Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/14399
Title: ENSEMBLE BASED ACTIVE ANNOTATION FOR BIOMEDICAL NAMED ENTITY RECOGNITION
Authors: SRIVASTAVA, RITU
Keywords: Biomedical
Entity Recognition
Active Annotation
Issue Date: Jan-2016
Series/Report no.: TD 1267;
Abstract: ABSTRACT An important prospect of machine learning for information extraction to deal with the problems of high cost of collecting labelled examples. Active Learning makes more efficient use of the learner’s time by asking them to label only instances that are most useful for the trainer. In random sampling approach, unlabeled data is selected for annotation at random and thus can’t yield desired result. In contrast, active learning selects the useful data from a huge pool of unlabeled data for the classifier. The strategies used often classify the corpus tokens (or, data points) under wrong classes. The classifier is confused between two categories if the token is located near the margin. We propose a novel method for solving this problem and show that it favourably results in the increased performance. Our proposed framework is based on an ensemble approach, where ID3 and C5 algorithms are used as the base classifiers. The proposed approach is applied for solving the problem of named entity recognition (NER) in the Bio-medical domain. Results show that the proposed technique indeed improves the performance of the system
URI: http://dspace.dtu.ac.in:8080/jspui/handle/repository/14399
Appears in Collections:M.E./M.Tech. Computer Technology & Applications

Files in This Item:
File Description SizeFormat 
Ritu_Srivastava.pdf827.74 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.