NAMED ENTITY RECOGNITION FROM BIOMEDICAL TEXT -AN INFORMATION EXTRACTION TASK

ICTACT Journal on Soft Computing ( Volume: 6 , Issue: 4 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4fff18e1f000000c011050001000100
Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR) and Information Extraction (IE). The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER) from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is coupons for cialis extraction of relationships (IE). The work was based on machine learning algorithm Conditional Random Field (CRF).

Authors

N. Kanya1, T. Ravi2
Manonmanium Sundaranar University, India1, Madanapalle Institute of Technology and Science, India2

Keywords

Information Extraction, Information Retrieval, Text Mining, Named Entity Recognition, Data Mining

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 6 , Issue: 4 )
Date of Publication
July 2016
Pages
1302-1307

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in