A DYNAMIC FEATURE SELECTION METHOD FOR DOCUMENT RANKING WITH RELEVANCE FEEDBACK APPROACH

ICTACT Journal on Soft Computing ( Volume: 1 , Issue: 1 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4ff3e0b01000000fb3f000001000200
Ranking search results is essential for information retrieval and Web search. Search engines need to not only return highly relevant results, but also be fast to satisfy users. As a result, not all available features can be used for ranking, and in fact only a small percentage of these features can be used. Thus, it is crucial to have a feature selection mechanism that can find a subset of features that both meets latency requirements and achieves high relevance. In this paper we describe a 0/1 knapsack procedure for automatically selecting features to use within Generalization model for Document Ranking. We propose an approach for Relevance Feedback using Expectation Maximization method and evaluate the algorithm on the TREC Collection for describing classes of feedback textual information retrieval features. Experimental results, evaluated on standard TREC-9 part of the OHSUMED collections, show that our feature selection algorithm produces models that are either significantly more effective than, or equally effective as, models such as Markov Random Field model, Correlation Co-efficient and Count Difference method.

Authors

K. Latha1, B. Bhargavi2, C. Dharani3, R. Rajaram4
Anna University of Technology, Tiruchirappali, Tamil Nadu, India1, Anna University of Technology, Tiruchirappali, Tamil Nadu, India2, Anna University of Technology, Tiruchirappali, Tamil Nadu, India3, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India4

Keywords

Feature Selection, Expectation Maximization, Markov Random Field, Generalization, Document Ranking

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 1 , Issue: 1 )
Date of Publication
July 2010
Pages
1 - 8

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in