Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

11

NAACL
1994

favoriteEmaildiscussreport

124views Computational Linguistics» more NAACL 1994»

Learning from Relevant Documents in Large Scale Routing Retrieval

13 years 5 months ago

Learning from Relevant Documents in Large Scale Routing Retrieval

Download acl.ldc.upenn.edu

The normal practice of selecting relevant documents for training routing queries is to either use all relevants or the 'best n' of them after a (retrieval) ranking operation with respect to each query. Using all relevants can introduce noise and ambiguities in training because documents can be long with many irrelevant portions. Using only the 'best n' risks leaving out documents that do not resemble a query. Based on a method of segmenting documents into more uniform size subdocuments, a better approach is to use the top ranked subdocument of every relevant. An alternative selection strategy is based on document properties without ranking. We found experimentally that short relevant documents are the quality items for training. Beginning portions of longer relevants are also useful. Using both types provides a strategy that is effective and efficient.

K. L. Kwok, Laszlo Grunfeld

Real-time Traffic

Documents | NAACL 1994 | NAACL 2007 | Relevant Documents | Training Routing Queries |

claim paper

Related Content

» Inferring document relevance via average precision

» NonRelevance Feedback Document Retrieval based on One Class SVM and SVDD

» Relevance feedback using semantic association between indexing terms in large free text co...

» Dragon Toolkit Incorporating AutoLearned Semantic Knowledge into LargeScale Text Retrieval...

» Approximating true relevance distribution from a mixture model based on irrelevance data

» Liberal relevance criteria of TREC counting on negligible documents

» Contentbased document routing and index partitioning for scalable similaritybased searches...

» DOCODELite A MetaSearch Engine for Document Similarity Retrieval

» Efficient Representation of Local Geometry for Large Scale Object Retrieval

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1994
Where	NAACL
Authors	K. L. Kwok, Laszlo Grunfeld

Comments (0)