Sciweavers

2151 search results - page 246 / 431
» Using Document Dimensions for Enhanced Information Retrieval
Sort
View
131
Voted
EMNLP
2010
15 years 2 months ago
NLP on Spoken Documents Without ASR
There is considerable interest in interdisciplinary combinations of automatic speech recognition (ASR), machine learning, natural language processing, text classification and info...
Mark Dredze, Aren Jansen, Glen Coppersmith, Ken Wa...
142
Voted
INEX
2007
Springer
15 years 11 months ago
Using and Detecting Links in Wikipedia
In this paper, we document our efforts at INEX 2007 where we participated in the Ad Hoc Track, the Link the Wiki Track, and the Interactive Track that continued from INEX 2006. Ou...
Khairun Nisa Fachry, Jaap Kamps, Marijn Koolen, Ju...
MMDB
2004
ACM
148views Multimedia» more  MMDB 2004»
15 years 10 months ago
A unified framework for image database clustering and content-based retrieval
With the proliferation of image data, the need to search and retrieve images efficiently and accurately from a large image database or a collection of image databases has drastica...
Mei-Ling Shyu, Shu-Ching Chen, Min Chen, Chengcui ...
ISMIR
2005
Springer
168views Music» more  ISMIR 2005»
15 years 10 months ago
Using the Gamera Framework for Building a Lute Tablature Recognition System
In this article we describe an optical recognition system for historic lute tablature prints that we have built with the aid of the Gamera toolkit for document analysis and recogn...
Christophe Dalitz, Thomas Karsten
130
Voted
WWW
2006
ACM
16 years 5 months ago
Towards practical genre classification of web documents
Classification of documents by genre is typically done either using linguistic analysis or term frequency based techniques. The former provides better classification accuracy than...
George Ferizis, Peter Bailey