Sciweavers

924 search results - page 9 / 185
» Measuring Information Understanding in Large Document Collec...
Sort
View
52
Voted
AIL
2010
60views more  AIL 2010»
14 years 7 months ago
Network-based filtering for large email collections in E-Discovery
The information overload in E-Discovery proceedings makes reviewing expensive and it increases the risk of failure to produce results on time and consistently. New interactive tec...
Hans Henseler
ECIR
2009
Springer
15 years 6 months ago
Revisiting N-Gram Based Models for Retrieval in Degraded Large Collections
The traditional retrieval models based on term matching are not effective in collections of degraded documents (output of OCR or ASR systems for instance). This paper presents a n...
Javier Parapar, Ana Freire, Alvaro Barreiro
82
Voted
WSDM
2010
ACM
173views Data Mining» more  WSDM 2010»
15 years 7 months ago
Measuring the Reusability of Test Collections
While test collection construction is a time-consuming and expensive process, the true cost is amortized by reusing the collection over hundreds or thousands of experiments. Some ...
Ben Carterette, Evgeniy Gabrilovich, Vanja Josifov...
77
Voted
IADIS
2004
14 years 11 months ago
Relevance feedback using semantic association between indexing terms in large free text corpuses
Relevance feedback has been considered as a means of incorporating learning into information retrieval systems for quite sometime now. This paper discusses the research results of...
Shahzad Khan, Kenan Azam
AI
2007
Springer
15 years 3 months ago
Fuzzy Clustering for Topic Analysis and Summarization of Document Collections
Abstract. Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common an...
René Witte, Sabine Bergler