Sciweavers

3090 search results - page 228 / 618
» Document Processing with LinkIT
Sort
View
CEAS
2007
Springer
15 years 12 months ago
Hardening Fingerprinting by Context
Near-duplicate detection is not only an important pre and post processing task in Information Retrieval but also an effective spam-detection technique. Among different approache...
Aleksander Kolcz, Abdur Chowdhury
CIKM
2007
Springer
15 years 12 months ago
Developing learning strategies for topic-based summarization
Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assig...
Ouyang You, Sujian Li, Wenjie Li
CIKM
2001
Springer
15 years 10 months ago
Mining the Web to Create Minority Language Corpora
The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...
Rayid Ghani, Rosie Jones, Dunja Mladenic
CIKM
2001
Springer
15 years 10 months ago
Merging Techniques for Performing Data Fusion on the Web
Data fusion on the Web refers to the merging, into a unified single list, of the ranked document lists, which are retrieved in response to a user query by more than one Web search...
Theodora Tsikrika, Mounia Lalmas
DL
2000
Springer
162views Digital Library» more  DL 2000»
15 years 10 months ago
Snowball: extracting relations from large plain-text collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Luis Gravano