Sciweavers

1319 search results - page 143 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
133
Voted
CIKM
2004
Springer
15 years 9 months ago
Hierarchical document categorization with support vector machines
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques ...
Lijuan Cai, Thomas Hofmann
107
Voted
VLDB
1997
ACM
132views Database» more  VLDB 1997»
15 years 7 months ago
Integrating SQL Databases with Content-Specific Search Engines
In recentyears,databaseresearchandproduct developmentactivities havefocusedonsupport for non-traditional data types, such astext or multi-media documents.This paper describes an a...
Stefan Deßloch, Nelson Mendonça Matto...
116
Voted
SIGIR
2005
ACM
15 years 9 months ago
When will information retrieval be "good enough"?
We describe a user study that examined the relationship between the quality of an Information Retrieval system and the effectiveness of its users in performing a task. The task i...
James Allan, Ben Carterette, Joshua Lewis
129
Voted
ICTAI
2007
IEEE
15 years 10 months ago
Dragon Toolkit: Incorporating Auto-Learned Semantic Knowledge into Large-Scale Text Retrieval and Mining
The majority of text retrieval and mining techniques are still based on exact feature (e.g. words) matching and unable to incorporate text semantics. Many researchers believe that...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
186
Voted
CIKM
2005
Springer
15 years 5 months ago
Fast on-line index construction by geometric partitioning
Inverted index structures are the mainstay of modern text retrieval systems. They can be constructed quickly using off-line mergebased methods, and provide efficient support for ...
Nicholas Lester, Alistair Moffat, Justin Zobel