Sciweavers

368 search results - page 57 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
KDD
2008
ACM
128views Data Mining» more  KDD 2008»
15 years 10 months ago
Scaling up text classification for large file systems
: We combine the speed and scalability of information retrieval with the generally superior classification accuracy offered by machine learning, yielding a two-phase text classifie...
George Forman, Shyamsundar Rajaram
KDD
2008
ACM
115views Data Mining» more  KDD 2008»
15 years 10 months ago
Topical query decomposition
We introduce the problem of query decomposition, where we are given a query and a document retrieval system, and we want to produce a small set of queries whose union of resulting...
Francesco Bonchi, Carlos Castillo, Debora Donato, ...
ICDAR
2003
IEEE
15 years 2 months ago
A Constraint-based Approach to Table Structure Derivation
er presents an approach to deriving an abstract geometric model of a table from a physical representation. The technique developed uses a graph of constraints between cells which ...
Matthew Hurst
TREC
2008
14 years 11 months ago
UTDallas at TREC 2008 Blog Track
This paper describes our participation in the 2008 TREC Blog track. Our system consists of 3 components: data preprocessing, topic retrieval, and opinion finding. In the topic ret...
Bin Li, Feifan Liu, Yang Liu
ICDIM
2008
IEEE
15 years 4 months ago
Flexible Question Answering System for mobile devices
This paper presents a Flexible Question Answering System (FQAS) for mobile wireless devices. FQAS comprises a fuzzy logic-based Information Retrieval (IR) System together with a q...
Daniel Ortiz Arroyo