Sciweavers

467 search results - page 8 / 94
» Pat-tree-based Keyword Extraction for Chinese Information Re...
Sort
View
WWW
2005
ACM
15 years 5 months ago
An information extraction engine for web discussion forums
In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...
Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...
JCDL
2005
ACM
95views Education» more  JCDL 2005»
15 years 5 months ago
Resolving the unencoded character problem for chinese digital libraries
Constructing a Chinese digital library, especially for a historical article archiving, is often bothered by the small character sets supported by the current computer systems. Thi...
Derming Juang, Jenq-Haur Wang, Chen-Yu Lai, Ching-...
SIGIR
2009
ACM
15 years 6 months ago
Web-derived resources for web information retrieval: from conceptual hierarchies to attribute hierarchies
A weakly-supervised extraction method identifies concepts within conceptual hierarchies, at the appropriate level of specificity (e.g., Bank vs. Institution), to which attribute...
Marius Pasca, Enrique Alfonseca
IR
2006
14 years 11 months ago
Table extraction for answer retrieval
The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...
Xing Wei, W. Bruce Croft, Andrew McCallum
CIKM
2008
Springer
15 years 1 months ago
Using structured text for large-scale attribute extraction
We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...
Sujith Ravi, Marius Pasca