Sciweavers

1437 search results - page 100 / 288
» Content Extraction Signatures
Sort
View
WWW
2007
ACM
16 years 3 months ago
Towards domain-independent information extraction from web tables
Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...
Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...
ADC
2009
Springer
113views Database» more  ADC 2009»
15 years 9 months ago
Ranking-Constrained Keyword Sequence Extraction from Web Documents
Given a large volume of Web documents, we consider problem of finding the shortest keyword sequences for each of the documents such that a keyword sequence can be rendered to a g...
Ding-Yi Chen, Xue Li, Jing Liu, Xia Chen
MM
2006
ACM
184views Multimedia» more  MM 2006»
15 years 9 months ago
Extraction of social context and application to personal multimedia exploration
Personal media collections are often viewed and managed along the social dimension, the places we spend time at and the people we see, thus tools for extracting and using this inf...
Brett Adams, Dinh Q. Phung, Svetha Venkatesh
WIRI
2005
IEEE
15 years 8 months ago
Extended Link Analysis for Extracting Spatial Information Hubs
Recently, web mining that tries to find useful knowledge from the vast amount of web pages has attracted a lot of research interests. Besides, it is becoming an essential task to...
Jianwei Zhang 0002, Yoshiharu Ishikawa, Hiroyuki K...
SIGIR
2003
ACM
15 years 8 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann