Sciweavers

502 search results - page 54 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
CIKM
2005
Springer
15 years 4 months ago
Structural features in content oriented XML retrieval
The structural features of XML components are an extra source of information that should be used in a contentoriented retrieval task on this type of documents. This paper explores...
Georgina Ramírez, Thijs Westerveld, Arjen P...
CG
2007
Springer
14 years 11 months ago
Visual text mining using association rules
In many situations, individuals or groups of individuals are faced with the need to examine sets of documents to achieve understanding of their structure and to locate relevant in...
Alneu de Andrade Lopes, Roberto Pinho, Fernando Vi...
WWW
2001
ACM
15 years 12 months ago
IEPAD: information extraction based on pattern discovery
The research in information extraction (IE) regards the generation of wrappers that can extract particular information from semistructured Web documents. Similar to compiler gener...
Chia-Hui Chang, Shao-Chen Lui
RIAO
2000
15 years 20 days ago
Assisting requirements engineering with semantic document analysis
Requirements engineering is the first stage in the software life-cycle and is concerned with discovering and managing a software system's services, constraints and goals. Req...
Paul Rayson, Roger Garside, Peter Sawyer
KDD
2008
ACM
120views Data Mining» more  KDD 2008»
15 years 11 months ago
Entity categorization over large document collections
Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...
Arnd Christian König, Rares Vernica, Venkates...