Sciweavers

288 search results - page 15 / 58
» Extracting compound terms from domain corpora
Sort
View
121
Voted
WWW
2008
ACM
16 years 1 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
100
Voted
ACL
2010
14 years 10 months ago
Automatically Generating Term Frequency Induced Taxonomies
We propose a novel method to automatically acquire a term-frequency-based taxonomy from a corpus using an unsupervised method. A term-frequency-based taxonomy is useful for applic...
Karin Murthy, Tanveer A. Faruquie, L. Venkata Subr...
115
Voted
WWW
2007
ACM
16 years 1 months ago
Towards domain-independent information extraction from web tables
Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...
Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...
ECAI
2000
Springer
15 years 4 months ago
Using Description Logics for Ontology Extraction
The paper presents a prototype of a system for querying the Web in natural language (French) for a limited domain. The domain knowledge, represented in description logics (DL), is ...
Amalia Todirascu, François de Bertrand de B...
CIKM
2001
Springer
15 years 4 months ago
A Domain Independent Environment for Creating Information Extraction Modules
Text-Mining is a growing area of interest within the field of Data Mining and Knowledge Discovery. Given a collection of text documents, most approaches to Text Mining perform kno...
Ronen Feldman, Yonatan Aumann, Yair Liberzon, Kfir...