Sciweavers

469 search results - page 19 / 94
» On Compressing the Textual Web
Sort
View
78
Voted
IJCAI
2003
14 years 11 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu
AAAI
2008
15 years 20 hour ago
Turning Web Text and Search Queries into Factual Knowledge: Hierarchical Class Attribute Extraction
A seed-based framework for textual information extraction allows for weakly supervised acquisition of open-domain class attributes over conceptual hierarchies, from a combination ...
Marius Pasca
DOCENG
2004
ACM
15 years 1 months ago
A document-based approach to the generation of web applications
: XML is unique in its very broad acceptance throughout both the document engineering and data processing community. This creates a unique opportunity for unifying the traditionall...
Andrea R. de Andrade, Ethan V. Munson, Maria da Gr...
LREC
2008
102views Education» more  LREC 2008»
14 years 11 months ago
Unsupervised Learning-based Anomalous Arabic Text Detection
The growing dependence of modern society on the Web as a vital source of information and communication has become inevitable. However, the Web has become an ideal channel for vari...
Nasser Abouzakhar, Ben Allison, Louise Guthrie
NLPRS
2001
Springer
15 years 2 months ago
Hierarchical Concept Description and Learning for Information Extraction
This paper addresses the problem of extracting information from textual documents, either normal documents or web pages. A new approach for extracting complicate information from ...
Luo Xiao, Dieter Wissmann, Michael Brown, Stefan J...