Sciweavers

258 search results - page 3 / 52
» Classifying Document Titles Based on Information Inference
Sort
View
IJCAI
2003
13 years 6 months ago
Information Extraction from Web Documents Based on Local Unranked Tree Automaton Inference
Information extraction (IE) aims at extracting specific information from a collection of documents. A lot of previous work on 10 from semi-structured documents (in XML or HTML) us...
Raymond Kosala, Maurice Bruynooghe, Jan Van den Bu...
EMNLP
2010
13 years 3 months ago
Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective
With the proliferation of user-generated articles over the web, it becomes imperative to develop automated methods that are aware of the ideological-bias implicit in a document co...
Amr Ahmed, Eric P. Xing
TREC
2004
13 years 6 months ago
Indri at TREC 2004: Terabyte Track
This paper provides an overview of experiments carried out at the TREC 2004 Terabyte Track using the Indri search engine. Indri is an efficient, effective distributed search engin...
Donald Metzler, Trevor Strohman, Howard R. Turtle,...
WWW
2006
ACM
14 years 6 months ago
HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document
We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...
Tomoyuki Nanno, Manabu Okumura
DKE
2006
139views more  DKE 2006»
13 years 5 months ago
Information extraction from structured documents using k-testable tree automaton inference
Information extraction (IE) addresses the problem of extracting specific information from a collection of documents. Much of the previous work on IE from structured documents, suc...
Raymond Kosala, Hendrik Blockeel, Maurice Bruynoog...