Search Sciweavers | Sciweavers

62 search results - page 2 / 13

» Learning Page-Independent Heuristics for Extracting Data fro...

click to vote

ACL
2009

167views Computational Linguistics» more ACL 2009»

Mining Bilingual Data from the Web with Adaptively Learnt Patterns

13 years 3 months ago

Download www.aclweb.org

Mining bilingual data (including bilingual sentences and terms1 ) from the Web can benefit many NLP applications, such as machine translation and cross language information retrie...

Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, ...

claim paper

Read More »

click to vote

BNCOD
2006

88views Database» more BNCOD 2006»

The Lixto Project: Exploring New Frontiers of Web Data Extraction

13 years 6 months ago

Download www.dbai.tuwien.ac.at

The Lixto project is an ongoing research effort in the area of Web data extraction. Whereas the project originally started out with the idea to develop a logic-based extraction lan...

Julien Carme, Michal Ceresna, Oliver Frölich,...

claim paper

Read More »

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

13 years 5 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

click to vote

DL
2000
Springer

351views Digital Library» more DL 2000»

Acrophile: an automated acronym extractor and server

13 years 9 months ago

Download ir.iit.edu

We implemented a web server for acronym and abbreviation lookup, containing a collection of acronyms and their expansions gathered from a large number of web pages by a heuristic ...

Leah S. Larkey, Paul Ogilvie, M. Andrew Price, Bre...

claim paper

Read More »

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

13 years 11 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

« Prev « First page 2 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers