Search Sciweavers | Sciweavers

111

ACL
2007

140views Computational Linguistics» more ACL 2007»

Sparse Information Extraction: Unsupervised Language Models to the Rescue

15 years 4 months ago

Even in a massive corpus such as the Web, a substantial fraction of extractions appear infrequently. This paper shows how to assess the correctness of sparse extractions by utiliz...

Doug Downey, Stefan Schoenmackers, Oren Etzioni

claim paper

Read More »

122

click to vote

KDD
2008
ACM

211views Data Mining» more KDD 2008»

ArnetMiner: extraction and mining of academic social networks

16 years 3 months ago

Download keg.cs.tsinghua.edu.cn

This paper addresses several key issues in the ArnetMiner system, which aims at extracting and mining academic social networks. Specifically, the system focuses on: 1) Extracting ...

Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zha...

claim paper

Read More »

127

click to vote

IJCAI
2003

120views Artificial Intelligence» more IJCAI 2003»

Information Extraction from Tree Documents by Learning Subtree Delimiters

15 years 4 months ago

Download www.isi.edu

Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...

Boris Chidlovskii

claim paper

Read More »

134

Voted

WWW
2007
ACM

171views Internet Technology» more WWW 2007»

SPARQ2L: towards support for subgraph extraction queries in rdf databases

16 years 3 months ago

Download www2007.org

Many applications in analytical domains often have the need to "connect the dots" i.e., query about the structure of data. In bioinformatics for example, it is typical t...

Kemafor Anyanwu, Angela Maduko, Amit P. Sheth

claim paper

Read More »

131

click to vote

IDEAS
2005
IEEE

142views Database» more IDEAS 2005»

Automatically Maintaining Wrappers for Web Sources

15 years 8 months ago

Download www.tic.udc.es

A substantial subset of the web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or semantic information about the data it represents...

Juan Raposo, Alberto Pan, Manuel Álvarez, J...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers