Search Sciweavers | Sciweavers

167

WWW
2010
ACM

193views Internet Technology» more WWW 2010»

Web-scale knowledge extraction from semi-structured tables

15 years 10 months ago

A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...

Eric Crestan, Patrick Pantel

claim paper

Read More »

198

click to vote

CIARP
2007
Springer

107views Pattern Recognition» more CIARP 2007»

Information Extraction and Classification from Free Text Using a Neural Approach

15 years 9 months ago

Download eprints.pascal-network.org

Many approaches to Information Extraction (IE) have been proposed in literature capable of finding and extract specific facts in relatively unstructured documents. Their applicatio...

Ignazio Gallo, Elisabetta Binaghi

claim paper

Read More »

163

click to vote

ITCC
2005
IEEE

105views Information Technology» more ITCC 2005»

Elimination of Redundant Information for Web Data Mining

15 years 11 months ago

Download eprints.utas.edu.au

These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...

Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

claim paper

Read More »

172

click to vote

WWW
2001
ACM

187views Internet Technology» more WWW 2001»

IEPAD: information extraction based on pattern discovery

16 years 6 months ago

Download www10.org

The research in information extraction (IE) regards the generation of wrappers that can extract particular information from semistructured Web documents. Similar to compiler gener...

Chia-Hui Chang, Shao-Chen Lui

claim paper

Read More »

148

click to vote

IJCNLP
2005
Springer

168views Natural Language Processing» more IJCNLP 2005»

Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web

15 years 11 months ago

Download www.aclweb.org

This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method diﬀers from previous approaches to paraphrase...

Marius Pasca, Péter Dienes

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers