Search Sciweavers | Sciweavers

26 search results - page 4 / 6

» Information extraction from structured documents using k-tes...

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

14 years 6 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

13 years 10 months ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

click to vote

AAAI
2007

123views Intelligent Agents» more AAAI 2007»

Recognizing Textual Entailment Using a Subsequence Kernel Method

13 years 7 months ago

Download www.aaai.org

We present a novel approach to recognizing Textual nt. Structural features are constructed from abstract tree descriptions, which are automatically extracted from syntactic depend...

Rui Wang 0005, Günter Neumann

claim paper

Read More »

click to vote

IJCNN
2006
IEEE

94views Neural Networks» more IJCNN 2006»

A Self-Organising Map Approach for Clustering of XML Documents

13 years 11 months ago

Download www.math.unipd.it

— The number of XML documents produced and available on the Internet is steadily increasing. It is thus important to devise automatic procedures to extract useful information fro...

Francesca Trentini, Markus Hagenbuchner, Alessandr...

claim paper

Read More »

click to vote

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Automatically learning document taxonomies for hierarchical classification

14 years 6 months ago

Download www.ideal.ece.utexas.edu

While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...

Kunal Punera, Suju Rajan, Joydeep Ghosh

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers