Search Sciweavers | Sciweavers

10 search results - page 1 / 2

» Using urls and table layout for web classification tasks

click to vote

WWW
2004
ACM

151views Internet Technology» more WWW 2004»

Using urls and table layout for web classification tasks

14 years 5 months ago

Download www.iw3c2.org

We propose new features and algorithms for automating Web-page classification tasks such as content recommendation and ad blocking. We show that the automated classification of We...

L. K. Shih, David R. Karger

claim paper

Read More »

click to vote

HICSS
2008
IEEE

105views Biometrics» more HICSS 2008»

Using Visual Features for Fine-Grained Genre Classification of Web Pages

13 years 11 months ago

Download csdl2.computer.org

The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...

Ryan Levering, Michal Cutler, Lei Yu

claim paper

Read More »

click to vote

WWW
2009
ACM

131views Internet Technology» more WWW 2009»

Purely URL-based topic classification

14 years 5 months ago

Download www2009.org

Given only the URL of a web page, can we identify its topic? This is the question that we examine in this paper. Usually, web pages are classified using their content [7], but a U...

Eda Baykan, Monika Rauch Henzinger, Ludmila Marian...

claim paper

Read More »

click to vote

IR
2006

155views Natural Language Processing» more IR 2006»

Table extraction for answer retrieval

13 years 4 months ago

Download www.cs.umass.edu

The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...

Xing Wei, W. Bruce Croft, Andrew McCallum

claim paper

Read More »

click to vote

ITCC
2005
IEEE

105views Information Technology» more ITCC 2005»

Elimination of Redundant Information for Web Data Mining

13 years 10 months ago

Download eprints.utas.edu.au

These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...

Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers