Search Sciweavers | Sciweavers

22 search results - page 3 / 5

» Comparing Keyword Extraction Techniques for WEBSOM Text Arch...

click to vote

WSDM
2010
ACM

215views Data Mining» more WSDM 2010»

Boilerplate Detection using Shallow Text Features

14 years 3 months ago

Download www.wsdm-conference.org

In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...

Christian Kohlschütter, Peter Fankhauser, Wol...

claim paper

Read More »

click to vote

HT
2009
ACM

148views Internet Technology» more HT 2009»

Retrieving broken web links using an approach based on contextual information

13 years 3 months ago

Download nlp.uned.es

In this short note we present a recommendation system for automatic retrieval of broken Web links using an approach based on contextual information. We extract information from th...

Juan Martinez-Romo, Lourdes Araujo

claim paper

Read More »

click to vote

AIRWEB
2009
Springer

252views Internet Technology» more AIRWEB 2009»

Looking into the past to better classify web spam

14 years 11 days ago

Download airweb.cse.lehigh.edu

Web spamming techniques aim to achieve undeserved rankings in search results. Research has been widely conducted on identifying such spam and neutralizing its inﬂuence. However,...

Na Dai, Brian D. Davison, Xiaoguang Qi

claim paper

Read More »

click to vote

CIVR
2007
Springer

177views Image Analysis» more CIVR 2007»

Matching ottoman words: an image retrieval approach to historical document indexing

13 years 12 months ago

Download cs-people.bu.edu

Large archives of Ottoman documents are challenging to many historians all over the world. However, these archives remain inaccessible since manual transcription of such a huge vo...

Esra Ataer, Pinar Duygulu

claim paper

Read More »

click to vote

SIGMOD
2011
ACM

250views Database» more SIGMOD 2011»

Hybrid in-database inference for declarative information extraction

12 years 8 months ago

Download db.cs.berkeley.edu

In the database community, work on information extraction (IE) has centered on two themes: how to effectively manage IE tasks, and how to manage the uncertainties that arise in th...

Daisy Zhe Wang, Michael J. Franklin, Minos N. Garo...

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers