Search Sciweavers | Sciweavers

23 search results - page 3 / 5

» Focused web crawling in the acquisition of comparable corpor...

click to vote

WWW
2006
ACM

114views Internet Technology» more WWW 2006»

Status of the African Web

14 years 5 months ago

Download www2006.org

As part of the Language Observatory Project [4], we have been crawling all the web space since 2004. We have collected terabytes of data mostly from Asian and African ccTLDs. In t...

Rizza Camus Caminero, Pavol Zavarsky, Yoshiki Mika...

claim paper

Read More »

click to vote

ECIR
2009
Springer

134views Information Technology» more ECIR 2009»

Quality-Oriented Search for Depression Portals

14 years 2 months ago

Download david-hawking.net

The problem of low-quality information on the Web is nowhere more important than in the domain of health, where unsound information and misleading advice can have serious consequen...

Thanh Tin Tang, David Hawking, Ramesh S. Sankarana...

claim paper

Read More »

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

14 years 5 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

click to vote

SIGMOD
2010
ACM

232views Database» more SIGMOD 2010»

Optimizing content freshness of relations extracted from the web using keyword search

13 years 5 months ago

Download www2.hawaii.edu

An increasing number of applications operate on data obtained from the Web. These applications typically maintain local copies of the web data to avoid network latency in data acc...

Mohan Yang, Haixun Wang, Lipyeow Lim, Min Wang

claim paper

Read More »

click to vote

EACL
2006
ACL Anthology

128views Natural Language Processing» more EACL 2006»

Compiling French-Japanese Terminologies from the Web

13 years 6 months ago

Download acl.ldc.upenn.edu

We propose a method for compiling bilingual terminologies of multi-word terms (MWTs) for given translation pairs of seed terms. Traditional methods for bilingual terminology compi...

Xavier Robitaille, Yasuhiro Sasaki, Masatsugu Tono...

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers