Search Sciweavers | Sciweavers

219 search results - page 12 / 44

» Web page language identification based on URLs

158

click to vote

WISE
2002
Springer

161views Internet Technology» more WISE 2002»

Applying the Site Information to the Information Retrieval from the Web

15 years 10 months ago

Download www.tkl.iis.u-tokyo.ac.jp

In recent years, several information retrieval methods using information about the Web-links are developed, such as HITS and Trawling. In order to analyze the Web-links dividing i...

Yasuhito Asano, Hiroshi Imai, Masashi Toyoda, Masa...

claim paper

Read More »

131

click to vote

INTR
2002

50views more INTR 2002»

Methodologies for crawler based Web surveys

15 years 5 months ago

Download cybermetrics.wlv.ac.uk

There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...

Mike Thelwall

claim paper

Read More »

150

click to vote

WWW
2006
ACM

138views Internet Technology» more WWW 2006»

Geographically focused collaborative crawling

16 years 6 months ago

Download www2006.org

A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...

Weizheng Gao, Hyun Chul Lee, Yingbo Miao

claim paper

Read More »

177

click to vote

AAAI
2008

206views Intelligent Agents» more AAAI 2008»

Mining Translations of Web Queries from Web Click-through Data

15 years 8 months ago

Download www.cse.ust.hk

Query translation for Cross-Lingual Information Retrieval (CLIR) has gained increasing attention in the research area. Previous work mainly used machine translation systems, bilin...

Rong Hu, Weizhu Chen, Jian Hu, Yansheng Lu, Zheng ...

claim paper

Read More »

143

click to vote

LREC
2008

108views Education» more LREC 2008»

A Lightweight and Efficient Tool for Cleaning Web Pages

15 years 7 months ago

Download www.lrec-conf.org

Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...

Stefan Evert

claim paper

Read More »

« Prev « First page 12 / 44 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers