Search Sciweavers | Sciweavers

125

CIKM
2005
Springer

143views Information Technology» more CIKM 2005»

Focused crawling for both topical relevance and quality of medical information

15 years 7 months ago

Subject-speciﬁc search facilities on health sites are usually built using manual inclusion and exclusion rules. These can be expensive to maintain and often provide incomplete c...

Thanh Tin Tang, David Hawking, Nick Craswell, Kath...

claim paper

Read More »

80

Voted

CIKM
2009
Springer

121views Information Technology» more CIKM 2009»

Graph-based seed selection for web-scale crawlers

15 years 8 months ago

Download clgiles.ist.psu.edu

One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identiﬁes and explores the problem of seed selection in webscal...

Shuyi Zheng, Pavel Dmitriev, C. Lee Giles

claim paper

Read More »

118

click to vote

WEBI
2007
Springer

133views Internet Technology» more WEBI 2007»

Question Answering over Implicitly Structured Web Content

15 years 8 months ago

Download www.mathcs.emory.edu

Implicitly structured content on the Web such as HTML tables and lists can be extremely valuable for web search, question answering, and information retrieval, as the implicit str...

Eugene Agichtein, Chris Burges, Eric Brill

claim paper

Read More »

99

Voted

GCC
2005
Springer

116views Distributed And Parallel Com...» more GCC 2005»

Parallel Web Spiders for Cooperative Information Gathering

15 years 7 months ago

Download www.semgrid.net

Web spider is a widely used approach to obtain information for search engines. As the size of the Web grows, it becomes a natural choice to parallelize the spider’s crawling proc...

Jiewen Luo, Zhongzhi Shi, Maoguang Wang, Wei Wang

claim paper

Read More »

110

click to vote

ICML
2007
IEEE

124views Machine Learning» more ICML 2007»

Focused crawling with scalable ordinal regression solvers

16 years 2 months ago

Download www.machinelearning.org

In this paper we propose a novel, scalable, clustering based Ordinal Regression formulation, which is an instance of a Second Order Cone Program (SOCP) with one Second Order Cone ...

Rashmin Babaria, J. Saketha Nath, S. Krishnan, K. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers