Search Sciweavers | Sciweavers

179 search results - page 18 / 36

» Improvement of HITS-based algorithms on web documents

102

click to vote

SIGIR
2008
ACM

133views Information Technology» more SIGIR 2008»

Classifiers without borders: incorporating fielded text from neighboring web pages

14 years 11 months ago

Download www.cse.lehigh.edu

Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...

Xiaoguang Qi, Brian D. Davison

claim paper

Read More »

click to vote

SIGIR
2010
ACM

205views Information Technology» more SIGIR 2010»

Adaptive near-duplicate detection via similarity learning

15 years 3 months ago

Download research.microsoft.com

In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...

Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz

claim paper

Read More »

click to vote

GRID
2006
Springer

123views Distributed And Parallel Com...» more GRID 2006»

A Parallel Approach to XML Parsing

14 years 11 months ago

Download www.cs.indiana.edu

A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document p...

Wei Lu, Kenneth Chiu, Yinfei Pan

claim paper

Read More »

122

click to vote

WSDM
2012
ACM

285views Data Mining» more WSDM 2012»

Probabilistic models for personalizing web search

13 years 7 months ago

Download www.cs.nyu.edu

We present a new approach for personalizing Web search results to a speciﬁc user. Ranking functions for Web search engines are typically trained by machine learning algorithms u...

David Sontag, Kevyn Collins-Thompson, Paul N. Benn...

claim paper

Read More »

Voted

WWW
2009
ACM

152views Internet Technology» more WWW 2009»

Bootstrapped extraction of class attributes

15 years 6 months ago

Download www2009.eprints.org

As an alternative to previous studies on extracting class attributes from unstructured text, which consider either Web documents or query logs as the source of textual data, A boo...

Joseph Reisinger, Marius Pasca

claim paper

Read More »

« Prev « First page 18 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers