Search Sciweavers | Sciweavers

367 search results - page 66 / 74

» Indexing Text Documents Based on Topic Identification

206

Voted

VLDB
2002
ACM

161views Database» more VLDB 2002»

Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection

15 years 7 months ago

Download qprober.cs.columbia.edu

Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over many s...

Panagiotis G. Ipeirotis, Luis Gravano

claim paper

Read More »

161

click to vote

SIGIR
2006
ACM

99views Information Technology» more SIGIR 2006»

Distributed query sampling: a quality-conscious approach

16 years 1 months ago

Download www.cc.gatech.edu

We present an adaptive distributed query-sampling framework that is quality-conscious for extracting high-quality text database samples. The framework divides the query-based samp...

James Caverlee, Ling Liu, Joonsoo Bae

claim paper

Read More »

184

click to vote

WWW
2006
ACM

158views Internet Technology» more WWW 2006»

Finding advertising keywords on web pages

16 years 8 months ago

Download www2006.org

A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and this is a substantial source of rev...

Wen-tau Yih, Joshua Goodman, Vitor R. Carvalho

claim paper

Read More »

180

click to vote

EWMF
2005
Springer

149views Internet Technology» more EWMF 2005»

Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis

16 years 29 days ago

Download lahuen.dcc.uchile.cl

Abstract. We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decompositio...

Holger Bast, Georges Dupret, Debapriyo Majumdar, B...

claim paper

Read More »

222

click to vote

IR
2010

157views Natural Language Processing» more IR 2010»

Learning to rank with (a lot of) word features

15 years 5 months ago

Download ronan.collobert.com

In this article we present Supervised Semantic Indexing (SSI) which deﬁnes a class of nonlinear (quadratic) models that are discriminatively trained to directly map from the word...

Bing Bai, Jason Weston, David Grangier, Ronan Coll...

claim paper

Read More »

« Prev « First page 66 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers