Search Sciweavers | Sciweavers

367 search results - page 22 / 74

» Indexing Text Documents Based on Topic Identification

138

Voted

TOIS
2010

128views more TOIS 2010»

Learning author-topic models from text corpora

15 years 2 months ago

Download www.ics.uci.edu

We propose a new unsupervised learning technique for extracting information about authors and topics from large text collections. We model documents as if they were generated by a...

Michal Rosen-Zvi, Chaitanya Chemudugunta, Thomas L...

claim paper

Read More »

152

click to vote

TREC
2007

133views Information Technology» more TREC 2007»

Parsimonious Language Models for a Terabyte of Text

15 years 4 months ago

Download trec.nist.gov

: The aims of this paper are twofold. Our ﬁrst aim is to compare results of the earlier Terabyte tracks to the Million Query track. We submitted a number of runs using different ...

Djoerd Hiemstra, Rongmei Li, Jaap Kamps, Rianne Ka...

claim paper

Read More »

119

Voted

ADC
2007
Springer

108views Database» more ADC 2007»

Distributed Text Retrieval From Overlapping Collections

15 years 10 months ago

Download crpit.com

In standard text retrieval systems, the documents are gathered and indexed on a single server. In distributed information retrieval (DIR), the documents are held in multiple colle...

Milad Shokouhi, Justin Zobel, Yaniv Bernstein

claim paper

Read More »

125

Voted

CICLING
2009
Springer

335views Natural Language Processing» more CICLING 2009»

Language Identification on the Web: Extending the Dictionary Method

15 years 7 months ago

Download www.fi.muni.cz

Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...

Radim Rehurek, Milan Kolkus

claim paper

Read More »

120

Voted

ICDAR
2009
IEEE

159views Document Analysis» more ICDAR 2009»

Finding Images and Line-Drawings in Document-Scanning Systems

15 years 10 months ago

Download www.mangolassi.org

The system presented in this paper finds images and line-drawings in scanned pages; it is a crucial processing step in the creation of a large-scale system to detect and index ima...

Shumeet Baluja, Michele Covell

claim paper

Read More »

« Prev « First page 22 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers