Search Sciweavers | Sciweavers

241 search results - page 17 / 49

» Detecting Co-Derivative Documents in Large Text Collections

111

Voted

ICDE
2008
IEEE

113views Database» more ICDE 2008»

A rank-rewrite framework for summarizing XML documents

15 years 6 months ago

Download www.mpi-inf.mpg.de

Abstract— With XML becoming a standard for data representation and exchange, we can expect to see large scale repositories and warehouses of XML data. In order for users to under...

Maya Ramanath, Kondreddi Sarath Kumar

claim paper

Read More »

click to vote

IPM
2007

95views more IPM 2007»

Using structural contexts to compress semistructured text collections

14 years 11 months ago

Download www.dcc.uchile.cl

We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...

Joaquín Adiego, Gonzalo Navarro, Pablo de l...

claim paper

Read More »

click to vote

IPM
2006

151views more IPM 2006»

Document clustering using nonnegative matrix factorization

14 years 11 months ago

Download www.math.wfu.edu

A methodology for automatically identifying and clustering semantic features or topics in a heterogeneous text collection is presented. Textual data is encoded using a low rank no...

Farial Shahnaz, Michael W. Berry, V. Paul Pauca, R...

claim paper

Read More »

click to vote

CIKM
2006
Springer

138views Information Technology» more CIKM 2006»

A document-centric approach to static index pruning in text retrieval systems

15 years 3 months ago

Download stefan.buettcher.org

We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follows a documentcentric approach to decide whether a posting for a given term shoul...

Stefan Büttcher, Charles L. A. Clarke

claim paper

Read More »

click to vote

EMNLP
2009

159views Natural Language Processing» more EMNLP 2009»

Polylingual Topic Models

14 years 9 months ago

Download www.cs.umass.edu

Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...

David M. Mimno, Hanna M. Wallach, Jason Naradowsky...

claim paper

Read More »

« Prev « First page 17 / 49 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers