large text collections

14

EMNLP
2009

159views Natural Language Processing» more EMNLP 2009»

13 years 2 months ago

Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...

David M. Mimno, Hanna M. Wallach, Jason Naradowsky...

claim paper

Read More »

10

click to vote

APWEB
2006
Springer

102views Internet Technology» more APWEB 2006»

The Case of the Duplicate Documents Measurement, Search, and Science

13 years 8 months ago

Download goanna.cs.rmit.edu.au

Many of the documents in large text collections are duplicates and versions of each other. In recent research, we developed new methods for finding such duplicates; however, as the...

Justin Zobel, Yaniv Bernstein

claim paper

Read More »

9

click to vote

CIKM
2000
Springer

97views Information Technology» more CIKM 2000»

Collection Selection and Results Merging with Topically Organized U.S. Patents and TREC Data

13 years 9 months ago

Download delivery.acm.org

We investigate three issues in distributed information retrieval, considering both TREC data and U.S. Patents: (1) topical organization of large text collections, (2) collection r...

Leah S. Larkey, Margaret E. Connell, James P. Call...

claim paper

Read More »

16

click to vote

SAC
2005
ACM

141views Applied Computing» more SAC 2005»

Mining concept associations for knowledge discovery in large textual databases

13 years 10 months ago

Download www.ualr.edu

In this paper, we describe a new approach for mining concept associations from large text collections. The concepts are short sequences of words that occur frequently together acr...

Xiaowei Xu, Mutlu Mete, Nurcan Yuruk

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers