text collection | Sciweavers

19

CATA
1998

123views Computer Science» more CATA 1998»

QVI: Query-based virtual index for distributed information retrieval

13 years 5 months ago

The large unstructured text collections demand full-text search capabilities from IR systems. Current systems typically allow users only to connect to a single database (or site) ...

Dong-gyu Kim, Sang-goo Lee

claim paper

Read More »

13

click to vote

FLAIRS
2004

105views Artificial Intelligence» more FLAIRS 2004»

Mining On-line Sources for Definition Knowledge

13 years 5 months ago

Download www.aaai.org

Finding definitions in huge text collections is a challenging problem, not only because of the many ways in which definitions can be conveyed in natural language texts but also be...

Horacio Saggion, Robert J. Gaizauskas

claim paper

Read More »

12

click to vote

ACL
2006

75views Computational Linguistics» more ACL 2006»

A Term Recognition Approach to Acronym Recognition

13 years 5 months ago

Download acl.ldc.upenn.edu

We present a term recognition approach to extract acronyms and their definitions from a large text collection. Parenthetical expressions appearing in a text collection are identif...

Naoaki Okazaki, Sophia Ananiadou

claim paper

Read More »

10

click to vote

SCCC
1998
IEEE

108views Theoretical Computer Science» more SCCC 1998»

Parallel Generation of Inverted Files for Distributed Text Collections

13 years 8 months ago

Download www.dcc.uchile.cl

We present a scalable algorithm for the parallel computation of inverted files for large text collections. The algorithm takes into account an environment of a high bandwidth netw...

Berthier A. Ribeiro-Neto, Joao Paulo Kitajima, Gon...

claim paper

Read More »

13

click to vote

SIGIR
1999
ACM

111views Information Technology» more SIGIR 1999»

Efficient Distributed Algorithms to Build Inverted Files

13 years 8 months ago

Download homepages.dcc.ufmg.br

We present three distributed algorithms to build global inverted files for very large text collections. The distributed environment we use is a high bandwidth network of workstati...

Berthier A. Ribeiro-Neto, Edleno Silva de Moura, M...

claim paper

Read More »

14

click to vote

SIGIR
2000
ACM

116views Information Technology» more SIGIR 2000»

On the design and evaluation of a multi-dimensional approach to information retrieval

13 years 8 months ago

Download www.ir.iit.edu

We present a method of searching text collections that takes advantage of hierarchrical information within documents and integrates searches of structured and unstructured data. W...

M. Catherine McCabe, Jinho Lee, Abdur Chowdhury, D...

claim paper

Read More »

12

click to vote

ERCIMDL
2005
Springer

114views Education» more ERCIMDL 2005»

Compressing Dynamic Text Collections via Phrase-Based Coding

13 years 10 months ago

Download www.dcc.uchile.cl

We present a new statistical compression method, which we call Phrase Based Dense Code (PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection to ...

Nieves R. Brisaboa, Antonio Fariña, Gonzalo...

claim paper

Read More »

11

click to vote

CIKM
2007
Springer

125views Information Technology» more CIKM 2007»

"More like these": growing entity classes from seeds

13 years 10 months ago

Download staff.science.uva.nl

We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to deﬁne a membersh...

Luís Sarmento, Valentin Jijkoun, Maarten de...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers