Sciweavers

CATA
1998
13 years 5 months ago
QVI: Query-based virtual index for distributed information retrieval
The large unstructured text collections demand full-text search capabilities from IR systems. Current systems typically allow users only to connect to a single database (or site) ...
Dong-gyu Kim, Sang-goo Lee
FLAIRS
2004
13 years 5 months ago
Mining On-line Sources for Definition Knowledge
Finding definitions in huge text collections is a challenging problem, not only because of the many ways in which definitions can be conveyed in natural language texts but also be...
Horacio Saggion, Robert J. Gaizauskas
ACL
2006
13 years 5 months ago
A Term Recognition Approach to Acronym Recognition
We present a term recognition approach to extract acronyms and their definitions from a large text collection. Parenthetical expressions appearing in a text collection are identif...
Naoaki Okazaki, Sophia Ananiadou
SCCC
1998
IEEE
13 years 8 months ago
Parallel Generation of Inverted Files for Distributed Text Collections
We present a scalable algorithm for the parallel computation of inverted files for large text collections. The algorithm takes into account an environment of a high bandwidth netw...
Berthier A. Ribeiro-Neto, Joao Paulo Kitajima, Gon...
SIGIR
1999
ACM
13 years 8 months ago
Efficient Distributed Algorithms to Build Inverted Files
We present three distributed algorithms to build global inverted files for very large text collections. The distributed environment we use is a high bandwidth network of workstati...
Berthier A. Ribeiro-Neto, Edleno Silva de Moura, M...
SIGIR
2000
ACM
13 years 8 months ago
On the design and evaluation of a multi-dimensional approach to information retrieval
We present a method of searching text collections that takes advantage of hierarchrical information within documents and integrates searches of structured and unstructured data. W...
M. Catherine McCabe, Jinho Lee, Abdur Chowdhury, D...
ERCIMDL
2005
Springer
114views Education» more  ERCIMDL 2005»
13 years 10 months ago
Compressing Dynamic Text Collections via Phrase-Based Coding
We present a new statistical compression method, which we call Phrase Based Dense Code (PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection to ...
Nieves R. Brisaboa, Antonio Fariña, Gonzalo...
CIKM
2007
Springer
13 years 10 months ago
"More like these": growing entity classes from seeds
We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to define a membersh...
Luís Sarmento, Valentin Jijkoun, Maarten de...