Sciweavers

IEEEVAST
2010
12 years 10 months ago
Understanding text corpora with multiple facets
Text visualization becomes an increasingly more important research topic as the need to understand massive-scale textual information is proven to be imperative for many people and...
Lei Shi, Furu Wei, Shixia Liu, Li Tan, Xiaoxiao Li...
FCSC
2010
238views more  FCSC 2010»
13 years 1 months ago
Knowledge discovery through directed probabilistic topic models: a survey
Graphical models have become the basic framework for topic based probabilistic modeling. Especially models with latent variables have proved to be effective in capturing hidden str...
Ali Daud, Juanzi Li, Lizhu Zhou, Faqir Muhammad
COLING
2002
13 years 3 months ago
Scaled Log Likelihood Ratios for the Detection of Abbreviations in Text Corpora
We describe a language-independent, flexible, and accurate method for the detection of abbreviations in text corpora. It is based on the idea that an abbreviation can be viewed as...
Tibor Kiss, Jan Strunk
BMCBI
2008
150views more  BMCBI 2008»
13 years 3 months ago
BibGlimpse: The case for a light-weight reprint manager in distributed literature research
Background: While text-mining and distributed annotation systems both aim at capturing knowledge and presenting it in a standardized form, there have been few attempts to investig...
Thomas Tüchler, Golda Velez, Alexandra Graf, ...
ISIWI
2000
13 years 5 months ago
Aiding Web Searches by Statistical Classification Tools
We describe an infrastructure for the collection and management of large amounts of text, and discuss the possibility of information extraction and visualisation from text corpora...
Gerhard Heyer, Uwe Quasthoff, Christian Wolff
COLING
2000
13 years 5 months ago
Automatic Extraction of Semantic Relations from Specialized Corpora
In this paper we address the problem of discovering word semantic similarities via statistical processing of text corpora. We propose a knowledge-poor method that exploits the sen...
Aristomenis Thanopoulos, Nikos Fakotakis, George K...
ICML
2005
IEEE
14 years 4 months ago
Hierarchical Dirichlet model for document classification
The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...
Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...