139views more  JUCS 2010»
13 years 2 months ago
LemmaGen: Multilingual Lemmatisation with Induced Ripple-Down Rules
: Lemmatisation is the process of finding the normalised forms of words appearing in text. It is a useful preprocessing step for a number of language engineering and text mining t...
Matjaz Jursic, Igor Mozetic, Tomaz Erjavec, Nada L...
87views more  BMCBI 2005»
13 years 4 months ago
BioCreAtIvE Task 1A: gene mention finding evaluation
Background: The biological research literature is a major repository of knowledge. As the amount of literature increases, it will get harder to find the information of interest on...
Alexander S. Yeh, Alexander A. Morgan, Marc E. Col...
162views more  BMCBI 2010»
13 years 4 months ago
Moara: a Java library for extracting and normalizing gene and protein mentions
Background: Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein intera...
Mariana L. Neves, José María Carazo,...
121views Data Mining» more  KDD 2008»
14 years 4 months ago
Mining multi-faceted overviews of arbitrary topics in a text collection
A common task in many text mining applications is to generate a multi-faceted overview of a topic in a text collection. Such an overview not only directly serves as an informative...
Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Sc...