This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
Our algorithm, ParaMor, fared well in Morpho Challenge 2007 (Kurimo et al., 2007), a peer operated competition pitting against one another algorithms designed to discover the morp...
Christian Monson, Jaime G. Carbonell, Alon Lavie, ...
In this work we try to bridge the gap often encountered by researchers who find themselves with few or no labeled examples from their desired target domain, yet still have access ...
Biomedical corpora annotated with event-level information provide an important resource for the training of domain-specific information extraction (IE) systems. These corpora conc...
Raheel Nawaz, Paul Thompson, John McNaught, Sophia...
Abstract. The goal of the INEX 2009 Book Track is to evaluate approaches for supporting users in reading, searching, and navigating the full texts of digitized books. The investiga...
Gabriella Kazai, Antoine Doucet, Marijn Koolen, Mo...