Sciweavers

1719 search results - page 223 / 344
» Supervised Dictionary Learning
Sort
View
EMNLP
2007
14 years 11 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
LREC
2008
88views Education» more  LREC 2008»
14 years 11 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
LREC
2008
160views Education» more  LREC 2008»
14 years 11 months ago
Automatic Extraction of Textual Elements from News Web Pages
In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...
Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany
LREC
2008
183views Education» more  LREC 2008»
14 years 11 months ago
Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues
In this paper we present an active approach to annotate with lexical and semantic labels an Italian corpus of conversational human-human and Wizard-of-Oz dialogues. This procedure...
Christian Raymond, Kepa Joseba Rodriguez, Giuseppe...
NIPS
2007
14 years 11 months ago
Hierarchical Penalization
Hierarchical penalization is a generic framework for incorporating prior information in the fitting of statistical models, when the explicative variables are organized in a hiera...
Marie Szafranski, Yves Grandvalet, Pierre Morizet-...