Sciweavers

2714 search results - page 250 / 543
» Machine Learning for Information Extraction
Sort
View
117
Voted
ACMICEC
2006
ACM
141views ECommerce» more  ACMICEC 2006»
15 years 9 months ago
From HTML documents to web tables and rules
We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...
Kai Simon, Georg Lausen, Harold Boley
LREC
2010
145views Education» more  LREC 2010»
15 years 4 months ago
Generic Ontology Learners on Application Domains
In ontology learning from texts, we have ontology-rich domains where we have large structured domain knowledge repositories or we have large general corpora with large general str...
Francesca Fallucchi, Maria Teresa Pazienza, Fabio ...
LREC
2008
90views Education» more  LREC 2008»
15 years 4 months ago
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Parallel corpora are critical resources for machine translation research and development since parallel corpora contain translation equivalences of various granularities. Manual a...
Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing...
204
Voted
RIDE
2003
IEEE
15 years 8 months ago
Exploiting multi-lingual text potentialities in EBMT systems
Translating documents from a source to a target language is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream. Among the...
Federica Mandreoli, Riccardo Martoglia, Paolo Tibe...
116
Voted
ECML
2003
Springer
15 years 8 months ago
A Two-Level Learning Method for Generalized Multi-instance Problems
In traditional multi-instance (MI) learning, a single positive instance in a bag produces a positive class label. Hence, the learner knows how the bag’s class label depends on th...
Nils Weidmann, Eibe Frank, Bernhard Pfahringer