Sciweavers

735 search results - page 101 / 147
» Corpora and data preparation
Sort
View
ADMA
2009
Springer
142views Data Mining» more  ADMA 2009»
15 years 4 months ago
Crawling Deep Web Using a New Set Covering Algorithm
Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...
Yan Wang, Jianguo Lu, Jessica Chen
MLMI
2007
Springer
15 years 4 months ago
Meeting State Recognition from Visual and Aural Labels
In this paper we present a meeting state recognizer based on a combination of multi-modal sensor data in a smart room. Our approach is based on the training of a statistical model ...
Jan Curín, Pascal Fleury, Jan Kleindienst, ...
ACL
2007
14 years 11 months ago
Multilingual Transliteration Using Feature based Phonetic Method
In this paper we investigate named entity transliteration based on a phonetic scoring method. The phonetic method is computed using phonetic features and carefully designed pseudo...
Su-Youn Yoon, Kyoung-Young Kim, Richard Sproat
LREC
2010
139views Education» more  LREC 2010»
14 years 11 months ago
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian
The theoretical characterisation of multiword expressions (MWEs) is tightly connected to their actual occurrences in data and to their representation in lexical resources. We pres...
Andrea Zaninello, Malvina Nissim
LREC
2008
171views Education» more  LREC 2008»
14 years 11 months ago
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
Distributional, corpus-based descriptions have frequently been applied to model aspects of word meaning. However, distributional models that use corpus data as their basis have on...
Michael Roth, Sabine Schulte im Walde