Sciweavers

48 search results - page 7 / 10
» A Syllabification Algorithm for Spanish
Sort
View
KDD
2009
ACM
168views Data Mining» more  KDD 2009»
15 years 10 months ago
Name-ethnicity classification from open sources
The problem of ethnicity identification from names has a variety of important applications, including biomedical research, demographic studies, and marketing. Here we report on th...
Anurag Ambekar, Charles B. Ward, Jahangir Mohammed...
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
15 years 4 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
SIGIR
2005
ACM
15 years 3 months ago
Bootstrapping dictionaries for cross-language information retrieval
The bottleneck for dictionary-based cross-language information retrieval is the lack of comprehensive dictionaries, in particular for many different languages. We here introduce a...
Kornél G. Markó, Stefan Schulz, Olen...
CICLING
2005
Springer
15 years 3 months ago
A Machine Learning Approach to Information Extraction
Information extraction is concerned with applying natural language processing to automatically extract the essential details from text documents. A great disadvantage of current ap...
Alberto Téllez-Valero, Manuel Montes-y-G&oa...
IBPRIA
2007
Springer
15 years 1 months ago
A Simple But Effective Approach to Speaker Tracking in Broadcast News
The automatic transcription of broadcast news and meetings involves the segmentation, identification and tracking of speaker turns during each session, which is known as speaker di...
Luis Javier Rodríguez, Mikel Peñagar...