Sciweavers

735 search results - page 18 / 147
» Corpora and data preparation
Sort
View
KDD
2005
ACM
185views Data Mining» more  KDD 2005»
15 years 10 months ago
Mining comparable bilingual text corpora for cross-language information integration
Integrating information in multiple natural languages is a challenging task that often requires manually created linguistic resources such as a bilingual dictionary or examples of...
Tao Tao, ChengXiang Zhai
SDM
2012
SIAM
258views Data Mining» more  SDM 2012»
13 years 10 days ago
Feature Selection with Linked Data in Social Media
Feature selection is widely used in preparing highdimensional data for effective data mining. Increasingly popular social media data presents new challenges to feature selection....
Jiliang Tang, Huan Liu
LREC
2008
82views Education» more  LREC 2008»
14 years 11 months ago
SpatialML: Annotation Scheme, Corpora, and Tools
SpatialML is an annotation scheme for marking up references to places in natural language. It covers both named and nominal references to places, grounding them where possible wit...
Inderjeet Mani, Janet Hitzeman, Justin Richer, Dav...
LREC
2008
109views Education» more  LREC 2008»
14 years 11 months ago
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...
Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel
TREC
2004
14 years 11 months ago
Question Answering by Searching Large Corpora With Linguistic Methods
In this paper we describe the QuALiM Question Answering system which uses linguistic analysis of questions as well as candidate sentences in its answer finding process. To this en...
Michael Kaißer, Tilman Becker