Sciweavers

4199 search results - page 125 / 840
» Generalizing Data in Natural Language
Sort
View
193
Voted
KDD
2008
ACM
199views Data Mining» more  KDD 2008»
16 years 6 months ago
Building semantic kernels for text classification using wikipedia
Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...
Pu Wang, Carlotta Domeniconi
154
Voted
CICLING
2009
Springer
15 years 10 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus
OOPSLA
2010
Springer
15 years 4 months ago
Managing ambiguity in programming by finding unambiguous examples
We propose a new way to raise the level of discourse in the programming process: permit ambiguity, but manage it by linking it to unambiguous examples. This allows programming env...
Kenneth C. Arnold, Henry Lieberman
KI
2004
Springer
15 years 11 months ago
Generation of Sentence Parse Trees Using Parts of Speech
This paper proposes a new corpus-based approach for deriving syntactic structures and generating parse trees of natural language sentences. The parts of speech (word categories) of...
Tunga Güngör
160
Voted
EMNLP
2010
15 years 4 months ago
Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation
We describe a new approach to SMT adaptation that weights out-of-domain phrase pairs according to their relevance to the target domain, determined by both how similar to it they a...
George F. Foster, Cyril Goutte, Roland Kuhn