Sciweavers

1372 search results - page 111 / 275
» Information retrieval on Turkish texts
Sort
View
COLING
2010
14 years 9 months ago
Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm
Sentence-level aligned parallel texts are important resources for a number of natural language processing (NLP) tasks and applications such as statistical machine translation and ...
Peng Li, Maosong Sun, Ping Xue
COLING
2000
15 years 3 months ago
The Week at a Glance - Cross-language Cross-document Information Extraction and Translation
Work on the production of texts in English describing instances of a particular event type from multiple news sources will be described. A system has been developed which extracts...
James R. Cowie, Yevgeny Ludovik, Hugo Molina-Salga...
99
Voted
KDD
2007
ACM
139views Data Mining» more  KDD 2007»
16 years 2 months ago
Raising the baseline for high-precision text classifiers
Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...
Aleksander Kolcz, Wen-tau Yih
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 7 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
SIGIR
2003
ACM
15 years 7 months ago
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
This paper presents a novel domain-independent text segmentation method, which identifies the boundaries of topic changes in long text documents and/or text streams. The method c...
Xiang Ji, Hongyuan Zha