Sciweavers

8795 search results - page 213 / 1759
» Measuring Generality of Documents
Sort
View
ACL
2011
14 years 8 months ago
From Bilingual Dictionaries to Interlingual Document Representations
Mapping documents into an interlingual representation can help bridge the language barrier of a cross-lingual corpus. Previous approaches use aligned documents as training data to...
Jagadeesh Jagarlamudi, Hal Daumé III, Ragha...
SIGIR
2011
ACM
14 years 7 months ago
When documents are very long, BM25 fails!
We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet effective extension of BM25, namel...
Yuanhua Lv, ChengXiang Zhai
CLEF
2005
Springer
15 years 10 months ago
Pitt at CLEF05: Data Fusion for Spoken Document Retrieval
Abstract. This paper describes an investigation of data fusion techniques for spoken document retrieval. The effectiveness of retrievals solely based on the outputs from automatic...
Daqing He, Jae-wook Ahn
ICAIL
2007
ACM
15 years 8 months ago
The Legal-RDF Ontology. A Generic Model for Legal Documents
Legal-RDF.org1 publishes a practical ontology that models both the layout and content of a document and metadata about the document; these have been built using data models implici...
John McClure
ICPR
2006
IEEE
16 years 5 months ago
Detecting Text Lines in Handwritten Documents
Although detecting text lines in machine printed documents is typically considered a solved problem, it is still a challenge to segment handwritten text lines in the general sense...
Yi Li, Yefeng Zheng, David S. Doermann