Sciweavers

2190 search results - page 111 / 438
» Unweaving a web of documents
Sort
View
TREC
2008
15 years 2 months ago
Combining Candidate and Document Models for Expert Search
: We describe our participation in the TREC 2008 Enterprise track and detail our language modeling-based approaches. For document search, our focus was on query expansion using pro...
Krisztian Balog, Maarten de Rijke
59
Voted
COLING
1996
15 years 2 months ago
Identifying the Coding System and Language of On-line Documents on the Internet
This paper proposes a new algorithm that simultaneously identifies the coding system and language of a code string fetched from the Internet, especially World-Wide Web. The algori...
Gen-itiro Kikui
104
Voted
COLING
2010
14 years 7 months ago
Large Scale Parallel Document Mining for Machine Translation
A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...
LREC
2008
106views Education» more  LREC 2008»
15 years 2 months ago
Producing an Encyclopedic Dictionary using Patent Documents
Although the World Wide Web has of late become an important source to consult for the meaning of words, a number of technical terms related to high technology are not found on the...
Atsushi Fujii
90
Voted
IADIS
2003
15 years 2 months ago
Querying Databases and XML Documents: Comparative Study and a New Proposal
XML has become the most useful standard of data interchange in the web and e-business world and there is a large amount of information stored in this format. Nonetheless, a large ...
Ana Fermoso García, María José...