Sciweavers

241 search results - page 9 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
ICDAR
2005
IEEE
15 years 3 months ago
Document Ranking by Layout Relevance
This paper describes the development of a new document ranking system based on layout similarity. The user has a need represented by a set of ”wanted” documents, and the syste...
May Huang, Daniel DeMenthon, David S. Doermann, Ly...
TKDE
1998
142views more  TKDE 1998»
14 years 9 months ago
Performance Analysis of Three Text-Join Algorithms
—When a multidatabase system contains textual database systems (i.e., information retrieval systems), queries against the global schema of the multidatabase system may contain a ...
Weiyi Meng, Clement T. Yu, Wei Wang 0010, Naphtali...
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
15 years 10 months ago
A parallel learning algorithm for text classification
Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify te...
Canasai Kruengkrai, Chuleerat Jaruskulchai
LREC
2010
164views Education» more  LREC 2010»
14 years 11 months ago
Enhanced Infrastructure for Creation and Collection of Translation Resources
Statistical Machine Translation (MT) systems have achieved impressive results in recent years, due in large part to the increasing availability of parallel text for system trainin...
Zhiyi Song, Stephanie Strassel, Gary Krug, Kazuaki...
KDD
2007
ACM
148views Data Mining» more  KDD 2007»
15 years 10 months ago
Detecting research topics via the correlation between graphs and texts
In this paper we address the problem of detecting topics in large-scale linked document collections. Recently, topic detection has become a very active area of research due to its...
Yookyung Jo, Carl Lagoze, C. Lee Giles