Sciweavers

19 search results - page 4 / 4
» Efficient Phrase-Based Document Similarity for Clustering
Sort
View
CORR
2006
Springer
142views Education» more  CORR 2006»
13 years 4 months ago
Exploiting multilingual nomenclatures and language-independent text features as an interlingua for cross-lingual text analysis a
We are proposing a simple, but efficient basic approach for a number of multilingual and cross-lingual language technology applications that are not limited to the usual two or th...
Ralf Steinberger, Bruno Pouliquen, Camelia Ignat
ICDAR
2009
IEEE
13 years 2 months ago
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...
Ahmad Abdulkader, Mathew R. Casey
WWW
2008
ACM
14 years 5 months ago
Tag-based social interest discovery
The success and popularity of social network systems, such as del.icio.us, Facebook, MySpace, and YouTube, have generated many interesting and challenging problems to the research...
Xin Li, Lei Guo, Yihong Eric Zhao
BMCBI
2010
138views more  BMCBI 2010»
13 years 5 months ago
UFFizi: a generic platform for ranking informative features
Background: Feature selection is an important pre-processing task in the analysis of complex data. Selecting an appropriate subset of features can improve classification or cluste...
Assaf Gottlieb, Roy Varshavsky, Michal Linial, Dav...