Sciweavers

39 search results - page 3 / 8
» Automatic metadata extraction from multilingual enterprise c...
Sort
View
SIGMOD
2006
ACM
107views Database» more  SIGMOD 2006»
14 years 5 months ago
Documentum ECI self-repairing wrappers: performance analysis
Documentum Enterprise Content Integration (ECI) services is a content integration middleware that provides one-query access to the Intranet and Internet content resources. The ECI...
Boris Chidlovskii, Bruno Roustant, Marc Brette
LREC
2010
189views Education» more  LREC 2010»
13 years 3 months ago
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
Availability of labeled language resources, such as annotated corpora and domain dependent labeled language resources is crucial for experiments in the field of Natural Language ...
Eric Charton, Juan Manuel Torres Moreno
MT
2002
297views more  MT 2002»
13 years 4 months ago
MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System
We present MARS (Multilingual Automatic tRanslation System), a research prototype speech-to-speech translation system. MARS is aimed at two-way conversational spoken language trans...
Yuqing Gao, Bowen Zhou, Zijian Diao, Jeffrey S. So...
COLING
2010
12 years 12 months ago
Unsupervised Synthesis of Multilingual Wikipedia Articles
In this paper, we propose an unsupervised approach to automatically synthesize Wikipedia articles in multiple languages. Taking an existing high-quality version of any entry as co...
Yuncong Chen, Pascale Fung
ICDAR
2009
IEEE
13 years 11 months ago
Scalable Feature Extraction from Noisy Documents
We cope with the metadata recognition in layoutoriented documents. We address the problem as a classification task and propose a method for automatic extraction of relevant featu...
Loïc Lecerf, Boris Chidlovskii