Sciweavers

735 search results - page 120 / 147
» Corpora and data preparation
Sort
View
ICASSP
2008
IEEE
15 years 4 months ago
Optimizing the acoustic modeling from an unbalanced bi-lingual corpus
Phoneme set clustering of accurate modeling is important in the task of multilingual speech recognition, especially when each of the available language training corpora is mismatc...
Dau-cheng Lyu, Ren-yuan Lyu
MICAI
2007
Springer
15 years 4 months ago
Fuzzifying Clustering Algorithms: The Case Study of MajorClust
Among various document clustering algorithms that have been proposed so far, the most useful are those that automatically reveal the number of clusters and assign each target docum...
Eugene Levner, David Pinto, Paolo Rosso, David Alc...
AMR
2005
Springer
117views Multimedia» more  AMR 2005»
15 years 3 months ago
Learning User Queries in Multimodal Dissimilarity Spaces
Abstract. Different strategies to learn user semantic queries from dissimilarity representations of video audio-visual content are presented. When dealing with large corpora of vi...
Eric Bruno, Nicolas Moënne-Loccoz, Sté...
IJCNLP
2005
Springer
15 years 3 months ago
A Lexicon-Constrained Character Model for Chinese Morphological Analysis
Abstract. This paper proposes a lexicon-constrained character model that combines both word and character features to solve complicated issues in Chinese morphological analysis. A ...
Yao Meng, Hao Yu, Fumihito Nishino
AIRS
2004
Springer
15 years 3 months ago
Document Clustering Using Linear Partitioning Hyperplanes and Reallocation
This paper presents a novel algorithm for document clustering based on a combinatorial framework of the Principal Direction Divisive Partitioning (PDDP) algorithm [1] and a simpli...
Canasai Kruengkrai, Virach Sornlertlamvanich, Hito...