Sciweavers

1527 search results - page 63 / 306
» Hidden word statistics
Sort
View
ANLP
1997
80views more  ANLP 1997»
15 years 1 months ago
Sequential Model Selection for Word Sense Disambiguation
Statistical models of word-sense disambiguation are often based on a small number of contextual features or on a model that is assumed to characterize the interactions among a set...
Ted Pedersen, Rebecca F. Bruce, Janyce Wiebe
CORR
2008
Springer
115views Education» more  CORR 2008»
14 years 12 months ago
Determining the Unithood of Word Sequences using Mutual Information and Independence Measure
Most works related to unithood were conducted as part of a larger effort for the determination of termhood. Consequently, the number of independent research that study the notion ...
Wilson Wong, Wei Liu, Mohammed Bennamoun
NIPS
2004
15 years 1 months ago
Hierarchical Distributed Representations for Statistical Language Modeling
Statistical language models estimate the probability of a word occurring in a given context. The most common language models rely on a discrete enumeration of predictive contexts ...
John Blitzer, Kilian Q. Weinberger, Lawrence K. Sa...
CORR
2002
Springer
90views Education» more  CORR 2002»
14 years 11 months ago
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...
Rie Kubota Ando, Lillian Lee
BMCBI
2008
132views more  BMCBI 2008»
14 years 12 months ago
The SeqWord Genome Browser: an online tool for the identification and visualization of atypical regions of bacterial genomes thr
Background: Data mining in large DNA sequences is a major challenge in microbial genomics and bioinformatics. Oligonucleotide usage (OU) patterns provide a wealth of information f...
Hamilton Ganesan, Anna S. Rakitianskaia, Colin F. ...