Sciweavers

910 search results - page 166 / 182
» Standardization of Speech Corpus
Sort
View
FLAIRS
2009
14 years 8 months ago
Improving Biomedical Document Retrieval by Mining Domain Knowledge
When research articles introduce new findings or concepts they typically relate them only to knowledge and domain concepts of immediate relevance. However, many domain concepts re...
Shuguang Wang, Milos Hauskrecht
87
Voted
COLING
2010
14 years 6 months ago
Discriminative Induction of Sub-Tree Alignment using Limited Labeled Data
We employ Maximum Entropy model to conduct sub-tree alignment between bilingual phrasal structure trees. Various lexical and structural knowledge is explored to measure the syntac...
Jun Sun, Min Zhang, Chew Lim Tan
110
Voted
EMNLP
2011
13 years 10 months ago
Named Entity Recognition in Tweets: An Experimental Study
People tweet more than 100 Million times daily, yielding a noisy, informal, but sometimes informative corpus of 140-character messages that mirrors the zeitgeist in an unprecedent...
Alan Ritter, Sam Clark, Mausam, Oren Etzioni
103
Voted
WSDM
2012
ACM
236views Data Mining» more  WSDM 2012»
13 years 6 months ago
Effective query formulation with multiple information sources
Most standard information retrieval models use a single source of information (e.g., the retrieval corpus) for query formulation tasks such as term and phrase weighting and query ...
Michael Bendersky, Donald Metzler, W. Bruce Croft

Dataset
924views
15 years 6 months ago
SCUT-COUCH2009 - A Comprehensive Online Unconstrained Handwriting Database
SCUT-COUCH 2009 database is a comprehensive database that consists of 12 datasets, namely GB1, GB2, TradGB1, Big5, Pinyin, Letters, Digit, Symbol, Word8888, Word17366, Word44208 an...
Lianwen Jin