Search Sciweavers | Sciweavers

85

TSD
2001
Springer

163views Signal Processing» more TSD 2001»

Finding Semantically Related Words in Large Corpora

15 years 2 months ago

The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and descr...

Pavel Smrz, Pavel Rychlý

claim paper

Read More »

82

click to vote

ACL
2009

130views Computational Linguistics» more ACL 2009»

Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora

14 years 7 months ago

Download aclweb.org

In this paper, we study the problem of extracting technical paraphrases from a parallel software corpus, namely, a collection of duplicate bug reports. Paraphrase acquisition is a...

Xiaoyin Wang, David Lo, Jing Jiang, Lu Zhang, Hong...

claim paper

Read More »

80

click to vote

ACL
2010

144views Computational Linguistics» more ACL 2010»

How Spoken Language Corpora Can Refine Current Speech Motor Training Methodologies

14 years 8 months ago

Download aclweb.org

The growing availability of spoken language corpora presents new opportunities for enriching the methodologies of speech and language therapy. In this paper, we present a novel ap...

Daniil Umanski, Federico Sangati

claim paper

Read More »

74

click to vote

EMNLP
2008

109views Natural Language Processing» more EMNLP 2008»

N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation

14 years 11 months ago

Download people.csail.mit.edu

In domains with insufficient matched training data, language models are often constructed by interpolating component models trained from partially matched corpora. Since the ngram...

Bo-June Paul Hsu, James R. Glass

claim paper

Read More »

69

click to vote

EACL
2003
ACL Anthology

137views Natural Language Processing» more EACL 2003»

Experiments on Candidate Data for Collocation Extraction

14 years 11 months ago

Download www.ims.uni-stuttgart.de

The paper describes ongoing work on the evaluation of methods for extracting collocation candidates from large text corpora. Our research is based on a German treebank corpus used...

Stefan Evert, Hannah Kermes

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers