Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

ACL
2011

favoriteEmaildiscussreport

180views Computational Linguistics» more ACL 2011»

Effective Measures of Domain Similarity for Parsing

12 years 8 months ago

Effective Measures of Domain Similarity for Parsing

Download www.let.rug.nl

It is well known that parsing accuracy suffers when a model is applied to out-of-domain data. It is also known that the most beneﬁcial data to parse a given domain is data that matches the domain (Sekine, 1997; Gildea, 2001). Hence, an important task is to select appropriate domains. However, most previous work on domain adaptation relied on the implicit assumption that domains are somehow given. As more and more data is becoming available, automatic ways to select data that is beneﬁcial for a new (unknown) target domain are becoming attractive. This paper evaluates various ways to automatically acquire related training data for a given test set. The results show that an unsupervised technique based on topic models is effective – it outperforms random data selection on both examined languages, English and Dutch. Moreover, the technique works better than manually assigned labels gathered from meta-data that is available for English.

Barbara Plank, Gertjan van Noord

Real-time Traffic

ACL 2011 | Computational Linguistics | Implicit Assumption | Meta Data | Target Domain |

claim paper

Related Content

» On domain similarity and effectiveness of adaptingtorank

» Automatic Prediction of Parser Accuracy

» SimRank a measure of structuralcontext similarity

» A comparative study of similarity measures for contentbased multimedia retrieval

» A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clusteri...

» A new video similarity measure model based on video time density function and dynamic prog...

» Effects of user similarity in social media

» A Fuzzy Nonlinear Similarity Measure for CaseBased Reasoning Systems for Radiotherapy Trea...

» CLUSS Clustering of protein sequences based on a new similarity measure

Post Info
More Details (n/a)

Added	23 Aug 2011
Updated	23 Aug 2011
Type	Journal
Year	2011
Where	ACL
Authors	Barbara Plank, Gertjan van Noord

Comments (0)