Sciweavers

COLING
2010

Exploring variation across biomedical subdomains

12 years 11 months ago
Exploring variation across biomedical subdomains
Previous research has demonstrated the importance of handling differences between domains such as "newswire" and "biomedicine" when porting NLP systems from one domain to another. In this paper we identify the related issue of subdomain variation, i.e., differences between subsets of a domain that might be expected to behave homogeneously. Using a large corpus of research articles, we explore how subdomains of biomedicine vary across a variety of linguistic dimensions and discover that there is rich variation. We conclude that an awareness of such variation is necessary when deploying NLP systems for use in single or multiple subdomains.
Tom Lippincott, Diarmuid Ó Séaghdha,
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Tom Lippincott, Diarmuid Ó Séaghdha, Lin Sun, Anna Korhonen
Comments (0)