Protein names and how to find them

14 years 12 months ago

Download www.sics.se

A prerequisite for all higher level information extraction tasks is the identication of unknown names in text. Today, when large corpora can consist of billions of words, it is of utmost importance to develop accurate techniques for the automatic detection, extraction and categorization of named entities in these corpora. Although named entity recognition might be regarded a solved problem in some domains, it still poses a signicant challenge in others. In this work we focus on one of the more dicult tasks, the identication of protein names in text. This task presents several interesting diculties because of the named entities' variant structural characteristics, their sometimes unclear status as names, the lack of common standards and xed nomenclatures, and the specics of the texts in the molecular biology domain in which they appear. We describe how we approached these and other diculties in the implementation of Yapex, a system for the automatic identication of protein names i...

Kristofer Franzén, Gunnar Eriksson, Fredrik

Real-time Traffic

IJMI 2002 | Information Extraction Tasks | Protein Names | Unknown Names |

claim paper

» Matching Names and Definitions of Topological Operators

» New kernels for protein structural motif discovery and function classification

» Information Integration Across Heterogeneous Sources Where Do We Stand and How to Proceed

» ProtSweep 2Dsweep and DomainSweep protein analysis suite at DKFZ

» Multiclass protein fold recognition using large margin logic based divide and conquer lear...

» An enhanced partial order curve comparison algorithm and its application to analyzing prot...

» Diffusion of Protein Receptors on a Cylindrical Dendritic Membrane with Partially Absorbin...

» A hubattachment based method to detect functional modules from confidencescored protein in...

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	2002
Where	IJMI
Authors	Kristofer Franzén, Gunnar Eriksson, Fredrik Olsson, Lars Asker, Per Lidén, Joakim Cöster

Comments (0)

Sciweavers

Protein names and how to find them

IJMI 2002 | Information Extraction Tasks | Protein Names | Unknown Names |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers