Hybrid semantic tagging for information extraction

16 years 8 months ago

Download www.www2005.org

The semantic web is expected to have an impact at least as big as that of the existing HTML based web, if not greater. However, the challenge lays in creating this semantic web and in converting existing web information into the semantic paradigm. One of the core technologies that can help in migration process is automatic markup, the semantic markup of content, providing the semantic tags to describe the raw content. This paper describes a hybrid statistical and knowledge-based information extraction model, able to extract entities and relations at the sentence level. The model attempts to retain and improve the high accuracy levels of knowledge-based systems while drastically reducing the amount of manual labor by relying on statistics drawn from a training corpus. The implementation of the model, called TEG (Trainable Extraction Grammar), can be adapted to any IE domain by writing a suitable set of rules in a SCFG (Stochastic Context Free Grammar) based extraction language, and tra...

Ronen Feldman, Binyamin Rosenfeld, Moshe Fresko, B

Real-time Traffic

Internet Technology | Keywords Semantic Web | Semantic Web | Semantic Web Pages | WWW 2005 |

claim paper

Post Info
More Details (n/a)

Added	22 Nov 2009
Updated	22 Nov 2009
Type	Conference
Year	2005
Where	WWW
Authors	Ronen Feldman, Binyamin Rosenfeld, Moshe Fresko, Brian D. Davison

Comments (0)

Sciweavers

Hybrid semantic tagging for information extraction

Internet Technology | Keywords Semantic Web | Semantic Web | Semantic Web Pages | WWW 2005 |

Explore & Download

Productivity Tools

Sciweavers