Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

8

BTW
2007
Springer

favoriteEmaildiscussreport

122views Database» more BTW 2007»

YAWN: A Semantically Annotated Wikipedia XML Corpus

13 years 10 months ago

YAWN: A Semantically Annotated Wikipedia XML Corpus

Download www.btw2007.de

: The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.

Ralf Schenkel, Fabian M. Suchanek, Gjergji Kasneci

Real-time Traffic

BTW 2007 | Extracts Additional Information | Paper Presents Yawn | WordNet Thesaurus |

claim paper

Related Content

» Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus

» Semantically Annotated Snapshot of the English Wikipedia

» WikiWoods SyntactoSemantic Annotation for English Wikipedia

» Wikicorpus A WordSense Disambiguated Multilingual Wikipedia Corpus

» Annotating wikipedia articles with semantic tags for structured retrieval

» Coarse Lexical Semantic Annotation with Supersenses An Arabic Case Study

» Learning to Tag and Tagging to Learn A Case Study on Wikipedia

» A Corpus Representation Format for Linguistic Web Services The DSPIN Text Corpus Format an...

» ANAWIKI Creating Anaphorically Annotated Resources through Web Cooperation

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	BTW
Authors	Ralf Schenkel, Fabian M. Suchanek, Gjergji Kasneci

Comments (0)