Sciweavers

17 search results - page 2 / 4
» A Term-Based Methodology for Template Creation in Informatio...
Sort
View
ESWS
2007
Springer
13 years 11 months ago
What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content
Wikis are established means for the collaborative authoring, versioning and publishing of textual articles. The Wikipedia project, for example, succeeded in creating the by far lar...
Sören Auer, Jens Lehmann
ELPUB
2006
ACM
13 years 10 months ago
Automated Building of OAI Compliant Repository from Legacy Collection
In this paper, we report on our experience with the creation of an automated, human-assisted process to extract metadata from documents in a large (>100,000), dynamically growi...
Jianfeng Tang, Kurt Maly, Steven J. Zeil, Mohammad...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 2 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
LREC
2010
201views Education» more  LREC 2010»
13 years 6 months ago
Cultural Heritage: Knowledge Extraction from Web Documents
This article presents the use of NLP techniques (text mining, text analysis) to develop specific tools that allow to create linguistic resources related to the cultural heritage d...
Eva Sassolini, Alessandra Cinini
ICDIM
2006
IEEE
13 years 11 months ago
Creating an Historical Archive Ontology: Guidelines and Evaluation
Ontologies have been proven invaluable tools both for the semantic web and for personal information management. In the context of a historical archive an ontology may provide mean...
Elena Torou, Akrivi Katifori, Costas Vassilakis, G...