Sciweavers

2677 search results - page 308 / 536
» Extracting Structured Data from Web Pages
Sort
View
LREC
2010
217views Education» more  LREC 2010»
15 years 3 months ago
Building a Web Corpus of Czech
Large corpora are essential to modern methods of computational linguistics and natural language processing. In this paper, we describe an ongoing project whose aim is to build a l...
Drahomíra "johanka" Spoustová, Miros...
ECIR
2008
Springer
15 years 3 months ago
Towards an Automatically Generated Music Information System Via Web Content Mining
Abstract. This paper presents first steps towards building a music information system like last.fm, but with the major difference that the data is automatically retrieved from the ...
Markus Schedl, Peter Knees, Tim Pohle, Gerhard Wid...
ICCS
2005
Springer
15 years 7 months ago
Games of Inquiry for Collaborative Concept Structuring
Google’s project to digitize five of the world's greatest libraries will dramatically extend their search engine reach in the future. Current search-engine philosophy, which...
Mary A. Keeler, Heather D. Pfeiffer
RIAO
2000
15 years 2 months ago
SgmlQL + XGQL = Powerful XML Pattern-Matching and Data-Manipulation in a Single Language
The presence of XML in many recent hypermedia management tools and methods (W3I3, SMIL, etc.) shows better than ever that both structural and textual criteria will continue to pla...
Jacques Le Maitre, Yves Marcoux, Elisabeth Murisas...
SIGMOD
2006
ACM
202views Database» more  SIGMOD 2006»
16 years 1 months ago
Avatar semantic search: a database approach to information retrieval
We present Avatar Semantic Search, a prototype search engine that exploits annotations in the context of classical keyword search. The process of annotations is accomplished offli...
Eser Kandogan, Rajasekar Krishnamurthy, Sriram Rag...