Sciweavers

622 search results - page 112 / 125
» Extractive spoken document summarization for information ret...
Sort
View
WWW
2005
ACM
16 years 10 days ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
KDD
2005
ACM
194views Data Mining» more  KDD 2005»
16 years 2 days ago
Web object indexing using domain knowledge
Web object is defined to represent any meaningful object embedded in web pages (e.g. images, music) or pointed to by hyperlinks (e.g. downloadable files). Users usually search for...
Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiya...
CIKM
2008
Springer
15 years 1 months ago
A random walk on the red carpet: rating movies with user reviews and pagerank
Although PageRank has been designed to estimate the popularity of Web pages, it is a general algorithm that can be applied to the analysis of other graphs other than one of hypert...
Derry Tanti Wijaya, Stéphane Bressan
106
Voted
TKDE
2010
224views more  TKDE 2010»
14 years 10 months ago
Probabilistic Topic Models for Learning Terminological Ontologies
—Probabilistic topic models were originally developed and utilised for document modeling and topic extraction in Information Retrieval. In this paper we describe a new approach f...
Wang Wei, Payam M. Barnaghi, Andrzej Bargiela
92
Voted
CICLING
2006
Springer
15 years 3 months ago
Creating a Testbed for the Evaluation of Automatically Generated Back-of-the-Book Indexes
The automatic generation of back-of-the book indexes seems to be out of sight of the Information Retrieval and Natural Language Processing communities, although the increasingly la...
Andras Csomai, Rada Mihalcea