Sciweavers

22 search results - page 4 / 5
» OCELOT: a system for summarizing Web pages
Sort
View
KCAP
2005
ACM
13 years 11 months ago
AutoFeed: an unsupervised learning system for generating webfeeds
The AutoFeed system automatically extracts data from semistructured web sites. Previously, researchers have developed two types of supervised learning approaches for extracting we...
Bora Gazen, Steven Minton
DSS
2006
174views more  DSS 2006»
13 years 5 months ago
CMedPort: An integrated approach to facilitating Chinese medical information seeking
As the number of non-English resources available on the Web is increasing rapidly, developing information retrieval techniques for non-English languages is becoming an urgent and ...
Yilu Zhou, Jialun Qin, Hsinchun Chen
WWW
2008
ACM
14 years 6 months ago
Query-sets: using implicit feedback and query patterns to organize web documents
In this paper we present a new document representation model based on implicit user feedback obtained from search engine queries. The main objective of this model is to achieve be...
Barbara Poblete, Ricardo A. Baeza-Yates
WWW
2007
ACM
14 years 6 months ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger
SIGIR
2009
ACM
14 years 3 days ago
Web derived pronunciations for spoken term detection
Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of application...
Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan...