Sciweavers

21 search results - page 4 / 5
» Semi-supervised Information Extraction from Variable-length ...
Sort
View
CHI
2006
ACM
14 years 6 months ago
Marmite: end-user programming for the web
A tremendous amount of semi-structured data is available today on the web but is not necessarily in a form which is suitable for a user's tasks. For example, a website may sh...
Jason I. Hong, Jeffrey Wong
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 20 days ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
MM
2010
ACM
207views Multimedia» more  MM 2010»
13 years 6 months ago
TalkMiner: a search engine for online lecture video
TalkMiner is a search engine for lecture webcasts. Lecture videos are processed to recover a set of distinct slide images and OCR is used to generate a list of indexable terms fro...
John Adcock, Matthew Cooper, Laurent Denoue, Hamed...
SIGIR
2009
ACM
14 years 22 days ago
Web derived pronunciations for spoken term detection
Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of application...
Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan...
JCDL
2004
ACM
128views Education» more  JCDL 2004»
13 years 11 months ago
Panorama: extending digital libraries with topical crawlers
A large amount of research, technical and professional documents are available today in digital formats. Digital libraries are created to facilitate search and retrieval of inform...
Gautam Pant, Kostas Tsioutsiouliklis, Judy Johnson...