Sciweavers

108 search results - page 3 / 22
» Web data cleansing for information retrieval using key resou...
Sort
View
IPPS
2008
IEEE
13 years 11 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
CVPR
2008
IEEE
13 years 11 months ago
Object image retrieval by exploiting online knowledge resources
We describe a method to retrieve images found on web pages with specified object class labels, using an analysis of text around the image and of image appearance. Our method dete...
Gang Wang, David A. Forsyth
AAAI
1998
13 years 6 months ago
Optimizing Information Agents by Selectively Materializing Data
We present an approach for optimizing the performance of information agents by materializing useful information . A critical problem with information agents, particularly those ga...
Naveen Ashish
CIKM
2009
Springer
13 years 8 months ago
Classification-based resource selection
In some retrieval situations, a system must search across multiple collections. This task, referred to as federated search, occurs for example when searching a distributed index o...
Jaime Arguello, Jamie Callan, Fernando Diaz
WWW
2008
ACM
14 years 5 months ago
iRobot: an intelligent crawler for web forums
We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...
Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...