Sciweavers

77 search results - page 1 / 16
» Building a dynamic classifier for large text data collection...
Sort
View
ADC
2010
Springer
214views Database» more  ADC 2010»
13 years 13 hour ago
Building a dynamic classifier for large text data collections
Due to the lack of in-built tools to navigate the web, people have to use external solutions to find information. The most popular of these are search engines and web directories....
Pavel Kalinov, Bela Stantic, Abdul Sattar
WWW
2008
ACM
14 years 5 months ago
Learning to classify short and sparse text & web with hidden topics from large-scale data collections
This paper presents a general framework for building classifiers that deal with short and sparse text & Web segments by making the most of hidden topics discovered from larges...
Xuan Hieu Phan, Minh Le Nguyen, Susumu Horiguchi
ELPUB
2006
ACM
13 years 11 months ago
Automated Building of OAI Compliant Repository from Legacy Collection
In this paper, we report on our experience with the creation of an automated, human-assisted process to extract metadata from documents in a large (>100,000), dynamically growi...
Jianfeng Tang, Kurt Maly, Steven J. Zeil, Mohammad...
VLDB
2002
ACM
184views Database» more  VLDB 2002»
14 years 5 months ago
Database indexing for large DNA and protein sequence collections
Our aim is to develop new database technologies for the approximate matching of unstructured string data using indexes. We explore the potential of the suffix tree data structure i...
Ela Hunt, Malcolm P. Atkinson, Robert W. Irving
ICDM
2009
IEEE
151views Data Mining» more  ICDM 2009»
13 years 2 months ago
TagLearner: A P2P Classifier Learning System from Collaboratively Tagged Text Documents
The amount of text data on the Internet is growing at a very fast rate. Online text repositories for news agencies, digital libraries and other organizations currently store gigaan...
Haimonti Dutta, Xianshu Zhu, Tushar Mahule, Hillol...