Sciweavers

30 search results - page 4 / 6
» Dictionary-based text categorization of chemical web pages
Sort
View
CIKM
2010
Springer
13 years 4 months ago
Using Wikipedia categories for compact representations of chemical documents
Today, Web pages are usually accessed using text search engines, whereas documents stored in the deep Web are accessed through domain-specific Web portals. These portals rely on e...
Benjamin Köhncke, Wolf-Tilo Balke
ADCS
2004
13 years 7 months ago
Co-Training on Textual Documents with a Single Natural Feature Set
Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...
Jason Chan, Irena Koprinska, Josiah Poon
WEBDB
2001
Springer
137views Database» more  WEBDB 2001»
13 years 10 months ago
Using Database Technology to Improve Performance of Web Proxy Servers
In this paper, we propose to use database technology to improve performance of web proxy servers. We view the cache at a proxy server as a web warehouse with data organized in a h...
Kai Cheng, Yahiko Kambayashi, Mukesh K. Mohania
IICS
2004
Springer
13 years 11 months ago
Towards Logical Hypertext Structure
Facing the retrieval problem according to the overwhelming set of documents online the adaptation of text categorization to web units has recently been pushed. The aim is to utiliz...
Alexander Mehler, Matthias Dehmer, Rüdiger Gl...
CIKM
2005
Springer
13 years 11 months ago
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing web page classification. This approach is magnitudes faster than typical web page classific...
Min-Yen Kan, Hoang Oanh Nguyen Thi