Sciweavers

311 search results - page 61 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
99
Voted
IJCNN
2008
IEEE
15 years 3 months ago
A neural network approach to ordinal regression
— Ordinal regression is an important type of learning, which has properties of both classification and regression. Here we describe an effective approach to adapt a traditional ...
Jianlin Cheng, Zheng Wang, Gianluca Pollastri
HCI
2009
14 years 7 months ago
COBRA - A Visualization Solution to Monitor and Analyze Consumer Generated Medias
Consumer Generated Medias (CGMs) -- such as blogs, news forums, message boards, and web pages -- are emerging as locations where consumers trade, discuss and influence each other&#...
Amit Behal, Julia Grace, Linda Kato, Ying Chen, Sh...
DGO
2006
134views Education» more  DGO 2006»
14 years 10 months ago
Next steps in near-duplicate detection for eRulemaking
Large volume public comment campaigns and web portals that encourage the public to customize form letters produce many near-duplicate documents, which increases processing and sto...
Hui Yang, Jamie Callan, Stuart W. Shulman
GFKL
2007
Springer
152views Data Mining» more  GFKL 2007»
15 years 3 months ago
Supporting Web-based Address Extraction with Unsupervised Tagging
Abstract. The manual acquisition and modeling of tourist information as e.g. addresses of points of interest is time and, therefore, cost intensive. Furthermore, the encoded inform...
Berenike Loos, Chris Biemann
KDD
2008
ACM
147views Data Mining» more  KDD 2008»
15 years 9 months ago
Extracting shared subspace for multi-label classification
Multi-label problems arise in various domains such as multitopic document categorization and protein function prediction. One natural way to deal with such problems is to construc...
Shuiwang Ji, Lei Tang, Shipeng Yu, Jieping Ye