Sciweavers

536 search results - page 32 / 108
» Feature Engineering for Text Classification
Sort
View
AIRWEB
2008
Springer
14 years 11 months ago
Cleaning search results using term distance features
The presence of Web spam in query results is one of the critical challenges facing search engines today. While search engines try to combat the impact of spam pages on their resul...
Josh Attenberg, Torsten Suel
PAKDD
2000
ACM
128views Data Mining» more  PAKDD 2000»
15 years 1 months ago
A Comparative Study of Classification Based Personal E-mail Filtering
This paper addresses personal E-mail filtering by casting it in the framework of text classification. Modeled as semi-structured documents, Email messages consist of a set of field...
Yanlei Diao, Hongjun Lu, Dekai Wu
MSR
2010
ACM
14 years 11 months ago
Identifying security bug reports via text mining: An industrial case study
-- A bug-tracking system such as Bugzilla contains bug reports (BRs) collected from various sources such as development teams, testing teams, and end users. When bug reporters subm...
Michael Gegick, Pete Rotella, Tao Xie
ERCIMDL
2005
Springer
100views Education» more  ERCIMDL 2005»
15 years 3 months ago
Importance of HTML Structural Elements and Metadata in Automated Subject Classification
The aim of the study was to determine how significance indicators assigned to different Web page elements (internal metadata, title, headings, and main text) influence automated cl...
Koraljka Golub, Anders Ardö
KDD
1995
ACM
129views Data Mining» more  KDD 1995»
15 years 1 months ago
Feature Extraction for Massive Data Mining
Techniques for learning from data typically require data to be in standard form. Measurements must be encoded in a numerical format such as binary true-or-false features, numerica...
V. Seshadri, Raguram Sasisekharan, Sholom M. Weiss