Sciweavers

6388 search results - page 219 / 1278
» High Performance Data Mining
Sort
View
GFKL
2006
Springer
108views Data Mining» more  GFKL 2006»
15 years 8 months ago
Identifying and Exploiting Ultrametricity
We begin with pervasive ultrametricity due to high dimensionality and/or spatial sparsity. How extent or degree of ultrametricity can be quantified leads us to the discussion of va...
Fionn Murtagh
KDD
2005
ACM
99views Data Mining» more  KDD 2005»
16 years 5 months ago
Determining an author's native language by mining a text for errors
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automat...
Moshe Koppel, Jonathan Schler, Kfir Zigdon
HICSS
2007
IEEE
176views Biometrics» more  HICSS 2007»
15 years 11 months ago
Mining Fuzzy Weighted Association Rules
The paper combines and extends the technologies of fuzzy sets and association rules, considering users’ differential emphasis on each attribute through fuzzy regions. A fuzzy da...
David L. Olson, Yanhong Li
ICMLA
2008
15 years 6 months ago
Highly Scalable SVM Modeling with Random Granulation for Spam Sender Detection
Spam sender detection based on email subject data is a complex large-scale text mining task. The dataset consists of email subject lines and the corresponding IP address of the em...
Yuchun Tang, Yuanchen He, Sven Krasser
KDD
2007
ACM
153views Data Mining» more  KDD 2007»
16 years 5 months ago
Exploiting duality in summarization with deterministic guarantees
Summarization is an important task in data mining. A major challenge over the past years has been the efficient construction of fixed-space synopses that provide a deterministic q...
Panagiotis Karras, Dimitris Sacharidis, Nikos Mamo...