Search Sciweavers | Sciweavers

101

KDD
2009
ACM

211views Data Mining» more KDD 2009»

Address standardization with latent semantic association

15 years 10 months ago

Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...

Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...

claim paper

Read More »

95

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

Structured entity identification and document categorization: two tasks with one joint model

15 years 10 months ago

Download www.godbole.net

Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...

Indrajit Bhattacharya, Shantanu Godbole, Sachindra...

claim paper

Read More »

77

click to vote

KDD
2007
ACM

167views Data Mining» more KDD 2007»

Generalized component analysis for text with heterogeneous attributes

15 years 10 months ago

Download www.cs.umass.edu

We present a class of richly structured, undirected hidden variable models suitable for simultaneously modeling text along with other attributes encoded in different modalities. O...

Xuerui Wang, Chris Pal, Andrew McCallum

claim paper

Read More »

83

click to vote

KDD
2006
ACM

128views Data Mining» more KDD 2006»

On privacy preservation against adversarial data mining

15 years 10 months ago

Download charuaggarwal.net

Privacy preserving data processing has become an important topic recently because of advances in hardware technology which have lead to widespread proliferation of demographic and...

Charu C. Aggarwal, Jian Pei, Bo Zhang 0002

claim paper

Read More »

83

click to vote

KDD
2005
ACM

125views Data Mining» more KDD 2005»

Email data cleaning

15 years 10 months ago

Download research.microsoft.com

Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...

Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers