Data Mining | Sciweavers

15

KDD
2009
ACM

209views Data Mining» more KDD 2009»

Collective annotation of Wikipedia entities in web text

14 years 5 months ago

To take the first step beyond keyword-based search toward entity-based search, suitable token spans ("spots") on documents must be identified as references to real-world...

Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, ...

claim paper

Read More »

13

click to vote

KDD
2009
ACM

152views Data Mining» more KDD 2009»

TANGENT: a novel, 'Surprise me', recommendation algorithm

14 years 5 months ago

Download www.cs.cmu.edu

Most of recommender systems try to find items that are most relevant to the older choices of a given user. Here we focus on the "surprise me" query: A user may be bored ...

Kensuke Onuma, Hanghang Tong, Christos Faloutsos

claim paper

Read More »

11

click to vote

KDD
2009
ACM

141views Data Mining» more KDD 2009»

Meme-tracking and the dynamics of the news cycle

14 years 5 months ago

Download www.cs.cornell.edu

Tracking new topics, ideas, and "memes" across the Web has been an issue of considerable interest. Recent work has developed methods for tracking topic shifts over long ...

Jure Leskovec, Lars Backstrom, Jon M. Kleinberg

claim paper

Read More »

10

click to vote

KDD
2009
ACM

156views Data Mining» more KDD 2009»

Effective multi-label active learning for text classification

14 years 5 months ago

Download research.microsoft.com

Labeling text data is quite time-consuming but essential for automatic text classification. Especially, manually creating multiple labels for each document may become impractical ...

Bishan Yang, Jian-Tao Sun, Tengjiao Wang, Zheng Ch...

claim paper

Read More »

15

click to vote

KDD
2009
ACM

180views Data Mining» more KDD 2009»

Mining social networks for personalized email prioritization

14 years 5 months ago

Download nyc.lti.cs.cmu.edu

Email is one of the most prevalent communication tools today, and solving the email overload problem is pressingly urgent. A good way to alleviate email overload is to automatical...

Shinjae Yoo, Yiming Yang, Frank Lin, Il-Chul Moon

claim paper

Read More »

23

click to vote

KDD
2009
ACM

219views Data Mining» more KDD 2009»

Structured correspondence topic models for mining captioned figures in biological literature

14 years 5 months ago

Download www.cs.cmu.edu

A major source of information (often the most crucial and informative part) in scholarly articles from scientific journals, proceedings and books are the figures that directly pro...

Amr Ahmed, Eric P. Xing, William W. Cohen, Robert ...

claim paper

Read More »

8

click to vote

KDD
2009
ACM

168views Data Mining» more KDD 2009»

Name-ethnicity classification from open sources

14 years 5 months ago

Download www.cs.sunysb.edu

The problem of ethnicity identification from names has a variety of important applications, including biomedical research, demographic studies, and marketing. Here we report on th...

Anurag Ambekar, Charles B. Ward, Jahangir Mohammed...

claim paper

Read More »

13

click to vote

KDD
2009
ACM

188views Data Mining» more KDD 2009»

Mining broad latent query aspects from search sessions

14 years 5 months ago

Download www.cs.cmu.edu

Search queries are typically very short, which means they are often underspecified or have senses that the user did not think of. A broad latent query aspect is a set of keywords ...

Xuanhui Wang, Deepayan Chakrabarti, Kunal Punera

claim paper

Read More »

14

click to vote

KDD
2009
ACM

191views Data Mining» more KDD 2009»

Scalable pseudo-likelihood estimation in hybrid random fields

14 years 5 months ago

Download www.dii.unisi.it

Learning probabilistic graphical models from high-dimensional datasets is a computationally challenging task. In many interesting applications, the domain dimensionality is such a...

Antonino Freno, Edmondo Trentin, Marco Gori

claim paper

Read More »

16

click to vote

KDD
2009
ACM

194views Data Mining» more KDD 2009»

Combining link and content for community detection: a discriminative approach

14 years 5 months ago

Download www.nec-labs.com

In this paper, we consider the problem of combining link and content analysis for community detection from networked data, such as paper citation networks and Word Wide Web. Most ...

Tianbao Yang, Rong Jin, Yun Chi, Shenghuo Zhu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers