Search Sciweavers | Sciweavers

126

ICDM
2007
IEEE

116views Data Mining» more ICDM 2007»

A Computational Approach to Style in American Poetry

15 years 10 months ago

We develop a quantitative method to assess the style of American poems and to visualize a collection of poems in relation to one another. Qualitative poetry criticism helped guide...

David M. Kaplan, David M. Blei

claim paper

Read More »

139

Voted

PAKDD
2000
ACM

128views Data Mining» more PAKDD 2000»

A Comparative Study of Classification Based Personal E-mail Filtering

15 years 7 months ago

Download www.cs.umass.edu

This paper addresses personal E-mail filtering by casting it in the framework of text classification. Modeled as semi-structured documents, Email messages consist of a set of field...

Yanlei Diao, Hongjun Lu, Dekai Wu

claim paper

Read More »

135

click to vote

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Entity categorization over large document collections

16 years 4 months ago

Download www.ics.uci.edu

Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...

Arnd Christian König, Rares Vernica, Venkates...

claim paper

Read More »

150

Voted

ESCIENCE
2006
IEEE

184views Distributed And Parallel Com...» more ESCIENCE 2006»

ODIN: A Model for Adapting and Enriching Legacy Infrastructure

15 years 7 months ago

Download faculty.washington.edu

The Online Database of Interlinear Text (ODIN)1 is a database of interlinear text "snippets", harvested mostly from scholarly documents posted to the Web. Although large...

William D. Lewis

claim paper

Read More »

137

Voted

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 4 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers