Search Sciweavers | Sciweavers

523 search results - page 50 / 105

» Metric Learning for Text Documents

141

click to vote

KDD
2006
ACM

118views Data Mining» more KDD 2006»

Reducing the human overhead in text categorization

16 years 5 months ago

Download research.microsoft.com

Many applications in text processing require significant human effort for either labeling large document collections (when learning statistical models) or extrapolating rules from...

Arnd Christian König, Eric Brill

claim paper

Read More »

146

click to vote

SYNASC
2007
IEEE

136views Algorithms» more SYNASC 2007»

Wikipedia-Based Kernels for Text Categorization

15 years 11 months ago

Download datamin.ubbcluj.ro

In recent years several models have been proposed for text categorization. Within this, one of the widely applied models is the vector space model (VSM), where independence betwee...

Zsolt Minier, Zalan Bodo, Lehel Csató

claim paper

Read More »

156

click to vote

DL
2000
Springer

162views Digital Library» more DL 2000»

Snowball: extracting relations from large plain-text collections

15 years 9 months ago

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...

Eugene Agichtein, Luis Gravano

claim paper

Read More »

185

click to vote

GECCO
2007
Springer

206views Optimization» more GECCO 2007»

Using code metric histograms and genetic algorithms to perform author identification for software forensics

15 years 9 months ago

Download www.cs.bham.ac.uk

We have developed a technique to characterize software developers' styles using a set of source code metrics. This style fingerprint can be used to identify the likely author...

Robert Charles Lange, Spiros Mancoridis

claim paper

Read More »

158

click to vote

DASFAA
2004
IEEE

135views Database» more DASFAA 2004»

Semi-supervised Text Classification Using Partitioned EM

15 years 9 months ago

Download www.cs.uic.edu

Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...

Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu

claim paper

Read More »

« Prev « First page 50 / 105 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers