Search Sciweavers | Sciweavers

115 search results - page 12 / 23

» Training Data Cleaning for Text Classification

177

click to vote

IPM
2002

106views more IPM 2002»

A feature mining based approach for the classification of text documents into disjoint classes

15 years 5 months ago

Download www.csc.lsu.edu

This paper proposes a new approach for classifying text documents into two disjoint classes. The new approach is based on extracting patterns, in the form of two logical expressio...

Salvador Nieto Sánchez, Evangelos Triantaph...

claim paper

Read More »

159

Voted

WWW
2008
ACM

173views Internet Technology» more WWW 2008»

Learning to classify short and sparse text & web with hidden topics from large-scale data collections

16 years 6 months ago

Download www2008.org

This paper presents a general framework for building classifiers that deal with short and sparse text & Web segments by making the most of hidden topics discovered from larges...

Xuan Hieu Phan, Minh Le Nguyen, Susumu Horiguchi

claim paper

Read More »

152

click to vote

CLEF
2010
Springer

136views Information Technology» more CLEF 2010»

ZOT! to Wikipedia Vandalism - Lab Report for PAN at CLEF 2010

15 years 6 months ago

Download clef2010.org

Abstract This vandalism detector uses features primarily derived from a wordpreserving differencing of the text for each Wikipedia article from before and after the edit, along wit...

James White, Rebecca Maessen

claim paper

Read More »

147

click to vote

LREC
2008

141views Education» more LREC 2008»

New Resources for Document Classification, Analysis and Translation Technologies

15 years 7 months ago

Download www.lrec-conf.org

The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...

Stephanie Strassel, Lauren Friedman, Safa Ismael, ...

claim paper

Read More »

126

click to vote

COMPSAC
2004
IEEE

104views Software Engineering» more COMPSAC 2004»

N-Gram-Based Detection of New Malicious Code

15 years 9 months ago

Download tony.abou-assaleh.net

The current commercial anti-virus software detects a virus only after the virus has appeared and caused damage. Motivated by the standard signature-based technique for detecting v...

Tony Abou-Assaleh, Nick Cercone, Vlado Keselj, Ray...

claim paper

Read More »

« Prev « First page 12 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers