character n-grams | Sciweavers

19

LREC
2010

159views Education» more LREC 2010»

The Web Library of Babel: evaluating genre collections

13 years 3 months ago

We present experiments in automatic genre classiﬁcation on web corpora, comparing a wide variety of features on several different genreannotated datasets (HGC, I-EN, KI-04, KRYS...

Serge Sharoff, Zhili Wu, Katja Markert

claim paper

Read More »

12

click to vote

ICWSM
2008

113views Internet Technology» more ICWSM 2008»

A Shallow Approach to Subjectivity Classification

13 years 6 months ago

Download www.aaai.org

We present a shallow linguistic approach to subjectivity classification. Using multinomial kernel machines, we demonstrate that a data representation based on counting character n...

Stephan Raaijmakers, Wessel Kraaij

claim paper

Read More »

7

click to vote

CLEF
2008
Springer

98views Information Technology» more CLEF 2008»

JHU Ad Hoc Experiments at CLEF 2008

13 years 6 months ago

Download clef.isti.cnr.it

For CLEF 2008 JHU conducted monolingual and bilingual experiments in the ad hoc TEL and Persian tasks. The TEL task involved focused on searching electronic card catalog records i...

Paul McNamee

claim paper

Read More »

11

click to vote

CLEF
2006
Springer

110views Information Technology» more CLEF 2006»

A First Approach to CLIR Using Character N -Grams Alignment

13 years 8 months ago

Download www.grupocole.org

Abstract. This paper describes the technique for translation of character n-grams we developed for our participation in CLEF 2006. This solution avoids the need for word normalizat...

Jesús Vilares, Michael P. Oakes, John Tait

claim paper

Read More »

9

click to vote

AIMSA
2006
Springer

122views Artificial Intelligence» more AIMSA 2006»

N-Gram Feature Selection for Authorship Identification

13 years 8 months ago

Download www.icsd.aegean.gr

Automatic authorship identification offers a valuable tool for supporting crime investigation and security. It can be seen as a multi-class, single-label text categorization task. ...

John Houvardas, Efstathios Stamatatos

claim paper

Read More »

8

click to vote

NLDB
2007
Springer

113views Natural Language Processing» more NLDB 2007»

Character N-Grams Translation in Cross-Language Information Retrieval

13 years 10 months ago

Download www.grupocole.org

Abstract. This paper describes a new technique for the direct translation of character n-grams for use in Cross-Language Information Retrieval systems. This solution avoids the nee...

Jesús Vilares, Michael P. Oakes, Manuel Vil...

claim paper

Read More »

8

click to vote

SIGIR
2009
ACM

134views Information Technology» more SIGIR 2009»

Addressing morphological variation in alphabetic languages

13 years 11 months ago

Download web.jhu.edu

The selection of indexing terms for representing documents is a key decision that limits how eﬀective subsequent retrieval can be. Often stemming algorithms are used to normaliz...

Paul McNamee, Charles K. Nicholas, James Mayfield

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers