Search Sciweavers | Sciweavers

160 search results - page 4 / 32

» Exploiting structural information for semi-structured docume...

click to vote

AIIA
2005
Springer

140views Artificial Intelligence» more AIIA 2005»

A Semantic Kernel to Exploit Linguistic Knowledge

13 years 11 months ago

Download dit.unitn.it

Abstract. Improving accuracy in Information Retrieval tasks via semantic information is a complex problem characterized by three main aspects: the document representation model, th...

Roberto Basili, Marco Cammisa, Alessandro Moschitt...

claim paper

Read More »

click to vote

ICDAR
2009
IEEE

175views Document Analysis» more ICDAR 2009»

Using top n Recognition Candidates to Categorize On-line Handwritten Documents

13 years 3 months ago

Download www.cvc.uab.es

The traditional weighting schemes used in text categorization for the vector space model (VSM) cannot exploit information intrinsic to texts obtained through on-line handwriting r...

Sebastián Peña Saldarriaga, Emmanuel...

claim paper

Read More »

click to vote

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Entity categorization over large document collections

14 years 6 months ago

Download www.ics.uci.edu

Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...

Arnd Christian König, Rares Vernica, Venkates...

claim paper

Read More »

click to vote

CIKM
2008
Springer

118views Information Technology» more CIKM 2008»

Semi-supervised text categorization by active search

13 years 7 months ago

Download www.cse.cuhk.edu.hk

In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high...

Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu...

claim paper

Read More »

click to vote

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

13 years 11 months ago

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...

Leonardo Rigutini, Marco Maggini

claim paper

Read More »

« Prev « First page 4 / 32 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers