Search Sciweavers | Sciweavers

298 search results - page 27 / 60

» An information-theoretic measure for document similarity

click to vote

DKE
2008

96views more DKE 2008»

Fragment-based approximate retrieval in highly heterogeneous XML collections

15 years 2 months ago

Download krono.act.uji.es

Due to the heterogeneous nature of XML data for internet applications exact matching of queries is often inadequate. The need arises to quickly identify subtrees of XML documents ...

Ismael Sanz, Marco Mesiti, Giovanna Guerrini, Rafa...

claim paper

Read More »

125

Voted

RIAO
2004

157views Information Technology» more RIAO 2004»

Multilingual document clusters discovery

15 years 3 months ago

Download www-list.cea.fr

Cross Language Information Retrieval community has brought up search engines over multilingual corpora, and multilingual text categorization systems. In this paper, we focus on th...

Benoît Mathieu, Romaric Besançon, Chr...

claim paper

Read More »

145

click to vote

CIVR
2005
Springer

205views Image Analysis» more CIVR 2005»

Automatic Image Semantic Annotation Based on Image-Keyword Document Model

15 years 7 months ago

Download homepage.fudan.edu.cn

Abstract. This paper presents a novel method of automatic image semantic annotation. Our approach is based on the Image-Keyword Document Model (IKDM) with image features discretiza...

Xiangdong Zhou, Lian Chen, Jianye Ye, Qi Zhang, Ba...

claim paper

Read More »

click to vote

CIKM
2008
Springer

133views Information Technology» more CIKM 2008»

Achieving both high precision and high recall in near-duplicate detection

15 years 3 months ago

Download www.infomall.cn

To find near-duplicate documents, fingerprint-based paradigms such as Broder's shingling and Charikar's simhash algorithms have been recognized as effective approaches a...

Lian'en Huang, Lei Wang, Xiaoming Li

claim paper

Read More »

157

click to vote

CLEF
2011
Springer

255views Information Technology» more CLEF 2011»

A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Document

14 years 1 months ago

Download web2py.iiit.ac.in

Abstract. This paper presents a language-independent Multilingual Document Clustering (MDC) approach on comparable corpora. Named entites (NEs) such as persons, locations, organiza...

N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma

claim paper

Read More »

« Prev « First page 27 / 60 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers