Sciweavers

329 search results - page 17 / 66
» A Novel Method for Detecting Similar Documents
Sort
View
TREC
2007
15 years 3 months ago
On Retrieving Legal Files: Shortening Documents and Weeding Out Garbage
This paper describes our participation in the TREC Legal experiments in 2007. We have applied novel normalization techniques that are designed to slightly favor longer documents i...
Scott Kulp, April Kontostathis
129
Voted
EDBT
2009
ACM
277views Database» more  EDBT 2009»
15 years 6 months ago
G-hash: towards fast kernel-based similarity search in large graph databases
Structured data including sets, sequences, trees and graphs, pose significant challenges to fundamental aspects of data management such as efficient storage, indexing, and simila...
Xiaohong Wang, Aaron M. Smalter, Jun Huan, Gerald ...
120
Voted
WWW
2009
ACM
16 years 2 months ago
Measuring the similarity between implicit semantic relations from the web
Measuring the similarity between semantic relations that hold among entities is an important and necessary step in various Web related tasks such as relation extraction, informati...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
ICDAR
2009
IEEE
15 years 8 months ago
Automatic Corresponding Control Points Selection for Historical Document Image Registration
Image registration is crucial for various image analysis tasks. In particular, most approaches to correction of bleed-through distortion on handwritten document images require the...
Jie Wang, Michael S. Brown, Chew Lim Tan
ICPR
2004
IEEE
16 years 3 months ago
Morphological Tagging Approach in Document Analysis of Invoices
In this paper a morphological tagging approach for document image invoice analysis is described. Tokens close by their morphology and confirmed in their location within different ...
Abdel Belaïd, Yolande Belaïd