Sciweavers

112 search results - page 1 / 23
» Text-Based Document Similarity Matching Using Sdtext
Sort
View
FSKD
2007
Springer
354views Fuzzy Logic» more  FSKD 2007»
13 years 11 months ago
Using Fuzzy-Word Correlation Factors to Compute Document Similarity Based on Phrase Matching
One of the Web information Retrieval (IR) problems these days is to identify redundant information that exist in (replicated) Web documents. These documents can easily be found in...
Jun won Lee, Yiu-Kai Ng
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
13 years 10 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel
BMCBI
2008
137views more  BMCBI 2008»
13 years 5 months ago
Evading the annotation bottleneck: using sequence similarity to search non-sequence gene data
Background: Non-sequence gene data (images, literature, etc.) can be found in many different public databases. Access to these data is mostly by text based methods using gene name...
Michael J. Gilchrist, Mikkel B. Christensen, Richa...
HICSS
2000
IEEE
154views Biometrics» more  HICSS 2000»
13 years 9 months ago
Anti-Serendipity: Finding Useless Documents and Similar Documents
The problem of finding your way through a relatively unknown collection of digital documents can be daunting. Such collections sometimes have few categories and little hierarchy, ...
James W. Cooper, John M. Prager
KSEM
2007
Springer
13 years 11 months ago
Finding Similar RSS News Articles Using Correlation-Based Phrase Matching
Traditional phrase matching approaches, which can discover documents containing exactly the same phrases, fail to detect documents including phrases that are semantically relevant,...
Maria Soledad Pera, Yiu-Kai Ng