Sciweavers

23 search results - page 5 / 5
» MatchDetectReveal: finding overlapping and similar digital d...
Sort
View
SIGMOD
2009
ACM
269views Database» more  SIGMOD 2009»
14 years 5 months ago
Efficient approximate entity extraction with edit distance constraints
Named entity recognition aims at extracting named entities from unstructured text. A recent trend of named entity recognition is finding approximate matches in the text with respe...
Wei Wang 0011, Chuan Xiao, Xuemin Lin, Chengqi Zha...
KDD
2003
ACM
99views Data Mining» more  KDD 2003»
14 years 5 months ago
Fragments of order
High-dimensional collections of 0-1 data occur in many applications. The attributes in such data sets are typically considered to be unordered. However, in many cases there is a n...
Aristides Gionis, Teija Kujala, Heikki Mannila
GECCO
2007
Springer
206views Optimization» more  GECCO 2007»
13 years 8 months ago
Using code metric histograms and genetic algorithms to perform author identification for software forensics
We have developed a technique to characterize software developers' styles using a set of source code metrics. This style fingerprint can be used to identify the likely author...
Robert Charles Lange, Spiros Mancoridis