Sciweavers

18306 search results - page 3035 / 3662
» Algorithmics in Exponential Time
Sort
View
KDD
2007
ACM
167views Data Mining» more  KDD 2007»
16 years 6 months ago
Multiscale topic tomography
Modeling the evolution of topics with time is of great value in automatic summarization and analysis of large document collections. In this work, we propose a new probabilistic gr...
Ramesh Nallapati, Susan Ditmore, John D. Lafferty,...
KDD
2006
ACM
175views Data Mining» more  KDD 2006»
16 years 6 months ago
A mixture model for contextual text mining
Contextual text mining is concerned with extracting topical themes from a text collection with context information (e.g., time and location) and comparing/analyzing the variations...
Qiaozhu Mei, ChengXiang Zhai
KDD
2006
ACM
157views Data Mining» more  KDD 2006»
16 years 6 months ago
Using structure indices for efficient approximation of network properties
Statistics on networks have become vital to the study of relational data drawn from areas such as bibliometrics, fraud detection, bioinformatics, and the Internet. Calculating man...
Matthew J. Rattigan, Marc Maier, David Jensen
188
Voted
KDD
2006
ACM
162views Data Mining» more  KDD 2006»
16 years 6 months ago
Simultaneous record detection and attribute labeling in web data extraction
Recent work has shown the feasibility and promise of templateindependent Web data extraction. However, existing approaches use decoupled strategies ? attempting to do data record ...
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Y...
165
Voted
KDD
2004
ACM
195views Data Mining» more  KDD 2004»
16 years 6 months ago
Improved robustness of signature-based near-replica detection via lexicon randomization
Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...
Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...
« Prev « First page 3035 / 3662 Last » Next »