Sciweavers

2227 search results - page 303 / 446
» Graph Mining based on a Data Partitioning Approach
Sort
View
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
15 years 9 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
WWW
2010
ACM
15 years 10 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
ESWS
2008
Springer
15 years 5 months ago
Instance Based Clustering of Semantic Web Resources
Abstract. The original Semantic Web vision was explicit in the need for intelligent autonomous agents that would represent users and help them navigate the Semantic Web. We argue t...
Gunnar Aastrand Grimnes, Peter Edwards, Alun D. Pr...
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
15 years 10 months ago
Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
This paper presents an interdisciplinary investigation of statistical information retrieval (IR) techniques for protein identification from tandem mass spectra, a challenging probl...
Yiming Yang, Abhay Harpale, Subramaniam Ganapathy
146
Voted
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
16 years 24 days ago
GeoFolk: Latent spatial semantics in Web 2.0 social media
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spat...
Sergej Sizov