Sciweavers

150 search results - page 2 / 30
» A neighborhood-based approach for clustering of linked docum...
Sort
View
ICAIL
2005
ACM
13 years 11 months ago
Effective Document Clustering for Large Heterogeneous Law Firm Collections
Computational resources for research in legal environments have historically implied remote access to large databases of legal documents such as case law, statutes, law reviews an...
Jack G. Conrad, Khalid Al-Kofahi, Ying Zhao, Georg...
KDD
2010
ACM
326views Data Mining» more  KDD 2010»
13 years 3 months ago
Document clustering via dirichlet process mixture model with feature selection
One essential issue of document clustering is to estimate the appropriate number of clusters for a document collection to which documents should be partitioned. In this paper, we ...
Guan Yu, Ruizhang Huang, Zhaojun Wang
AAAI
2010
13 years 7 months ago
Utilizing Context in Generative Bayesian Models for Linked Corpus
In an interlinked corpus of documents, the context in which a citation appears provides extra information about the cited document. However, associating terms in the context to th...
Saurabh Kataria, Prasenjit Mitra, Sumit Bhatia
SIGIR
2009
ACM
14 years 8 days ago
A latent topic model for linked documents
Documents in many corpora, such as digital libraries and webpages, contain both content and link information. To explicitly consider the document relations represented by links, i...
Zhen Guo, Shenghuo Zhu, Yun Chi, Zhongfei Zhang, Y...
KDD
2005
ACM
135views Data Mining» more  KDD 2005»
14 years 6 months ago
A hybrid unsupervised approach for document clustering
We propose a hybrid, unsupervised document clustering approach that combines a hierarchical clustering algorithm with Expectation Maximization. We developed several heuristics to ...
Mihai Surdeanu, Jordi Turmo, Alicia Ageno