Ontology generation for large email collections

11 years 6 months ago
Ontology generation for large email collections
This paper presents a new approach to identifying concepts expressed in a collection of email messages, and organizing them into an ontology or taxonomy for browsing. It incorporates techniques from text mining, information retrieval, natural language processing and machine learning to generate a concept ontology. Nominal N-gram mining is used to identify candidate concepts. Wordnet and surface text pattern matching are used to identify relationships among the concepts. A supervised clustering algorithm is then used to further cluster the concepts. The experiments show that the approach is effective. Categories and Subject Descriptors H.4 [Information Systems Applications]: Miscellaneous Keywords concept ontology, supervised clustering, eRulemaking
Hui Yang, Jamie Callan
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where DGO
Authors Hui Yang, Jamie Callan
Comments (0)