Sciweavers

285 search results - page 33 / 57
» Ontology-based Text Document Clustering
Sort
View
104
Voted
EMNLP
2004
15 years 1 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
88
Voted
JILT
2000
96views more  JILT 2000»
15 years 7 days ago
Automatic Classification and Intelligent Clustering for WWWeb Information Retrieval Systems
In this paper we present some aspects of an intelligent interface for a WWWeb legal information retrieval system. Our system is able to keep the context of the user interaction in...
Paulo Quaresma, Irene Pimenta Rodrigues
DASFAA
2004
IEEE
135views Database» more  DASFAA 2004»
15 years 4 months ago
Semi-supervised Text Classification Using Partitioned EM
Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...
Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu
IPM
2007
145views more  IPM 2007»
15 years 10 days ago
Text mining techniques for patent analysis
Patent documents contain important research results. However, they are lengthy and rich in technical terminology such that it takes a lot of human efforts for analyses. Automatic...
Yuen-Hsien Tseng, Chi-Jen Lin, Yu-I Lin
NIPS
2008
15 years 1 months ago
Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization
The cluster assumption is exploited by most semi-supervised learning (SSL) methods. However, if the unlabeled data is merely weakly related to the target classes, it becomes quest...
Liu Yang, Rong Jin, Rahul Sukthankar