Sciweavers

ACL
2006
13 years 6 months ago
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization
Cross-language Text Categorization is the task of assigning semantic classes to documents written in a target language (e.g. English) while the system is trained using labeled doc...
Alfio Massimiliano Gliozzo, Carlo Strapparava
AAAI
2006
13 years 6 months ago
Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge
When humans approach the task of text categorization, they interpret the specific wording of the document in the much larger context of their background knowledge and experience. ...
Evgeniy Gabrilovich, Shaul Markovitch
MEDINFO
2007
13 years 6 months ago
Using Discourse Analysis to Improve Text Categorization in MEDLINE
PROBLEM: Automatic keyword assignment has been largely studied in medical informatics in the context of the MEDLINE database, both for helping search in MEDLINE and in order to pr...
Patrick Ruch, Antoine Geissbühler, Julien Gob...
LREC
2008
82views Education» more  LREC 2008»
13 years 6 months ago
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
We describe the creation of a corpus that supports a real-world hierarchical text categorization task in the domain of electronic rulemaking (eRulemaking). Features of the task an...
Claire Cardie, Cynthia Farina, Matt Rawding, Adil ...
DGO
2008
113views Education» more  DGO 2008»
13 years 6 months ago
A study in rule-specific issue categorization for e-rulemaking
We address the e-rulemaking problem of categorizing public comments according to the issues that they address. In contrast to previous text categorization research in e-rulemaking...
Claire Cardie, Cynthia Farina, Adil Aijaz, Matt Ra...
COLING
2008
13 years 6 months ago
Exact Inference for Multi-label Classification using Sparse Graphical Models
This paper describes a parameter estimation method for multi-label classification that does not rely on approximate inference. It is known that multi-label classification involvin...
Yusuke Miyao, Jun-ichi Tsujii
ICALT
2007
IEEE
13 years 6 months ago
An Evaluation of Automatic Text Categorization in Online Discussion Analysis
Content analysis is often employed by teachers and research to analyse online discussion forums to serve various purposes such as assessment, evaluation, and educational research....
Andrew Kwok-Fai Lui, Siu Cheung Li, Sheung-On Choy
ECML
2006
Springer
13 years 6 months ago
Distributional Features for Text Categorization
Abstract-- Text categorization is the task of assigning predefined categories to natural language text. With the widely used `bag of words' representation, previous researches...
Xiao-Bing Xue, Zhi-Hua Zhou
CIKM
2008
Springer
13 years 6 months ago
Semi-supervised text categorization by active search
In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high...
Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu...
PRICAI
2000
Springer
13 years 8 months ago
A Comparative Study on Chinese Text Categorization Methods
Abstract. This paper reports our comparative evaluation of three machine learning methods on Chinese text categorization. Whereas a wide range of methods have been applied to Engli...
Ji He, Ah-Hwee Tan, Chew Lim Tan