Sciweavers

374 search results - page 23 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
ICDAR
2009
IEEE
14 years 9 months ago
Using Kernel Density Classifier with Topic Model and Cost Sensitive Learning for Automatic Text Categorization
This paper proposes a novel framework for automatic text categorization problem based on the kernel density classifier. The overall goal is to tackle two main issues in automatic ...
Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang
80
Voted
NIPS
2007
15 years 1 months ago
Supervised Topic Models
We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. The model accommodates a variety of response types. We derive a maximum-like...
David M. Blei, Jon D. McAuliffe
ICDAR
2011
IEEE
13 years 11 months ago
A Novel Italic Detection and Rectification Method for Chinese Advertising Images
—The italic detection and slant rectification is a key step of optical character recognition (OCR). In this paper, a novel method is proposed to detect and rectify italic charact...
Jie Liu, Heping Li, Shuwu Zhang, Wei Liang
WWW
2011
ACM
14 years 6 months ago
Geographical topic discovery and comparison
This paper studies the problem of discovering and comparing geographical topics from GPS-associated documents. GPSassociated documents become popular with the pervasiveness of loc...
Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang...
EMNLP
2010
14 years 9 months ago
Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective
With the proliferation of user-generated articles over the web, it becomes imperative to develop automated methods that are aware of the ideological-bias implicit in a document co...
Amr Ahmed, Eric P. Xing