Search Sciweavers | Sciweavers

123

ECAI
2006
Springer

93views Artificial Intelligence» more ECAI 2006»

Text Sampling and Re-Sampling for Imbalanced Authorship Identification Cases

15 years 8 months ago

Authorship identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candida...

Efstathios Stamatatos

claim paper

Read More »

147

click to vote

ICDM
2010
IEEE

122views Data Mining» more ICDM 2010»

Learning Preferences with Millions of Parameters by Enforcing Sparsity

15 years 2 months ago

Download www.cs.cmu.edu

We study the retrieval task that ranks a set of objects for a given query in the pairwise preference learning framework. Recently researchers found out that raw features (e.g. word...

Xi Chen, Bing Bai, Yanjun Qi, Qihang Lin, Jaime G....

claim paper

Read More »

152

click to vote

ICDAR
2011
IEEE

228views Document Analysis» more ICDAR 2011»

A Handwritten Character Extraction Algorithm for Multi-language Document Image

14 years 4 months ago

Download www.icdar2011.org

—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...

Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...

claim paper

Read More »

144

click to vote

EMNLP
2004

143views Natural Language Processing» more EMNLP 2004»

A New Approach for English-Chinese Named Entity Alignment

15 years 5 months ago

Download research.microsoft.com

Traditional word alignment approaches cannot come up with satisfactory results for Named Entities. In this paper, we propose a novel approach using a maximum entropy model for nam...

Donghui Feng, Yajuan Lü, Ming Zhou

claim paper

Read More »

104

Voted

TREC
1997

95views Information Technology» more TREC 1997»

Conceptual Indexing Using Thematic Representation of Texts

15 years 5 months ago

Download www.cir.ru

We present the thesaurus-based indexing technology developed by the Center for Information Research under the Information System RUSSIA project. The technology is based on using b...

Boris V. Dobrov, Natalia V. Loukachevitch, Tatyana...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers