Sciweavers

637 search results - page 92 / 128
» Training and documentation
Sort
View
ICDAR
2009
IEEE
14 years 7 months ago
An Open Source Tesseract Based Optical Character Recognizer for Bangla Script
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the Center for Research on Bangla Language Pr...
Md. Abul Hasnat, Muttakinur Rahman Chowdhury, Mumi...
ICML
2005
IEEE
15 years 10 months ago
Learning hierarchical multi-category text classification models
We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is...
Craig Saunders, John Shawe-Taylor, Juho Rousu, S&a...
MMM
2009
Springer
151views Multimedia» more  MMM 2009»
15 years 6 months ago
Large Scale Concept Detection in Video Using a Region Thesaurus
This paper presents an approach on high-level feature detection within video documents, using a Region Thesaurus. A video shot is represented by a single keyframe and MPEG-7 featur...
Evaggelos Spyrou, Giorgos Tolias, Yannis S. Avrith...
IAT
2009
IEEE
15 years 4 months ago
Opinion Mining on Newspaper Quotations
— Opinion mining is the task of extracting from a set of documents opinions expressed by a source on a specified target. This article presents a comparative study on the methods ...
Alexandra Balahur, Ralf Steinberger, Erik Van der ...
SIGIR
2009
ACM
15 years 4 months ago
Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization
This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the first stage, the proposed approach identifies topic th...
Massih-Reza Amini, Nicolas Usunier