A Hybrid Model for Part-of-Speech Tagging and its Application to Bengali

15 years 2 months ago

Download shiva.iiit.ac.in

This paper describes our work on Bengali Part of Speech (POS) tagging using a corpus-based approach. There are several approaches for part of speech tagging. This paper deals with a model that uses a combination of supervised and unsupervised learning using a Hidden Markov Model (HMM). We make use of small tagged corpus and a large untagged corpus. We also make use of Morphological Analyzer. Bengali is a highly ambiguous and relatively free word order language. We have obtained an overall accuracy of 95%. Keywords--Natural Language Processing, Machine Learning and Statistical Technology .

Sandipan Dandapat, Sudeshna Sarkar, Anupam Basu

Real-time Traffic

Artificial Intelligence | Bengali Part | IJIT 2004 | Large Untagged Corpus | Small Tagged Corpus |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	IJIT
Authors	Sandipan Dandapat, Sudeshna Sarkar, Anupam Basu

Comments (0)

Sciweavers

A Hybrid Model for Part-of-Speech Tagging and its Application to Bengali

Artificial Intelligence | Bengali Part | IJIT 2004 | Large Untagged Corpus | Small Tagged Corpus |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers