Sciweavers

LRE
2008

Automatic building of an ontology on the basis of text corpora in Thai

13 years 4 months ago
Automatic building of an ontology on the basis of text corpora in Thai
This paper presents a methodology for automatic learning of ontologies from Thai text corpora, by extraction of terms and relations. A shallow parser is used to chunk texts on which we identify taxonomic relations with the help of cues: lexico-syntactic patterns and item lists. The main advantage of the approach is that it simplifies the task of concept and relation labeling since cues help for identifying the ontological concept and hinting their relation. However, these techniques pose certain problems, i.e. cue word ambiguity, item list identification, and numerous candidate terms. We also propose the methodology to solve these problems by using lexicon and cooccurrence features and weighting them with information gain. The precision, recall and F-measure of the system are 0.74, 0.78 and 0.76, respectively.
Aurawan Imsombut, Asanee Kawtrakul
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2008
Where LRE
Authors Aurawan Imsombut, Asanee Kawtrakul
Comments (0)