Sciweavers

ACL
2006

Concept Unification of Terms in Different Languages for IR

13 years 6 months ago
Concept Unification of Terms in Different Languages for IR
Due to the historical and cultural reasons, English phases, especially the proper nouns and new words, frequently appear in Web pages written primarily in Asian languages such as Chinese and Korean. Although these English terms and their equivalences in the Asian languages refer to the same concept, they are erroneously treated as independent index units in traditional Information Retrieval (IR). This paper describes the degree to which the problem arises in IR and suggests a novel technique to solve it. Our method firstly extracts an English phrase from Asian language Web pages, and then unifies the extracted phrase and its equivalence(s) in the language as one index unit. Experimental results show that the high precision of our conceptual unification approach greatly improves the IR performance.
Qing Li, Sung-Hyon Myaeng, Yun Jin, Bo-Yeong Kang
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where ACL
Authors Qing Li, Sung-Hyon Myaeng, Yun Jin, Bo-Yeong Kang
Comments (0)