We present an integrated approach to speech and natural language processing which uses a single parser to create training for a statistical speech recognition component and for in...
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
This paper describes Automated Industry and Occupation Coding System (AIOCS). The main function of the system is to classify natural language responses to survey questionnaires int...
Most databases contain “name constants” like course numbers, personal names, and place names that correspond to entities in the real world. Previous work in integration of het...
In Chinese, phrases and named entities play a central role in information retrieval. Abbreviations, however, make keyword-based approaches less effective. This paper presents an em...