In this paper, we try to leverage a large-scale and multilingual knowledge base, Wikipedia, to help effectively analyze and organize Web information written in different languages...
This paper studies the problem of unified ranked retrieval of heterogeneous XML documents and Web data. We propose an effective search engine called Sailer to adaptively and versa...
Yellow pages catalogs and corresponding directory services on the web are a widely used business concept for helping people to find companies providing services and selling product...
Government regulations are semi-structured text documents that are often voluminous, heavily cross-referenced between provisions and even ambiguous. Multiple sources of regulation...
We present an event based system for storing, managing, and presenting personal multimedia history. The development of such systems is a challenge because information about person...