Recently, the credibility of information on the Web has become an important issue. In addition to telling about content of source documents, indicating how to interpret the conten...
Corruption of data by class-label noise is an important practical concern impacting many classification problems. Studies of data cleaning techniques often assume a uniform label ...
Abstract. Extractive text summarization is the process of selecting relevant sentences from a collection of documents, perhaps only a single document, and arranging such sentences ...
This paper describes a light-weight, content-based, functional query language. The Mathematical Services Query Language (MSQL) has been developed for querying mathematical web ser...
Web archives preserve the history of Web sites and have high long-term value for media and business analysts. Such archives are maintained by periodically re-crawling entire Web s...
Marc Spaniol, Dimitar Denev, Arturas Mazeika, Gerh...