This paper presents a general framework for building classifiers that deal with short and sparse text & Web segments by making the most of hidden topics discovered from larges...
To classify spectroscopic measurements it is necessary to have comparable methods of evaluation. In Terahertz (THz) time-domain spectroscopy, as a new technology, neither the pres...
Henrike Stephani, Joachim Jonuscheit, Christoph Ro...
The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applicati...
People enacting processes deviate from the process definition for a variety of different reasons, the consequences of which may be either positive or negative. Detecting deviation...
Tagged data is rapidly becoming more available on the World Wide Web. Web sites which populate tagging services offer a good way for Internet users to share their knowledge. An in...