We present D-HOTM, a framework for Distributed Higher Order Text Mining based on named entities extracted from textual data that are stored in distributed relational databases. Unl...
This paper describes a language-independent, scalable system for both challenges of crossdocument co-reference: name variation and entity disambiguation. We provide system results...
The dimensionality reduction problem has been widely studied in the database literature because of its application for concise data representation in a variety of database applica...
In this paper, we describe our Brand Association MapTM (BAM) tool which maps and visualizes the way consumers naturally think and talk about brands across billions of unaided conv...
Optimal Component Analysis (OCA) is a linear method for feature extraction and dimension reduction. It has been widely used in many applications such as face and object recognitio...