BOAT-Optimistic Decision Tree Construction

8 years 10 months ago
BOAT-Optimistic Decision Tree Construction
Classification is an important data mining problem. Given a training database of records, each tagged with a class label, the goal of classification is to build a concise model that can be used to predict the class label of future, unlabeled records. A very popular class of classifiers are decision trees. All current algorithms to construct decision trees, including all main-memory algorithms, make one scan over the training database per level of the tree. We introduce a new algorithm (BOAT) for decision tree construction that improves upon earlier algorithms in both performance and functionality. BOAT constructs several levels of the tree in only two scans over the training database, resulting in an average performance gain of 300% over previous work. The key to this performance improvement is a novel optimistic approach to tree construction in which we construct an initial tree using a small subset of the data and refine it to arrive at the final tree. We guarantee that any dif...
Johannes Gehrke, Venkatesh Ganti, Raghu Ramakrishn
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Authors Johannes Gehrke, Venkatesh Ganti, Raghu Ramakrishnan, Wei-Yin Loh
Comments (0)