Scalable look-ahead linear regression trees

14 years 5 months ago

Download www.dataminingsolutions.net

Most decision tree algorithms base their splitting decisions on a piecewise constant model. Often these splitting algorithms are extrapolated to trees with non-constant models at the leaf nodes. The motivation behind Look-ahead Linear Regression Trees (LLRT) is that out of all the methods proposed to date, there has been no scalable approach to exhaustively evaluate all possible models in the leaf nodes in order to obtain an optimal split. Using several optimizations, LLRT is able to generate and evaluate thousands of linear regression models per second. This allows for a near-exhaustive evaluation of all possible splits in a node, based on the quality of fit of linear regression models in the resulting branches. We decompose the calculation of the Residual Sum of Squares in such a way that a large part of it is pre-computed. The resulting method is highly scalable. We observe it to obtain high predictive accuracy for problems with strong mutual dependencies between attributes. We rep...

David S. Vogel, Ognian Asparouhov, Tobias Scheffer

Real-time Traffic

Data Mining | KDD 2007 | Linear Regression Models | Linear Regression Tree | Linear Regression Trees |

claim paper

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2007
Where	KDD
Authors	David S. Vogel, Ognian Asparouhov, Tobias Scheffer

Sciweavers

Scalable look-ahead linear regression trees

Data Mining | KDD 2007 | Linear Regression Models | Linear Regression Tree | Linear Regression Trees |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers