Sciweavers

SIGKDD
2000

KDD-Cup 2000 Organizers' Report: Peeling the Onion

13 years 4 months ago
KDD-Cup 2000 Organizers' Report: Peeling the Onion
We describe KDD-Cup 2000, the yearly competition in data mining. For the first time the Cup included insight problems in addition to prediction problems, thus posing new challenges in both the knowledge discovery and the evaluation criteria, and highlighting the need to "peel the onion" and drill deeper into the reasons for the initial patterns found. We chronicle the data generation phase starting from the collection at the site through its conversion to a star schema in a warehouse through data cleansing, data obfuscation for privacy protection, and data aggregation. We describe the information given to the participants, including the questions, site structure, the marketing calendar, and the data schema. Finally, we discuss interesting insights, common mistakes, and lessons learned. Three winners were announced and they describe their own experiences and lessons in the pages following this paper. Keywords KDD-Cup, e-commerce, competition, data mining, real-world data, ins...
Ron Kohavi, Carla E. Brodley, Brian Frasca, Llew M
Added 19 Dec 2010
Updated 19 Dec 2010
Type Journal
Year 2000
Where SIGKDD
Authors Ron Kohavi, Carla E. Brodley, Brian Frasca, Llew Mason, Zijian Zheng
Comments (0)