Getting George H. John's Thesis/Book

Now available! Enhancements to the Data Mining Process by George H. John, a doctoral dissertation from Stanford University with lessons for data mining practitioners, researchers, and students alike.

"The introduction should be required reading for all data analysts... I couldn't put it down!"
-- Prof. Jerry Friedman, co-Inventor of CART

"Insightful, readable, and with a touch of humor"
-- Herb Edelstein, President, Two Crows Corporation

The book is organized around the data mining process, with each chapter discussing a new method for handling one step, such as data extraction, data cleaning, or data engineering. The bulk of the dissertation is geared towards a technical audience, but the introductory chapter should be readable by a wide audience. All of the technical chapters begin with motivating exmples, and with nearly 60 figures and tables, readers who wish to skip the mathematical formulas should benefit from reading the technical chapters as well.

Please fill out the optional short survey below, then click the button at the bottom of the page to proceed. You can also view the ad or read the abstract & table of contents of the thesis.

Your Name:

Email address:

Your Company/University:

Your Position:

Your Level of Education:

How many years' experience do you have with data mining?

Any other comments?

Click to proceed.

Recently there has been a problem with the way this website handles CGI scripts. If the submit button doesn't work, please click here to go directly to the thesis page.