Our ability to generate and collect data has been increasing rapidly.  Not only are all of our business, scientific, and government  transactions now computerized, but the widespread use of digital  cameras, publication tools, and bar codes also generate data. On the  collection side, scanned text and image platforms, satellite remote  sensing systems, and the World Wide Web have flooded us with a  tremendous amount of data. This explosive growth has generated an even  more urgent need for new techniques and automated tools that can help us  transform this data into useful information and knowledge.
Like  the first edition, voted the most popular data mining book by KD Nuggets  readers, this book explores concepts and techniques for the discovery  of patterns hidden in large data sets, focusing on issues relating to  their feasibility, usefulness, effectiveness, and scalability. However,  since the publication of the first edition, great progress has been made  in the development of new data mining methods, systems, and  applications. This new edition substantially enhances the first edition,  and new chapters have been added to address recent developments on  mining complex types of data including stream data, sequence data, graph  structured data, social network data, and multi-relational data.
Whether  you are a seasoned professional or a new student of data mining, this  book has much to offer you:
* A comprehensive, practical look at the  concepts and techniques you need to know to get the most out of real  business data.
* Updates that incorporate input from readers, changes  in the field, and more material on statistics and machine learning.
*  Dozens of algorithms and implementation examples, all in easily  understood pseudo-code and suitable for use in real-world, large-scale  data mining projects.
* Complete classroom support for instructors at  www.mkp.com/datamining2e companion site.
Download (27.3  MB)
or
Download  ebook
or
Download ebook
Data Mining, Second Edition : Concepts and Techniques (The Morgan Kaufmann Series in Data Management Systems)
Labels: Data Mining