E-Book, Englisch, 139 Seiten, eBook
Olson / Lauhoff Descriptive Data Mining
2. Auflage 2019
ISBN: 978-981-13-7181-3
Verlag: Springer Singapore
Format: PDF
Kopierschutz: 1 - PDF Watermark
E-Book, Englisch, 139 Seiten, eBook
Reihe: Computational Risk Management
ISBN: 978-981-13-7181-3
Verlag: Springer Singapore
Format: PDF
Kopierschutz: 1 - PDF Watermark
This book provides an overview of data mining methods demonstrated by software. Knowledge management involves application of human knowledge (epistemology) with the technological advances of our current society (computer systems) and big data, both in terms of collecting data and in analyzing it. We see three types of analytic tools. Descriptive analytics focus on reports of what has happened. Predictive analytics extend statistical and/or artificial intelligence to provide forecasting capability. It also includes classification modeling. Diagnostic analytics can apply analysis to sensor input to direct control systems automatically. Prescriptive analytics applies quantitative models to optimize systems, or at least to identify improved systems. Data mining includes descriptive and predictive modeling. Operations research includes all three. This book focuses on descriptive analytics.
The book seeks to provide simple explanations and demonstration of some descriptive tools. This second edition provides more examples of big data impact, updates the content on visualization, clarifies some points, and expands coverage of association rules and cluster analysis. Chapter 1 gives an overview in the context of knowledge management. Chapter 2 discusses some basic software support to data visualization. Chapter 3 covers fundamentals of market basket analysis, and Chapter 4 provides demonstration of RFM modeling, a basic marketing data mining tool. Chapter 5 demonstrates association rule mining. Chapter 6 is a more in-depth coverage of cluster analysis. Chapter 7 discusses link analysis.
Models are demonstrated using business related data. The style of the book is intended to be descriptive, seeking to explain how methods work, with some citations, but without deep scholarly reference. The data sets and software are all selected for widespread availability and access by any reader with computer links.
David L. Olson is the James & H.K. Stuart Professor in MIS and Chancellor's Professor at the University of Nebraska. He has published over 200 articles in refereed journals, primarily on the topic of multiple objective decision-making and information technology. He has authored over 20 books, is co-editor-in-chief of the International Journal of Services Sciences and associate editor of a number of journals. He has given over 150 presentations at international and national conferences. He is a member of the Decision Sciences Institute, the Institute for Operations Research and Management Sciences, and the Multiple Criteria Decision Making Society. He was a Lowry Mays endowed Professor at Texas A&M University from 1999 to 2001, was named the Raymond E. Miles Distinguished Scholar in 2002, and was James C. and Rhonda Seacrest Fellow from 2005 to 2006. He was named Best Enterprise Information Systems Educator by IFIP in 2006. He is a Fellow of the Decision Sciences Institute.
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
1;Preface;6
2;Book Concept;7
3;Contents;8
4;About the Authors;10
5;1 Knowledge Management;11
5.1;Computer Support Systems;12
5.2;Examples of Knowledge Management;14
5.3;Data Mining Descriptive Applications;17
5.4;Summary;18
5.5;References;18
6;2 Data Visualization;20
6.1;Data Visualization;20
6.2;R Software;21
6.2.1;Loan Data;22
6.3;Energy Data;29
6.4;Basic Visualization of Time Series;30
6.5;Conclusion;37
6.6;References;39
7;3 Market Basket Analysis;40
7.1;Definitions;41
7.2;Co-occurrence;42
7.3;Demonstration;46
7.3.1;Fit;47
7.3.2;Profit;47
7.3.3;Lift;50
7.4;Market Basket Limitations;52
7.5;References;53
8;4 Recency Frequency and Monetary Analysis;54
8.1;Dataset 1;55
8.2;Balancing Cells;59
8.3;Lift;61
8.4;Value Function;62
8.5;Data Mining Classification Models;67
8.5.1;Logistic Regression;67
8.5.2;Decision Tree;68
8.5.3;Neural Networks;68
8.6;Dataset 2;68
8.7;Conclusions;72
8.8;References;74
9;5 Association Rules;76
9.1;Methodology;77
9.2;The Apriori Algorithm;78
9.3;Association Rules from Software;80
9.4;Non-negative Matric Factorization;84
9.5;Conclusion;85
9.6;References;85
10;6 Cluster Analysis;86
10.1;K-Means Clustering;87
10.1.1;A Clustering Algorithm;87
10.1.2;Loan Data;88
10.2;Clustering Methods Used in Software;90
10.3;Software;91
10.3.1;R (Rattle) K-Means Clustering;91
10.3.2;Other R Clustering Algorithms;97
10.3.3;KNIME;105
10.3.4;WEKA;107
10.4;Summary;114
10.5;References;115
11;7 Link Analysis;116
11.1;Link Analysis Terms;116
11.2;Basic Network Graphics with NodeXL;123
11.3;Network Analysis of Facebook Network or Other Networks;127
11.4;Link Analysis of Your Emails;133
11.5;Link Analysis Application with PolyAnalyst (Olson and Shi 2007);134
11.6;Summary;137
11.7;References;137
12;8 Descriptive Data Mining;138