Preface

This is Dr. Jesse Crawford’s Materials from his Data Mining class. I was given an opportunity to convert his scripts and powerpoint into r markdown document. After all these years, I am making an attempt to convert this into a book like structure to keep it relevant.

Structure of the book

[Starts with basic R]
[Cover Statistics or math]
[Working code and projects]

Technical aspects of this book

The packages used throughout this book and R version are as follows

## R version 4.0.2 (2020-06-22)
## Platform: x86_64-apple-darwin19.5.0 (64-bit)
## Running under: macOS Catalina 10.15.5
## 
## Matrix products: default
## BLAS:   /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
## LAPACK: /usr/local/Cellar/openblas/0.3.10_1/lib/libopenblasp-r0.3.10.dylib
## 
## locale:
## [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
## 
## attached base packages:
## [1] stats4    grid      stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
##  [1] DMwR_0.4.1          gridExtra_2.3       proto_1.0.0         expm_0.999-5        mixtools_1.2.0     
##  [6] fpc_2.2-7           cluster_2.1.0       fields_10.3         maps_3.3.0          spam_2.5-1         
## [11] dotCall64_1.0-0     mlbench_2.1-1       cvTools_0.3.2       robustbase_0.93-6   lattice_0.20-41    
## [16] exact2x2_1.6.5      exactci_1.3-3       ssanv_1.1           party_1.3-5         strucchange_1.5-2  
## [21] sandwich_2.5-1      zoo_1.8-8           modeltools_0.2-23   mvtnorm_1.1-1       randomForest_4.6-14
## [26] kknn_1.3.1          bestglm_0.37.3      leaps_3.1           lawstat_3.4         MASS_7.3-51.6      
## [31] Matrix_1.2-18       nnet_7.3-14         e1071_1.7-3         class_7.3-17        GGally_2.0.0       
## [36] pROC_1.16.2         RColorBrewer_1.1-2  rattle_5.4.0        bitops_1.0-6        tibble_3.0.3       
## [41] rpart.plot_3.0.8    rpart_4.1-15        ggplot2_3.3.2      
## 
## loaded via a namespace (and not attached):
##  [1] TH.data_1.0-10     colorspace_1.4-1   ellipsis_0.3.1     mclust_5.4.6       pls_2.7-3         
##  [6] rstudioapi_0.11    farver_2.0.3       flexmix_2.3-15     coin_1.3-1         codetools_0.2-16  
## [11] splines_4.0.2      libcoin_1.0-6      knitr_1.29         jsonlite_1.7.0     kernlab_0.9-29    
## [16] servr_0.18         Kendall_2.2        compiler_4.0.2     later_1.1.0.1      htmltools_0.5.0   
## [21] prettyunits_1.1.1  tools_4.0.2        igraph_1.2.5       gtable_0.3.0       glue_1.4.1        
## [26] dplyr_1.0.2        Rcpp_1.0.5         vctrs_0.3.2        nlme_3.1-148       iterators_1.0.12  
## [31] grpreg_3.3.0       gbRd_0.4-11        xfun_0.16          stringr_1.4.0      mime_0.9          
## [36] lifecycle_0.2.0    DEoptimR_1.0-8     scales_1.1.1       hms_0.5.3          promises_1.1.1    
## [41] parallel_4.0.2     curl_4.3           quantmod_0.4.17    yaml_2.2.1         segmented_1.2-0   
## [46] reshape_0.8.8      stringi_1.4.6      foreach_1.5.0      TTR_0.24.0         boot_1.3-25       
## [51] bibtex_0.4.2.2     shape_1.4.4        Rdpack_1.0.0       rlang_0.4.7        pkgconfig_2.0.3   
## [56] prabclus_2.3-2     matrixStats_0.56.0 evaluate_0.14      ROCR_1.0-11        purrr_0.3.4       
## [61] labeling_0.3       tidyselect_1.1.0   plyr_1.8.6         magrittr_1.5       bookdown_0.20     
## [66] R6_2.4.1           generics_0.0.2     multcomp_1.4-13    pillar_1.4.6       withr_2.2.0       
## [71] mgcv_1.8-31        xts_0.12-0         abind_1.4-5        survival_3.1-12    crayon_1.3.4      
## [76] rmarkdown_2.3      progress_1.2.2     diptest_0.75-7     digest_0.6.25      httpuv_1.5.4      
## [81] munsell_0.5.0      glmnet_4.0-2