Hands on Data Science
2020-09-07
Preface
This is Dr. Jesse Crawford’s Materials from his Data Mining class. I was given an opportunity to convert his scripts and powerpoint into r markdown document. After all these years, I am making an attempt to convert this into a book like structure to keep it relevant.
Structure of the book
[Starts with basic R]
[Cover Statistics or math]
[Working code and projects]
Technical aspects of this book
The packages used throughout this book and R version are as follows
## R version 4.0.2 (2020-06-22)
## Platform: x86_64-apple-darwin19.5.0 (64-bit)
## Running under: macOS Catalina 10.15.5
##
## Matrix products: default
## BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
## LAPACK: /usr/local/Cellar/openblas/0.3.10_1/lib/libopenblasp-r0.3.10.dylib
##
## locale:
## [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
##
## attached base packages:
## [1] stats4 grid stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] DMwR_0.4.1 gridExtra_2.3 proto_1.0.0 expm_0.999-5 mixtools_1.2.0
## [6] fpc_2.2-7 cluster_2.1.0 fields_10.3 maps_3.3.0 spam_2.5-1
## [11] dotCall64_1.0-0 mlbench_2.1-1 cvTools_0.3.2 robustbase_0.93-6 lattice_0.20-41
## [16] exact2x2_1.6.5 exactci_1.3-3 ssanv_1.1 party_1.3-5 strucchange_1.5-2
## [21] sandwich_2.5-1 zoo_1.8-8 modeltools_0.2-23 mvtnorm_1.1-1 randomForest_4.6-14
## [26] kknn_1.3.1 bestglm_0.37.3 leaps_3.1 lawstat_3.4 MASS_7.3-51.6
## [31] Matrix_1.2-18 nnet_7.3-14 e1071_1.7-3 class_7.3-17 GGally_2.0.0
## [36] pROC_1.16.2 RColorBrewer_1.1-2 rattle_5.4.0 bitops_1.0-6 tibble_3.0.3
## [41] rpart.plot_3.0.8 rpart_4.1-15 ggplot2_3.3.2
##
## loaded via a namespace (and not attached):
## [1] TH.data_1.0-10 colorspace_1.4-1 ellipsis_0.3.1 mclust_5.4.6 pls_2.7-3
## [6] rstudioapi_0.11 farver_2.0.3 flexmix_2.3-15 coin_1.3-1 codetools_0.2-16
## [11] splines_4.0.2 libcoin_1.0-6 knitr_1.29 jsonlite_1.7.0 kernlab_0.9-29
## [16] servr_0.18 Kendall_2.2 compiler_4.0.2 later_1.1.0.1 htmltools_0.5.0
## [21] prettyunits_1.1.1 tools_4.0.2 igraph_1.2.5 gtable_0.3.0 glue_1.4.1
## [26] dplyr_1.0.2 Rcpp_1.0.5 vctrs_0.3.2 nlme_3.1-148 iterators_1.0.12
## [31] grpreg_3.3.0 gbRd_0.4-11 xfun_0.16 stringr_1.4.0 mime_0.9
## [36] lifecycle_0.2.0 DEoptimR_1.0-8 scales_1.1.1 hms_0.5.3 promises_1.1.1
## [41] parallel_4.0.2 curl_4.3 quantmod_0.4.17 yaml_2.2.1 segmented_1.2-0
## [46] reshape_0.8.8 stringi_1.4.6 foreach_1.5.0 TTR_0.24.0 boot_1.3-25
## [51] bibtex_0.4.2.2 shape_1.4.4 Rdpack_1.0.0 rlang_0.4.7 pkgconfig_2.0.3
## [56] prabclus_2.3-2 matrixStats_0.56.0 evaluate_0.14 ROCR_1.0-11 purrr_0.3.4
## [61] labeling_0.3 tidyselect_1.1.0 plyr_1.8.6 magrittr_1.5 bookdown_0.20
## [66] R6_2.4.1 generics_0.0.2 multcomp_1.4-13 pillar_1.4.6 withr_2.2.0
## [71] mgcv_1.8-31 xts_0.12-0 abind_1.4-5 survival_3.1-12 crayon_1.3.4
## [76] rmarkdown_2.3 progress_1.2.2 diptest_0.75-7 digest_0.6.25 httpuv_1.5.4
## [81] munsell_0.5.0 glmnet_4.0-2