A collection of R Markdown notebooks covering principles, concepts, and methods in the fields of data mining and knowledge discovery. Topics include: data visualization, exploration, clustering, classification, association rule mining, and anomaly detection, among others.
- Data Collection and Preprocessing
- Dimensionality Reduction and PCA
- Association Rules
- Clustering
- Text Mining
These notebooks were put together as part of Florida Poly's Data Mining and Text Mining course (Fall 2020), an application-driven introduction to data mining and text mining, covering fundamental and popular R packages for data mining and text mining, introduced as working examples. The notebook templates were prepared by Dr. Reinaldo (Rei) Sanchez-Arias.