Data Preparation Workshop
- Foundation
- 1 hour
Material prepared for Purdue's Women in HPC (WHPC) "Data 101 for Machine Learning" workshop. This workshop is a beginner-level discussion about data, focusing on how data fits into machine learning and data science workflows.
Topics:
Types of data
Data collection
Basic data processing with Pandas
Considerations for exploratory data analyses
Basic Pandas code, data, and a PDF of the presentation can be found in this repository.
Code, presentation, and data are from Sarah Rodenbeck's Github repository.