Skip to main content

Data Preparation Workshop

  • Foundation
  • 1 hour
Material prepared for Purdue's Women in HPC (WHPC) "Data 101 for Machine Learning" workshop. This workshop is a beginner-level discussion about data, focusing on how data fits into machine learning and data science workflows. Topics: Types of data Data collection Basic data processing with Pandas Considerations for exploratory data analyses Basic Pandas code, data, and a PDF of the presentation can be found in this repository. Code, presentation, and data are from Sarah Rodenbeck's Github repository.
  1. Intro to Data and Data Processing
    PDF 6.82 MB
  2. Data 101 for Machine Learning
    Jupyter Notebook
  3. Data 101 for Machine Learning Workshop Information
    Reading 15 seconds