These are resources from the class.
Slides
Note: Slides are provided under Creative Commons 4.0 Share-Alike. You are encouraged to make comments on the slides so that they can be improved.
- Block 1: Scoping and EDA Slides
- Block 2: Modelling Slides
- Block 3: Visualisation Slides
- Block 4: Putting It All Together Slides
Block 1: Scoping and EDA
- Data Science for Social Good: Hitchhikers Guide - Data Exploration URL
- Ideas for Data Science Project Scoping URL
- Sample Sell Phone EDA Notebooks URL
Block 2: Modelling
- DSSG Hitchhiker’s Guide - Programming Best Practices URL
- DC-Check: Data-Centric AI Checklist URL
- Ten Simple Rules for Reproducible Computational Research URL
- CRediT Author Statement URL
- Introduction to Git (Microsoft Learn) URL
- VS Code + Jupyter Notebooks URL
Block 3: Visualisation and Deployment
Plotting Libraries (Python)
Deployment Frameworks
Online Tools
Block 4: Putting It All Together
Model Documentation
-
Model Cards for Model Reporting Paper Examples - Model Cards Writing Tool (Hugging Face) URL
- Model Card Toolkit URL
Dataset Documentation
PCS Framework
- Veridical Data Science (Video) URL
Video
- Lessons learned from practicing and teaching data science in Latin America - John Alexis Guerra Gómez URL
- Git & GitHub Crash Course For Beginners URL
Presentations/Readings from other Researchers
- “How to do good research, get it published in SIGKDD and get it cited!”, Eamonn Keogh, SIGKDD 2009 Tutorial. URL
- Heuristics for Scientific Writing (a Machine Learning Perspective) - Zachary C. Lipton URL
Tech Resources
- Github Student Pack URL
- Cookie Cutter Data Science URL
- Google Colab URL
- Python examples for The Art of Data Science URL
Other courses
- The Missing Semester of Your CS Education URL