GA Seattle DSI
Introduction
i. Students
a: Onboarding
1: Installfest
2: Technical Guide
3: Precourse Onboarding Tasks
b: Resources
ii. Projects
a: Weekly Projects
1: SAT Scores + Summary Statistics
2: Billboard Hits + Data Munging
3: Liquor Sales + Linear Regression
4: Web Scraping + Logistic Regression
5: Disaster Relief + Classification
6: IMDB API + Random Forests
7: Airport Delays + Cluster Analysis
b: Capstone Project
1: Capstone Topic + Dataset Validation
2: Problem Statement + EDA
3: Progress Report + Preliminary Findings
4: Report Writeup + Technical Analysis
5: Presentation + Recommendations
1. Python
1.1a Data Science Overview
1.1b Command Line
1.2 Git
1.3 Python pt.1
1.4 Python pt.2
2.1 Python pt.3
2.2 Lab
2.3 Python pt.4
2.4 Lab
3.1 Programming Fundamentals
3.2 iPython, Data Values, and CSV
3.3 Intro to NumPy
3.4 Lab
4.1 Math Primer pt. 2
4.2 Lab
4.3 Lab
4.4 Data Viz Principles
5.1 Plotting Tools
5.2 Lab
5.3 Python List Comprehensions
2. Pandas / Data Analysis
1.1 Pandas pt.1
1.2 Study Design
1.3 Stats 101
1.4 Lab
2.1 Pandas pt.2
2.2 Lab
2.3 Data Cleaning
2.4 Lab
3.1 Pandas pt.3
3.2 Lab
3.3 Categorical & Dummy Variables
3.4 Lab
4.1 SciPy
4.2 Lab
4.3 Joins and Pandas
4.4 Lab
5.1 Plotting and Pandas
5.2 Lab
3. Linear Regression / Stats Models
1.1 Modeling Linear Regression
1.2 Lab
1.3 Stats Models & SKLearn
1.4 Lab
2.1 Bias Variance Tradeoff
2.2 Lab
2.3 Regularization and Overfitting
2.4 Lab
3.1 Regression Metrics & Loss Functions
3.2 Train-Test Split
3.3 Lab
3.4 Lab
4.1 Gradient Descent
4.2 Feature Scaling
4.3 Study Design
5.1 Stakeholder Analysis
5.2 Lab
4. Logistic Regression
1.1 Classification
1.2 Web Scraping
1.3 Lab
1.4 Lab
2.1 Logistic Regression
2.2 Lab
2.3 Model Fit
2.4 Lab
3.1 Visualizing Classification Models
3.2 Lab
4.1 Advanced Model Evaluation
4.2 Lab
4.3 Lab
5.1 Communicating Results
5.2 Lab
5. Classification / Databases
1.1 Different Databases
1.2 SQL pt.1
1.3 Lab
1.4 Lab
2.1 Classification Case Studies
2.2 Pipelines and Custom Transformers in SKLearn
2.3 Lab
2.4 Lab
3.1 SQL pt.2
3.2 Lab
3.3 Lab
4.1 SQL Feature Selection
4.2 Lab
5.1 Support Vector Machines
5.2 Lab
6. Trees & Ensemble Methods
1.1 Classification and Regression Trees (CARTs)
1.2 Lab
1.3 Servers, JSON, & APIs
1.4 Lab
2.1 SQL Joins
2.2 Lab
2.3 Ensemble Methods - Decision Trees and Bagging
2.4 Lab
3.1 Ensemble Methods - Random Forests and Boosting
3.2 Lab
3.3 Model Evaluation & Feature Importance
3.4 Lab
4.1 Feature Extraction from Text
4.2 Lab
5.2 Communicating results
7. Unsupervised Learning
1.1 Intro to Clustering
1.2 Lab
1.3 Tuning Clusters
1.4 Advanced Database Skills
2.1 Dimensionality Reduction
2.2 Intro to Principal Component Analysis
2.3 Lab
2.4 Lab
3.1 Hierarchical Clustering
3.2 Hierarchical Clustering
3.3 Lab
3.4 Unsupervised Learning Trends
4.2 DBSCAN
4.3 Lab
5.1 Unsupervised Learning Case Study
8. Bayesian Inference
1.1 Bayes
1.2 Lab
1.3 Bayes Deep Dive
1.4 Lab
2.1 Lab
2.2 Lab
2.3 Lab
2.4 Lab
3.1 API Review
3.2 Lab
3.3 LDA
3.4 Lab
4.1 Bayesian Stat Testing
4.2 Lab
4.3 Naive Bayes
4.4 Lab
5.3 Communicating Bayesian Results
9. Time Series Data
1.1 Github for Teams
1.2 Lab
1.3 Pandas & Time Series Data
1.4 Lab
2.1 Lab
2.2 Autocorrelation & Time Series Data
2.3 Lab
3.1 ARIMA Model pt.1
3.2 Lab
4.1 ARIMA Model pt.2
4.2 Lab
5.1 Time Series Analysis Recap
5.2 Lab
10. Intro to Big Data
1.1 Intro to Big Data
1.2 Lab
1.3 Lab
1.4 Lab
2.1 AWS Amazon Web Services
2.2 AWS Elastic Map Reduce
2.3 Lab
2.4 Big Data Case Studies
3.1 Spark Overview
3.2 Lab
3.3 Lab
3.4 Spark Case Studies
4.1 Database Design Case Study
5.1 Big Data Review Case Study
11. Advanced Topics & Interviews
1.1 Intro to A/B Testing
1.2 Lab
3.1 Introductory Career Tips
3.2 Lab
4.1 Interview Prep
4.2 Lab
5.1 Data Science Portfolios
Powered by
GitBook
1. Python
DSI - Week 1
This is a summary of the materials provided for Week 1 of the Data Science Immersive.
Week 1: Programming Fundamentals
Session Time
Day 1
Day 2
Day 3
Day 4
Day 5
9 - 10
+Welcome to Data Science!
+Morning Exercise
#Outcomes
+Morning Exercise
Reflection
10 - 11:30
Command Line
Python Control Flow
Programming Fundamentals
Arrays & Functions
Plotting tools intro
11:30 - 1
Intro to Git
Lab: Python function practice
Notebooks & CSV Files
Lab: NumPy
Lab: Plotting
2 - 3:30
Python Data Types
Python Iteration
Intro to NumPy
Lab: Stats practice with Python
+Instructor choice
3:30 - 5
Python Collections
Lab: Python with GitHub
Lab: Datasets and NumPy
Dataviz Principles
Project 1: Workshop
results matching "
"
No results matching "
"