GA Seattle DSI
Introduction
i. Students
a: Onboarding
1: Installfest
2: Technical Guide
3: Precourse Onboarding Tasks
b: Resources
ii. Projects
a: Weekly Projects
1: SAT Scores + Summary Statistics
2: Billboard Hits + Data Munging
3: Liquor Sales + Linear Regression
4: Web Scraping + Logistic Regression
5: Disaster Relief + Classification
6: IMDB API + Random Forests
7: Airport Delays + Cluster Analysis
b: Capstone Project
1: Capstone Topic + Dataset Validation
2: Problem Statement + EDA
3: Progress Report + Preliminary Findings
4: Report Writeup + Technical Analysis
5: Presentation + Recommendations
1. Python
1.1a Data Science Overview
1.1b Command Line
1.2 Git
1.3 Python pt.1
1.4 Python pt.2
2.1 Python pt.3
2.2 Lab
2.3 Python pt.4
2.4 Lab
3.1 Programming Fundamentals
3.2 iPython, Data Values, and CSV
3.3 Intro to NumPy
3.4 Lab
4.1 Math Primer pt. 2
4.2 Lab
4.3 Lab
4.4 Data Viz Principles
5.1 Plotting Tools
5.2 Lab
5.3 Python List Comprehensions
2. Pandas / Data Analysis
1.1 Pandas pt.1
1.2 Study Design
1.3 Stats 101
1.4 Lab
2.1 Pandas pt.2
2.2 Lab
2.3 Data Cleaning
2.4 Lab
3.1 Pandas pt.3
3.2 Lab
3.3 Categorical & Dummy Variables
3.4 Lab
4.1 SciPy
4.2 Lab
4.3 Joins and Pandas
4.4 Lab
5.1 Plotting and Pandas
5.2 Lab
3. Linear Regression / Stats Models
1.1 Modeling Linear Regression
1.2 Lab
1.3 Stats Models & SKLearn
1.4 Lab
2.1 Bias Variance Tradeoff
2.2 Lab
2.3 Regularization and Overfitting
2.4 Lab
3.1 Regression Metrics & Loss Functions
3.2 Train-Test Split
3.3 Lab
3.4 Lab
4.1 Gradient Descent
4.2 Feature Scaling
4.3 Study Design
5.1 Stakeholder Analysis
5.2 Lab
4. Logistic Regression
1.1 Classification
1.2 Web Scraping
1.3 Lab
1.4 Lab
2.1 Logistic Regression
2.2 Lab
2.3 Model Fit
2.4 Lab
3.1 Visualizing Classification Models
3.2 Lab
4.1 Advanced Model Evaluation
4.2 Lab
4.3 Lab
5.1 Communicating Results
5.2 Lab
5. Classification / Databases
1.1 Different Databases
1.2 SQL pt.1
1.3 Lab
1.4 Lab
2.1 Classification Case Studies
2.2 Pipelines and Custom Transformers in SKLearn
2.3 Lab
2.4 Lab
3.1 SQL pt.2
3.2 Lab
3.3 Lab
4.1 SQL Feature Selection
4.2 Lab
5.1 Support Vector Machines
5.2 Lab
6. Trees & Ensemble Methods
1.1 Classification and Regression Trees (CARTs)
1.2 Lab
1.3 Servers, JSON, & APIs
1.4 Lab
2.1 SQL Joins
2.2 Lab
2.3 Ensemble Methods - Decision Trees and Bagging
2.4 Lab
3.1 Ensemble Methods - Random Forests and Boosting
3.2 Lab
3.3 Model Evaluation & Feature Importance
3.4 Lab
4.1 Feature Extraction from Text
4.2 Lab
5.2 Communicating results
7. Unsupervised Learning
1.1 Intro to Clustering
1.2 Lab
1.3 Tuning Clusters
1.4 Advanced Database Skills
2.1 Dimensionality Reduction
2.2 Intro to Principal Component Analysis
2.3 Lab
2.4 Lab
3.1 Hierarchical Clustering
3.2 Hierarchical Clustering
3.3 Lab
3.4 Unsupervised Learning Trends
4.2 DBSCAN
4.3 Lab
5.1 Unsupervised Learning Case Study
8. Bayesian Inference
1.1 Bayes
1.2 Lab
1.3 Bayes Deep Dive
1.4 Lab
2.1 Lab
2.2 Lab
2.3 Lab
2.4 Lab
3.1 API Review
3.2 Lab
3.3 LDA
3.4 Lab
4.1 Bayesian Stat Testing
4.2 Lab
4.3 Naive Bayes
4.4 Lab
5.3 Communicating Bayesian Results
9. Time Series Data
1.1 Github for Teams
1.2 Lab
1.3 Pandas & Time Series Data
1.4 Lab
2.1 Lab
2.2 Autocorrelation & Time Series Data
2.3 Lab
3.1 ARIMA Model pt.1
3.2 Lab
4.1 ARIMA Model pt.2
4.2 Lab
5.1 Time Series Analysis Recap
5.2 Lab
10. Intro to Big Data
1.1 Intro to Big Data
1.2 Lab
1.3 Lab
1.4 Lab
2.1 AWS Amazon Web Services
2.2 AWS Elastic Map Reduce
2.3 Lab
2.4 Big Data Case Studies
3.1 Spark Overview
3.2 Lab
3.3 Lab
3.4 Spark Case Studies
4.1 Database Design Case Study
5.1 Big Data Review Case Study
11. Advanced Topics & Interviews
1.1 Intro to A/B Testing
1.2 Lab
3.1 Introductory Career Tips
3.2 Lab
4.1 Interview Prep
4.2 Lab
5.1 Data Science Portfolios
Powered by
GitBook
2. Pandas / Data Analysis
DSI - Week 2
This is a summary of the materials provided for Week 2 of the Data Science Immersive.
Week 2: Exploratory Data Analysis & Pandas
Session Time
Day 1
Day 2
Day 3
Day 4
Day 5
9-10
(Project Review)
Morning Exercise
(Outcomes)
Morning Exercise
(Reflection)
10-11:30
Intro to Pandas 1
Intro to Pandas 2
Intro to Pandas 3
Stats Review & Intro to Scipy
Plotting With Pandas
11:30-1
Study Design & Pandas
Pandas Computation Lab
Pandas & Pivot Tables
Scipy & Stat Lab
Pandas, Plotting, & Project 2
2-3:30
Stats 101
Intro to Data Cleaning
Categorical & Dummy Variables
Joins & Pandas
+Instructor Choice
3:30-5
Pandas & Numpy
Data Cleaning Lab
Lambda Functions & Missing Data
Practicing Joins
Project 2: Workshop
results matching "
"
No results matching "
"