Python for Data Science
This badge is earned by successfully completing all of the courses in the Python for Data Science path at Maven Analytics.
PATH HOURS: 76.0
PATH DESCRIPTION:
This path is for data professionals looking to build job-ready data science & ML skills with Python.
We'll start by mastering the foundations of data prep & EDA, including scoping projects, gathering & cleaning data, performing exploratory data analysis, and preparing the data for modeling.
Next we'll dive into Regression Analysis, a popular supervised learning technique for making predictions with numerical data. We'll introduce simple & multiple linear regression, review key model assumptions, and walk through the steps for testing and validating your models. We'll also cover multiple techniques for regularized regression and time series analysis, including ridge & lasso regression, moving averages, decomposition, and more.
From there we'll explore Classification Modeling, another supervised learning technique for making predictions with categorical data. We'll build k-nearest neighbors and logistic regression models, review evaluation metrics like accuracy, precision & recall, then explore methods for working with imbalanced data. We'll then dive into decision trees and ensemble models, including random forests & gradient boosting.
Last but not least we'll cover Unsupervised Learning, a popular approach for discovering hidden patterns & relationships in data. We'll use clustering algorithms for segmentation & anomaly detection, and then leverage dimensionality reduction algorithms for visualizing complex data, identifying clusters, and building recommendation engines.
This path is designed to help you learn job-ready skills, solve real business problems, and build a project portfolio to showcase your skills to peers and employers.
Skills / Knowledge
- Python
- Pandas
- Data Visualization
- Jupyter
- data science
- scikit-learn
- machine learning