A carefully-curated repository of Artificial Intelligence (AI), Machine Learning (ML), and Data Science (DS) literature with practical implementations using Jupyter Notebook "skill-builders".
The Data Science Boot Camp (DSBC) team has spent years developing tools and training materials for Applied Math and Data Science. In this project you will find the following tools.
- "5 Questions a Data Scientist Should Ask Their Customer" is a one-page list of questions any data scientist should have with them to define the problem they are trying to solve and set expectations with their customer.
- "Notes on ..." these are cheat sheets on various topics of math for Machine Learning, e.g. algebra, probability theory, etc., as well as Machine Learning topics, e.g. linear regression, hypothesis testing, etc.
- "Machine Learning flow-charts" will help you navigate the enormous number of algorithms. These will help you select what algorithm or plot you need to complete your task, based on the data that you have and the desired outcome or story you are trying to tell.
- "Cheat-Sheets" are for various Python toolboxes, e.g. Numpy, MatPlotLib, SciKitLearn, Keras, etc.
- "10 steps to Data Science" is a series of notebooks to teach you the most common tools used in Data Science.
- "10 steps to Machine Learning" is a series of notebooks to teach you some advanced tools used in Machine Learning.
- "Python in 2 days" is a series of notebooks to get you started in Juypter notebooks and Python.
- "Machine Learning in 1 day" is a series of notebooks focused on a basic toolbox for Machine Learning.
- "Deep Learning in 1 day" is a series of notebooks focused on advanced CNNs, RNNs, GANs and Transformers.
- "Examples" is a series of notebooks that most folks will find useful for a variety of real-life applications of Data Science.
In the Top-Down approach, you don't need to know the math, or be a deep expert in Python. We will teach you the tools, using industry best practices and rules-of-thumb, so that you will be a solid contributing member of a Data Science team.
To get started, we recommend the following self-paced tutorials performed in order:
In the Bottom-Up approach, we assume that you already have a solid foundation in math and/or computer science. We will teach you the algorithms, both how to manipulate and optimize them for your application. With these powerful skills, you will be a technical leader enabling the full potential of a Data Science team.
To get started, we recommend that you review the following:
- ML technical notes.
- "Math Refresher for Machine Learning".
- "Machine Learning: A Conceptual Approach".
Have you heard about Jupyter Notebooks, but don't know how to get started? Here is a quick tutorial.
This project is licensed under the MIT License - see the LICENSE.md file for details.
This repo was built using material from our private industry and academic experience, as well as material borrowed from:
- UCLA ECE 239AS.
- UPenn CIS 229.
- UPenn CIS 520.
- Stanford CS 229.
- Python Data Science Handbook.
- Machine Learning Mastery.
- Towards Data Science.
- Randy Olson's data analysis and machine learning projects.
- Many thanks to Andreas Mueller for some of his examples in the Machine Learning section. We drew inspiration from several of his excellent examples.
- Many thanks to Kaggle for the datasets.
- Numerous others that we cannot remember.