Skip to content

Self-Paced Machine Learning Tutorials

License

Notifications You must be signed in to change notification settings

tpike3/ml_training-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI/ML Training

A carefully-curated repository of Artificial Intelligence (AI), Machine Learning (ML), and Data Science (DS) literature with practical implementations using Jupyter Notebook "skill-builders".


Project Overview

The Data Science Boot Camp (DSBC) team has spent years developing tools and training materials for Applied Math and Data Science. In this project you will find the following tools.

  • "5 Questions a Data Scientist Should Ask Their Customer" is a one-page list of questions any data scientist should have with them to define the problem they are trying to solve and set expectations with their customer.
  • "Notes on ..." these are cheat sheets on various topics of math for Machine Learning, e.g. algebra, probability theory, etc., as well as Machine Learning topics, e.g. linear regression, hypothesis testing, etc.
  • "Machine Learning flow-charts" will help you navigate the enormous number of algorithms. These will help you select what algorithm or plot you need to complete your task, based on the data that you have and the desired outcome or story you are trying to tell.
  • "Cheat-Sheets" are for various Python toolboxes, e.g. Numpy, MatPlotLib, SciKitLearn, Keras, etc.
  • "10 steps to Data Science" is a series of notebooks to teach you the most common tools used in Data Science.
  • "10 steps to Machine Learning" is a series of notebooks to teach you some advanced tools used in Machine Learning.
  • "Python in 2 days" is a series of notebooks to get you started in Juypter notebooks and Python.
  • "Machine Learning in 1 day" is a series of notebooks focused on a basic toolbox for Machine Learning.
  • "Deep Learning in 1 day" is a series of notebooks focused on advanced CNNs, RNNs, GANs and Transformers.
  • "Examples" is a series of notebooks that most folks will find useful for a variety of real-life applications of Data Science.

The DSBC Approach

The DSBC team teaches AI/ML, and more broadly Data Science, with two approaches:

"Top-Down" Approach

In the Top-Down approach, you don't need to know the math, or be a deep expert in Python. We will teach you the tools, using industry best practices and rules-of-thumb, so that you will be a solid contributing member of a Data Science team.

To get started, we recommend the following self-paced tutorials performed in order:

  1. "Python in 2 days".
  2. "Machine Learning in 1 day".
  3. "Deep Learning in 1 day".

"Bottom-Up" Approach

In the Bottom-Up approach, we assume that you already have a solid foundation in math and/or computer science. We will teach you the algorithms, both how to manipulate and optimize them for your application. With these powerful skills, you will be a technical leader enabling the full potential of a Data Science team.

To get started, we recommend that you review the following:

  1. ML technical notes.
  2. "Math Refresher for Machine Learning".
  3. "Machine Learning: A Conceptual Approach".

New to Python and Jupyter Notebooks?

Have you heard about Jupyter Notebooks, but don't know how to get started? Here is a quick tutorial.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgments

This repo was built using material from our private industry and academic experience, as well as material borrowed from:

  • UCLA ECE 239AS.
  • UPenn CIS 229.
  • UPenn CIS 520.
  • Stanford CS 229.
  • Python Data Science Handbook.
  • Machine Learning Mastery.
  • Towards Data Science.
  • Randy Olson's data analysis and machine learning projects.
  • Many thanks to Andreas Mueller for some of his examples in the Machine Learning section. We drew inspiration from several of his excellent examples.
  • Many thanks to Kaggle for the datasets.
  • Numerous others that we cannot remember.


About

Self-Paced Machine Learning Tutorials

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published