Pandas for Everyone

Pandas for Everyone
Setup
- Install seaborn for plotting
- Install all the packages used in the book
  - (Optional) Create a Virtual Environment
  - Install the packages
Teaching Slides
- No Powerpoint (.ppt/.odp)
Data
Links to teaching sessions
Other random goodies

Pandas for Everyone

Repository to accompany "Pandas for Everyone".

If you have gone through the book, an Amazon review would be much appreciated! My mom would too :)

Setup

The easiest way to get everything you need to the tutorial is to install anaconda

You can download and install it here: https://www.continuum.io/downloads

To download just the data, see the Data section below. Otherwise you can choose to clone this repository, or click the "Clone or Download" link above and clicking Download Zip

Install seaborn for plotting

conda install seaborn

Install all the packages used in the book

There is an error in the preface of the book for installing packages. I am leaving this section here in the README to have an updated list of packages and installation instructions

(Optional) Create a Virtual Environment

You can choose to create a virtual envirionment for the packages used in the book, so it doesn't clash with other packages you plan to use later on.

# create a virtual environment named "book" using python 3.6
conda create -n book python=3.6

# activate the environment
# so all installed packages will go in there and not mess up your base python environment
source activate book

Install the packages

Whether you decited to create a virtual environment or not, you can install the packages with the below commands. If you did use virtual environments, remember to source activate book before you follow along with the book so the packages you installed can be loaded.

conda install pandas xlwt openpyxl seaborn numpy ipython jupyter statsmodels scikit-learn regex wget odo numba
conda install -c conda-forge pweave # you don't really need this package, it was used to build and create the book
conda install -c conda-forge feather-format
pip install lifelines pandas-datareader

Teaching Slides

For those instructors who are using the teaching slide deck version of the book. Each chapter is split into it's own slide deck. There are multiple versions for each chapter.

Jupyter notebook (ipynb)
PDF
HTML

The slides are created using Damian Avila's RISE Jupyter/IPython Slideshow Extension. Thus, you can choose to install the RISE extension and live render and display the Jupyter notebooks (ipynb). Since each chapter is a Jupyter notebook at heart, the conversions to PDF and HTML are performed using

jupyter nbconvert --to slides your_talk.ipynb --post serve

More about useage ange converting to the PDF can be found on the RISE documentation page on useage.

No Powerpoint (.ppt/.odp)

RISE's back end uses reveal.js. Unfortunately there is no way to go from a reveal.js presentation to powerpoint. Having said that, if there's a way we can jerry-rig something together using the the given capabilties of RISE and reveal.js please let me know.

Data

You can choose to just download the datasets by using Minhas Kamal's DownGit by clicking the link here

Ongoing list of data references:

Gapminder: https://github.com/jennybc/gapminder/
Survey: Comes from the Software-Carpentry SQL lesson
Ebola: www.github.com/cmrivers/ebola

Links to teaching sessions

I've taught out of the book while I was writing it. Here you can find the various tutorials and workshops I've taught (pre and post when the book was officially published). You can also checkout my talks page for other things not completely on Pandas.

Tables	URL	Video
Online Live Training	https://github.com/chendaniely/2017-12-04-pandas_live, https://github.com/chendaniely/2018-05-pandas_live, https://github.com/chendaniely/2018-06-pandas_live
Whirlwind tour of Python	https://github.com/chendaniely/2017-10-26-python_crash_course
SciPy 2017 Pandas Tutorial	https://github.com/chendaniely/scipy-2017-tutorial-pandas	https://www.youtube.com/watch?v=oGzU688xCUs
PyData Carolinas 2016 Tutorial	https://github.com/chendaniely/2016-pydata-carolinas-pandas	https://www.youtube.com/watch?v=dye7rDktJ2E

Other random goodies

pandas .head() to .tail() (Beginner)
- SciPy 2018 Tutorial
- Dillon Niederhut, Tom Augspurger, Joris Van den Bossche
- https://www.youtube.com/watch?v=lkLl_QKLgcA
Andreas C. Müller - Lecturer in Data Science Courses COMS W4995 Applied Machine Learning :
- https://amueller.github.io/COMS4995-s18/
- http://www.cs.columbia.edu/~amueller/comsw4995s18/schedule/
Scipy 2017 list of tutorial/talks/links: https://github.com/chendaniely/scipy_2017_notes
Titus Brown's "Data Intensive Biology" Training page: http://dib-training.readthedocs.io/en/pub/

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data		data
misc/giveaway/scipy18		misc/giveaway/scipy18
notebooks		notebooks
training		training
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pandas for Everyone

Setup

Install seaborn for plotting

Install all the packages used in the book

(Optional) Create a Virtual Environment

Install the packages

Teaching Slides

No Powerpoint (.ppt/.odp)

Data

Links to teaching sessions

Other random goodies

About

Releases

Packages

Languages

License

ashishvaishno/pandas_for_everyone

Folders and files

Latest commit

History

Repository files navigation

Pandas for Everyone

Setup

Install seaborn for plotting

Install all the packages used in the book

(Optional) Create a Virtual Environment

Install the packages

Teaching Slides

No Powerpoint (.ppt/.odp)

Data

Links to teaching sessions

Other random goodies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages