Skip to content

The Data Science Piscine offers a comprehensive learning experience designed to equip participants with essential skills in data science through a series of hands-on modules.

License

Notifications You must be signed in to change notification settings

mbrettsc/Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Piscine

Welcome to the Data Science Piscine! This repository includes a series of training modules designed to enhance your skills in data science through hands-on exercises. Each module covers a unique aspect of the field, from foundational concepts to advanced predictive modeling techniques.

Contents

  • datascience-0: Learn to create a PostgreSQL database, emphasizing the significance of data cleaning and preparation for analysis through practical exercises.
  • datascience-1: Focus on the creation of a data warehouse using the ETL (Extract, Transform, Load) process, guiding participants in effective data integration, organization, and management practices.
  • datascience-2: Explore data visualization techniques, highlighting the role of data analysts in interpreting and making informed decisions based on graphical representations of data.
  • datascience-3: Understand current data through exercises involving visualizations, correlation analysis, standardization, normalization, and dataset splitting, all aimed at preparing for predictive modeling.
  • datascience-4: Delve into predictive modeling, applying techniques such as confusion matrices, heatmaps, variance calculations, feature selection, decision trees, KNN, and voting classifiers, while adhering to guidelines for software setup and collaborative submission.

Getting Started

  1. Clone the Repository: git clone https://github.com/mbrettsc/Data-Science
  2. Navigate to the Module Directory: cd <module-directory>
  3. Follow the Instructions: Each module contains its own set of instructions and exercises. Follow them to complete the tasks.

Contributing

Contributions to improve or extend the modules are welcome. Please fork the repository, make your changes, and submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For any questions or issues, please open an issue on the repository or contact the maintainers directly.

Happy coding!

About

The Data Science Piscine offers a comprehensive learning experience designed to equip participants with essential skills in data science through a series of hands-on modules.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published