Skip to content

Commit

Permalink
Added Text Roadmap: Data Scientists
Browse files Browse the repository at this point in the history
  • Loading branch information
andkret committed Feb 29, 2024
1 parent 670094d commit 0cad123
Show file tree
Hide file tree
Showing 3 changed files with 24 additions and 1 deletion.
Binary file not shown.
Binary file added images/Data-Engineering-Roadmap-for.textClipping
Binary file not shown.
25 changes: 24 additions & 1 deletion sections/01-Introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -334,12 +334,35 @@ In summary, the journey from understanding basic platform and pipeline concepts
| Live Stream -> Roadmap: Data Engineering for Data Analysts!
|------------------|
|In this live stream I was showing step by step how to read this roadmap for Analysts, why I chose these tools and why I think this is the right way to do it. I also answered many questions from the audience.
| [Watch on YouTube](https://youtube.com/live/w2t6SL5tQG0?feature=share)|
| [Watch on YouTube](https://youtube.com/live/w2t6SL5tQG0)|

### Roadmap for Data Scientists

![Building blocks of your curriculum](/images/Data-Engineering-Roadmap-for-Data-Scientists.jpg)

We’re going to tackle the data engineering roadmap for data scientists. It's a topic a lot of you have been curious about, especially after we explored the data analyst side of things. The goal here is to lay out a step-by-step path for those of you looking to make a pivot or deepen your understanding of data engineering.

The first thing I did was sit down and list out all the courses available in my academy. It’s designed to be super flexible, catering to different job roles. For a data scientist, your journey usually starts with a strong grasp of data science fundamentals, right? You know your way around machine learning, how to preprocess data, and maybe even deploy models on a basic level. But then, the question arises: How do you set up an entire platform or pipeline that takes data from ingestion to a point where it’s usable for others?

Here’s where it gets interesting. I thought about how we could structure this to really benefit data scientists. Starting with the basics, like platform and pipeline design, and then moving into choosing data storage solutions. We’re talking about understanding the differences between databases and when to use each type.

But it doesn’t stop there. I’ve included some optional topics, like platform security, because it’s always handy to know, even if you’re not directly responsible for it. And since you’re already familiar with data, why not dive deeper into data modeling? It’s all about making your data work for you in the most efficient way possible.

Now, let's talk about Docker. It's a game-changer for deploying your algorithms. And after that, mastering API fundamentals and streaming with Apache Kafka will open up new possibilities for your projects.

Depending on your interests or where you see yourself in the future, you might want to explore cloud services like AWS, GCP, or Azure. Or maybe you’re more intrigued by the idea of document streaming and creating user interfaces with MongoDB and Streamlit. The roadmap I’ve laid out includes paths for all these directions.

Monitoring and observability are crucial, too. You’ll want to keep an eye on your algorithms and the data flowing through your systems. Tools like Elasticsearch or InfluxDB paired with Grafana can give you those insights.

And don’t forget about orchestration with Airflow. It’s all about keeping your workflows organized and efficient.

So, this roadmap is more than just a list of topics. It’s about building a foundation that lets you, as a data scientist, expand into data engineering seamlessly. It’s about understanding the ecosystem around your data and how to leverage it to build robust, scalable solutions.

| Live Stream -> Roadmap: Data Engineering for Data Scientists!
|------------------|
|In this live stream you'll find even more details how to read this roadmap for Data Scientists, why I chose these tools and why I think this is the right way to do it. I also answered many questions from the audience.
| [Watch on YouTube](https://youtube.com/live/fusLAtA1Eu4)|

### Roadmap for Software Engineers

![Building blocks of your curriculum](/images/Data-Engineering-Roadmap-for-Software-Engineers.jpg)

0 comments on commit 0cad123

Please sign in to comment.