π Resource list
Legend:
symbol | description |
---|---|
* |
to explore |
π¨βπ¬ | experimented |
π΅πΌββοΈ | actively exploring |
π | certificate obtained |
βοΈ | confident in using |
For more domain-specific resources e.g. mobility, retail... it might be interesting to create a separate list
- βοΈ Cloud Computing
- π οΈ Data Engineering
- π Data Visualisation
- π Data Warehousing
- π€ Machine Learning
- π©βπ» Software Development
- General learning resources for programming
- π Books
- π¦Έ People
- https://aws.amazon.com/
- Learning Resources
- Certification
- https://cloud.google.com/
- Certification
- https://azure.microsoft.com/en-us/
- Learning Resources
- Certification
- Fundamentals
- Azure AI-900 - AI Fundamentals
(2021)π
- I really loved CloudReadySkills' session to get ready for this certification.
- Azure DP-900 - Data Fundamentals
*
- Azure AI-900 - AI Fundamentals
- Associate
- Expert
- Fundamentals
- Data Vault
- Kimball
- Inmon
- https://aws.amazon.com/quicksight/
- Currently I don't know how Amazon Quicksight differentiates itself from other data visualisation applications (aside from its pay-per-session model) because I haven't tested it out yet. Just like with Google Looker I'm however excited about how cloud providers are adding data visualisation solutions to their stack, and how this might change and improve the way we analyse data. The problem I currently have with solutions like Tableau and Power BI, is how scalable these solutions are in terms of creation and distribution of datasets and data governance. With the proliferation of custom data models it might be difficult to maintain an overview. It might be the case that these integrated visualisation solutions that are getting closer to the actual database, might provide some advantages in terms of data governance and reduce data model duplication.
- https://plotly.com/dash/
- When you search for the web application equivalent to RShiny for Python, this is the one that pops up.
- other resources
- https://d3js.org/
- The most stunning visualisations I've ever seen, were mostly made with this Javascript library. With the presence of Observable notebooks it's gotten even easier to get started with the tool, but I still haven't succeeded in moving beyond some beginner charts. It takes more time to create a simple barchart than the plug-and-play alternatives in other tools, but the amount of flexibility and the beauty can make it definitely worthwhile. This is one of the tools that's the highest on my "wanting to master"-list.
-
There are already a lot of great visualisation tools out there, but of course there's always room for improvement. It looks like Google Looker has a more backend driven and robust approach to data visualisation where the Don't Repeat Yourself principle isn't violated (at least from the database modeling side), contrary to tools like Tableau and Power BI. Tableau and Power BI both have data prep tools as well, but why would one need to have those data prep tools if the work is already done during the data engineering process (aside perhaps from some additional data cleaning work). From demos it seems one can just select import model from database making the model directly available for the visualisation
What I currently don't like about Google Looker is that I can't start just testing it out straight away. With other environments or tools at a click of a button you can start playing around straight away, for Looker I need to wait for a team to contact me. This takes the fun out of wanting to test something out straight away and adds unnecessary friction to get started
- https://grafana.com/
- I know nothing about this one. A team at the company I work uses this tool for data monitoring, but haven't tried it out myself
- https://powerbi.microsoft.com/en-us/
- Kudos to Microsoft and how they made Power BI one of the most widely adopted visualisation tools at the moment. By making the tool readily available to just about everyone, they've managed to decrease the distance between data and business users. I think in terms of look and feel it isn't as stunning as Tableau, but for people already familiar with the Microsoft stack, in particular Excel, the transition to this application and the additional insight into data it can provide is just what they need.
- other resources
- https://shiny.rstudio.com/
- When you're already working in R for data analytics, this is a great tool to translate your data to a web application.
- https://streamlit.io/
- As the website says
Streamlit turns data scripts into shareable web apps in minutes. All in Python. All for free. No frontβend experience required.
- other resources
- https://www.tableau.com/
- I've already played around and created dashboards in Tableau. I think with Tableau the beauty is in the details and in their beautiful visualisation library and user experience design. I think however for a lot of use cases it's not worth the additional cost and for a lot of business cases Power BI will be just as convenient, easy to use and more tailored to self-service than a tool like Tableau
The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable.
An open source platform for the machine learning lifecycle
- https://www.hackerrank.com : platform containing exercises for different programming languages
- https://ss64.com/bash/ : command reference for Linux
The Data Warehouse Toolkit by Ralph Kimball π΅πΌββοΈ
- https://eugeneyan.com/
- Great posts about machine learning and career advice
- https://github.com/eugeneyan
- At Eugene Yan's github page you can find some great lists about papers on real-world machine learning, machine learning advances...
- His lists have inspired me be more
- https://maggieappleton.com/
- https://github.com/maggieappleton/
- The most beautiful example that I've seen so far with regard to Digital Gardens. She has her own curated list of digital gardens as well
- https://notes.andymatuschak.org/About_these_notes
- His website has introduced me to the concept of evergreen notes and has changed the way I think about structuring information. I also love to read about his views on non-linear note-taking and learning.