Skip to content

🧠 My personal Knowledge Base. Check out the wiki as well!

Notifications You must be signed in to change notification settings

smoens/knowledge-base

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“š Resource list

Legend:

symbol description
* to explore
πŸ‘¨β€πŸ”¬ experimented
πŸ•΅πŸΌβ€β™€οΈ actively exploring
πŸŽ“ certificate obtained
βœ”οΈ confident in using

For more domain-specific resources e.g. mobility, retail... it might be interesting to create a separate list

☁️ Cloud Computing

Applications

Amazon Web Services

Google Cloud Platform *

Microsoft Azure * ai-900 (2021)πŸŽ“

Snowflake

πŸ› οΈ Data Engineering

Concepts

Data Warehouse

  • Data Vault
  • Kimball
  • Inmon

Data Lake

Data Lakehouse

Data Mesh

Applications

IBM DataStage βœ”οΈ

Azure Data Factory

dbt (data build tool) *

πŸ“Š Data Visualisation

Applications

Amazon Quicksight *

  • https://aws.amazon.com/quicksight/
  • Currently I don't know how Amazon Quicksight differentiates itself from other data visualisation applications (aside from its pay-per-session model) because I haven't tested it out yet. Just like with Google Looker I'm however excited about how cloud providers are adding data visualisation solutions to their stack, and how this might change and improve the way we analyse data. The problem I currently have with solutions like Tableau and Power BI, is how scalable these solutions are in terms of creation and distribution of datasets and data governance. With the proliferation of custom data models it might be difficult to maintain an overview. It might be the case that these integrated visualisation solutions that are getting closer to the actual database, might provide some advantages in terms of data governance and reduce data model duplication.

Dash *

D3js πŸ‘¨β€πŸ”¬ *

  • https://d3js.org/
  • The most stunning visualisations I've ever seen, were mostly made with this Javascript library. With the presence of Observable notebooks it's gotten even easier to get started with the tool, but I still haven't succeeded in moving beyond some beginner charts. It takes more time to create a simple barchart than the plug-and-play alternatives in other tools, but the amount of flexibility and the beauty can make it definitely worthwhile. This is one of the tools that's the highest on my "wanting to master"-list.

Google Looker *

  • https://looker.com/

  • There are already a lot of great visualisation tools out there, but of course there's always room for improvement. It looks like Google Looker has a more backend driven and robust approach to data visualisation where the Don't Repeat Yourself principle isn't violated (at least from the database modeling side), contrary to tools like Tableau and Power BI. Tableau and Power BI both have data prep tools as well, but why would one need to have those data prep tools if the work is already done during the data engineering process (aside perhaps from some additional data cleaning work). From demos it seems one can just select import model from database making the model directly available for the visualisation

    What I currently don't like about Google Looker is that I can't start just testing it out straight away. With other environments or tools at a click of a button you can start playing around straight away, for Looker I need to wait for a team to contact me. This takes the fun out of wanting to test something out straight away and adds unnecessary friction to get started

    • Google Looker message stating that I need to wait for a Looker team

Grafana

  • https://grafana.com/
  • I know nothing about this one. A team at the company I work uses this tool for data monitoring, but haven't tried it out myself

Power BI βœ”οΈ

  • https://powerbi.microsoft.com/en-us/
  • Kudos to Microsoft and how they made Power BI one of the most widely adopted visualisation tools at the moment. By making the tool readily available to just about everyone, they've managed to decrease the distance between data and business users. I think in terms of look and feel it isn't as stunning as Tableau, but for people already familiar with the Microsoft stack, in particular Excel, the transition to this application and the additional insight into data it can provide is just what they need.
  • other resources

RShiny πŸ‘¨β€πŸ”¬ *

  • https://shiny.rstudio.com/
  • When you're already working in R for data analytics, this is a great tool to translate your data to a web application.

Streamlit

Tableau βœ”οΈ

  • https://www.tableau.com/
  • I've already played around and created dashboards in Tableau. I think with Tableau the beauty is in the details and in their beautiful visualisation library and user experience design. I think however for a lot of use cases it's not worth the additional cost and for a lot of business cases Power BI will be just as convenient, easy to use and more tailored to self-service than a tool like Tableau

πŸ”— Data Warehousing

πŸ€– Machine Learning

Applications

DVC data version control *

Keras deep learning πŸ‘¨β€πŸ”¬

Kubeflow machine learning workflow

The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable.

MLFlow machine learning workflow

An open source platform for the machine learning lifecycle

Tensorflow deep learning

πŸ‘©β€πŸ’» Software Development

General learning resources for programming

Hackerrank

Linux

πŸ“™ Books

Machine Learning

Data Warehousing

The Data Warehouse Toolkit by Ralph Kimball πŸ•΅πŸΌβ€β™€οΈ


🦸 People

Cassie Kozyrkov data science decision scientist

Eugene Yan machine learning career

Susan Shu Chang machine learning career

Martin Fowler software development architecture

Maggie Appleton digital garden design

Andy Matuschak digital garden evergreen notes

  • https://notes.andymatuschak.org/About_these_notes
  • His website has introduced me to the concept of evergreen notes and has changed the way I think about structuring information. I also love to read about his views on non-linear note-taking and learning.

Azlen Elza digital garden evergreen notes

About

🧠 My personal Knowledge Base. Check out the wiki as well!

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published