Data Science/ Data Analyst Interviews

Questions-answers/ materials I use to study for data science interviews
Some credits to: this repo

Note: Do contribute with PRs! Repo is very messy atm, sorry.

Coding Interviews:

Used Stratascratch a lot for data analytics interview preparation
Leetcode - Database
HackerRank
Grind75
Leetcode - Top 150 Questions
deep-ml

Video Materials:

Statquest - Machine Learning/ Statistics/ Deep Learning
3b1b - Math for Machine Learning
Andrej kaparthy - LLM Legend
Indently - Good coding practices

Reading Materials:

Q-Learning
- https://ketanhdoshi.github.io/Reinforcement-Learning-Q-Learning/
- A model-free reinforcement learning algorithm based on learning a Q-value function.
Deep Q-Networks (DQN)
- https://ketanhdoshi.github.io/Reinforcement-Learning-Deep-Q-Network/
- A combination of Q-learning and deep neural networks.
Policy Gradient Methods
- https://ketanhdoshi.github.io/Reinforcement-Learning-Policy-Gradients/
- Learn policies directly instead of learning a value function.
Proximal Policy Optimization (PPO)
- https://towardsdatascience.com/proximal-policy-optimization-ppo-explained-abed1952457b
- A modern, stable policy optimization method used in reinforcement learning.
SARSA (State-Action-Reward-State-Action)
- A reinforcement learning algorithm that updates policies based on current actions.

Neural Networks and Deep Learning Algorithms:

Artificial Neural Networks (ANN)
- The basic neural network model used for various tasks.
Convolutional Neural Networks (CNN)
- https://medium.com/thedeephub/convolutional-neural-networks-a-comprehensive-guide-5cc0b5eae175
- https://medium.com/@beingfarina/breaking-down-the-mathematics-behind-cnn-models-a-comprehensive-guide-1853aa6b011e
- Primarily used for image recognition tasks.
Recurrent Neural Networks (RNN)
- https://www.analyticsvidhya.com/blog/2017/12/introduction-to-recurrent-neural-networks/
- https://www.analyticsvidhya.com/blog/2022/03/a-brief-overview-of-recurrent-neural-networks-rnn/
- https://karpathy.github.io/2015/05/21/rnn-effectiveness/
- Used for sequential data tasks like time series or natural language processing.
Long Short-Term Memory Networks (LSTM)
- https://medium.com/@ottaviocalzone/an-intuitive-explanation-of-lstm-a035eb6ab42c
- https://colah.github.io/posts/2015-08-Understanding-LSTMs/
- https://weberna.github.io/blog/2017/11/15/LSTM-Vanishing-Gradients.html
- https://data-science-blog.com/blog/2020/09/07/back-propagation-of-lstm/
- A type of RNN capable of learning long-term dependencies.
Transformer Networks
- A deep learning architecture primarily used in NLP tasks (e.g., BERT, GPT).
Generative Adversarial Networks (GANs)
- https://www.analyticsvidhya.com/blog/2021/10/an-end-to-end-introduction-to-generative-adversarial-networksgans/
- A framework involving two neural networks to generate new data.

Recommendation Systems:

https://towardsdatascience.com/recommender-systems-a-complete-guide-to-machine-learning-models-96d3f94ea748
Collaborative Filtering:
- https://medium.com/@deepapandithu/recommender-system-user-collaborative-filtering-37613f0c6a9
Content-Based Filtering:
- https://medium.com/@zbeyza/recommendation-systems-content-based-filtering-e19e3b0a309e

Ensemble Learning Algorithms:

Bagging
- Combines the predictions of several base models (e.g., Random Forest).
Boosting
- Sequentially builds models that correct the errors of previous models (e.g., XGBoost, AdaBoost).
Stacking
- Combines multiple models by training a meta-model on their predictions.

Name	Name	Last commit message	Last commit date
Latest commit mcxraider add LLM interview questions Feb 12, 2025 4a2d206 · Feb 12, 2025 History 16 Commits
.gitignore	.gitignore	initial commit	Feb 5, 2025
100_LLM_interview_qns.md	100_LLM_interview_qns.md	add LLM interview questions	Feb 12, 2025
65 ML qustions.md	65 ML qustions.md	Add files via upload	Oct 9, 2024
Data Science Related Technical Questions.md	Data Science Related Technical Questions.md	add theory and technical qns	Oct 9, 2024
Data Science Theory Questions.md	Data Science Theory Questions.md	Update Theory Questions	Oct 9, 2024
Data_analyst.md	Data_analyst.md	Create Data_analyst.md	Nov 6, 2024
Natural Language Processing Questions.md	Natural Language Processing Questions.md	Add files via upload	Oct 9, 2024
README.md	README.md	Update README.md	Oct 15, 2024
SQL CHeatsheet.pdf	SQL CHeatsheet.pdf	add sql cheatsheet	Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science/ Data Analyst Interviews

Coding Interviews:

Video Materials:

Reading Materials:

Supervised Learning Algorithms:

Unsupervised Learning Algorithms:

Reinforcement Learning Algorithms:

Neural Networks and Deep Learning Algorithms:

Recommendation Systems:

Ensemble Learning Algorithms:

About

Releases

Packages

mcxraider/interview-preparation

Folders and files

Latest commit

History

Repository files navigation

Data Science/ Data Analyst Interviews

Coding Interviews:

Video Materials:

Reading Materials:

Supervised Learning Algorithms:

Unsupervised Learning Algorithms:

Reinforcement Learning Algorithms:

Neural Networks and Deep Learning Algorithms:

Recommendation Systems:

Ensemble Learning Algorithms:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages