Skip to content
View Okancan-Balci's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Okancan-Balci

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Okancan-Balci/README.md

Hi 👋, I'm Okancan

okancan-balci-linkedin okancan-kaggle

As an aspiring Data Scientist/Machine Learning Engineer,

  • I do like the story telling aspect of the data analysis process. In the making of a good data story, Exploratory Data Analysis with solid data visualizatons is very important. While conducting the EDA, aspects of data integrity and data quality must be kept in check in the process of data cleansing.
  • I believe a good data scientist should know his data's story in order to be able to build strong and resilient Machine Learning applications with high accuracy and low error rate.
  • I think productionalizing Machine Learning algorithms is the most important aspect of the Data Science process since even the best performing ML Models in a vacuum can't really offer business value without being deployed.
  • In order to keep deployed Machine Learning applications resillient and strong auiditing and monitoring processes must be taken very seriously since Data Drift can rot deployed ML Models as time goes. Proper monitoring and maintenance of deployed ML Models are only possible with Continuous Integration and Continuous Delivery (CI/CD).

I'm eager to ...

  • become a better Linux user since it is the OS of choice for all Cloud Platforms.
  • improve on common Data Science languages such as Python, R and SQL also get proficient at pipeline tools such as Apache Beam and Apache Spark.
  • forge ahead my Cloud Computing skills since I believe the best way to do Data Science is through cloud native applications. Currently I am making progress on Google Cloud Platform.
  • develop and hone my Machine Learning Operations (MLOps) abilities on cloud platforms by utilizing tools such as Docker Containers, Kubernetes(KubeFlow), GitHub Actions, BigQuery ML and Vertex AI.

Since I consider myself as a data person I do enjoy learning new tools and tackling new challenges on Data Science related subjects such programming, statistics, mathematics or another Data/ML framework such as MLFlow. Though learning process could be hard, time taking and painful at times😓 I believe it's also quite rewarding in an intellectual sense.

You can find my personal projects here on Github or on my Kaggle.

If you wish to contact me my e-mail is [email protected]. I am always open to have a chitchat about Data Science 😊.

Languages

r python mysql postgresql bash zsh

Tools & Libraries

tidyverse tidymodels dplyr ggplot2 rmarkdown

numpy pandas scikit_learn seaborn tensorflow selenium


rsutdio-IDE jupyter-lab docker git

Cloud Platform Tools

gcp google-cloud-storage data-flow apache-beam apache-spark bigquery automl vertex-ai

Pinned Loading

  1. IMDB_Spider-Man_Text_Analysis IMDB_Spider-Man_Text_Analysis Public

    I analyzed Spider-Man Movie reviews from IMDb. I employed basic NLP techniques like TF-IDF, Sentiment Analysis and Topic Modelling and I shared the results with solid visualizations. All done with R.

    RMarkdown

  2. Kaggle_Notebooks Kaggle_Notebooks Public

    This repo contains my uncategorized Kaggle Notebooks. The rendered versions of the Notebooks can be found on Kaggle.

    RMarkdown

  3. Selenium_Web_Scrapers Selenium_Web_Scrapers Public

    Contains Selenium Webdriver web scrapers for IMDb and BestBuy. Scrapers aren't automated. The scraping processes were done in interactive Jupyter Notebook instances in a semi-supervised manner.

    Jupyter Notebook

  4. KPMG_AU_Data_Analytics KPMG_AU_Data_Analytics Public

    The project of the virtual internship program from forage.com where I cleansed, analyzed and modeled the given data by employing R and Python.

    Jupyter Notebook

  5. MA_Thesis MA_Thesis Public

    Encompasses all of my thesis work including experimental code and the actual Thesis with accompanying code in RMarkdown.

    RMarkdown

  6. freeCodeCamp_Projects freeCodeCamp_Projects Public

    These are the projects I completed to get certificates.

    Jupyter Notebook