Skip to content

KristofVandewynckel/Telecom_Churn

Repository files navigation

Customer Churn Prediction Using Machine Learning

Churn

Project description:

In this project we are predicticting the customer churn rate - the percentage of customers who either cancel or don't renew their subscription. It is an important measure that can help businesses to retain their clients.

  • Data source - This Kaggle dataset tracks a telco company's customer churn based on a variety of possible factors.
    • Target value is churn - it indicates whether or not the customer left within the last month
    • Features include: customerID, gender, SeniorCitizen, Partner, Dependents, tenure, PhoneService, MultipleLines, InternetService, OnlineSecurity, OnlineBackup, DeviceProtection, TechSupport, StreamingTV, StreamingMovies, Contract, PaperlessBilling, PaymentMethod, MonthlyCharges, TotalCharges

Steps of the project:

  • Exploring and pre-processing the data (Pandas,Numpy)
  • Visualising (Matplotlib, Seaborn, Plotly)
  • Comparing performance of various ML models (Sklearn)
  • Fine tuning the best performing models (Sklearn)
  • Looking for clusters in the data (Sklearn)
  • Interpreting the results (Sklearn)

Machine Learning approach:

Overview of the models we tested:

models

Visuals:

Churn

Churn

Churn

Overview of the clusters.

In this unsupervised way of machine learning, we take away the target column (churn? yes/no) from the database and use Kmeans clustering to let the algorithm decide for itself which customers share certain patterns. Churn

ROC Curve

ROC

Odds and Probabilities

Here we have an overview of how certain featues impact the chance of churning. Churn

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published