⨠Executive Summary
Customer churn is a critical challenge for subscriptionâbased businesses, as retaining existing customers is often more costâeffective than acquiring new ones. This case study applies predictive modeling using Python (pandas, scikitâlearn) on the Telco Customer Churn dataset (~7,043 customers, 20+ features) to identify atârisk customers and recommend retention strategies.
Key findings (to be finalized after modeling):
- Overall churn rate is (~26.5%).
- Contract type, tenure, and monthly charges are the strongest predictors of churn.
- Customers on monthâtoâmonth contracts with high monthly charges are most likely to churn.
- Machine learning models (Logistic Regression, Random Forest) achieved strong predictive performance (ROCâAUC ~0.8).
- Targeted retention strategies could significantly reduce churn and improve customer lifetime value.
đ Project Overview
- Domain: Customer Analytics / Predictive Modeling
- Dataset: Telco Customer Churn dataset (~7,043 customers, 20+ service and account features, including churn labels)
- Tools Used: Python (pandas, NumPy, scikitâlearn, matplotlib, seaborn), Jupyter/Colab
- Deliverables: Clean dataset & reproducible notebook, churn prediction models (Logistic Regression, Random Forest), evaluation metrics (Accuracy, Precision, Recall, ROCâAUC), visuals (EDA plots, feature importance, ROC curve), executive summary, and optional slide deck
Scope: Build a predictive model to classify customers as âChurnâ or âNot Churn,â identify the strongest predictors of churn, and provide actionable retention strategies for business teams.
â Business Question
Primary Question:
How can we accurately predict which customers are most likely to churn and translate those insights into effective retention strategies?