This project is about building a classification model, where you will be given peoplePregnancies, Glucose, blood pressure, skin thickness, Insulin, BMI, DiabetesPedigreeFunction, Age and you would need to build a model that predicts if the new person is diabetic or not (Outcome). This helps patients to get diagnosed at an early stage to prevent any risk in the future
Based on the Pima Indians historical diabetes data, build a machine learning binary classification model to predict if the person is diabetic or Not based on the below features, Pregnancies, Glucose, blood pressure, skin thickness, Insulin, BMI, Diabetes Pedigree Function, Age
Evaluation will be done based on the below details,
- Document the hypothesis
- Loading the data
- Understanding the data
- Preprocessing with Exploratory Data Analysis (EDA)
- Missing value and outlier treatment (if any)
- Feature Engineering
- Model Building - Build multiple models and select the best one.
S.No. |
Description |
Code File |
Solution File |