This project focuses on Chronic Kidney Disease (CKD) analysis using a Jupyter Notebook. The goal is to analyze CKD data and build models for predicting the presence of CKD.
The project involves:
- Data loading and preprocessing
- Exploratory Data Analysis (EDA)
- Machine Learning model training and evaluation
- Visualizations
- Python
- Jupyter Notebook
- Pandas
- NumPy
- Scikit-learn
- Matplotlib
- Seaborn
Ensure you have the following libraries installed:
pip install pandas numpy scikit-learn matplotlib seaborn
INFO6105_Final_Project_CKD.ipynb # Jupyter Notebook with the project code
data/ # Dataset used for the analysis
README.md # Project documentation
- Clone the repository.
- Open the Jupyter Notebook:
jupyter notebook INFO6105_Final_Project_CKD.ipynb
- Run all cells to see the data analysis and model results.
The dataset used contains medical data related to CKD with various features like age, blood pressure, specific gravity, and more. Ensure the dataset is stored in the data/
folder.
The project explores the following models:
- Logistic Regression
- Decision Tree
- Random Forest
- Support Vector Machine (SVM)
The results include:
- Accuracy, Precision, Recall, and F1-Score
- Confusion Matrix
- Feature Importance
Feel free to fork the repository and submit pull requests for improvements.
This project is licensed under the MIT License.