Requirements :
Find 2 datasets, one for regression and the other for classification
Regression:
linear regression, polynomial regression(upto deg=3), random forest, SVM
Classification:
the other for classification using logistic regression, KNN, random forest, SVM
Project Requirements:
No. of rows >=1000
No. variables > 2
No. of classes for the dependent variable must be more than 2 for classification
Do K-fold cross-validation for both.
For regression show: R2, Adjusted R2, RMSE, correlation matrix, p-values of independent variables (codes 10)
For classification show: Accuracy, confusion matrix, (Macro recall and precision for multiclass Classification) (codes 10)
Do hyper-parameter tuning using Grid Search
The report should discuss the properties of the datasets, your results, and model performance comparisons, and inferences/conclusions. (10)
Prepare a report to discuss the properties of the datasets, your results, and inferences. (10)
Here solution of this which fulfill the above requirements :
Import Libraries
>>> import pandas as pd
>>> import numpy as np
>>> import matplotlib.pyplot as plt #Data visualization libraries
>>> import seaborn as sns
>>> %matplotlib inline
Load Data
data:image/s3,"s3://crabby-images/6f782/6f78249f9b725b50ec37e22f331e784c4d975b25" alt=""
Creating methods to update columns fields values
data:image/s3,"s3://crabby-images/379a9/379a9c835951db71d78ab2a6482551485c1daf77" alt=""
Applying these methods on pandas datasets to update values
data:image/s3,"s3://crabby-images/7d359/7d359447803224fb7ece50e53c7cf382e2932a89" alt=""
In the next steps are done logistic regression, if you need the complete solution with k fold to implement logistic regression classification then please contact us here or you can also comment in below comments section.
留言