I cacluated Burn out Rate with help of Month of Joining, Gender, Company type, Designation, Resource Allocation and Mental Fatigue Score
- Before beginning with Machine Learning I have analysed the data and have done data pre processing.
- All the dates were of year 2008 so I have used th concept of feature engineering to add a new feature with the month as date and year will be insignificant.
- I used Label Encoder (refers to converting the labels into a numeric form so as to convert them into the machine-readable form) for Company Type, WFH Setup, and Gender for model to read values and predict accordingly
- The missing data with NaN values is replaced using ffill method which used to fill the missing value in the dataframe. ‘ffill’ stands for ‘forward fill’ and will propagate last valid observation forward.
- I have replaced all infinity values with NaN and then with 0 since such huge data will affect the model.
- I used linear regression to build the model.
- I have used skicit learns libraries to perform linear regression and have added the predicted values of burn rate in the test data set.
- Accuracy of the Model turns out to be 55.53% using Linear Regression