4 min to read
Neural Network Approach to Employee Attrition Analysis
Understanding Employee Attrition: Analyzing Key Factors Influencing Turnover in the Workplace
Overview
The dataset examines employee attrition within a company and includes 1,470 employee records with 35 attributes. These attributes provide insights into various factors influencing employee turnover, such as demographic information, job-related metrics, compensation details, work environment, and career progression. This comprehensive dataset is valuable for understanding the dynamics of employee attrition and identifying key factors that contribute to turnover, enabling targeted interventions to enhance employee retention and organizational stability.
Click this link to directly access the report: Neural Network Approach to Employee Attrition Analysis
Business Questions
- What is the breakdown of distance from home by job role and attrition?
- How does the average monthly income compare across different education levels and attrition?
Project Structure
- Introduction: Overview of the dataset and the purpose of the analysis.
- Exploring the Data: Initial data exploration and understanding the distribution of variables.
- Analyzing Trends: Identifying patterns and correlations related to employee attrition.
- Model Building: Developing predictive models to understand factors influencing employee attrition.
- Visualizations: Creating visual representations of the data to highlight key insights.
- Reporting/Conclusion: Summarizing findings and providing recommendations based on the analysis.
Column Descriptions
- Age: Age of the employee
- Attrition: Whether the employee left the company (Yes, No)
- BusinessTravel: Frequency of business travel (Non-Travel, Travel_Rarely, Travel_Frequently)
- DailyRate: Amount paid per day of work
- Department: Work department (Research & Development, Sales, Human Resources)
- DistanceFromHome: Distance between company and home
- Education: Level of education (1: Below College, 2: College, 3: Bachelor, 4: Master, 5: Doctor)
- EducationField: Field of education (Life Sciences, Medical, Human Resources, Technical Degree, Marketing, Other)
- EmployeeCount: Count of employees (always 1)
- EmployeeNumber: ID of the employee
- EnvironmentSatisfaction: Satisfaction with environment (1: Low, 2: Medium, 3: High, 4: Very High)
- Gender: Gender of the employee (Male, Female)
- HourlyRate: Amount paid per hour of work
- JobInvolvement: Level of job involvement (1: Low, 2: Medium, 3: High, 4: Very High)
- JobLevel: Level of the job (1 - 5)
- JobRole: Role of the job (Sales Executive, Research Scientist, Laboratory Technician, etc.)
- JobSatisfaction: Satisfaction with the job (1: Low, 2: Medium, 3: High, 4: Very High)
- MaritalStatus: Marital status (Married, Single, Divorced)
- MonthlyIncome: Monthly income
- MonthlyRate: Monthly rate of pay
- NumCompaniesWorked: Number of companies the employee has worked with
- Over18: Whether the employee is over 18 years old (Yes, No)
- OverTime: Whether the employee works overtime frequently (Yes, No)
- PercentSalaryHike: Percentage increase in salary
- PerformanceRating: Level of performance assessment (1: Low, 2: Good, 3: Excellent, 4: Outstanding)
- RelationshipSatisfaction: Satisfaction with relationships (1: Low, 2: Medium, 3: High, 4: Very High)
- StandardHours: Standard work hours (always 80)
- StockOptionLevel: Stock option level (0 - 3)
- TotalWorkingYears: Total number of years the employee has worked
- TrainingTimesLastYear: Number of training times in the last year
- WorkLifeBalance: Level of work-life balance (1: Bad, 2: Good, 3: Better, 4: Best)
- YearsAtCompany: Number of years at the current company
- YearsInCurrentRole: Number of years in the current role
- YearsSinceLastPromotion: Number of years since last promotion
- YearsWithCurrManager: Number of years with the current manager
Workflow
- Data Collection: Gather the dataset from the source.
- Data Cleaning: Handle missing values, remove duplicates, and ensure data quality.
- Exploratory Data Analysis (EDA): Analyze the dataset to understand the distribution and relationships between variables.
- Feature Engineering: Create new features or modify existing ones to improve model performance.
- Model Building: Develop machine learning models to predict employee attrition.
- Model Evaluation: Assess the performance of the models using accuracy, recall, precision, and other metrics.
- Visualization: Create visualizations to present findings and insights.
- Reporting: Summarize the analysis, model results, and provide actionable insights.
Exploring the Data
We will start by exploring the dataset to understand the distribution of key variables such as age, attrition rates, job roles, and income levels. We’ll also examine correlations between different predictors and the target variable, attrition.
Analyzing Trends
Identifying trends in employee attrition based on different attributes such as job role, department, and job satisfaction. This analysis will help in understanding which factors contribute most significantly to employee turnover.
Visualizations
Visualizations will be created to represent the data and findings, such as:
- Attrition rate by job role and department.
- Distribution of employee age and its correlation with attrition.
- Monthly income comparisons across different education levels and job roles.
Reporting
Based on the analysis, we conclude that while certain factors such as distance from home, job satisfaction, and work-life balance significantly influence employee attrition, the model’s performance needs further improvement. High accuracy with low recall suggests that the model may be biased towards the majority class. Future work will focus on enhancing model recall and precision through advanced modeling techniques and more balanced data.
Dataset Source
The dataset used in this project can be found at the following link: Employee Attrition Analysis.
The report for this project is available here: Employee Attrition Analysis Report.