Back To Projects
A Machine Learning Approach to Understanding the Determining Factors of the Gender Wage Gap
Sophia G. |
workspace_premium 2nd Place at San Diego BROADCOM Science Fair (Senior Division)

By studying the affect of different attributes on the gender wage gap, we can better understand both the scale of this issue and its possible solutions. So, we explore the question, how does a worker’s marital status, along with other variables, impact the gap in hourly wage between male and female workers? We seek to create a model able to predict the gender wage gap given a set of variables—age, years of education, race, state, and marital status.


Gender inequality is a complex subject consisting of a variety of issues and nuances. In this project, we choose to study gender income inequality—a prevalent issue in current society. Among the many factors that play a role in the gender wage gap, we focus on the affects of marital status, race, geographical location (by state), age, and years of education. By using these variables to create a model able to predict the hourly wage gap between a woman and their equivalent male counterpart, we can analyze the impact of each variable to better understand the role they play in the income gap. Utilizing income data from the Current Population Survey, we train and test five models—a Linear Regression, Decision Tree Regressor, Random Forest Regressor, KNeighbors Regressor, and MLP Regressor. Our Linear Regression model found that there is a correlation between being a never married worker and a smaller gender wage gap, as well as being a married worker with an absent spouse and a greater gender wage gap. In general, though, our models found little correlation between the variables provided and the predicted hourly age gap.

Explore More!

Source Code
Sophia G.
Ana Sofia Muñoz Valadez

Related Projects

workspace_premium
The Differentiation of Viral and Bacterial Pneumonia using Deep Learning

This project aims to find out whether a Convolutional Neural Network can be used to classify x-ray scans as having either bacterial or viral Pneumonia.
Arnav D.
Mentored by Erick Siavichay
workspace_premium
VisionAssist: Enhancing Accessibility for Individuals with Visual Impairment Through AI

This project explores how AI can support individuals with visual impairments by developing a system that converts images containing text or math into audio and Braille in near real-time. Using a fine-tuned OCR model, the system achieves high accuracy and low latency, demonstrating that AI can be a powerful tool for improving accessibility to educational content.
Azaan R.
Mentored by Joe Xiao
workspace_premium
Evaluating Machine Learning Models on Predicting Change in Enzyme Thermostability

Our research problem is finding the best machine learning model to predict the change in enzyme thermostability after a single point mutation in the amino acid sequence.
Avnith V.
Mentored by Jacklyn Luu