Back To Projects
Predicting Running Injuries with Machine Learning Models

Is it possible to predict running injuries with only a dataset and machine learning models? This paper explores this question by using classification models, including the Logistic Regression model and the Random Forest Classifier model.


Is it possible to predict running injuries with only a dataset and machine learning models? This paper explores this question by using classification models, including the Logistic Regression model and the Random Forest Classifier model. In the dataset used, ten features were taken into account when predicting running injuries. With slight modifications, the Weighted Logistic Regression and over and down-sampling Random Forest Classifier models were used to mitigate the imbalance in the dataset. The results suggested that the best model was Weighted Logistic Regression and that the best score metric to take into account was the F beta score.

Explore More!

Published Paper
Elgin V.
Joseph Vincent
Aerospace Engineering PhD Candidate at Stanford

Related Projects

workspace_premium
Combating Climate Fake News Using NLP

As fake news becomes more prevalent across the US, important issues become harder to solve. One such issue is climate change, where climate misinformation has worsened viewer’s abilities to distinguish between fake information and real information. This project’s objective is to tackle climate misinformation using an artificial intelligence model.
Rayyan M.
Mentored by Philip Bell
workspace_premium
A Machine Learning Approach to Understanding the Determining Factors of the Gender Wage Gap

By studying the affect of different attributes on the gender wage gap, we can better understand both the scale of this issue and its possible solutions. So, we explore the question, how does a worker’s marital status, along with other variables, impact the gap in hourly wage between male and female workers? We seek to create a model able to predict the gender wage gap given a set of variables—age, years of education, race, state, and marital status.
Sophia G.
Mentored by
Application of AI to Tennis Match Footage Transcription

One of the best ways for tennis players to improve their game is to record and watch their own match footage, find patterns in the points they win and lose, and practice based on these realizations. However, watching match footage and documenting each point shot by shot is a very time-consuming process. This paper investigates an AI approach to transcribing tennis match footage, combining a deep convolution neural network (YOLOv4), a pose estimation model (Movenet), and a long short- term memory (LSTM) deep neural network. Looking at a transcript of each point will be far more efficient than watching entire match footage for a player to understand how they are losing and winning and analyze patterns in their game. The LSTM model in this project achieved accura- cies of 73.33% and 79.31% when classifying shot type (forehand, forehand volley, forehand slice, backhand, backhand volley, backhand slice, over- head/smash, and serve) for players on the close side and opposite side of the net, respectively, and 55.17% and 60.00% when classifying the di- rection of a shot (cross-court, down the line, down the middle, inside in, inside out, out wide, down the t, and body) for players on the close side and opposite side of the net, respectively.
Marco Y.
Mentored by Eric Bradford