Comparing different machine learning methods on NFL defensive player performance prediction

dc.contributor.authorGweon, Jihyuk
dc.date.accessioned2024-08-08T18:19:18Z
dc.date.available2024-08-08T18:19:18Z
dc.date.graduationmonthAugust
dc.date.issued2024
dc.description.abstractThis study examines the effectiveness of various machine learning techniques in predicting defensive player performance in the National Football League (NFL). The dataset from Kaggle's "NFL BIG DATA BOWL 2024" includes numerous variables related to games, players, plays, and tackle details for NFL players. Six datasets were created by pairing each of the three subsets of predictive variables with either of the two response variables, 'attack' or 'tackle'. These datasets were analyzed using multiple machine learning methods. The research aims to provide a comprehensive analysis of defensive performance by incorporating a broad array of algorithms, including Logistic Regression, Random Forest, K-Nearest Neighbors (KNN), AdaBoost, Support Vector Machine (SVM), Decision Tree, and XGBoost. Hyperparameter tuning was employed to optimize each model's performance. Among the classifiers, Logistic Regression, XGBoost and AdaBoost consistently achieved the highest performance metrics, demonstrating their robustness and effectiveness in predicting NFL defensive performance. Decision Tree, SVM, and Random Forest have slightly lower AUC than the top three, but the difference is not statistically significant. KNN has lowest performance and the difference from the top three is significant even though the difference with the middle three classifiers is not significant. The analysis revealed that variables such as 'player_bmi', 'age', 'tackleSuccessRate', 'assistSuccessRate', 'forcedFumble', 'assist', and 'quarter' offer the most predictive power for both tackle and attack outcomes across all classifiers. This comprehensive evaluation underscores the importance of feature selection and hyperparameter tuning in optimizing machine learning models for sports analytics.
dc.description.advisorHaiyan Wang
dc.description.degreeMaster of Science
dc.description.departmentDepartment of Statistics
dc.description.levelMasters
dc.identifier.urihttps://hdl.handle.net/2097/44442
dc.language.isoen_US
dc.publisherKansas State University
dc.rights© the author. This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectAUC
dc.subjectXGBoost
dc.subjectPrediction NFL defensive performance
dc.subjectMachine learning
dc.subjectLogistic Regression
dc.subjectRandom Forest
dc.titleComparing different machine learning methods on NFL defensive player performance prediction
dc.typeReport

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JihyukGweon2024.pdf
Size:
2.91 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.6 KB
Format:
Item-specific license agreed upon to submission
Description: