Machine learning for high performance computing applications
dc.contributor.author | Hutchison, Scott | |
dc.date.accessioned | 2024-04-15T19:01:52Z | |
dc.date.available | 2024-04-15T19:01:52Z | |
dc.date.graduationmonth | May | |
dc.date.issued | 2024 | |
dc.description.abstract | The focus of this study was to apply state-of-the-art Machine Learning (ML) techniques to problems in the High Performance Computing (HPC) domain. The ML techniques included clustering, various types of regression, a recommendor system, and reinforcement learning using proximal policy optimization. Included are three different advancements applying these techniques. The first application used K-means clustering and Gradient Boosted Tree Regression (GBTR) to predict estimated queue time for jobs submitted to an HPC system. This method achieved a 96% accuracy when predicting whether or not a job would start prior to a specified deadline. The second application focused on optimizing hardware procurement for HPC systems while remaining under a fixed budget. Vendor quotes for new hardware were used with a custom Discrete Event Simulator (DES) to simulate the execution of a job workload on proposed hardware. An Extreme Gradient Boosting (XGBoost) regression model powers a recommendor system that provides a precision@50 of 92%. The third application used Proximal Policy Optimization (PPO) with Invalid Action Masking (IAM) to train a Reinforcement Learning (RL) agent to schedule jobs on a simulated HPC system. The performance of this RL agent was compared to modern scheduling algorithms. The RL agent performed 18.44% better than the algorithmic baselines for one metric and comparably to the baselines for another. | |
dc.description.advisor | Daniel A. Andresen | |
dc.description.degree | Doctor of Philosophy | |
dc.description.department | Department of Computer Science | |
dc.description.level | Doctoral | |
dc.identifier.uri | https://hdl.handle.net/2097/44307 | |
dc.language.iso | en_US | |
dc.publisher | Kansas State University | |
dc.rights | © the author. This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | High performance computing | |
dc.subject | Machine learning | |
dc.subject | Reinforcement learning | |
dc.subject | Regression | |
dc.subject | Artificial intellegence | |
dc.title | Machine learning for high performance computing applications | |
dc.type | Dissertation |