Optimizing high performance computing system’s, resource utilization and throughput by leveraging machine learning

dc.contributor.authorDunn, Brandon
dc.date.accessioned2021-08-25T14:29:59Z
dc.date.available2021-08-25T14:29:59Z
dc.date.graduationmonthMayen_US
dc.date.published2021en_US
dc.description.abstractHigh Performance Computing (HPC) facilitates a significant portion of research and analytics across many different fields, industries, and education. HPC is implemented using supercomputers, which can be comprised of a few servers to tens to thousands. HPC systems typically use a scheduler - such as Slurm - to manage the execution of tasks on the system. Schedulers typically have hundreds of configuration parameters. With such diverse workflows and hardware the question becomes: how do we adapt these HPC schedulers so that we keep a high utilization and throughput on the systems? Our research focuses on optimizing the SLURM scheduler by adapting its configuration options based on the type of hardware in the High Performance Computing system and types of workflows, utilizing Semi-supervised Machine Learning.en_US
dc.description.advisorDaniel A. Andresenen_US
dc.description.degreeMaster of Scienceen_US
dc.description.departmentDepartment of Computer Scienceen_US
dc.description.levelMastersen_US
dc.identifier.urihttps://hdl.handle.net/2097/41685
dc.language.isoen_USen_US
dc.subjectSLURMen_US
dc.subjectHPCen_US
dc.subjectMachine learningen_US
dc.titleOptimizing high performance computing system’s, resource utilization and throughput by leveraging machine learningen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BrandonDunn2021.pdf
Size:
621.17 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: