Graph mining for role extraction in predictive analytics of high-performance computing systems

dc.contributor.authorBobadilla Dias, Luis Enrique
dc.date.accessioned2020-04-20T15:31:50Z
dc.date.available2020-04-20T15:31:50Z
dc.date.graduationmonthMayen_US
dc.date.issued2020-05-01
dc.date.published2020en_US
dc.description.abstractThis thesis addresses the task of analyzing property graphs in system log data from high-performance computing (HPC) systems, to identify entity roles to aid in predicting job submission outcomes. This predictive analytics project uses inductive learning on historical logs to produce regression models for estimating resource needs and potential shortfalls, and classification models that predict when jobs will fail due to insufficient resource allocation. The log files are generated by the workload manager of an HPC compute cluster and include runtime parameters for every submitted job. The research objectives of the overall project consist of using these techniques to solve three extant problems: (1) predicting the sufficiency of resource requested in a HPC system at job submission time; (2) making HPC resource allocation more efficient; and (3) building a decision support system for HPC users. Previous approaches and techniques used features such as user demographics and simulations harnessed with simple optimization algorithms to improve the resource allocation usage on a large-scale compute cluster (Kansas State University’s Beocat). In this thesis, role extraction is applied with the goal to create a user-specific feature for machine learning tasks. Specific use cases include personalized prediction of submitted job outcomes or reinforcement learning from simulation for optimization tasks in job scheduling. Objectives include improving on the accuracy, precision, recall, and utility of previous learning systems.en_US
dc.description.advisorWilliam H. Hsuen_US
dc.description.degreeMaster of Scienceen_US
dc.description.departmentDepartment of Computer Scienceen_US
dc.description.levelMastersen_US
dc.identifier.urihttps://hdl.handle.net/2097/40528
dc.language.isoen_USen_US
dc.subjectGraph miningen_US
dc.subjectRole extractionen_US
dc.subjectHigh-performance computing systemsen_US
dc.subjectProperty graphsen_US
dc.subjectPredictive analyticsen_US
dc.titleGraph mining for role extraction in predictive analytics of high-performance computing systemsen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LuisBobadillaDias2020.pdf
Size:
3.14 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: