Statistical mechanics approaches to high-dimensional survival analysis

dc.contributor.authorChu, Guotao
dc.date.accessioned2022-01-31T16:20:09Z
dc.date.available2022-01-31T16:20:09Z
dc.date.graduationmonthMayen_US
dc.date.issued2022-05-01
dc.date.published2022en_US
dc.description.abstractWith the advent of high-dimensional data, variable selection has become a key step in survival data analysis. Recently, a general class of model selection criteria for high-dimensional data, called the generalized information criterion, has been developed. However, the use of the non-convex penalty functions in the generalized information criterion results in high-dimensional non-convex optimization problems. While many works have been proposed, their focus is limited to the application of a convex surrogate approach, which cannot ensure the convergence to the global optimal model with respect to the generalized information criterion. The objective of this dissertation is to develop new solutions to high-dimensional data challenges of survival analysis. To meet this goal, we develop a powerful framework for high-dimensional survival data analysis using the notion of statistical mechanics, which is one of the pillars of modern physics. The proposed methods in this dissertation are widely applicable to not only model fitting problems but also prediction problems. To investigate the performance of our proposed methods, simulation study and real data analysis are extensively implemented. In Chapter 1, the background, existing obstacles, rationale, and motivation are discussed. In Chapter 2, we develop a new fast variable selection procedure using the idea of simulated annealing with some modifications. The proposed method allows for rapidly finding the global optimal model with respect to the generalized information criterion. In Chapter 3, we develop a new best predictive model selection method for high-dimensional survival modeling. The proposed method relies on the idea of the optimal Bayesian predictive model, called the median probability model. In Chapter 4, we develop a robust variable selection approach to high-dimensional survival regression models. It is motivated by the "sandwich" estimator and provides a way for finding the global optimal model when the model is misspecified.en_US
dc.description.advisorGyuhyeong Gohen_US
dc.description.degreeDoctor of Philosophyen_US
dc.description.departmentDepartment of Statisticsen_US
dc.description.levelDoctoralen_US
dc.identifier.urihttps://hdl.handle.net/2097/41999
dc.language.isoen_USen_US
dc.subjectSurvival analysisen_US
dc.subjectHigh-dimensional variable selectionen_US
dc.subjectGeneralized information criterionen_US
dc.subjectStatistical mechanicsen_US
dc.subjectBoltzmann distributionen_US
dc.titleStatistical mechanics approaches to high-dimensional survival analysisen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GuotaoChu2022.pdf
Size:
574.34 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: