Sparse Bayesian kernel learning for high-dimensional regression and classification

Duan, Weikang

Sparse Bayesian kernel learning for high-dimensional regression and classification

dc.contributor.author	Duan, Weikang
dc.date.accessioned	2022-05-10T19:41:20Z
dc.date.available	2022-05-10T19:41:20Z
dc.date.graduationmonth	May
dc.date.issued	2022
dc.description.abstract	In the past decades, statistical learning has been an increasingly popular topic that has drawn a significant amount of attention from researchers. Kernel based nonlinear models, in particular, are powerful tools due to their flexibility to extract information from complex datasets. A major challenge with the kernel modeling in the current big data era is the curse of dimensionality. Although an abundance of variable selection methods have been proposed, the developments in high-dimensional Bayesian kernel models is still in its infancy. In addition to the variable selection, the innate nature of kernel based models induces heavy computational costs, which further prohibit the application of related methods. The goal of this dissertation is to develop new, fast variable selection and prediction procedures in order to address the problem of high-dimensional nonlinear regression and classification from the Bayesian perspective. To reduce the computational cost, we propose a novel hybrid search algorithm and the Bayesian doubly-sparse frameworks to the kernel based models. In Chapter 1, we discuss the background, existing methods, and their limitations. We also give the motivation for our study. In Chapter 2, we propose a Bayesian model hybrid search algorithm for Gaussian process (GP) regression models, which quickly scan through the model space to search for a set of models with high posterior probabilities. In addition, we address the massive and high-dimensional data problem for GP by proposing an approach which combines quantile subsample hybrid search with a nearest neighbor GP scheme. In Chapter 3, we propose a novel Bayesian doubly-sparse framework to the reproducing kernel Hilbert space (RKHS) regression models. The proposed doubly-sparse frame work performs both variable selection and sparse kernel matrix estimation. In Chapter 4, we extend our proposed Bayesian doubly-sparse framework to the nonlinear Bayesian support vector machine.
dc.description.advisor	Gyuhyeong Goh
dc.description.degree	Doctor of Philosophy
dc.description.department	Department of Statistics
dc.description.level	Doctoral
dc.identifier.uri	https://hdl.handle.net/2097/42235
dc.language.iso	en_US
dc.publisher	Kansas State University
dc.rights	© the author. This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/
dc.subject	Bayesian
dc.subject	Gaussian process
dc.subject	Reproducing kernel Hilbert space
dc.subject	Variable selection
dc.title	Sparse Bayesian kernel learning for high-dimensional regression and classification
dc.type	Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: WeikangDuan2022.pdf
Size:: 561.84 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.62 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

K-State Electronic Theses, Dissertations, and Reports: 2004 -