Randomization test and correlation effects in high dimensional data

dc.contributor.authorWang, Xiaofei
dc.date.accessioned2012-07-17T20:35:11Z
dc.date.available2012-07-17T20:35:11Z
dc.date.graduationmonthAugusten_US
dc.date.issued2012-07-17
dc.date.published2012en_US
dc.description.abstractHigh-dimensional data (HDD) have been encountered in many fields and are characterized by a “large p, small n” paradigm that arises in genomic, lipidomic, and proteomic studies. This report used a simulation study that employed basic block diagonal covariance matrices to generate correlated HDD. Quantities of interests in such data are, among others, the number of ‘significant’ discoveries. This number can be highly variable when data are correlated. This project compared randomization tests versus usual t-tests for testing of significant effects across two treatment conditions. Of interest was whether the variance of the number of discoveries is better controlled in a randomization setting versus a t-test. The results showed that the randomization tests produced results similar to that of t-tests.en_US
dc.description.advisorGary L. Gadburyen_US
dc.description.degreeMaster of Scienceen_US
dc.description.departmentDepartment of Statisticsen_US
dc.description.levelMastersen_US
dc.identifier.urihttp://hdl.handle.net/2097/14039
dc.language.isoen_USen_US
dc.publisherKansas State Universityen
dc.subjectRandomization testen_US
dc.subjectCorrelation effecten_US
dc.subjectHigh dimensional dataen_US
dc.subject.umiStatistics (0463)en_US
dc.titleRandomization test and correlation effects in high dimensional dataen_US
dc.typeReporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
XiaofeiWang2012.pdf
Size:
1.29 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: