Finding common support through largest connected components and its implementation

dc.contributor.authorHongyuan, Lu
dc.date.accessioned2019-04-18T15:27:59Z
dc.date.available2019-04-18T15:27:59Z
dc.date.graduationmonthMayen_US
dc.date.issued2019-05-01
dc.date.published2019en_US
dc.description.abstractIn an observational study, the average treatment effect may only be reliably estimated for a subset of units under which the covariate space of both treatment and control units overlap. This is known as the common support assumption. In this report, we develop a method to find a region of common support. The method is as follows. Given a distance function to measure dissimilarity between any two units with differing treatment statuses, we can construct an adjacency list by drawing edges between each pair of treated and control units that have distance no larger than some pre-specified threshold. Then, all connected components of the graph are found. Finally, a region of common support is found by obtain- ing the largest connected components (LCC) (e.g. the connected components with the most treated units) of this graph. We implement the LCC algorithm by using binary search trees to find all the connected graphs from sample data and sorting them by size. This algorithm requires O(n²) runtime and O(n) memory (where n is the number of units in the observational study. We then create an R package implementing this LCC algorithm. Finally, we use our R package to compare the performance of LCC to that of other common support methods on simulated data.en_US
dc.description.advisorMichael Higginsen_US
dc.description.degreeMaster of Scienceen_US
dc.description.departmentDepartment of Statisticsen_US
dc.description.levelMastersen_US
dc.identifier.urihttp://hdl.handle.net/2097/39595
dc.language.isoen_USen_US
dc.subjectConnected componentsen_US
dc.subjectLCCen_US
dc.subjectObservational studyen_US
dc.subjectCovariateen_US
dc.subjectPropensity scoreen_US
dc.subjectATTen_US
dc.subjectATEen_US
dc.subjectTreatmenten_US
dc.subjectControlen_US
dc.subjectTreatment effecten_US
dc.titleFinding common support through largest connected components and its implementationen_US
dc.typeReporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HongyuanLu2019.pdf
Size:
8.09 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: