Selected results from clustering and analyzing stock market trade data

dc.contributor.authorZhang, Zhihan
dc.date.accessioned2018-11-15T20:06:28Z
dc.date.available2018-11-15T20:06:28Z
dc.date.graduationmonthDecemberen_US
dc.date.issued2018-12-01
dc.date.published2018en_US
dc.description.abstractThe amount of data generated from stock market trading is massive. For example, roughly 10 million trades are performed each day on the NASDAQ stock exchange. A significant proportion of these trades are made by high-frequency traders. These entities make on the order of thousands or more trades a day. However, the stock-market factors that drive the decisions of high-frequency traders are poorly understood. Recently, hybridized threshold clustering (HTC) has been proposed as a way of clustering large-to-massive datasets. In this report, we use three months of NASDAQ HFT data---a dataset containing information on all trades of 120 different stocks including identifiers on whether the buyer and/or seller were high-frequency traders---to investigate the trading patterns of high-frequency traders, and we explore the use of HTC to identify these patterns. We find that, while HTC can be successfully performed on the NASDAQ HFT dataset, the amount of information gleaned from this clustering is limited. Instead, we show that an understanding of the habits of high-frequency traders may be gained by looking at \textit{janky} trades---those in which the number of shares traded is not a multiple of 10. We demonstrate evidence that janky trades are more common for high-frequency traders. Additionally, we suggest that a large number of small, janky trades may help signal that a large trade will happen shortly afterward.en_US
dc.description.advisorMichael Higginsen_US
dc.description.degreeMaster of Scienceen_US
dc.description.departmentDepartment of Statisticsen_US
dc.description.levelMastersen_US
dc.identifier.urihttp://hdl.handle.net/2097/39297
dc.language.isoen_USen_US
dc.subjectClusteringen_US
dc.subjectStock market dataseten_US
dc.subjectHigh-frequency tradesen_US
dc.titleSelected results from clustering and analyzing stock market trade dataen_US
dc.typeReporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZhihanZhang2018.pdf
Size:
1.62 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: