Leveraging a natural language processing approach towards a more informed vulnerability documentation process

dc.contributor.authorAnshutz, BreAnn
dc.date.accessioned2024-10-22T18:55:13Z
dc.date.available2024-10-22T18:55:13Z
dc.date.graduationmonthDecember
dc.date.issued2024
dc.description.abstractCybersecurity vulnerabilities are an ever-increasing threat to the current cybersecurity landscape. It has been previously suggested that Twitter is a robust data source for gathering Cyber Threat Intelligence data. This includes cyber vulnerabilities which can be retrieved via their Common Vulnerabilities and Exposures (CVE) identifier. However, the culture of post-disclosure vulnerability discussion is changing to sometimes include a ”nickname”, or a short name utilized instead of the CVE identifier. This trend poses a significant challenge to the retrieval of CVE-relevant information as not all text includes the CVE identifier. To address this challenge, a system was designed by utilizing an off-the-shelf machine learning model to link tweets that do not explicitly mention a CVE Identifier to their corresponding CVE. The system was tested utilizing several datasets and metrics to determine parameters required to obtain satisfactory performance with regards to retrieved information. The results show that machine learning makes it possible to retrieve relevant information corresponding to a specific CVE in the absence of the CVE identifier.
dc.description.advisorDoina Caragea
dc.description.degreeMaster of Science
dc.description.departmentDepartment of Computer Science
dc.description.levelMasters
dc.identifier.urihttps://hdl.handle.net/2097/44640
dc.language.isoen_US
dc.subjectCybersecurity
dc.subjectCyber Threat Intel
dc.subjectTwitter
dc.subjectNatural Language Processing
dc.titleLeveraging a natural language processing approach towards a more informed vulnerability documentation process
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
BreAnnAnshutz2024.pdf
Size:
401.19 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: