Altmetric
BETH dataset: real cybersecurity data for anomaly detection research
Publication available at: | http://www.gatsby.ucl.ac.uk/~balaji/udl2021/accepted-papers/UDL2021-paper-033.pdf |
---|
Title: | BETH dataset: real cybersecurity data for anomaly detection research |
Authors: | Highnam, K Arulkumaran, K Hanif, Z Jennings, N |
Item Type: | Working Paper |
Abstract: | We present the BETH cybersecurity dataset for anomaly detection and out-of-distribution analysis. With real “anomalies” collected using a novel tracking system, our dataset contains over eight million data points tracking 23 hosts. Each host has captured benign activity and, at most, a single attack, enabling cleaner behavioural analysis. In addition to being one of the most modern and extensive cybersecurity datasets available, BETH enables the development of anomaly detection algorithms on heterogeneously-structured real-world data, with clear downstream applications. We give details on the data collection, suggestions on pre-processing, and analysis with initial anomaly detection benchmarks on a subset of the data. |
Issue Date: | 23-Jul-2021 |
URI: | http://hdl.handle.net/10044/1/90568 |
Publisher: | Gatsby Computational Neuroscience Unit |
Copyright Statement: | © 2021 The Author(s). |
Publication Status: | Published |
Open Access location: | http://www.gatsby.ucl.ac.uk/~balaji/udl2021/accepted-papers/UDL2021-paper-033.pdf |
Appears in Collections: | Computing Electrical and Electronic Engineering Faculty of Natural Sciences |