Altmetric

BETH dataset: real cybersecurity data for anomaly detection research

Title: BETH dataset: real cybersecurity data for anomaly detection research
Authors: Highnam, K
Arulkumaran, K
Hanif, Z
Jennings, N
Item Type: Working Paper
Abstract: We present the BETH cybersecurity dataset for anomaly detection and out-of-distribution analysis. With real “anomalies” collected using a novel tracking system, our dataset contains over eight million data points tracking 23 hosts. Each host has captured benign activity and, at most, a single attack, enabling cleaner behavioural analysis. In addition to being one of the most modern and extensive cybersecurity datasets available, BETH enables the development of anomaly detection algorithms on heterogeneously-structured real-world data, with clear downstream applications. We give details on the data collection, suggestions on pre-processing, and analysis with initial anomaly detection benchmarks on a subset of the data.
Issue Date: 23-Jul-2021
URI: http://hdl.handle.net/10044/1/90568
Publisher: Gatsby Computational Neuroscience Unit
Copyright Statement: © 2021 The Author(s).
Publication Status: Published
Open Access location: http://www.gatsby.ucl.ac.uk/~balaji/udl2021/accepted-papers/UDL2021-paper-033.pdf
Appears in Collections:Computing
Electrical and Electronic Engineering
Faculty of Natural Sciences