Synthetic datasets are beneficial for machine learning researchers due to the possibility of experimenting with new strategies and algorithms in the training and testing phases. These datasets can easily include more scenarios that might be costly to research with real data or can complement and, in some cases, replace real data measurements, depending on the quality of the synthetic data. They can also solve the unbalanced data problem, avoid overfitting, and can be used in training while testing can be done with real data. In this paper, we present, to the best of our knowledge, the first synthetic dataset for Unmanned Aerial Vehicle (UAV) attacks in 5G and beyond networks based on the following key observable network parameters that indicate power levels: the Received Signal Strength Indicator (RSSI) and the Signal to Interference-plus-Noise Ratio (SINR). The main objective of this data is to enable deep network development for UAV communication security. Especially, for algorithm development or the analysis of time-series data applied to UAV attack recognition. Our proposed dataset provides insights into network functionality when static or moving UAV attackers target authenticated UAVs in an urban environment. The dataset also considers the presence and absence of authenticated terrestrial users in the network, which may decrease the deep networks ability to identify attacks. Furthermore, the data provides deeper comprehension of the metrics available in the 5G physical and MAC layers for machine learning and statistics research. The dataset will available at link archive-beta.ics.uci.edu
翻译:由于有可能在培训和测试阶段试验新的战略和算法,合成数据集对机器学习研究人员是有益的,因为有可能在培训和测试阶段试验新的战略和算法,这些数据集可以很容易地包括更多的假设情景,这些假设情景对于用真实数据进行研究可能代价高昂,或者可以补充或在某些情况下取代真实的数据测量,这取决于合成数据的质量。它们还可以解决不平衡的数据问题,避免过度装配,并可用于培训,同时可以用真实数据进行测试。在本文中,我们根据我们的知识,提供第一个合成数据集,用于5G和5G以上网络中无人驾驶航空飞行器(UAV)袭击的合成数据集,其基础是显示功率水平的以下主要可观测网络参数:收到信号强度指标(RSSI)和信号到干涉度比率(SINR),这些数据的主要目的是为UAV通信安全提供深度网络的深度网络开发深度网络,特别是用于算法开发或分析用于UAVAV攻击识别的时间序列数据。我们提议的数据集在固定或移动UAV攻击目标在验证UAV攻击目标在深度网络中的固定或移动时提供网络功能,用于更深层数据库数据库中的现有数据库,这些数据还能够提供现有数据库。