Deep learning (DL) models, a specialized class of multilayer neural networks, have become central to time-series forecasting in critical domains such as environmental monitoring and the Internet of Things (IoT). Among these, Bidirectional Long Short-Term Memory (BiLSTM) architectures are particularly effective in capturing complex temporal dependencies. However, the robustness and generalization of such models are highly sensitive to input data characteristics - an aspect that remains underexplored in existing literature. This study presents a systematic empirical analysis of two key data-centric factors: input sequence length and additive noise. To support this investigation, a modular and reproducible forecasting pipeline is developed, incorporating standardized preprocessing, sequence generation, model training, validation, and evaluation. Controlled experiments are conducted on three real-world datasets with varying sampling frequencies to assess BiLSTM performance under different input conditions. The results yield three key findings: (1) longer input sequences significantly increase the risk of overfitting and data leakage, particularly in data-constrained environments; (2) additive noise consistently degrades predictive accuracy across sampling frequencies; and (3) the simultaneous presence of both factors results in the most substantial decline in model stability. While datasets with higher observation frequencies exhibit greater robustness, they remain vulnerable when both input challenges are present. These findings highlight important limitations in current DL-based forecasting pipelines and underscore the need for data-aware design strategies. This work contributes to a deeper understanding of DL model behavior in dynamic time-series environments and provides practical insights for developing more reliable and generalizable forecasting systems.
翻译:深度学习(DL)模型作为多层神经网络的一个专门类别,已成为环境监测和物联网(IoT)等关键领域中时间序列预测的核心方法。其中,双向长短期记忆(BiLSTM)架构在捕捉复杂时间依赖性方面尤为有效。然而,此类模型的鲁棒性和泛化能力对输入数据特征高度敏感——这一方面在现有文献中仍未得到充分探索。本研究对两个以数据为中心的关键因素进行了系统的实证分析:输入序列长度和加性噪声。为支持此项研究,我们开发了一个模块化且可复现的预测流程,包含标准化的预处理、序列生成、模型训练、验证和评估。我们在三个具有不同采样频率的真实世界数据集上进行了受控实验,以评估BiLSTM在不同输入条件下的性能。结果得出三个关键发现:(1)较长的输入序列会显著增加过拟合和数据泄漏的风险,尤其是在数据受限的环境中;(2)加性噪声在不同采样频率下均会持续降低预测精度;(3)两种因素同时存在时,模型稳定性下降最为显著。虽然观测频率较高的数据集表现出更强的鲁棒性,但当两种输入挑战同时存在时,它们仍然脆弱。这些发现凸显了当前基于DL的预测流程中的重要局限性,并强调了数据感知设计策略的必要性。本工作有助于更深入地理解DL模型在动态时间序列环境中的行为,并为开发更可靠、更可泛化的预测系统提供了实用见解。