The tools and technology that are currently used to analyze chemical compound structures that identify polymer types in microplastics are not well-calibrated for environmentally weathered microplastics. Microplastics that have been degraded by environmental weathering factors can offer less analytic certainty than samples of microplastics that have not been exposed to weathering processes. Machine learning tools and techniques allow us to better calibrate the research tools for certainty in microplastics analysis. In this paper, we investigate whether the signatures (Raman shift values) are distinct enough such that well studied machine learning (ML) algorithms can learn to identify polymer types using a relatively small amount of labeled input data when the samples have not been impacted by environmental degradation. Several ML models were trained on a well-known repository, Spectral Libraries of Plastic Particles (SLOPP), that contain Raman shift and intensity results for a range of plastic particles, then tested on environmentally aged plastic particles (SloPP-E) consisting of 22 polymer types. After extensive preprocessing and augmentation, the trained random forest model was then tested on the SloPP-E dataset resulting in an improvement in classification accuracy of 93.81% from 89%.
翻译:目前用于分析微塑料中识别聚合物类型的化学化合物结构的工具和技术目前用于分析微塑料中识别聚合物类型的化学化合物结构的工具和技术,对环境经风化的微塑料没有进行适当校准;环境经风化因素使微塑料退化的微塑料与未受风化过程接触的微塑料样本相比,可以提供较少分析的确定性;机器学习工具和技术使我们能够更好地校准研究工具,以便在微塑料分析中确定聚合物类型;在本文中,我们调查签名(拉曼转移值)是否足够不同,以便经过良好研究的机器学习算法能够学会如何在样品没有受到环境退化影响时使用相对少量的标签输入数据来识别聚合物类型;一些微塑料模型在众所周知的存放处,即塑料粒子的光谱图书馆(SLOPP)接受了培训,其中载有一系列塑料颗粒的拉曼转移和强度结果,随后对22种聚合物类型的环境老塑料颗粒进行了测试;经过广泛预处理和增强的经过培训的随机森林模型在SloPP-E数据设置方面进行了测试,从而改进了93.81的精确度。