VQ (Vendor Qualification) and IOQ (Installation and Operation Qualification) audits are implemented in warehouses to ensure all equipment being turned over in the fulfillment network meets the quality standards. Audit checks are likely to be skipped if there are many checks to be performed in a short time. In addition, exploratory data analysis reveals several instances of similar checks being performed on the same assets and thus, duplicating the effort. In this work, Natural Language Processing and Machine Learning are applied to trim a large checklist dataset for a network of warehouses by identifying similarities and duplicates, and predict the non-critical ones with a high passing rate. The study proposes ML classifiers to identify checks which have a high passing probability of IOQ and VQ and assign priorities to checks to be prioritized when the time is not available to perform all checks. This research proposes using NLP-based BlazingText classifier to throttle the checklists with a high passing rate, which can reduce 10%-37% of the checks and achieve significant cost reduction. The applied algorithm over performs Random Forest and Neural Network classifiers and achieves an area under the curve of 90%. Because of imbalanced data, down-sampling and upweighting have shown a positive impact on the models' accuracy using F1 score, which improve from 8% to 75%. In addition, the proposed duplicate detection process identifies 17% possible redundant checks to be trimmed.
翻译:VQ( Vendor 资格) 和 IOQ( 测试和运作资格) 审计在仓库中进行,以确保履行网络中所有设备都符合质量标准。如果在很短的时间内要进行许多检查,审计检查可能会被跳过。此外,探索性数据分析还揭示了在同一资产上进行类似检查的一些事例,从而重复了努力。在这项工作中,自然语言处理和机器学习应用到为仓库网络修剪一个大型的核对清单数据集,方法是查明相似和重复之处,并预测非关键设备的超速率很高。研究建议ML分类人员确定检查是否具有IOQ和VQ的高超过概率,并在没有时间进行所有检查时指定检查的优先次序。这项研究提议使用基于NLP的BlazingText分类器,用高超速速度将核对清单压坏,这可以减少检查的10%-37%,并实现大幅降低成本。 应用算法对随机森林和神经网络分类的分类,并用90%的评分级曲线下显示一个区域,因为评级为8 % 。