Influenza occurs every season and occasionally causes pandemics. Despite its low mortality rate, influenza is a major public health concern, as it can be complicated by severe diseases like pneumonia. A fast, accurate and low-cost method to predict the origin host and subtype of influenza viruses could help reduce virus transmission and benefit resource-poor areas. In this work, we propose multi-channel neural networks to predict antigenic types and hosts of influenza A viruses with hemagglutinin and neuraminidase protein sequences. An integrated data set containing complete protein sequences were used to produce a pre-trained model, and two other data sets were used for testing the model's performance. One test set contained complete protein sequences, and another test set contained incomplete protein sequences. The results suggest that multi-channel neural networks are applicable and promising for predicting influenza A virus hosts and antigenic subtypes with complete and partial protein sequences.
翻译:流感是一种快速、准确和低成本的预测流感病毒原宿主和亚型的方法,可以帮助减少病毒传播,使资源贫乏地区受益。在这项工作中,我们提议建立多通道神经网络,以预测抗原类型和携带长效甘油素和尼拉米丁酶蛋白序列的流感病毒宿主。一个包含完整的蛋白质序列的综合数据集用于制作一个预培训模型,另外两个数据集用于测试模型的性能。一个测试集包含完整的蛋白序列,另一个测试集包含不完整蛋白序列。结果显示多通道神经网络适用于并有望预测流感A病毒宿主和具有完整和部分蛋白序列的抗病毒亚型。