One of the significant steps in the process leading to the identification of proteins is mass spectrometry, which allows for obtaining information about the structure of proteins. Removing isotope peaks from the mass spectrum is vital and it is done in a process called deisotoping. There are different algorithms for deisotoping, but they have their limitations, they are dedicated to different methods of mass spectrometry. Data from experiments performed with the MALDI-ToF technique are characterized by high dimensionality. This paper presents a method for identifying isotope envelopes in MALDI-ToF molecular imaging data based on the Mamdani-Assilan fuzzy system and spatial maps of the molecular distribution of peaks included in the isotopic envelope. Several image texture measures were used to evaluate spatial molecular distribution maps. The algorithm was tested on eight datasets obtained from the MALDI-ToF experiment on samples from the National Institute of Oncology in Gliwice from patients with cancer of the head and neck region. The data were subjected to pre-processing and feature extraction. The results were collected and compared with three existing deisotoping algorithms. The analysis of the obtained results showed that the method for identifying isotopic envelopes proposed in this paper enables the detection of overlapping envelopes by using the approach oriented to study peak pairs. Moreover, the proposed algorithm enables the analysis of large data sets.
翻译:在确定蛋白质的过程中,一个重要步骤是质量光谱测量,以便获得关于蛋白质结构的信息。从质量频谱中去除同位素峰值至关重要,而且是在一种称为脱索方位的过程中完成的。有各种不同的脱索方程算法,但它们有其局限性,专门使用不同的质量光谱测量方法。通过MALDI-TOF技术的实验获得的数据具有高维度特征。本文介绍了一种方法,用以根据Mamdani-Assilan fuzzy系统以及同位素分布空间分布的分子分布图,在MALDI-TOF分子成像数据中识别同位素封套。根据Mamdani-Assilan fuzzy系统以及包含在等离子封封中的分子分布空间分布图绘制了同位素封顶峰。使用了几种图像纹度测量方法来评价空间分子分布图。该算法是用从MALDI-TOF实验获得的8个数据集测试的。从Gliwice国家肿瘤研究所从头部和颈部癌症患者取的样本采集的数据。这些数据是经过预处理和特征提取的。数据提取的。与特征提取的结果与现有脱索方位方位分析模型分析结果比较,通过现有的脱剖式分析模型分析结果显示分析结果显示了这一分析结果。