Accurate and efficient detection of ovarian cancer at early stages is critical to ensure proper treatments for patients. Among the first-line modalities investigated in studies of early diagnosis are features distilled from protein mass spectra. This method, however, considers only a specific subset of spectral responses and ignores the interplay among protein expression levels, which can also contain diagnostic information. We propose a new modality that automatically searches protein mass spectra for discriminatory features by considering the self-similar nature of the spectra. Self-similarity is assessed by taking a wavelet decomposition of protein mass spectra and estimating the rate of level-wise decay in the energies of the resulting wavelet coefficients. Level-wise energies are estimated in a robust manner using distance variance, and rates are estimated locally via a rolling window approach. This results in a collection of rates that can be used to characterize the interplay among proteins, which can be indicative of cancer presence. Discriminatory descriptors are then selected from these evolutionary rates and used as classifying features. The proposed wavelet-based features are used in conjunction with features proposed in the existing literature for early stage diagnosis of ovarian cancer using two datasets published by the American National Cancer Institute. Including the wavelet-based features from the new modality results in improvements in diagnostic performance for early-stage ovarian cancer detection. This demonstrates the ability of the proposed modality to characterize new ovarian cancer diagnostic information.
翻译:早期准确而高效地检测卵巢癌对于确保病人得到适当治疗至关重要。在早期诊断研究中调查的第一线模式中,有从蛋白质质质谱中蒸馏的特征。但是,这种方法只考虑一个特定的谱子反应,忽视蛋白表达水平之间的相互作用,而蛋白质表达水平也可以包含诊断信息。我们建议一种新的模式,通过考虑到蛋白质的自我相似性质,自动搜索蛋白质质质谱,以歧视性特征为特征。通过对蛋白质质质质谱进行分解并估计由此产生的波子系数能量水平衰减速度来评估自相异性。使用距离差异以稳健的方式估算水平能量,并通过滚动窗口方法估算当地比率。这可以用来描述蛋白质之间相互作用的速率,这可以表明癌症的存在。然后从这些进化速度中选择具有歧视性的描述器,并用作分类特征。拟议中的波质特征与现有文献中提议的用于早期诊断结果的特征一起使用,用于早期诊断结果分析结果,即用两种阶段的美国癌症诊断结果,用两种阶段数据显示美国癌症诊断结果。