Biclustering is a powerful approach to search for patterns in data, as it can be driven by a function that measures the quality of diverse types of patterns of interest. However, due to its computational complexity, the exploration of the search space is usually guided by an algorithmic strategy, sometimes introducing random factors that simplify the computational cost (e.g. greedy search or evolutionary computation). Shifting patterns are specially interesting as they account constant fluctuations in data, i.e. they capture situations in which all the values in the pattern move up or down for one dimension maintaining the range amplitude for all the dimensions. This behaviour is very common in nature, e.g. in the analysis of gene expression data, where a subset of genes might go up or down for a subset of patients or experimental conditions, identifying functionally coherent categories. Boolean reasoning was recently revealed as an appropriate methodology to address the search for constant biclusters. In this work, this direction is extended to search for more general biclusters that include shifting patterns. The mathematical foundations are described in order to associate Boolean concepts with shifting patterns, and the methodology is presented to show that the induction of shifting patterns by means of Boolean reasoning is due to the ability of finding all inclusion--maximal {\delta}-shifting patterns. Experiments with a real dataset show the potential of our approach at finding biclusters with {\delta}-shifting patterns, which have been evaluated with the mean squared residue (MSR), providing an excellent performance at finding results very close to zero.
翻译:生物群集是搜索数据模式的强有力方法,因为它可以由测量不同类型利益模式质量的函数驱动。然而,由于计算的复杂性,搜索空间的探索通常以算法战略为指导,有时引入随机因素,简化计算成本(如贪婪搜索或进化计算)。 变化模式特别有趣,因为它们考虑到数据的不断波动,即它们捕捉模式中的所有值上下移动的情况,一个维系所有层面的幅度振幅的平均值。这种行为在性质上非常常见,例如,在基因表达数据分析中非常常见,对于一组患者或实验条件而言,一组基因可能会上下沉,确定功能一致性类别。最近,布利恩推理被揭示为一种适当的方法,用以解决对固定双组群群的搜索。在这项工作中,这一方向扩大到寻找更一般性的双组群群,包括变化模式。数学基础被描述是为了将布利恩概念与变化模式挂钩,而数学基础被描述为非常常见的。 在基因表达模式的模型分析中,将一组基因群集成的基因群的基因群群集可能会上升或下降。 方法显示,通过BOUILAA级的演变变变式分析能力,以显示BILA级分析方式的演进能力,而发现所有BILLLLLA的演进的能力是找到的推。