C-SHIFT 使共变法正常化的算法 (The C-SHIFT algorithm for normalizing covariances) - 专知论文

会员服务 ·

0

规范化的 · 协方差矩阵 · 协变量偏移 · Performer · 估计/估计量 ·

2021 年 8 月 5 日

The C-SHIFT algorithm for normalizing covariances

翻译：C-SHIFT 使共变法正常化的算法

Evgenia Chunikhina,Paul Logan,Yevgeniy Kovchegov,Anatoly Yambartsev,Debashis Mondal,Andrey Morgun

Omics technologies are powerful tools for analyzing patterns in gene expression data for thousands of genes. Due to a number of systematic variations in experiments, the raw gene expression data is often obfuscated by undesirable technical noises. Various normalization techniques were designed in an attempt to remove these non-biological errors prior to any statistical analysis. One of the reasons for normalizing data is the need for recovering the covariance matrix used in gene network analysis. In this paper, we introduce a novel normalization technique, called the covariance shift (C-SHIFT) method. This normalization algorithm uses optimization techniques together with the blessing of dimensionality philosophy and energy minimization hypothesis for covariance matrix recovery under additive noise (in biology, known as the bias). Thus, it is perfectly suited for the analysis of logarithmic gene expression data. Numerical experiments on synthetic data demonstrate the method's advantage over the classical normalization techniques. Namely, the comparison is made with Rank, Quantile, cyclic LOESS (locally estimated scatterplot smoothing), and MAD (median absolute deviation) normalization methods. We also evaluate the performance of C-SHIFT algorithm on real biological data.

翻译：基因技术是分析数千种基因基因的基因表达数据模式的有力工具。由于实验中的一些系统变化,原始基因表达数据往往被不受欢迎的技术噪音所混淆。在任何统计分析之前,设计了各种正常化技术,试图消除这些非生物错误。数据正常化的原因之一是需要恢复基因网络分析中使用的共变矩阵。在本文件中,我们采用了一种新型的正常化技术,称为常变转换(C-SHIFT)方法。这种正常化算法使用优化技术,加上在添加噪声(生物学中称为偏差)下利用维度理论和能量最小化假设来恢复共变矩阵。因此,它完全适合于分析对数基因表达数据。合成数据中的数值实验表明该方法比典型的正常化技术更有利。也就是说,我们与级、量、量、周期性LOESSS(当地估计的散落平法)和MAD(中度绝对偏差)方法进行了比较。我们还评估了C-SFIA的实性数据性。

0

相关内容

规范化的

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

已删除

将门创投

3+阅读 · 2019年9月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

lightgbm algorithm case of kaggle（上）

lightgbm algorithm case of kaggle（上）

R语言中文社区

8+阅读 · 2018年3月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Model-Adaptive Interface Generation for Data-Driven Discovery

Arxiv

0+阅读 · 2021年10月5日

Clustering a Mixture of Gaussians with Unknown Covariance

Arxiv

0+阅读 · 2021年10月4日

Approximations of energy minimization in cell-induced phase transitions of fibrous biomaterials: $Γ$-convergence analysis

Arxiv

0+阅读 · 2021年10月3日

Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes

Arxiv

0+阅读 · 2021年10月3日

A Lagged Particle Filter for Stable Filtering of certain High-Dimensional State-Space Models

Arxiv

0+阅读 · 2021年10月2日

Inference on the maximal rank of time-varying covariance matrices using high-frequency data

Arxiv

0+阅读 · 2021年10月1日

Smooth Normalizing Flows

Arxiv

1+阅读 · 2021年10月1日

Distributed Estimation of Sparse Inverse Covariances

Arxiv

0+阅读 · 2021年9月30日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

VIP会员

文章信息

相关主题

协方差矩阵

协变量偏移

估计/估计量

相关VIP内容

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

已删除

将门创投

3+阅读 · 2019年9月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

lightgbm algorithm case of kaggle（上）

lightgbm algorithm case of kaggle（上）

R语言中文社区

8+阅读 · 2018年3月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Model-Adaptive Interface Generation for Data-Driven Discovery

Arxiv

0+阅读 · 2021年10月5日

Clustering a Mixture of Gaussians with Unknown Covariance

Arxiv

0+阅读 · 2021年10月4日

Approximations of energy minimization in cell-induced phase transitions of fibrous biomaterials: $Γ$-convergence analysis

Arxiv

0+阅读 · 2021年10月3日

Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes

Arxiv

0+阅读 · 2021年10月3日

A Lagged Particle Filter for Stable Filtering of certain High-Dimensional State-Space Models

Arxiv

0+阅读 · 2021年10月2日

Inference on the maximal rank of time-varying covariance matrices using high-frequency data

Arxiv

0+阅读 · 2021年10月1日

Smooth Normalizing Flows

Arxiv

1+阅读 · 2021年10月1日

Distributed Estimation of Sparse Inverse Covariances

Arxiv

0+阅读 · 2021年9月30日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

A three domain covariance framework for EEG/MEG data

Arxiv

3+阅读 · 2014年10月9日

微信扫码咨询专知VIP会员