The Jensen inequality is a widely used tool in a multitude of fields, such as for example information theory and machine learning. It can be also used to derive other standard inequalities such as the inequality of arithmetic and geometric means or the H\"older inequality. In a probabilistic setting, the Jensen inequality describes the relationship between a convex function and the expected value. In this work, we want to look at the probabilistic setting from the reverse direction of the inequality. We show that under minimal constraints and with a proper scaling, the Jensen inequality can be reversed. We believe that the resulting tool can be helpful for many applications and provide a variational estimation of mutual information, where the reverse inequality leads to a new estimator with superior training behavior compared to current estimators.
翻译:Jensen不平等是许多领域广泛使用的工具,例如信息理论和机器学习等。它也可以用来获取其他标准不平等,如计算和几何手段的不平等或H\'older不平等。在概率学的环境下,Jensen不平等描述了曲线函数和预期价值之间的关系。在这项工作中,我们希望从不平等的相反方向来审视概率环境。我们表明,在最低限度的限制和适当规模的情况下,Jensen的不平等可以逆转。我们认为,由此产生的工具对许多应用都有帮助,并且提供了对相互信息的变异估计,在这种情况下,逆向不平等导致一个新的估算者,其培训行为优于目前的估算者。