In this work, we propose an interesting method that aims to approximate an activation function over some domain by polynomials of the presupposing low degree. The main idea behind this method can be seen as an extension of the ordinary least square method and includes the gradient of activation function into the cost function to minimize.
翻译:在这项工作中,我们提出了一种有趣的方法,目的是通过预设的低度多位数来将某些领域的激活功能近似于某个领域的激活功能。 这种方法背后的主要理念可以被视为普通最低平方法的延伸,并将激活功能的梯度纳入成本功能,以最大限度地降低成本功能。