会员服务 ·

1

TensorFlow · keras · TensorFlow 2.0 ·

2020 年 3 月 27 日

TensorFlow 2.2为keras.Model加入train_step方法，开发者可自由定义模型自动训练过程

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

tf.keras为我们提供了易用的TF API，其中keras.Model是最重要的API之一，它封装了模型的参数、结构等信息及训练、测试等过程。为了让用户能够更好地定制训练的过程，TF 2.2为该API引入了新的可扩展接口。

在TensorFlow开发者峰会2020（TF Dev Summit '20）中，相关人员介绍了TF 2.2为keras.Model引入的自定义训练过程接口train_step：

在之前的版本中，虽然tf.keras的keras.Model模型封装了模型的训练过程，但由于这种封装过于黑盒，使得许多开发者并不愿意使用keras.Model自带的训练功能，而选择显式地调用tf.GradientTape等来进行反向传播和参数更新。一般，开发者会定义如下的训练过程：``` def train_step(images, labels): with tf.GradientTape() as tape: logits = mnist_model(images, training=True)

Add asserts to check the shape of the output.

tf.debugging.assert_equal(logits.shape, (32, 10))

loss_value = loss_object(labels, logits)

loss_history.append(loss_value.numpy().mean()) grads = tape.gradient(loss_value, mnist_model.trainable_variables) optimizer.apply_gradients(zip(grads, mnist_model.trainable_variables))




然后通过循环来手动调度训练过程：```
        def train(epochs):
 for epoch in range(epochs):
 for(batch, (images, labels)) in enumerate(dataset):
 train_step(images, labels)
 print('Epoch {} finished'.format(epoch))

keras.Model自带了许多非常好用的功能，例如进度显示、基于回调的TensorBoard日志、基于回调的Early Stop等。一般需要使用keras.Model自带的训练机制才可以享受到这些便捷的功能，上面这种手动调用的方法虽然能够让开发者对训练过程有着完全的掌控，但也使得他们不能享受部分keras.Model自带的便捷功能。

TF 2.2在keras.Model类中直接引入了train_step方法，这样开发者只需要在继承keras.Model模型时用自定义的方法覆盖父类中train_step的方法，就可以自定义可控的训练过程，并使用keras.Model自带的调度机制来进行训练：

参考链接：

成为VIP会员查看完整内容

36

相关内容

TensorFlow

Google发布的第二代深度学习系统TensorFlow

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

专知会员服务

28+阅读 · 2020年5月25日

TensorFlow开发者峰会2020 Keynote，TF2及其庞大的科研、工业生态圈

TensorFlow开发者峰会2020 Keynote，TF2及其庞大的科研、工业生态圈

专知会员服务

26+阅读 · 2020年3月20日

《强化学习—使用 Open AI、TensorFlow和Keras实现》174页pdf

《强化学习—使用 Open AI、TensorFlow和Keras实现》174页pdf

专知会员服务

139+阅读 · 2020年3月1日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【新书】Python强化学习-基于Tensorflow与Keras和OpenAI Gym实战, 177页pdf

【新书】Python强化学习-基于Tensorflow与Keras和OpenAI Gym实战, 177页pdf

专知会员服务

184+阅读 · 2020年1月17日

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

专知会员服务

21+阅读 · 2019年12月31日

【新书】学习TensorFlow2.0，177页pdf，使用Python实现机器学习和深度学习模型

【新书】学习TensorFlow2.0，177页pdf，使用Python实现机器学习和深度学习模型

专知会员服务

224+阅读 · 2019年12月28日

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

专知会员服务

24+阅读 · 2019年11月20日

【干货】谷歌Joshua Gordon 《TensorFlow 2.0讲解》，63页PPT

【干货】谷歌Joshua Gordon 《TensorFlow 2.0讲解》，63页PPT

专知会员服务

28+阅读 · 2019年11月2日

tf.GradientTape 详解

tf.GradientTape 详解

TensorFlow

120+阅读 · 2020年2月21日

使用 Keras Tuner 调节超参数

使用 Keras Tuner 调节超参数

TensorFlow

15+阅读 · 2020年2月6日

“Keras之父发声：TF 2.0 + Keras 深度学习必知的12件事”

“Keras之父发声：TF 2.0 + Keras 深度学习必知的12件事”

图灵教育

12+阅读 · 2019年3月19日

TF Boys必看！一文搞懂TensorFlow 2.0新架构！

TF Boys必看！一文搞懂TensorFlow 2.0新架构！

引力空间站

19+阅读 · 2019年1月16日

【附源码】TensorFlow动态图（Eager模式）的那些神坑

【附源码】TensorFlow动态图（Eager模式）的那些神坑

专知

19+阅读 · 2018年10月12日

【干货】还在自己写训练过程么？你需要一个训练引擎

【干货】还在自己写训练过程么？你需要一个训练引擎

专知

8+阅读 · 2018年5月17日

基于Keras进行迁移学习

基于Keras进行迁移学习

论智

12+阅读 · 2018年5月6日

【干货】使用TensorFlow官方Java API调用TensorFlow模型（附代码）

【干货】使用TensorFlow官方Java API调用TensorFlow模型（附代码）

专知

20+阅读 · 2018年4月22日

keras系列︱深度学习五款常用的已训练模型

keras系列︱深度学习五款常用的已训练模型

数据挖掘入门与实战

10+阅读 · 2018年3月27日

重要更新 | 谷歌发布 TensorFlow 1.4，迁移Keras，支持分布式训练

重要更新 | 谷歌发布 TensorFlow 1.4，迁移Keras，支持分布式训练

AI100

4+阅读 · 2017年11月23日

Self-training with Noisy Student improves ImageNet classification

Arxiv

15+阅读 · 2019年11月11日

Attention Forcing for Sequence-to-sequence Model Training

Attention Forcing for Sequence-to-sequence Model Training

Arxiv

7+阅读 · 2019年9月26日

Meta Learning for Task-Driven Video Summarization

Arxiv

6+阅读 · 2019年7月29日

LadderNet: Multi-path networks based on U-Net for medical image segmentation

Arxiv

11+阅读 · 2019年4月1日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Improving GAN Training via Binarized Representation Entropy (BRE) Regularization

Arxiv

4+阅读 · 2018年5月9日

Object Tracking in Satellite Videos Based on a Multi-Frame Optical Flow Tracker

Arxiv

5+阅读 · 2018年4月25日

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Arxiv

7+阅读 · 2018年3月13日

VR Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control

Arxiv

5+阅读 · 2018年2月1日

VIP会员

相关主题

相关VIP内容

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

专知会员服务

28+阅读 · 2020年5月25日

TensorFlow开发者峰会2020 Keynote，TF2及其庞大的科研、工业生态圈

TensorFlow开发者峰会2020 Keynote，TF2及其庞大的科研、工业生态圈

专知会员服务

26+阅读 · 2020年3月20日

《强化学习—使用 Open AI、TensorFlow和Keras实现》174页pdf

《强化学习—使用 Open AI、TensorFlow和Keras实现》174页pdf

专知会员服务

139+阅读 · 2020年3月1日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【新书】Python强化学习-基于Tensorflow与Keras和OpenAI Gym实战, 177页pdf

【新书】Python强化学习-基于Tensorflow与Keras和OpenAI Gym实战, 177页pdf

专知会员服务

184+阅读 · 2020年1月17日

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

专知会员服务

21+阅读 · 2019年12月31日

【新书】学习TensorFlow2.0，177页pdf，使用Python实现机器学习和深度学习模型

【新书】学习TensorFlow2.0，177页pdf，使用Python实现机器学习和深度学习模型

专知会员服务

224+阅读 · 2019年12月28日

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

专知会员服务

24+阅读 · 2019年11月20日

【干货】谷歌Joshua Gordon 《TensorFlow 2.0讲解》，63页PPT

【干货】谷歌Joshua Gordon 《TensorFlow 2.0讲解》，63页PPT

专知会员服务

28+阅读 · 2019年11月2日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

tf.GradientTape 详解

tf.GradientTape 详解

TensorFlow

120+阅读 · 2020年2月21日

使用 Keras Tuner 调节超参数

使用 Keras Tuner 调节超参数

TensorFlow

15+阅读 · 2020年2月6日

“Keras之父发声：TF 2.0 + Keras 深度学习必知的12件事”

“Keras之父发声：TF 2.0 + Keras 深度学习必知的12件事”

图灵教育

12+阅读 · 2019年3月19日

TF Boys必看！一文搞懂TensorFlow 2.0新架构！

TF Boys必看！一文搞懂TensorFlow 2.0新架构！

引力空间站

19+阅读 · 2019年1月16日

【附源码】TensorFlow动态图（Eager模式）的那些神坑

【附源码】TensorFlow动态图（Eager模式）的那些神坑

专知

19+阅读 · 2018年10月12日

【干货】还在自己写训练过程么？你需要一个训练引擎

【干货】还在自己写训练过程么？你需要一个训练引擎

专知

8+阅读 · 2018年5月17日

基于Keras进行迁移学习

基于Keras进行迁移学习

论智

12+阅读 · 2018年5月6日

【干货】使用TensorFlow官方Java API调用TensorFlow模型（附代码）

【干货】使用TensorFlow官方Java API调用TensorFlow模型（附代码）

专知

20+阅读 · 2018年4月22日

keras系列︱深度学习五款常用的已训练模型

keras系列︱深度学习五款常用的已训练模型

数据挖掘入门与实战

10+阅读 · 2018年3月27日

重要更新 | 谷歌发布 TensorFlow 1.4，迁移Keras，支持分布式训练

重要更新 | 谷歌发布 TensorFlow 1.4，迁移Keras，支持分布式训练

AI100

4+阅读 · 2017年11月23日

相关论文

Self-training with Noisy Student improves ImageNet classification

Arxiv

15+阅读 · 2019年11月11日

Attention Forcing for Sequence-to-sequence Model Training

Attention Forcing for Sequence-to-sequence Model Training

Arxiv

7+阅读 · 2019年9月26日

Meta Learning for Task-Driven Video Summarization

Arxiv

6+阅读 · 2019年7月29日

LadderNet: Multi-path networks based on U-Net for medical image segmentation

Arxiv

11+阅读 · 2019年4月1日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Improving GAN Training via Binarized Representation Entropy (BRE) Regularization

Arxiv

4+阅读 · 2018年5月9日

Object Tracking in Satellite Videos Based on a Multi-Frame Optical Flow Tracker

Arxiv

5+阅读 · 2018年4月25日

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Arxiv

7+阅读 · 2018年3月13日

VR Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control

Arxiv

5+阅读 · 2018年2月1日

微信扫码咨询专知VIP会员