本文介绍如何使用Tensorflow对MNIST数据集进行手写字识别,涉及到神经网络优化技术:模型的正则化、模型的滑动平均、学习率衰减,也涉及到Tensorflow中的实用化技术:模型的持久化、断点继续训练。
1 | import tensorflow as tf |
搭建模型
1 | # 神经网络结构相关的参数 |
1 | # 神经网络训练相关的参数 |
1 | def get_weight_variable(shape, regularizer): |
训练模型
1 | from tensorflow.examples.tutorials.mnist import input_data |
1 | tf.reset_default_graph() |
Extracting /home/seisinv/data/mnist/train-images-idx3-ubyte.gz
Extracting /home/seisinv/data/mnist/train-labels-idx1-ubyte.gz
Extracting /home/seisinv/data/mnist/t10k-images-idx3-ubyte.gz
Extracting /home/seisinv/data/mnist/t10k-labels-idx1-ubyte.gz
After 1 training step(s), loss on training batch is 7.21735.
After 1001 training step(s), loss on training batch is 0.43611.
After 2001 training step(s), loss on training batch is 0.445638.
After 3001 training step(s), loss on training batch is 0.653738.
After 4001 training step(s), loss on training batch is 0.441901.
After 5001 training step(s), loss on training batch is 0.219952.
After 6001 training step(s), loss on training batch is 0.319431.
After 7001 training step(s), loss on training batch is 0.292837.
After 8001 training step(s), loss on training batch is 0.249832.
After 9001 training step(s), loss on training batch is 0.38989.
After 10001 training step(s), loss on training batch is 0.439101.
After 11001 training step(s), loss on training batch is 0.315771.
After 12001 training step(s), loss on training batch is 0.309541.
After 13001 training step(s), loss on training batch is 0.270133.
After 14001 training step(s), loss on training batch is 0.204996.
After 15001 training step(s), loss on training batch is 0.35835.
After 16001 training step(s), loss on training batch is 0.343041.
After 17001 training step(s), loss on training batch is 0.419665.
After 18001 training step(s), loss on training batch is 0.145532.
After 19001 training step(s), loss on training batch is 0.308465.
After 20001 training step(s), loss on training batch is 0.265333.
After 21001 training step(s), loss on training batch is 0.149025.
After 22001 training step(s), loss on training batch is 0.250994.
After 23001 training step(s), loss on training batch is 0.43688.
After 24001 training step(s), loss on training batch is 0.381845.
After 25001 training step(s), loss on training batch is 0.277657.
After 26001 training step(s), loss on training batch is 0.492095.
After 27001 training step(s), loss on training batch is 0.334551.
After 28001 training step(s), loss on training batch is 0.19148.
After 29001 training step(s), loss on training batch is 0.244014.
测试模型
1 | # EVAL_INTERVAL_SECS = 10 |
1 | tf.reset_default_graph() |
Extracting /home/seisinv/data/mnist/train-images-idx3-ubyte.gz
Extracting /home/seisinv/data/mnist/train-labels-idx1-ubyte.gz
Extracting /home/seisinv/data/mnist/t10k-images-idx3-ubyte.gz
Extracting /home/seisinv/data/mnist/t10k-labels-idx1-ubyte.gz
INFO:tensorflow:Restoring parameters from model/model_nn_mnist.ckpt-29001
After 29001 training setp(s), validation accuracy = 0.9292
参考资料
- 郑泽宇、梁博文和顾思宇,Tensorflow: 实战Google深度学习框架(第二版)