mnistpyfile - juedaiyuer/researchNote GitHub Wiki
mnist文件读取程序笔记
import gzip
import sys
from six.moves import cPickle
def load_data(path='mnist.pkl.gz'):
# path = get_file(path, origin='https://s3.amazonaws.com/img-datasets/mnist.pkl.gz')
path = r'/home/juedaiyuer/Downloads/MNIST_data/mnist.pkl'
if path.endswith('.gz'):
f = gzip.open(path, 'rb')
else:
f = open(path, 'rb')
f = open(path, 'rb')
data = cPickle.load(f)
f.close()
return data # (X_train, y_train), (X_test, y_test)
# the data, shuffled and split between train and test sets
#(X_train, y_train), (X_test, y_test) = mnist.load_data()
(x_train, y_train), (x_test, y_test) = load_data()
使用python的cPickle库中的load函数,可以读取pkl文件的内容
以mnist为例,inf的类型为元祖tuple,他又包含了三个元祖,分别对应训练集,验证集,测试集。每个元祖中又包含两个numpy.ndarray,分别对应训练数据和label数据。训练数据的组成是由50000个含有784个元素的列表组成,每个列表代表一张图片。label数据集是由50000个元素组成的一维numpy.ndarray向量
pkl文件是python里面保存文件的一种格式