700字范文 > 运行caffe识别数字的模型mnist

运行caffe识别数字的模型mnist

时间：2024-04-14 06:21:07

相关推荐

运行caffe识别数字的模型mnist

原始引用地址: [运行caffe识别数字的模型mnist](/caffe_mnist.html)

time: .5.17 23:28

## mnist是啥？

mnist是一个运用神经网络识别数字的模型，可以识别数字0到9.

### 获取mnist模块与数据

对于mnist（就是数字识别）例子，参考以下地址，获取数据训练相关数据：

</gathered/examples/mnist.html>

准备数据：

```

cd $CAFFE_ROOT

./data/mnist/get_mnist.sh

./examples/mnist/create_mnist.sh

```

### 运用多核，加速训练

caffe在纯CPU模式下，使用多核运行

</b876144622/article/details/80009877>

1. sudo apt-get install -y libopenblas-dev

2、修改caffe目录下的Makefile.config文件，将BLAS: =atlas修改为BLAS: =open

3、再编译caffe，首先make clean，清除之前的编译结果，再依次执行make all -j16， make test -j16， make runtest -j16，编译caffe。-j16是指用16个核并行编译caffe，可以大大加快编译速度。

4、编译完成后，执行训练前，需要export OPENBLAS_NUM_THREADS=4, 即使用4个核进行训练

### 开始训练

因为编译时，选择使用cpu，使用要更改文件：

examples/mnist/lenet_solver.prototxt

把：solver_mode: GPU 改为：solver_mode: CPU

训练命令：

```

export PYTHONPATH=/home/xy/works/caffe/python/

time ./examples/mnist/train_lenet.sh #vm用时17m36s j1900 42m55

```

### 进行图片识别

#### 用软件手写一个图片

（注意：识别的图片，**一定要是黑底，数字用白色写**）

安装画图软件gimp：

```

sudo apt-get install gimp

```

gimp 创建27*27的bmp图：

1. 新建file -> new 选width 27 height27 单位为像素，底色选黑

2. 画图：选画笔，左边size ,可以选2，size太大，27*27的画布不够画的。

3. 导出：file->export as, 格式先jpg，

#### 识别手写的图片

在运行python/calssify前要运行安装 protobuf：

```sudo pip install protobuf```

在识别图片前，需要对classify进行更改：

```

git diff python/classify.py

diff --git a/python/classify.py b/python/classify.py

index 4544c51..446c55e 100755

--- a/python/classify.py

+++ b/python/classify.py

@@ -105,9 +105,9 @@ def main(argv):

# Make classifier.

classifier = caffe.Classifier(args.model_def, args.pretrained_model,

-image_dims=image_dims, mean=mean,

-input_scale=args.input_scale, raw_scale=args.raw_scale,

-channel_swap=channel_swap)

+image_dims=None, mean=None,

+input_scale=None, raw_scale=None,

+channel_swap=None)

# Load numpy array (.npy), directory glob (*.jpg), or image file.

args.input_file = os.path.expanduser(args.input_file)

@@ -116,11 +116,11 @@ def main(argv):

inputs = np.load(args.input_file)

elif os.path.isdir(args.input_file):

print("Loading folder: %s" % args.input_file)

- inputs =[caffe.io.load_image(im_f)

+ inputs =[caffe.io.load_image(im_f, False)

for im_f in glob.glob(args.input_file + '/*.' + args.ext)]

else:

print("Loading file: %s" % args.input_file)

- inputs = [caffe.io.load_image(args.input_file)]

+ inputs = [caffe.io.load_image(args.input_file, False)]

print("Classifying %d inputs." % len(inputs))

@@ -131,6 +131,7 @@ def main(argv):

# Save

print("Saving results into %s" % args.output_file)

+ print(predictions)

np.save(args.output_file, predictions)

git diff python/caffe/classifier.py

diff --git a/python/caffe/classifier.py b/python/caffe/classifier.py

index 64d804be..65b0d881 100644

--- a/python/caffe/classifier.py

+++ b/python/caffe/classifier.py

@@ -69,18 +69,18 @@ class Classifier():

for ix, in_ in enumerate(inputs):

input_[ix] = caffe.io.resize_image(in_, self.image_dims)

- if oversample:

-# Generate center, corner, and mirrored crops.

-input_ = caffe.io.oversample(input_, self.crop_dims)

- else:

-# Take center crop.

-center = np.array(self.image_dims) / 2.0

-crop = np.tile(center, (1, 2))[0] + np.concatenate([

- -self.crop_dims / 2.0,

- self.crop_dims / 2.0

-])

-crop = crop.astype(int)

-input_ = input_[:, crop[0]:crop[2], crop[1]:crop[3], :]

+ #if oversample:

+ # # Generate center, corner, and mirrored crops.

+ # input_ = caffe.io.oversample(input_, self.crop_dims)

+ #else:

+ # # Take center crop.

+ # center = np.array(self.image_dims) / 2.0

+ # crop = np.tile(center, (1, 2))[0] + np.concatenate([

+ # -self.crop_dims / 2.0,

+ # self.crop_dims / 2.0

+ # ])

+ # crop = crop.astype(int)

+ # input_ = input_[:, crop[0]:crop[2], crop[1]:crop[3], :]

# Classify

caffe_in = np.zeros(np.array(input_.shape)[[0, 3, 1, 2]],

@@ -91,8 +91,8 @@ class Classifier():

predictions = out[self.outputs[0]]

# For oversampling, average predictions across crops.

- if oversample:

-predictions = predictions.reshape((len(predictions) // 10, 10, -1))

-predictions = predictions.mean(1)

+ #if oversample:

+ # predictions = predictions.reshape((len(predictions) // 10, 10, -1))

+ # predictions = predictions.mean(1)

return predictions

```

最后，真正到运行命令的时候了：

```

使用命令计算图片：

python python/classify.py --model_def examples/mnist/lenet.prototxt --pretrained_model examples/mnist/lenet_iter_10000.caffemodel --center_only --images_dim 28,28 /home/user/Desktop/2.jpg FOO

```

更改上在py程序后，会输出以下内容：

```

Saving results into FOO

[[2.63039285e-10 1.69570372e-10 1.00000000e+00 3.37297190e-10

1.04435086e-16 6.86246951e-15 1.50223258e-14 4.68932055e-12

6.54263449e-11 1.28875165e-14]]

```

由于输入的是2.jpg，所以第2个位置（从0开始）的概率最大，几乎是1.可以分别手写0到9图片，进行测试。

我分别在vm(i53230), j1900（真机）, i737**m(真机)进行测试，cpu运行过，识别率还可以，速度感觉都比较慢在1s以上吧。

[首页](/index.html)

本内容不代表本网观点和政治立场，如有侵犯你的权益请联系我们处理。

网友评论

网友评论仅供其表达个人看法，并不表明网站立场。