2018 06 - PaddlePaddle/continuous_evaluation GitHub Wiki

2018年6月值班日志

20180629

20180628

下面是相邻的几个 commit 单独 debug vgg16模型的结果,commit 按时间先后排序

commmit status
bc28cf613f9e PASS
a2e43ae5ce69 PASS
19e877ffdb4f FAIL

对应 issue: https://github.com/PaddlePaddle/continuous_evaluation/issues/88

20180607

[13:37:35]	-----------  Configuration Arguments -----------
[13:37:35]	batch_size: 64
[13:37:35]	data_format: NCHW
[13:37:35]	data_set: flowers
[13:37:35]	device: GPU
[13:37:35]	gpu_id: 0
[13:37:35]	infer_only: False
[13:37:35]	iterations: 80
[13:37:35]	log_dir: ./
[13:37:35]	model: resnet_imagenet
[13:37:35]	pass_num: 3
[13:37:35]	skip_batch_num: 5
[13:37:35]	use_cprof: False
[13:37:35]	use_fake_data: False
[13:37:35]	use_nvprof: False
[13:37:35]	------------------------------------------------
[13:37:35]	del memory.txt
[13:37:37]	/usr/local/lib/python2.7/dist-packages/paddle/fluid/average.py:42: Warning: The WeightedAverage is deprecated, please use fluid.metrics.Accuracy instead.
[13:37:37]	  (self.__class__.__name__), Warning)
[13:38:28]	Pass:0, Loss:5.186864, Train Accuray:0.063672, Test Accuray:0.047059, Handle Images Duration: 38.863647
[13:38:28]	
[13:39:17]	Pass:1, Loss:3.916058, Train Accuray:0.148828, Test Accuray:0.118627, Handle Images Duration: 41.423943
[13:39:17]	
[13:40:06]	Pass:2, Loss:3.371016, Train Accuray:0.205469, Test Accuray:0.134314, Handle Images Duration: 41.409965
[13:40:06]	
[13:40:06]	Total examples: 15040, total time: 121.69756
[13:40:06]	123.58506 examples/sec, 0.51786 sec/batch 
[13:40:06]	
[13:40:07]	*** Aborted at 1528292407 (unix time) try "date -d @1528292407" if you are using GNU date ***
[13:40:07]	PC: @                0x0 (unknown)
[13:40:07]	*** SIGSEGV (@0x58) received by PID 36315 (TID 0x7f12c45c7700) from PID 88; stack trace: ***
[13:40:07]	    @     0x7f134cafa390 (unknown)
[13:40:07]	    @           0x4bc5bb PyEval_EvalFrameEx
[13:40:07]	    @           0x4b9ab6 PyEval_EvalCodeEx
[13:40:07]	    @           0x4d55f3 (unknown)
[13:40:07]	    @           0x4a577e PyObject_Call
[13:40:07]	    @           0x4bed3d PyEval_EvalFrameEx
[13:40:07]	    @           0x4c136f PyEval_EvalFrameEx
[13:40:07]	    @           0x4c136f PyEval_EvalFrameEx
[13:40:07]	    @           0x4b9ab6 PyEval_EvalCodeEx
[13:40:07]	    @           0x4d54b9 (unknown)
[13:40:07]	    @           0x4eebee (unknown)
[13:40:07]	    @           0x4a577e PyObject_Call
[13:40:07]	    @           0x4c5e10 PyEval_CallObjectWithKeywords
[13:40:07]	    @           0x589172 (unknown)
[13:40:07]	    @     0x7f134caf06ba start_thread
[13:40:07]	    @     0x7f134c82641d clone
[13:40:07]	    @                0x0 (unknown)
[13:40:07]	./run.xsh: line 13: 36315 Segmentation fault      (core dumped) FLAGS_benchmark=true FLAGS_fraction_of_gpu_memory_to_use=0.0 python model.py --device=GPU --batch_size=64 --data_set=flowers --model=resnet_imagenet --pass_num=3 --gpu_id=$cudaid

后面版本该错误未出现, 应该是ce当时环境问题

20180608

  • 值班人: 卫科

20180609

  • 值班人: 董志宏

20180610

20180611

20180612

20180613

20180614

  • 值班人:巩伟宝
  • 问题:环境问题,找不到wheel包, 解决人:严春伟
12:00:59][Step 2/3] Requirement 'python/dist/*.whl' looks like a filename, but the file does not exist
[12:00:59][Step 2/3] *.whl is not a valid wheel filename.
[12:00:59][Step 2/3] You are using pip version 9.0.3, however version 10.0.1 is available.

20180615

[12:37:50]Evaluate [0ddc5d86319f33e560afe274ce17f038cdfe498a] successed!
[12:37:50]updating baseline
[12:37:50]current kpi imikolov_20_pass_duration_card4_factor.txt better than history by 0.465918, update baseline
[12:37:51]current kpi train_pass_duration_kpi_card4_factor.txt better than history by 0.665026, update baseline
[12:37:51]update github baseline
[12:37:51]To [email protected]:PaddlePaddle/paddle-ce-latest-kpis.git
[12:37:51] ! [rejected]        master -> master (fetch first)
[12:37:51]error: failed to push some refs to '[email protected]:PaddlePaddle/paddle-ce-latest-kpis.git'

20180619

20180620

  • 值班人 于洋

20180621

20180622

20180625

20180626

20180627

⚠️ **GitHub.com Fallback** ⚠️