SRCNN: Image Super Resolution Using Deep Convolutional Networks - Deepest-Project/Greedy-Survey GitHub Wiki

Resources

arxiv

Youtube, TensorFlow KR ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„

Abstract

  • SISR(Single-Image Super-Resolution) ๋ฌธ์ œ์— ์ตœ์ดˆ๋กœ ๋”ฅ๋Ÿฌ๋‹์„ ์ ์šฉํ•˜์˜€๋‹ค.
  • ์ด์ „์˜ State-of-art ๋ฐฉ๋ฒ•์ด์—ˆ๋˜ Sparse-coding Based SR์ด CNN์˜ ๊ด€์ ์œผ๋กœ๋„ ํ•ด์„ํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค.
  • ๋‹ค์–‘ํ•œ ์ƒ‰์ƒ ์ฑ„๋„, Kernel Size์— ๋Œ€ํ•ด ์‹คํ—˜ํ–ˆ์œผ๋ฉฐ ์ƒ‰์ƒ ์ฑ„๋„์€ RGB์ผ๋•Œ ๊ฐ€์žฅ ์ข‹๊ฒŒ, Kernel Size๋Š” Performance์™€ Time์ด Compromiseํ•˜๋Š” ๊ฒฐ๊ณผ๋ฅผ ๋ณด์˜€๋‹ค.

Introduction

Prior Knowledge

PSNR: Peak Signal-to-noise ratio

ํ†ต์ƒ์ ์œผ๋กœ ์‹ ํ˜ธ ๋Œ€ ์žก์Œ ๋น„์˜ ์˜๋ฏธ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์ง€๋งŒ, Image Restoration์—์„œ๋Š” ๋‹ค๋ฅธ ์˜๋ฏธ๋ฅผ ๊ฐ€์ง„๋‹ค. ๋ถ„๋ชจ ๋ถ€๋ถ„์˜ MSE๊ฐ€ {์›๋ณธ ์ด๋ฏธ์ง€ - ๋ณต์› ์ด๋ฏธ์ง€}์˜ L2 Loss๋กœ ๋“ค์–ด๊ฐ€๋ฉฐ, Loss(๋ถ„๋ชจ)๊ฐ€ ์ž‘์œผ๋ฉด PSNR์ด ์ปค์ง€๊ธฐ ๋•Œ๋ฌธ์— PSNR์€ Image Restoration์˜ ํ’ˆ์งˆ์„ ์ธก์ •ํ•˜๋Š” ์ฒ™๋„ ์ค‘ ํ•˜๋‚˜์ด๋‹ค.

Bicubic Interpolation

Image Upsampling(์ด๋ฏธ์ง€ ์‚ฌ์ด์ฆˆ ํ‚ค์šฐ๊ธฐ)๋ฅผ ํ•  ๋•Œ ์“ฐ์ด๋Š” ๋ฐฉ๋ฒ• ์ค‘ ํ•˜๋‚˜. Wikipedia
SRCNN, VDSR ๋“ฑ pre-upsampling ๊ธฐ๋ฐ˜์˜ SISR ๊ธฐ๋ฒ•์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉ๋˜์—ˆ์œผ๋‚˜, Interpolation์€ ๊ทธ ๋น„์šฉ์ด ์ ์ง€ ์•Š๊ณ , ๊ธฐ์กด์˜ ์ด๋ฏธ์ง€์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ์ •๋ณด๋ฅผ ์ „ํ˜€ ์ œ๊ณตํ•˜์ง€ ์•Š๋Š”๋‹ค๋Š” ์ง€์  ๋•Œ๋ฌธ์— ESPCN ๋“ฑ์˜ post-upsampling SISR ๊ธฐ๋ฒ•๋“ค์ด ๋‚˜์˜ค๊ธฐ๋„ ํ•˜์˜€๋‹ค.

Sparse Coding

(https://bskyvision.com/177)

Experiments and Results

์ฐธ๊ณ  - Github TesorFlow Code(https://github.com/tegg89/SRCNN-Tensorflow)

Experiments

Architecture of SRCNN

1. Patch Extraction Input Image๋ฅผ ๊ฐ™์€ Size(๋…ผ๋ฌธ์—์„œ๋Š” 33 * 33)์˜ Patch๋กœ ์กฐ๊ฐ๋‚ธ๋‹ค.

2. Patch Representation Conv1 - Relu1 Layer๋ฅผ ๊ฑฐ์ณ ๋‘๊ป˜ n_1์˜ feature map์„ ์ƒ์„ฑํ•œ๋‹ค.

3. Non-linear Mapping Conv2 - Relu2 Layer๋ฅผ ๊ฑฐ์ณ ๋‘๊ป˜ n_2์˜ feature map์„ ์ƒ์„ฑํ•˜๋ฉฐ, ์ด๋Š” ๋ชจ๋ธ์— nonlinearity๋ฅผ ๋ชจ๋ธ์— ๋ถ€์—ฌํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค.

4. Reconstruction Conv3 Layer๋ฅผ ๊ฑฐ์ณ Restored Image๋ฅผ ์ƒ์„ฑํ•œ๋‹ค.

Loss Function: image

์‹ค์ œ๋กœ model์„ evaluateํ•  ๋•Œ๋Š” ๊ฐ๊ฐ์˜ patch๋ฅผ mergeํ•˜์—ฌ ํ•˜๋‚˜์˜ ์ด๋ฏธ์ง€๋กœ ํ•ฉ์น˜๋Š” ๊ณผ์ •์ด ์ˆ˜๋ฐ˜๋œ๋‹ค(๋”ฐ๋ผ์„œ Zero Padding์„ ํ•˜์ง€ ์•Š์„ ๊ฒฝ์šฐ output์ด HR์ด๋ฏธ์ง€์—์„œ ํ…Œ๋‘๋ฆฌ๋ฅผ ์ž˜๋ผ๋‚ธ ํ˜•ํƒœ๊ฐ€ ๋œ๋‹ค).

Results

backprops-PSNR Graph and Reconstruction of SRCNN, SC, Bicubic PSNR of SRCNN via different Color Channels

Discussion

Sparse-Coding Based Method์™€์˜ ๋น„๊ต

Sparse Coding Based SR ๋…ผ๋ฌธ arxiv์„ ์ •๋…ํ•˜๊ธฐ ์–ด๋ ค์šด ๊ด€๊ณ„๋กœ, Sparse Coding ์ž์ฒด์™€ SRCNN์„ ๋น„๊ตํ•ด๋ณธ๋‹ค.

Sparse Coding SR in a view of SRCNN

(https://bskyvision.com/177)
์—์„œ ์„ค๋ช…ํ•œ Sparse coding(DMOS๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋ฐฉ๋ฒ•)๊ณผ SRCNN์„ ๋น„๊ตํ•œ๋‹ค.

  • Representation ์€ ์ƒˆ๋กœ์šด Image์˜ ํŠน์„ฑ์„ ๋ฝ‘์•„๋‚ด์„œ ์ƒˆ๋กœ์šด ์—ด๋ฒกํ„ฐ๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ์— ํ•ด๋‹นํ•œ๋‹ค.
  • Non-linear mapping ์€ ๊ฐ atom์— ํ•ด๋‹นํ•˜๋Š” DMOS ๊ฐ’์„ ๋Œ€์‘์‹œํ‚ค๋Š” ๊ณผ์ •์— ํ•ด๋‹นํ•œ๋‹ค.
  • Reconstruction์€ ๊ณ„์ˆ˜๋“ค์„ DMOS๊ฐ’์— ๊ฐ๊ฐ ๊ณฑํ•ด์„œ ์ตœ์ข… ๊ฒฐ๊ณผ(Prediction)์„ ๋งŒ๋“ค์–ด๋‚ด๋Š” ๊ฒƒ์— ํ•ด๋‹นํ•œ๋‹ค.

์งˆ๋ฌธ

Super Resolution ๋ฌธ์ œ์—์„œ Confidence Map์„ ์–ด๋–ป๊ฒŒ ๊ทธ๋ฆด ์ˆ˜ ์žˆ์„๊นŒ?