[논문리뷰] EfficientNet - penny4860/study-note GitHub Wiki

1. 정리

Compound Scaling
- ConvNet의 크기(scale)를 키우는 방법은 3가지가 있다.
  - resolution / depth / width(filter 숫자)
- 기존의 연구들은 Network의 크기를 키우기위해 1가지 방식만을 고려.
- 본 논문에서는 3가지 요소각각의 fixed scaling coefficient를 정하고, 이들을 동시에 scale up.
- 직관적 설명
  - resolution이 커지면
  - 더 큰 receptive field가 필요하다. --> depth를 키워야 함.
  - 더 많은 pattern을 capture해야함 --> width를 키워야 함.
EfficientNet
- NAS로 baseline network를 찾고
- compound scaling으로 모델의 complexity를 높임.
- ImageNet에서 SOTA
- 다른 classification task에서도 SOTA의 성능을 보임

ConvNet Accuracy
- ImageNet Task를 타겟으로 Convnet 모델을 개발
- 다른 classification task에서도 높은 성능을 보임
- 다른 CV task (object detection)에서도 높은 성능을 보임.
ConvNet Efficiency
- squeezenet 계열
- mobilenet 계열
- shufflenet 계열
- NAS로 찾은 모델
  - mnasnet
  - proxylessnas
model scaling
- ConvNet의 스케일을 어떻게 높일것인가?

1st Observation
- 3가지(depth, width, resolution) 중에서 1개라도 늘리면 성능이 좋아짐
- 너무 늘릴경우 성능향상이 saturation / diminish 된다.

2nd Observation
- 네트워크를 scaling 할떄는 3가지 dimenstion을 골고루 늘려야함.
compound scaling method
- theta : compound scaling coefficient
  - 모델의 크기를 결정
- alpha, beta, gamma :
  - 결정된 resource를 어떻게 배분할지를 결정