[논문리뷰] MultiGrain : a unified image embedding for classes and instances - penny4860/study-note GitHub Wiki

1. 정리

R-mac 내용정리
margin loss에서의 batch sampling에서 negative pair를 선택하는 방식 (4)수식
- 45 논문확인
- 코드확인
- negative j를 선택할 때 pdf 수식이 어떤역할을 하는거지?
l2-norm ==> pca ==> l2-norm 과정에 대해서 연구해보자.
- CNN features off-the-shelf: An astounding baseline for recognition
- Aggregating deep convolutional features for image retrieval
- Particular object retrieval with integral max-pooling of CNN activations
논문리뷰 : Fine-tuning CNN Image Retrieval with No Human Annotation
- alpha-QE
논문리뷰 : Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking
Image Embeddings
- MultiGrain: a unified image embedding for classes and instances
- Sampling Matters in Deep Embedding Learning
  - contrasive / triplets 를 개선
- Unifying Deep Local and Global Features for Image Search
- https://jobs.zalando.com/tech/blog/shop-look-deep-learning/?gh_src=4n3gxh1
- dataset
  - street2shop
  - deep fashion
  - modanet : ebay
    - object detection
    - segmentation
  - open-images

classification task 에서의 pooling
- Global Average Pooling 을 사용 : Global Translation Invariance
Retrieval Task 에서의 pooling
- cls task에 비해 more localized information이 필요하다.
- 종류
  - cam을 사용
  - rmac
  - GeM : Generalized Mean Pooling
    - channel 별로 p-norm을 구한다.
    - p : exponent parameter
      - p = 1 : average pooling
      - p = inf : max pooling
      - p > 1 : Tensor pixel의 contrast를 높인다.
논문에서 사용한 Pooling
- cls/ret에 공통으로 GeM을 사용
- input resolution에 따라 p값을 다르게 적용

(5) 수식

image classification dataset만을 사용해서 retrival task까지 학습하는 batch 구성방법을 제안
Repeated Augmentaion
1. |B| / m개의 image를 sampling
2. m번의 augmentation을 수행
  - |B|개의 image
  - original image와 augment된 sample을 positive pair로 사용

resnet50
learning-rate
- [0.2, 0.02, 0.002, 0.0002]
batch
- batch size : 512, m : 3
- 3번 augmentation 해서 512개의 batch 를 채운다.
  - 170-instance를 sampling

p=1 / p=3의 setting을 실험
p=3을 사용했을때 retrieval 성능이 좋았음.
- convolution feature map의 contrast 커짐.
- local 정보에 집중하는 효과

margin loss hyperparameter
- "Sampling matters in deep embedding learning." 의 설정을 사용