DeepFM: A Factorization Machine based Neural Network for CTR Prediction - penny4860/study-note GitHub Wiki

1. 정리

Feature Interaction을 모델링하는 방법
- 기존의 방식
  - Manually include pairwise feature interactions
  - Feature engineering 필요한 단점이 있음.
- Factorization Machine
  - Feature 끼리의 latent vector를 내적하는 방식
  - 성능이 좋으나 high order modeling이 안되는 문제가 있음.
- Wide & Deep learning
  - low order interaction은 wide part로 , high order interaction은 deep part로 모델링
  - wide part의 구현에는 여전이 feature engineering에 의존
DeepFM의 contributions
1. low order, high order의 feature interaction을 1개의 network로 모델링
2. 효율적인 학습
  - wide/deep part가 같은 input을 공유
3. 성능

y_hat = sigmoid(y_FM + y_DNN)
- y_hat : user가 item을 클릭할 확률
- y_FM : scalar
- y_DNN : scalar
Feature interaction 모델링 order
- order-1
  - 여러 feature의 linear combination으로 모델링
  - w1*x1 + w2*x2 + ...
- order-2
  - 임베딩 vector의 내적으로 모델링
- higher order
  - feature vector를 concat & MLP