Attalos Image and Text Performance - Lab41/attalos GitHub Wiki
Assumptions:
- Using Inception Features
- Evaluated using attalos.evaluation.evaluation.py
Dataset Codes
Code | Name |
---|---|
IA | IAPRTC-12 |
ESP | ESP Game |
VG | Visual Genome |
Single Corpus
Approach | Train | Test | wordvecs | Precision | Recall | F1 | Network | Epochs |
---|---|---|---|---|---|---|---|---|
MSE | IA | IA | Glove | 0.2131 | 0.1306 | 0.1619 | 2048,1024,200 | 240 |
Negative Sampling - Fixed | IA | IA | w2v | 0.282 | 0.216 | 0.163 | 2048,1024,200 | 240 |
Negative Sampling - Fixed | IA | IA | Glove | 0.3077 | 0.3486 | 0.3269 | 2048,1024,200 | 240 |
Negative Sampling - JointOpt | IA | IA | w2v | 0.3980 | 0.2650 | 200,200 | 240 | |
Negative Sampling - JointOpt | IA | IA | w2v | 0.4120 | 0.28995 | 0.340209 | 2048,1024,200 | 240 |
Negative Sampling - JointOpt | IA | IA | w2v | 0.3347 | 0.40065 | 0.3647 | 2048,1024,200 | 240 |
Negative Sampling - Fixed | VG | VG | w2v | 0.04567 | 0.02462 | 0.01552 | 2048,1024,200 | 240 |
Negative Sampling - JointOpt | VG | VG | w2v | 0.1494 | 0.05013 | 0.751 | 2048,1024,200 | 240 |
DenseCap | VG | VG | w2v | 0.133 | 0.149 | 0.140 | 1-Hot | |
Fast Zero Tag | IA | IA | Glove | 0.2708 | 0.2119 | 0.2165 | 2048,1024,200 | 240 |
Word Distribution Vectors | IA | IA | w2v | 0.4644 | 0.3081 | 0.3455 | 2048,1168,1168,1168 | 220 |
MSE | ESP | ESP | Glove | 0.3762 | 0.1967 | 0.2186 | 2048,1024,200 | 240 |
Negative Sampling | ESP | ESP | Glove | 0.3933 | 0.2721 | 0.2665 | 2048,1024,200 | 240 |
Fast Zero Tag | ESP | ESP | Glove | 0.4017 | 0.2047 | 0.2437 | 2048,1024,200 | 240 |
Word Distribution Vectors | ESP | ESP |
Multi Corpus
Approach | Train | Test | Precision | Recall | F1 | Network Description | Epochs |
---|---|---|---|---|---|---|---|
MSE | IA | ESP | 0.0891 | 0.0969 | 0.0579 | 2048,1024,200 | 240 |
Negative Sampling-Fixed | IA | ESP | 0.1008 | 0.0501 | 0.0359 | 2048,1024,200 | 240 |
NegSamp-FixWV/OptUnseenWV | |||||||
NegaSamp-OptW | IA | ESP | 0.12206 | 0.07297 | 0.068568 | 200,200 | 240 |
NegSamp-OptWV | IA | ESP | 0.1256 | 0.9195 | 0.1062 | 4096,2048 | 5k,noBN |
Fast Zero Tag | IA | ESP | 0.0641 | 0.0764 | 0.0501 | 2048,1024,200 | 240 |
MSE | VG | ESP | |||||
Negative Sampling - Fixed | VG | ESP | 0.0745 | 0.052711 | 0.061745 | 2048,1024,200 | 500 |
Negative Sampling - JointOpt | VG | ESP | 0.11821 | 0.1399 | 0.12814 | 2048,1024,200 | 240 |
Fast Zero Tag | VG | ESP | 0.0812 | 0.0854 | 0.0631 | 2048,1024,200 | 240 |
DenseCap | VG | ESP | 0.090 | 0.170 | 0.118 | 4096,4096,1000,~4600 | 1.6M images |