-
Ghostnet on MS1mV3
-
Constant lr decay
-
ReLU, with_pointwise, E
-
TT_ghostnet_pointwise_E_arc_emb512_dr04_wd5e4_bs512_ms1m_hist
- 0.995833 | 0.953571 | 0.959
-
PReLU, GDC, droupout 0.4
-
TT_ghostnet_prelu_GDC_arc_emb512_dr04_wd5e4_bs512_ms1m_hist
- 0.995333 | 0.957714 | 0.956
-
Cosine lr decay, PReLU
-
on batch, first_decay_step 16
-
sgdw 5e-4
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgdw_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_hist
- 0.996167 | 0.961286 | 0.966833
-
sgd, l2 5e-4, apply_to_batch_normal=False
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_hist
- 0.997167 | 0.959429 | 0.969333
-
sgd, l2 5e-4, restarts 3
- use_bias True, scale True
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_hist
- 0.997 | 0.959714 | 0.962833
- use_bias False, scale True
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_bias_false_hist
- 0.9965 | 0.960571 | 0.97
- arc + triplet 64
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_bias_false_E48_arc_trip_hist
- 0.997333 | 0.970857 | 0.9705
-
first_decay_step 7
-
on epoch
- sgd, l2 1e-3, apply_to_batch_normal=False
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_1e3_bs1024_ms1m_bnm09_bne1e5_cos7_epoch_hist
- 0.9965 | 0.965 | 0.97
- sgd, l2 5e-4, apply_to_batch_normal=False
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos7_epoch_hist
- 0.996833 | 0.962429 | 0.969
-
on batch, sgd, l2 1e-3
- image_per_class=0
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_1e3_bs1024_ms1m_bnm09_bne1e5_cos7_batch_hist
- 0.997167 | 0.959857 | 0.968667
- image_per_class=4
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_1e3_bs1024_ms1m_bnm09_bne1e5_cos7_batch_image_4_hist
- 0.996333 | 0.959714 | 0.968000
-
float16
-
PReLU
-
init 0
- TT_ghostnet_prelu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_bias_false_hist
- 0.995333 | 0.957714 | 0.969
-
init 0.25
- TT_ghostnet_prelu_25_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_float16_hist
- 0.996667 | 0.960429 | 0.966833
-
swish
-
image_per_class 10
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_ipc10_float16_hist
- 0.9955 | 0.961857 | 0.971167
-
Randaug
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_randaug_float16_hist
- 0.9965 | 0.958 | 0.960667
-
keep_lr_as_min 0
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_kam0_float16_hist
- 0.9965 | 0.962 | 0.968333
-
SGD LookAhead
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_LH_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_float16_hist
- 0.995833 | 0.948429 | 0.956167
-
Basic
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_float16_hist
- 0.996833 | 0.959857 | 0.969
- E50, lr 0.025
- SGD
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_E50_sgd_lr25e3_float16_hist
- 0.997667 | 0.962571 | 0.9705
- LookAhead
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_E50_LH_sgd_lr25e3_float16_hist
- 0.997833 | 0.964857 | 0.970833
- SAM
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_E50_SAM_sgd_lr25e3_float16_hist
- 0.997833 | 0.963571 | 0.97
- SAM + LookAhead
- TT_ghostnet_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_E50_SAM_LH_sgd_lr25e3_float16_hist
- 0.997667 | 0.963857 | 0.970667
-
first_strides=1
-
PReLU, se use PReLU
-
TT_ghostnet_strides_1_prelu_25_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_float16_hist
- 0.997833 | 0.978286 | 0.98
-
PReLU, se use ReLU
-
TT_ghostnet_strides_1_prelu_25_se_relu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_float16_hist
- 0.9975 | 0.978429 | 0.976333
-
Botnet50 on MS1mV3
-
relu
-
Conv use_bias=True, shortcut act relu
-
TT_botnet50_relu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_bias_false_hist
- 0.998 | 0.978 | 0.978167
-
Conv use_bias=False, shortcut act relu
-
TT_botnet50_relu_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_restart_3_bias_false_conv_no_bias_hist
- 0.997833 | 0.981143 | 0.978833
-
Conv use_bias=False, shortcut act none
-
random 0
- TT_botnet50_relu_shortcut_act_none_GDC_arc_emb512_cos16_batch_restart_3_bias_false_conv_no_bias_tmul_2_hist
- 0.9985 | 0.980286 | 0.979667
-
randaug 100
- TT_botnet50_relu_shortcut_act_none_GDC_arc_emb512_cos16_batch_restart_3_bias_false_conv_no_bias_tmul_2_randaug_hist
- 0.997667 | 0.981857 | 0.979333
-
PreLU, init 0
-
Conv use_bias=False, shortcut act none, random 0
-
TT_botnet50_prelu_shortcut_act_none_GDC_arc_emb512_bs768_cos16_batch_restart_2_bias_false_conv_no_bias_tmul_2_random0_hist
- 0.997833 | 0.978571 | 0.978
-
swish
-
Conv use_bias=False, shortcut act none, random 0
-
GDC
- use_bias False
- TT_botnet50_swish_shortcut_act_none_GDC_arc_emb512_cos16_batch_restart_2_bias_false_conv_no_bias_tmul_2_random0_hist
- 0.9985 | 0.984571 | 0.979833
-
E, dropout 0.4
- use_bias False, E17
- TT_botnet50_swish_shortcut_act_none_E_dr04__arc_emb512_cos16_batch_restart_2_bias_false_conv_no_bias_tmul_2_random0_hist
- 0.998 | 0.981571 | 0.979
- use_bias True, E17
- TT_botnet50_swish_shortcut_act_none_E_dr04__arc_emb512_cos16_batch_restart_2_bias_true_conv_no_bias_tmul_2_random0_hist
- 0.997667 | 0.983143 | 0.9785
-
Resnet on MS1mV3
-
resnet50v2
-
pad_same_conv_no_bias
-
GDC
- relu
- TT_resnet50v2_pad_same_conv_no_bias_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_float16_hist
- 0.998 | 0.980143 | 0.976667
- swish
- TT_resnet50v2_swish_pad_same_conv_no_bias_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_float16_hist
- 0.997333 | 0.977286 | 0.977
-
swish, E
- first_conv_k7_stride_2
- TT_resnet50v2_swish_pad_same_conv_no_bias_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_float16_hist
- 0.998 | 0.979 | 0.977667
- first_conv_k3_stride_1
- TT_resnet50v2_swish_pad_same_first_conv_k3_stride_1_conv_no_bias_E_arc_emb512_dr04_sgd_l2_5e4_bs384_ms1m_bnm09_bne1e4_cos16_hist
- 0.9985 | 0.988571 | 0.9835
-
resnet101v2
-
basic, relu, DC
-
TT_resnet101v2_pad_same_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e5_cos16_batch_fixed_float16_hist
- 0.998167 | 0.984 | 0.9785
-
pad_same_conv_no_bias
-
relu, GDC
- TT_resnet101v2_pad_same_conv_no_bias_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_float16_hist
- 0.997833 | 0.983429 | 0.979167
-
swish, E
- first_conv_k7_stride_2
- TT_resnet101v2_swish_pad_same_conv_no_bias_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_float16_hist
- 0.998167 | 0.982714 | 0.980333
- first_conv_k3_stride_1
- TT_resnet101v2_swish_pad_same_first_conv_k3_stride_1_conv_no_bias_E_arc_emb512_dr04_sgd_l2_5e4_bs384_ms1m_bnm09_bne1e4_cos16_hist
- 0.9985 | 0.989143 | 0.9845
-
r50
-
swish, E, ms1m
-
TT_r50_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_bnm09_bne1e4_cos16_hist
- 0.998333 | 0.989714 | 0.984167
-
swish, E, ms1m_cleaned
-
basic
- TT_r50_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_cleaned_bnm09_bne1e4_cos16_hist
- 0.998333 | 0.989571 | 0.984333
-
SD (1, 0.8)
- TT_r50_SD_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_cleaned_bnm09_bne1e4_cos16_hist
- 0.9985 | 0.989714 | 0.983667
-
se_r50
- random 0
- TT_se_r50_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_cleaned_bnm09_bne1e4_cos16_hist
- 0.998333 | 0.989714 | 0.984
- randaug 100
- TT_se_r50_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_cleaned_randaug_100_bnm09_bne1e4_cos16_hist
-
se_r50, SD (1, 0.8)
- TT_se_r50_SD_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs1024_ms1m_cleaned_bnm09_bne1e4_cos16_hist
-
efficientnetV2 on MS1mV3
-
efv2_s
-
early defined one
-
Stochastic Depth 0
- TT_early_efv2_s_add_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cos16_hist
- 0.997833 | 0.960429 | 0.9785
-
Stochastic Depth 0.8
- TT_early_efv2_s_sd08_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_cos16_hist
- 0.997667 | 0.981 | 0.981
-
Stochastic Depth (1, 0.8)
- TT_early_efv2_s_sd_1_08_GDC_arc_emb512_dr0_sgd_l2_5e4_bs1024_ms1m_cos16_hist
- 0.998167 | 0.974857 | 0.982167
-
Official image21k, SD 0, E17
-
TT_efv2_s_swish_E_arc_emb512_dr04_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_hist
- 0.997333 | 0.975571 | 0.977833
-
efv2_b0
-
sgdw 5e-4
-
TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgdw_wd_5e4_bs512_ms1m_cos16_batch_float16_hist
- 0.995833 | 0.953429 | 0.964
-
SGD, l2 5e-4
-
bnm 0.99, bne 1e-3, MS1mV3
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cos16_batch_float16_hist
- 0.997167 | 0.974857 | 0.976
-
bnm 0.9, bne 1e-4
- ms1m_cleaned
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cleaned_bnm09_bne1e4_cos16_batch_float16_hist
- 0.9975 | 0.975714 | 0.976333
- MS1mV3
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_batch_float16_hist
- 0.997167 | 0.975429 | 0.975
-
finetune efv2_b0, E50 --> E17
-
MS1mV3
-
arcface
- SD 0
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_batch_float16_E50_arc_base_hist
- 0.997333 | 0.972 | 0.9755
- SD (1, 0.8)
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_batch_float16_E50_arc_SD_hist
- 0.997167 | 0.974 | 0.976
- random 2
- randaug 100, no shear
- randaug 100, no shear, cutout
- cutout only
-
arc + triplet 64
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_batch_float16_E50_arc_trip64_hist
- 0.997667 | 0.981143 | 0.9785
-
curricular
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_bnm09_bne1e4_cos16_batch_float16_E50_curr_hist
- 0.997 | 0.973429 | 0.976333
-
ms1m_cleaned
-
arc + triplet 64
- random 0
- TT__efv2_b0_swish_*_E50_arc_trip64_hist
- 0.998 | 0.982714 | 0.977833
- randaug 100
- TT__efv2_b0_swish_*_E50_arc_trip64_randaug_100_hist
- 0.997833 | 0.981714 | 0.974167
- random 2
- TT__efv2_b0_swish_*_E50_arc_trip64_random_2_hist
- 0.997833 | 0.983429 | 0.977833
- randaug 100, no shear
- TT__efv2_b0_swish_*_E50_arc_trip64_randaug_100_no_shear_hist
- randaug 100, no shear, cutout
- TT__efv2_b0_swish_*_E50_arc_trip64_randaug_100_no_shear_cutout_hist
-
curricular + triplet 64
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cleaned_bnm09_bne1e4_cos16_batch_float16_E50_curr_trip64_hist
- 0.997833 | 0.983286 | 0.977333
-
curricular
- SD (1, 0.8)
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cleaned_bnm09_bne1e4_cos16_batch_float16_E50_curr_SD_hist
- 0.9975 | 0.973857 | 0.975167
- SD 0
- TT_efv2_b0_swish_GDC_arc_emb512_dr0_sgd_l2_5e4_bs512_ms1m_cleaned_bnm09_bne1e4_cos16_batch_float16_E50_curr_hist
- 0.998167 | 0.975857 | 0.976167