configs/trainers/CATEX/vit_b16_ep50.yaml (25 lines of code) (raw):

DATALOADER: TRAIN_X: BATCH_SIZE: 128 # 1024 128 TEST: BATCH_SIZE: 100 # 100 NUM_WORKERS: 32 INPUT: SIZE: (224, 224) INTERPOLATION: "bicubic" PIXEL_MEAN: [0.48145466, 0.4578275, 0.40821073] PIXEL_STD: [0.26862954, 0.26130258, 0.27577711] TRANSFORMS: ["random_resized_crop", "random_flip", "normalize"] OPTIM: NAME: "sgd" LR: 0.002 MAX_EPOCH: 50 # 50 LR_SCHEDULER: "cosine" WARMUP_EPOCH: 1 WARMUP_TYPE: "constant" WARMUP_CONS_LR: 1e-5 TRAIN: PRINT_FREQ: 5 MODEL: BACKBONE: NAME: "ViT-B/16"