YOLOvx:

For training, we train my YOLOvx series series with 300 epochs on COCO.
For data augmentation, we use the large scale jitter (LSJ), Mosaic augmentation and Mixup augmentation, following the setting of YOLOX, but we remove the rotation transformation which is used in YOLOX's strong augmentation.
For optimizer, we use AdamW with weight decay 0.05 and base per image lr 0.001 / 64.
For learning rate scheduler, we use linear decay scheduler.
Due to my limited computing resources, I can not train YOLOvx-X with the setting of batch size=128.

Model	Scale	Batch	AP^{test 0.5:0.95}	AP^test 0.5	AP^val 0.5:0.95	AP^val 0.5	FLOPs ^(G)	Params ^(M)	Weight
YOLOvx-N	640	8xb16					9.1	2.4
YOLOvx-T	640	8xb16					19.0	5.1
YOLOvx-S	640	8xb16	43.6	62.6	43.3	62.6	33.6	9.0	ckpt
YOLOvx-M	640	8xb16	48.3	67.0	48.1	66.9	87.4	23.6	ckpt
YOLOvx-L	640	8xb16	50.2	68.6	50.0	68.4	176.6	47.6	ckpt
YOLOvx-X	640