Image Classification On Inaturalist 2019

评估指标

Top-1 Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Top-1 Accuracy
Paper TitleRepository
LeViT-19270.8LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CeiT-S78.9Incorporating Convolution Designs into Visual Transformers-
CeiT-S (384 finetune resolution)82.7Incorporating Convolution Designs into Visual Transformers-
ResNet50 (A2)75.0ResNet strikes back: An improved training procedure in timm-
RDNet-T (224 res, IN-1K pretrained)81.2DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
Conviformer-B82.85Conviformers: Convolutionally guided Vision Transformer-
MixMIM-L83.9MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers-
MAE (ViT-H, 448)88.3Masked Autoencoders Are Scalable Vision Learners-
RDNet-S (224 res, IN-1K pretrained)82.9DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
Hiera-H (448px)88.5Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles-
RDNet-L (224 res, IN-1K pretrained)83.7DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
LeViT-25672.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
LeViT-12868.4LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CeiT-T72.8Incorporating Convolution Designs into Visual Transformers-
ResMLP-1271.0ResMLP: Feedforward networks for image classification with data-efficient training-
CeiT-T (384 finetune resolution)77.9Incorporating Convolution Designs into Visual Transformers-
RDNet-B (224 res, IN-1K pretrained)83.5DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
LeViT-38474.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
LeViT-128S66.5LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CaiT-M-36 U 22481.8--
0 of 22 row(s) selected.
Image Classification On Inaturalist 2019 | SOTA | HyperAI超神经