[FEATURE] Add ViT weights: RADIO #2177

seefun · 2024-05-14T12:01:53Z

The code and model weights of paper [CVPR 2024] AM-RADIO: Agglomerative Vision Foundation Model - Reduce All Domains Into One has been released by Nvidia

RADIO , a new vision foundation model (actually a new vit pretrained weight), excels across visual domains, serving as a superior replacement for vision backbones. Integrating CLIP variants, DINOv2, and SAM through distillation, it preserves unique features like text grounding and segmentation correspondence.

NightMachinery · 2024-05-22T02:09:41Z

Does RADIO have ImageNet-1k heads?

seefun added the enhancement New feature or request label May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add ViT weights: RADIO #2177

[FEATURE] Add ViT weights: RADIO #2177

seefun commented May 14, 2024

NightMachinery commented May 22, 2024

[FEATURE] Add ViT weights: RADIO #2177

[FEATURE] Add ViT weights: RADIO #2177

Comments

seefun commented May 14, 2024

NightMachinery commented May 22, 2024