Introduction

This repository contains the official implementation of the following papers:

CoTNet Contextual transformer networks for visual recognition, TPAMI 2022
Wave-ViT Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning, ECCV 2022
Dual-ViT Dual Vision Transformer

Getting Started

For Image Classification, please see classification.
For Object Detection and Instance Segmentation, please see object_detection.
For Semantic Segmentation, please see semantic_segmentation.

Citation

CoTNet

@article{cotnet2022,
  title={Contextual transformer networks for visual recognition},
  author={Li, Yehao and Yao, Ting and Pan, Yingwei and Mei, Tao},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2022},
  publisher={IEEE}
}

Wave-ViT

@inproceedings{wavevit2022,
    title     = {Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning},
    author    = {Yao, Ting and Pan, Yingwei and Li, Yehao and Ngo, Chong-Wah and Mei, Tao},
    booktitle = {Proceedings of the European conference on computer vision (ECCV)},
    year      = {2022},
}

Dual-ViT

@article{dualvit2022,
  title={Dual Vision Transformer},
  author={Yao, Ting and Li, Yehao and Pan, Yingwei and Wang, Yu and Zhang, Xiao-Ping and Mei, Tao},
  journal={arXiv preprint arXiv:2207.04976},
  year={2022}
}

Acknowledgements

Thanks the contribution of timm, pvt and volo.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
classification		classification
images		images
object_detection		object_detection
semantic_segmentation		semantic_segmentation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

classification

classification

images

images

object_detection

object_detection

semantic_segmentation

semantic_segmentation

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Introduction

Getting Started

Citation

Acknowledgements

About

Releases

Packages

Languages

License

YehLi/ImageNetModel

Folders and files

Latest commit

History

Repository files navigation

Introduction

Getting Started

Citation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages