PVT v2: Improved Baselines with Pyramid Vision Transformer
Paper
• 2106.13797 • Published
Converted TIMM image classification model for LiteRT.
@article{wang2021pvtv2,
title={Pvtv2: Improved baselines with pyramid vision transformer},
author={Wang, Wenhai and Xie, Enze and Li, Xiang and Fan, Deng-Ping and Song, Kaitao and Liang, Ding and Lu, Tong and Luo, Ping and Shao, Ling},
journal={Computational Visual Media},
volume={8},
number={3},
pages={1--10},
year={2022},
publisher={Springer}
}
Base model
timm/pvt_v2_b0.in1k