Dynamic Token Pruning in Plain Vision Transformers for Semantic Segmentation
PreviousAdaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision TransNextEViT: Expediting Vision Transformers via Token Reorganizations
Last updated