RightProduct
Kronecker Attention NetworksAn Attention Free TransformerTransformer with Fourier Integral AttentionsLinear Complexity Randomized Self-attention MechanismUFO-ViT: High Performance Linear Vision Transformer without SoftmaxXCiT: Cross-Covariance Image TransformersSimpleTRON: Simple Transformer with O(N) ComplexityA Dot Product Attention Free TransformerOn Learning the Transformer KernelMomentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
Last updated