This is an official pytorch implementation of our paper "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization". In this paper, we ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results