[ACL‘20] Highway Transformer: A Gated Transformer.
pytorch transformer language-model gated-attention transformer-xl highway-transformer gating-transformer
-
Updated
Dec 5, 2021 - Python