On the Relationship Between Self-Attention And Convolutional Layers January 10, 2020 https://arxiv.org/pdf/1911.03584 Fullscreen Dark Mode