News

RetNet: Retentive Network: A Successor to Transformer for Large Language Models, Arxiv, 2023 (Microsoft).[][GPViT: "GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation" ...