AI-Papers
Variable-Width Transformers(><former)とは?×形設計でLLMのFLOPs 22%削減 | AI-Papers