#transformers 1 item 8 мая Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and a Fix research