#ssm 2 items 3 июн Do Language Models Need Sleep? Offline Recurrence as Memory Consolidation for Improved Inference Google / CMU research 11 июн llama.cpp b9589–b9592: CUDA SSM Sync Fix and Mamba Memory Optimization tools