AI
AI Digest
EN RU
Home Archive About RSS

#mech-interp

2 items

  • 18 мая Judge Circuits: Mechanistic Explanation of LLM-as-Judge Format Inconsistency research
  • 11 июн Anatomy of Post-Training: Using Interpretability to Audit and Fix Preference Data research

ai-digest.kerby.pro

© 2026 Alexei Lukin · CC BY 4.0

RSS · JSON Feed · About