#scalable-oversight 1 item 8 мая Automated Weak-to-Strong Researcher: AI Agents Outperform Humans on Alignment Research Anthropic research