#evaluation 1 item 1 мая AutoResearchBench — a benchmark for autonomous scientific literature search by AI agents BAAI research