В российском городе обломки ракеты повредили многоквартирный дом

2026年2月9日 · 杨勇 · 来源：dev门户

ЕС не смог смириться с угрозами Зеленского в адрес Орбана14:59

Мерц резко сменил риторику во время встречи в Китае09:25

Thousands ，推荐阅读迅雷下载获取更多信息

\nThere have long been hints that innate immunity can last longer in certain circumstances. The most-studied example is the Bacillus Calmette-Guerin tuberculosis vaccine, which is given to some 100 million newborns every year. Epidemiological and clinical studies have shown that it can decrease infant mortality from other infections, suggesting that the cross-protection could last months. But the phenomenon was inconsistent and the mechanism mysterious.

“케데헌, 아직 보여줄 게 많다” 속편 2029년 공개

via the game

Abstract:Large language model (LLM)-powered agents have demonstrated strong capabilities in automating software engineering tasks such as static bug fixing, as evidenced by benchmarks like SWE-bench. However, in the real world, the development of mature software is typically predicated on complex requirement changes and long-term feature iterations -- a process that static, one-shot repair paradigms fail to capture. To bridge this gap, we propose \textbf{SWE-CI}, the first repository-level benchmark built upon the Continuous Integration loop, aiming to shift the evaluation paradigm for code generation from static, short-term \textit{functional correctness} toward dynamic, long-term \textit{maintainability}. The benchmark comprises 100 tasks, each corresponding on average to an evolution history spanning 233 days and 71 consecutive commits in a real-world code repository. SWE-CI requires agents to systematically resolve these tasks through dozens of rounds of analysis and coding iterations. SWE-CI provides valuable insights into how well agents can sustain code quality throughout long-term evolution.

关于作者