Key Conclusions

Selected Signals

1. Evaluating alignment of behavioral dispositions in LLMs

Google Research 把行为对齐从概念推进成了可复现、可验证的评测工程。


2. Copilot organization custom instructions are generally available

GitHub 把组织级 custom instructions 推到 Copilot 全链路。


3. Claude Code auto mode: a safer way to skip permissions

Anthropic 用输入探测和输出 classifier 替代了手工审批。


Signal Technique

Observations