周报

技术趋势

本周 AI 领域最突出的特征是一种"同步加速"：资本、模型、基础设施和研究同时进入新的量级。OpenAI 宣布了史上最大规模的 1100 亿美元融资，NVIDIA 以 300 亿美元直接入股，Anthropic 刚刚完成 300 亿美元 G 轮——三天内流入 AI 头部公司的资本超过 1400 亿美元。与此同时，Qwen3.5-397B、Claude Sonnet 4.6、Gemini 3.1 Pro 三款旗舰模型在同一周内发布，形成了一场罕见的三方对决。但真正值得关注的变化发生在水面之下。微软、Cloudflare、GitHub、HuggingFace 在同一周内集中发布 Agent 基础设施框架，标志着行业重心正从"更强的模型"转向"更可靠的 Agent 系统"。与此形成尖锐对照的是，五篇安全研究论文从几何、结构、模态三个维度共同揭示了当前 LLM 安全对齐的根本性脆弱。在 Agent 即将大规模部署的节点上，这一矛盾格外刺眼。

文章详情

周报

2026-W15 (April 5-11) marked a cognitive shift in AI engineering: the orchestration infrastructure built around models — what the industry now calls the "harness" — moved from backstage to center stage. OpenAI disclosed a million-line zero-human-code experiment. Meta built a code pre-computation engine with 50+ agents. A Claude Code source leak exposed the sophistication of this architecture. All three point to the same conclusion: the 2026 AI engineering race is no longer about models — it is about everything around them.

文章详情

周报

If one word captures this week in AI, it's "engineering." Coding agents had a collective awakening. Internal architectures got laid bare, engineering methodology got codified, toolchains proliferated, and model-layer catch-up intensified. Coding agents have officially entered the era of systematic engineering discipline. Meanwhile, agent memory discourse — sparked by Karpathy's personal Wiki experiment — rippled through academia and the open-source community, making "how should agents persist knowledge" the week's most debated question.

文章详情

周报

技术趋势

Week 13 of 2026 (March 22–28) surfaced three parallel but interconnected narratives in AI. The first is a concentrated burst of multi-agent orchestration tooling. Cline Kanban, Scion, DeerFlow 2.0, and several others all shipped in the same week, marking an industry-wide pivot from "single-agent capability" to "engineering multi-agent collaboration."

Two technical threads dominate Week 11 of 2026 (March 8–14) in recommendation system research. First, generative recommendation (GR) is undergoing full-stack optimization — transitioning from "making it work" to "making it work well, fast, and fairly" — Netflix/Meta's exponential reward-weighted SFT addresses post-training alignment, LinkedIn's causal attention reformulation halves sequence length, Kuaishou's FP8 quantization reduces OneRec-V2 inference latency by 49%, and Alibaba's differentiable geometric indexing eliminates long-tail bias at its root. Five papers advance GR's industrial maturity across five dimensions. Second, LLM-based recommendation is shifting from "single-pass inference" toward an agentic paradigm — Meta's VRec inserts verification steps into reasoning chains, Meituan's RecPilot replaces traditional recommendation lists with a multi-agent framework, USTC's TriRec introduces tri-party coordination for the first time, and RUC/JD's RecThinker enables autonomous tool invocation.

All revisions applied. Here's a summary of changes:

本周共收录 23 篇推荐系统相关论文，其中 5 分论文 5 篇，4 分 10 篇，3 分 8 篇，整体质量出色。Generative Recommendation（生成式推荐）是本周最显著的技术主线，6 篇论文直接聚焦于此，涵盖 Semantic ID 编码、受限解码优化、广告场景部署和多任务统一框架。另一条主线是 LLM 与推荐系统的融合范式——"LLM-as-Rec"（LLM 作为推荐骨干）与"LLM-for-Rec"（LLM 辅助推荐）两条路径本周都有重要进展。工业部署论文占比极高（6 篇含 Online A/B 测试），来自 AliExpress、快手、Apple App Store 等一线平台。

文章详情

1 2