Zhihu Frontier @ZhihuFrontier X Profile

Zhihu Frontier

@ZhihuFrontier

Followers

248

Following

27

Media

24

Statuses

38

🚀Bringing China's AI & tech trends, voices, and perspectives to the global stage. ⚡️Powered by Zhihu, China's leading knowledge platform.

Beijing

Joined June 2025

Don't wanna be here? Send us removal request.

Zhihu Frontier

@ZhihuFrontier

1 month

🚀 Zhihu Frontier is now live. We're here to bring China's AI & tech trends, voices, and perspectives to the global stage — powered by Zhihu Inc., China's leading knowledge platform. 🧭 Exploring the frontiers of technology and ideas. 📌 From developer insights to deep tech

6

0

9

Zhihu Frontier

@ZhihuFrontier

3 days

📢 ICML 2025 | From Language to Vision: VARSR unlocks a new paradigm for Image Super-Resolution. Recent advances show autoregressive models (AR) - successful in NLP - are now thriving in vision tasks (like DALL·E, GPT-4o). Compared to Diffusion Models, AR methods better capture

0

1

3

Zhihu Frontier

@ZhihuFrontier

3 days

🧠 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S shares insights on K2’s architecture & optimization:.🔥One key breakthrough? Solving the MaxLogit explosion with a combo of Muon + QK-Clip.(Technical details see previous post👇).🧩 More reflections on model.

Zhihu Frontier

@ZhihuFrontier

6 days

🚀 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece:."QK-Clip: Taking Muon Further on the Scaleup Journey". While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥.Enter QK-Clip - a post-hoc fix to Q/K weights (unlike

0

10

Zhihu Frontier

@ZhihuFrontier

4 days

🧠Zhihu contributor & @Kimi_Moonshot dev Dylan shares his thoughts on building Kimi K2:. Why RL? Because compute may be infinite, but data is not. RL improves data efficiency - that's why we invest in scaling test-time compute. Why large models? Why Muon optimizer?.→ It's all

2

7

44

Zhihu Frontier

@ZhihuFrontier

5 days

🤖 Zhihu contributor & @Kimi_Moonshot RL Lead @RotekSong shares how they pushed Kimi K2 toward better general-purpose Agent abilities - by scaling up tool-use data. In short: a fully automated agent data factory 🏭 that simulates end-to-end workflows to filter out high-quality

2

12

48

Zhihu Frontier

@ZhihuFrontier

6 days

🚀 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece:."QK-Clip: Taking Muon Further on the Scaleup Journey". While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥.Enter QK-Clip - a post-hoc fix to Q/K weights (unlike

0

3

37

Zhihu Frontier

@ZhihuFrontier

6 days

🎉 Great to see @Kimi_Moonshot infra dev 刘少伟 sharing why Kimi K2's config "looks the way it does" - from an inference perspective 🤖. 🧩 Constraints:.Inherits DSv3 structure.Adjust internal model parameters to fit needs.Training & inference cost ≈ DSv3.🎯 Goal: Lower loss.

Kimi.ai

@Kimi_Moonshot

6 days

Some thoughts on the decisions behind Kimi K2's architecture - from our infra staff.

0

1

8

Zhihu Frontier

@ZhihuFrontier

7 days

🚀🔥 @Kimi_Moonshot drops the K2 model — now #1 trending on Hugging Face!.💡 Devs break it down, tech folks dive deep. 👀📎 Full discussion & hands-on reviews on Zhihu:.#Kimi #K2Model #LLM #HuggingFace #AI.

Hugging Face

@huggingface

8 days

Kimi K2 is number one trending on HF, congrats!

0

2

Zhihu Frontier

@ZhihuFrontier

9 days

🚀 Kimi releases its first trillion-parameter open-source agentic model — K2!.What's new? What's powerful? What's next? 🤖🔥.Full discussions, hands-on reviews & benchmarks now live on Zhihu:.👉 Welcome to join the convo! 🧠💬.Try it now at.

Kimi.ai

@Kimi_Moonshot

9 days

🚀 Hello, Kimi K2! Open-Source Agentic Model!.🔹 1T total / 32B active MoE model.🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models.🔹Strong in coding and agentic tasks.🐤 Multimodal & thought-mode not supported for now. With Kimi K2, advanced agentic intelligence

0

1

3

Zhihu Frontier

@ZhihuFrontier

9 days

🧠 Zhihu contributor @langfengq shared his thinkings about open-sourced verl-agent - an RL framework designed to train reasoning-capable LLM agents!. It extends veRL, and unlike methods that simply concatenate full interaction history, verl-agent treats each step as an

1

6

Zhihu Frontier

@ZhihuFrontier

10 days

💬 How do people view Grok 4, the new-gen model from Musk’s xAI? What are its standout features?. 🔗Dive into discussions from Chinese tech community Zhihu users here:.#Grok4 #xAI #AI.

xAI

@xai

10 days

Introducing Grok 4, the world's most powerful AI model. Watch the livestream now:

0

1

Zhihu Frontier

@ZhihuFrontier

12 days

Zhihu contributor 周国睿 has spent the past year exploring a big Q:.👉 Can we take recommender systems to the next level?. Based on hands-on work with OneRec, they found E2E recsys might be the future. Here are his key thoughts:.🔍How big should a rec model be? .⚙️How to scale

5

0

9

Zhihu Frontier

@ZhihuFrontier

13 days

🧠 Zhihu contributor @jackbai_jkb shares insights from his paper "Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction". How to do multi-step training on a model post-trained in single-step environments? They found the ideal training setup should be:.⚙️ no

6

0

13

Zhihu Frontier

@ZhihuFrontier

16 days

「Tech Insight」.🔥 #HackerNews Top Post: "The new skill in AI is not prompting, it's context engineering" — thoughts?. 💬 Zhihu contributor Navis Li:.What really makes or breaks your AI app is what kind of "ingredients" you feed it. Context Engineering = crafting the entire

0

4

Zhihu Frontier

@ZhihuFrontier

17 days

🚀 What's the hype about @Zai_org open-source GLM‑4.1V‑9B‑Thinking model?. 🧑‍💻 Zhihu contributor 社恐打工仔:.Pretty slick! Fast inference⚡. Just 9B params yet beats many bigger models. Why? It thinks, not just predicts. 🔥. 🧠 Zhihu contributor Mirabella:.Breakthrough design -

0

3

Zhihu Frontier

@ZhihuFrontier

17 days

🎙️ AI Insight Talk | Code Bench Special Live!. 🗓️ July 3rd, 8:30–9:30 PM (CST).🔗 Co-hosted by @huggingface × @OpenMMLab × @MaaSAI42 × Zhihu × 机智流 & more!. Topics:.🔍 CPRet – Is the model "memorizing" or truly understanding? Exposing performance inflation on similar problems

0

1

Zhihu Frontier

@ZhihuFrontier

18 days

「Developer Talk」.🧠 In the RL Scaling era, what kind of RL framework do we need?.🚀 To answer that, THUDM open-sourced their self-developed framework slime: 👨‍💻 RL infra @Zai_org & Zhihu contributor @朱小霖 breaks it down:.🧩 Believes LLM + RL = final

0

1

5

Zhihu Frontier

@ZhihuFrontier

18 days

🔥 Baidu open-sourced the ERNIE 4.5 series!.🤖 What does it mean for China's AI ecosystem?.📊 Reviews & Discussions on Zhihu:.#AI #LLM #OpenSource #MoE.

Baidu Inc.

@Baidu_Inc

20 days

The ERNIE 4.5 series is now officially open source. This family of models includes 10 variants—from MoE models with 47B and 3B active parameters, the largest having 424B total parameters, to a 0.3B dense model—all available now to the global AI community for open research and

0

1

Zhihu Frontier

@ZhihuFrontier

19 days

冯一尘 mentioned two important things for building an Agent:.1️⃣ Agents that can think long 🤔.2️⃣ End-to-end Reinforcement Learning 💡.Another Kimi’s dev Flood Sung explains why "long thinking" matters, and bonus discovery during training:.📈As performance improves, token usage.

Zhihu Frontier

@ZhihuFrontier

19 days

「AI Frontier」.🧪 @Kimi_Moonshot 's first Agent - Kimi-Researcher - entered limited rollout on June 20. It generates long-form, source-traceable research reports. 👨‍💻 Kimi engineer 冯一尘 personally responded on Zhihu:.We're not building a search tool. We're training an AI Agent

0

1

Zhihu Frontier

@ZhihuFrontier

19 days

「AI Frontier」.🧪 @Kimi_Moonshot 's first Agent - Kimi-Researcher - entered limited rollout on June 20. It generates long-form, source-traceable research reports. 👨‍💻 Kimi engineer 冯一尘 personally responded on Zhihu:.We're not building a search tool. We're training an AI Agent

0

1

Zhihu Frontier

@ZhihuFrontier

23 days

「Tech Insight」.🧠 Zhihu contributor 许华哲Harry shares his take on Embodied Intelligence:. He reflects on common failure modes in this field:.• Chasing the "coolest task" at all costs.• Building virtual worlds and hoping digital solves everything.• Dumping massive data into

1

0

3