arjunkocher Profile Banner
Arjun Profile
Arjun

@arjunkocher

Followers
4K
Following
4K
Media
459
Statuses
6K

AIR𝘦𝘴𝘦𝘢𝘳𝘤𝘩

Joined September 2009
Don't wanna be here? Send us removal request.
@arjunkocher
Arjun
1 day
Much awaited Kimi-K2 Technical Report is Out Now!. Kimi K2 is 1T-parameter open-weight MoE model built for agentic intelligence. Using MuonClip optimizer and a 15.5T-token high-quality dataset, Kimi K2 achieves stable, scalable pre-training. Post-training combines large-scale
Tweet media one
Tweet media two
1
1
29
@arjunkocher
Arjun
3 hours
RT @Kimi_Moonshot: We're building the Kimi Discord community! . If you're passionate about AI, love Kimi, or want to help shape the future….
discord.com
Community server for Kimi AI. | 389 members
0
15
0
@arjunkocher
Arjun
1 day
RT @Teknium1: Little do people know but interstellar, on work for the Hermes 3 function calling capabilities - built an almost identical pi….
0
3
0
@arjunkocher
Arjun
7 days
0
0
2
@arjunkocher
Arjun
7 days
Mixture of Raytraced Experts. —. - MRE replaces fixed top-k MoE gating with a dynamic, stochastic raytracing mechanism. - firing ray probabilistically activates a sequence of experts using a routing net, like a poisson walk on a softmax graph. - no load balancing, hard top-k,
Tweet media one
1
2
37
@arjunkocher
Arjun
8 days
Nous released their Hermes 3 dataset. --.- 1m samples.- uncensored sota for its time across llama-3 (8b, 70b, 405b).- dense in-prompt adherence, roleplay, subjective/objective tasks.- rich tool use, structured outputs, api-like call patterns.- early agentic traces: xml-tagged
Tweet media one
2
3
36
@arjunkocher
Arjun
12 days
RT @teortaxesTex: Oh.
Tweet media one
0
2
0
@arjunkocher
Arjun
12 days
@Kimi_Moonshot
Kimi.ai
12 days
🚀 Hello, Kimi K2! Open-Source Agentic Model!.🔹 1T total / 32B active MoE model.🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models.🔹Strong in coding and agentic tasks.🐤 Multimodal & thought-mode not supported for now. With Kimi K2, advanced agentic intelligence
Tweet media one
0
0
1
@arjunkocher
Arjun
12 days
The 1 Trillion param Open-Source Agentic Model from Kimi Moonshot is live. Kimi K-2 ✌🏻. (based on MuonClip Optimizer). get started here:
Tweet media one
Tweet media two
1
0
16
@arjunkocher
Arjun
15 days
RT @teortaxesTex: Claim. Source: “revealed in a dream”. I do sometimes get leaks in dreams, and Arjun is Indian so maybe it's a Ramanujan s….
0
1
0
@arjunkocher
Arjun
15 days
Kimi’s next drop gonna be spicy 🥵
Tweet media one
1
0
24
@arjunkocher
Arjun
22 days
0
0
3
@arjunkocher
Arjun
22 days
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning.via Multi-Agent Multi-Turn Reinforcement Learning. —.adversarial selfplay can yield cognitive dividends in LLMs trained for general reasoning without explicit labels or human reward shaping. instead of finetuning on
Tweet media one
Tweet media two
1
3
38
@arjunkocher
Arjun
23 days
0
0
1
@arjunkocher
Arjun
23 days
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search. —.AB‑MCTS (Adaptive Branching Monte Carlo Tree Search) inference-time LLMs strategy. repeated sampling w iterative refinement, guided by external feedback. each node uses bayesian posterior
Tweet media one
2
1
17
@arjunkocher
Arjun
23 days
Ovis-U1, a 3B parameter unified MLLM. built on a RoPE-based diffusion visual decoder and a CLS-token-based refiner. integrates 3 core capabilities .- image comprehension.- text-to-image synthesis.- and fine-grained editing.all within a single architecture. performance wise, .it
Tweet media one
Tweet media two
1
4
29
@arjunkocher
Arjun
23 days
if money could buy good AI.siri would have been the best model. a lesson in there.
0
0
6