Ce Zhang @ce_zhang X Profile

Ce Zhang

@ce_zhang

Followers

3K

Following

2K

Media

134

Statuses

729

CTO @ Together @togethercompute Neubauer Associate Professor @UChicago

https://t.co/hoNt2PH4dF

San Francisco

Joined September 2016

Don't wanna be here? Send us removal request.

Ce Zhang

@ce_zhang

2 years

A 7B model beyond Transformer architecture that matches / sometimes outperforms, the strongest 7B Transformer! Thanks @Hessian_AI & @Teknium1 @theemozilla @NousResearch for the collaboration. Play with it here https://t.co/nTU6ZX3Pfn and give us feedback!

Together AI

@togethercompute

2 years

Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context. It builds on the lessons learned in past year designing efficient sequence modeling architectures. https://t.co/UGLnfz0Dma

0

4

32

AK

@_akhaliq

8 days

Nvidia presents TiDAR Think in Diffusion, Talk in Autoregression

24

176

2K

Together AI

@togethercompute

1 month

We're excited to host Apriel-1.5-15b-Thinker by @ServiceNow's SLAM labs on Together AI! 👉15B parameters, fits on single GPU 👉On par with Deepseek-R1-0528 and Mistral-Medium-1.2 on the Artificial Analysis Intelligence Index Built by @SathwikTejaswi @ServiceNowRSRCH

1

8

Together AI

@togethercompute

2 months

Breaking: @VFSGlobal x Together AI announce strategic partnership. We’re partnering with VFS Global to scale secure, responsible, and high-performance AI solutions for global mobility. Millions of visa applications. 160+ countries. One mission: faster, more transparent, and

3

2

11

Together AI

@togethercompute

2 months

The Washington Post processes 1.79 billion tokens every month powering "Ask The Post AI" They needed reliable inference without vendor lock-in. Fixed costs. Full model ownership. Together AI's Dedicated Endpoints delivered.

1

8

Hassan

@nutlope

2 months

Announcing ReceiptHero – an app to help people track their finances! It'll take in any receipts you have, extract the total $, and categorize it for you (dining, groceries, utilities, ect). 100% free & open source. Powered by llama 4 on @togethercompute.

16

32

538

Hassan

@nutlope

2 months

I'm building a realtime video analysis app! It takes screenshots every 500ms, sends it to llama 4 on @togethercompute, and streams back the results. I want to extend it to be able to perform actions too (record my screen & send me a text when a video finishes for example).

39

33

438

Together AI

@togethercompute

2 months

Together Instant Clusters, offering ready to use, self-service NVIDIA GPUs, are now Generally Available 🚀

2

4

24

Together AI

@togethercompute

3 months

Building AI agents for complex engineering tasks ≠ building chatbots 🧵 Most AI agents today excel at short, simple tasks. But automating multi-day engineering workflows? That’s a whole different game. At Together AI, we learned this the hard way while optimizing LLM

4

20

68

Together AI

@togethercompute

3 months

Therapeutic AI isn't just "helpful" AI 🧠 @slingshotai_inc built a psychology foundation model that knows when to push back, stay silent, or offer new perspectives. And now - 50,000+ people are getting specialized mental health support.

2

4

27

Together AI

@togethercompute

4 months

🤖OpenAI's open models are here. gpt-oss models just landed on Together AI. Achieves near-parity with o4- mini, trained using o3 techniques. Build anything, deploy anywhere🔥

13

24

112

Drishan Arora

@drishanarora

4 months

A small update - we had more traffic than anticipated. However, the endpoints are now scalable on Together AI for all models, including the 671B MoE. Test out the model here: https://t.co/Od1NXYVBxU (A huge thanks to the folks at @togethercompute for making this happen so

together.ai

671B mixture-of-experts model matching Deepseek R1 performance, 60% shorter reasoning chains, approaching o3 and Claude 4 capabilities

Drishan Arora

@drishanarora

4 months

Today, we are releasing 4 hybrid reasoning models of sizes 70B, 109B MoE, 405B, 671B MoE under open license. These are some of the strongest LLMs in the world, and serve as a proof of concept for a novel AI paradigm - iterative self-improvement (AI systems improving themselves).

4

14

82

Together AI

@togethercompute

4 months

🛡️ VirtueGuard is LIVE on Together AI 🚀 AI security and safety model that screens input and output for harmful content: ⚡ Under 10ms 𝗿𝗲𝘀𝗽𝗼𝗻𝘀𝗲 🎯 𝟴𝟵% 𝗮𝗰𝗰𝘂𝗿𝗮𝗰𝘆 vs 76% (AWS Bedrock) 🧠 𝗖𝗼𝗻𝘁𝗲𝘅𝘁-𝗮𝘄𝗮𝗿𝗲 - adapts to your policies, not just keywords 👇

4

5

24

Together AI

@togethercompute

4 months

We built an open source voice note taking app using our fast Whisper implementation! Check it out -> usewhisper.io https://t.co/t2mMWs4LqS

6

4

59

Hassan

@nutlope

4 months

We now have the fastest speeds for DeepSeek R1 – up to 330 tokens/sec running on B200s! Here it is in action – video is not sped up!

Together AI

@togethercompute

4 months

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:

8

77

Together AI

@togethercompute

4 months

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:

7

14

106

Together AI

@togethercompute

4 months

Kimi K2 is now available on https://t.co/NO0aADGvEz for free!

Together AI

@togethercompute

4 months

🚨MAJOR DROP: Kimi K2 just landed on Together AI 🚀 An open-source 1T parameter model that beats proprietary LLMs in creativity, coding, and tool use while delivering 60-70% cost savings. Built for agents. Priced for scale. 👇

1

9

46

Together AI

@togethercompute

4 months

We just launched a new "dictate" feature on Together Chat powered by our new Whisper model! The video is not sped up – it's really that fast!

4

3

31

Together AI

@togethercompute

4 months

🚀 We just launched speech-to-text APIs designed for real-time applications. Our Whisper V3 Large deployment delivers transcription 15x faster than OpenAI while maintaining full accuracy. Sub-second processing that actually keeps up with conversation speed ⚡

4

52

Together AI

@togethercompute

5 months

Announcing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in

8

81

496

Together AI

@togethercompute

5 months

Together AI’s first GB200 cluster built by Dell!

Michael Dell 🇺🇸

@MichaelDell

5 months

Good morning

3

10

107