avian_io Profile Banner
Avian.io Profile
Avian.io

@avian_io

Followers
263
Following
33
Media
2
Statuses
41

Worlds Fastest Inference for DeepSeek R1 at 351 tokens per second. Dedicated endpoints + Serverless APIs available.

New York, USA
Joined July 2022
Don't wanna be here? Send us removal request.
@avian_io
Avian.io
9 months
4X DeepSeek R1 671B Inference: As we prepare for DeepSeek R2, Avian presents 351 output tokens per second on DeepSeek R1 per user on NVIDIA B200 at FP4 precision. Presented to you in collaboration with @Vultr Cloud GPU, who provided the NVIDIA HGX B200 hardware. Keep your
2
4
13
@Vultr
Vultr
9 months
💥 GenAI inference just hit warp speed: @nvidia Blackwell hits 351 tokens/sec on DeepSeek R1 (FP4), and @avian_io hits 351/sec per user on Vultr Cloud GPU with the B200. High-throughput inference is here – and Llama 4 Maverick is next. #GenAI #AIinference #NVIDIA #CloudGPU
0
2
3
@avian_io
Avian.io
10 months
Learn how https://t.co/hZeVXLVPLP achieved a world record 303 tokens per second on DeepSeek R1 using TensorRT-LLM and NVIDIA Blackwell B200 in our technical blog https://t.co/Vz6IOQxKi1
1
22
71
@NVIDIAAIDev
NVIDIA AI Developer
10 months
NVIDIA Blackwell sets a new benchmark with 303 Tokens/s for DeepSeek R1 in FP4 precision. 👀 🎉 Huge congrats to NVIDIA Inception partner @avian_io on this impressive achievement which showcases major ecosystem breakthrough, leveraging NVIDIA Blackwell and our open accelerated
@ArtificialAnlys
Artificial Analysis
10 months
NVIDIA Blackwell can achieve 303 output tokens/s for DeepSeek R1 in FP4 precision, per our benchmarking of an Avian API endpoint Artificial Analysis benchmarked DeepSeek R1 on an @avian_io private API endpoint. Running DeepSeek R1 in FP4 precision on NVIDIA Blackwell, their
5
15
105
@ArtificialAnlys
Artificial Analysis
10 months
NVIDIA Blackwell can achieve 303 output tokens/s for DeepSeek R1 in FP4 precision, per our benchmarking of an Avian API endpoint Artificial Analysis benchmarked DeepSeek R1 on an @avian_io private API endpoint. Running DeepSeek R1 in FP4 precision on NVIDIA Blackwell, their
5
14
195