Ekagra Ranjan Profile
Ekagra Ranjan

@EkagraRanjan

Followers
150
Following
310
Media
3
Statuses
28

LLM Inference & Efficiency @cohere • Ex-@Microsoft • Open Source @PyTorch • Intern @IiscNLP (MALL Lab, IISc), @IITKgp • B. Tech @IITGuwahati • Machine Learning

Joined March 2019
Don't wanna be here? Send us removal request.
@EkagraRanjan
Ekagra Ranjan
5 months
RT @ArtificialAnlys: Cohere has launched Command A, a 111B parameter dense model that represents a huge leap from their previous Command R/….
0
23
0
@EkagraRanjan
Ekagra Ranjan
1 year
Forgot to mention it the plot. The unit of time is in seconds.
0
0
5
@EkagraRanjan
Ekagra Ranjan
1 year
Outlines is a fantastic open source project. It uses a novel idea to prebuild an FSM and pay the cost upfront and not during sampling. We used Outlines as a starting point and heavily replaced, reimplemented and optimized multiple components to arrive at a much faster solution.
2
0
9
@EkagraRanjan
Ekagra Ranjan
1 year
The above benchmark was done a JSON schema with random keys and values as string.
2
0
8
@EkagraRanjan
Ekagra Ranjan
1 year
Time to build JSON schema is proportinal to the tokenizer size. 256k is the tokenizer used for Cohere and Outlines. gpt-4o-mini was used for OpenAI so probably tokenizer size is 200k if not more as per tiktoken for gpt-4o.
1
1
10
@EkagraRanjan
Ekagra Ranjan
1 year
Building artefacts for JSON Schema is expensive to do on scale in production for LLM inference. Probably why OpenAI had "JSON mode" for a year but not "JSON schema mode" until @cohere released it first :).* @cohere is >10x faster than OpenAI.* @cohere is >45x faster than Outlines
Tweet media one
Tweet media two
1
21
100
@EkagraRanjan
Ekagra Ranjan
1 year
RT @sarahookr: Congrats to Sudip's team who released this a few weeks ago :). structured outputs are now in vogue -- and for good reason. A….
0
5
0
@EkagraRanjan
Ekagra Ranjan
1 year
RT @DeanCarignan: Exciting to see more models supporting structured outputs. A big win for all developers who need to integrate LLMs into s….
0
3
0
@EkagraRanjan
Ekagra Ranjan
1 year
RT @cohere: Response_format=json_object is here!. Command R models now support Structured Outputs for JSON. This forces the model to genera….
0
24
0
@EkagraRanjan
Ekagra Ranjan
1 year
The thing I have been working on hard in the last few months at Cohere is finally out!!.
@nickfrosst
Nick Frosst
1 year
@cohere just shipped json schema sampling! . Now not only can you guarantee that the model returns valid json, you can actually ensure it returns json with a specific format!. Big win for people actually building with LLMs :) .
Tweet media one
4
4
78
@EkagraRanjan
Ekagra Ranjan
1 year
RT @Teknium1: Pretty incredible, grats @cohere.
0
16
0
@EkagraRanjan
Ekagra Ranjan
1 year
RT @virattt: Cmd R+ beats Sonnet at financial RAG. I initially assumed these models were equivalent due to pricing. However, command r+ wa….
0
38
0
@EkagraRanjan
Ekagra Ranjan
1 year
RT @cohere: Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and spe….
0
192
0
@EkagraRanjan
Ekagra Ranjan
1 year
RT @aidangomez: ⌘-R. Introducing Command-R, a model focused on scalability, RAG, and Tool Use. We've also released the weights for research….
Tweet card summary image
cohere.com
Command R is a scalable generative model targeting RAG and Tool Use to enable production-scale AI for enterprise.
0
182
0
@EkagraRanjan
Ekagra Ranjan
3 years
RT @KaustubhPriye: We are doing research on healthcare and Looking to connect with folks who have spent more than 50k out of their own pock….
0
4
0
@EkagraRanjan
Ekagra Ranjan
5 years
I have a p-value joke, but I am not confident about it.
@docmilanfar
Peyman Milanfar
5 years
I have a mean joke, but it doesn't meet expectations.
0
0
5
@EkagraRanjan
Ekagra Ranjan
5 years
I have a joke about reading papers, but you want the Github repo first.
@EmilyRiederer
Emily Riederer
5 years
I have a joke about proofs, but it’s left as an exercise for the reader.
0
0
2
@EkagraRanjan
Ekagra Ranjan
5 years
I have a deep learning joke, but it was already proposed by someone in the 1980s.
@kareem_carr
Kareem Carr, Statistics Person
5 years
I have a deep learning joke but it has a lot of layers to it.
0
0
2
@EkagraRanjan
Ekagra Ranjan
5 years
I have a Graph joke, but you won't be able to connect it.
@ChrSzegedy
Christian Szegedy
5 years
I have a computer vision joke, but don't see the point of it.
1
0
1
@EkagraRanjan
Ekagra Ranjan
5 years
I have an RNN joke, but I have an RNN joke.
@kareem_carr
Kareem Carr, Statistics Person
5 years
I have a deep learning joke but it has a lot of layers to it.
0
1
2