Ekagra Ranjan @EkagraRanjan X Profile

Ekagra Ranjan

@EkagraRanjan

Followers

150

Following

310

Media

3

Statuses

28

LLM Inference & Efficiency @cohere • Ex-@Microsoft • Open Source @PyTorch • Intern @IiscNLP (MALL Lab, IISc), @IITKgp • B. Tech @IITGuwahati • Machine Learning

Joined March 2019

Don't wanna be here? Send us removal request.

Ekagra Ranjan

@EkagraRanjan

5 months

RT @ArtificialAnlys: Cohere has launched Command A, a 111B parameter dense model that represents a huge leap from their previous Command R/….

0

23

0

Ekagra Ranjan

@EkagraRanjan

1 year

Forgot to mention it the plot. The unit of time is in seconds.

0

5

Ekagra Ranjan

@EkagraRanjan

1 year

Outlines is a fantastic open source project. It uses a novel idea to prebuild an FSM and pay the cost upfront and not during sampling. We used Outlines as a starting point and heavily replaced, reimplemented and optimized multiple components to arrive at a much faster solution.

2

0

9

Ekagra Ranjan

@EkagraRanjan

1 year

The above benchmark was done a JSON schema with random keys and values as string.

2

0

8

Ekagra Ranjan

@EkagraRanjan

1 year

Time to build JSON schema is proportinal to the tokenizer size. 256k is the tokenizer used for Cohere and Outlines. gpt-4o-mini was used for OpenAI so probably tokenizer size is 200k if not more as per tiktoken for gpt-4o.

1

10

Ekagra Ranjan

@EkagraRanjan

1 year

Building artefacts for JSON Schema is expensive to do on scale in production for LLM inference. Probably why OpenAI had "JSON mode" for a year but not "JSON schema mode" until @cohere released it first :).* @cohere is >10x faster than OpenAI.* @cohere is >45x faster than Outlines

1

21

100

Ekagra Ranjan

@EkagraRanjan

1 year

RT @sarahookr: Congrats to Sudip's team who released this a few weeks ago :). structured outputs are now in vogue -- and for good reason. A….

0

5

0

Ekagra Ranjan

@EkagraRanjan

1 year

RT @DeanCarignan: Exciting to see more models supporting structured outputs. A big win for all developers who need to integrate LLMs into s….

0

3

0

Ekagra Ranjan

@EkagraRanjan

1 year

RT @cohere: Response_format=json_object is here!. Command R models now support Structured Outputs for JSON. This forces the model to genera….

0

24

0

Ekagra Ranjan

@EkagraRanjan

1 year

The thing I have been working on hard in the last few months at Cohere is finally out!!.

Nick Frosst

@nickfrosst

1 year

@cohere just shipped json schema sampling! . Now not only can you guarantee that the model returns valid json, you can actually ensure it returns json with a specific format!. Big win for people actually building with LLMs :) .

4

78

Ekagra Ranjan

@EkagraRanjan

1 year

RT @Teknium1: Pretty incredible, grats @cohere.

0

16

0

Ekagra Ranjan

@EkagraRanjan

1 year

RT @virattt: Cmd R+ beats Sonnet at financial RAG. I initially assumed these models were equivalent due to pricing. However, command r+ wa….

0

38

0

Ekagra Ranjan

@EkagraRanjan

1 year

RT @cohere: Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and spe….

0

192

0

Ekagra Ranjan

@EkagraRanjan

1 year

RT @aidangomez: ⌘-R. Introducing Command-R, a model focused on scalability, RAG, and Tool Use. We've also released the weights for research….

cohere.com

Command R is a scalable generative model targeting RAG and Tool Use to enable production-scale AI for enterprise.

0

182

0

Ekagra Ranjan

@EkagraRanjan

3 years

RT @KaustubhPriye: We are doing research on healthcare and Looking to connect with folks who have spent more than 50k out of their own pock….

0

4

0

Ekagra Ranjan

@EkagraRanjan

5 years

I have a p-value joke, but I am not confident about it.

Peyman Milanfar

@docmilanfar

5 years

I have a mean joke, but it doesn't meet expectations.

0

5

Ekagra Ranjan

@EkagraRanjan

5 years

I have a joke about reading papers, but you want the Github repo first.

Emily Riederer

@EmilyRiederer

5 years

I have a joke about proofs, but it’s left as an exercise for the reader.

0

2

Ekagra Ranjan

@EkagraRanjan

5 years

I have a deep learning joke, but it was already proposed by someone in the 1980s.

Kareem Carr, Statistics Person

@kareem_carr

5 years

I have a deep learning joke but it has a lot of layers to it.

0

2

Ekagra Ranjan

@EkagraRanjan

5 years

I have a Graph joke, but you won't be able to connect it.

Christian Szegedy

@ChrSzegedy

5 years

I have a computer vision joke, but don't see the point of it.

1

0

1

Ekagra Ranjan

@EkagraRanjan

5 years

I have an RNN joke, but I have an RNN joke.

Kareem Carr, Statistics Person

@kareem_carr

5 years

I have a deep learning joke but it has a lot of layers to it.

0

1

2