Philip Monk @pcmonk X Profile

Philip Monk

@pcmonk

Followers

2K

Following

2K

Media

73

Statuses

2K

A man alive, walking on two legs about the world. Infra lead @essential_ai

https://t.co/X6lS8oeI5x

San Francisco, CA

Joined January 2012

Don't wanna be here? Send us removal request.

ollama

@ollama

4 days

.@essential_ai's rnj-1 model is now on Ollama! ollama run rnj-1 8B parameter, open-weight dense model trained from scratch. The model is optimized for code and STEM with capabilities on par with other state of the art open-weight models. Let's go! 🚀🚀🚀

9

33

244

Philip Monk

@pcmonk

7 days

It's open weights and a very convenient size to run locally, btw. I get 20 tok/s on an M3 mac with llama.cpp.

0

6

Philip Monk

@pcmonk

7 days

It's been a blast to lead the infrastructure effort to train this model. I'm excited to see it out in the world!

Ashish Vaswani

@ashVaswani

7 days

We are beyond thrilled to share our first flagship models, Rnj-1 base and instruct 8B parameter models. Rnj-1 is the culmination of 10 months of hard work by a phenomenal team, dedicated to advancing American SOTA OSS AI. Lots of wins with Rnj-1. 1. SWE bench performance close

1

0

13

Essential AI

@essential_ai

7 days

Today, we’re excited to introduce Rnj-1, @essential_ai's first open model; a world-class 8B base + instruct pair, built with scientific rigor, intentional design, and a belief that the advancement and equitable distribution of AI depend on building in the open. We bring

37

153

1K

Philip Monk

@pcmonk

18 days

You all have it so easy today with your petaflop gpus. In my day we had *floppy disks* that could only handle a few hundred kiloflops/s

1

0

Philip Monk

@pcmonk

2 months

It finally happened: I ran into a bug that rust would have caught

Philip Monk

@pcmonk

4 months

The things that are hard about ml infra are not things that rust solves

0

2

Essential AI

@essential_ai

3 months

[1/2] We at Essential are driven by mission to advance fundamental research guided by first principles, rigor and sharing research openly.

1

10

31

Philip Monk

@pcmonk

4 months

of course, PL guys are often wrong

0

3

Philip Monk

@pcmonk

4 months

If you're a PL guy who engages seriously with the problem the answer you come to is Jax and/or torch.compile, not rust

1

0

6

Philip Monk

@pcmonk

4 months

The things that are hard about ml infra are not things that rust solves

mattparlmer 🪐 🌷

@mattparlmer

4 months

The fact that Python is the standard for machine learning is a serious indictment of the field’s engineering standards

3

0

8

Philip Monk

@pcmonk

5 months

Protip: function in the presence of uncertainty in your own mind. Your tweets will be worse but you will be more aligned with reality

Emmett Shear

@eshear

5 months

Complete certainty is impossible. So for any belief it’s always possible to be wrong. The Sun might not rise tomorrow. Yet at some point, in order to function we must round the chance of error to zero on many beliefs. On what principled basis might we decide to do this?

0

1

Essential AI

@essential_ai

6 months

[1/5] 🚀 Meet Essential-Web v1.0, a 24-trillion-token pre-training dataset with rich metadata built to effortlessly curate high-performing datasets across domains and use cases!

12

54

301

Philip Monk

@pcmonk

7 months

We've always been reaching for the glory of the previous generation

1

0

2

Philip Monk

@pcmonk

7 months

Leaving aside the main point of the post, I think people underestimate the degree to which the previous generation was influenced by stories of the one before that. There are more total stories now, but the number that affects any particular person is probably not any higher

dax

@thdxr

7 months

that jony and sam video has me thinking about something - i'll try to explain it the previous generation of silicon valley there were not yet too many stories of silicon valley people did the things they were trying to do, went through twists and turns and then later the

1

0

3

Philip Monk

@pcmonk

7 months

This was a super interesting project to work on. It showed up when we started using muon at larger scales.

Essential AI

@essential_ai

7 months

Muon is a serious competitor to AdamW, but it's tricky to scale up. Our infra team has made fundamental advancements in parallelizing Muon on large scale distributed clusters. We're extremely happy with the result and it's now a part of our pretraining pipeline. 🔗Check out

1

0

5

Philip Monk

@pcmonk

7 months

Update: it doesn't actually know the score

0

4

Philip Monk

@pcmonk

7 months

Avoiding spoilers is pretty difficult in general, but I was pleasantly surprised this just works

1

0

4

Philip Monk

@pcmonk

8 months

I don't know who needs to hear it, but for 100s of years there has been a significant tribe that believes basically this. Something in their worldview hits inf or nan and they start believing the end of the world is in the next 10 years. Reject it, don't let it make you impotent.

1

3

22

Philip Monk

@pcmonk

8 months

I'm more pro-civilization than anything else. But even though the Louvre is a valuable part of our civilization, it's much less valuable than what generally efficient allocation of capital gives us.

0

1

Philip Monk

@pcmonk

8 months

This and the related thread is a good example of precisely where I differ from a lot of trads. I don't actually think the Colossus or Louvre was/is more interesting than what finance and tech has brought us.

Simon Sarris

@simonsarris

8 months

I guess my own problem is that I think the most interesting things in the world were a result of *inefficient* allocation of capital. You don't build the Colossus of Rhodes or the Louvre - or almost everything inside of the Louvre - by expecting a return on investment.

1

0

3