
Simon Willison
@simonw
Followers
94K
Following
60K
Media
3K
Statuses
56K
Creator @datasetteproj, co-creator Django. PSF board. Hangs out with @natbat. He/Him. Mastodon: https://t.co/t0MrmnJW0K Bsky: https://t.co/OnWIyhX4CH
Joined November 2006
Here's the spiciest detail from the new o1 system card:
The updated OpenAI o1 system card builds on prior safety work, detailing robustness evals, red teaming insights, and safety improvements using Instruction Hierarchy. It maintains a "medium" risk rating based on testing with an expanded suite of evaluations, reflecting it is safe.
87
436
5K
To test this out for yourself, open a tab on a Google site and paste this into the Chrome DevTools console:. chrome.runtime.sendMessage('nkeimhogjdpnpccoofpliimaahmaaome', {method: 'cpu.getInfo'}, response => {console.log(JSON.stringify(response, null, 2));});
So, Google Chrome gives all *.google.com sites full access to system / tab CPU usage, GPU usage, and memory usage. It also gives access to detailed processor information, and provides a logging backchannel. This API is not exposed to other sites - only to *.google.com.
62
682
4K
@d_feldman It's got me to the point where I can read Spanish language news articles and understand ~80% of them - spoken Spanish is much harder.
21
28
2K
I tried this out yesterday and it's incredible: download a 4GB binary, child 755 it and now you have a full LLM and the software needed to run it ready to go, with multiplied operating system platforms supported by that single file.
I spent the last month building llamafile which is the fattest executable file format ever. It lets you turn LLM weights into runnable llama.cpp binaries using cosmo libc. Blog post:
14
179
2K
Honestly, at this point If you give me a programming interview and don't let me use AI assistance you won't get a very realistic idea of what I'm actually capable of.
Coding interviews should be using ai. If you disagree you literally don’t understand what ai is for. Sure you can use it to fill gaps & find easy answers. More importantly you can use it to accelerate and do much bigger things. Failing to select for this skill will ruin tech co’s.
71
130
2K
OK this is really cool: Nico built an OpenAI action/GPT which breaks packages from PyPI up into <9.5MB chunks and returns them to ChatGPT in a way that lets it save them to disk. and then pip install them!. I didn't know actions could do that, docs here:
@ChatGPTapp If you want to try pip installing a package, here's the link
4
41
367
This is really grim, if not entirely unexpected: apparently the Instagram mobile app injects additional JavaScript into every page that's loaded using the in-app embedded browser - here's the tool @KrauseFx built to track changes made to the DOM when loading a page
22
599
2K
Hidden at the bottom of this announcement:. "By switching to the new gpt-4o-2024-08-06, developers save 50% on inputs ($2.50/1M input tokens) and 33% on outputs ($10.00/1M output tokens) compared to gpt-4o-2024-05-13.". That's a pretty substantial price decrease!.
Introducing Structured Outputs! A huge leap beyond JSON mode, solving a major challenge for developers. If your app relies on a specific JSON format to drive the UI, our models now match your schema—every time. No more missing keys or hallucinated enums!.
15
48
626
Whoa. runs a full Debian VM entirely in your browser via WebAssembly. and it ships with working Perl, Python, Ruby and Node.js!.
We have made a server-less virtual Linux environment that runs unmodified Debian binaries in the browser. This is powered by CheerpX, a WebAssembly virtualization platform. Feel free to play with it and report bugs:.
25
355
1K
Notes on how I ran Facebook's 7B LLaMA model on my 64GB M2 MacBook Pro using llama.cpp by @ggerganov. It's genuinely possible to run a LLM that's hinting towards the performance of GPT3 on your own hardware now!. I thought that was still a few years away.
37
233
1K
Corollary to this: you can have an outsized impact on the world by being one of the few people who DO publish fresh information online on your own web pages. Blog like it's 2005!.
A decade ago I felt like I could find anything on the Web. Now I feel like I can barely find anything. People just don't put information on web pages anymore.
29
166
1K
PostgreSQL 14 adds new syntax for accessing JSON data:. SELECT *.FROM shirts .WHERE details['attributes']['color'] = '"neon yellow"'. I like this so much more than the -> operators, which stubbornly refuse to stick in my head.
Y'all this new JSON subscript syntax in Postgres 14 is sweet. Super excited to see Postgres just getting better bit by bit -
22
234
1K
If you haven't tried Claude yet it's a absolutely worth spending time with - I lean on it a lot for working with longer documents, since it can handle 100,000 tokens (GPT-4 is only 8,000) at a time. Plus you can upload PDFs to it - I've used it with 100+ page documents.
We’re rolling out access to to more people around the world. Starting today, users in 95 countries can talk to Claude and get help with their professional or day-to-day tasks. You can find the list of supported countries here:
55
88
1K
TIL that Claude Code increases the "thinking" budget to maximum if you tell it to "ultrathink", which is a step up from "megathink", which is a step up from "think" - via
It’s definitely worth reading this post for anyone using Claude code. TIL that the word “ultrathink” will result in maximum thinking . Lots of other great tips in here .
34
86
1K
These papers are fascinating, but my favorite thing about them is they aren't PDFs! They're glorious mobile-friendly web pages with interactive diagrams. I hope everyone else who publishes papers takes note, this us a much better way to share research
For more, read our papers:. On the Biology of a Large Language Model contains an interactive explanation of each case study: Circuit Tracing explains our technical approach in more depth:
20
115
1K
Love this idea that the reason voice assistants don't seem to stick for most people is that they're actually command line interfaces, but even less discoverable because they don't provide any visible feedback at all.
@daviddlow @charlesarthur @benedictevans I've droned on endlessly about how you can't expect normal people to use the command line. That's what Alexa is. If you don't say the *precise* invocation correctly, you get an error. And because there's no display, you have to remember dozens of different commands. It's too hard.
25
213
941
Love that we live in a time where "your software got lazier" is a legit piece of feedback.
we've heard all your feedback about GPT4 getting lazier! we haven't updated the model since Nov 11th, and this certainly isn't intentional. model behavior can be unpredictable, and we're looking into fixing it 🫡.
19
74
1K
I really like Drew's framework dividing current AI use-cases into Gods (human replacement), Interns (assistants you delegate closely-reviewed tasks to) and Cogs (smaller tools that can more reliably serve a single purpose, like Whisper) - more notes here
15
110
1K
Looks like the reason that letter only had 500 signatures out of 770 might be that the rest of the company were asleep.
About 650 / 770 signed at this moment. As people start waking up, more will come. All the efforts started after 1:30 AM, 500+ within two hours and all of this after 2 crazy days with very little sleep.
16
50
941
Maybe the solution to the Fermi paradox is that significantly advanced civilizations discover crypto currencies and then furiously burn through all available energy sources until they go extinct
Uhhh. about bitcoin. it's actually ruining the planet. The bitcoin computer network currently uses as much electricity as Denmark. In 18 months, it will use as much as the entire United States. Something's gotta give. This simply can’t continue.
28
401
821
@moorehn @dancow @sewellchan @guardian I'm fascinated by their use of the term "build" - they talk about building a lot, took me a while to realize that their version of building is funneling money into speculative investments and convincing others to do the same.
19
26
845