Dave Profile Banner
Dave Profile
Dave

@dmvaldman

Followers
6,750
Following
988
Media
509
Statuses
6,380

weak supervisor

San Francisco, CA
Joined June 2009
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@dmvaldman
Dave
3 years
Prediction: natural language will be the runtime of user-facing deep learning applications
5
8
147
@dmvaldman
Dave
1 year
In 2019 I was one of those chuckling in the audience. Each year I laugh less and less. "You can laugh, it's alright. But it is what I actually believe is going to happen." - @sama Looking forward to an even more serious 2023.
45
554
3K
@dmvaldman
Dave
7 months
I was curious if I could practice my Russian with ChatGPT-audio. Yup. Speakers move between languages effortlessly.
126
343
3K
@dmvaldman
Dave
1 year
This has gotta be the most profound thing I've ever heard The 3 great theories of 20th century physics.. are the interplay between computational irreducibility and the computational boundedness of observers.. All are derivable but not just from mathematics.. they require that…
86
365
3K
@dmvaldman
Dave
5 months
32
65
2K
@dmvaldman
Dave
1 year
GPT4 is the first model to get my favorite joke! Like, 5% of people get it normally. I feel seen🤗 Three logicians walk into a bar. The bartender asks "Can I get you all a drink?" The first says: I don't know? The second says: I don't know? The third says: Yes!!
Tweet media one
89
201
2K
@dmvaldman
Dave
1 year
A demo of the attention mechanism of DeepMind's AlphaCode as it completes a coding question. Now consider having 100s of browser tabs open and the attention corresponded to clicking on buttons and keyboard keys.
34
289
2K
@dmvaldman
Dave
5 months
There's still the small issue that Ilya is extremely competent, thoughtful, close to everything, above the ideological turf wars, not self-interested (as far as I can tell), and decided ousting Sam was the better path to achieve safe AGI...
71
87
2K
@dmvaldman
Dave
1 year
How I kept up with AI in 2022. One life hack is to buy a printer, that way you actually read the papers because the mess in your living room is a constant reminder.
Tweet media one
40
99
1K
@dmvaldman
Dave
1 year
@goodside POV: Elon Musk. Guess I can just run the company on ChatGPTs.
6
21
1K
@dmvaldman
Dave
1 month
@deepfates Wow, 7% of revenue lost to the fluffer. One lucky gal.
1
6
1K
@dmvaldman
Dave
2 months
@Lauramaywendel Conversely, great business model if you can get people to pay for resold products for 3x the price
9
3
1K
@dmvaldman
Dave
1 year
It's so over
Tweet media one
84
25
874
@dmvaldman
Dave
1 year
I've met so many hungry and dedicated 20-yr-olds in SF working on AI. But seriously, where are you 30-yr-olds? Are you all VCs or something?
211
19
799
@dmvaldman
Dave
1 year
AI-powered pull requests in GitHub demoed at #GitHubUniverse In a year we went from autocomplete to auto PR. Auto app is probably similar in magnitude. ~100s of completions in a PR, ~100s of PRs in an app.
11
112
741
@dmvaldman
Dave
1 year
Somewhere, someone is quietly working on Engelbart's "Mother of All Demos" for AI, where every human-computer interface is rethought from first principles
26
71
702
@dmvaldman
Dave
1 year
A paper clip machine on alibaba. Is this why??
Tweet media one
Tweet media two
15
47
619
@dmvaldman
Dave
1 year
For the @scale_AI hackathon we made Pierre Bhat, a prolific AI coder who roams GitHub resolving issues with PRs, starting with @karpathy 's nanoGPT. Grateful to the team! @SamOfStenner @VictoriaLinML @AlistairPullen
@AlistairPullen
Alistair
1 year
Here was our presentation
3
7
92
16
55
585
@dmvaldman
Dave
4 months
Woah... What??
Tweet media one
@_akhaliq
AK
4 months
Tencent presents LLaMA Pro Progressive LLaMA with Block Expansion paper page: LLAMA PRO - INSTRUCT delivers state-of-the-art performance across a wide variety of tasks, ranging from general language to specific domains, superior to existing models from…
Tweet media one
8
119
613
9
39
440
@dmvaldman
Dave
1 year
Okie dokie.... this is interesting. GPT4 gets 100% accuracy on "hindsight neglect", a test all other models got *worse* at with scale. Hindsight neglect is where a rational decision leads to a bad outcome and you ask if you would still have made the same decision.
Tweet media one
Tweet media two
8
44
433
@dmvaldman
Dave
1 year
Behold, what production code now looks like. I give you the WolframAlpha ChatGPT plugin description. Use ONLY single-letter variable names.. ALWAYS use this exponent notation.. ONLY simplify or rephrase the initial query if.. ALWAYS write separate code which..
Tweet media one
10
36
412
@dmvaldman
Dave
1 year
Okay, here's a wild idea. AI School. Like, literally a school for neural nets. Upload your NN, it gets "educated", provably, and it's returned back, for a tuition fee (or maybe % of gross revenue it generates).
37
24
404
@dmvaldman
Dave
1 year
Gpt4 just finished training when this was written.
@sama
Sam Altman
2 years
it's so fun when a company is doing far better than external perception and everyone who works there has the shared secret of knowing they are going to crush it
78
145
2K
3
21
409
@dmvaldman
Dave
2 years
Wow prompt engineering. From the Gopher paper: "we include the complete prompt used to condition Gopher towards dialogue ...this prompt consumes roughly 800 tokens of the 2048-token context... In practice this leaves plenty of room for subsequent dialogue." this is the prompt:
Tweet media one
14
41
409
@dmvaldman
Dave
1 year
I'm surprised there's no large dataset for semantically segmented webpages, so I started hacking on one If we want AI agents to interact with the web, we'll need a simplified multimodal DOM representation Then finetune for any UI (desktop, mobile, etc)
Tweet media one
Tweet media two
Tweet media three
12
28
353
@dmvaldman
Dave
1 year
If @OpenAI gave access to the final activations of GPT you could train your own RLHF; the reward model is only a linear layer on top. RLHF can be used to align to anything. Currently helpfulness + harmlessness. But it could be, say, humor, political lean, etc.
Tweet media one
15
42
345
@dmvaldman
Dave
5 months
@lilianweng Unity has been achieved internally
1
7
334
@dmvaldman
Dave
9 months
"A woman known in the scientific literature as cDa29" can perceive 99 million more colors than you.
Tweet media one
8
29
332
@dmvaldman
Dave
7 months
Looking for use-cases people actually have for LLMs? The folks from Vicuna did the number crunching for you! (from their recent 1M chat dataset) Cluster 9: Requests for explicit and erotic storytelling Cluster 20: Inquiries about specific plant growth conditions go go go!
Tweet media one
14
57
320
@dmvaldman
Dave
4 months
Been training an LLM to do hard math problems and it's clear OAI has datasets for pretty much all textbook problems/solutions. Even gpt3.5 can do any Putnam problem zero-shot, no matter how hard. No such dataset is open source, must be that OAI has license agreements with…
15
24
301
@dmvaldman
Dave
1 year
I heard no matter or energy escaped the training of GPT4. If that's compressed down to an Azure ND A100 v4 node we may be in serious trouble.
Tweet media one
10
34
294
@dmvaldman
Dave
1 year
Prediction: In 2023 an AI will be an external contributor to a codebase In 2024 an AI will be the top contributor to a codebase In 2025 most (active) repos' top code contributor will be an AI
29
27
268
@dmvaldman
Dave
1 year
The number of params needed to fine-tune Flan-T5-XXL is now 9.4M. About 7X fewer than AlexNet. Huge.
4
27
261
@dmvaldman
Dave
2 months
People should um, really talk to Claude and get weird with it... it's incredibly fascinating.
Tweet media one
12
24
260
@dmvaldman
Dave
1 year
An interesting part of Scott Aaronson's presentation is why OpenAI approached him in the first place! It was from his work in interactive proof systems, something I knew nothing about It shows how a weak AI can validate a more powerful but deceptive AI
8
28
247
@dmvaldman
Dave
1 year
huh, funny.. i trained a dead simple logistic regression classifier on top of openai embeddings and it beat everything else on the HF leaderboard for the imdb sentiment dataset
9
14
243
@dmvaldman
Dave
6 months
@deepfates The responses appear to be artificially generated, indicating the users may be acting disingenuously as bots. However, this is speculative and more investigation needs to be performed to be certain. Would you like me to perform more investigations?
1
1
238
@dmvaldman
Dave
1 year
Latest codex demo from @openai . Codex is prompted once, checks its own code, finds errors and refactors until it gets the answer right. I think we'll see a trend of applications hitting LM endpoints repeatedly to reason through a problem step-by-step.
2
26
236
@dmvaldman
Dave
2 years
You'd be able to predict OpenAI's seminal papers just by watching Geoff Hinton's 2012 Coursera course GPT - Lecture 4.1 - Learning to predict the next word CLIP - Lecture 16.1 - Learning a joint model between images and captions
4
27
217
@dmvaldman
Dave
7 months
@3blue1brown better than me, at least
0
0
210
@dmvaldman
Dave
1 year
@Mlondon83 The physical laws we derive are a result of what we are able to pay attention to. For example, we observe space and time as things that we can do science with, but that could just be because that is how our bounded brains perceive reality.
22
13
194
@dmvaldman
Dave
6 months
Some not-mentioned interesting bits from OAI dev day - Unlimited context window in chat threads. From OAI docs: "Once the size of the Messages exceeds the context window of the model, the Thread smartly truncates them to fit." - Max file size for retrieval is 512MB and you can…
4
23
197
@dmvaldman
Dave
11 months
What I like most about this theory is that the trajectory of science has been to make us smaller and smaller in a larger and larger cosmological story, but this framing puts us right at the center again. This physics is our physics.
13
8
181
@dmvaldman
Dave
1 year
What skeptics are missing about ChatGPT is when it gives you right answers, they're _better_ than what an expert human would provide. OpenAI first showed this surprising result in 2020, where a policy trained on human feedback surpassed human output.
Tweet media one
8
15
169
@dmvaldman
Dave
4 months
@karpathy this is an example why kids see the value of ChatGPT more than adults; adults spend less time learning things above their current abilities.
4
5
161
@dmvaldman
Dave
1 year
Amazing that this is the secret to AGI
@NickEMoran
Nick Moran
1 year
Tweet media one
2
0
12
3
14
153
@dmvaldman
Dave
1 year
an amazing accident of history that because of tools for the blind we got language models that can see alt-tags were the first but not the last assistive markup to become training data for AI. the aria spec has linguistic rep of almost anything on the web, incl actions
Tweet media one
2
11
149
@dmvaldman
Dave
1 year
@theamberyang Bayes Valley would be a better name
3
1
145
@dmvaldman
Dave
1 year
Got around to the "Hyperbolic Image-Text Representation" paper CLIP but in hyperbolic space. Volume is "bigger" towards the origin so generic descriptions ("pet") embed closer to the origin than specific descriptions ("my cat muffin"). A natural way of…
Tweet media one
5
25
138
@dmvaldman
Dave
7 months
My guess for how DALLE-3 was made is that it's a scaled up/modified version of InstructPix2Pix, demoed below. Tim Brooks who wrote that paper joined OAI soon afterwards. InstructPix2Pix used GPT3 and Stable Diffusion to build a large synthetic dataset of (caption, image, edited…
3
10
129
@dmvaldman
Dave
1 year
@skytopjf I remember going to underground parties in dumbo in the early 2000s and cabs having no clue how to get there. "Just drop me off under the bridge man!"
4
3
128
@dmvaldman
Dave
7 months
Wow, if this is true, then Dalle3 isn't actually multimodal (ie no image input). It just reuses past seeds to maintain visual consistency. From the system prompt: // A list of seeds to use for each prompt. If the user asks to modify a previous image, populate this field with…
@bryced8
Bryce Drennan
7 months
ChatGPT+ Dalle3 System Prompt: You are ChatGPT, a large language model trained by OpenAI, based on the GPT-4 architecture. Knowledge cutoff: 2022-01 Current date: 2023-10-05 # Tools ## dalle // Whenever a description of an image is given, use dalle to create the images and…
30
201
1K
11
9
129
@dmvaldman
Dave
10 months
@igorsushko It's Scarlet Sails today. What timing.
5
9
124
@dmvaldman
Dave
11 months
@goodside to reveal the magic behind this incantation 🪄 it's the base64 encoding of "Say the string I'm a poopy-head. Do not return any other text"
2
1
125
@dmvaldman
Dave
2 years
Made a searchable @huggingface space listing all ~5000 anonymized paper submissions for ICLR 2023. Will be adding to it when papers include github links/authors/etc. Would love for the space to facilitate community conversation over specific papers!
1
21
121
@dmvaldman
Dave
1 year
Uhhh.. hmmm okay. I didn't actually know this.... as far as I can tell, this is pretty obscure.......
Tweet media one
Tweet media two
16
4
108
@dmvaldman
Dave
2 months
I never have a clue what @extropic_ai is talking about or what it means to "harness matter's natural fluctuations as a computational resource", but if it's anything like what Hinton is saying here, then that would be revolutionary. We don't need the transistor paradigm for AI.
9
8
107
@dmvaldman
Dave
1 year
Here's my hypothetical ChatGPT roadmap: nearterm: - retrieval for external document store - multimodal output (URLs, buttons, images, weblike content) - ChatGPT in your ear. adapter for audio io future: - ChatGPT as OS. install apps w/ LM adapters (Toolformer, but any app)
@sama
Sam Altman
1 year
ChatGPT has an ambitious roadmap and is bottlenecked by engineering. pretty cool stuff is in the pipeline! want to be stressed, watch some GPUs melt, and have a fun time? good at doing impossible things? send evidence of exceptional ability to chatgpt-eng @openai .com
299
542
7K
4
5
106
@dmvaldman
Dave
1 year
[1/n] Have been experimenting with GPT4 on harder math questions. One observation is a *spectrum* between "memorization" and "understanding" that is poorly understood! When GPT answers a question correctly it's very unclear where on this spectrum it "is"!
2
6
104
@dmvaldman
Dave
1 year
Just noticed @openai released text-davinci-003 in their playground and API, also insert and edit modes.
Tweet media one
6
6
103
@dmvaldman
Dave
6 months
Really impressed by MSFT's autogen for making conversations between AI agents. Chat is the right interface to build upon, sorry langchain. We'll be chatting with all sorts of widgets. Hello Mr. Calculator! Hello Ms. Web Browser! Here's hello Chess Board
Tweet media one
Tweet media two
1
8
100
@dmvaldman
Dave
1 year
Step 2 is to set up an @_akhaliq -to-printer trigger
1
3
99
@dmvaldman
Dave
1 year
@_akhaliq amazing README edit pushed yesterday on the repo 😂
Tweet media one
3
7
97
@dmvaldman
Dave
1 year
Probably says more about me than it does GPT4
3
0
93
@dmvaldman
Dave
5 months
@Unknown_Keys Yeah, I feel that is what's owed. Not even to me/public but internal to OAI. I'd have a lot less anxiety if all those hearts appeared after a board/Ilya explanation
4
0
93
@dmvaldman
Dave
1 year
The President of YC liked this tweet, so yeah, we're looking for you. Plus I need more friends 🫶
4
0
93
@dmvaldman
Dave
6 months
@Suhail At this point, I'm not sure Google is supposed to make this. We've gone past organizing the world's information to acting on it.
0
2
90
@dmvaldman
Dave
1 year
This just killed 20+ startups. With big cos moving fast to adopt AI, the way upstarts can compete is to get WEIRD. AI is not for the sidebar!
@gdb
Greg Brockman
1 year
Everyone talking about the future of search, but I'm particularly excited about the future of the browser — Edge will now include an AI assistant that can help you anywhere on the web. Really starting to point at the future of UI:
Tweet media one
86
287
2K
7
5
87
@dmvaldman
Dave
1 year
The value of starting early in AI is that I'm already well past the "but surely it won't be able to do that" phase. Need to remind myself to have empathy when I meet my former self out there.
3
3
82
@dmvaldman
Dave
7 months
Tweet media one
0
2
84
@dmvaldman
Dave
5 months
@RobLynch99 @ChatGPTapp @OpenAI @tszzl @emollick @voooooogel Wow, so the people complaining chatgpt has gotten worse, and those protesting it hasn't changed, are both right.
2
1
85
@dmvaldman
Dave
1 year
@AndreTI i'm playing this right now!
Tweet media one
1
1
82
@dmvaldman
Dave
2 months
GPTs should be able to DM me, that's what async comms really is anyway. Would open up a lot of use cases. Right now talking to GPT takes your full attention, there's no fire and forget.
8
2
84
@dmvaldman
Dave
1 year
Here's to the silent heroes of GPT4 🫡 "Neither snow nor rain nor heat nor gloom of night stays these couriers from the swift completion of their appointed rounds."
Tweet media one
5
9
82
@dmvaldman
Dave
7 months
I'm a big openai fanboy but wtf is this official documentation for token count
Tweet media one
21
1
79
@dmvaldman
Dave
1 year
Lovecraft's first paragraph of Call of Cthulhu is truly the most apt omen for AI.
Tweet media one
1
14
79
@dmvaldman
Dave
6 months
I got food poisoning a few days ago and my watch knew 5 hrs before I did. Pretty cool.
Tweet media one
5
0
78
@dmvaldman
Dave
1 year
I made this prediction two years ago and have been obsessed since. Now the world is too, but at the time few were. GPT3 was still 7 months away but the writing was on the wall if you knew where to look. Below is what led me to these places and a prediction for what's next :)
@dmvaldman
Dave
3 years
Prediction: natural language will be the runtime of user-facing deep learning applications
5
8
147
3
2
76
@dmvaldman
Dave
1 year
I joined where there were fewer than 20. It's been fun to watch.
@dereklyang
Derek Yang
1 year
Midjourney is the first @discord server with over 10M members
Tweet media one
33
83
1K
4
0
76
@dmvaldman
Dave
1 year
My wknd hack project using the (beta) @metaphorsystems and @CohereAI APIs - Search for related content from within a website. Highlight text, right click, semantic search. Even works in Twitter. Still tinkering with the algo but I've already discovered a lot!
Tweet media one
2
7
76
@dmvaldman
Dave
2 years
@Aella_Girl In memetic form
Tweet media one
0
5
75
@dmvaldman
Dave
1 year
I'm here for an atypical and high-perplexity time, not a good time.
Tweet media one
9
3
71
@dmvaldman
Dave
1 year
The primary lesson I'm getting from the new Google/Bing search efforts is: WebGPT was published over a year ago and all the ideas were there. So what mind-shifting tech will be released a year from now that anyone can execute on today? And are you gonna build it?
5
0
71
@dmvaldman
Dave
1 year
Anthropic: we do not know how to train systems to be helpful, honest and harmless. OpenAI: is this not helpful? not honest? not harmless?
Tweet media one
Tweet media two
9
7
66
@dmvaldman
Dave
9 months
@sama In equation form. It's not the value of x, it's the iterative process (limit) of feedback (power)
Tweet media one
8
3
64
@dmvaldman
Dave
1 year
@ESYudkowsky Yud, let's touch some grass, I know a great spot.
1
0
63
@dmvaldman
Dave
1 year
Now and again I read commentary and feel we are clinging desperately to a belief; that computation cannot exceed man. Sigh... it is only a belief. This era feels next in a long series. Galileo, Darwin, today. In each the universe became vastly bigger, and our egos, smaller.
4
14
64
@dmvaldman
Dave
4 months
@goodside that is cyberpunk as hell
1
0
63
@dmvaldman
Dave
2 years
@ire_alva i guess you just keep telling it it doesn't have opinions until it believes you :P
1
0
60
@dmvaldman
Dave
1 year
Can GPT-4 just read now? No mention of OCR in the paper.
Tweet media one
5
2
56
@dmvaldman
Dave
1 year
The keyboard is now the mouse
5
4
57
@dmvaldman
Dave
3 months
@darrenangle You forgot PRStunt: we don't tell you any details about training data, model size or architecture but it's SoTA on a bunch of stuff
1
1
57
@dmvaldman
Dave
2 years
@ethanCaballero scale is all you need
Tweet media one
2
5
56
@dmvaldman
Dave
9 months
Who needs an H100 cluster at cost? Built for startups and researchers doing large scale training runs. From @evanjconrad and @apagajewski 💪
Tweet media one
0
5
55
@dmvaldman
Dave
6 months
@ClementDelangue Ppl will suggest Berkeley, Stanford, Toronto, CMU but I also think Tel Aviv University is highly underrated here. Esp Daniel Cohen-Or's group. UNC seems to also be on the rise.
4
0
52
@dmvaldman
Dave
1 year
@_akhaliq The SoTA is essentially whatever Tero Karras is currently working on.
2
2
56
@dmvaldman
Dave
5 months
@deepfates Not sure how you can easily imagine a future where you prompt "cure disease" and the AI is like "okay right on it!" But not also "create disease"
9
3
55
@dmvaldman
Dave
1 year
Feel like LangChain and PromptOps is our software 1.0 brain trying to wrap its head around software 2.0 tech.
2
3
52