
Daniel Fein
@DanielFein7
Followers
321
Following
585
Media
16
Statuses
441
There is a group of people quietly sabotaging all frontier language models…. People with bad taste. We show that we can get human-preference reward models to do better by finding and removing bad examples
1
1
5
Data is the most important determinant of model behavior. Just like we are conscious of what we show our kids, we should be conscious of what we show our models. Shoutout Gabriela Aránguiz-Dias and CS361 by @aiprof_mykel. Read the paper!!.
arxiv.org
Language models are commonly fine-tuned via reinforcement learning to alter their behavior or elicit new capabilities. Datasets used for these purposes, and particularly human preference datasets,...
0
0
2
Most hard problems do not have verifiable answers. Happy to share this work on AI for creative writing to push forward how we think about rewarding models to perform well in divergent domains.
Introducing LitBench, the first standardized benchmark for creative writing verifiers! We use Reddit’s r/WritingPrompts to label human preferences across 50k story-pairs, and see how LLM-as-a-judge, Generative RMs, and Bradley-Terry RMs stack up.
0
1
5
RT @sebbrusso: seeing as im graduating in 2 weeks, I wanted to share a piece i wrote about leaving stanford
0
1
0
RT @tszzl: a common cope among the classes blessed to work on or with ai, but we are not blessed for long. there is no conceptual divide be….
0
69
0
I made an app!. It’s called Receipts and it gives you all the data you could ever want about your text messages. It also has some really cool AI features like conversation simulation and text recommendations. Shoutout @kabirjolly_ for working through tough battles to make this
3
0
7
RT @deliprao: If this were a science paper, you would expect a country that picks its science workforce at random as a “weak baseline” and….
0
111
0
RT @therecount: Fox News’ Peter Doocy uses all his time at the White House press briefing to ask about an assessment that “literally everyo….
0
238
0
@bing and @satyanadella have google in checkmate. With 3% of search marketshare it’s easy to put out a 150B parameter language model, whatever it costs - Bing isn’t a major profit center anyway.
1
0
0
One of the underrated design innovations of chatGPT is the removal of the traditional left/right chat interface that unnecessarily wastes a bunch of space for the response.
6/.You will be able to interact with Bing the same way you interact with ChatGPT today - as a chatbot. Imagine search and chat and conversation converged into one…. It is hard to fathom where this can go!
0
0
0
How is this real. “Altman and OpenAI's chief scientist, Ilya Sutskever, said the move to focus on large language models is the best way for the company to reach AGI, or adjusted gross income.”.
businessinsider.com
OpenAI CEO Sam Altman became a household name after the release of OpenAI's groundbreaking AI model, ChatGPT.
1
1
3