JessicaHullman Profile Banner
Jessica Hullman Profile
Jessica Hullman

@JessicaHullman

Followers
8K
Following
8K
Media
145
Statuses
4K

Ginni Rometty Prof @NorthwesternCS | Fellow @IPRatNU | AI beliefs uncertainty decisions metascience | Somewhere betw theory & practice | Blogger @statmodeling

Chicago, IL
Joined June 2013
Don't wanna be here? Send us removal request.
@JessicaHullman
Jessica Hullman
5 months
Explainable AI has long frustrated me by lacking a clear theory of what explanations should do. Improve use of a model for what? How? Given a task what's max effect explanation can have? It's complicated bc most methods are functions of features & prediction but not true state 1/
5
10
69
@zhonghaohe
Zhonghao He โœˆ๏ธ NeurIPS
4 hours
Sycophancy. Groupthink. Test-time Inverse Scaling. What if one unsupervised metric could detect all these reasoning failures? ๐Ÿคฏ ๐ŸฉตPresenting our NeurIPS'25: Martingale Score๐Ÿฉต We propose a statistical test for "Belief Entrenchment" in LLMs - no labels required. Link to paper:
1
3
10
@JessicaHullman
Jessica Hullman
4 days
A 'pragmatic intepretability' turn sounds a lot like our argument/framework for evaluating explanation methods--Time to replace task-agnostic fortunetelling w/concrete decision problem specs + theoretic & empirical evidence of expected performance boost https://t.co/z9XREAdjnt
Tweet card summary image
arxiv.org
Modern methods for explainable machine learning are designed to describe how models map inputs to outputs--without deep consideration of how these explanations will be used in practice. This paper...
@NeelNanda5
Neel Nanda
7 days
The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit
0
1
13
@JessicaHullman
Jessica Hullman
7 days
Science without interpretation or ability to reproduce results. Good luck with that!
@tdietterich
Thomas G. Dietterich
7 days
On @arxiv, we are receiving more and more submissions with short sections and many bulleted lists. I'm curious what my fellow researchers think about this style of research paper. Should this become our standard practice, or does this style omit too many details?
2
0
14
@colin_fraser
Colin Fraser
9 days
No. If this came out in the pre-AI era, almost no one would notice it. People largely donโ€™t care about visual art for its own sake. The reason this is getting so much attention has almost nothing to do with the work itself.
@chatgpt21
Chris
11 days
Apparently this video has all of X in a frenzy. If it had come out before the AI era, people would be fawning over it as great art, but now they are so clicker trained that any mention of AI sends them into a verbiage frenzy and they anoint anything AI related as slop.
5
7
287
@JessicaHullman
Jessica Hullman
9 days
Deeply disappointed in my institution
@Phil_Lewis_
philip lewis
9 days
Northwestern University has agreed to pay $75 million as a part of a deal with the Trump administration to restore hundreds of millions in federal funding https://t.co/rZGdjYCVoi
0
0
10
@JessicaHullman
Jessica Hullman
11 days
He has a jokey way of not letting people off the hook for misconduct. Eg, this made me laugh, from a stats conversation earlier this week: โ€œActually, that reminds me of something my advisor Don Rubin used to say, when he wasnโ€™t emailing Jeff Epstein, about how the likelihood โ€ฆโ€
0
0
2
@JessicaHullman
Jessica Hullman
11 days
Thanksgiving psychoanalysis of Epstein: some ppl are really good at creating elite seeming intellectual vibe networks, even if the takes one gets access to are mostly vacuous. (I mean, just look at AI lately) I love Gelmanโ€™s refusal to let these things go. They really get to him
@StatModeling
Andrew Gelman et al.
11 days
Larry Summers, Ken Starr, Jeffrey Epstein, and everyone else https://t.co/ACZpAWjw71
1
2
7
@JessicaHullman
Jessica Hullman
13 days
New Pew report confirms whatโ€™s seemed obvious to me for awhile: X is not a place one goes to interact w/women โ€œNever in the history of modern social media has one gender so decisively abandoned a platformโ€ฆ X male-female imbalance is less extreme only than late-2010s Redditโ€ 1/2
1
7
22
@StatModeling
Andrew Gelman et al.
14 days
Some thoughts on empirical distributions of z-scores https://t.co/TCOLMOs6Wy
0
4
23
@JessicaHullman
Jessica Hullman
13 days
Trying to imagine a world in which I willingly put my name on papers I've barely read. I really can't. Even a few sloppy sentencesn in a paper I'm on sticks with me. When PIs stop believing the devil is in the details, some aspect of research conscience has been lost
@PKUWZP
Zhipeng(Jason Z) Wang ๐Ÿ‡บ๐Ÿ‡ฆ
14 days
@thegautamkamath I think itโ€™s fine as long as you support the team and provide intellectual input/guidance, which is vague and do not have a clear criteria. Many full professor has over 50 papers from their team. Itโ€™s totally reasonable to put their names in the paper.
2
1
42
@evavivalt
Eva Vivalt
14 days
๐Ÿšจ New working paper! How well do people predict the results of studies? @sdellavi and I leverage data from the first 100 studies to have been posted on the SSPP, containing 1,482 key questions, on whichย over 50,000 forecasts were placed. Some surprising results below.... ๐Ÿงต๐Ÿ‘‡
6
65
217
@wes_deng
Wesley Hanwen Deng
18 days
Hello colleagues and friends! โœจPlease help repost and share โœจ: I am officially on the job market for ๐˜๐—ฒ๐—ป๐˜‚๐—ฟ๐—ฒ ๐˜๐—ฟ๐—ฎ๐—ฐ๐—ธ ๐—ฎ๐˜€๐˜€๐—ถ๐˜€๐˜๐—ฎ๐—ป๐˜ ๐—ฝ๐—ฟ๐—ผ๐—ณ๐—ฒ๐˜€๐˜€๐—ผ๐—ฟ and ๐—ถ๐—ป๐—ฑ๐˜‚๐˜€๐˜๐—ฟ๐˜† ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—ฟ๐—ผ๐—น๐—ฒ๐˜€ where I can apply my research to real-world AI safety challenges!!
2
35
100
@ben_golub
Ben Golub
23 days
The grade inflation in economics job market recommendation letters is getting out of hand. Stars and superstars and megastars. I predict that, by 2027, at least one candidate will be described as the black hole at the center of our galaxy.
20
19
471
@michael_nielsen
Michael Nielsen
26 days
This is a very insightful thread
@paulnovosad
Paul Novosad
27 days
Econ seminar culture is built on the assumption that the audience knows something that the speaker doesn't, and that the speaker values that information. A very important thing the audience knows and the speaker doesn't: Is the speaker making any sense at all? If nobody has any
1
3
35
@JessicaHullman
Jessica Hullman
1 month
ICE showed up to kidnap lawn workers and nannies today in my sleepy Chicago North Shore neighborhood. So disgusted
0
0
1
@2plus2make5
Emma Pierson
1 month
We don't fully understand the preferences human feedback encodes, so training on it can be risky. We propose a method to automatically discover these preferences! We identify unsafe, contradictory, and subjective preferences, and improve model safety, eval, and personalization.
@rajivmovva
Raj Movva
1 month
๐Ÿ“ฃNEW PAPER! What's In My Human Feedback? (WIMHF) ๐Ÿ”ฆ Human feedback can induce unexpected/harmful changes to LLMs, like overconfidence or sycophancy. How can we forecast these behaviors ahead of time? Using SAEs, WIMHF automatically extracts these signals from preference data.
2
11
90
@JessicaHullman
Jessica Hullman
1 month
Our lab has produced multiple R1 tenure-track faculty ๐ŸŽ“, won lots of paper awards ๐Ÿ† & developed widely used techniques for uncertainty quantification & communication Apply to Northwestern Phd in CS https://t.co/5l7ATDIt4i or Technology & Social Behavior https://t.co/L8fypqClWl
0
0
1
@JessicaHullman
Jessica Hullman
1 month
๐Ÿง โš™๏ธ Interested in decision theory+cogsci meets AI? Want to create methods to rigorously design/evaluate human-AI workflows? I'm recruiting PhDs to work on ๐ŸŽฏ Stat foundations of multiagent collab ๐ŸŒซ๏ธ Uncertainty & meta-cognition ๐Ÿ”Ž Interpretability ๐Ÿ’ฌ LLMs in behavioral sci
1
3
14
@JessicaHullman
Jessica Hullman
1 month
Coming off a week-long JID-induced energy surge, I wonder how productive I'd be if I just followed the tour
0
0
5