Chris Donahue Profile
Chris Donahue

@chrisdonahuey

Followers
5K
Following
6K
Media
100
Statuses
1K

GenAI for *human* creativity in music + more. Assistant prof at CMU CSD, ๐ŸŽผ G-CLef lab. Part time Google DeepMind, Magenta (views my own)

Pittsburgh, PA
Joined January 2012
Don't wanna be here? Send us removal request.
@chrisdonahuey
Chris Donahue
3 years
Excited to share SingSong, a system which can generate instrumental accompaniments to pair with input vocals! ๐Ÿ“„ https://t.co/1mRUaXvqVy ๐Ÿ”Š https://t.co/8RGezPu5YQ Work co-led by myself, @antoine_caillon, and @ada_rob as part of @GoogleMagenta and the broader MusicLM project ๐Ÿงต
138
722
3K
@chrisdonahuey
Chris Donahue
7 days
See https://t.co/Xj92707LZU to ๐Ÿ—ณ๏ธ vote on your favorite, or to download a comprehensive data release for the preferences we've collected so far Thanks @yonghyunk1m @SonyAI_global @arena !
Tweet card summary image
huggingface.co
0
1
3
@chrisdonahuey
Chris Donahue
7 days
๐ŸŽตMusic Arena โš”๏ธ was accepted to the NeurIPS 2025 Creativity Track, and we've released a big update to celebrate! Includes new models from @SonautoAI and @elevenlabsio. Also, Music Arena is now available as a ๐Ÿค— @huggingface space and dataset!
1
8
40
@chrisdonahuey
Chris Donahue
1 month
Thanks to blog post co-authors (@yonghyunk1m, Nathan Pruyne), all our collaborators (@iamwaynechi @ml_angelopoulos @infwinston @koichi__saito @shinjiw_at_cmu @mittu1204), and others (@sonyai_global, @lmarena_ai, @riffusionai_ )!
1
0
4
@OpenledgerHQ
OpenLedger
1 day
Mainnet countdown is here ๐Ÿ™
57
38
182
@chrisdonahuey
Chris Donahue
1 month
We will be continuously updating Music Arena with new systems! Please reach out if you are interested in evaluating your music generation model on our platform
1
0
3
@chrisdonahuey
Chris Donahue
1 month
We collect natural language feedback in addition to binary prefs. Users tend to comment on both generation quality and prompt adherence. Sentiment analysis on this feedback is correlated with win rates, though also reveals new system-specific strengths and weaknesses!
1
0
3
@chrisdonahuey
Chris Donahue
1 month
Most of our users write their own prompts, as opposed to using one of our built in suggestions. User prompts emphasize genres, instruments, and modes, and most are very short (median length 7).
1
0
3
@ThatKyleFisher
KingofKyle
17 hours
Ending EBT isn't about the money
11
0
5
@chrisdonahuey
Chris Donahue
1 month
We also observe a weak (ฯ=0.082) but significant (p=0.012) correlation between the amount of time a user spends listening to a pair of outputs, and the overall "difficulty" of the comparison (codified by negative absolute difference in Arena score)
1
0
3
@chrisdonahuey
Chris Donahue
1 month
We collect nuanced listening behavior on Music Arena, revealing new insights E.g.: listening behavior differs dramatically between the 1st and 2nd tracks a user observes. Users listen to the 1st at length, then decide their preference after only the first few seconds of the 2nd
1
2
8
@chrisdonahuey
Chris Donahue
1 month
We aim for *comprehensive* transparency of the Music Arena platform. To this end, this first update comes paired with a comprehensive data release, to be updated on a rolling basis. โš”๏ธ https://t.co/00HvKj3zPx โญ๏ธ https://t.co/9f3mvsfco2
Tweet card summary image
github.com
Contribute to gclef-cmu/music-arena development by creating an account on GitHub.
1
0
6
@chrisdonahuey
Chris Donahue
1 month
Music Arena went into public beta on July 28. In the first ~month of use, we collected 1051 votes on 1420 battles. We compile two leaderboards for instrumental-only (2/3 of votes, first tweet) and w/ vocals (1/3 of votes, below)
1
0
2
@WaterMinder_app
WaterMinder App
16 days
Imagine AI that cares about your health. WaterMinderโ€™s AI Gulp Detection is like having a hydration coach in your pocket. Sip water โ†’ App detects it โ†’ Earn Rewards. Download now and see it work in real time.
4
6
31
@chrisdonahuey
Chris Donahue
1 month
Sharing our initial leaderboard and open data release for ๐ŸŽถMusic Arenaโš”๏ธ! Music is subjective and multi-dimensional. A key goal of Music Arena is to provide insights beyond binary preferences! ๐Ÿงต
2
21
87
@chrisdonahuey
Chris Donahue
1 month
Thanks to our team Yichen (Will) Huang, @zacknovack @Koichi__Saito @jiatongshi @_shinjiwatanabe @mittu1204 @jwthickstun and @SonyAI_global for the support! So don't be sad about your music gen evals, get MAD! ๐ŸŽบ๐Ÿ˜ก๐ŸŽบ https://t.co/2U2Oz4sHpS
Tweet card summary image
github.com
Contribute to i-need-sleep/mad development by creating an account on GitHub.
0
0
4
@chrisdonahuey
Chris Donahue
1 month
To facilitate robust + reliable music gen eval, we release MAD as a drop-in replacement (w/MIT license!) for metrics like FAD/MMD, and MusicPrefs on HF for further eval research and preference modeling! https://t.co/2U2Oz4sHpS https://t.co/p9GJpLZbOl
Tweet card summary image
huggingface.co
1
0
1
@chrisdonahuey
Chris Donahue
1 month
We find that MAD has much stronger rank correlation (๐œ=0.62) with human preferences than FAD (๐œ=0.14)!
1
0
0
@QuasarMarkets
Quasar Markets
7 hours
QM LIVE: @qmbigbeat and @cryptohondo with YOUR MORNING LOOK AT THE MARKETS
1
0
12
@chrisdonahuey
Chris Donahue
1 month
Second, we measure correlation between automatic metrics and MusicPrefs: a dataset of pairwise human prefs for text-to-music that we collect via MTurk. https://t.co/p9GJpLZJDT
1
0
0
@chrisdonahuey
Chris Donahue
1 month
We find that FAD lacks sensitivity to important desiderata like musicality. We propose an alternative based on our meta evaluation findings: FAD: *VGGish* embeddings + *Frechet Distance* divergence โฌ‡๏ธ MAD: *MERT* embeddings + *MAUVE* divergence
1
0
0
@chrisdonahuey
Chris Donahue
1 month
We meta evaluate reference-based music evaluation metrics in two stages. First, we systematically search over a design space of embeddings and divergences (inclusive of FAD), using synthetic datasets capturing sensitivity to specific desiderata (e.g., musicality, diversity).
1
0
0
@chrisdonahuey
Chris Donahue
1 month
The capabilities of music generation models continue to improve, but progress on evaluation lags behind. Popular automatic metrics such as FAD were developed for different tasks (music enhancement) and their relevance to music gen is unclear. So how can we do better?
1
0
0
@THINCfdn
THINC Foundation
6 days
Concerned about biases and politics in your children's education? Be an advocate with these 6 simple steps. Follow THINC to learn more about what you can do as a parent.
1
2
6
@chrisdonahuey
Chris Donahue
1 month
Eval for music generation is notoriously ill-defined, but no fear! Presenting MAD, a new metric for music quality with stronger alignment to human preferences. Appearing at ISMIR this week! โญ: https://t.co/2U2Oz4sHpS ๐Ÿ“–: https://t.co/GxMDGmbSt1 ๐Ÿ”Š: https://t.co/gpw7OrEfz0 ๐Ÿงต
1
6
43