
Manvi Agarwal
@ManviTweeteth
Followers
194
Following
1K
Media
42
Statuses
289
PhD student @telecomparis | Converts chocolate and coffee to code | Calcuttan | (re)tweets ∈ funny stuff ∪ science-y stuff
Paris, France
Joined December 2015
Monsieur le Président, j’ai Choosé France for Science (France m’a Choosé for Science, merci MSCA fellowship) et ça passe bien (there are many amazing laboratoires dans ce pays) but sans mesures concrètes pour améliorer la bureaucratie scientifique, il est inutile d'en parler.
Ici en France, la recherche est une priorité, l’innovation une culture, la science un horizon sans limite. Chercheurs, chercheuses du monde entier, choisissez la France, choisissez l’Europe ! Je vous donne rendez-vous le 5 mai. →
1
1
4
3️⃣ We explain why structural information in PE has proven to be empirically successful. We go back to 1️⃣ and use the content-context connection to show that the higher is the mutual information between data and its positional representation, the better is task performance.
0
0
2
2️⃣ We introduce a new positional encoding method - RoPEPool - that can model causality. How does RoPEPool compare to RoPE and F-StrIPE? Our analysis with a toy example says: RoPEPool isn’t just different, it’s also richer in terms of expressivity.
1
0
1
1️⃣ We show how different families of positional encoding - rotation-based (RoPE) and random fourier features-based (F-StrIPE) - can be compared using kernel methods. It’s not just vibes - we characterize precisely how queries and keys are affected by positional information.
1
0
1
🚨 We just submitted a follow-up to this work, now available as a preprint:
hal.science
While music remains a challenging domain for generative models like Transformers, a two-pronged approach has recently proved successful: inserting musically-relevant structural information into the...
ICASSP 2025 has begun! I will be presenting "F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation" on April 11 in the lecture session "Machine learning for speech, audio and music processing II" at 2 PM. More details below 🧵
1
1
2
With these two interventions, we obtain better performance at lower cost! 🚀 Curious? Check out the companion webpage:
0
0
1
In our paper, we show that stochastic positional encoding is, in fact, a noisy version of a well-known kernel approximation technique: Random Fourier Features. We also show how prior knowledge (e.g. related to musical structure) can be used in such linear-complexity Transformers.
1
0
1
However, there was a piece missing: how do you handle relative positional encoding in these linear-complexity transformers? 🤔 Enter Stochastic Positional Encoding! It brings relative positional information back into the picture without going back to quadratic cost.
1
0
1
Luckily, there's a solution: you can think of attention as a kernel function and use kernel approximation techniques to reduce the cost from quadratic to linear. ⚡ This was the idea used by Performers, for example.
1
0
1
Transformers are powerful, but there's a problem: their cost grows quadratically with sequence length. 📈 This makes it really hard to apply them to lengthy sequences, like music, where long-term connections carry important information.
1
0
1
ICASSP 2025 has begun! I will be presenting "F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation" on April 11 in the lecture session "Machine learning for speech, audio and music processing II" at 2 PM. More details below 🧵
1
1
5
Did Qin Shi Huang know when he united China under a Legalist bureaucratic system 2200 years ago, that he would start a process that would end up in me filling out forms in triplicate from Calcutta to Paris, instead of doing science? FML.
0
1
2
Just explained to general disbelief of my Indian parents how @Sorbonne_Univ_ hasn't yet reimbursed me ~1200€ for over 4 years, & that I will not be paid my December salary in time due to bureacratic incompetence. They thought such incompetence was just an Indian particularity!
1
1
5
I will be in London next week visiting @c4dm ! If you’re around, come say hi 😄 In other news, we’re looking for a Master’s student to carry forward some of this work that I’ll be talking about at next week’s seminar 👇(advert in following tweet)
Next Tuesday, 3/12 at 2pm, we will host a seminar by Manvi Agarwal @ManviTweeteth on 'Fast Structure-informed Positional Encoding for Music Generation'. More info at: https://t.co/30SmSpUzBS
1
1
7
Thrilled to join @DartmouthCS as Assistant Professor in Jan 2025! I’m seeking 1-2 PhD students to join in Fall 2025. Application is by December 15th; please feel free to reach out with any questions. More details here:
10
40
221
👩🎓@SonyCSLMusic has an open 3-year PhD position starting in spring 2025! We are excited to invite applicants who meet most of the following requirements: - Master's degree - Strong background in (generative) deep learning - Experience with deep learning projects - Publications
1
8
38