
Marco Ciccone @ ICML π¨π¦
@mciccone_AI
Followers
881
Following
19K
Media
53
Statuses
1K
Postdoctoral Fellow @VectorInst - Collaborative, Decentralized, Modular ML - Competition chair @NeurIPSConf 2021, 2022, 2023 - PhD @polimi ex @NVIDIA @NNAISENSE
Toronto, Canada
Joined April 2015
π¨ Life update π¨ I moved to Toronto π¨π¦and joined @VectorInst as a Postdoctoral Fellow to work with @colinraffel and his lab on collaborative, decentralized, and modular machine learning to democratize ML model development. Exciting times ahead! πͺΏ
12
4
104
RT @ffabffrasca: 6/. @JoshSouthern13 and I be at #ICML2025, poster session Tuesday β stop by and chat if you're around!. I would also bβ¦.
0
1
0
RT @Ar_Douillard: I'll discuss distributed learning on Saturday, July 12. First, I'll cover current methods needing high bandwidth, then nβ¦.
0
12
0
Even crazier is that the rmsprop algorithm was introduced by @geoffreyhinton in his neural networks course, and we cite his slides - so it's fine!
@samsja19 @kellerjordan0 even crazy part is there's no muon paper.
1
0
2
Fantastic work from @allen_ai - asynchronous training of MoEs on private datasets and a domain-aware router. Akin to cross-silo FL, but no synchronization and no fear of heterogeneity anymore on LLMs. Nice, clean, and of course modular!.
The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? . Introducing
0
2
18
RT @roydanroy: The Department of Statistical Sciences in the Faculty of Arts and Science at the University of Toronto invites applicationsβ¦.
0
11
0
π¨ Missed our #ICLR2025 workshop on modularity & collaborative learning? If you are interested in these topics, get in touch! .
1
1
2
Heading to #ICML2025 in Vancouver π¨π¦ July 13β19!. πPing me if you want to discuss research and collabs on how to build better LLMs via:. β’ Model merging & routing.β’ Continual & modular learning.β’ Distributed and Collaborative Learning
1
5
14
Check the full thread π§΅here
Do you feel FL research is stuck with methods that do not work well in realistic scenarios? π€. π«΅We got you!.Introducing πGeneralized Heavy-Ball Momentum (GHBM)π, accepted at TMLR:.the FL algorithm with both SOTA theoretical guarantees and much better empirical results. π§΅1/9
0
1
2
Early inΒ his Phd, @RickZack96 showed me some experiments of a preliminary version of GHBM applied to a different problem, and I remember pushing him to be ambitious and turn it into an actual FL algorithm.
What began as a hunch became a path Iβm proud ofβthis journey through FL & optimization has been rich with growth and purpose. Deep thanks to my PhD advisor @masone_carlo, and to Sai Praneeth Karimireddy & @mciccone_AIβyour guidance lit the way. The best is yet to come.
1
0
0
Faster local optimization by recycling past outer gradients! . Incredibly proud of this work and collaboration. We have a bunch of ideas in the pipeline, looking forward to the next one with @RickZack96!.
Do you feel FL research is stuck with methods that do not work well in realistic scenarios? π€. π«΅We got you!.Introducing πGeneralized Heavy-Ball Momentum (GHBM)π, accepted at TMLR:.the FL algorithm with both SOTA theoretical guarantees and much better empirical results. π§΅1/9
0
2
3
Despite the impressive quality of video generation, what I don't understand about #Veo3 is how it is possible to create such models without infringing on copyright. Did they make a deal with the movie industry?.
0
0
2
Sadly, I won't be able to travel and attend our Modularity workshop :( I'm sure I'll miss many amazing discussions and talks. Luckily @Ar_Douillard and @derylucio will be there to hold down the fort! Proud to be part of the organizing team - can't wait to meet you all next time!.
0
0
6