MSalvaris Profile Banner
Mathew Salvaris Profile
Mathew Salvaris

@MSalvaris

Followers
436
Following
6K
Media
3
Statuses
475

Machine Learning @Microsoft - - ex @iRobot Neuroscience @UCL - - PhD Computer Science Machine Learning. Avid snowboarder and climber

Joined November 2011
Don't wanna be here? Send us removal request.
@pavandavuluri
Pavan Davuluri
9 months
Optimizing LLMs for edge devices can be a challenge — especially when balancing model size and reasoning quality. Our team is pushing the boundaries of what’s possible, introducing new techniques to fit large-scale language models on PCs with limited memory and maintain
Tweet card summary image
arxiv.org
We introduce DeltaLLM, a new post-training compression technique to reduce the memory footprint of LLMs. We propose an alternative way of structuring LLMs with weight sharing between layers in...
0
2
14
@BostonDynamics
Boston Dynamics
1 year
Why build a humanoid robot? Because the world is designed for humans, including all the best Halloween costumes!
248
786
5K
@alexandertmai
Alexander Mai
1 year
Our new paper performs exact volume rendering at 30FPS@720p, giving us the highest detail 3D-consistent NeRF! Paper: https://t.co/CRtvXC69s1 Website: https://t.co/EbVmedJp0U
14
75
493
@rogerio_bonatti
Rogerio Bonatti
1 year
AI assistants have changed the way we use computers to work and search for information. As LLMs become more powerful, what’s next? Agents. Excited to introduce Windows Agent Arena, a benchmark for evaluating AI models that can reason, plan and act to solve tasks on your PC.
7
53
181
@goodfellow_ian
Ian Goodfellow
1 year
It’s always good to temper one’s optimism for empirically validated defenses against adversarial examples, but this is the most promising one I’ve heard of in several years. Definitely worth reading this explainer thread
@stanislavfort
Stanislav Fort
1 year
✨🎨🏰Super excited to share our new paper Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness Inspired by biology we 1) get adversarial robustness + interpretability for free, 2) turn classifiers into generators & 3) design attacks on vLLMs 1/12
4
26
257
@hardmaru
hardmaru
1 year
I thought the interview with @SchmidhuberAI on @MLStreetTalk was really well done. Full of inspiration and creative energy. 🔥 For those who think ChatGPT and LLM variants have solved “AGI”, I recommend watching the full video. Looking forward to Part 2! https://t.co/BC38DcP8QU
9
58
367
@jon_barron
Jon Barron
1 year
The legendary Ross Girshick just posted his CVPR workshop slides about the 1.5 decades he spent ~solving object detection as it relates to the ongoing LLM singularity. Excellent read, highly recommended.
drive.google.com
7
137
740
@hardmaru
hardmaru
1 year
Language is primarily a tool for communication rather than thought https://t.co/8V9zPoMDjK “Language is a defining characteristic of our species, but the function, or functions, that it serves has been debated for centuries. Here we bring recent evidence from neuroscience and
Tweet card summary image
nature.com
Nature - Evidence from neuroscience and related fields suggests that language and thought processes operate in distinct networks in the human brain and that language is optimized for communication...
87
320
1K
@cusp_ai
CuspAI
1 year
🚀We're excited to emerge from stealth and announce our $30M Seed financing round. A huge thank you to our incredible investor group including @HoxtonVentures, @BasisSet, @lightspeedvp, @northzoneVC, @localglobevc, Touring Capital, Giant Ventures, @FjLabs, @ZeroPrimeVC & Tiferes
7
54
310
@MLStreetTalk
Machine Learning Street Talk
1 year
Refreshing take from @cohere co-founder @nickfrosst that we should focus on solving real-world business problems with large language models rather than "AGI". Just dropped on MLST.
15
29
295
@OfficialLoganK
Logan Kilpatrick
1 year
Reminder: no one has cracked AGI yet, not the frontier labs, not Ilya, no one. Everyone is playing the same iterative game, looking N steps ahead, and trying to guesstimate what happens next.
83
58
758
@fchollet
François Chollet
1 year
I'm partnering with @mikeknoop to launch ARC Prize: a $1,000,000 competition to create an AI that can adapt to novelty and solve simple reasoning problems. Let's get back on track towards AGI. Website: https://t.co/wNsM3IQgEI ARC Prize on @kaggle: https://t.co/Lhsh1RiWKq
80
532
3K
@mejia_petit
Nicolas Mejia Petit
2 years
Why isn’t everyone talking about this??? Deepspeed devs literally just created a datatype FP6 with full tensor core support on the a100’s. (Since nvidia left us stranded with int4/8) It is SO smart just reading through the kernel, my god.
@rohanpaul_ai
Rohan Paul
2 years
LLaMA-70b inferencing using only a single GPU and achieving 1.69x-2.65x higher normalized inference throughput than the FP16 baseline. with Six-bit quantization (FP6) 🔥 Deepspeed has just recently released this Paper and also integrated the FP6 quantization - "FP6-LLM:
9
102
627
@ylecun
Yann LeCun
2 years
My tirade against AI doomers in Davos. https://t.co/SURHmhWywq
34
90
553
@DrJimFan
Jim Fan
2 years
Today may be the ImageNet moment for robotics. RT-X: the largest open-source robot dataset ever compiled, across 33 institutes, 22 robot hardware, 527 skills, and 1M episodes. Why is robotics lagging so far behind NLP, vision, and other AI domains? Data scarcity is the main
28
333
1K
@chrisalbon
Chris Albon
2 years
It feels so weird to me that people talk about open source as reckless and a risk when it comes to AI model. Open source has been responsible for the last 20 years of technological revolution. Builders sharing their free knowledge to the world to benefit us all. But suddenly
83
269
2K
@MelMitchell1
Melanie Mitchell
2 years
Really interesting talk by VC legend Bill Gurley: https://t.co/2FntcDjrLF V. cogent (& 🔥) abt regulatory capture & the industry / govt revolving door. I'm in favor of some kind of gov regulation of AI, but it has to avoid those traps. I'm also big fan of open source.
7
15
79
@ChrisJReiser
Christian Reiser
2 years
In less than an hour I am going to present our paper MERF at #SIGGRAPH2023 in Petree Hall D. MERF allows you to interactively explore large scenes on a laptop in the browser. Check out our web demo: https://t.co/a4uOxriY1D By the way we have also released the entire code now!
creiser.github.io
Project page for MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes.
3
33
118
@ariskonstant
Aris Konstantinidis
2 years
Στην Cohere έχουμε ξεκινήσει ένα πρότζεκτ για να εκπαιδεύσουμε πολύγλωσσα μοντέλα. Για τα ελληνικά δεν έχουμε βρει ακόμα πρεσβευτές (ambassadors). Αν θέλεις να βοηθήσεις να διασφαλίσουμε ότι τα επόμενα LLMs θα μιλούν άπταιστα ελληνικά, γίνε ambassador ή κάνε retweet/share!
@Cohere_Labs
Cohere Labs
2 years
Our European Sprint for the Aya project is this weekend, June 17-18.🌍Join us to help ensure every European language is included in the development of language AI! More info here: https://t.co/x3g0FyGAjA Register to participate here:
5
44
61