Mathew Salvaris @MSalvaris X Profile

Mathew Salvaris

@MSalvaris

Followers

436

Following

6K

Media

3

Statuses

475

Machine Learning @Microsoft - - ex @iRobot Neuroscience @UCL - - PhD Computer Science Machine Learning. Avid snowboarder and climber

Joined November 2011

Don't wanna be here? Send us removal request.

Pavan Davuluri

@pavandavuluri

9 months

Optimizing LLMs for edge devices can be a challenge — especially when balancing model size and reasoning quality. Our team is pushing the boundaries of what’s possible, introducing new techniques to fit large-scale language models on PCs with limited memory and maintain

arxiv.org

We introduce DeltaLLM, a new post-training compression technique to reduce the memory footprint of LLMs. We propose an alternative way of structuring LLMs with weight sharing between layers in...

0

2

14

Boston Dynamics

@BostonDynamics

1 year

Why build a humanoid robot? Because the world is designed for humans, including all the best Halloween costumes!

248

786

5K

Alexander Mai

@alexandertmai

1 year

Our new paper performs exact volume rendering at 30FPS@720p, giving us the highest detail 3D-consistent NeRF! Paper: https://t.co/CRtvXC69s1 Website: https://t.co/EbVmedJp0U

14

75

493

Rogerio Bonatti

@rogerio_bonatti

1 year

AI assistants have changed the way we use computers to work and search for information. As LLMs become more powerful, what’s next? Agents. Excited to introduce Windows Agent Arena, a benchmark for evaluating AI models that can reason, plan and act to solve tasks on your PC.

7

53

181

Ian Goodfellow

@goodfellow_ian

1 year

It’s always good to temper one’s optimism for empirically validated defenses against adversarial examples, but this is the most promising one I’ve heard of in several years. Definitely worth reading this explainer thread

Stanislav Fort

@stanislavfort

1 year

✨🎨🏰Super excited to share our new paper Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness Inspired by biology we 1) get adversarial robustness + interpretability for free, 2) turn classifiers into generators & 3) design attacks on vLLMs 1/12

4

26

257

hardmaru

@hardmaru

1 year

I thought the interview with @SchmidhuberAI on @MLStreetTalk was really well done. Full of inspiration and creative energy. 🔥 For those who think ChatGPT and LLM variants have solved “AGI”, I recommend watching the full video. Looking forward to Part 2! https://t.co/BC38DcP8QU

9

58

367

Jon Barron

@jon_barron

1 year

The legendary Ross Girshick just posted his CVPR workshop slides about the 1.5 decades he spent ~solving object detection as it relates to the ongoing LLM singularity. Excellent read, highly recommended.

drive.google.com

7

137

740

hardmaru

@hardmaru

1 year

Language is primarily a tool for communication rather than thought https://t.co/8V9zPoMDjK “Language is a defining characteristic of our species, but the function, or functions, that it serves has been debated for centuries. Here we bring recent evidence from neuroscience and

nature.com

Nature - Evidence from neuroscience and related fields suggests that language and thought processes operate in distinct networks in the human brain and that language is optimized for communication...

87

320

1K

CuspAI

@cusp_ai

1 year

🚀We're excited to emerge from stealth and announce our $30M Seed financing round. A huge thank you to our incredible investor group including @HoxtonVentures, @BasisSet, @lightspeedvp, @northzoneVC, @localglobevc, Touring Capital, Giant Ventures, @FjLabs, @ZeroPrimeVC & Tiferes

7

54

310

Machine Learning Street Talk

@MLStreetTalk

1 year

Refreshing take from @cohere co-founder @nickfrosst that we should focus on solving real-world business problems with large language models rather than "AGI". Just dropped on MLST.

15

29

295

Logan Kilpatrick

@OfficialLoganK

1 year

Reminder: no one has cracked AGI yet, not the frontier labs, not Ilya, no one. Everyone is playing the same iterative game, looking N steps ahead, and trying to guesstimate what happens next.

83

58

758

François Chollet

@fchollet

1 year

I'm partnering with @mikeknoop to launch ARC Prize: a $1,000,000 competition to create an AI that can adapt to novelty and solve simple reasoning problems. Let's get back on track towards AGI. Website: https://t.co/wNsM3IQgEI ARC Prize on @kaggle: https://t.co/Lhsh1RiWKq

80

532

3K

Nicolas Mejia Petit

@mejia_petit

2 years

Why isn’t everyone talking about this??? Deepspeed devs literally just created a datatype FP6 with full tensor core support on the a100’s. (Since nvidia left us stranded with int4/8) It is SO smart just reading through the kernel, my god.

Rohan Paul

@rohanpaul_ai

2 years

LLaMA-70b inferencing using only a single GPU and achieving 1.69x-2.65x higher normalized inference throughput than the FP16 baseline. with Six-bit quantization (FP6) 🔥 Deepspeed has just recently released this Paper and also integrated the FP6 quantization - "FP6-LLM:

9

102

627

Yann LeCun

@ylecun

2 years

My tirade against AI doomers in Davos. https://t.co/SURHmhWywq

34

90

553

Jim Fan

@DrJimFan

2 years

Today may be the ImageNet moment for robotics. RT-X: the largest open-source robot dataset ever compiled, across 33 institutes, 22 robot hardware, 527 skills, and 1M episodes. Why is robotics lagging so far behind NLP, vision, and other AI domains? Data scarcity is the main

28

333

1K

Chris Albon

@chrisalbon

2 years

It feels so weird to me that people talk about open source as reckless and a risk when it comes to AI model. Open source has been responsible for the last 20 years of technological revolution. Builders sharing their free knowledge to the world to benefit us all. But suddenly

83

269

2K

Melanie Mitchell

@MelMitchell1

2 years

Really interesting talk by VC legend Bill Gurley: https://t.co/2FntcDjrLF V. cogent (& 🔥) abt regulatory capture & the industry / govt revolving door. I'm in favor of some kind of gov regulation of AI, but it has to avoid those traps. I'm also big fan of open source.

7

15

79

Christian Reiser

@ChrisJReiser

2 years

In less than an hour I am going to present our paper MERF at #SIGGRAPH2023 in Petree Hall D. MERF allows you to interactively explore large scenes on a laptop in the browser. Check out our web demo: https://t.co/a4uOxriY1D By the way we have also released the entire code now!

creiser.github.io

Project page for MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes.

3

33

118

Melanie Mitchell

@MelMitchell1

2 years

How do we know how smart AI systems are? | Science

science.org

In 1967, Marvin Minksy, a founder of the field of artificial intelligence (AI), made a bold prediction: “Within a generation…the problem of creating ‘artificial intelligence’ will be substantially...

33

196

623

Aris Konstantinidis

@ariskonstant

2 years

Στην Cohere έχουμε ξεκινήσει ένα πρότζεκτ για να εκπαιδεύσουμε πολύγλωσσα μοντέλα. Για τα ελληνικά δεν έχουμε βρει ακόμα πρεσβευτές (ambassadors). Αν θέλεις να βοηθήσεις να διασφαλίσουμε ότι τα επόμενα LLMs θα μιλούν άπταιστα ελληνικά, γίνε ambassador ή κάνε retweet/share!

Cohere Labs

@Cohere_Labs

2 years

Our European Sprint for the Aya project is this weekend, June 17-18.🌍Join us to help ensure every European language is included in the development of language AI! More info here: https://t.co/x3g0FyGAjA Register to participate here:

5

44

61