MMLab@NTU @MMLabNTU X Profile

MMLab@NTU

@MMLabNTU

Followers

2K

Following

213

Media

12

Statuses

79

Multimedia Laboratory @NTUsg, affiliated with S-Lab. Large Multimodal Models, Computer Vision, Image Processing, Computer Graphics, Deep Learning

https://t.co/0NXxRSfmRg

Singapore

Joined May 2021

Don't wanna be here? Send us removal request.

MMLab@NTU

@MMLabNTU

6 months

Congratulations to Ziqi and Ziwei! Grateful for the opportunity to work with so many gifted students at @MMLabNTU. Their passion and creativity continue to inspire us! Their achievements are listed here: https://t.co/GMvhTMUl09

NTU Singapore

@NTUsg

6 months

Freshly picked: #NTUsg PhD student Huang Ziqi has been selected as one of 21 global recipients of the prestigious 2025 Apple Scholars in AIML PhD Fellowship — a prestigious programme that supports emerging leaders in AI and machine learning through funding, mentorship, and

0

4

17

MMLab@NTU

@MMLabNTU

1 month

RT @ccloy: Congrats to Yuekun @YuekunDai and @ziangcao_ , both from @MMLabNTU , for winning the prestigious Google PhD Fellowship! Yuekun…

0

1

0

Kang Liao

@KangLiao929

1 month

Introducing 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠 𝐰𝐢𝐭𝐡 𝐂𝐚𝐦𝐞𝐫𝐚📸, a unified multimodal model that integrates camera-centric spatial intelligence to interpret and create scenes from arbitrary viewpoints. Project Page: https://t.co/KxwIpuDUBg Code: https://t.co/DO52LFyL9m

14

33

147

AK

@_akhaliq

1 month

Thinking with Camera A Unified Multimodal Model for Camera-Centric Understanding and Generation

3

33

238

Shangchen Zhou

@ShangchenZhou

2 months

📸Join us at #ICCV2025 for the Mobile Intelligent Photography & Imaging (MIPI) Workshop! ✨Leading keynotes: Profs. @songhan_mit, Michal Irani, Boxin Shi, and @MingHsuanYang - on intelligent photography and efficient GenAI. 🗓Oct 20, 8:50am–12:30pm HST 🔗 https://t.co/CqdCqzdsY1

1

10

28

MMLab@NTU

@MMLabNTU

2 months

Congratulations to @liuziwei7 of @MMLabNTU , recipient of the Young Scientist Award, recognised for his impactful contributions to computer vision and generative AI. 🎉🎉

NTU Singapore

@NTUsg

2 months

🏆 Congrats to #NTUsg Prof Ng Geok Ing on the 🇸🇬 President’s Technology Award 2025. A pioneer in Gallium Nitride (#GaN) – found in fast chargers, EVs, satellites & defence – he built 🇸🇬’s global standing in this field and led the creation of the national GaN centre. 👏 We also

2

1

29

MMLab@NTU

@MMLabNTU

2 months

Congrats to Weichen ( https://t.co/f6HHqV96CK) and Mutian ( https://t.co/APOYvyVfQg)!

Ziwei Liu

@liuziwei7

2 months

#ICCV2025 Congrats to Weichen ( https://t.co/3EHLciKwgP) and Mutian ( https://t.co/eL1sdcXIuo) being selected as the outstanding reviewers @ICCVConference https://t.co/dKiLhYJjWG

0

1

5

Shulin Tian

@shulin_tian

5 months

🎥 Video is already a tough modality for reasoning. Egocentric video? Even tougher! It is longer, messier, and harder. 💡 How do we tackle these extremely long, information-dense sequences without exhausting GPU memory or hitting API limits? We introduce 👓Ego-R1: A framework

7

9

37

Ziqi Huang

@ziqi_huang_

6 months

🎬 𝗖𝗩𝗣𝗥 𝟮𝟬𝟮𝟱 𝗧𝘂𝘁𝗼𝗿𝗶𝗮𝗹 𝙁𝙧𝙤𝙢 𝙑𝙞𝙙𝙚𝙤 𝙂𝙚𝙣𝙚𝙧𝙖𝙩𝙞𝙤𝙣 𝙩𝙤 𝙒𝙤𝙧𝙡𝙙 𝙈𝙤𝙙𝙚𝙡 🚀 Hosted by MMLab@NTU × Kuaishou, etc 📅 June 11 | Nashville 🔗 https://t.co/YcQ6pb30R0 🧠 Video is just the start. World modeling is the goal. #CVPR2025 #WorldModel

1

28

138

AK

@_akhaliq

7 months

Aero-1-Audio is out on Hugging Face Trained in <24h on just 16×H100 Handles 15+ min audio seamlessly Outperforms bigger models like Whisper, Qwen-2-Audio & commercial services from ElevenLabs/Scribe

8

64

415

Size Wu

@WuSize

8 months

🔥 We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro). 💥 SOTA on GenEval, MJHQ, WISE 🧠 Strong understanding performance 📄 Paper: https://t.co/RFhEl9NEN7 🔗 Code:

github.com

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation - wusize/Harmon

2

1

18

Chen Change Loy

@ccloy

8 months

🚀 Meet Harmon – a unified model for both image generation and understanding! Trained with a shared masked autoregressive encoder, it sets new benchmarks on GenEval & MJHQ30K. 🖼️💬 Try the live demo now on Hugging Face: 👉 https://t.co/PX7OkVaZbx Paper:

huggingface.co

Size Wu

@WuSize

8 months

🔥 We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro). 💥 SOTA on GenEval, MJHQ, WISE 🧠 Strong understanding performance 📄 Paper: https://t.co/RFhEl9NEN7 🔗 Code:

0

2

15

Chen Change Loy

@ccloy

1 year

We turned our method, rejected by CVPR and ECCV, into the iOS app "Cutcha". EdgeSAM, our fast Segment Anything Model, runs at over 30 FPS on an iPhone 14. Enjoy intuitive one-touch object selection and precise editing—all processed locally on your device. No cloud needed!

8

26

215

Chen Change Loy

@ccloy

1 year

📸🌟 Attention all photography and imaging enthusiasts! Join us at the Third MIPI Workshop at #CVPR2024! 📍 Location: Arch 213 ⏰ Time: 08:30 AM - 12:10 PM 🌐 Website: https://t.co/3x06T1AvaF Don't miss out on an exciting lineup of speakers: 🔹 Lei Zhang: How Far Are We From

2

9

31

The AI Talks

@TheAITalksOrg

2 years

The Upcoming AI talk: 🌋LLaVA🦙 A Vision-and-Language Approach to Computer Vision in the Wild by Chunyuan Li @ChunyuanLi More info: https://t.co/ap7S1osxAm Subscribe us: https://t.co/m7NoJNciLe

0

14

33

Xingang Pan

@XingangP

2 years

(1/2) We are actively seeking PhD candidates from various countries to foster diversity in our research group at Nanyang Technological University. Know someone interested in a PhD with us? Please refer them to our team. Thanks for supporting diversity in academia! 🌍🎓

2

17

86

SenseTime

@SenseTime_AI

2 years

https://t.co/Jqm1zNe5SZ

8

22

124

Chen Change Loy

@ccloy

2 years

🔬 Our study introduces "Upscale-A-Video," a text-guided latent diffusion framework for video upscaling. It ensures temporal coherence locally & globally, balancing fidelity and quality. 🚀 Project page: https://t.co/3UkaXXyMCC 💻 GitHub: https://t.co/irQRuPHxED 🎥 Video:

11

51

293

MMLab@NTU

@MMLabNTU

2 years

EdgeSAM - Prompt-In-the-Loop Distillation for On-Device Deployment of SAM 🔗 Project page: https://t.co/ydz7rZ78sS 🔗 GitHub: https://t.co/VJS0YQHJI0 🤗 Hugging Face:

huggingface.co

Chen Change Loy

@ccloy

2 years

🚀 Excited to share our latest work: "EdgeSAM - Prompt-In-the-Loop Distillation for On-Device Deployment of SAM" Supercharged SAM for Edge Devices! 🌟 #EdgeSAM is a faster, optimized version of SAM, now tailored for edge devices. We've reimagined SAM's ViT-based image encoder

0

8

MMLab@NTU

@MMLabNTU

2 years

EdgeSAM - Prompt-In-the-Loop Distillation for On-Device Deployment of SAM 🔗 Project page: https://t.co/ydz7rZ78sS 🔗 GitHub: https://t.co/VJS0YQHJI0 🤗 Hugging Face:

huggingface.co

0