Chen Change Loy
@ccloy
Followers
3K
Following
2K
Media
128
Statuses
955
President's Chair Professor @NTUsg Director of @MMLabNTU Computer vision and deep learning
Singapore
Joined November 2010
🚀Thrilled to announce that Synvo AI has joined the NVIDIA Inception program, a significant step forward in accelerating our mission to build context-aware AI solutions! #NVIDIAInception #DeepTech #Entrepreneurship
0
5
8
EdgeTAM now supported by @thedroneforge what should we do with this model and an autonomous drone?
EdgeTAM, real-time segment tracker by Meta is now in @huggingface transformers with Apache-2.0 license 🔥 > 22x faster than SAM2, processes 16 FPS on iPhone 15 Pro Max with no quantization > supports single/multiple/refined point prompting, bounding box prompts
31
84
903
EdgeTAM, real-time segment tracker by Meta is now in @huggingface transformers with Apache-2.0 license 🔥 > 22x faster than SAM2, processes 16 FPS on iPhone 15 Pro Max with no quantization > supports single/multiple/refined point prompting, bounding box prompts
26
157
2K
Due to an unexpected system issues prior to the abstract submission deadline, we are extending author registration and abstract submission by 24h until Nov 07, 2025 (Anywhere on Earth). We are working to restore access as quickly as possible. Please note, however, that the
1
2
21
Thinking with Camera A Unified Multimodal Model for Camera-Centric Understanding and Generation
3
33
238
🔥One-Stop Training Engine for Unified Models🔥 ⚡️LMMs-Engine⚡️ is a lean and flexible unified model training engine built for hacking at scale * Support multimodal inputs and outputs, from AR, diffusion and linear models, to unified models like BAGEL 🏠 https://t.co/x2CW8XZlRu
Throughout my journey in developing multimodal models, I’ve always wanted a framework that lets me plug & play modality encoders/decoders on top of an auto-regressive LLM. I want to prototype fast, try new architectures, and have my demo files scale effortlessly — with full
6
34
186
Congrats to Yuekun Dai @YuekunDai and Ziang Cao @ziangcao_ , both from @MMLabNTU , for winning the prestigious Google PhD Fellowship! Yuekun: https://t.co/MDgugkfTEw Ziang: https://t.co/Jn63YiYG57
@NTUsg
🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: https://t.co/0Pvuv6hsgP
1
3
39
Thinking with Camera (Puffin) just dropped. This AI doesn't just see a picture, it reasons like a director. It predicts lens/pose, guides shots, and generates scenes across views. Simple breakdown:
12
6
56
Introducing 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠 𝐰𝐢𝐭𝐡 𝐂𝐚𝐦𝐞𝐫𝐚📸, a unified multimodal model that integrates camera-centric spatial intelligence to interpret and create scenes from arbitrary viewpoints. Project Page: https://t.co/KxwIpuDUBg Code: https://t.co/DO52LFyL9m
14
33
147
📸Join us at #ICCV2025 for the Mobile Intelligent Photography & Imaging (MIPI) Workshop! ✨Leading keynotes: Profs. @songhan_mit, Michal Irani, Boxin Shi, and @MingHsuanYang - on intelligent photography and efficient GenAI. 🗓Oct 20, 8:50am–12:30pm HST 🔗 https://t.co/CqdCqzdsY1
1
10
28
Congrats to @liuziwei7 !!
🏆 Congrats to #NTUsg Prof Ng Geok Ing on the 🇸🇬 President’s Technology Award 2025. A pioneer in Gallium Nitride (#GaN) – found in fast chargers, EVs, satellites & defence – he built 🇸🇬’s global standing in this field and led the creation of the national GaN centre. 👏 We also
1
1
31
Big thanks to the World Top-Performing Incubator Conference 顶尖孵化器大会 for inviting our Co-founder & CTO Dr. @JingkangY to this inspiring event! 🚀💡#Innovation #Startups #WTIC2025
2
8
26
STream3R Scalable Sequential 3D Reconstruction with Causal Transformer
4
14
110
Lan and Luo et al., "STREAM3R: Scalable Sequential 3D Reconstruction with Causal Transformer" Yep, another streaming feed-forward 3D estimator. This time, with Dust3R backbone. Architecture is now getting pretty close to LLMs :) Are these going to become 3D GPT?
1
23
150
New from S-Lab, Nanyang Technological University & SenseTime Research: Next Visual Granularity Generation (NVG)! This novel framework progressively refines images from global layout to fine details, offering fine-grained control over generation. It outperforms the VAR series in
4
11
66
STream3R reformulates dense 3D reconstruction into a sequential registration task with causal attention. Just tried 3D reconstruction on a #GrokImagine video using #STream3R🫡! Check out STream3R on our GitHub for more👨💻: https://t.co/sj4fRe3vts
Grok Imagine prompt: A lone swordsman in a tattered cloak battles a massive sand serpent in a desert coliseum at midday, his blade flashing as dust clouds swirl around him. The arena is surrounded by crumbling stone pillars and a blazing sun overhead. Harsh sunlight cast...
0
2
9