ccloy Profile Banner
Chen Change Loy Profile
Chen Change Loy

@ccloy

Followers
3K
Following
2K
Media
128
Statuses
955

President's Chair Professor @NTUsg Director of @MMLabNTU Computer vision and deep learning

Singapore
Joined November 2010
Don't wanna be here? Send us removal request.
@synvoAI
Synvo AI
14 days
🚀Thrilled to announce that Synvo AI has joined the NVIDIA Inception program, a significant step forward in accelerating our mission to build context-aware AI solutions! #NVIDIAInception #DeepTech #Entrepreneurship
0
5
8
@chesterzelaya
chester
17 days
EdgeTAM now supported by @thedroneforge what should we do with this model and an autonomous drone?
@mervenoyann
merve
17 days
EdgeTAM, real-time segment tracker by Meta is now in @huggingface transformers with Apache-2.0 license 🔥 > 22x faster than SAM2, processes 16 FPS on iPhone 15 Pro Max with no quantization > supports single/multiple/refined point prompting, bounding box prompts
31
84
903
@mervenoyann
merve
17 days
EdgeTAM, real-time segment tracker by Meta is now in @huggingface transformers with Apache-2.0 license 🔥 > 22x faster than SAM2, processes 16 FPS on iPhone 15 Pro Max with no quantization > supports single/multiple/refined point prompting, bounding box prompts
26
157
2K
@ccloy
Chen Change Loy
18 days
Due to an unexpected system issues prior to the abstract submission deadline, we are extending author registration and abstract submission by 24h until Nov 07, 2025 (Anywhere on Earth). We are working to restore access as quickly as possible. Please note, however, that the
1
2
21
@CVPR
#CVPR2026
20 days
Have a question for a #CVPR2026 organizer? Use the form. Form: https://t.co/JnShhBC6V2
0
4
3
@_akhaliq
AK
1 month
Thinking with Camera A Unified Multimodal Model for Camera-Centric Understanding and Generation
3
33
238
@liuziwei7
Ziwei Liu
29 days
🔥One-Stop Training Engine for Unified Models🔥 ⚡️LMMs-Engine⚡️ is a lean and flexible unified model training engine built for hacking at scale * Support multimodal inputs and outputs, from AR, diffusion and linear models, to unified models like BAGEL 🏠 https://t.co/x2CW8XZlRu
@BoLi68567011
Brian Bo Li
1 month
Throughout my journey in developing multimodal models, I’ve always wanted a framework that lets me plug & play modality encoders/decoders on top of an auto-regressive LLM. I want to prototype fast, try new architectures, and have my demo files scale effortlessly — with full
6
34
186
@ccloy
Chen Change Loy
1 month
Congrats to Yuekun Dai @YuekunDai and Ziang Cao @ziangcao_ , both from @MMLabNTU , for winning the prestigious Google PhD Fellowship! Yuekun: https://t.co/MDgugkfTEw Ziang: https://t.co/Jn63YiYG57 @NTUsg
@Googleorg
Google.org
1 month
🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: https://t.co/0Pvuv6hsgP
1
3
39
@minchoi
Min Choi
1 month
Thinking with Camera (Puffin) just dropped. This AI doesn't just see a picture, it reasons like a director. It predicts lens/pose, guides shots, and generates scenes across views. Simple breakdown:
12
6
56
@CVPR
#CVPR2026
1 month
The #CVPR2026 submission portal is now open!
2
17
102
@KangLiao929
Kang Liao
1 month
Introducing 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠 𝐰𝐢𝐭𝐡 𝐂𝐚𝐦𝐞𝐫𝐚📸, a unified multimodal model that integrates camera-centric spatial intelligence to interpret and create scenes from arbitrary viewpoints. Project Page: https://t.co/KxwIpuDUBg Code: https://t.co/DO52LFyL9m
14
33
147
@ShangchenZhou
Shangchen Zhou
2 months
📸Join us at #ICCV2025 for the Mobile Intelligent Photography & Imaging (MIPI) Workshop! ✨Leading keynotes: Profs. @songhan_mit, Michal Irani, Boxin Shi, and @MingHsuanYang - on intelligent photography and efficient GenAI. 🗓Oct 20, 8:50am–12:30pm HST 🔗 https://t.co/CqdCqzdsY1
1
10
28
@ccloy
Chen Change Loy
2 months
Congrats to @liuziwei7 !!
@NTUsg
NTU Singapore
2 months
🏆 Congrats to #NTUsg Prof Ng Geok Ing on the 🇸🇬 President’s Technology Award 2025. A pioneer in Gallium Nitride (#GaN) – found in fast chargers, EVs, satellites & defence – he built 🇸🇬’s global standing in this field and led the creation of the national GaN centre. 👏 We also
1
1
31
@synvoAI
Synvo AI
2 months
Big thanks to the World Top-Performing Incubator Conference 顶尖孵化器大会 for inviting our Co-founder & CTO Dr. @JingkangY to this inspiring event! 🚀💡#Innovation #Startups #WTIC2025
2
8
26
@_akhaliq
AK
3 months
STream3R Scalable Sequential 3D Reconstruction with Causal Transformer
4
14
110
@kwangmoo_yi
Kwang Moo Yi
3 months
Lan and Luo et al., "STREAM3R: Scalable Sequential 3D Reconstruction with Causal Transformer" Yep, another streaming feed-forward 3D estimator. This time, with Dust3R backbone. Architecture is now getting pretty close to LLMs :) Are these going to become 3D GPT?
1
23
150
@HuggingPapers
DailyPapers
3 months
New from S-Lab, Nanyang Technological University & SenseTime Research: Next Visual Granularity Generation (NVG)! This novel framework progressively refines images from global layout to fine details, offering fine-grained control over generation. It outperforms the VAR series in
4
11
66
@TheYihangLuo
Yihang Luo
3 months
STream3R reformulates dense 3D reconstruction into a sequential registration task with causal attention. Just tried 3D reconstruction on a #GrokImagine video using #STream3R🫡! Check out STream3R on our GitHub for more👨‍💻: https://t.co/sj4fRe3vts
@elonmusk
Elon Musk
3 months
Grok Imagine prompt: A lone swordsman in a tattered cloak battles a massive sand serpent in a desert coliseum at midday, his blade flashing as dust clouds swirl around him. The arena is surrounded by crumbling stone pillars and a blazing sun overhead. Harsh sunlight cast...
0
2
9
@_akhaliq
AK
3 months
Next Visual Granularity Generation
2
8
70