Robin Courant @robin_courant X Profile

Robin Courant

@robin_courant

Followers

61

Following

66

Media

5

Statuses

31

https://t.co/a9Rs2esPC0

Joined August 2021

Don't wanna be here? Send us removal request.

Robin Courant

@robin_courant

1 year

Happy to present E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. ECCV2024 with @nico_dufour, @xiwang92, @MarcChristie4 and @VickyKalogeiton Paper: https://t.co/eNdGW9EOSz Webpage: https://t.co/UqrE6FlXyp

2

5

16

Nicolas DUFOUR

@nico_dufour

17 days

Text-to-Image models don't need 3 training stages anymore! 🤯 Our new MIRO method integrates human alignment directly into pretraining. 19x faster convergence ⚡ 370x less compute than FLUX-dev 📉 Train once, align to many rewards. The era of multi-stage training is over!

1

14

31

Lucas Ventura

@Lucas__Ventura

8 months

Introducing Chapter-Llama [#CVPR2025], a framework for 𝐯𝐢𝐝𝐞𝐨 𝐜𝐡𝐚𝐩𝐭𝐞𝐫𝐢𝐧𝐠 using Large Language Models! 🎬🦙 Check it out: 📄 Paper: https://t.co/1KhPsgZYUN 🔗 Project: https://t.co/68GevYyznx 💻 Code: https://t.co/MysWVlewRm 🤗 Demo: https://t.co/zKmL6v3PKU

4

37

200

Yuanzhi

@yuanzhi_zhu

8 months

Masked Diffusion Models (MDMs) are a hot topic in generative AI 🔥 — powerful but slow due to multiple sampling steps. We @Polytechnique and @Inria introduce Di[M]O — a novel approach to distill MDMs into a one-step generator without sacrificing quality.

2

30

195

Thibaut Loiseau

@thibaut_loiseau

8 months

1/13 🐊 Introducing our latest work on improving relative camera pose regression with a novel pre-training approach Alligat0R ( https://t.co/Mi6iy5rQ1A)! @GBourmaud @VincentLepetit2

2

8

12

Lucas

@lucasdegeorge

9 months

🚨 News! 🚨 We have released the models from our latest paper "How far can we go with ImageNet for text-to-image generation?" Check out the models on HuggingFace: 🤗 https://t.co/jaNyoNDN6u... 📜 https://t.co/gH6gct7lUA

1

3

5

Xi WANG

@xiwang92

11 months

🎥 AKiRa provides control over camera motion and optics (focal length, distortion, aperture) in video diffusion, enabling cinematic effects like fisheye, focus shifts, and dolly zoom. 📄 Paper: https://t.co/0ajalZXZ3a 👉 Project Page: https://t.co/uGqwhFPWLK 🧵👇

2

17

52

Nicolas DUFOUR

@nico_dufour

11 months

🌍 Guessing where an image was taken is a hard, and often ambiguous problem. Introducing diffusion-based geolocation—we predict global locations by refining random guesses into trajectories across the Earth's surface! 🗺️ Paper, code, and demo: https://t.co/pNRFZk9NYP

6

37

153

Simon Rouard

@simonrouard

1 year

I am presenting our paper MusicGen-Style “Audio Conditioning for Music Generation via Discrete Bottleneck Features” at @ISMIRConf this afternoon. The code as well as the weights of the model are available on https://t.co/tSvrr446v3. You can now play with it!

github.com

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable...

1

10

104

Nicolas DUFOUR

@nico_dufour

1 year

We are in Milan 🇮🇹 to present 🎥 E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness. 📍 Come see our poster #227 this afternoon at #ECCV2024! 🚀 Introducing new dataset, diffusion model, and evaluation metric for camera generation!

1

3

19

Antoine Guédon

@antoine_guedon

1 year

#ECCV2024 #Blender If you're interested in high-quality Gaussian Splatting representations that can be edited and animated in Blender without a single line of code, Please come to our Oral Presentation of our ECCV2024 paper Gaussian Frosting on Oct2 at 1:40PM! Thread (1/n) 🧵

1

21

118

Imagine-ENPC

@ImagineEnpc

1 year

#ECCV2024 Oct 2 (PM) E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. @robin_courant, @nico_dufour, @xiwang92, @MarcChristie4 and @VickyKalogeiton pdf: https://t.co/ehKsij3e2V webpage: https://t.co/CbimUjiGhR

1

2

13

rid

@ridouaneg_

1 year

(1/8) 🎬 Introducing the Short Film Dataset (SFD), a long video QA benchmark with 1k short films and 5k questions. Why another videoQA dataset? 📖 Story-level QAs 🎥 Publicly available videos 🔒 Minimal data leakage ⏳ Long temporal context questions https://t.co/FJQzIRgDxV

2

12

24

Nicolas DUFOUR

@nico_dufour

1 year

We now have a first version of the 512x512 model in the demo! Still training but we will release weights soon!

David Picard

@david_picard

1 year

🎨We updated the demo, you can try your favorite text2image prompts: https://t.co/pBv4RHEd6l Be gentle: it's a tiny model by image generation standards (300M params) trained from scratch on a ridiculously small dataset (20M img+txt pairs), so it doesn't have high def capabilities

0

2

9

Simon Rouard

@simonrouard

1 year

Very happy to announce that my paper “Audio Conditioning for Music Generation via Discrete Bottleneck Features“ done with @honualx @adiyossLC @jadecopet and Axel Roebel has been accepted at ISMIR24. Paper: https://t.co/2KwG6Bk1jH Sample: https://t.co/Dkom70Eoie Code: soon

2

23

97

Robin Courant

@robin_courant

1 year

Paper: https://t.co/eNdGW9EOSz Webpage: https://t.co/UqrE6FlXyp Code: https://t.co/h2PxDUXEYn Dataset: https://t.co/1AFQWtQoU5 Demo:

huggingface.co

0

1

6

Robin Courant

@robin_courant

1 year

Additionally, we train CLaTr, a Contrastive Language-Trajectory embedding, to facilitate the evaluation of camera trajectory generation models.

1

0

1

Robin Courant

@robin_courant

1 year

DIRECTOR exhibits high controllability and diversity, is character-aware, and handles complex input conditions.

1

0

1

Robin Courant

@robin_courant

1 year

We demonstrate the potential of our dataset with DIRECTOR, a camera trajectory diffusion model that leverages both character trajectories and captions.

1

0

1

Robin Courant

@robin_courant

1 year

We introduce a camera trajectory dataset called Exceptional Trajectories (E.T.), extracted from real movies. E.T. includes camera and character trajectories along with textual captions.

1

0

1