Robin Courant Profile
Robin Courant

@robin_courant

Followers
61
Following
66
Media
5
Statuses
31

Joined August 2021
Don't wanna be here? Send us removal request.
@robin_courant
Robin Courant
1 year
Happy to present E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. ECCV2024 with @nico_dufour, @xiwang92, @MarcChristie4 and @VickyKalogeiton Paper: https://t.co/eNdGW9EOSz Webpage: https://t.co/UqrE6FlXyp
2
5
16
@nico_dufour
Nicolas DUFOUR
17 days
Text-to-Image models don't need 3 training stages anymore! ๐Ÿคฏ Our new MIRO method integrates human alignment directly into pretraining. 19x faster convergence โšก 370x less compute than FLUX-dev ๐Ÿ“‰ Train once, align to many rewards. The era of multi-stage training is over!
1
14
31
@Lucas__Ventura
Lucas Ventura
8 months
Introducing Chapter-Llama [#CVPR2025], a framework for ๐ฏ๐ข๐๐ž๐จ ๐œ๐ก๐š๐ฉ๐ญ๐ž๐ซ๐ข๐ง๐  using Large Language Models! ๐ŸŽฌ๐Ÿฆ™ Check it out: ๐Ÿ“„ Paper: https://t.co/1KhPsgZYUN ๐Ÿ”— Project: https://t.co/68GevYyznx ๐Ÿ’ป Code: https://t.co/MysWVlewRm ๐Ÿค— Demo: https://t.co/zKmL6v3PKU
4
37
200
@yuanzhi_zhu
Yuanzhi
8 months
Masked Diffusion Models (MDMs) are a hot topic in generative AI ๐Ÿ”ฅ โ€” powerful but slow due to multiple sampling steps. We @Polytechnique and @Inria introduce Di[M]O โ€” a novel approach to distill MDMs into a one-step generator without sacrificing quality.
2
30
195
@thibaut_loiseau
Thibaut Loiseau
8 months
1/13 ๐ŸŠ Introducing our latest work on improving relative camera pose regression with a novel pre-training approach Alligat0R ( https://t.co/Mi6iy5rQ1A)! @GBourmaud @VincentLepetit2
2
8
12
@lucasdegeorge
Lucas
9 months
๐Ÿšจ News! ๐Ÿšจ We have released the models from our latest paper "How far can we go with ImageNet for text-to-image generation?" Check out the models on HuggingFace: ๐Ÿค— https://t.co/jaNyoNDN6u... ๐Ÿ“œ https://t.co/gH6gct7lUA
1
3
5
@xiwang92
Xi WANG
11 months
๐ŸŽฅ AKiRa provides control over camera motion and optics (focal length, distortion, aperture) in video diffusion, enabling cinematic effects like fisheye, focus shifts, and dolly zoom. ๐Ÿ“„ Paper: https://t.co/0ajalZXZ3a ๐Ÿ‘‰ Project Page: https://t.co/uGqwhFPWLK ๐Ÿงต๐Ÿ‘‡
2
17
52
@nico_dufour
Nicolas DUFOUR
11 months
๐ŸŒ Guessing where an image was taken is a hard, and often ambiguous problem. Introducing diffusion-based geolocationโ€”we predict global locations by refining random guesses into trajectories across the Earth's surface! ๐Ÿ—บ๏ธ Paper, code, and demo: https://t.co/pNRFZk9NYP
6
37
153
@simonrouard
Simon Rouard
1 year
I am presenting our paper MusicGen-Style โ€œAudio Conditioning for Music Generation via Discrete Bottleneck Featuresโ€ at @ISMIRConf this afternoon. The code as well as the weights of the model are available on https://t.co/tSvrr446v3. You can now play with it!
Tweet card summary image
github.com
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable...
1
10
104
@nico_dufour
Nicolas DUFOUR
1 year
We are in Milan ๐Ÿ‡ฎ๐Ÿ‡น to present ๐ŸŽฅ E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness. ๐Ÿ“ Come see our poster #227 this afternoon at #ECCV2024! ๐Ÿš€ Introducing new dataset, diffusion model, and evaluation metric for camera generation!
1
3
19
@antoine_guedon
Antoine Guรฉdon
1 year
#ECCV2024 #Blender If you're interested in high-quality Gaussian Splatting representations that can be edited and animated in Blender without a single line of code, Please come to our Oral Presentation of our ECCV2024 paper Gaussian Frosting on Oct2 at 1:40PM! Thread (1/n) ๐Ÿงต
1
21
118
@ImagineEnpc
Imagine-ENPC
1 year
#ECCV2024 Oct 2 (PM) E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. @robin_courant, @nico_dufour, @xiwang92, @MarcChristie4 and @VickyKalogeiton pdf: https://t.co/ehKsij3e2V webpage: https://t.co/CbimUjiGhR
1
2
13
@ridouaneg_
rid
1 year
(1/8) ๐ŸŽฌ Introducing the Short Film Dataset (SFD), a long video QA benchmark with 1k short films and 5k questions. Why another videoQA dataset? ๐Ÿ“– Story-level QAs ๐ŸŽฅ Publicly available videos ๐Ÿ”’ Minimal data leakage โณ Long temporal context questions https://t.co/FJQzIRgDxV
2
12
24
@nico_dufour
Nicolas DUFOUR
1 year
We now have a first version of the 512x512 model in the demo! Still training but we will release weights soon!
@david_picard
David Picard
1 year
๐ŸŽจWe updated the demo, you can try your favorite text2image prompts: https://t.co/pBv4RHEd6l Be gentle: it's a tiny model by image generation standards (300M params) trained from scratch on a ridiculously small dataset (20M img+txt pairs), so it doesn't have high def capabilities
0
2
9
@simonrouard
Simon Rouard
1 year
Very happy to announce that my paper โ€œAudio Conditioning for Music Generation via Discrete Bottleneck Featuresโ€œ done with @honualx @adiyossLC @jadecopet and Axel Roebel has been accepted at ISMIR24. Paper: https://t.co/2KwG6Bk1jH Sample: https://t.co/Dkom70Eoie Code: soon
2
23
97
@robin_courant
Robin Courant
1 year
Additionally, we train CLaTr, a Contrastive Language-Trajectory embedding, to facilitate the evaluation of camera trajectory generation models.
1
0
1
@robin_courant
Robin Courant
1 year
DIRECTOR exhibits high controllability and diversity, is character-aware, and handles complex input conditions.
1
0
1
@robin_courant
Robin Courant
1 year
We demonstrate the potential of our dataset with DIRECTOR, a camera trajectory diffusion model that leverages both character trajectories and captions.
1
0
1
@robin_courant
Robin Courant
1 year
We introduce a camera trajectory dataset called Exceptional Trajectories (E.T.), extracted from real movies. E.T. includes camera and character trajectories along with textual captions.
1
0
1