apolinario (multimodal.art) Profile
apolinario (multimodal.art)

@multimodalart

Followers
10,201
Following
378
Media
596
Statuses
2,214

ML for Art and Creativity, working @HuggingFace (apolinario @multimodal .art)

Joined July 2021
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@multimodalart
apolinario (multimodal.art)
5 months
Excited to introduce LEDITS++, a novel way to edit real images with precision ✏️ - Multiple edits ✂️🔁 - Automagic free masking 🪄🎭 - 🆕 DPM-Solver fast inversion 🔀⚡ 🤗 Try it: 🔗 Project: 📝 Paper
8
98
429
@multimodalart
apolinario (multimodal.art)
2 years
Google just announced a DALLE-2-like model: Imagen For now no code, just demo site: And paper:
Tweet media one
18
141
941
@multimodalart
apolinario (multimodal.art)
1 year
I hacked @huggingface Spaces to build an open source @gradio Dreambooth Training UI that allows you to train a model for less than US$0.80 🐱‍💻 (you can also use it locally for free):
30
115
838
@multimodalart
apolinario (multimodal.art)
11 days
My favorite part is that it works really well with out-of-the-distribution garments
Tweet media one
@multimodalart
apolinario (multimodal.art)
11 days
Testing out the new virtual try-on pipeline on @huggingface , IDM-VTON ▶️
Tweet media one
Tweet media two
Tweet media three
7
19
169
19
91
818
@multimodalart
apolinario (multimodal.art)
2 years
1 week of Stable Diffusion A creative explosion is unfolding with Stable Diffusion,s showing the power of open source as state of the art! We curated 23+ applications this week: new features, workflow integrations, UIs; run on Win, CPU, AMD, M1 and more!
9
150
626
@multimodalart
apolinario (multimodal.art)
2 years
After some, uh, developments yesterday: - Stable Diffusion v1-5 is out by @runwayml - Fine-tuned image decoder (VAE) out by @StabilityAI Magic of open source🧙 collaboration continues no matter what, here's the Best Available Stable Diffusion™ notebook:
Tweet media one
10
102
624
@multimodalart
apolinario (multimodal.art)
2 years
Very exciting 'breaking' news! CompVis (research group behind VQGAN) have just released a new 1.45B parameter model to its Latent Diffusion model: From the released image it seems like it has an unprecedented text-synthesis capacity. More to follow soon
Tweet media one
13
123
623
@multimodalart
apolinario (multimodal.art)
8 months
Thanks @angrypenguinPNG for merging my PR to add high resolution to the Illusion Diffusion Space 📺🌀 It's now as fast, double the resolution and has crispy details - go play ▶️
18
95
524
@multimodalart
apolinario (multimodal.art)
2 years
Google just announced "Parti" - a text-to-image model co-developed with "Imagen" "Parti" doesn't use diffusion models - rather it scales up Transformer + VQGAN architectures like DALL-E 1 and its open source replicas (dalle-pytorch, ruDALLE, DALL-E Mini)
Tweet media one
7
102
525
@multimodalart
apolinario (multimodal.art)
1 year
ControlNet is cool, but what if you could have MORE control? 🤯 With MultiDiffusion Region Control you can 🎛️ draw masks ✏️ and give a specific prompt for each mask 📜 The @gradio demo is just out on @huggingface 🤗 - kudos to the author @omerbartal !
9
103
451
@multimodalart
apolinario (multimodal.art)
4 months
Less than 1 minute guide on how to train your own LoRA with LoRA Ease 🧞‍♂️⚡ Train high-quality LoRAs on objects 📦, faces 😊, styles 🎨 or characters 🧑‍🎤 effortlessly and super cheap ༄ ▶️
16
96
429
@multimodalart
apolinario (multimodal.art)
4 months
You can now finally create your own stock photo smiling while eating salad in seconds 👨‍🎤🥗 IP-Apdater-FaceID Plus was silently released last week - it's first inference technique time face really captures my likeness 🥸🦚 ▶️
8
77
431
@multimodalart
apolinario (multimodal.art)
2 years
It's out! 🥳 Browse visually the Stable Diffusion Concepts Library - and use more than 100+ community taught concepts in your prompt directly on the same UI! Colab with Gradio UI:
Tweet media one
Tweet media two
7
70
429
@multimodalart
apolinario (multimodal.art)
1 year
How to train your own ControlNet? 🥅 We wrote a guide, ranging from deciding which controls to use 🎛️, how to prepare your dataset, all the way to gpus going brrr 🔥 (with an unexpected trip to the uncanny valley 👀) From me and @pcuenq with ❤️
10
93
423
@multimodalart
apolinario (multimodal.art)
8 months
Iterated with @angrypenguinPNG on some enhancements to their Illusion Diffusion Space, @MrUgleh -inspired QR ControlNet patterns 🌀 ▶️
Tweet media one
13
61
395
@multimodalart
apolinario (multimodal.art)
8 months
Upgraded the TokenFlow demo to an A100! And defaults changed - the edits should be ~2.5x faster
8
55
385
@multimodalart
apolinario (multimodal.art)
1 year
This was drawn by GPT-4
Tweet media one
16
26
376
@multimodalart
apolinario (multimodal.art)
1 year
The first large scale open source DALL-E 2 replication is here🧙 Karlo is an unCLIP model trained by #KakaoBrain I'm having fun playing with it on 🤗 @huggingface Spaces: Model card: GitHub:
Tweet media one
13
80
376
@multimodalart
apolinario (multimodal.art)
9 months
Introducing LoRA the Explorer 🔎: browse the coolest SDXL LoRAs, play with them online ▶️, use locally 💿 (...and no need to dodge semi-naked waifus 🚫) Join the fun 🕺
6
83
360
@multimodalart
apolinario (multimodal.art)
2 years
The Stable Diffusion Multi Inpainting Spaces is out! On it you can do both: Inpainting by masking the image (with the newest @Gradio masking) or inpainting with words, your choice!
8
63
355
@multimodalart
apolinario (multimodal.art)
2 years
🧨 diffusers 0.5.0 now supports JAX for super fast #stablediffusion inference on TPUs You can generate 8 images in ~8s on Colab Free using TPU 🚀
Tweet media one
2
77
356
@multimodalart
apolinario (multimodal.art)
2 years
I'm super thrilled to announce that our assemble of the Latent Diffusion LAION-400M text-to-image model is now available on @huggingface 🤗, democratizing even further the access to text-to-image ai art! Thank you for all the help @osanseviero !
11
80
354
@multimodalart
apolinario (multimodal.art)
2 years
I'm delighted to announce I've joined @huggingface as a ML Art Engineer 🤗, to help make AI art even more accessible, easy to use and to develop for! This tech is going to empower human expression and creativity in unprecedented ways - and building it openly feels the right way!
Tweet media one
29
30
354
@multimodalart
apolinario (multimodal.art)
3 months
Text-to-3D and Image-to-3D in 7 seconds 🤯 💨 That's LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation 🧊 And it's open source ✨ Try it ▶️
38
75
315
@multimodalart
apolinario (multimodal.art)
8 months
ControlNets are cool, but T2I-Adapters are 94% smaller 🤏 , and way faster 💨 Today TencentARC released 6 T2I Adapters for SDXL: depth, canny, lineart, openpose, and... DOODLY! Come play:
Tweet media one
3
62
328
@multimodalart
apolinario (multimodal.art)
1 year
The MarioGPT @huggingface Spaces demo is now playable! 🕹️ Now you can play the levels you generate - hopefully you're better than me 😂
7
59
313
@multimodalart
apolinario (multimodal.art)
5 months
Meta just released a new collection their open access "Seamless" translation models 🔊 They do speech-to-text, text-to-speech, speech-to-speech, text-to-text 💬🔄📝 The Expressive model keeps speech rate, pauses and style 🗣️ 📁 Models and demos:
6
76
308
@multimodalart
apolinario (multimodal.art)
1 year
The diffusers 🧨 library just did a release incorporating ControlNet, it runs so fast! 🏎️‍💨 Blog: Colab:
Tweet media one
3
61
302
@multimodalart
apolinario (multimodal.art)
2 years
Collaborative new concepts on #StableDiffusion 🎨 1. Teach Stable Diffusion new concepts 👩‍🏫(add to the public library if you wish): (or browse the library to pick one🧤 ) 2. Run with the learned concepts 🖼️
Tweet media one
4
60
284
@multimodalart
apolinario (multimodal.art)
1 year
Stable Diffusion 2 by @StabilityAI is out with new 5 models 👽 You can try now the 768x768 model (the largest one released) on @huggingface Spaces
Tweet media one
9
44
278
@multimodalart
apolinario (multimodal.art)
4 months
Happy Public Domain day! 🎉 To celebrate Steamboat Willie finally joining the public domain, I created a @huggingface dataset with all frames of the 1928 short 🐭📜 ▶️
Tweet media one
Tweet media two
7
51
275
@multimodalart
apolinario (multimodal.art)
2 years
Breaking news: OpenAI open sourced their CLIP ViT-L/14 @336px ! I'll hook it soon to many generation systems, stay tuned!
6
34
273
@multimodalart
apolinario (multimodal.art)
2 years
Ok - I just quickly assembled the LAION-400M trained Latent Diffusion CFG TTI model to a Google Colab, you can try it yourself: "A mecha robot holding a sign that reads: 'This is weird'"
Tweet media one
@multimodalart
apolinario (multimodal.art)
2 years
Very exciting 'breaking' news! CompVis (research group behind VQGAN) have just released a new 1.45B parameter model to its Latent Diffusion model: From the released image it seems like it has an unprecedented text-synthesis capacity. More to follow soon
Tweet media one
13
123
623
40
53
262
@multimodalart
apolinario (multimodal.art)
2 years
🎅 Ho-ho-ho! Today a bunch of ICLR 2023 papers dropped! This is a conference with blind submission, authors are anonymous till review A lot of multimodal AI: text-to-video (yes, another one), text-to-3D, another 'teach-diffusion-new-concepts', texto-to-audio... and more! 🧵
4
57
256
@multimodalart
apolinario (multimodal.art)
2 years
OPEN TO EVERYBODY! I optimized the Latent Diffusion LAION-400M Colab RAM usage and now it should run on free non-Pro accounts. And fast! 8 images in 20 seconds on a P4 GPU Google Drive support and VRAM optimizations by @RiversHaveWings were also added
Tweet media one
@multimodalart
apolinario (multimodal.art)
2 years
Ok - I just quickly assembled the LAION-400M trained Latent Diffusion CFG TTI model to a Google Colab, you can try it yourself: "A mecha robot holding a sign that reads: 'This is weird'"
Tweet media one
40
53
262
22
46
253
@multimodalart
apolinario (multimodal.art)
2 years
Stable Diffusion model card is up, and the weights are available for academic and research purposes first This is the first step ahead of a full public release which should be coming soon! 🤩 #StableDiffusion
4
51
253
@multimodalart
apolinario (multimodal.art)
5 months
Stable Video Diffusion is an amazing (and chonky 🐼) new model by @StabilityAI - if you can't run it locally, you can now play with it on @huggingface Spaces 🤗 ▶️
2
41
253
@multimodalart
apolinario (multimodal.art)
2 years
This week's updates were not only made of Dall-E 2! We also got: - Latent Diffusion LAION 400M (an open model!) - KNN Diffusion paper (promising new approach to text-to-image) - 3 new exciting TEXT-to-VIDEO models! and more! Check out our weekly update:
4
47
248
@multimodalart
apolinario (multimodal.art)
2 years
And the Spaces for the Stable Diffusion Concepts Library is out! Navigate 250+ community taught object and styles with Textual Inversion and use them in your prompts!
Tweet media one
3
47
246
@multimodalart
apolinario (multimodal.art)
2 years
Yesterday OpenCLIP released the first LAION-2B trained perceptor! a ViT-B/32 CLIP that suprasses OpenAI's ViT-B/32 quite significantly:
Tweet media one
3
35
246
@multimodalart
apolinario (multimodal.art)
2 years
DALL-E Flow is an awesome new tool by @JinaAI_ 's @hxiao Like Centipede Diffusion it is a mix of models: It generates images from with DALL-E Mega, refines and creates variations with Latent Diffusion, ranks the best with CLIP and upscales the results
11
41
237
@multimodalart
apolinario (multimodal.art)
2 years
Following the full open source release of Stable Diffusion, the @huggingface Spaces for it is out🤗 Stable Diffusion is a state-of-the-art text-to-image model that was released today by @StabilityAI #stablediffusion
Tweet media one
4
64
232
@multimodalart
apolinario (multimodal.art)
1 year
InstructPix2Pix by Tim Brooks allows you to write natural language instructions to edit images ✏️🖼️ We are getting closer and closer to "photoshop with words"! 🎨 Play with it now on @huggingface Spaces
Tweet media one
7
42
225
@multimodalart
apolinario (multimodal.art)
1 year
Since VQGAN+CLIP times, we've been learning to prompt with @openai CLIP knowledge (incl. SDv1, conditioned on OAI CLIP) Stable Diffusion 2 breaks that 💥 with LAION-trained CLIP, "trending on artstation", "greg rutkowski" don't work; we're all learning to prompt again! 👶
15
23
222
@multimodalart
apolinario (multimodal.art)
2 years
MindsEye - an open source interface to 'pilot' AI art models without using code - is now available to everyone Check it out, share it around and let me know what you think! Colab: Discord: Guide and FAQ:
17
36
197
@multimodalart
apolinario (multimodal.art)
2 years
Introducing Majesty Diffusion👑 Dango233 princesses were crowned queens, and Majesty Diffusion is born! Two colabs are being released with new plenty of new features - but I need your help with one thing, come with me🧶
Tweet media one
8
50
194
@multimodalart
apolinario (multimodal.art)
17 days
PAG (Perturbed-Attention Guidance) is not getting nearly the attention it deserves, I've adapted it to work on SDXL with diffusers 🧨 ...and it DELIVERS! 🤯 Try it here ▶️ thanks to KU-CVLAB researchers: Donghoon Ahn Hyoungwon Cho et. al ❤️
@OpenCVUniverse
OpenCV University
1 month
Recent studies reveal that the quality of samples from diffusion models relies on techniques like CG and CFG, yet these fall short in unconditional generation and tasks like image restoration. This research paper introduces Perturbed-Attention Guidance (PAG), a novel method…
Tweet media one
1
3
28
7
54
197
@multimodalart
apolinario (multimodal.art)
2 years
I've released MindsEye Lite👁️🧠: a UI that runs multiple text-to-image models without Colabs or logins - directly on Hugging Face Spaces Run Diffusion, DALLE replicas, VQGAN+CLIP. Try it out and consider sending it to someone that tried used AI art yet!
3
42
176
@multimodalart
apolinario (multimodal.art)
3 months
🚨 A new text to image model by @StabilityAI is out! It's Stable Cascade 💧 an iteration on the Würstchen architecture by @dome_271 & @pabloppp I made a demo for it:
21
43
170
@multimodalart
apolinario (multimodal.art)
7 months
Introducing LoRA Roulette 🎲 Two custom models are loaded at random every refresh 🔄 - can you find a fun way to combine them? 🎨 ▶️
6
31
166
@multimodalart
apolinario (multimodal.art)
3 months
Introducing ✨ LoRA Studio ✨ a dedicated UI by @enzostvs for LoRAs hosted on @huggingface 🤗 browse and generate images with fun models 🎉 (and safe models, no need to worry if your mom or your colleague enter the room while you are browsing 😳 🔞) ▶️
12
42
166
@multimodalart
apolinario (multimodal.art)
2 months
Have you tried OOT-Diffusion? 👕 A state of the art diffusion virtual try-on that just works with any person and any clothes ✨ - fully open source 💥 Official demo by Yuhao Xu: ▶️
3
45
168
@multimodalart
apolinario (multimodal.art)
2 years
This is fun! A new leap! You show the model 3-5 images of what you want, it 'learns' what it is and now you use it on your prompts! And the approach is be pluggable to different models (here they applied it to Latent Diffusion) Code is not yet out - excited for it!
Tweet media one
@_akhaliq
AK
2 years
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion abs: “Textual Inversions”, operates by inverting the concepts into new pseudo-words within the textual embedding space of a pre-trained text-to-image model
Tweet media one
10
170
852
4
19
162
@multimodalart
apolinario (multimodal.art)
7 months
Generate 4 SDXL images in < 5s seconds, no queue, for free! 🏎️💨 (I did NOT speed up this video!) ▶️
3
28
159
@multimodalart
apolinario (multimodal.art)
11 days
Testing out the new virtual try-on pipeline on @huggingface , IDM-VTON ▶️
Tweet media one
Tweet media two
Tweet media three
7
19
169
@multimodalart
apolinario (multimodal.art)
23 days
Two days ago, @stabilityai quietly released CosXL and CosXL Edit, fine-tuned SDXL models that can produce full color range images ⬛⬜ You can now try them out on @huggingface ! 🕹️ ▶️
6
36
147
@multimodalart
apolinario (multimodal.art)
2 months
SDXL Lightning is a new distilled SDXL model by ByteDance: LCM+progressive distillation+adversarial objective ⚡️ They have a 1, 2, 4 and 8 step variations, below I'll test the prompt: "An unicorn plush toy on the beach" 🦄 for every step 🧵 (my favorite is 4 steps 🦶)
Tweet media one
4
21
140
@multimodalart
apolinario (multimodal.art)
6 months
The @huggingface Hub now has `model templates`: instead of a blank `/new` page: a page tailored towards uploading a specific kind of model 📙🎨 The first model template is one of the most requested: SD LoRAs! Share it with your fine-tuner friends 🤗
7
27
140
@multimodalart
apolinario (multimodal.art)
1 year
DeepFloyd IF is here! 💫 Demo Fits on Colab Free with diffusers 🧨 👩‍💻 GitHub:
Tweet media one
7
36
141
@multimodalart
apolinario (multimodal.art)
1 year
fast & longer text-to-video with 🧨 diffusers you maybe saw fun junky text-to-video from the ModelScope's research model lately with diffusers you can control how long the video is - and fit it on smol VRAM GPUs, including free colab. Try out here:
Tweet media one
6
33
136
@multimodalart
apolinario (multimodal.art)
2 years
MindsEye - an open source interface to 'pilot' AI art models without using code - is now available to everyone Check it out, share it around and let me know what you think! Colab: Discord: Guide and FAQ:
12
26
129
@multimodalart
apolinario (multimodal.art)
7 months
Don't keep calm: the first Latent Consistency Model is out 🚀! It can generate good images in just 1, 2, 4, 8 blazing fast steps ⚡ (this video is not sped up) (Distilled from a SD1.5 fine-tune in 32 A100/h) 💎
@arankomatsuzaki
Aran Komatsuzaki
7 months
Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference Achieves SotA text-to-image generation performance with few-step inference proj: abs:
Tweet media one
1
27
150
8
28
133
@multimodalart
apolinario (multimodal.art)
2 years
This week was🔥, we got: DALL-E Flow Dall-E Mega reaching 50% of training Centipede Diffusion added inpainting OpenCLIP LAION-400M ViT-B/16+ released CLIP-Forge (a text-to-3D shape model) and more! Check it out in our multimodal ai art weekly news:
3
20
135
@multimodalart
apolinario (multimodal.art)
8 months
Würtschen: a new, trained from scratch high res (1024x1024) model by @dome39931447 Inference is at a fraction of SDXL. And trained with 6x less compute than SD1.4 Quality trade-offs 🤔? Try it for yourself! PS: this video is not sped up!
4
33
136
@multimodalart
apolinario (multimodal.art)
6 months
Is it a LoRA or a Latent Consistency Model? 🤔 Well, both! 🔄 Just hook the LCM LoRA to SDXL or SD1.5 and boom! Now it can do inference in 4-8 steps 🤯 📚
3
28
134
@multimodalart
apolinario (multimodal.art)
4 months
Top 10 trending in text-to-image on the @huggingface Hub: 1️⃣ SDXL Turbo by @StabilityAI 🏎️💨 2️⃣ DPO SDXL by @meihuadang ⚙️⚡ 3️⃣ Playground v2 by @Suhail 's Playground 🛝 4️⃣ Stable Diffusion v1-5 by @runwayml 🎨 5️⃣ Stable Diffusion XL by @StabilityAI 🌌🔍 6️⃣ OpenDalle by…
Tweet media one
5
33
133
@multimodalart
apolinario (multimodal.art)
1 month
Introducing Face-to-All👨‍🎤, a diffusers 🧨 workflow inspired by @fofrAI amazing Face-to-Many ComfyUI workflow Input a face, any style LoRA and get a stylized portrait Colab with code: Thanks @Haofan_Wang for merging our img2img pipeline to InstantID!
Tweet media one
7
30
132
@multimodalart
apolinario (multimodal.art)
2 years
With the explosion #StableDiffusion use-cases 🖼️, it's impossible for 🧨 diffusers maintainers to keep up 🥵 But with your help they don't have to! With community pipelines, the community jumps in 🤝 implementing cool use-cases or papers Check it out!
Tweet media one
1
27
128
@multimodalart
apolinario (multimodal.art)
1 year
I was today years old when I learned that now @GoogleColab localhost links now have an internal reverse proxy! 🤯 No more ngrok needed!
Tweet media one
6
12
128
@multimodalart
apolinario (multimodal.art)
2 months
Introducing Grog 🖖 @Gradio 🤝 @replicate 's Cog Grog automagically creates a Gradio UI for any Cog/Replicate 🪄 Use: - Locally: UI and backend in your machine 🖥️ - Replicate API: local UI, API backend 🌐 - Dockerfile: cloud or @huggingface Spaces
4
26
125
@multimodalart
apolinario (multimodal.art)
2 years
Kind of stealthily Microsoft released "Improved VQ-Diffusion" - a follow-up on their technique that combines a VQ-VAE with diffusion They released the code, the weights and a new VQ-VAE trained I'm running the first experiments:
Tweet media one
Tweet media two
3
17
123
@multimodalart
apolinario (multimodal.art)
2 years
diffusers 🧨v0.4.0 introduces (among many other amazing features) negative prompts to Stable Diffusion! Now you can, keeping the same seed, remove specific objects, colors or concepts from your output 🟪🏙
Tweet media one
@psuraj28
Suraj Patil
2 years
🧨diffusers 0.4.0 is out: Better, faster stronger! 🚀35% faster #stablediffusion in fp16 ✨New scheduler API ➖negative prompts in #stablediffusion pipeline 🧹No more use_auth_token if you are logged in to hub 🤩Community pipelines More in release notes
3
57
330
0
19
123
@multimodalart
apolinario (multimodal.art)
2 years
Latent Majesty Diffusion 1.6, by @dango233max and me is out New stuff: better defaults, fixed inpainting, @laion_ai models, defaults lib on @huggingface and way more Pics: Hot Pot King, Lego Burger, Tamagotchi Ghost Wanderer, a business angel
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
37
120
@multimodalart
apolinario (multimodal.art)
7 months
Got DALL-E 3 via Bing, and there's a game-changer aspect that no one is talking about prompt: "the line to the first feijoada restaurant in Tokyo" 🫘🗼 but do you see the 2nd line? It almost reads "serving authentic brazil cuisine" That's mind blowing! 🤯 Yup, Imagen, IF,…
Tweet media one
7
9
121
@multimodalart
apolinario (multimodal.art)
2 years
This week on multimodal ai art news: - Dall-E Mega sneak peak (try it yourself!) - CLIP-GEN code released (further exploration needed) - New StyleGAN XL 1024px model out - Flamingo Visual Language Model announced And more! Check it out:
4
18
119
@multimodalart
apolinario (multimodal.art)
2 months
Found it! It was WonderJourney! (the first frame is the input image)
@multimodalart
apolinario (multimodal.art)
2 months
I saw a model that from a single image generated a long flythorugh video from it - but I don't rememebr the name and can't find it anymore, does anyone remember/have a name to it?
3
0
6
0
25
122
@multimodalart
apolinario (multimodal.art)
1 year
the problem of @ylecun with autoregressive models
Tweet media one
1
12
118
@multimodalart
apolinario (multimodal.art)
5 months
I've updated the LEDITS++ demo to use @Gradio 4.0 and with that we got faster performance and the webcam component for free 📸 ▶️
3
25
120
@multimodalart
apolinario (multimodal.art)
1 year
Google is soon releasing a suite of generative AI APIs and no/low code interfaces for text generation with PaLM But besides the text generation aspects, it seems they are also sneaking in an Imagen service/API release!
Tweet media one
3
16
118
@multimodalart
apolinario (multimodal.art)
1 year
We're kicking off the Control Stable Diffusion Community Sprint! 🧨 @googlecloud is kindly providing @huggingface with free TPU-v4 for you to train ways to Control Stable Diffusion (with ControlNet or otherwise) 🚀 Join us here!
Tweet media one
0
24
118
@multimodalart
apolinario (multimodal.art)
5 months
Ramping up the Stable Video Diffusion with 🧨 diffusers! folks from @diffuserslib just merged compatibility with SVD - I updated my demo to use it - uses less VRAM 🤏, runs faster 🏃🏻⚡️ with torch.compile(), which means smaller queues for the demo ▶️
3
21
115
@multimodalart
apolinario (multimodal.art)
2 years
Sneak peak of MindEye Generator: a user interface to run multiple models (starting with Disco Diffusion) in one place Platform agnostic, no need for a powerful GPU (as you're be able to execute it from a Google Colab) Will be starting a beta-testing soon, stay tuned!
14
11
115
@multimodalart
apolinario (multimodal.art)
2 years
The Stable Diffusion Collaborative library of textual inversion trained concepts was offline for a bit after a member of the community thought it would be funny to delete it... Now it is back online 🔥 & protected while still open for collaboration 🤗
Tweet media one
5
9
112
@multimodalart
apolinario (multimodal.art)
7 months
I'm having so much fun with the Ikea Instructions LoRA
Tweet media one
@ostrisai
Ostris
7 months
New Stable Diffusion XL LoRA, Ikea Instructions. SDXL does an amazingly hilarious job at coming up with how to make things. Special thanks to @multimodalart and @huggingface for the GPU grant!! HF -> Civitai ->
Tweet media one
Tweet media two
Tweet media three
Tweet media four
13
80
491
3
14
115
@multimodalart
apolinario (multimodal.art)
2 years
GLID-3: mindblowingly good photorealistic images CLIPMatrix: text-guided 3D mesh stylization LAION 5B: a 5B image-text pair dataset 4 new CLIP-like models That's just a small glimpse of what happened on the last 7 days! Check out our weekly update:
4
23
114
@multimodalart
apolinario (multimodal.art)
2 years
Last 2 weeks in multimodal AI art: First big text-to-video model out (CogVideo), more out on text-to-3D, 'image editing with text' getting better - and a bunch of community trained diffusion models 🧵
2
11
112
@multimodalart
apolinario (multimodal.art)
7 months
Big release that got under the radar 📡📟: last week @ml6team released Fondant-25M: dataset of image-text pairs with a @creativecommons license And that's just the tip: they are working on a 500M one 🤯 Blog: Dataset on 🤗
1
26
112
@multimodalart
apolinario (multimodal.art)
2 years
Happy birthday @ak92501 ! Thank you for keeping us updated with the state of the art on many different fronts in the field of Machine Learning! Your curation is amazing and became a de-facto benchmark in the industry!
Tweet media one
@_akhaliq
AK
2 years
It’s that time of year again
Tweet media one
15
3
175
0
5
111
@multimodalart
apolinario (multimodal.art)
1 year
Sidney can draw! 🖌️ I asked Bing chat/Sidney to illustrate the Waluigi Effect and Unaligned AGI as an SVG image Rule: it can't use the <img> tag (otherwise it tries to cheat with a hallucinated URLs) These are the things it came up with:
Tweet media one
6
13
111
@multimodalart
apolinario (multimodal.art)
17 days
I'm having a lot of fun with the Pokemon Trainer Sprite LoRA by @wizadwongsa 🐭🐦 It turns any prompt into a Pokemon trainer, any! (can you guess them all?) ▶️
Tweet media one
Tweet media two
Tweet media three
Tweet media four
7
8
110
@multimodalart
apolinario (multimodal.art)
2 years
ˢᵒᵒⁿ
Tweet media one
3
9
108
@multimodalart
apolinario (multimodal.art)
8 months
DALL-E 3 in one of my most challenging prompts “The inauguration of a wormhole portal between Shanghai and New York. New York is inside the portal and Shanghai outside of it”
@willdepue
will depue
8 months
Tweet media one
Tweet media two
6
3
241
5
10
105
@multimodalart
apolinario (multimodal.art)
11 months
Working from the @huggingface coworking space in Barcelona 🤗
Tweet media one
Tweet media two
5
4
101
@multimodalart
apolinario (multimodal.art)
28 days
The Face-to-All demo is here! Customize your face with any LoRA in the @huggingface Spaces demo 👩‍🎤 ▶️
@multimodalart
apolinario (multimodal.art)
1 month
Introducing Face-to-All👨‍🎤, a diffusers 🧨 workflow inspired by @fofrAI amazing Face-to-Many ComfyUI workflow Input a face, any style LoRA and get a stylized portrait Colab with code: Thanks @Haofan_Wang for merging our img2img pipeline to InstantID!
Tweet media one
7
30
132
4
25
100
@multimodalart
apolinario (multimodal.art)
2 months
Now it's very easy to create a jupyterlab instance as a @huggingface Space 👽 It's free for CPU spaces, but if you have a cool GPU project (fine-tuning, testing), we provide grants ✍ ▶️
Tweet media one
9
15
99
@multimodalart
apolinario (multimodal.art)
1 year
Trying the "Image Mixer" model by @Buntworthy out!
Tweet media one
@Buntworthy
Justin Pinkney
1 year
📢🌀 Released my "Image Mixer" model! Mix up concepts in multiple images and words to generate novel pictures! Try it on @huggingface spaces here:
Tweet media one
31
182
949
2
10
98
@multimodalart
apolinario (multimodal.art)
10 months
From removing your glasses to turning you into taylor swift, LEDIS approaches you to a photoshop with words using AI! ✏️✨
4
18
98
@multimodalart
apolinario (multimodal.art)
2 years
Whoa, this is promising! Centipede Diffusion combines the our Latent Diffusion Colab and Disco Diffusion by @gandamu_ml , @Somnai_dreams and @zippy731 in a clever way (basically Disco is used as an upscaler for Latent!) Will try it and report back soon!
Tweet media one
@proximasan
proxima centauri b
2 years
Centipede Diffusion (Disco Diffusion 5.2 + Latent Diffusion LAION-400M model) 😲
14
14
116
11
10
96
@multimodalart
apolinario (multimodal.art)
7 months
Drag and drop your pic and see IDEFICS-80B attempt to make a meme of it ▶️
5
22
97
@multimodalart
apolinario (multimodal.art)
2 years
OpenAI on DALL-E 2: "We used advanced techniques to prevent photorealistic generations of real individuals’ faces, including those of public figures." People on our Discord server with Majesty Diffusion:
Tweet media one
4
5
91