AI at Alibaba International
@AI_AlibabaInt
Followers
236
Following
234
Media
47
Statuses
120
As the #AI Business team of Alibaba International Digital Commerce Group (AIDC) , we work on fundation modles to revolutionize digital commerce
Joined September 2024
🚀 Ovis-Image (7B) is live on ModelScope! ✅Delivers frontier-level text rendering—on par with 20B-class models like Qwen-Image and even competitive with GPT-4o on text-heavy tasks. ✅Sharp, layout-aware output for posters, banners, logos, UI mocks, and infographics. ✅Runs fast
1
4
21
We released Marco-Voice months ago,now the pre-trained model weights and an online demo for Marco-Voice are LIVE! Try the Demo: https://t.co/yTBpDCS1KW Find the Project on GitHub: https://t.co/b3CLrkCgIn Download the Models on Hugging Face:
huggingface.co
1
2
7
☕️A coffee takes 5 minutes. A full video now takes even less. Introducing Pixelle-Video — ⚡️AI Fully Automated Short Video Engine. Built fully on ComfyUI workflows & backend. Input an idea → get narration, images, layout, TTS, all in one pipeline.Try Pixelle-Video and share
2
3
29
🎯Introducing Marco Search Agent — an open-source project Towards Real-world & Challenging Agentic Search! @AI_AlibabaInt First of all, we release two agent benchmarks: 🔥 DeepWideSearch — Benchmarking Search Agents on Depth and Width in Information-Seeking 🔥 HSCodeComp —
5
4
11
Wan animate is available on Pixelle MCP🎉 Try to create your kongfu cat video here: https://t.co/uNSvXoBwGz Prompt: Generate a motion transfer video based on this video and cat image using wan animate #AlibabaWan
github.com
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai - AIDC-AI/Pixelle-MCP
🎬 Tried the new wan2.2 animate @Alibaba_Wan and ended up with a Taoist master training alongside his cat 🐱💃 Not sure who’s teaching who, but the sync is wild 😂Cooked this up in Pixelle-MCP ⚡ — our own open-source project (yep, also from the Alibaba family 💡).Testing new
0
2
5
Thanks @slatornews for featuring Marco-Voice! 🗣️ Pushing boundaries in TTS with unified voice cloning & emotion control. Check out our work and join us in advancing expressive speech synthesis! https://t.co/xTfwBQ8pCJ
#AI #SpeechTech #MarcoVoice
slator.com
Alibaba’s Marco-Voice pairs voice cloning with controllable emotion for more natural and expressive synthetic speech.
Alibaba unveils Marco-Voice, a new text-to-speech system that combines #voicecloning 🗣️ and emotional #speechsynthesis, 😐😄😠😢😮 delivering more natural and expressive synthetic #speech in Mandarin and English. @Chenyang_Lyu @wangly0229 @AlibabaGroup
https://t.co/Q2wXtU2dez
0
0
5
Game-changing integration! Pixelle-MCP now powered by Google's Nano-Banana for seamless image editing. Here's a simple prompt creates stunning product photography with perfect style consistency! 🎯 Get started: https://t.co/uNSvXoAYR1
#AI #ComfyUI #PixelleMCP
github.com
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai - AIDC-AI/Pixelle-MCP
prompt:Using the banana tool, create a photorealistic image highlighting a gold pendant necklace held by a woman. The pendant features a relief pattern from [Sample Image] and hangs from a polished gold chain. The background is a softly blurred neutral beige tone, using a
0
0
7
Join @runninglsy as we unpack #Ovis multimodal #LLM: • Model architecture • Training strategies • Performance on benchmarks • Latest open-source updates Learn how we built this powerful #MLLM!
🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS 🧐Join the AI Insight Talk with @huggingface, @OpenCompassX, @ModelScope2022 and @ZhihuFrontier 🚀Tech deep-dives & breakthroughs 🚀Roundtable debates ⏰Aug 21, 5 AM PDT 📺Live: https://t.co/brweSm4yT5
0
1
1
🚨 AI Insight Talk: HF Papers Live|VLM Session 📅 Aug 21, 8:00 PM (GMT+8) — Live 🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS 👨🔬Devs from Shanghai AI Lab, @OpenBMB , @Zai_org , @AlibabaGroup and @StepFun_ai will share their
0
0
4