Adina Yakup Profile Banner
Adina Yakup Profile
Adina Yakup

@AdeenaY8

Followers
4,257
Following
733
Media
127
Statuses
1,140

@huggingface 🤗 | AI Research - Contributing to Chinese ML community.

Joined April 2023
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@AdeenaY8
Adina Yakup
3 months
Open-Sora 1.2 is out🔥 Open-Sora is an initiative dedicated to efficiently producing high-quality video in open-source way , released by @HPCAITech 👏 Model: Demo: ✨ Video compression network ✨Rectifie-flow training ✨More data
9
112
445
@AdeenaY8
Adina Yakup
11 months
Today released their open source model Yi-34B on the Hub👏🚀 ✨ The FIRST Chinese model to top the Open LLM Leaderboard 💪 Better performance than Falcon-180B and Llama2 70B on pre-training 🇨🇳 Supports both English and Chinese
9
64
365
@AdeenaY8
Adina Yakup
21 days
Impressive work from the Chinese community 🚀 Mini-Omni 🔥 An open multimodal large language model with real-time speech and audio conversation abilities. Model: Demo: Paper: ( you can directly communicate
6
53
248
@AdeenaY8
Adina Yakup
7 months
OpenCodeInterpreter💻 A family of open-source code systems for generating, executing, & refining code🔄 ✨Their 7b models hits 90% accuracy on HumanEval ✨ SC2 series based on StarCoder2 and GM-7b Model with gemma-7b Model: Paper:
4
49
177
@AdeenaY8
Adina Yakup
1 year
Chinese Llama 2 is out! 🔥🔥🔥 Try it in 👇
9
50
199
@AdeenaY8
Adina Yakup
8 months
EVA-CLIP-18B🔥A powerful and probably largest open-source CLIP model released by a Chinese research Lab @BAAIBeijing ✨ 18B params ✨ 80.7% zero-shot accuracy on 27 benchmarks with only 6B samples. ✨ Trained on a smaller dataset, more efficient Model:
0
38
144
@AdeenaY8
Adina Yakup
8 months
RWKV-v5 Eagle 7B is out 🔥 ✨ Trained on 1.1 Trillion Tokens across 100+ languages 📄 Apache 2.0 🚀 Outperforms all 7B class models Model: Demo: 💡Check the blog, their response to the question about the muiti-lingual is really cool
@RWKV_AI
RWKV
8 months
All while being - Cleanly licensed Apache 2, under @linuxfoundation (do anything with it!) - The world's greenest 7B model 🌲 (by per token, energy consumption) You can find out more from our full writeup:
3
16
153
1
29
133
@AdeenaY8
Adina Yakup
9 months
China Telecom becomes the first state-owned enterprises to open source their LLM - TeleChat 7B and high quality pre-train dataset🚀 ✨The size of the dataset reaches 270M, which is one of the largest Chinese pre-training datasets released so far.
4
40
173
@AdeenaY8
Adina Yakup
10 months
SeaLLMs - language models optimized for Southeast Asian languages! It supports English 🇬🇧, Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹🇭, Malay 🇲🇾, Khmer🇰🇭, Lao🇱🇦, Tagalog🇵🇭 and Burmese🇲🇲 Always excited to see an LLM that goes beyond mainstream languages 🤗
2
48
167
@AdeenaY8
Adina Yakup
7 months
The Era of 1-bit LLMs is now the most upvoted paper of all time🤯🚀
Tweet media one
@AdeenaY8
Adina Yakup
7 months
Just checked out today's top-voted paper, Chinese researchers are on fire🔥 ✨ The Era of 1-bit LLMs from UCAS and Microsoft ✨EMO: Emote Portrait Alive from Alibaba Btw, love seeing authors jump into the conversation thread like
Tweet media one
4
10
89
2
20
156
@AdeenaY8
Adina Yakup
20 days
Let's meet Yi-Coder 🔥 Chinese AI unicorn @01AI_Yi just released its first series of code LLMs! Blog: Model: ✨ 1.5B & 9B base and chat ✨ Apache 2.0 ✨ Context window of 128K tokens ✨ Supporting 52 major programming languages
3
40
158
@AdeenaY8
Adina Yakup
1 year
百模大战 🔛 The battle of 100 large models A trendy word in Chinese large model land. It gives you an idea about how this business is growing in China. 🚀 Show case or business case? 🧵
Tweet media one
Tweet media two
Tweet media three
Tweet media four
16
44
151
@AdeenaY8
Adina Yakup
4 months
OpenRLHF 🔥 A high-performance RLHF framework built on Ray, DeepSpeed, and HF Transformers! ✨ User-friendly, compatible with @huggingface models. ✨ 2x performance boost with Ray and Adam Offload. ✨ Distributed training for 70B+ models on multiple GPUs.
3
41
150
@AdeenaY8
Adina Yakup
8 months
BGE-M3 🔥 Highlights: ✨ Multi-Lingual: supports over 100 languages ✨ Multi-Granularity: input texts up to 8192 characters ✨ Multi-Functionality: integrates dense retrieval/ sparse retrieval/ multi-vector retrieval to support different scenarios
@BAAIBeijing
BAAI
8 months
BAAI releases BGE-M3, a new member to BGE model series. M3 stands for Multi-linguality (100+ languages), Multi-granularities (input length up to 8192), Multi-Functionality (unification of dense, lexical, multi-vec retrieval). 🔥
7
49
213
4
17
146
@AdeenaY8
Adina Yakup
7 months
AutoMathText: A 200GB dataset of mathematical texts open sourced on @huggingface 📊🚀 ✨ Multi-source : arXiv/programming code/web pages ✨ Filtered and processed to adapte Math reasoning ✨ Selected by Qwen 72B Paper:
1
37
145
@AdeenaY8
Adina Yakup
2 months
Just few hours after Shanghai AI lab's two releases, here comes CogVideoX 🔥🚀 a SOTA open video generation model made by @thukeg from the Chinese community 🤯 Model: Demo: And it's only the first day of the week!!😎
3
29
124
@AdeenaY8
Adina Yakup
11 months
Found this multi-lingual fine-tuned version of Zephry-7B in the Hub🔥 🔠 Supports multiple languages: Chinese, Japanese, Korean, English, French, German etc. and cross-language tasks such as translation. 🧠 Strong cognitive abilities.
0
27
120
@AdeenaY8
Adina Yakup
7 months
DeepSeek-VL🐬 An open access VL model designed for real-world vision and language understanding applications 📺🚀 ✨ 1.3B & 7B base and chat ✨ Support commercial use in limited scenarios Paper: Model:
1
18
121
@AdeenaY8
Adina Yakup
26 days
🔥 Just dropped: Qwen2-VL by @Alibaba_Qwen 🚀 ✨ 2B & 7B both under Apache 2.0 ✨ Smart agents for mobile & robot ops ✨ SoTA in image & 20min+ video comprehension ✨ Multilingual: English, Chinese, Japanese, Arabic etc.
2
27
116
@AdeenaY8
Adina Yakup
4 months
This is THE MODEL you don't want to miss!! Qwen 2 ⚡️ POWERFUL open model from @alibaba_cloud is now available on @huggingface Hub 🚀 Model: Demo: ✨ 0.5B / 1.5B / 7B / 57B-A14B / 72B ✨ Apache 2.0 ✨ Support 27 languages ✨ Great
4
24
109
@AdeenaY8
Adina Yakup
4 months
GLM-4-9B 🔥 Chinese model with open access from ZHIPU AI @thukeg , now available on the @huggingface hub🚀 Model: ✨ 9B base ( 8k ) & chat ( 128k and 1M ) , 4V-9B MML ✨ Function call comparable to GPT-4 ✨ All Tools function: enabling smart use of web
3
27
103
@AdeenaY8
Adina Yakup
1 year
YES, WE DID IT AGAIN IN PARIS!! 🥐🤗 #WoodstockAI Can you believe the biggest open source community event in Europe was organised by our @huggingface team in only 2 weeks? 🤯 🧵
2
24
102
@AdeenaY8
Adina Yakup
3 months
DeepSeek-Coder-V2 ⚡️ an powerful MoE code language model with open access is now available on the @huggingface hub 🚀🔥 Model: ✨ 16B & 236B parameters ✨ 128k context length ✨ Code & math skill are between GPT-4o & GPT-4Turbo. ✨ Free commercial use
2
23
101
@AdeenaY8
Adina Yakup
4 months
ConvLLaVA 💻 A visual encoder design replacing Vision Transformer (ViT) in language-vision models (LMM) from Alibaba and Tsinghua University. Model: Paper: ✨ Uses the hierarchical ConvNeXt backbone as the visual encoder for LMM.
0
21
94
@AdeenaY8
Adina Yakup
5 months
Alibaba just released Qwen1.5 - 110b on @huggingface hub🎉 Model: Demo: ✨ The largest one in the Qwen1.5 series ✨ Context length 32K tokens ✨ Multilingual: Chinese, English, French, Korean, Japanese, Vietnamese, Arabic etc.
0
26
96
@AdeenaY8
Adina Yakup
11 days
GOT-OCR2.0 🔥 a 580M end-to-end OCR-2.0 model released by StepFun 阶跃星辰 is now available on the @huggingface Model: Paper: ✨ While others are releasing powerful models, StepFun, a new player in China's OS community is opening
3
38
133
@AdeenaY8
Adina Yakup
25 days
InstantX team from the Chinese community has been making a lot of new moves in open source👇 ✨ Partnered with @ShakkerAI_Team , release FLUX.1-CN-Union & FLUX.1-CN-Depth. ✨ Built a new demo: ✨ Published new paper about
1
29
90
@AdeenaY8
Adina Yakup
7 months
Just checked out today's top-voted paper, Chinese researchers are on fire🔥 ✨ The Era of 1-bit LLMs from UCAS and Microsoft ✨EMO: Emote Portrait Alive from Alibaba Btw, love seeing authors jump into the conversation thread like
Tweet media one
4
10
89
@AdeenaY8
Adina Yakup
1 month
🎥 New Video-LLMs update from the Chinese community! VideoLLaMA 2-72B released by @AlibabaDAMO 🔥 Model: Demo: Paper: ✨ Join the discussion thread, communicate with the authors on the paper page!
1
22
90
@AdeenaY8
Adina Yakup
3 months
VideoLLaMA2 🦙 A set of Video LLMs from @alibaba_cloud is now available on @huggingface 🚀 Model: Paper: ✨ Spatial-Temporal Mastery: Advanced STC for pinpoint video dynamics capture. ✨ Enhanced Audio Branch: Seamless integration
1
29
88
@AdeenaY8
Adina Yakup
3 months
Most upvoted paper in past two weeks from the Chinese community on Daily Papers📑🚀 ✨ ShareGPT4Video: Improving Video Understanding and Generation with Better Captions ✨Depth Anything V2 ✨Autoregressive Model Beats Diffusion:
1
24
89
@AdeenaY8
Adina Yakup
7 months
China has released the <Basic Security Requirements for Generative AI Services>📖setting standards for data, model safety, and security protocols, including assessment guidelines🔍🧵
Tweet media one
7
29
83
@AdeenaY8
Adina Yakup
3 months
New dataset from the Chinese community 🥳 OpenVid-1M 🔥a high-quality text-to-video dataset with 1 million text-video pairs, from @ByteDanceOSS and @NanjingUnivers1 Dataset: Paper: ( most upvoted paper of the day)
0
25
83
@AdeenaY8
Adina Yakup
4 months
Most upvoted paper of May from the Chinese community on Daily Papers 📜🚀 ✨ StoryDiffusion: Consistent self-attention for long-range image and video generation @ByteDanceOSS ✨ ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal
8
19
82
@AdeenaY8
Adina Yakup
21 days
喜报🎉 @huggingface 终于拿下了!! .co 还是会继续延用,输入之后会自动跳转到 .com 🫡
Tweet media one
5
2
81
@AdeenaY8
Adina Yakup
11 months
After the BIGGEST UPDATE (added 3 new evals🚀) of Open LLM Leaderboard, Yi-34B is now the top model of the leaderboard across ALL SIZES/MODELS 🤯
Tweet media one
@AdeenaY8
Adina Yakup
11 months
Today released their open source model Yi-34B on the Hub👏🚀 ✨ The FIRST Chinese model to top the Open LLM Leaderboard 💪 Better performance than Falcon-180B and Llama2 70B on pre-training 🇨🇳 Supports both English and Chinese
9
64
365
1
10
81
@AdeenaY8
Adina Yakup
2 months
Last Monday, @AIatMeta released SAM2. The community shared Medical-SAM2 just a few days later🔥 kudos to Jiayuan Zhu, Yanli Qi and @JundeMorsenWu Dataset: Paper:
1
18
77
@AdeenaY8
Adina Yakup
2 months
Tuesday's here, and we've got an exciting update for THE MiniCPM! 🔥 MiniCPM-V 2.6🚀 is the latest end-side MLLM from @OpenBMB 💫 Communicate with authors on the paper page: Model: ✨ Built on SigLip-400M & Qwen2-7B with 8B
0
27
76
@AdeenaY8
Adina Yakup
2 months
Alibaba's LLMs are entering the Southeast Asian market👀🚀 SeaLLMs is now on V3 🦭 This is a family of OPEN LLMs tailored for Southeast Asia Languages, made by @AlibabaGroup 🔥 Model: Demo: Paper:
1
22
76
@AdeenaY8
Adina Yakup
4 months
What a week in the Chinese open-source community! 🤯 Check the thread 🧵 ✨ Yi 1.5 from @01AI_Yi ✨ DeepSeek V2 / V2 lite from @deepseek_ai ✨ MiniCPM-Llama3-V 2.5 from @OpenBMB ✨ CogVLM2 from @thukeg ✨ Hunyuan DiT from @TencentGlobal ✨ Lumina-T2X FROM @opengvlab
2
11
75
@AdeenaY8
Adina Yakup
6 months
Congrats LiblibAI 🎊🚀 The largest AI image creation platform in China, with millions of creators and 100 millions AI artworks contributed, is now become the FIRST and ONLY AI community officially registered under China's AI service regulations🥳
Tweet media one
7
16
74
@AdeenaY8
Adina Yakup
3 months
MeshAnything🔥converts any 3D representation into Artist-Created Meshes. Super cool work from the Chinese language community ✨🚀 Paper: Demo:
@dylan_ebert_
dylan
3 months
This is a huge deal for Generative 3D. MeshAnything was just released, and is a major leap in terms of mesh topology. This is the beginning of Generative 3D in real-world 3D applications.
21
83
545
0
11
74
@AdeenaY8
Adina Yakup
3 months
ANOLE🦎 an OPEN multimodal model that natively generates images and text without needing stable diffusion from GAIR🔥 Model: Paper: ✨ Native integration: no need for adapters, seamlessly aligning visual and language models. ✨
1
21
73
@AdeenaY8
Adina Yakup
28 days
🔥 Big update on the SOTA text-to-video model from the Chinese community! - @ChatGLM from Tsinghua just released CogVideoX 5B - CogVideoX 2B now supports Apacha 2.0 🙌 Paper: Model: Demo: ✨ CogVideoX
1
17
72
@AdeenaY8
Adina Yakup
18 days
DeepSeek-V2.5 🚀 an OPEN model combining general and coding capabilities just released by Chinese AI unicorn @deepseek_ai . ✨ Combine DeepSeek-V2-Chat & DeepSeek-Coder-V2 ✨ Enhanced writing, instruction-following and human preference alignment
2
18
71
@AdeenaY8
Adina Yakup
3 months
Just discovered an amazing organisation on the @huggingface Hub with over 1,000 researchers in Japan collaborating on open-source LLMs. 🌟 It's inspiring to see global support and contributions to open source in so many ways!🌍🚀
0
18
43
@AdeenaY8
Adina Yakup
4 months
Open Chinese LLM Leaderboard 开放中文大语言模型榜单🏆 from @BAAIBeijing now available on @huggingface hub! ✨ Based on the Eleuther AI Language Model Evaluation Harness ✨ Evaluates on 7 key benchmarks, with all English datasets translated to Chinese
1
21
65
@AdeenaY8
Adina Yakup
8 months
Chinese community is keep releasing interesting models on the Hub🔥🧵 ✨Orion-14B: supports Japanese and Korean ✨Qwen 72B ✨Yi Vision Language Model
2
13
63
@AdeenaY8
Adina Yakup
4 months
Super cool demo!! 🤩 #DreamGaussian4D Paper: Demo:
0
18
61
@AdeenaY8
Adina Yakup
2 months
The open models released by the Chinese community this week are truly remarkable 🔥 Here are some highlights: ✨ CogVideoX 2b from @thukeg - Zhipu "Sora" ✨ Qwen 2 - Math from @Alibaba_Qwen - For advanced math problem-solving.
0
13
62
@AdeenaY8
Adina Yakup
8 months
Bunny🐰is on the Hub! A family of lightweight but powerful multimodal models released by Chinese research lab @BAAIBeijing 🔥 ✨Bunny-3B (SigLIP + Phi-2) outperforms even 13B models!
0
16
60
@AdeenaY8
Adina Yakup
6 months
Beihang University of China released the tech paper of LlamaFactory🦙🌟 Demo: Tech report:
1
12
59
@AdeenaY8
Adina Yakup
7 months
For those overwhelmed by 500 daily new papers on arXiv and not sure where to start, here are some tips💡 📑 Check out Daily Paper() by @_akhaliq for daily digest or subscribe for inbox delivery 📩 🤖 Ask librarian-bot for paper recommendations 🌟 Start
@vanstriendaniel
Daniel van Strien
7 months
Very nice to see the authors of a paper ask librarian-bot for a paper recommendation. I hope it was helpful, @vicgalle_ ! You can find similar papers for a @huggingface paper by commenting ` @librarian -bot recommend`.
Tweet media one
2
4
22
1
9
59
@AdeenaY8
Adina Yakup
9 months
DeepSeekMoE 16B : a new MoE with two innovative strategies, just released by @deepseek_ai 🔥 📊 16.4B parameters 🏋️ Trained on a 2T token dataset ♻️ 40% more efficient than DeekSeek 7B and LLaMA2 7B 💻 Deployed on a single GPU without quantization
2
7
58
@AdeenaY8
Adina Yakup
3 months
Depth Anything 2 🔥 A monocular depth estimation model from HKU and TikTok 🚀 Model: Demo: Paper: ✨ Enhancing depth prediction with synthetic images, larger teacher models, and pseudo-labeled real images.
0
18
57
@AdeenaY8
Adina Yakup
5 months
Near 500 paper claimed in Space ICLR 2024🎉 Here are some tips if you want to engage more with the community 🤗 💡 Join the discussion threads below each paper. 💡 Start a conversation with authors by clicking "@" next to the author's profile photo on
2
17
58
@AdeenaY8
Adina Yakup
5 months
We've prepared everything for @iclr_conf at Space ICLR2024 👀 📑 All accepted papers 💬 Discussions with authors 🧠 Code/Models/Datasets/Demos related to the paper 🙋 Upvotes by the community ❓Is there anything else to add? #ICLR2024 @_akhaliq
Tweet media one
1
17
57
@AdeenaY8
Adina Yakup
7 months
Yi-9B 🚀 is now on the @huggingface hub 🤗 ✨ Strong coding and math skills ✨ Excellent bilingual ability in Chinese and English ✨ Developer friendly, can run on consumer GPUs ✨ Apache 2.0, mail required for free commercial use
2
9
56
@AdeenaY8
Adina Yakup
5 months
SeaLLM-7B-v2.5🔥 Latest update of SeaLLMs, released by @AlibabaGroup on @huggingface Hub! Model: Demo: ✨ Support Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹🇭, Malay 🇲🇾, Khmer🇰🇭, Lao🇱🇦, Tagalog🇵🇭 and Burmese🇲🇲 ✨ Great on math, common
0
12
53
@AdeenaY8
Adina Yakup
1 month
HF sticker is on vacation🏝️
Tweet media one
3
1
55
@AdeenaY8
Adina Yakup
28 days
HF sticker’s back after an epic journey 🏖️🛥️🪂🏰🚵 So, what was the coolest paper last week on while the sticker was away? 👀
Tweet media one
1
7
54
@AdeenaY8
Adina Yakup
4 months
ChatTTS 💬 A text-to-speech model designed for dialogue scenarios like LLM assistants. Model: Demo: ✨ OS version on @huggingface is pre-trained with 40,000 hours of data. ✨Support English and Chinese. ✨Supports multiple
2
12
54
@AdeenaY8
Adina Yakup
11 days
XVERSE 元象 has released one of the largest MoE models from the Chinese community 🇨🇳 👉 XVERSE-MoE-A36B ✨ 255B total parameters, with 36B actively used during inference ✨ Supports 40 languages, including English, Chinese, Russian, and Spanish ✨ Apache
3
18
61
@AdeenaY8
Adina Yakup
3 months
Open LLM Leaderboard 2⃣️is out!!🚀🏆 And Qwen2 is 🔥🔥🔥
Tweet media one
@clefourrier
Clémentine Fourrier 🍊 - Lisbon offsite!
3 months
LLM performances have been plateauing... so we decided to make the Open LLM Leaderboard steep again 🏔️ 😈 Introducing the Leaderboard 2️⃣ Expect... - new benchmarks - fairer reporting - cool features (did I hear voting and chat template?) 🧵
13
60
217
2
6
48
@AdeenaY8
Adina Yakup
1 year
Probably the best Chinese Llama2 on the Hub 🔥🦙 Model:
1
14
47
@AdeenaY8
Adina Yakup
3 months
ChronoDepth🔥A new work in video depth estimation! Demo: Paper: ✨Achieves frame-to-frame consistency and spatial accuracy ✨Transforms depth estimation into a conditional generation problem, enhancing learning and generalizability.
0
13
46
@AdeenaY8
Adina Yakup
3 months
An exclusive community for Chinese LLMs is here 💫🚀 We're building this org on the @huggingface hub to keep you in the loop with the latest works from the Chinese language community 🔥 And we'd love for you to join us and help sharpen this community
4
4
47
@AdeenaY8
Adina Yakup
2 months
Mindsearch (思·索)🔍 An open AI Search Engine Framework made by @intern_lm 🔥 Communicate with authors in here: ✨ Apache 2.0 ✨ Gathers and integrates info from 300+ web pages in 3 mins ✨ Accurately handles complex queries by breaking them into
2
9
47
@AdeenaY8
Adina Yakup
1 year
Aquila2 Chat 34b from BAAI is on the hub now!🔥 Really glad to see that Chinese LLM are getting out of the 10b range.
3
10
46
@AdeenaY8
Adina Yakup
4 months
Dive into the world of "Benchmark" and "Arena" with today's Daily Papers on @huggingface 📑🧐 ✨ GenAI Arena: An Open Evaluation Platform for Generative Models ✨ WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
0
11
45
@AdeenaY8
Adina Yakup
2 months
When you're going to call it a day... Qwen released Qwen2 audio🤯 ✨ 7b base & instruct ✨ Voice chat: use your own voice to give instructions ✨ Audio analysis: including speech, sound, music, etc. ✨ Multilingual: supports more than 8 languages and
@AdeenaY8
Adina Yakup
2 months
The open models released by the Chinese community this week are truly remarkable 🔥 Here are some highlights: ✨ CogVideoX 2b from @thukeg - Zhipu "Sora" ✨ Qwen 2 - Math from @Alibaba_Qwen - For advanced math problem-solving.
0
13
62
0
9
46
@AdeenaY8
Adina Yakup
1 year
Our ML engineer @Jhuaplin at #BAAI in Beijing today🤘 Sharing the experience of open source community of @huggingface 🤗
Tweet media one
Tweet media two
1
4
44
@AdeenaY8
Adina Yakup
2 months
25% of global AI papers have come from the Chinese community🤯 Staying tuned to the latest papers on the Daily Papers page is a great way to keep up with AI developments from the Chinese community. Here are the MOST UPVOTED ones from in the last two weeks🔥🚀
3
8
44
@AdeenaY8
Adina Yakup
13 days
LLaMA-Omni 🦙 a speech-language model built upon Llama-3.1-8B-Instruct release by the Chinese community 🚀🔥 ✨ Low-latency speech interaction with a latency as low as 226ms. ✨ Simultaneous generation of both text and speech responses. ✨ Trained in
1
12
46
@AdeenaY8
Adina Yakup
11 months
ChatGLM 3 was released by THUDM today 🔥 Probably the best model of its size in China. ✨ It supports complex scenarios like: Function call, code interpreter, Agent tasks etc. ✨ Fully OPEN to academic research and FREE commercial use with registration.
2
14
44
@AdeenaY8
Adina Yakup
4 months
New MoE from Chinese community 🚀 Skywork-MoE from Kun Lun tech is now on @huggingface Hub🔥 ✨ 146 billion parameters, 16 experts and 22 billion activated parameters ✨ With two innovative techniques: Gating Logit Normalization, which enhances expert
0
9
42
@AdeenaY8
Adina Yakup
3 months
HunyuanDiT from @TencentGlobal has released version 1.2🔥 along with the Hunyuan-Captioner on @huggingface Model: ✨ Generating high-quality image descriptions from various angles and supporting bilingual
0
15
43
@AdeenaY8
Adina Yakup
4 months
FULLY OPEN SOURCE LLM: MAP-NEO 7B👉Another amazing work by MAP on the @huggingface hub🔥 Model: Github: Paper:
1
13
40
@AdeenaY8
Adina Yakup
11 months
You can play with Whisper Large V3 directly in here 👇
2
4
40
@AdeenaY8
Adina Yakup
4 months
虎头帮 TIGER-LAB🐯 The name caught my attention first, then I realized they were behind all these cool works! ⚔️ GenAI-Arena ⚔️ : Benchmarking Visual Generative Models in the Wild ✨Mantis: Optimized for multi-image reasoning with text/image format
1
6
40
@AdeenaY8
Adina Yakup
3 months
Today's Daily Papers page is really inspiring!! So many authors are engaging with the community in discussion thread❤️
1
10
40
@AdeenaY8
Adina Yakup
1 year
The @huggingface team at BAAI Conference in Beijing 🚀 Looking forward to your talk @Jhuaplin 🤗
Tweet media one
1
10
41
@AdeenaY8
Adina Yakup
2 months
The Chinese community is on fire with the release of open models! 🔥🙀 MeshAnything V2 🚀 A transformer that generates artist-created meshes (AM) based on given shapes made by @NTUsg & @Tsinghua_Uni Model weight: Demo: ✨ Upvote
1
10
40
@AdeenaY8
Adina Yakup
2 months
Tele-FLM-1T 🚀 An open LLM released by @BAAIBeijing and TeleAI with 1T parameters💫 ✨Support Chinese & English ✨Apache 2.0 ✨Cost-effective progressive pre-training. ✨Enhanced with Input/Output scalers, RoPE, RMSNorm, SwiGLU. Model:
2
8
39
@AdeenaY8
Adina Yakup
3 months
CodeGeeX4-ALL-9B🔥 An OPEN multilingual code generation model from the latest CodeGeeX4 series, released by @thukeg 🚀 Github: Model: ✨ 128k context length ✨ Supports Function Call ✨ Trained on GLM-4-9B ✨ Open for academic
1
11
34
@AdeenaY8
Adina Yakup
4 months
CraftsMan✂️ A novel generative 3D modeling system from @hkust 🔥 Demo: Paper: ✨ High quality and efficient generation ✨ Interactive Refinement: Users can interactively refine and customize the generated 3D models. ✨ Multiple
0
10
38
@AdeenaY8
Adina Yakup
2 months
MM-Vet is now on v2 🔥🔥🔥 MM-Vet v2 is a benchmark to evaluate LLMs for integrated capabilities , released by @NUSingapore 🚀⚖️ Paper: ✨ Includes a new capability called "image-text sequence understanding." ✨ Expands the
0
16
37
@AdeenaY8
Adina Yakup
18 days
It’s only 9 AM and the community has already submitted 8 new papers on @huggingface 🚀🔥 Time to dive into some cool AI research 💡
1
4
36
@AdeenaY8
Adina Yakup
3 months
Huggy bandanas are ready for the community!🔥 Drop a 🤗in the thread if you're in for our new bandanas and I'll send you one! Can't wait to see everyone rocking them! 🥳✨🤘
Tweet media one
42
1
37
@AdeenaY8
Adina Yakup
4 months
ShareGPT4Video 📺 a series that helping big video-smart systems and video-making AI understand and create videos better, using detailed captions. Model & Dataset: Paper: ✨ ShareGPT4Video: dataset contains 40K high quality videos ✨
0
15
34
@AdeenaY8
Adina Yakup
3 months
🚀👏Another amazing project by the PKU Yuan Group, who also created Open-sora-Plan and MagicTime! ChronoMagic-Bench 📊 A benchmark for metamorphic evaluation of text-to-time-lapse video generation 🔥🎬 Code: Paper:
1
7
36
@AdeenaY8
Adina Yakup
7 months
OmniLMM-12B & OmniLMM-3B , open access LMMs from a Chinese research lab @OpenBMB 🔥 ✨ OmniLMM-12B: Outperform in benchmarks; Trustworthy behavior; Real-time interaction. 🚀 OmniLMM-3B (MiniCPM-V): Runs on most devices, mobile included; Support both Chinese & English. 🪪 Apache
1
5
36
@AdeenaY8
Adina Yakup
2 months
This is the speed of AI in 2024!!🚀🤯 24 hours after its release, the FIRST Llama 3.1 Chinese fine-tune is already available on the @huggingface hub🔥 Kudos to OpenBuddy!
4
4
36
@AdeenaY8
Adina Yakup
27 days
Two interesting papers on today: ✨Top trending: Diffusion Models Are Real-Time Game Engines ✨Most upvotes: Writing in the Margins: Better Inference Pattern for Long Context Retrieval @ authors in the
1
7
34
@AdeenaY8
Adina Yakup
8 months
New benchmark for Chinese LLM Evaluation: CMMMU ⚖️🚀 A Chinese Massive Multi-discipline Multimodal Understanding Benchmark with 12k manually collected multimodal questions. Code: Paper: Dataset:
1
3
33
@AdeenaY8
Adina Yakup
3 months
HuatuoGPT-Vision 华佗 🩺 An OPEN medical MLLMs released by Shenzhen Research Institute of Big Data and @cuhksz 🔥 Dataset: Model: Paper: ✨ The HuatuoGPT series is already being used in hospitals in
0
6
34
@AdeenaY8
Adina Yakup
8 months
MoE-LLaVA 👇released by PKU-Yuan group. Paper: Model: Dataset: Demo:
@LinBin46984
Bin Lin
8 months
🌋MoE-LLaVA with just 3B activated parameters outperforms the LLaVA-1.5-7B on an average of 9 benchmarks, and the 2.2B version even surpasses the LLaVA-1.5-13B in object hallucination benchmark. 🤗We have open-sourced all data, code, and models. Code:
4
55
300
1
3
33
@AdeenaY8
Adina Yakup
5 months
The discussion thread of Phi-3 tech report is getting long , feel free to join 👉
Tweet media one
0
6
34
@AdeenaY8
Adina Yakup
8 months
All here👀
@_akhaliq
AK
8 months
its over 9000
Tweet media one
7
9
280
1
3
31
@AdeenaY8
Adina Yakup
8 months
⚔️ WildVision Arena ⚔️
Tweet media one
@billyuchenlin
Bill Yuchen Lin 🤖
8 months
Introducing Vision Arena! Inspired by the awesome Chatbot Arena, we built a web demo on @huggingface for testing Vision LMs (GPT-4V, Gemini, Llava, Qwen-VL, etc.). You can easily test two VLMs side by side and vote! It’s still a work-in-progress. Feedbacks are welcome! 🔗
33
109
528
1
3
26