
bleedingedge.ai
@bleedingedgeai
Followers
9K
Following
15
Media
51
Statuses
171
Joined October 2022
New open weights LLM from @MistralAI. params.json:.- hidden_dim / dim = 14336/4096 => 3.5X MLP expand.- n_heads / n_kv_heads = 32/8 => 4X multiquery.- "moe" => mixture of experts 8X top 2 👀. Likely related code: . Oddly absent: an over-rehearsed
0
0
7
mixtral-8x7b-32kseqlen from @MistralAI. Mixture of Experts? 🤔.
magnet:?xt=urn:btih:5546272da9065eddeb6fcd7ffddeef5b75be79a7&dn=mixtral-8x7b-32kseqlen&tr=udp%3A%2F%3A6969%2Fannounce&tr=http%3A%2F%3A80%2Fannounce. RELEASE a6bbd9affe0c2725c1b7410d66833e24.
1
2
5