@armandjoulin
Armand Joulin
4 months
@abacaj We will look to improve our models in future iterations and any feedback will be appreciated (through DMs?). Mistral's models are amazing and if they work for you, all the best!
1
1
11

Replies

@abacaj
anton
4 months
After trying Gemma for a few hours I can say it won’t replace my mistral 7B models. It’s better than llama 2 but surprisingly not better than mistral. The mistral team really cooked up a model even google can’t top
35
57
885
@A_F_B_Jr
Adalberto jr
4 months
@armandjoulin @abacaj Why did you guys choose a smaller hidden dim and a bigger intermediate dim?
0
0
1