@angeloskath
Angelos Katharopoulos
6 months
@demirbasayyuce @awnihannun Well actually I don’t think you need any of that due to unified memory. Quantizing the Lora example in mlx should work out of the box. Haven’t tried it yet but I don’t see why not.
0
0
4