@demirbasayyuce @awnihannun Well actually I don’t think you need any of that due to unified memory. Quantizing the Lora example in mlx should work out of the box. Haven’t tried it yet but I don’t see why not. Tweet added by Angelos Katharopoulos @angeloskath

Angelos Katharopoulos

6 months

@demirbasayyuce @awnihannun Well actually I don’t think you need any of that due to unified memory. Quantizing the Lora example in mlx should work out of the box. Haven’t tried it yet but I don’t see why not.