-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for Q4_0 and other formats. #3
Comments
There usually are other formats with better perplexity and lower file size than Q4_0 and Q5_0. Do you really need them? |
I have not heard back from you, but I have now implemented Q4_0 and Q5_0 anyway. Could you test whether it works for your use case? |
sorry i didn't check this message box, I would test it in a few days and give you feedback. Thanks! |
Should be fixed in #4 |
What does that pull request have to do with this issue? |
At the end of the day (not literally), gguf package will handle dequantization, so all quantization type will be supported. |
Hi, I notice some dequant format, like Q4_0 Q5_0, are not supported yet. Will you support these formats?
The text was updated successfully, but these errors were encountered: