Skip to content

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache #5551

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache #5551

Lint

succeeded Mar 20, 2024 in 7s