Skip to content

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache #9850

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache #9850

windows-latest-cmake-cublas (12.2.0, cublas)

succeeded Mar 20, 2024 in 22m 28s