Namespace: LLama
The quantizer to quantize the model.
public static class LLamaQuantizer
Inheritance Object → LLamaQuantizer
Quantize the model.
public static bool Quantize(string srcFileName, string dstFilename, LLamaFtype ftype, int nthread)
srcFileName String
The model file to be quantized.
dstFilename String
The path to save the quantized model.
ftype LLamaFtype
The type of quantization.
nthread Int32
Thread to be used during the quantization. By default it's the physical core number.
Boolean
Whether the quantization is successful.
Quantize the model.
public static bool Quantize(string srcFileName, string dstFilename, string ftype, int nthread)
srcFileName String
The model file to be quantized.
dstFilename String
The path to save the quantized model.
ftype String
The type of quantization.
nthread Int32
Thread to be used during the quantization. By default it's the physical core number.
Boolean
Whether the quantization is successful.