You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

llama.native.llamamodelquantizeparams.md 1.1 kB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455
  1. # LLamaModelQuantizeParams
  2. Namespace: LLama.Native
  3. Quantizer parameters used in the native API
  4. ```csharp
  5. public struct LLamaModelQuantizeParams
  6. ```
  7. Inheritance [Object](https://docs.microsoft.com/en-us/dotnet/api/system.object) → [ValueType](https://docs.microsoft.com/en-us/dotnet/api/system.valuetype) → [LLamaModelQuantizeParams](./llama.native.llamamodelquantizeparams.md)
  8. ## Fields
  9. ### **nthread**
  10. number of threads to use for quantizing, if <=0 will use std::thread::hardware_concurrency()
  11. ```csharp
  12. public int nthread;
  13. ```
  14. ### **ftype**
  15. quantize to this llama_ftype
  16. ```csharp
  17. public LLamaFtype ftype;
  18. ```
  19. ## Properties
  20. ### **allow_requantize**
  21. allow quantizing non-f32/f16 tensors
  22. ```csharp
  23. public bool allow_requantize { get; set; }
  24. ```
  25. #### Property Value
  26. [Boolean](https://docs.microsoft.com/en-us/dotnet/api/system.boolean)<br>
  27. ### **quantize_output_tensor**
  28. quantize output.weight
  29. ```csharp
  30. public bool quantize_output_tensor { get; set; }
  31. ```
  32. #### Property Value
  33. [Boolean](https://docs.microsoft.com/en-us/dotnet/api/system.boolean)<br>