Martin Evans
|
a8ba9f05b3
|
March Binary Update (#565)
* Updated binaries to llama.cpp `3ab8b3a92ede46df88bc5a2dfca3777de4a2b2b6` (build run: https://github.com/SciSharp/LLamaSharp/actions/runs/8118890586)
* Added abort callback
* Added properties to get/set thread count on `LLamaContext`
* Fixed LLamaLogLevel numbering
|
1 year ago |
Martin Evans
|
c7103e86e4
|
Added new file types to quantisation
|
1 year ago |
Martin Evans
|
c002642268
|
- Removed some `unsafe` where it wasn't necessary
- Wrapped some native functions which take (pointer, length) in function which take a `span` instead.
|
1 year ago |
Martin Evans
|
9daf586ba8
|
Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc)
|
2 years ago |
Martin Evans
|
b69f4bc40e
|
- Expanded range of supported types in quantizer to match llama.cpp
- Rewritten `LLamaFtype` parsing to support any substring which uniquely matches a single enum variant
|
2 years ago |
Martin Evans
|
2e76b79af6
|
Various minor XML docs fixes
|
2 years ago |
Yaohui Liu
|
9850417a12
|
feat: update quantize native params.
|
2 years ago |
Yaohui Liu
|
6c400e64c2
|
docs: publiash documentation 0.4.
|
2 years ago |
Yaohui Liu
|
f3565d6b2d
|
refactor: rename Quantizer to LLamaQuantizer.
|
2 years ago |