Martin Evans
d7f971fc22
Improved `NativeApi` file a bit:
- Added some more comments
- Modified `llama_tokenize` to not allocate
- Modified `llama_tokenize_native` to take a pointer instead of an array, allowing use with no allocations
- Removed GgmlInitParams (not used)
2 years ago
Martin Evans
841cf88e3b
Merge pull request #96 from martindevans/minor_quantizer_improvements
Minor quantizer improvements
2 years ago
Martin Evans
ce325b49c7
Rewritten comments
2 years ago
Martin Evans
b69f4bc40e
- Expanded range of supported types in quantizer to match llama.cpp
- Rewritten `LLamaFtype` parsing to support any substring which uniquely matches a single enum variant
2 years ago
sa_ddam213
a67ea36dd9
Typo and formatting
2 years ago
sa_ddam213
726987b761
Add native logging output
2 years ago
Martin Evans
acd91341e6
Added lots of comments to all the LLamaFtype variants
2 years ago
Yaohui Liu
ee2a5f064e
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
3a1daa98a3
feat: add the api to get the embedding length of the model.
2 years ago
Martin Evans
270c6d55ef
Merge pull request #88 from martindevans/fix_serialization_nan
Fix serialization error due to NaN
2 years ago
Martin Evans
91bcefc852
comment on IModelParamsExtensions
2 years ago
Martin Evans
9cdc72aa67
Fixed `ToLlamaContextParams` using the wrong parameter for `use_mmap`
2 years ago
Martin Evans
bab3b46f0c
Merge pull request #82 from martindevans/tokenization_cleanup
Utils Cleanup
2 years ago
Martin Evans
b5de3ee5aa
Fixed some final mentions of "mirostate" instead of "mirostat"
2 years ago
Martin Evans
be52737488
Using a nullable float instead of NaN, this should fix the serialization issue reported in #85
2 years ago
sa_ddam213
2d1269cae9
Access to IModelParamsExtensions
2 years ago
Martin Evans
1fceeaf352
Applied fix from #84 (antiprompt does not work in stateless executor)
2 years ago
Yaohui Liu
d609b0e1d5
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
b60c8bd285
fix: antiprompt does not work in stateless executor.
2 years ago
Martin Evans
2b2d3af26b
Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle`
2 years ago
Martin Evans
7fabcc1849
One last `TokenToString` case
2 years ago
Martin Evans
0e5e00e300
Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`)
2 years ago
Martin Evans
2d811b2603
- Moved `GetLogits` into `SafeLLamaContextHandle`
- Added disposal check into `SafeLLamaContextHandle`
2 years ago
Martin Evans
cd3cf2b77d
- Moved tokenization from `Utils.Tokenize` into `SafeLLamaContextHandle.Tokenize`, one less thing in `Utils`.
- Also refactored it to return an `int[]` instead of an `IEnumerable<int>`, solving the "multiple enumeration" problems at the source!
2 years ago
Martin Evans
73882de591
Merge pull request #81 from martindevans/tensor_splits_array
Improved Tensor Splits
2 years ago
Martin Evans
bd3d8d3dc4
Cleaned up multiple enumeration in FixedSizeQueue
2 years ago
Martin Evans
f2499371ea
Pulled conversion of a `IModelParams` into a `LLamaContextParams` out into an extension method which can be used in other places.
2 years ago
Martin Evans
f1111a9f8b
Using a pin instead of a `fixed` block
2 years ago
Martin Evans
685eb3b9c2
Replaced `nint` with `float[]?` in Model params, which is much more user friendly!
2 years ago
sa_ddam213
e02d0c3617
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Rinne
bfe9cc8961
Merge pull request #78 from SciSharp/rinne-dev
feat: update the llama backends.
2 years ago
sa_ddam213
e46646b8db
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Yaohui Liu
bb46a990d0
fix: add bug info for native api.
2 years ago
Yaohui Liu
5fe13bd9f7
fix: update the dlls.
2 years ago
sa_ddam213
372894e1d4
Expose some native classes
2 years ago
sa_ddam213
bac9cba01a
InferenceParams abstractions
2 years ago
sa_ddam213
2a04e31b7d
ModelParams abstraction
2 years ago
Yaohui Liu
546ba28a68
fix: ci error caused by branch merge.
2 years ago
Yaohui Liu
fc17e91d1a
feat: add backend for MACOS.
2 years ago
Yaohui Liu
9fcbd16b74
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
2968125daf
feat: update the llama backends.
2 years ago
Martin Evans
fe3bd11dfa
Merge branch 'master' into master
2 years ago
Martin Evans
7ef07104e7
Added queue fix, so that CI can pass
2 years ago
SignalRT
348f2c7d72
Update llama.cpp binaries to 5f631c2 and align the context to that version
It solves the problem with netstandard2 (is it really netstandard2 a thing right now?)
Change context to solve problems.
5f631c2679
2 years ago
Rinne
075b785a4d
Merge branch 'master' into fixed_mirostate_mu
2 years ago
Rinne
c641dbdb83
Merge pull request #69 from martindevans/fixed_mirostat_spelling
Fixed Spelling Mirostate -> Mirostat
2 years ago
Rinne
8d37abd787
Merge pull request #68 from martindevans/sampling_improvements
Fixed Memory pinning in Sampling API
2 years ago
Rinne
1d29b240b2
Merge pull request #64 from martindevans/new_llama_state_loading_mechanism
Low level new loading system
2 years ago
Martin Evans
add3d5528b
Removed `MarshalAs` on array
2 years ago
Martin Evans
2245b84906
Update LLamaContextParams.cs
2 years ago