sa_ddam213
a67ea36dd9
Typo and formatting
2 years ago
sa_ddam213
726987b761
Add native logging output
2 years ago
Martin Evans
270c6d55ef
Merge pull request #88 from martindevans/fix_serialization_nan
Fix serialization error due to NaN
2 years ago
Martin Evans
91bcefc852
comment on IModelParamsExtensions
2 years ago
Martin Evans
9cdc72aa67
Fixed `ToLlamaContextParams` using the wrong parameter for `use_mmap`
2 years ago
Martin Evans
bab3b46f0c
Merge pull request #82 from martindevans/tokenization_cleanup
Utils Cleanup
2 years ago
Martin Evans
b5de3ee5aa
Fixed some final mentions of "mirostate" instead of "mirostat"
2 years ago
Martin Evans
be52737488
Using a nullable float instead of NaN, this should fix the serialization issue reported in #85
2 years ago
sa_ddam213
2d1269cae9
Access to IModelParamsExtensions
2 years ago
Martin Evans
1fceeaf352
Applied fix from #84 (antiprompt does not work in stateless executor)
2 years ago
Yaohui Liu
d609b0e1d5
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
b60c8bd285
fix: antiprompt does not work in stateless executor.
2 years ago
Martin Evans
2b2d3af26b
Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle`
2 years ago
Martin Evans
7fabcc1849
One last `TokenToString` case
2 years ago
Martin Evans
0e5e00e300
Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`)
2 years ago
Martin Evans
2d811b2603
- Moved `GetLogits` into `SafeLLamaContextHandle`
- Added disposal check into `SafeLLamaContextHandle`
2 years ago
Martin Evans
cd3cf2b77d
- Moved tokenization from `Utils.Tokenize` into `SafeLLamaContextHandle.Tokenize`, one less thing in `Utils`.
- Also refactored it to return an `int[]` instead of an `IEnumerable<int>`, solving the "multiple enumeration" problems at the source!
2 years ago
Martin Evans
73882de591
Merge pull request #81 from martindevans/tensor_splits_array
Improved Tensor Splits
2 years ago
Martin Evans
bd3d8d3dc4
Cleaned up multiple enumeration in FixedSizeQueue
2 years ago
Martin Evans
f2499371ea
Pulled conversion of a `IModelParams` into a `LLamaContextParams` out into an extension method which can be used in other places.
2 years ago
Martin Evans
f1111a9f8b
Using a pin instead of a `fixed` block
2 years ago
Martin Evans
685eb3b9c2
Replaced `nint` with `float[]?` in Model params, which is much more user friendly!
2 years ago
sa_ddam213
e02d0c3617
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Rinne
bfe9cc8961
Merge pull request #78 from SciSharp/rinne-dev
feat: update the llama backends.
2 years ago
sa_ddam213
e46646b8db
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Yaohui Liu
bb46a990d0
fix: add bug info for native api.
2 years ago
Yaohui Liu
5fe13bd9f7
fix: update the dlls.
2 years ago
sa_ddam213
372894e1d4
Expose some native classes
2 years ago
sa_ddam213
bac9cba01a
InferenceParams abstractions
2 years ago
sa_ddam213
2a04e31b7d
ModelParams abstraction
2 years ago
Yaohui Liu
546ba28a68
fix: ci error caused by branch merge.
2 years ago
Yaohui Liu
fc17e91d1a
feat: add backend for MACOS.
2 years ago
Yaohui Liu
9fcbd16b74
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
2968125daf
feat: update the llama backends.
2 years ago
Martin Evans
fe3bd11dfa
Merge branch 'master' into master
2 years ago
Martin Evans
7ef07104e7
Added queue fix, so that CI can pass
2 years ago
SignalRT
348f2c7d72
Update llama.cpp binaries to 5f631c2 and align the context to that version
It solves the problem with netstandard2 (is it really netstandard2 a thing right now?)
Change context to solve problems.
5f631c2679
2 years ago
Rinne
075b785a4d
Merge branch 'master' into fixed_mirostate_mu
2 years ago
Rinne
c641dbdb83
Merge pull request #69 from martindevans/fixed_mirostat_spelling
Fixed Spelling Mirostate -> Mirostat
2 years ago
Rinne
8d37abd787
Merge pull request #68 from martindevans/sampling_improvements
Fixed Memory pinning in Sampling API
2 years ago
Rinne
1d29b240b2
Merge pull request #64 from martindevans/new_llama_state_loading_mechanism
Low level new loading system
2 years ago
Martin Evans
add3d5528b
Removed `MarshalAs` on array
2 years ago
Martin Evans
2245b84906
Update LLamaContextParams.cs
2 years ago
Martin Evans
c64507cb41
Correctly passing through mu value to mirostate instead of resetting it every time.
2 years ago
Rinne
cd015055a8
Merge branch 'master' into more_multi_enumeration_fixes
2 years ago
sa_ddam213
3e252c81f6
LLamaContextParams epsilon and tensor split changes
2 years ago
Martin Evans
36735f7908
Fixed spelling of "mirostat" instead of "mirostate"
2 years ago
Martin Evans
ec49bdd6eb
- Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned!
- Moved repeated code to convert `LLamaTokenDataArray` into a `LLamaTokenDataArrayNative` into a helper method.
- Modified all call sites to dispose the `MemoryHandle`
- Saved one copy of the `List<LLamaTokenData>` into a `LLamaTokenData[]` in `LlamaModel`
2 years ago
Martin Evans
6985d3ab60
Added comments on two properties
2 years ago
Martin Evans
c974c8429e
Removed leftover `using`
2 years ago