Martin Evans
98635a0d5a
Fixed decoding of large tokens (over 16 bytes) in streaming text decoder
1年前
Martin Evans
402a110a3a
Merge pull request #404 from martindevans/switched_to_LLamaToken_struct
LLamaToken Struct
1年前
Martin Evans
1e69e265b6
Moved some native methods to do with creating/destroying resources into their respective handles. There is **no** safe way to call most of these methods, everything must be done through through handles.
1年前
Martin Evans
82727c4414
Removed collection expressions from test
1年前
Martin Evans
2eb52b1630
made casts to/from int explicit, fixed places affected
1年前
Martin Evans
42be9b136d
Switched form using raw integers, to a `LLamaToken` struct
1年前
Martin Evans
4e5e994dda
- directly returning a SafeLlamaModelHandle, instead of an IntPtr which is wrapped in a handle.
- made `llama_backend_init` private. This is automatically called, there is no way it can correctly be used externally.
- made `llama_token_to_piece` safe (Span instead of pointer)
1年前
Martin Evans
bac3e43498
Fixed handling of empty spans
1年前
Martin Evans
c002642268
- Removed some `unsafe` where it wasn't necessary
- Wrapped some native functions which take (pointer, length) in function which take a `span` instead.
1年前
Martin Evans
f860f88c36
Code cleanup driven by R# suggestions:
- Made `NativeApi` into a `static class` (it's not intended to be instantiated)
- Moved `LLamaTokenType` enum out into a separate file
- Made `LLamaSeqId` and `LLamaPos` into `record struct`, convenient to have equality etc
1年前
Martin Evans
2cded1b296
Fixed alignment of value fields in `LLamaModelMetadataOverride`
1年前
Martin Evans
6be3f62321
Fixed loading of very large metadata values (over 1kb)
1年前
Martin Evans
fb606c2488
Fixed incorrect values
1年前
Martin Evans
47e4fcef2a
Fixed GetString on netstandard2
1年前
Martin Evans
2a1e1b6183
Removed unused imports
1年前
Martin Evans
a2bae178fa
Added a `Metadata` property to `LLamaWeights`
1年前
Martin Evans
1b13f7c717
Improved support for AVX512:
- Enabled more features in build process (VBMI and VNNI)
- Added runtime checking for this features
- Improved runtime checking to no longer require dotnet8.0
1年前
Martin Evans
c298ab828a
Merge pull request #368 from martindevans/context_set_seed
Context Set Seed
1年前
Martin Evans
a3177ab140
Merge pull request #369 from martindevans/rename_llama_sample_temperature
Renamed `llama_sample_temperature` to `llama_sample_temp`
1年前
Martin Evans
db7ecf5a43
Added a method to create a clone of a grammar instance
1年前
Martin Evans
ea523d2e2a
Renamed `llama_sample_temperature` to `llama_sample_temp`, Mirroring the same change made in llama.cpp
1年前
Martin Evans
2df3e7617e
Added a method to set the RNG seed on the context
1年前
Martin Evans
cedef5e45a
Added the `pure` field to `LLamaModelQuantizeParams` (it's been added to llama.cpp)
1年前
Martin Evans
b868b056f7
Added metadata overrides to `IModelParams`
1年前
Martin Evans
b22d8b7495
- Added `GroupDisposable` to dispose a collection of items all together
- Renamed `LLamaModelKvOverride` to `LLamaModelMetadataOverride`
1年前
Martin Evans
5ad2cd1d3c
Added a comment on the type itself
1年前
Martin Evans
b0270b5788
Added comments on GGMLType
1年前
Martin Evans
b3e576608b
fixed safe handle
1年前
Martin Evans
bab6b65b61
Added a safe handle for LLamaKvCacheView
1年前
Martin Evans
439d14a061
Updated binaries:
- build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440
- commit: 9fb13f9584
1年前
Martin Evans
835958398c
- Removed the object wrappers and configurable pipeline, they can be better written in code.
- Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline`
- Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling
1年前
Martin Evans
33358124db
Initial pass at a new sampling pipeline
2 年前
Rinne
1f97ad874b
Merge pull request #333 from AsakusaRinne/master
feat: allow customized search path for native library loading.
2 年前
Rinne
ffc347a3f3
resolve comments.
2 年前
Rinne
b05c3154f4
feat: allow customized search path for native library loading.
2 年前
Rinne
934358a7b3
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into fix_chinese
2 年前
Rinne
217c67b757
fix: chinese encoding error.
2 年前
Martin Evans
a3614f6747
Added `native/` back into path prefix
2 年前
Martin Evans
77003d763e
Added new symbols from llama.h
2 年前
Martin Evans
37466956c7
Added new binaries.
- Built by this run: https://github.com/SciSharp/LLamaSharp/actions/runs/6921572568
- commit: `e937066420b79a757bf80e9836eb12b88420a218`
- Rearranged paths
2 年前
Martin Evans
48c5039054
Improved test coverage. Discovered some issues:
FixedSizeQueue:
- Enqueue would always stop one short of filling the capacity
- Fill would only _replace_ existing items. It was only used in a place where there were not existing items! Removed the method entirely.
LLamaGrammarElement:
- Converted into a `record` struct, removed all of the (now unnecessary) equality stuff.
2 年前
Martin Evans
c517cc18a2
Merge pull request #304 from martindevans/obsolete_attribute_eval
Added Obsolete markings to all `Eval` overloads
2 年前
Martin Evans
16ab33ba3c
Added Obsolete markings to all `Eval` overloads
2 年前
Martin Evans
0e51badb38
Exposed `progress_callback` in `LLamaModelParams` (although not in higher level)
2 年前
Martin Evans
1970023ef4
Merge pull request #292 from martindevans/dotnet8.0
dotnet8.0
2 年前
Martin Evans
89fef05362
This commit ( 5fe721bdbe) accidentally removed a load of stuff that it shouldn't. Fixed that.
Originally from these PRs:
- https://github.com/SciSharp/LLamaSharp/pull/263
- https://github.com/SciSharp/LLamaSharp/pull/259
2 年前
Martin Evans
e9f5dbba89
Processing AVX512 branch on all dotnet versions
2 年前
Martin Evans
e850115b5f
Added dotnet8.0 as a build target
2 年前
Martin Evans
b44e780b0f
Merge pull request #281 from martindevans/NativeLibraryConfig_improvements
CPU Feature Detection 2
2 年前
Martin Evans
e3468d04f0
Merge pull request #277 from martindevans/feature/min_p
MinP Sampler
2 年前