LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	00df7c1516	- Added `LLamaWeights.LoadFromFileAsync`. - Async loading supports cancellation through a `CancellationToken`. If loading is cancelled an `OperationCanceledException` is thrown. If it fails for another reason a `LoadWeightsFailedException` is thrown. - Updated examples to use `LoadFromFileAsync`	1 year ago
Martin Evans	c325ac9127	April 2024 Binary Update (#662 ) * Updated binaries, using [this build](https://github.com/SciSharp/LLamaSharp/actions/runs/8654672719/job/23733195669) for llama.cpp commit `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7`. - Added all new functions. - Moved some functions (e.g. `SafeLlamaModelHandle` specific functions) into `SafeLlamaModelHandle.cs` - Exposed tokens on `SafeLlamaModelHandle` and `LLamaWeights` through a `Tokens` property. As new special tokens are added in the future they can be added here. - Changed all token properties to return nullable tokens, to handle some models not having some tokens. - Fixed `DefaultSamplingPipeline` to handle no newline token in some models. * Moved native methods to more specific locations. - Context specific things have been moved into `SafeLLamaContextHandle.cs` and made private - they're exposed through C# properties and methods already. - Checking that GPU layer count is zero if GPU offload is not supported. - Moved methods for creating default structs (`llama_model_quantize_default_params` and `llama_context_default_params`) into relevant structs. * Removed exception if `GpuLayerCount > 0` when GPU is not supported. * - Added low level wrapper methods for new per-sequence state load/save in `SafeLLamaContextHandle` - Added high level wrapper methods (save/load with `State` object or memory mapped file) in `LLamaContext` - Moved native methods for per-sequence state load/save into `SafeLLamaContextHandle` * Added update and defrag methods for KV cache in `SafeLLamaContextHandle` * Updated submodule to `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7` * Passing the sequence ID when saving a single sequence state	1 year ago
Martin Evans	a8ba9f05b3	March Binary Update (#565 ) * Updated binaries to llama.cpp `3ab8b3a92ede46df88bc5a2dfca3777de4a2b2b6` (build run: https://github.com/SciSharp/LLamaSharp/actions/runs/8118890586) * Added abort callback * Added properties to get/set thread count on `LLamaContext` * Fixed LLamaLogLevel numbering	1 year ago
Martin Evans	15a98b36d8	Updated everything to work with llama.cpp `ce32060198`	1 year ago
Martin Evans	439d14a061	Updated binaries: - build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440 - commit: `9fb13f9584`	1 year ago
Martin Evans	89fef05362	This commit (`5fe721bdbe`) accidentally removed a load of stuff that it shouldn't. Fixed that. Originally from these PRs: - https://github.com/SciSharp/LLamaSharp/pull/263 - https://github.com/SciSharp/LLamaSharp/pull/259	2 years ago
SignalRT	97006a214f	Merge remote-tracking branch 'upstream/master' into RuntimeDetection	2 years ago
Martin Evans	31244ae691	Merge branch 'master' into YaRN_scaling_parameters	2 years ago
SignalRT	5fe721bdbe	Revert "Merge branch 'pr/268' into RuntimeDetection" This reverts commit 091b8d58b3502a99b3bfbec9db457c92cc736beb, reversing changes made to `9b2ca9cf8e`.	2 years ago
Martin Evans	db1bc741b0	Modified `ContextSize` in parameters to be nullable. A null value means autodetect from the model.	2 years ago
Martin Evans	04ee64a6be	Exposed YaRN scaling parameters in IContextParams	2 years ago
SignalRT	46fb472d42	Align with llama.cpp b1488	2 years ago
Martin Evans	bca55eace0	Initial changes to match the llama.cpp changes	2 years ago
Martin Evans	2056078aef	Initial changes required for GGUF support	2 years ago
Martin Evans	a911b77dec	Various minor changes, resolving about 100 ReSharper code quality warnings	2 years ago
zombieguy	45b01d5a78	Improved type conversion Type conversion is now done in the property rather than the utils class and uses the System.Convert class to ensure consistency.	2 years ago
Martin Evans	2830e5755c	- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed. - Deleted `NativeInfo` (internal class, not used anywhere)	2 years ago
zombieguy	10f88ebd0e	Potential fix for .Net Framework issues (#103 ) * Added a bool to sbyte Utils convertor As an attempt to avoid using any MarshalAs attribute for .Net Framework support this Utils method will take in a bool value and return a 1 for true or 0 for false sbyte. * Changed all bool "MarshalAs" types to sbytes Changed all previous BOOL types with "MarshalAs" attributes to SBYTEs and changed all the setters of them to use the Utils.BoolToSignedByte() convertor method. * Fixed Utils bool convertor & added sbyte to bool Improved the Utils bool convertor just casting an sbyte value to get rid of the unneeded sbyte array and added an sbyte to bool convertor to convert back the way to a C# bool assuming any positive value above 0 is a bool and no bools are packed in the single byte integer. * bool to & from sbyte conversions via properties All 1byte bools are now handled where they "sit", via public properties which perform the conversions to keep all external data able to communicate as it did before.	2 years ago
SignalRT	348f2c7d72	Update llama.cpp binaries to 5f631c2 and align the context to that version It solves the problem with netstandard2 (is it really netstandard2 a thing right now?) Change context to solve problems. `5f631c2679`	2 years ago
Martin Evans	add3d5528b	Removed `MarshalAs` on array	2 years ago
Martin Evans	2245b84906	Update LLamaContextParams.cs	2 years ago
sa_ddam213	3e252c81f6	LLamaContextParams epsilon and tensor split changes	2 years ago
Martin Evans	f16aa58e12	Updated to use the new loading system in llama (llama_state). This new system has split model weights and contexts into two separate things, allowing one set of weights to be shared between many contexts. This change _only_ implements the low level API and makes no effort to update the LlamaSharp higher level abstraction. It is built upon llama `b3f138d`, necessary DLLs are not included in this commit.	2 years ago
Yaohui Liu	9850417a12	feat: update quantize native params.	2 years ago
Yaohui Liu	18c2ff2395	refactor: instruct mode and examples.	2 years ago
Yaohui Liu	1fca06dc7f	fix: n_gpu_layers miss in llama context.	2 years ago
Yaohui Liu	5a79edeb51	feat: add the framework and basic usages.	2 years ago

27 Commits (a0335f67a4cd42a25f6b8920e43d35f0124b15e8)