LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	3ba49754b1	Removed (marked as obsolete) prompting with a string for `Conversation`. Tokenization requires extra parameters (e.g. addBos, special) which require special considersation. For now it's better to tokenize using other tools and pass the tokens directly.	1 year ago
Rinne	6bf010d719	Merge pull request #689 from zsogitbe/master SemanticKernel: Correcting non-standard way of working with PromptExecutionSettings	1 year ago
Rinne	495177fd0f	fix: typos.	1 year ago
Rinne	98909dc2af	Merge pull request #708 from AsakusaRinne/llama3_support Add LLaMA3 chat session example.	1 year ago
Rinne	175b25d4f7	Add LLaMA3 chat session example.	1 year ago
Martin Evans	377ebf3664	- Added `LoadFromFileAsync` method for `LLavaWeights` - Fixed checking for invalid handles in `clip_model_load`	1 year ago
Martin Evans	00df7c1516	- Added `LLamaWeights.LoadFromFileAsync`. - Async loading supports cancellation through a `CancellationToken`. If loading is cancelled an `OperationCanceledException` is thrown. If it fails for another reason a `LoadWeightsFailedException` is thrown. - Updated examples to use `LoadFromFileAsync`	1 year ago
Zoli Somogyi	ab8dd0dfc7	Correcting non-standard way of working with PromptExecutionSettings The extension of PromptExecutionSettings is not only for ChatCompletion, but also for text completion and text embedding.	1 year ago
Zoli Somogyi	156d7bb463	Revert "Standardizing Image Data implementation" This reverts commit `b2423fe6e9`.	1 year ago
Zoli Somogyi	6bd269da60	Revert "Simplifying image handling" This reverts commit `f264024666`.	1 year ago
Zoli Somogyi	f264024666	Simplifying image handling	1 year ago
Zoli Somogyi	b2423fe6e9	Standardizing Image Data implementation	1 year ago
Martin Evans	ccc49eb1e0	BatchedExecutor Save/Load (#681 ) * Added the ability to save and load individual conversations in a batched executor. - New example - Added `BatchedExecutor.Load(filepath)` method - Added `Conversation.Save(filepath)` method - Added new (currently internal) `SaveState`/`LoadState` methods in LLamaContext which can stash some extra binary data in the header * Added ability to save/load a `Conversation` to an in-memory state, instead of to file. * Moved the new save/load methods out to an extension class specifically for the batched executor. * Removed unnecessary spaces	1 year ago
Martin Evans	c325ac9127	April 2024 Binary Update (#662 ) * Updated binaries, using [this build](https://github.com/SciSharp/LLamaSharp/actions/runs/8654672719/job/23733195669) for llama.cpp commit `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7`. - Added all new functions. - Moved some functions (e.g. `SafeLlamaModelHandle` specific functions) into `SafeLlamaModelHandle.cs` - Exposed tokens on `SafeLlamaModelHandle` and `LLamaWeights` through a `Tokens` property. As new special tokens are added in the future they can be added here. - Changed all token properties to return nullable tokens, to handle some models not having some tokens. - Fixed `DefaultSamplingPipeline` to handle no newline token in some models. * Moved native methods to more specific locations. - Context specific things have been moved into `SafeLLamaContextHandle.cs` and made private - they're exposed through C# properties and methods already. - Checking that GPU layer count is zero if GPU offload is not supported. - Moved methods for creating default structs (`llama_model_quantize_default_params` and `llama_context_default_params`) into relevant structs. * Removed exception if `GpuLayerCount > 0` when GPU is not supported. * - Added low level wrapper methods for new per-sequence state load/save in `SafeLLamaContextHandle` - Added high level wrapper methods (save/load with `State` object or memory mapped file) in `LLamaContext` - Moved native methods for per-sequence state load/save into `SafeLLamaContextHandle` * Added update and defrag methods for KV cache in `SafeLLamaContextHandle` * Updated submodule to `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7` * Passing the sequence ID when saving a single sequence state	1 year ago
jlsantiago	399e81d314	Merge pull request #664 from SignalRT/LLavaResetOnImageChange Llava Initial approach to clear images	1 year ago
Martin Evans	274ab6e578	Merge pull request #663 from martindevans/remove_example_context_size Removed `ContextSize` from most examples	1 year ago
Martin Evans	6b816dd51b	Removed context size from SpeechChat	1 year ago
SignalRT	168f697db6	Clean up and align documentation with the changes in the interface	1 year ago
SignalRT	aa11562f62	Link the llama.cpp reference about reset llava contex	1 year ago
SignalRT	d6890e4ec4	Initial approach to clear images	1 year ago
Martin Evans	64db478578	Removed `ContextSize` from most examples. If it's not set it's retrieved from the model, which is usually what you want!	1 year ago
jlsantiago	8dd9101f8d	Merge pull request #653 from zsogitbe/master Extension LLava with in memory images	1 year ago
Lyrcaxis	b66b49de58	typo fix	1 year ago
Lyrcaxis	c3cddcfafb	Better Title for the SpeechChat example	1 year ago
Zoli Somogyi	f4fad825c7	Simplifying image handling	1 year ago
Lyrcaxis	e9bc6b6726	cr x	1 year ago
Lyrcaxis	8316c2c3c0	addressed change requests	1 year ago
Lyrcaxis	8c94659dbc	naming adjustments & beam sampling	1 year ago
Lyrcaxis	c86d4b9aba	spaces vs tabs	1 year ago
Lyrcaxis	417ed94a46	Example with GPU support	1 year ago
Zoli Somogyi	e991e631f9	Standardizing Image Data implementation	1 year ago
Lyrcaxis	469ec0d68a	minor fixup	1 year ago
Lyrcaxis	9e513204db	Added Whisper.net x LLamaSharp examples for Speech Detection and Speech Chat	1 year ago
Rinne	045850819e	Merge pull request #647 from AsakusaRinne/fix_llava_backend fix: add cuda llava native libraries.	1 year ago
Martin Evans	58107bb5b9	Logging interceptor (#649 ) * - Added `NativeLogConfig` which allows overriding the llama.cpp log callback - Delaying binding of this into llama.cpp until after `NativeLibraryConfig` has loaded * Using the log callback to show loading log messages during loading. * Registering log callbacks before any calls to llama.cpp except `llama_empty_call`, this is specifically selected to be a method that does nothing and is just there for triggering DLL loading. * - Removed much of the complexity of logging from `NativeApi.Load`. It always call whatever log callbacks you have registered. - Removed alternative path for `ILogger` in NativeLibraryConfig, instead it redirects to wrapping it in a delegate. * Saving a GC handle to keep the log callback alive * Removed prefix, logger should already do that. * Buffering up messages until a newline is encountered before passing log message to ILogger. * - Added trailing `\n` to log messages from loading. - Using `ThreadLocal<StringBuilder>` to ensure messages from separate threads don't get mixed together.	1 year ago
Rinne	ec8f832365	fix: add cuda llava native libraries.	1 year ago
Rinne	b9444452eb	docs: refactor the documentations.	1 year ago
SignalRT	bc487decae	Delete default prompt	1 year ago
SignalRT	43677c511c	Change interface to support multiple images and add the capabitlity to render the image in the console	1 year ago
SignalRT	e8732efadd	Example InteractiveExecutor Add an Example and modifications to the interactive executor to enable Llava Models. Just a preview / demo	1 year ago
Rinne	b677cdc6a3	Merge pull request #560 from eublefar/feature/chat-session-state-management Chat session state management	1 year ago
Martin Evans	e2705be6c8	Fixed off by one error in LLamaBatch sampling position (#626 )	1 year ago
eublefar	9440f153da	Make process message method more flexible	1 year ago
Martin Evans	ad682fbebd	`BatchedExecutor.Create()` method (#613 ) Replaced `BatchedExecutor.Prompt(string)` method with `BatchedExecutor.Create()` method. This improves the API in two ways: - A conversation can be created, without immediately prompting it - Other prompting overloads (e.g. prompt with token list) can be used without duplicating all the overloads onto `BatchedExecutor` Added `BatchSize` property to `LLamaContext`	1 year ago
Rinne	e3ecc318ff	Merge pull request #612 from xbotter/deps/sk-1.6.2 Update Semantic Kernel & Kernel Memory Package	1 year ago
Martin Evans	024787225b	`SetDllImportResolver` based loading (#603 ) - Modified library loading to be based on `SetDllImportResolver`. This replaces the built in loading system and ensures there can't be two libraries loaded at once. - llava and llama are loaded separately, as needed. - All the previous loading logic is still used, within the `SetDllImportResolver` - Split out CUDA, AVX and MacOS paths to separate helper methods. - `Description` now specifies if it is for `llama` or `llava`	1 year ago
eublefar	a31391edd7	Polymorphic serialization for executor state and transforms	1 year ago
xbotter	3f2e5c27ff	🔧 Update package references - Update Microsoft.KernelMemory.Core to version 0.34.240313.1 - Update Microsoft.SemanticKernel to version 1.6.2 - Update Microsoft.SemanticKernel.Plugins.Memory to version 1.6.2-alpha - Update Microsoft.KernelMemory.Abstractions to version 0.34.240313.1 - Update Microsoft.SemanticKernel.Abstractions to version 1.6.2	1 year ago
Martin Evans	f0b0bbcbb7	Mutable Logits (#586 ) Modified LLamaBatch to not share tokens with other sequences if logits is true. This ensures that the logit span at the end in used by exactly one sequence - therefore it's safe to mutate. This removes the need for copying _very_ large arrays (vocab size) and simplifies sampling pipelines.	1 year ago
dependabot[bot]	6f03d5ac5c	build(deps): bump Microsoft.SemanticKernel and Microsoft.SemanticKernel.Abstractions (#572 ) Bumps [Microsoft.SemanticKernel](https://github.com/microsoft/semantic-kernel) and [Microsoft.SemanticKernel.Abstractions](https://github.com/microsoft/semantic-kernel). These dependencies needed to be updated together. Updates `Microsoft.SemanticKernel` from 1.4.0 to 1.5.0 - [Release notes](https://github.com/microsoft/semantic-kernel/releases) - [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.4.0...dotnet-1.5.0) Updates `Microsoft.SemanticKernel.Abstractions` from 1.4.0 to 1.5.0 - [Release notes](https://github.com/microsoft/semantic-kernel/releases) - [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.4.0...dotnet-1.5.0) --- updated-dependencies: - dependency-name: Microsoft.SemanticKernel dependency-type: direct:production update-type: version-update:semver-minor - dependency-name: Microsoft.SemanticKernel.Abstractions dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	1 year ago

1 2 3 4

192 Commits (3d76ef7b6ab554277bd44df4857a3b2014758fb7)