LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Rinne	ec8f832365	fix: add cuda llava native libraries.	1 year ago
Rinne	b9444452eb	docs: refactor the documentations.	1 year ago
SignalRT	bc487decae	Delete default prompt	1 year ago
SignalRT	43677c511c	Change interface to support multiple images and add the capabitlity to render the image in the console	1 year ago
SignalRT	e8732efadd	Example InteractiveExecutor Add an Example and modifications to the interactive executor to enable Llava Models. Just a preview / demo	1 year ago
Rinne	b677cdc6a3	Merge pull request #560 from eublefar/feature/chat-session-state-management Chat session state management	1 year ago
Martin Evans	e2705be6c8	Fixed off by one error in LLamaBatch sampling position (#626 )	1 year ago
eublefar	9440f153da	Make process message method more flexible	1 year ago
Martin Evans	ad682fbebd	`BatchedExecutor.Create()` method (#613 ) Replaced `BatchedExecutor.Prompt(string)` method with `BatchedExecutor.Create()` method. This improves the API in two ways: - A conversation can be created, without immediately prompting it - Other prompting overloads (e.g. prompt with token list) can be used without duplicating all the overloads onto `BatchedExecutor` Added `BatchSize` property to `LLamaContext`	1 year ago
Rinne	e3ecc318ff	Merge pull request #612 from xbotter/deps/sk-1.6.2 Update Semantic Kernel & Kernel Memory Package	1 year ago
Martin Evans	024787225b	`SetDllImportResolver` based loading (#603 ) - Modified library loading to be based on `SetDllImportResolver`. This replaces the built in loading system and ensures there can't be two libraries loaded at once. - llava and llama are loaded separately, as needed. - All the previous loading logic is still used, within the `SetDllImportResolver` - Split out CUDA, AVX and MacOS paths to separate helper methods. - `Description` now specifies if it is for `llama` or `llava`	1 year ago
eublefar	a31391edd7	Polymorphic serialization for executor state and transforms	1 year ago
xbotter	3f2e5c27ff	🔧 Update package references - Update Microsoft.KernelMemory.Core to version 0.34.240313.1 - Update Microsoft.SemanticKernel to version 1.6.2 - Update Microsoft.SemanticKernel.Plugins.Memory to version 1.6.2-alpha - Update Microsoft.KernelMemory.Abstractions to version 0.34.240313.1 - Update Microsoft.SemanticKernel.Abstractions to version 1.6.2	1 year ago
Martin Evans	f0b0bbcbb7	Mutable Logits (#586 ) Modified LLamaBatch to not share tokens with other sequences if logits is true. This ensures that the logit span at the end in used by exactly one sequence - therefore it's safe to mutate. This removes the need for copying _very_ large arrays (vocab size) and simplifies sampling pipelines.	1 year ago
dependabot[bot]	6f03d5ac5c	build(deps): bump Microsoft.SemanticKernel and Microsoft.SemanticKernel.Abstractions (#572 ) Bumps [Microsoft.SemanticKernel](https://github.com/microsoft/semantic-kernel) and [Microsoft.SemanticKernel.Abstractions](https://github.com/microsoft/semantic-kernel). These dependencies needed to be updated together. Updates `Microsoft.SemanticKernel` from 1.4.0 to 1.5.0 - [Release notes](https://github.com/microsoft/semantic-kernel/releases) - [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.4.0...dotnet-1.5.0) Updates `Microsoft.SemanticKernel.Abstractions` from 1.4.0 to 1.5.0 - [Release notes](https://github.com/microsoft/semantic-kernel/releases) - [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.4.0...dotnet-1.5.0) --- updated-dependencies: - dependency-name: Microsoft.SemanticKernel dependency-type: direct:production update-type: version-update:semver-minor - dependency-name: Microsoft.SemanticKernel.Abstractions dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	1 year ago
eublefar	0763f307ec	Example chat session with preprocessing of chat history and reset operation that resets chat to original point of history without extra processing	1 year ago
Martin Evans	7d84625a67	Classifier Free Guidance (#536 ) * Added a `Guidance` method to `LLamaTokenDataArray` which applies classifier free guidance * Factored out a safer `llama_sample_apply_guidance` method based on spans * Created a guided sampling demo using the batched executor * fixed comment, "classifier free" not "context free" * Rebased onto master and fixed breakage due to changes in `BaseSamplingPipeline` * Asking user for guidance weight * Progress bar in batched fork demo * Improved fork example (using tree display) * Added proper disposal of resources in batched examples * Added some more comments in BatchedExecutorGuidance	1 year ago
dependabot[bot]	364259aabe	build(deps): bump Microsoft.SemanticKernel from 1.1.0 to 1.4.0 (#544 ) Bumps [Microsoft.SemanticKernel](https://github.com/microsoft/semantic-kernel) from 1.1.0 to 1.4.0. - [Release notes](https://github.com/microsoft/semantic-kernel/releases) - [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.1.0...dotnet-1.4.0) --- updated-dependencies: - dependency-name: Microsoft.SemanticKernel dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	1 year ago
dependabot[bot]	e50f30d740	build(deps): bump Microsoft.KernelMemory.Core, System.Text.Json and Microsoft.KernelMemory.Abstractions (#546 ) Bumps [Microsoft.KernelMemory.Core](https://github.com/microsoft/kernel-memory), [System.Text.Json](https://github.com/dotnet/runtime) and [Microsoft.KernelMemory.Abstractions](https://github.com/microsoft/kernel-memory). These dependencies needed to be updated together. Updates `Microsoft.KernelMemory.Core` from 0.26.240121.1 to 0.29.240219.2 - [Release notes](https://github.com/microsoft/kernel-memory/releases) - [Commits](https://github.com/microsoft/kernel-memory/compare/packages-0.26.240121.1...packages-0.29.240219.2) Updates `System.Text.Json` from 8.0.1 to 8.0.2 - [Release notes](https://github.com/dotnet/runtime/releases) - [Commits](https://github.com/dotnet/runtime/compare/v8.0.1...v8.0.2) Updates `Microsoft.KernelMemory.Abstractions` from 0.26.240104.1 to 0.29.240219.3 - [Release notes](https://github.com/microsoft/kernel-memory/releases) - [Commits](https://github.com/microsoft/kernel-memory/compare/0.26.240104.1...abstractions-0.29.240219.3) --- updated-dependencies: - dependency-name: Microsoft.KernelMemory.Core dependency-type: direct:production update-type: version-update:semver-minor - dependency-name: System.Text.Json dependency-type: direct:production update-type: version-update:semver-patch - dependency-name: Microsoft.KernelMemory.Abstractions dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	1 year ago
Martin Evans	91a7967869	`ReadOnlySpan<float>` in ISamplingPipeline (#538 ) * - Modified ISamplingPipeline to accept `ReadOnlySpan<float>` of logits directly. This moves responsibility to copy the logits into the pipeline. - Added a flag to `BaseSamplingPipeline` indicating if a logit copy is necessary. Skipping it in most cases. * Fixed `RestoreProtectedTokens` not working if logit processing is skipped * - Implemented a new greedy sampling pipeline (always sample most likely token) - Moved `Grammar` into `BaseSamplingPipeline` - Removed "protected tokens" concept from `BaseSamplingPipeline`. Was introducing a lot of incidental complexity. - Implemented newline logit save/restore in `DefaultSamplingPipeline` (only place protected tokens was used) * Implemented pipelines for mirostat v1 and v2	1 year ago
Martin Evans	74a39188a2	Used `AnsiConsole` in a few more places: (#534 ) - UserSettings, simplifying the validation/re-ask loop down to one call - Program, adding colour to figlet title - Batched examples, showing default prompt - ExampleRunner, resetting state after running an example	1 year ago
Scott W Harden	91ca9d2732	LLamaSharp.Examples: Document Q&A with local storage (#532 ) * LLama.Examples: disable console logging * LLama.Examples: rename titles to signal grouped topics * LLama.Examples: add additional PDF for Q&A * LLama.Examples: improve kernel memory demo multi-document ingestion * LLama.Examples: improve message before resetting to main menu * LLama.Examples: document Q&A with local memory	1 year ago
Scott W Harden	a6394001a1	NativeLibraryConfig: WithLogs(LLamaLogLevel) (#529 ) Adds a NativeLibraryConfig.WithLogs() overload to let the user indicate the log level (with "info" as the default)	1 year ago
Scott W Harden	06ffe3ac95	LLama.Examples: improve model path prompt (#526 ) * LLama.Examples: RepoUtils.cs → ConsoleLogger.cs * LLama.Examples: Examples/Runner.cs → ExampleRunner.cs * LLama.Examples: delete unused console logger * LLama.Examples: improve splash screen appearance the llama_empty_call() no longer shows configuration information on startup, but it will display it automatically the first time a model is engaged * LLama.Examples: Runner → ExampleRunner * LLama.Examples: improve model path prompt The last used model is stored in a config file and is re-used when a blank path is provided * LLama.Examples: NativeApi.llama_empty_call() at startup * LLama.Examples: reduce console noise when saving model path	1 year ago
Scott W Harden	efa49cc8de	Improve "embeddings" example (#525 ) * Embeddings example: set EmbeddingMode true prevents an exception from being thrown when GetEmbeddings() is called * Embeddings example: improve documentation and styling * docs: improve GetEmbeddings page If EmbeddingMode is not set to true, GetEmbeddings() throws an exception * docs: improve GetEmbeddings page The previous commit `6c9ff3158c` was inaccurate * Embeddings example: improve styling displays the example description after the model is loaded to ensure the text is on the screen at the time the prompt is first requested	1 year ago
Martin Evans	b0acecf080	Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). Conversations can be "forked", to create a copy of a conversation at a given point. This allows e.g. prompting a conversation with a system prefix just once and then forking it again and again for each individual conversation. Conversations can also be "rewound" to an earlier state. Added two new examples, demonstrating forking and rewinding.	1 year ago
Martin Evans	92b9bbe779	Added methods to `SafeLLamaContextHandle` for KV cache manipulation	1 year ago
Martin Evans	96c26c25f5	Merge pull request #445 from martindevans/stateless_executor_llama_decode Swapped `StatelessExecutor` to use `llama_decode`!	1 year ago
xbotter	90815ae7d8	bump sk & km - bump semantic kernel to 1.1.0 - bump kernel memory to 0.26	1 year ago
Martin Evans	9fe878ae1f	- Fixed example - Growing more than double, if necessary	1 year ago
Martin Evans	a2e29d393c	Swapped `StatelessExecutor` to use `llama_decode`! - Added `logits_i` argument to `Context.ApplyPenalty` - Added a new exception type for `llama_decode` return code	1 year ago
Martin Evans	5b6e82a594	Improved the BatchedDecoding demo: - using less `NativeHandle` - Using `StreamingTokenDecoder` instead of obsolete detokenize method	1 year ago
Martin Evans	99969e538e	- Removed some unused `eval` methods. - Added a `DecodeAsync` overload which runs the work in a task - Replaced some `NativeHandle` usage in `BatchedDecoding` with higher level equivalents. - Made the `LLamaBatch` grow when token capacity is exceeded, removing the need to manage token capacity externally.	1 year ago
Martin Evans	36a9335588	Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch.	1 year ago
Martin Evans	42be9b136d	Switched form using raw integers, to a `LLamaToken` struct	1 year ago
Martin Evans	a408335c44	Fixed broken build on master (just removing a namespace that no longer exists)	1 year ago
dependabot[bot]	f02b0500b5	build(deps): bump Microsoft.KernelMemory.Core and Microsoft.KernelMemory.Abstractions Bumps [Microsoft.KernelMemory.Core](https://github.com/microsoft/kernel-memory) and [Microsoft.KernelMemory.Abstractions](https://github.com/microsoft/kernel-memory). These dependencies needed to be updated together. Updates `Microsoft.KernelMemory.Core` from 0.18.231209.1-preview to 0.24.231228.5 - [Release notes](https://github.com/microsoft/kernel-memory/releases) - [Commits](https://github.com/microsoft/kernel-memory/compare/dotnet-0.18.231209.1-preview...0.24.231228.5) Updates `Microsoft.KernelMemory.Abstractions` from 0.18.231209.1-preview to 0.24.231228.5 - [Release notes](https://github.com/microsoft/kernel-memory/releases) - [Commits](https://github.com/microsoft/kernel-memory/compare/dotnet-0.18.231209.1-preview...0.24.231228.5) --- updated-dependencies: - dependency-name: Microsoft.KernelMemory.Core dependency-type: direct:production update-type: version-update:semver-minor - dependency-name: Microsoft.KernelMemory.Abstractions dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	1 year ago
Martin Evans	f0d7468b22	Merge pull request #356 from xbotter/deps/sk-rc3 bump sk to 1.0.1 & km to 0.18	1 year ago
xbotter	40ac944fb5	Bump sk to 1.0.1	1 year ago
Martin Evans	b868b056f7	Added metadata overrides to `IModelParams`	1 year ago
xbotter	8766fb1b03	Merge branch 'deps/sk-rc3' of https://github.com/xbotter/LLamaSharp into deps/sk-rc3	1 year ago
xbotter	213b4be723	bump sk-1.0.0-rc4	1 year ago
xbotter	ce20b30e06	Merge branch 'SciSharp:master' into deps/sk-rc3	1 year ago
Martin Evans	bab6b65b61	Added a safe handle for LLamaKvCacheView	1 year ago
Rinne	fb75e06293	fix: output prefix of Chinese example.	1 year ago
Rinne	836f071cd0	fix: Chinese example.	1 year ago
xbotter	13a312b4ec	update sk to 1.0.0-rc3 & km to 0.18	1 year ago
Philipp Bauer	f669a4f5a7	Update the Chinese chat sample to use new ChatSession integration	1 year ago
Philipp Bauer	2cc01efdae	Merge branch 'SciSharp:master' into master	1 year ago
Martin Evans	4fc743c9ba	Merge branch 'master' into master	1 year ago

1 2 3 4

157 Commits (ec8f83236545a1989df2f75da4e1d8d0345b0407)