dependabot[bot]
e50f30d740
build(deps): bump Microsoft.KernelMemory.Core, System.Text.Json and Microsoft.KernelMemory.Abstractions ( #546 )
Bumps [Microsoft.KernelMemory.Core](https://github.com/microsoft/kernel-memory ), [System.Text.Json](https://github.com/dotnet/runtime ) and [Microsoft.KernelMemory.Abstractions](https://github.com/microsoft/kernel-memory ). These dependencies needed to be updated together.
Updates `Microsoft.KernelMemory.Core` from 0.26.240121.1 to 0.29.240219.2
- [Release notes](https://github.com/microsoft/kernel-memory/releases )
- [Commits](https://github.com/microsoft/kernel-memory/compare/packages-0.26.240121.1...packages-0.29.240219.2 )
Updates `System.Text.Json` from 8.0.1 to 8.0.2
- [Release notes](https://github.com/dotnet/runtime/releases )
- [Commits](https://github.com/dotnet/runtime/compare/v8.0.1...v8.0.2 )
Updates `Microsoft.KernelMemory.Abstractions` from 0.26.240104.1 to 0.29.240219.3
- [Release notes](https://github.com/microsoft/kernel-memory/releases )
- [Commits](https://github.com/microsoft/kernel-memory/compare/0.26.240104.1...abstractions-0.29.240219.3 )
---
updated-dependencies:
- dependency-name: Microsoft.KernelMemory.Core
dependency-type: direct:production
update-type: version-update:semver-minor
- dependency-name: System.Text.Json
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: Microsoft.KernelMemory.Abstractions
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 year ago
dependabot[bot]
33827a1ba8
build(deps): bump Microsoft.SemanticKernel.Abstractions ( #542 )
Bumps [Microsoft.SemanticKernel.Abstractions](https://github.com/microsoft/semantic-kernel ) from 1.1.0 to 1.4.0.
- [Release notes](https://github.com/microsoft/semantic-kernel/releases )
- [Commits](https://github.com/microsoft/semantic-kernel/compare/dotnet-1.1.0...dotnet-1.4.0 )
---
updated-dependencies:
- dependency-name: Microsoft.SemanticKernel.Abstractions
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 year ago
dependabot[bot]
070969b23f
build(deps): bump coverlet.collector from 6.0.0 to 6.0.1 ( #540 )
Bumps [coverlet.collector](https://github.com/coverlet-coverage/coverlet ) from 6.0.0 to 6.0.1.
- [Release notes](https://github.com/coverlet-coverage/coverlet/releases )
- [Commits](https://github.com/coverlet-coverage/coverlet/compare/v6.0.0...v6.0.1 )
---
updated-dependencies:
- dependency-name: coverlet.collector
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 year ago
alexhiggins732
3b5e781f1f
Add link to newly created Blazor Demo, requested by in the documentation. ( #539 )
1 year ago
Martin Evans
91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline ( #538 )
* - Modified ISamplingPipeline to accept `ReadOnlySpan<float>` of logits directly. This moves responsibility to copy the logits into the pipeline.
- Added a flag to `BaseSamplingPipeline` indicating if a logit copy is necessary. Skipping it in most cases.
* Fixed `RestoreProtectedTokens` not working if logit processing is skipped
* - Implemented a new greedy sampling pipeline (always sample most likely token)
- Moved `Grammar` into `BaseSamplingPipeline`
- Removed "protected tokens" concept from `BaseSamplingPipeline`. Was introducing a lot of incidental complexity.
- Implemented newline logit save/restore in `DefaultSamplingPipeline` (only place protected tokens was used)
* Implemented pipelines for mirostat v1 and v2
1 year ago
Martin Evans
74a39188a2
Used `AnsiConsole` in a few more places: ( #534 )
- UserSettings, simplifying the validation/re-ask loop down to one call
- Program, adding colour to figlet title
- Batched examples, showing default prompt
- ExampleRunner, resetting state after running an example
1 year ago
Scott W Harden
91ca9d2732
LLamaSharp.Examples: Document Q&A with local storage ( #532 )
* LLama.Examples: disable console logging
* LLama.Examples: rename titles to signal grouped topics
* LLama.Examples: add additional PDF for Q&A
* LLama.Examples: improve kernel memory demo
multi-document ingestion
* LLama.Examples: improve message before resetting to main menu
* LLama.Examples: document Q&A with local memory
1 year ago
Scott W Harden
a6394001a1
NativeLibraryConfig: WithLogs(LLamaLogLevel) ( #529 )
Adds a NativeLibraryConfig.WithLogs() overload to let the user indicate the log level (with "info" as the default)
1 year ago
Scott W Harden
06ffe3ac95
LLama.Examples: improve model path prompt ( #526 )
* LLama.Examples: RepoUtils.cs → ConsoleLogger.cs
* LLama.Examples: Examples/Runner.cs → ExampleRunner.cs
* LLama.Examples: delete unused console logger
* LLama.Examples: improve splash screen appearance
the llama_empty_call() no longer shows configuration information on startup, but it will display it automatically the first time a model is engaged
* LLama.Examples: Runner → ExampleRunner
* LLama.Examples: improve model path prompt
The last used model is stored in a config file and is re-used when a blank path is provided
* LLama.Examples: NativeApi.llama_empty_call() at startup
* LLama.Examples: reduce console noise when saving model path
1 year ago
Scott W Harden
efa49cc8de
Improve "embeddings" example ( #525 )
* Embeddings example: set EmbeddingMode true
prevents an exception from being thrown when GetEmbeddings() is called
* Embeddings example: improve documentation and styling
* docs: improve GetEmbeddings page
If EmbeddingMode is not set to true, GetEmbeddings() throws an exception
* docs: improve GetEmbeddings page
The previous commit 6c9ff3158c was inaccurate
* Embeddings example: improve styling
displays the example description after the model is loaded to ensure the text is on the screen at the time the prompt is first requested
1 year ago
Martin Evans
3d7bf4287c
Merge pull request #523 from swharden/patch-1
ChatSession: improve exception message
1 year ago
Scott W Harden
4c3077d0f0
ChatSession: improve exception message
The original message contained the word "preceeded" which should be spelled as "preceded"
1 year ago
Martin Evans
ca2b7cf4d5
Merge pull request #520 from SciSharp/dependabot/nuget/xunit.runner.visualstudio-2.5.7
build(deps): bump xunit.runner.visualstudio from 2.5.6 to 2.5.7
1 year ago
dependabot[bot]
7fbf979fe0
build(deps): bump xunit.runner.visualstudio from 2.5.6 to 2.5.7
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.6 to 2.5.7.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.6...2.5.7 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
39fe4263e5
Merge pull request #521 from SciSharp/dependabot/nuget/Microsoft.AspNetCore.OpenApi-8.0.2
build(deps): bump Microsoft.AspNetCore.OpenApi from 8.0.1 to 8.0.2
1 year ago
Martin Evans
7a20ec6a5f
Merge pull request #522 from SciSharp/dependabot/nuget/xunit-2.7.0
build(deps): bump xunit from 2.6.6 to 2.7.0
1 year ago
dependabot[bot]
939d288384
build(deps): bump xunit from 2.6.6 to 2.7.0
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.6 to 2.7.0.
- [Commits](https://github.com/xunit/xunit/compare/2.6.6...2.7.0 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
dependabot[bot]
03ec5d9807
build(deps): bump Microsoft.AspNetCore.OpenApi from 8.0.1 to 8.0.2
Bumps [Microsoft.AspNetCore.OpenApi](https://github.com/dotnet/aspnetcore ) from 8.0.1 to 8.0.2.
- [Release notes](https://github.com/dotnet/aspnetcore/releases )
- [Changelog](https://github.com/dotnet/aspnetcore/blob/main/docs/ReleasePlanning.md )
- [Commits](https://github.com/dotnet/aspnetcore/compare/v8.0.1...v8.0.2 )
---
updated-dependencies:
- dependency-name: Microsoft.AspNetCore.OpenApi
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
6230350b97
Merge pull request #515 from martindevans/warnings_cleanup
Warnings Cleanup
1 year ago
Martin Evans
c7d0dc915a
Assorted small changes to clean up some code warnings
1 year ago
Martin Evans
9bc129e252
Merge pull request #512 from martindevans/updated_version
Release 0.10.0
1 year ago
Martin Evans
174f21a385
0.10.0
1 year ago
Martin Evans
633727bb73
Merge pull request #511 from martindevans/fixed_release_minor
Fixed Minor Release Script
1 year ago
Martin Evans
69a74bb053
Commented back in the line that pushes to nuget
1 year ago
Martin Evans
d03c1a9201
Merge pull request #503 from martindevans/batched_executor_again
Introduced a new `BatchedExecutor`
1 year ago
Martin Evans
968e1e464a
Merge pull request #507 from martindevans/normalize_embeddings
Normalize Embeddings
1 year ago
Martin Evans
d47b6afe4d
Normalizing embeddings in `LLamaEmbedder`. As is done in llama.cpp: 2891c8aa9a/examples/embedding/embedding.cpp (L92)
1 year ago
Martin Evans
a5eba9463f
Merge pull request #505 from SciSharp/dependabot/nuget/Microsoft.NET.Test.Sdk-17.9.0
build(deps): bump Microsoft.NET.Test.Sdk from 17.8.0 to 17.9.0
1 year ago
Martin Evans
e9d9042576
Added `Divide` to `KvAccessor`
1 year ago
Martin Evans
1cc463b9b7
Added a finalizer to `BatchedExecutor`
1 year ago
dependabot[bot]
58b6b927c6
build(deps): bump Microsoft.NET.Test.Sdk from 17.8.0 to 17.9.0
Bumps [Microsoft.NET.Test.Sdk](https://github.com/microsoft/vstest ) from 17.8.0 to 17.9.0.
- [Release notes](https://github.com/microsoft/vstest/releases )
- [Changelog](https://github.com/microsoft/vstest/blob/main/docs/releases.md )
- [Commits](https://github.com/microsoft/vstest/compare/v17.8.0...v17.9.0 )
---
updated-dependencies:
- dependency-name: Microsoft.NET.Test.Sdk
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
0c2cff0e1c
Added a Finalizer for `Conversation` in case it is not correctly disposed.
1 year ago
Martin Evans
949861a581
- Added a `Modify` method to `Conversation`. This grants **temporary** access to directly modify the KV cache.
- Re-implmented `Rewind` as an extension method using `Modify` internally
- Implemented `ShiftLeft`, which shifts everything over except for some starting tokens. This is the same as the `StatelessExecutor` out-of-context handling.
- Starting batch at epoch 1, this ensures that conversations (starting at zero) are below the current epoch. It also means `0` can always be used as a value guaranteed to be below the current epoch.
1 year ago
Martin Evans
ea12ff4e07
Merge pull request #502 from vikramvee/Examples
Updated Examples
1 year ago
Martin Evans
b0acecf080
Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix).
Conversations can be "forked", to create a copy of a conversation at a given point. This allows e.g. prompting a conversation with a system prefix just once and then forking it again and again for each individual conversation. Conversations can also be "rewound" to an earlier state.
Added two new examples, demonstrating forking and rewinding.
1 year ago
vikramvee
ebd853fede
Updated Examples
1 year ago
Martin Evans
859160d6f7
Merge pull request #501 from martindevans/LLamaPos_inc_dec
Added increment and decrement operators to `LLamaPos`
1 year ago
Martin Evans
90915c5a99
Added increment and decrement operators to `LLamaPos`
1 year ago
Martin Evans
82c471eac4
Merge pull request #500 from martindevans/improved_kv_cache_methods
Small KV Cache Handling Improvements
1 year ago
Martin Evans
a8f9262b7f
Merge pull request #499 from martindevans/better_batch_processing
Using `AddRange` in `LLamaEmbedder`
1 year ago
Martin Evans
c5146bac23
- Exposed KV debug view through `SafeLLamaContextHandle`
- Added `KvCacheSequenceDivide`
- Moved count tokens/cells methods to `SafeLLamaContextHandle`
1 year ago
Martin Evans
744758f110
Using `AddRange` in `LLamaEmbedder`
1 year ago
Martin Evans
5d80a56d11
Merge pull request #496 from martindevans/smaller_unit_test_model
Smaller Unit Test Model
1 year ago
Martin Evans
418345cbaf
limited parallelism of CI jobs
1 year ago
Martin Evans
df38d73c79
Switched to `Q3_K_S` for unit test model, instead of `Q4`. This is almost 1gb smaller, and _may_ make the macos tests less flakey.
1 year ago
Martin Evans
ac7faa0f93
Merge pull request #495 from martindevans/quantise_new_formats
Added new file types to quantisation
1 year ago
Martin Evans
c7103e86e4
Added new file types to quantisation
1 year ago
Martin Evans
17385e12b6
Merge pull request #479 from martindevans/update_binaries_feb_2024
Update binaries feb 2024
1 year ago
Martin Evans
21bdecd049
Merge branch 'update_binaries_feb_2024' of github.com:martindevans/LLamaSharp into update_binaries_feb_2024
1 year ago
Martin Evans
bac40a3b7a
Added new binaries, from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7792319886
1 year ago