Rinne
609e2968ad
Merge branch 'master' of github.com:SciSharp/LLamaSharp into preview
1 year ago
Martin Evans
c298ab828a
Merge pull request #368 from martindevans/context_set_seed
Context Set Seed
1 year ago
Martin Evans
dc8e5d88f7
Update LLama/LLamaContext.cs
1 year ago
Martin Evans
a3177ab140
Merge pull request #369 from martindevans/rename_llama_sample_temperature
Renamed `llama_sample_temperature` to `llama_sample_temp`
1 year ago
Martin Evans
cbc4c8d9af
Merge pull request #370 from martindevans/copy_grammar
Clone Grammar
1 year ago
Martin Evans
db7ecf5a43
Added a method to create a clone of a grammar instance
1 year ago
Martin Evans
ea523d2e2a
Renamed `llama_sample_temperature` to `llama_sample_temp`, Mirroring the same change made in llama.cpp
1 year ago
Martin Evans
2df3e7617e
Added a method to set the RNG seed on the context
1 year ago
Martin Evans
8494fd5769
Merge pull request #367 from martindevans/fix_quantisation_pure_field
Added missing field to LLamaModelQuantizeParams
1 year ago
Martin Evans
cedef5e45a
Added the `pure` field to `LLamaModelQuantizeParams` (it's been added to llama.cpp)
1 year ago
Martin Evans
6650e683f4
Merge pull request #361 from martindevans/update_binaries_dec_2023
Updated Binaries December 2023
1 year ago
Rinne
b79387fd76
Merge pull request #365 from Onkitova/preview
feat: using CUDA while decoupling from the CUDA Toolkit as a hard-dependency
1 year ago
Martin Evans
199b4418c3
Fixed JSON serialization for ModelParams
1 year ago
Martin Evans
6128c18824
Merge pull request #366 from asmirnov82/fix_code_in_readme
Fix documentation to reflect changes in ChatSession API
1 year ago
Aleksei Smirnov
8558d109b4
Fix documentation according to the changes in newer versions
1 year ago
Martin Evans
2f0deeadcd
Implemented serialization for `MetadataOverride`. Deserialization is broken (converter is never called)
1 year ago
AlTonkas
4f1bda18b6
Using CUDA while decoupling from the CUDA Toolkit as a hard-dependency
Using CUDA while decoupling from the CUDA Toolkit as a hard-dependency
Possible solution for https://github.com/SciSharp/LLamaSharp/issues/350
Adding an alternative, fallback method of detection system-supported cuda version to make CUDA Toolkit installation optional. Technically, it uses output of the command line tool "nvidia-smi" (preinstalled with nvidia drivers), which also contains information about cuda version supported on system.
Can confirm it works only on Windows, but I suppose that similar approach can be utilized for Linux and MacOS as well. Didn't touch the code for these 2 platforms, nevertheless.
After that, cuda can be utilized simply by putting nvidia libraries from llama.cpp original repo, "bin-win-cublas-cu12.2.0-x64.zip" asset to the root folder of the built program. For example, to folder: "\LLama.Examples\bin\Debug\net8.0\".
1 year ago
Martin Evans
b868b056f7
Added metadata overrides to `IModelParams`
1 year ago
Martin Evans
b22d8b7495
- Added `GroupDisposable` to dispose a collection of items all together
- Renamed `LLamaModelKvOverride` to `LLamaModelMetadataOverride`
1 year ago
Martin Evans
5ad2cd1d3c
Added a comment on the type itself
1 year ago
Martin Evans
b0270b5788
Added comments on GGMLType
1 year ago
Martin Evans
b3e576608b
fixed safe handle
1 year ago
Martin Evans
bab6b65b61
Added a safe handle for LLamaKvCacheView
1 year ago
Martin Evans
439d14a061
Updated binaries:
- build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440
- commit: 9fb13f9584
1 year ago
Martin Evans
9813a30193
Merge pull request #358 from xbotter/unittest-net8
Upgrade unittest target framework to .NET 8.0
1 year ago
xbotter
340bbbcf48
Move JSON converter for TensorSplitsCollection
1 year ago
Martin Evans
728f7077d8
Merge pull request #360 from martindevans/macos_avx
Added AVX and AVX2 to MacOS x86_64 builds
1 year ago
Martin Evans
0b8422ea7f
Added AVX and AVX2 to MacOS x86_64 builds
1 year ago
Martin Evans
01c7f1b4da
Update LLama/Common/ModelParams.cs
1 year ago
Martin Evans
e6148c952e
Fixed encoding of `Encoding`
1 year ago
xbotter
df66d7e0c6
Upgrade unittest target framework to .net8
1 year ago
Martin Evans
d87d654a34
Merge pull request #348 from martindevans/new_object_based_sampling_pipeline
Custom Sampling Pipelines
1 year ago
Martin Evans
50c1b2d52d
Merge pull request #344 from philippjbauer/master
Rebuilt ChatSession class
1 year ago
Martin Evans
8a2923a7bc
Merge pull request #352 from SciSharp/dependabot/nuget/xunit-2.6.3
build(deps): bump xunit from 2.6.2 to 2.6.3
1 year ago
Rinne
fb75e06293
fix: output prefix of Chinese example.
1 year ago
dependabot[bot]
85dc43dde0
build(deps): bump xunit from 2.6.2 to 2.6.3
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.2 to 2.6.3.
- [Commits](https://github.com/xunit/xunit/compare/2.6.2...2.6.3 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
ec6d0e8778
Merge pull request #353 from SciSharp/dependabot/nuget/xunit.runner.visualstudio-2.5.5
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5
1 year ago
Rinne
836f071cd0
fix: Chinese example.
1 year ago
dependabot[bot]
8fb4476813
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.4 to 2.5.5.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.4...2.5.5 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Philipp Bauer
29c5c6e93c
Update the StatefulChatService to use new ChatSession integration
1 year ago
Philipp Bauer
f669a4f5a7
Update the Chinese chat sample to use new ChatSession integration
1 year ago
Philipp Bauer
2cc01efdae
Merge branch 'SciSharp:master' into master
1 year ago
Rinne
3019da9d22
Merge pull request #341 from xbotter/sk/chat-stateless
🔧 Refactor Semantic Kernel chat completion implementation
1 year ago
Martin Evans
835958398c
- Removed the object wrappers and configurable pipeline, they can be better written in code.
- Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline`
- Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling
1 year ago
Martin Evans
3afc007499
- Added "protected" logits, instead of the awkward save/load mechanism
- Added an example usage to one of the tests
1 year ago
Martin Evans
b34f72a883
- Added `SamplingPipeline` to inference params which overrides all other options with an entirely custom pipeline.
- Added a `Sample` method to `LLamaContext` which uses a custom pipeline
- Modified all executors to use the custom pipeline if it exists
1 year ago
Martin Evans
4fc743c9ba
Merge branch 'master' into master
1 year ago
xbotter
22a775e1b0
Merge branch 'SciSharp:master' into sk/chat-stateless
1 year ago
Philipp Bauer
422605d980
Re-add ChatSession examples
2 years ago
Philipp Bauer
73d1725954
Modified / updated ChatSession examples
2 years ago