jlsantiago
3b2836eac4
Llava api ( #563 )
* Add llava_binaries, update all binaries to make the test
* Llava API + LlavaTest
Preliminary
* First prototype of Load + Unit Test
* Temporary run test con branch LlavaAPI
* Disable Embed test to review the rest of the test
* Restore Embedding test
* Use BatchThread to eval image embeddings
Test Threads default value to ensure it doesn´t produce problems.
* Rename test file
* Update action versions
* Test only one method, no release embeddings
* Revert "Test only one method, no release embeddings"
This reverts commit 264e176dcc .
* Correct API call
* Only test llava related functionality
* Cuda and Cblast binaries
* Restore build policy
* Changes related with code review
* Add SafeHandles
* Set overwrite to upload-artifact@v4
* Revert to upload-artifact@v3
* revert to upload-artifact@v3
1 year ago
Martin Evans
8ac1634233
Removed `llama_eval`. It is going to be completely removed in the next version of llama.cpp ( #553 )
1 year ago
Martin Evans
a0731db944
Added tests checking that memory is freed properly ( #551 )
1 year ago
dependabot[bot]
070969b23f
build(deps): bump coverlet.collector from 6.0.0 to 6.0.1 ( #540 )
Bumps [coverlet.collector](https://github.com/coverlet-coverage/coverlet ) from 6.0.0 to 6.0.1.
- [Release notes](https://github.com/coverlet-coverage/coverlet/releases )
- [Commits](https://github.com/coverlet-coverage/coverlet/compare/v6.0.0...v6.0.1 )
---
updated-dependencies:
- dependency-name: coverlet.collector
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 year ago
dependabot[bot]
7fbf979fe0
build(deps): bump xunit.runner.visualstudio from 2.5.6 to 2.5.7
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.6 to 2.5.7.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.6...2.5.7 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
dependabot[bot]
939d288384
build(deps): bump xunit from 2.6.6 to 2.7.0
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.6 to 2.7.0.
- [Commits](https://github.com/xunit/xunit/compare/2.6.6...2.7.0 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
c7d0dc915a
Assorted small changes to clean up some code warnings
1 year ago
Martin Evans
d47b6afe4d
Normalizing embeddings in `LLamaEmbedder`. As is done in llama.cpp: 2891c8aa9a/examples/embedding/embedding.cpp (L92)
1 year ago
dependabot[bot]
58b6b927c6
build(deps): bump Microsoft.NET.Test.Sdk from 17.8.0 to 17.9.0
Bumps [Microsoft.NET.Test.Sdk](https://github.com/microsoft/vstest ) from 17.8.0 to 17.9.0.
- [Release notes](https://github.com/microsoft/vstest/releases )
- [Changelog](https://github.com/microsoft/vstest/blob/main/docs/releases.md )
- [Commits](https://github.com/microsoft/vstest/compare/v17.8.0...v17.9.0 )
---
updated-dependencies:
- dependency-name: Microsoft.NET.Test.Sdk
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
df38d73c79
Switched to `Q3_K_S` for unit test model, instead of `Q4`. This is almost 1gb smaller, and _may_ make the macos tests less flakey.
1 year ago
Martin Evans
15a98b36d8
Updated everything to work with llama.cpp ce32060198
1 year ago
Martin Evans
c9c8cd0d62
- Swapped embeddings generator to use `llama_decode`
- Modified `GetEmbeddings` method to be async
1 year ago
Martin Evans
9b995510d6
Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable.
1 year ago
Martin Evans
96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode
Swapped `StatelessExecutor` to use `llama_decode`!
1 year ago
dependabot[bot]
3d4c3c5509
build(deps): bump xunit from 2.6.5 to 2.6.6
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.5 to 2.6.6.
- [Commits](https://github.com/xunit/xunit/compare/2.6.5...2.6.6 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
a2e29d393c
Swapped `StatelessExecutor` to use `llama_decode`!
- Added `logits_i` argument to `Context.ApplyPenalty`
- Added a new exception type for `llama_decode` return code
1 year ago
Martin Evans
99969e538e
- Removed some unused `eval` methods.
- Added a `DecodeAsync` overload which runs the work in a task
- Replaced some `NativeHandle` usage in `BatchedDecoding` with higher level equivalents.
- Made the `LLamaBatch` grow when token capacity is exceeded, removing the need to manage token capacity externally.
1 year ago
Martin Evans
1472704e12
Added a test with examples of troublesome strings from 0.9.1
1 year ago
Martin Evans
2ea2048b78
- Added a test for tokenizing just a new line (reproduce issue https://github.com/SciSharp/LLamaSharp/issues/430 )
- Properly displaying `LLamaToken`
- Removed all tokenisation code in `SafeLLamaContextHandle` - just pass it all through to the `SafeLlamaModelHandle`
- Improved `SafeLlamaModelHandle` tokenisation:
- Renting an array, for one less allocation
- Not using `&tokens[0]` to take a pointer to an array, this is redundant and doesn't work on empty arrays
1 year ago
Martin Evans
98635a0d5a
Fixed decoding of large tokens (over 16 bytes) in streaming text decoder
1 year ago
Martin Evans
402a110a3a
Merge pull request #404 from martindevans/switched_to_LLamaToken_struct
LLamaToken Struct
1 year ago
dependabot[bot]
4a6f07943c
build(deps): bump xunit from 2.6.4 to 2.6.5
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.4 to 2.6.5.
- [Commits](https://github.com/xunit/xunit/compare/2.6.4...2.6.5 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
eebe4cb120
Added a new test (commented out for now) which reproduces the issue reported in #394
1 year ago
Martin Evans
82727c4414
Removed collection expressions from test
1 year ago
Martin Evans
42be9b136d
Switched form using raw integers, to a `LLamaToken` struct
1 year ago
dependabot[bot]
57fdac493a
build(deps): bump xunit.runner.visualstudio from 2.5.5 to 2.5.6
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.5 to 2.5.6.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.5...2.5.6 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
dependabot[bot]
58e5a84d54
build(deps): bump xunit from 2.6.3 to 2.6.4
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.3 to 2.6.4.
- [Commits](https://github.com/xunit/xunit/compare/2.6.3...2.6.4 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
f0d7468b22
Merge pull request #356 from xbotter/deps/sk-rc3
bump sk to 1.0.1 & km to 0.18
1 year ago
Martin Evans
a2bae178fa
Added a `Metadata` property to `LLamaWeights`
1 year ago
Martin Evans
db7ecf5a43
Added a method to create a clone of a grammar instance
1 year ago
Martin Evans
199b4418c3
Fixed JSON serialization for ModelParams
1 year ago
Martin Evans
b868b056f7
Added metadata overrides to `IModelParams`
1 year ago
xbotter
8766fb1b03
Merge branch 'deps/sk-rc3' of https://github.com/xbotter/LLamaSharp into deps/sk-rc3
1 year ago
xbotter
213b4be723
bump sk-1.0.0-rc4
1 year ago
xbotter
ce20b30e06
Merge branch 'SciSharp:master' into deps/sk-rc3
1 year ago
Martin Evans
e6148c952e
Fixed encoding of `Encoding`
1 year ago
xbotter
df66d7e0c6
Upgrade unittest target framework to .net8
1 year ago
Martin Evans
d87d654a34
Merge pull request #348 from martindevans/new_object_based_sampling_pipeline
Custom Sampling Pipelines
1 year ago
dependabot[bot]
85dc43dde0
build(deps): bump xunit from 2.6.2 to 2.6.3
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.2 to 2.6.3.
- [Commits](https://github.com/xunit/xunit/compare/2.6.2...2.6.3 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
xbotter
13a312b4ec
update sk to 1.0.0-rc3 & km to 0.18
1 year ago
dependabot[bot]
8fb4476813
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.4 to 2.5.5.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.4...2.5.5 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Martin Evans
835958398c
- Removed the object wrappers and configurable pipeline, they can be better written in code.
- Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline`
- Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling
1 year ago
Martin Evans
3afc007499
- Added "protected" logits, instead of the awkward save/load mechanism
- Added an example usage to one of the tests
1 year ago
dependabot[bot]
6d86219d71
build(deps): bump xunit.runner.visualstudio from 2.5.3 to 2.5.4
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.3 to 2.5.4.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.3...2.5.4 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2 years ago
Rinne
cf4edeac55
Merge pull request #315 from futzy314/fix-ai-request-settings
Added a converter similar to the Open AI one
2 years ago
Martin Evans
597188c236
Merge pull request #316 from martindevans/update_binaries_nov
November Binary Update
2 years ago
Ian Foutz
b2bf59d8d5
Unit tests added
2 years ago
dependabot[bot]
41292b4b32
build(deps): bump xunit from 2.6.1 to 2.6.2
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.1 to 2.6.2.
- [Commits](https://github.com/xunit/xunit/compare/2.6.1...2.6.2 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2 years ago
Martin Evans
77003d763e
Added new symbols from llama.h
2 years ago
Martin Evans
48c5039054
Improved test coverage. Discovered some issues:
FixedSizeQueue:
- Enqueue would always stop one short of filling the capacity
- Fill would only _replace_ existing items. It was only used in a place where there were not existing items! Removed the method entirely.
LLamaGrammarElement:
- Converted into a `record` struct, removed all of the (now unnecessary) equality stuff.
2 years ago