LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	c9c8cd0d62	- Swapped embeddings generator to use `llama_decode` - Modified `GetEmbeddings` method to be async	1 year ago
Martin Evans	3b08874bec	Merge pull request #468 from jasoncouture/clblast Add CLBLAST native library to native libraries build	1 year ago
Martin Evans	22aba9a671	Merge pull request #473 from martindevans/base_handle_removed_constructor Removed `SafeLLamaHandleBase` Constructor	1 year ago
Martin Evans	2488f74bbd	Merge pull request #472 from martindevans/remove_params_interface_set Removed `IModelParams` and `IContextParams` setters.	1 year ago
Martin Evans	5da2a2f64b	- Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing! - Also fixed `ToString()` in `SafeLLamaHandleBase`	1 year ago
Martin Evans	9b995510d6	Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable.	1 year ago
Martin Evans	f9a9aaabca	Merge pull request #471 from jasoncouture/master Fix incorrect event input variable name	1 year ago
Martin Evans	96d7d37f80	Merge pull request #469 from jasoncouture/library_name_fix Fix missing library name prefix for cuda	1 year ago
Jason Couture	52a85c35e6	Add missing CMAKE prefix	1 year ago
Jason Couture	689ddf0d08	Add missing T to defines for CLBLAST	1 year ago
Jason Couture	face505588	Fix syntax error in CLBLAST if statement	1 year ago
Jason Couture	ce5fbf0658	Copy clblast dependencies	1 year ago
Jason Couture	ad3f895eb3	Make build-deps depend on compile-clblast	1 year ago
Jason Couture	2347a7aa09	Add build job for CLBLAST	1 year ago
Jason Couture	34ca5ff6eb	Simplifiy directory creation in build-deps using bash expansion expression, combined with mkdir -p (--parents), allows us to create all of the directories at once. `deps/{avx,avx2,avx512,osx-arm64,osx-x64,cu11.7.1,cu12.1.0,clblast}` expands to: deps/avx deps/avx2 deps/avx512 deps/osx-arm64 deps/osx-x64 deps/cu11.7.1 deps/cu12.1.0 deps/clblast	1 year ago
Jason Couture	9cfbd22499	Fix github variable name I am not on my game today 🤦	1 year ago
Jason Couture	30e448d2d5	Use event input directly GHA doesn't seem to load the value into the enviroment variable first, so the workflow always runs on master.	1 year ago
Martin Evans	afa6cc0ec4	Merge pull request #470 from jasoncouture/specific_commit Checkout specific ref for llamacpp when building native libs	1 year ago
Jason Couture	64cb697bbf	Checkout specific ref for llamacpp when building native libs	1 year ago
Jason Couture	ec59c5bf9e	Fix missing library name prefix for cuda	1 year ago
Martin Evans	0f9742c6d0	Merge pull request #465 from jasoncouture/lib_naming Use llama instead of libllama in `[DllImport]`	1 year ago
Jason Couture	443ce4fff4	While the dllimport changes work, manual path searching needed to be updated	1 year ago
Jason Couture	db7e1e88f8	Use llama instead of libllama in `[DllImport]` This results in windows users not needing to rename the DLL. This allows native llama builds to be dropped in, even on windows. I also took the time to update the documentation, removing references to renaming the files, since the names now match. Fixes #463	1 year ago
Martin Evans	4cfdf064b8	Merge pull request #462 from SciSharp/dependabot/nuget/System.Text.Json-8.0.1 build(deps): bump System.Text.Json from 8.0.0 to 8.0.1	1 year ago
dependabot[bot]	d8eb817bf5	build(deps): bump System.Text.Json from 8.0.0 to 8.0.1 Bumps [System.Text.Json](https://github.com/dotnet/runtime) from 8.0.0 to 8.0.1. - [Release notes](https://github.com/dotnet/runtime/releases) - [Commits](https://github.com/dotnet/runtime/compare/v8.0.0...v8.0.1) --- updated-dependencies: - dependency-name: System.Text.Json dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	1 year ago
Martin Evans	5cf481dc8e	Merge pull request #454 from martindevans/kv_cache_instance_methods kv_cache_instance_methods	1 year ago
Martin Evans	92b9bbe779	Added methods to `SafeLLamaContextHandle` for KV cache manipulation	1 year ago
Martin Evans	8dfd07f67b	Merge pull request #453 from martindevans/fix_bad_merge_nseqmax Fix Master Build Fail	1 year ago
Martin Evans	a690db5d3e	Fixed build error caused by extra unnecessary parameter	1 year ago
Martin Evans	96c26c25f5	Merge pull request #445 from martindevans/stateless_executor_llama_decode Swapped `StatelessExecutor` to use `llama_decode`!	1 year ago
Martin Evans	1bc61472a8	Merge pull request #449 from SciSharp/dependabot/nuget/xunit-2.6.6 build(deps): bump xunit from 2.6.5 to 2.6.6	1 year ago
Martin Evans	d5b3650235	Merge pull request #451 from xbotter/deps/sk_1.1.0 bump sk & km	1 year ago
xbotter	90815ae7d8	bump sk & km - bump semantic kernel to 1.1.0 - bump kernel memory to 0.26	1 year ago
dependabot[bot]	3d4c3c5509	build(deps): bump xunit from 2.6.5 to 2.6.6 Bumps [xunit](https://github.com/xunit/xunit) from 2.6.5 to 2.6.6. - [Commits](https://github.com/xunit/xunit/compare/2.6.5...2.6.6) --- updated-dependencies: - dependency-name: xunit dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	1 year ago
Martin Evans	0074320a31	Merge pull request #447 from martindevans/grow_nseqmax_batch LLamaBatch Grow n_seq_max automatically	1 year ago
Martin Evans	9fe878ae1f	- Fixed example - Growing more than double, if necessary	1 year ago
Martin Evans	9ede1bedc2	Automatically growing batch n_seq_max when exceeded. This means no parameters need to be picked when the batch is created.	1 year ago
Martin Evans	a2e29d393c	Swapped `StatelessExecutor` to use `llama_decode`! - Added `logits_i` argument to `Context.ApplyPenalty` - Added a new exception type for `llama_decode` return code	1 year ago
Martin Evans	892e841da3	Merge pull request #444 from martindevans/batched_sampling_example_cleanup Improved the BatchedDecoding demo	1 year ago
Martin Evans	5b6e82a594	Improved the BatchedDecoding demo: - using less `NativeHandle` - Using `StreamingTokenDecoder` instead of obsolete detokenize method	1 year ago
Martin Evans	250c20bd56	Merge pull request #443 from martindevans/llama_batch_self_grow LLamaBatch Automatically Grow Capacity	1 year ago
Martin Evans	99969e538e	- Removed some unused `eval` methods. - Added a `DecodeAsync` overload which runs the work in a task - Replaced some `NativeHandle` usage in `BatchedDecoding` with higher level equivalents. - Made the `LLamaBatch` grow when token capacity is exceeded, removing the need to manage token capacity externally.	1 year ago
Martin Evans	a0be27d32b	Merge pull request #442 from martindevans/managed_llama_batch Managed `LLamaBatch`	1 year ago
Martin Evans	36a9335588	Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch.	1 year ago
Martin Evans	4b11feddef	Merge pull request #436 from SciSharp/dependabot/nuget/Microsoft.AspNetCore.OpenApi-8.0.1 build(deps): bump Microsoft.AspNetCore.OpenApi from 8.0.0 to 8.0.1	1 year ago
Martin Evans	1cb9bcd55c	Merge pull request #440 from martindevans/additional_special_string_tokenizer_tests Extra Tokenization Tests	1 year ago
Martin Evans	1472704e12	Added a test with examples of troublesome strings from 0.9.1	1 year ago
Martin Evans	73172bbaba	Merge pull request #438 from martindevans/cleanup_model_unnecessary_unsafe Model Metadata Loading Cleanup	1 year ago
Martin Evans	ce1d302e7e	Moved some native methods into `SafeLlamaModelHandle`, these methods are all wrapped in safer accessors with no extra costs so there is no need to expose them.	1 year ago
Martin Evans	4ef618012e	Merge pull request #437 from martindevans/check_model_path_exists Check Model Path Exists	1 year ago

1 2 3 4 5 ...

1118 Commits (experimental_cpp) All Branches Search

1118 Commits (experimental_cpp)

All Branches