509 Commits (experimental_cpp)

Author SHA1 Message Date
  jlsantiago 3b2836eac4
Llava api (#563) 1 year ago
  Martin Evans ce4de7d607
llama_decode lock (#595) 1 year ago
  Clovis Henrique Ribeiro d0f79814e9
Added conditional compilation code to progress_callback (in LlamaModelParams struct) so the struct plays nice with legacy NET Framework 4.8 (#593) 1 year ago
  Martin Evans f0b0bbcbb7
Mutable Logits (#586) 1 year ago
  Martin Evans a8ba9f05b3
March Binary Update (#565) 1 year ago
  dependabot[bot] 4068a6f03b
build(deps): bump System.Text.Json from 8.0.1 to 8.0.2 1 year ago
  Martin Evans defac000ad
Added a `%(RecursiveDir)` element to the props file, this causes files to be copied along with the folder structure rather than dumped into the root. (#561) 1 year ago
  Martin Evans 8ac1634233
Removed `llama_eval`. It is going to be completely removed in the next version of llama.cpp (#553) 1 year ago
  Martin Evans f0e7e7cc0a
Removed `SamplingApi`. it has been marked as Obsolete for a while, replaced by instance methods on `LLamaTokenDataArray` (#552) 1 year ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 1 year ago
  Martin Evans 91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline (#538) 1 year ago
  Scott W Harden a6394001a1
NativeLibraryConfig: WithLogs(LLamaLogLevel) (#529) 1 year ago
  Scott W Harden 4c3077d0f0
ChatSession: improve exception message 1 year ago
  Martin Evans c7d0dc915a Assorted small changes to clean up some code warnings 1 year ago
  Martin Evans 174f21a385 0.10.0 1 year ago
  Martin Evans d03c1a9201
Merge pull request #503 from martindevans/batched_executor_again 1 year ago
  Martin Evans d47b6afe4d Normalizing embeddings in `LLamaEmbedder`. As is done in llama.cpp: 2891c8aa9a/examples/embedding/embedding.cpp (L92) 1 year ago
  Martin Evans e9d9042576 Added `Divide` to `KvAccessor` 1 year ago
  Martin Evans 1cc463b9b7 Added a finalizer to `BatchedExecutor` 1 year ago
  Martin Evans 0c2cff0e1c Added a Finalizer for `Conversation` in case it is not correctly disposed. 1 year ago
  Martin Evans 949861a581 - Added a `Modify` method to `Conversation`. This grants **temporary** access to directly modify the KV cache. 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago
  Martin Evans 90915c5a99 Added increment and decrement operators to `LLamaPos` 1 year ago
  Martin Evans 82c471eac4
Merge pull request #500 from martindevans/improved_kv_cache_methods 1 year ago
  Martin Evans c5146bac23 - Exposed KV debug view through `SafeLLamaContextHandle` 1 year ago
  Martin Evans 744758f110 Using `AddRange` in `LLamaEmbedder` 1 year ago
  Martin Evans c7103e86e4 Added new file types to quantisation 1 year ago
  Martin Evans 17385e12b6
Merge pull request #479 from martindevans/update_binaries_feb_2024 1 year ago
  Martin Evans bac40a3b7a Added new binaries, from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7792319886 1 year ago
  Jason Couture c963b051e2 Add nuspec for OpenCL (CLBLAST) 1 year ago
  Martin Evans 765c697f77 Fixed number type 1 year ago
  Martin Evans b2e815d51e Updated all binaries (from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7746303349) 1 year ago
  Martin Evans 15a98b36d8 Updated everything to work with llama.cpp ce32060198 1 year ago
  Martin Evans c9c8cd0d62 - Swapped embeddings generator to use `llama_decode` 1 year ago
  Martin Evans 22aba9a671
Merge pull request #473 from martindevans/base_handle_removed_constructor 1 year ago
  Martin Evans 5da2a2f64b - Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing! 1 year ago
  Martin Evans 9b995510d6 Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable. 1 year ago
  Jason Couture ec59c5bf9e Fix missing library name prefix for cuda 1 year ago
  Jason Couture 443ce4fff4 While the dllimport changes work, manual path searching needed to be updated 1 year ago
  Jason Couture db7e1e88f8 Use llama instead of libllama in `[DllImport]` 1 year ago
  dependabot[bot] d8eb817bf5
build(deps): bump System.Text.Json from 8.0.0 to 8.0.1 1 year ago
  Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 1 year ago
  Martin Evans a690db5d3e Fixed build error caused by extra unnecessary parameter 1 year ago
  Martin Evans 96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode 1 year ago
  Martin Evans 9fe878ae1f - Fixed example 1 year ago
  Martin Evans 9ede1bedc2 Automatically growing batch n_seq_max when exceeded. This means no parameters need to be picked when the batch is created. 1 year ago
  Martin Evans a2e29d393c Swapped `StatelessExecutor` to use `llama_decode`! 1 year ago
  Martin Evans 5b6e82a594 Improved the BatchedDecoding demo: 1 year ago
  Martin Evans 99969e538e - Removed some unused `eval` methods. 1 year ago
  Martin Evans 36a9335588 Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch. 1 year ago