218 Commits (experimental_cpp)

Author SHA1 Message Date
  jlsantiago 3b2836eac4
Llava api (#563) 1 year ago
  Martin Evans ce4de7d607
llama_decode lock (#595) 1 year ago
  Clovis Henrique Ribeiro d0f79814e9
Added conditional compilation code to progress_callback (in LlamaModelParams struct) so the struct plays nice with legacy NET Framework 4.8 (#593) 1 year ago
  Martin Evans f0b0bbcbb7
Mutable Logits (#586) 1 year ago
  Martin Evans a8ba9f05b3
March Binary Update (#565) 1 year ago
  Martin Evans 8ac1634233
Removed `llama_eval`. It is going to be completely removed in the next version of llama.cpp (#553) 1 year ago
  Martin Evans f0e7e7cc0a
Removed `SamplingApi`. it has been marked as Obsolete for a while, replaced by instance methods on `LLamaTokenDataArray` (#552) 1 year ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 1 year ago
  Scott W Harden a6394001a1
NativeLibraryConfig: WithLogs(LLamaLogLevel) (#529) 1 year ago
  Martin Evans c7d0dc915a Assorted small changes to clean up some code warnings 1 year ago
  Martin Evans e9d9042576 Added `Divide` to `KvAccessor` 1 year ago
  Martin Evans 949861a581 - Added a `Modify` method to `Conversation`. This grants **temporary** access to directly modify the KV cache. 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago
  Martin Evans 90915c5a99 Added increment and decrement operators to `LLamaPos` 1 year ago
  Martin Evans c5146bac23 - Exposed KV debug view through `SafeLLamaContextHandle` 1 year ago
  Martin Evans 15a98b36d8 Updated everything to work with llama.cpp ce32060198 1 year ago
  Martin Evans 5da2a2f64b - Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing! 1 year ago
  Jason Couture ec59c5bf9e Fix missing library name prefix for cuda 1 year ago
  Jason Couture 443ce4fff4 While the dllimport changes work, manual path searching needed to be updated 1 year ago
  Jason Couture db7e1e88f8 Use llama instead of libllama in `[DllImport]` 1 year ago
  Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 1 year ago
  Martin Evans 96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode 1 year ago
  Martin Evans 9fe878ae1f - Fixed example 1 year ago
  Martin Evans 9ede1bedc2 Automatically growing batch n_seq_max when exceeded. This means no parameters need to be picked when the batch is created. 1 year ago
  Martin Evans a2e29d393c Swapped `StatelessExecutor` to use `llama_decode`! 1 year ago
  Martin Evans 99969e538e - Removed some unused `eval` methods. 1 year ago
  Martin Evans 36a9335588 Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch. 1 year ago
  Martin Evans 1472704e12 Added a test with examples of troublesome strings from 0.9.1 1 year ago
  Martin Evans 73172bbaba
Merge pull request #438 from martindevans/cleanup_model_unnecessary_unsafe 1 year ago
  Martin Evans ce1d302e7e Moved some native methods into `SafeLlamaModelHandle`, these methods are all wrapped in safer accessors with no extra costs so there is no need to expose them. 1 year ago
  Martin Evans 1e86755071 - Removed unnecessary `unsafe` block in model metadata loading 1 year ago
  Martin Evans de2b20aae5 - Added a specific exception for failing to load model weights. 1 year ago
  Martin Evans 096e0e75f8 Check that the model file actually exists immediately before loading it. Improve #395 1 year ago
  Martin Evans 2ea2048b78 - Added a test for tokenizing just a new line (reproduce issue https://github.com/SciSharp/LLamaSharp/issues/430) 1 year ago
  Martin Evans 98635a0d5a Fixed decoding of large tokens (over 16 bytes) in streaming text decoder 1 year ago
  Martin Evans 402a110a3a
Merge pull request #404 from martindevans/switched_to_LLamaToken_struct 1 year ago
  Martin Evans 1e69e265b6 Moved some native methods to do with creating/destroying resources into their respective handles. There is **no** safe way to call most of these methods, everything must be done through through handles. 1 year ago
  Martin Evans 82727c4414 Removed collection expressions from test 1 year ago
  Martin Evans 2eb52b1630 made casts to/from int explicit, fixed places affected 1 year ago
  Martin Evans 42be9b136d Switched form using raw integers, to a `LLamaToken` struct 1 year ago
  Martin Evans 4e5e994dda - directly returning a SafeLlamaModelHandle, instead of an IntPtr which is wrapped in a handle. 1 year ago
  Martin Evans bac3e43498 Fixed handling of empty spans 1 year ago
  Martin Evans c002642268 - Removed some `unsafe` where it wasn't necessary 1 year ago
  Martin Evans f860f88c36 Code cleanup driven by R# suggestions: 1 year ago
  Martin Evans 2cded1b296 Fixed alignment of value fields in `LLamaModelMetadataOverride` 1 year ago
  Martin Evans 6be3f62321 Fixed loading of very large metadata values (over 1kb) 1 year ago
  Martin Evans fb606c2488 Fixed incorrect values 1 year ago
  Martin Evans 47e4fcef2a Fixed GetString on netstandard2 1 year ago
  Martin Evans 2a1e1b6183 Removed unused imports 1 year ago
  Martin Evans a2bae178fa Added a `Metadata` property to `LLamaWeights` 1 year ago