41 Commits (a024d2242e1e984d406a9e5f72c23a6763b490ce)

Author SHA1 Message Date
  Martin Evans a024d2242e It works! 2 years ago
  Martin Evans 36c71abcfb Fixed `LLama.StreamingTokenDecoderLLamaLLama.StreamingTokenDecoderLLamaLLama.StreamingTokenDecoderLLama` spam in all executors except Stateless. 2 years ago
  Martin Evans 51d4411a58 Added two new classes for detokenization tasks: 2 years ago
  Martin Evans efdf3d630c - Removed all `TokenToString` methods (it's never correct to use them, because sometimes one single character may be represented by multiple tokens). 2 years ago
  Martin Evans 9daf586ba8 Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc) 2 years ago
  Martin Evans d8434ea9d6
Merge pull request #185 from martindevans/wip_major_api_change 2 years ago
  Martin Evans 1f8c94e386 Added in the `special` parameter to the tokenizer (introduced in https://github.com/ggerganov/llama.cpp/pull/3538) 2 years ago
  Martin Evans 669ae47ef7 - Split parameters into two interfaces 2 years ago
  Martin Evans 9a0a0ae9fe Removed cloning support 2 years ago
  Martin Evans 0d40338692 Fixed out-of-context handling in stateless executor 2 years ago
  Martin Evans ce1fc51163 Added some more native methods 2 years ago
  Martin Evans bca55eace0 Initial changes to match the llama.cpp changes 2 years ago
  Martin Evans 08f1615e60 - Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated. 2 years ago
  redthing1 b78044347c
fix opaque GetState (fixes #176) 2 years ago
  Martin Evans 466722dcff
Merge pull request #165 from martindevans/better_instruct_antiprompt_checking 2 years ago
  Martin Evans d08a125020 Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient. 2 years ago
  Martin Evans bba801f4b7 Added a property to get the KV cache size from a context 2 years ago
  Martin Evans 4dac142bd5
Merge pull request #160 from martindevans/GetState_fix 2 years ago
  Martin Evans 832bf7dbe0 Simplified implementation of `GetState` and fixed a memory leak (`bigMemory` was never freed) 2 years ago
  Martin Evans 4f7b6ffdcc Removed `GenerateResult` method that was only used in one place 2 years ago
  sa_ddam213 949b0cde16
Replace ILLamaLogger for ILogger 2 years ago
  Martin Evans 31287b5e6e Rewritten TokenToSpan/TokenToString to better fit the new way it's done in llama.cpp with a few different options: 2 years ago
  Martin Evans 0c98ae1955 Passing ctx to `llama_token_nl(_ctx)` 2 years ago
  Martin Evans 826c6aaec3 cleaned up higher level code using the sampling API: 2 years ago
  Martin Evans a911b77dec Various minor changes, resolving about 100 ReSharper code quality warnings 2 years ago
  Martin Evans 5a6c6de0dc
Merge pull request #115 from martindevans/model_params_record 2 years ago
  Martin Evans 70be6c7368 Removed `virtual` method in newly sealed class 2 years ago
  Martin Evans ebacdb666d - Moved the lower level state get/set methods onto SafeLLamaContextHandle 2 years ago
  Martin Evans 93f24f8a51 Switched to properly typed `Encoding` property 2 years ago
  Martin Evans 759ae26f36
Merge branch 'master' into grammar_basics 2 years ago
  Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! 2 years ago
  Martin Evans 4738c26299 - Reduced context size of test, to speed it up 2 years ago
  Martin Evans ae8ef17a4a - Added various convenience overloads to `LLamaContext.Eval` 2 years ago
  Martin Evans 64416ca23c - Created a slightly nicer way to create grammar (from `IReadOnlyList<IReadOnlyList<LLamaGrammarElement>>`) 2 years ago
  Martin Evans f5a260926f Renamed `EmbeddingCount` to `EmbeddingSize` in higher level class 2 years ago
  Martin Evans 479ff57853 Renamed `EmbeddingCount` to `EmbeddingSize` 2 years ago
  Martin Evans d0a7a8fcd6 - Cleaned up disposal in LLamaContext 2 years ago
  Martin Evans 4d741d24f2 Marked old `LLamaContext` constructor obsolete 2 years ago
  Martin Evans 20bdc2ec6f - Apply LoRA in `LLamaWeights.LoadFromFile` 2 years ago
  Martin Evans e2fe08a9a2 Added a higher level `LLamaWeights` wrapper around `SafeLlamaModelHandle` 2 years ago
  Martin Evans f3511e390f WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. 2 years ago