34 Commits (c7d0dc915aae3d38d6adeeb22cfbff5cadfaf770)

Author SHA1 Message Date
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago
  Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 1 year ago
  Martin Evans 96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode 1 year ago
  xbotter 90815ae7d8
bump sk & km 1 year ago
  Martin Evans 9fe878ae1f - Fixed example 1 year ago
  Martin Evans a2e29d393c Swapped `StatelessExecutor` to use `llama_decode`! 1 year ago
  Martin Evans 5b6e82a594 Improved the BatchedDecoding demo: 1 year ago
  Martin Evans 99969e538e - Removed some unused `eval` methods. 1 year ago
  Martin Evans 36a9335588 Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch. 1 year ago
  Martin Evans 42be9b136d Switched form using raw integers, to a `LLamaToken` struct 1 year ago
  Martin Evans a408335c44 Fixed broken build on master (just removing a namespace that no longer exists) 1 year ago
  Martin Evans f0d7468b22
Merge pull request #356 from xbotter/deps/sk-rc3 1 year ago
  xbotter 40ac944fb5
Bump sk to 1.0.1 1 year ago
  Martin Evans b868b056f7 Added metadata overrides to `IModelParams` 1 year ago
  xbotter 8766fb1b03
Merge branch 'deps/sk-rc3' of https://github.com/xbotter/LLamaSharp into deps/sk-rc3 1 year ago
  xbotter 213b4be723
bump sk-1.0.0-rc4 1 year ago
  xbotter ce20b30e06
Merge branch 'SciSharp:master' into deps/sk-rc3 1 year ago
  Rinne fb75e06293
fix: output prefix of Chinese example. 1 year ago
  Rinne 836f071cd0
fix: Chinese example. 1 year ago
  xbotter 13a312b4ec
update sk to 1.0.0-rc3 & km to 0.18 1 year ago
  Philipp Bauer f669a4f5a7 Update the Chinese chat sample to use new ChatSession integration 1 year ago
  Philipp Bauer 2cc01efdae
Merge branch 'SciSharp:master' into master 1 year ago
  Martin Evans 4fc743c9ba
Merge branch 'master' into master 1 year ago
  Philipp Bauer 422605d980 Re-add ChatSession examples 2 years ago
  Philipp Bauer 73d1725954 Modified / updated ChatSession examples 2 years ago
  xbotter a2b26faa7a
🔧 Refactor chat completion implementation 1 year ago
  Rinne 934358a7b3
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into fix_chinese 2 years ago
  Rinne 217c67b757
fix: chinese encoding error. 2 years ago
  xbotter d1e2a4750b
🔧 Update KernelMemory configuration 2 years ago
  Rinne c94aeabc4b
Merge pull request #307 from xbotter/sm-default-config 2 years ago
  xbotter 286904920b
update DefaultInferenceParams in WithLLamaSharpDefaults 2 years ago
  xbotter 1056e13414
fix examples 2 years ago
  Martin Evans 479779e908 Some minor cleanup on example code: 2 years ago
  xbotter 521e36903c
🔀 Remove unused code and update examples 2 years ago