51 Commits (ec8f83236545a1989df2f75da4e1d8d0345b0407)

Author SHA1 Message Date
  Rinne ec8f832365
fix: add cuda llava native libraries. 1 year ago
  Rinne b9444452eb
docs: refactor the documentations. 1 year ago
  SignalRT 43677c511c Change interface to support multiple images and add the capabitlity to render the image in the console 1 year ago
  SignalRT e8732efadd Example InteractiveExecutor 1 year ago
  Rinne b677cdc6a3
Merge pull request #560 from eublefar/feature/chat-session-state-management 1 year ago
  Martin Evans e2705be6c8
Fixed off by one error in LLamaBatch sampling position (#626) 1 year ago
  eublefar 9440f153da Make process message method more flexible 1 year ago
  Martin Evans ad682fbebd
`BatchedExecutor.Create()` method (#613) 1 year ago
  eublefar a31391edd7 Polymorphic serialization for executor state and transforms 1 year ago
  Martin Evans f0b0bbcbb7
Mutable Logits (#586) 1 year ago
  eublefar 0763f307ec Example chat session with preprocessing of chat history and reset operation that resets chat to original point of history without extra processing 1 year ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 1 year ago
  Martin Evans 91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline (#538) 1 year ago
  Martin Evans 74a39188a2
Used `AnsiConsole` in a few more places: (#534) 1 year ago
  Scott W Harden 91ca9d2732
LLamaSharp.Examples: Document Q&A with local storage (#532) 1 year ago
  Scott W Harden 06ffe3ac95
LLama.Examples: improve model path prompt (#526) 1 year ago
  Scott W Harden efa49cc8de
Improve "embeddings" example (#525) 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago
  Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 1 year ago
  Martin Evans 96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode 1 year ago
  xbotter 90815ae7d8
bump sk & km 1 year ago
  Martin Evans 9fe878ae1f - Fixed example 1 year ago
  Martin Evans a2e29d393c Swapped `StatelessExecutor` to use `llama_decode`! 1 year ago
  Martin Evans 5b6e82a594 Improved the BatchedDecoding demo: 1 year ago
  Martin Evans 99969e538e - Removed some unused `eval` methods. 1 year ago
  Martin Evans 36a9335588 Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch. 1 year ago
  Martin Evans 42be9b136d Switched form using raw integers, to a `LLamaToken` struct 1 year ago
  Martin Evans a408335c44 Fixed broken build on master (just removing a namespace that no longer exists) 1 year ago
  Martin Evans f0d7468b22
Merge pull request #356 from xbotter/deps/sk-rc3 1 year ago
  xbotter 40ac944fb5
Bump sk to 1.0.1 1 year ago
  Martin Evans b868b056f7 Added metadata overrides to `IModelParams` 1 year ago
  xbotter 8766fb1b03
Merge branch 'deps/sk-rc3' of https://github.com/xbotter/LLamaSharp into deps/sk-rc3 1 year ago
  xbotter 213b4be723
bump sk-1.0.0-rc4 1 year ago
  xbotter ce20b30e06
Merge branch 'SciSharp:master' into deps/sk-rc3 1 year ago
  Rinne fb75e06293
fix: output prefix of Chinese example. 1 year ago
  Rinne 836f071cd0
fix: Chinese example. 1 year ago
  xbotter 13a312b4ec
update sk to 1.0.0-rc3 & km to 0.18 1 year ago
  Philipp Bauer f669a4f5a7 Update the Chinese chat sample to use new ChatSession integration 1 year ago
  Philipp Bauer 2cc01efdae
Merge branch 'SciSharp:master' into master 1 year ago
  Martin Evans 4fc743c9ba
Merge branch 'master' into master 1 year ago
  Philipp Bauer 422605d980 Re-add ChatSession examples 2 years ago
  Philipp Bauer 73d1725954 Modified / updated ChatSession examples 2 years ago
  xbotter a2b26faa7a
🔧 Refactor chat completion implementation 2 years ago
  Rinne 934358a7b3
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into fix_chinese 2 years ago
  Rinne 217c67b757
fix: chinese encoding error. 2 years ago
  xbotter d1e2a4750b
🔧 Update KernelMemory configuration 2 years ago
  Rinne c94aeabc4b
Merge pull request #307 from xbotter/sm-default-config 2 years ago
  xbotter 286904920b
update DefaultInferenceParams in WithLLamaSharpDefaults 2 years ago
  xbotter 1056e13414
fix examples 2 years ago
  Martin Evans 479779e908 Some minor cleanup on example code: 2 years ago