1331 Commits (3d76ef7b6ab554277bd44df4857a3b2014758fb7)
 

Author SHA1 Message Date
  Martin Evans 3d76ef7b6a
Rewritten some examples docs, explaining what these examples show instead of just showing the source code. (#728) 1 year ago
  Martin Evans 3b0b2ab224
Merge pull request #721 from martindevans/kv_cache_view 1 year ago
  Martin Evans 44bd5b311e
Merge pull request #715 from martindevans/llama-templater 1 year ago
  Martin Evans b326624ade Split template out to a field, so it can be changed more easily. 1 year ago
  Martin Evans b25f93b86d
Merge pull request #725 from martindevans/fix_cublas_git_ref 1 year ago
  Martin Evans c585eb5b25 Fixed cublas action always compiling `master` instead of the specific commit 1 year ago
  Rinne d509105da7
ci: add windows benchmark test. (#723) 1 year ago
  Rinne 6f9097f25b
ci: add benchmark test. (#720) 1 year ago
  Martin Evans 2117287da9 Expanded the `LLamaKvCacheView` to make it usable without unsafe. 1 year ago
  Martin Evans 4332ab3813 Changes based on review feedback: 1 year ago
  Martin Evans a0335f67a4 - Added `LLamaTemplate` which efficiently formats a series of messages according to the model template. 1 year ago
  Martin Evans 05100184f4
Merge pull request #719 from martindevans/remove-batched-conversation-prompt-with-string 1 year ago
  Martin Evans 3ba49754b1 Removed (marked as obsolete) prompting with a string for `Conversation`. Tokenization requires extra parameters (e.g. addBos, special) which require special considersation. For now it's better to tokenize using other tools and pass the tokens directly. 1 year ago
  Martin Evans 9906871f84
Merge pull request #714 from ksanman/infinite-context 1 year ago
  ksanchez 0bbbf171ed Refactor executors 1 year ago
  ksanchez 46a9d603f4 Add method to get BOS token. 1 year ago
  ksanchez 61d143d8d8 Implement context shifting in executor base 1 year ago
  Rinne 6bf010d719
Merge pull request #689 from zsogitbe/master 1 year ago
  Zoli Somogyi 54c01d4c2c Making old code obsolete - SemanticKernel: Correcting working with PromptExecutionSettings 1 year ago
  Rinne 0c770a528e
Merge pull request #671 from kidkych/feature/interactive-sk-chatcompletion 1 year ago
  Rinne 16141adcb0
Merge pull request #711 from Norne9/master 1 year ago
  Rinne 7b03e735bb
Merge pull request #709 from AsakusaRinne/format_check_ci 1 year ago
  Norne9 5c60e6d4ca
Merge pull request #1 from Norne9/Norne9-patch-chat-session 1 year ago
  Norne9 ad9bf1cbba
InitializeSessionFromHistoryAsync changed 1 year ago
  Rinne 33d5677c0e Add editorconfig file for code format. 1 year ago
  Rinne f44c8846f5
Merge pull request #710 from AsakusaRinne/typo_check_ci 1 year ago
  Rinne 495177fd0f fix: typos. 1 year ago
  Rinne de31a06a4a ci: add workflow to check the spelling. 1 year ago
  Rinne 98909dc2af
Merge pull request #708 from AsakusaRinne/llama3_support 1 year ago
  Martin Evans 4c078a757c
Merge pull request #703 from martindevans/llava_async_load 1 year ago
  Rinne 175b25d4f7
Add LLaMA3 chat session example. 1 year ago
  Martin Evans 377ebf3664 - Added `LoadFromFileAsync` method for `LLavaWeights` 1 year ago
  Martin Evans 84bb5a36ab
Merge pull request #702 from martindevans/interruptible_async_model_load 1 year ago
  Martin Evans 1ec0fee5ba Added optional `IProgress` parameter to `LoadFromFileAsync` 1 year ago
  Zoli Somogyi 2aa96b206f Adding Response Format - Correcting non-standard way of working with PromptExecutionSettings 1 year ago
  Martin Evans 9867b4c85d Only setting callback if the token can be cancelled. 1 year ago
  Martin Evans 00df7c1516 - Added `LLamaWeights.LoadFromFileAsync`. 1 year ago
  Rinne b47ed9258f
Merge pull request #701 from AsakusaRinne/add_issue_template 1 year ago
  Rinne bcf3ef1e40
Fix typo in issue templates. 1 year ago
  Rinne c6565c3aaf
Merge pull request #700 from AsakusaRinne/add_issue_template 1 year ago
  Rinne d56eb1a5ad
Add issue templates. 1 year ago
  Martin Evans 18586cc43b
Merge pull request #696 from martindevans/safe_handle_constructor_refactor 1 year ago
  Martin Evans e9fd7f96e0
Merge pull request #691 from martindevans/empty_batch_check 1 year ago
  Martin Evans a2f8573831
Merge pull request #698 from martindevans/slightly_safer_quantize_params 1 year ago
  Martin Evans d4f793a7eb Using `is` check instead of `== null` 1 year ago
  Martin Evans ecb359c9e7
- Using more specific `LoadWeightsFailedException` when a llava model fails to load (#697) 1 year ago
  Martin Evans 58ec798bff Modified `llama_model_quantize` to accept argument by `ref` instead of pointer. 1 year ago
  Martin Evans 54dab273cd - Removed unnecessary constructors from safe handles 1 year ago
  Martin Evans 25812762c9 Added checks in `Decode` to skip doing anything if the batch is empty. 1 year ago
  Zoli Somogyi 59a0afdb77 Renaming files to correspond to class names 1 year ago