37 Commits (4fc743c9ba908b85aac24fb8f7a37c78fcdbb10b)

Author SHA1 Message Date
  Martin Evans 89fef05362 This commit (5fe721bdbe) accidentally removed a load of stuff that it shouldn't. Fixed that. 2 years ago
  SignalRT 97006a214f Merge remote-tracking branch 'upstream/master' into RuntimeDetection 2 years ago
  Martin Evans 31244ae691
Merge branch 'master' into YaRN_scaling_parameters 2 years ago
  SignalRT 5fe721bdbe Revert "Merge branch 'pr/268' into RuntimeDetection" 2 years ago
  Martin Evans db1bc741b0 Modified `ContextSize` in parameters to be nullable. A null value means autodetect from the model. 2 years ago
  Udayshankar Ravikumar 4071c1f5fc Updated preprocessor directives 2 years ago
  Udayshankar Ravikumar df310e15da Fixed preprocessor directives 2 years ago
  Martin Evans 04ee64a6be Exposed YaRN scaling parameters in IContextParams 2 years ago
  Udayshankar Ravikumar 1dad1ff834 Enhance framework compatibility 2 years ago
  Martin Evans 529b06b35b - Fixed rope frequency/base to use the values in the model by default, instead of always overriding them by default! 2 years ago
  Martin Evans 321d0b58c4
Merge pull request #202 from martindevans/multi_gpu 2 years ago
  Martin Evans 51d4411a58 Added two new classes for detokenization tasks: 2 years ago
  Martin Evans 6a4cd506bd Added a safe `TensorSplitsCollection` to the params which prevents incorrectly setting the `tensor_splits` collection 2 years ago
  Martin Evans 15db194c17 Added multi GPU support 2 years ago
  Martin Evans 9daf586ba8 Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc) 2 years ago
  Martin Evans d8434ea9d6
Merge pull request #185 from martindevans/wip_major_api_change 2 years ago
  Martin Evans b8f0eff080 - Added `GetCharCountImpl` tests, fixed handling of empty strings 2 years ago
  Martin Evans 2a38808bca - Added threads to context params, replaced all thread args with `uint?` 2 years ago
  Martin Evans 4e9b1f8cdc - Split extension methods into separate files 2 years ago
  Martin Evans 669ae47ef7 - Split parameters into two interfaces 2 years ago
  Martin Evans bca55eace0 Initial changes to match the llama.cpp changes 2 years ago
  Martin Evans 08f1615e60 - Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated. 2 years ago
  Martin Evans fe54f6764f - Added unit tests for extension methods 2 years ago
  Martin Evans d08a125020 Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient. 2 years ago
  Martin Evans 77bd090150 Simplified `LLamaInteractExecutor` antiprompt matching by using new extension method 2 years ago
  Martin Evans 614ba40948 - Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings. 2 years ago
  Rinne 4e83e48ad1
Merge pull request #122 from martindevans/gguf 2 years ago
  Martin Evans a70c7170dd - Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system. 2 years ago
  Martin Evans 2056078aef Initial changes required for GGUF support 2 years ago
  Martin Evans cf4754db44 Removed unnecessary parameters from some low level sampler methods 2 years ago
  Martin Evans 4738c26299 - Reduced context size of test, to speed it up 2 years ago
  Martin Evans 91bcefc852 comment on IModelParamsExtensions 2 years ago
  Martin Evans 9cdc72aa67 Fixed `ToLlamaContextParams` using the wrong parameter for `use_mmap` 2 years ago
  sa_ddam213 2d1269cae9 Access to IModelParamsExtensions 2 years ago
  Martin Evans f2499371ea Pulled conversion of a `IModelParams` into a `LLamaContextParams` out into an extension method which can be used in other places. 2 years ago
  Martin Evans 18462beb31 - Removed the `Update` and `GetOrDefault` extension methods (they were unused). 2 years ago
  Yaohui Liu 00d91cf99e
refactor: some parts of code of LLamaModel. 2 years ago