46 Commits (8766fb1b03955b44e511e22df1cef608b1ef6d8b)

Author SHA1 Message Date
  Martin Evans b34f72a883 - Added `SamplingPipeline` to inference params which overrides all other options with an entirely custom pipeline. 1 year ago
  Martin Evans d743516070 - Added support for the MinP sampler 2 years ago
  Martin Evans 7e3cde4c13 Moved helper methods into `LLamaBatchSafeHandle` 2 years ago
  Martin Evans ccb8afae46 Cleaned up stateless executor as preparation for changing it to use the new batched decoding system. 2 years ago
  Martin Evans a03fe003de Fixed decoding of text "accumulating" over time (never properly clearing buffer) 2 years ago
  Martin Evans 51d4411a58 Added two new classes for detokenization tasks: 2 years ago
  Martin Evans efdf3d630c - Removed all `TokenToString` methods (it's never correct to use them, because sometimes one single character may be represented by multiple tokens). 2 years ago
  Martin Evans f1e5a8f995 - Passing the `ILogger` through to every call of `CreateContext` 2 years ago
  sa_ddam213 4ec9aed47a
Revert LLamasSharp project changes 2 years ago
  sa_ddam213 b4b4000342
Merge branch 'master' into upstream_master 2 years ago
  Martin Evans d8434ea9d6
Merge pull request #185 from martindevans/wip_major_api_change 2 years ago
  Martin Evans efb0664df0 - Added new binaries 2 years ago
  sa_ddam213 9b8de007dc Propagate ILogger 2 years ago
  Martin Evans 669ae47ef7 - Split parameters into two interfaces 2 years ago
  Martin Evans 0d40338692 Fixed out-of-context handling in stateless executor 2 years ago
  Martin Evans d58fcbbd13 Fixed antiprompt checking 2 years ago
  Martin Evans 08f1615e60 - Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated. 2 years ago
  Martin Evans 3f80190f85 Minimal changes required to remove non-async inference. 2 years ago
  Martin Evans 77bd090150 Simplified `LLamaInteractExecutor` antiprompt matching by using new extension method 2 years ago
  Martin Evans 614ba40948 - Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings. 2 years ago
  Martin Evans 93f24f8a51 Switched to properly typed `Encoding` property 2 years ago
  Martin Evans 759ae26f36
Merge branch 'master' into grammar_basics 2 years ago
  Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! 2 years ago
  Martin Evans e7b217f462 Fixed out of context logic 2 years ago
  Martin Evans 4738c26299 - Reduced context size of test, to speed it up 2 years ago
  Martin Evans 64416ca23c - Created a slightly nicer way to create grammar (from `IReadOnlyList<IReadOnlyList<LLamaGrammarElement>>`) 2 years ago
  Martin Evans f3511e390f WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. 2 years ago
  Martin Evans 270c6d55ef
Merge pull request #88 from martindevans/fix_serialization_nan 2 years ago
  Martin Evans be52737488 Using a nullable float instead of NaN, this should fix the serialization issue reported in #85 2 years ago
  Martin Evans 1fceeaf352 Applied fix from #84 (antiprompt does not work in stateless executor) 2 years ago
  Yaohui Liu d609b0e1d5
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev 2 years ago
  Yaohui Liu b60c8bd285
fix: antiprompt does not work in stateless executor. 2 years ago
  Martin Evans 2b2d3af26b Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle` 2 years ago
  Martin Evans 7fabcc1849 One last `TokenToString` case 2 years ago
  Martin Evans 0e5e00e300 Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`) 2 years ago
  sa_ddam213 bac9cba01a InferenceParams abstractions 2 years ago
  Martin Evans c64507cb41 Correctly passing through mu value to mirostate instead of resetting it every time. 2 years ago
  Martin Evans ad28a5acdb
Merge branch 'master' into fix_multiple_enumeration 2 years ago
  Rinne 4d7d4f2bfe
Merge pull request #59 from saddam213/master 2 years ago
  sa_ddam213 3fec7a63c7 Add Instruct and Stateless support 2 years ago
  Martin Evans f3fa73de2b Implemented a new `LlamaModel.State` handle which internally stores the state as natively allocated memory. This allows it to exceed the 2GB limit on C# arrays. 2 years ago
  Martin Evans 7cf1f8ac28 Fixed multiple cases where an `IEnumerable<T>` was enumerated multiple times. 2 years ago
  Yaohui Liu 6c400e64c2
docs: publiash documentation 0.4. 2 years ago
  Marcel b911b2548b move interfaces into abstractions folder 2 years ago
  Yaohui Liu 3bf74ec9b9
feat: add chat session for refactored code. 2 years ago
  Yaohui Liu 908b79e855
feat: add stateless executor. 2 years ago