9 Commits (18b15184ea6dceba07b9e6a6a73594e823714d40)

Author SHA1 Message Date
  Martin Evans b7379b7124 Moved spinner out to an extension, so it can easily be used in other examples 2 years ago
  Martin Evans 08f1615e60 - Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated. 2 years ago
  Martin Evans 3f80190f85 Minimal changes required to remove non-async inference. 2 years ago
  Martin Evans 29df14cd9c Converted ModelParams into a `record` class. This has several advantages: 2 years ago
  Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! 2 years ago
  Martin Evans ae8ef17a4a - Added various convenience overloads to `LLamaContext.Eval` 2 years ago
  Martin Evans 02a46fc363 Updated demos to use the new loading/multi context system 2 years ago
  Martin Evans f3511e390f WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. 2 years ago
  Yaohui Liu 2eb2d6df83
test: add 9 examples of the new version. 2 years ago