14 Commits (04ee64a6be28d9cace256bcc7def02890d2bfe38)

Author SHA1 Message Date
  Martin Evans 04ee64a6be Exposed YaRN scaling parameters in IContextParams 2 years ago
  Martin Evans 529b06b35b - Fixed rope frequency/base to use the values in the model by default, instead of always overriding them by default! 2 years ago
  Martin Evans 6a4cd506bd Added a safe `TensorSplitsCollection` to the params which prevents incorrectly setting the `tensor_splits` collection 2 years ago
  sa_ddam213 b4b4000342
Merge branch 'master' into upstream_master 2 years ago
  Martin Evans 2a38808bca - Added threads to context params, replaced all thread args with `uint?` 2 years ago
  sa_ddam213 c9108f8311 Add service for managing Models and Model Contexts 2 years ago
  Martin Evans 669ae47ef7 - Split parameters into two interfaces 2 years ago
  Martin Evans bca55eace0 Initial changes to match the llama.cpp changes 2 years ago
  Martin Evans 2056078aef Initial changes required for GGUF support 2 years ago
  Martin Evans 93f24f8a51 Switched to properly typed `Encoding` property 2 years ago
  Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! 2 years ago
  Martin Evans 2c933c57a1 Fixed ModelOptions in Web project 2 years ago
  sa_ddam213 2a04e31b7d ModelParams abstraction 2 years ago
  sa_ddam213 d9fbd56f10 Strongly type connection status 2 years ago