Martin Evans
|
e47431ed80
|
Modified `TensorSplitsCollection` so it accepts any number of splits, as long as it doesn't exceed the number of supported devices
|
2 years ago |
Martin Evans
|
f621ec67e8
|
Fixed serialization
|
2 years ago |
Martin Evans
|
768747c652
|
spelling
|
2 years ago |
Martin Evans
|
b4e7f64e76
|
Added System.Text.Json serialization for `TensorSplitsCollectionConverter`
|
2 years ago |
Martin Evans
|
6a4cd506bd
|
Added a safe `TensorSplitsCollection` to the params which prevents incorrectly setting the `tensor_splits` collection
|
2 years ago |
Martin Evans
|
9daf586ba8
|
Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc)
|
2 years ago |
Martin Evans
|
2a38808bca
|
- Added threads to context params, replaced all thread args with `uint?`
- Replaced all binaries
|
2 years ago |
Martin Evans
|
669ae47ef7
|
- Split parameters into two interfaces
- params contains a list of loras, instead of just one
|
2 years ago |
Martin Evans
|
bca55eace0
|
Initial changes to match the llama.cpp changes
|
2 years ago |
Martin Evans
|
b47977300a
|
Removed one more unused parameter
|
2 years ago |
Martin Evans
|
a1b0349561
|
Removed `ModelAlias` property (unused)
|
2 years ago |
Martin Evans
|
2056078aef
|
Initial changes required for GGUF support
|
2 years ago |
Martin Evans
|
a911b77dec
|
Various minor changes, resolving about 100 ReSharper code quality warnings
|
2 years ago |
Martin Evans
|
93f24f8a51
|
Switched to properly typed `Encoding` property
|
2 years ago |
Martin Evans
|
a9e6f21ab8
|
- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring!
- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
|
2 years ago |
Martin Evans
|
685eb3b9c2
|
Replaced `nint` with `float[]?` in Model params, which is much more user friendly!
|
2 years ago |
sa_ddam213
|
2a04e31b7d
|
ModelParams abstraction
|
2 years ago |