Martin Evans
93f24f8a51
Switched to properly typed `Encoding` property
2 years ago
Martin Evans
759ae26f36
Merge branch 'master' into grammar_basics
2 years ago
Martin Evans
a9e6f21ab8
- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring!
- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
2 years ago
Martin Evans
e7b217f462
Fixed out of context logic
2 years ago
Martin Evans
4738c26299
- Reduced context size of test, to speed it up
- Removed some unnecessary `ToArray` calls
- Initial pass on LLamaStatelessExecutor, the context overflow management is broken but I think I found where it's ported from
2 years ago
Martin Evans
64416ca23c
- Created a slightly nicer way to create grammar (from `IReadOnlyList<IReadOnlyList<LLamaGrammarElement>>`)
- Integrated grammar into sampling
- Added a test for the grammar sampling
2 years ago
Martin Evans
f3511e390f
WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving.
The biggest single change is renaming `LLamaModel` to `LLamaContext`
2 years ago
Martin Evans
270c6d55ef
Merge pull request #88 from martindevans/fix_serialization_nan
Fix serialization error due to NaN
2 years ago
Martin Evans
be52737488
Using a nullable float instead of NaN, this should fix the serialization issue reported in #85
2 years ago
Martin Evans
1fceeaf352
Applied fix from #84 (antiprompt does not work in stateless executor)
2 years ago
Yaohui Liu
d609b0e1d5
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
b60c8bd285
fix: antiprompt does not work in stateless executor.
2 years ago
Martin Evans
2b2d3af26b
Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle`
2 years ago
Martin Evans
7fabcc1849
One last `TokenToString` case
2 years ago
Martin Evans
0e5e00e300
Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`)
2 years ago
sa_ddam213
bac9cba01a
InferenceParams abstractions
2 years ago
Martin Evans
c64507cb41
Correctly passing through mu value to mirostate instead of resetting it every time.
2 years ago
Martin Evans
ad28a5acdb
Merge branch 'master' into fix_multiple_enumeration
2 years ago
Rinne
4d7d4f2bfe
Merge pull request #59 from saddam213/master
Instruct & Stateless web example implemented
2 years ago
sa_ddam213
3fec7a63c7
Add Instruct and Stateless support
2 years ago
Martin Evans
f3fa73de2b
Implemented a new `LlamaModel.State` handle which internally stores the state as natively allocated memory. This allows it to exceed the 2GB limit on C# arrays.
2 years ago
Martin Evans
7cf1f8ac28
Fixed multiple cases where an `IEnumerable<T>` was enumerated multiple times.
2 years ago
Yaohui Liu
6c400e64c2
docs: publiash documentation 0.4.
2 years ago
Marcel
b911b2548b
move interfaces into abstractions folder
2 years ago
Yaohui Liu
3bf74ec9b9
feat: add chat session for refactored code.
2 years ago
Yaohui Liu
908b79e855
feat: add stateless executor.
2 years ago