Martin Evans
e89ca5cc17
Fixed a few minor warnings
2 years ago
Martin Evans
d8434ea9d6
Merge pull request #185 from martindevans/wip_major_api_change
Major llama.cpp API Change
2 years ago
Daniel Vaughan
f64a54c9c8
Support SemanticKernel 1.0.0-beta1
2 years ago
Martin Evans
669ae47ef7
- Split parameters into two interfaces
- params contains a list of loras, instead of just one
2 years ago
Martin Evans
bca55eace0
Initial changes to match the llama.cpp changes
2 years ago
Martin Evans
b7379b7124
Moved spinner out to an extension, so it can easily be used in other examples
2 years ago
Martin Evans
08f1615e60
- Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated.
- Added a "spinner" to the `StatelessModeExecute` demo, which spins while waiting for the next token (demonstrating that it's not blocked).
2 years ago
Haiping Chen
45426aefa9
Fix CodingAssistant InferAsync.
2 years ago
Haiping
9f9903c711
Merge pull request #173 from martindevans/async_only
Remove non-async
2 years ago
Martin Evans
3f80190f85
Minimal changes required to remove non-async inference.
2 years ago
Regenhardt Marlon
0049dcfb66
Coding Assistent: Improve user information and link to source paper
2 years ago
Regenhardt Marlon
71f4e2ae4a
Coding Assistent: Use GGUF model
2 years ago
Marlon Regenhardt
2386be58c4
Example: Add code assistant using Code Llama-7b-Instruct as default
2 years ago
Tim Miller
94a395240a
Bump example, readme
2 years ago
Tim Miller
bea1ca9f8d
Update
2 years ago
Tim Miller
35266d8112
Remove MemorySkill sample
2 years ago
Tim Miller
521f068d64
Add Embedding for Semantic Kernel
2 years ago
Rinne
1533ee7dbf
Merge pull request #138 from drasticactions/semantic-kernel
Enable Semantic kernel support
2 years ago
Tim Miller
d4a57fffef
README, Cleanup
2 years ago
Tim Miller
2bde188c64
Change Namespace
2 years ago
Tim Miller
a81edacbfb
Remove embedding for now
2 years ago
Tim Miller
98bfbe1d50
Update
2 years ago
Tim Miller
9a1d6f99f2
Add Semantic Kernel support
2 years ago
Martin Evans
a70c7170dd
- Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system.
- Made all the mechanics of grammar parsing (GBNFGrammarParser, ParseState) internal. Just call `Grammar.Parse("whatever")`.
- Added a `GrammarRule` class which validates elements on construction (this allows constructing grammar without parsing GBNF).
- It should be impossible for a `GrammarRule` to represent an invalid rule.
2 years ago
Mihai
71f02e08c2
Add grammar example
2 years ago
Martin Evans
5a6c6de0dc
Merge pull request #115 from martindevans/model_params_record
ModelsParams record class
2 years ago
Martin Evans
829f32b27d
- Added `Obsolete` attributes to the entire `OldVersion` namespace, so they can be removed in the future
- Minor changes to cleanup some of the compiler warnings
2 years ago
Martin Evans
29df14cd9c
Converted ModelParams into a `record` class. This has several advantages:
- Equality, hashing etc all implemented automatically
- Default values are defined in just one place (the properties) instead of the constructor as well
- Added test to ensure that serialization works properly
2 years ago
Martin Evans
2830e5755c
- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed.
- Deleted `NativeInfo` (internal class, not used anywhere)
2 years ago
Martin Evans
a45d9089e1
Fixed demos
2 years ago
Martin Evans
a9e6f21ab8
- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring!
- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
2 years ago
Martin Evans
ae8ef17a4a
- Added various convenience overloads to `LLamaContext.Eval`
- Converted `SafeLLamaContextHandle` to take a `ReadOnlySpan` for Eval, narrower type better represents what's really needed
2 years ago
Martin Evans
02a46fc363
Updated demos to use the new loading/multi context system
2 years ago
Martin Evans
e2fe08a9a2
Added a higher level `LLamaWeights` wrapper around `SafeLlamaModelHandle`
2 years ago
Martin Evans
f31bdf6b93
Using the right context for Bob
2 years ago
Martin Evans
f3511e390f
WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving.
The biggest single change is renaming `LLamaModel` to `LLamaContext`
2 years ago
Yaohui Liu
1062fe1a7e
feat: upgrade the native libraries.
2 years ago
Yaohui Liu
eed96248b5
docs: add the documentations with mkdocs.
2 years ago
Yaohui Liu
2eb2d6df83
test: add 9 examples of the new version.
2 years ago
Yaohui Liu
bdbd6aa824
feat: add transforms for chat session.
2 years ago
Yaohui Liu
3bf74ec9b9
feat: add chat session for refactored code.
2 years ago
Yaohui Liu
908b79e855
feat: add stateless executor.
2 years ago
Yaohui Liu
e603a09137
fix: state loading and saving not working.
2 years ago