Martin Evans
466722dcff
Merge pull request #165 from martindevans/better_instruct_antiprompt_checking
better_instruct_antiprompt_checking
2 years ago
Martin Evans
d08a125020
Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient.
2 years ago
Martin Evans
bba801f4b7
Added a property to get the KV cache size from a context
2 years ago
SignalRT
c41e448d0e
ggml-metal.metal MUST be copied to output folder
Metal depends on this file to execute, and MacOS llama.cpp defaults is now METAL.
2 years ago
SignalRT
096293a026
MacOS Remove Metal as is the current default
See on Mac OS enable Metal by default #2901
2 years ago
Martin Evans
b47977300a
Removed one more unused parameter
2 years ago
Martin Evans
a1b0349561
Removed `ModelAlias` property (unused)
2 years ago
Martin Evans
4dac142bd5
Merge pull request #160 from martindevans/GetState_fix
`GetState()` fix
2 years ago
Martin Evans
832bf7dbe0
Simplified implementation of `GetState` and fixed a memory leak (`bigMemory` was never freed)
2 years ago
Martin Evans
4f7b6ffdcc
Removed `GenerateResult` method that was only used in one place
2 years ago
sa_ddam213
09d8f434f2
Extract LLamaLogLevel, Remove Logger class
2 years ago
sa_ddam213
949b0cde16
Replace ILLamaLogger for ILogger
2 years ago
sa_ddam213
70b36f8996
Add Microsoft.Extensions.Logging.Abstractions, update any required deps
2 years ago
Martin Evans
d3b8ee988c
Beam Search ( #155 )
* Added the low level bindings to beam search.
2 years ago
Martin Evans
a09aa86324
Merge pull request #153 from martindevans/fix_savestate_OpenOrCreate
Changed `OpenOrCreate` to `Create`
2 years ago
Martin Evans
f366aa3abe
Changed `OpenOrCreate` to `Create` to fix #151
2 years ago
Martin Evans
77bd090150
Simplified `LLamaInteractExecutor` antiprompt matching by using new extension method
2 years ago
Martin Evans
614ba40948
- Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings.
- Minimal amount of characters converted
- Allocation free
- Added `TokensToSpan` to `SafeLlamaModelHandle` which converts as many tokens as possible into a character span
- Allocation free
2 years ago
Martin Evans
d79a6556a1
Removed 3 unused properties of `InferenceParams`
2 years ago
Martin Evans
6a842014ac
Removed duplicate `llama_sample_classifier_free_guidance` method
2 years ago
Martin Evans
4a53cdc56b
Merge pull request #142 from SciSharp/rinne-dev
refactor: remove old version files.
2 years ago
Martin Evans
33035c82bf
- Removed `LLamaNewlineTokens` from `InteractiveExecutorState`. This is always set in the constructor from the context, so there's no point serializing it.
2 years ago
Yaohui Liu
18294a725e
refactor: remove old version files.
2 years ago
Martin Evans
8f58a40fb9
Added Linux dependency loading
2 years ago
Martin Evans
dd4957471f
Changed paths to match what the GitHub build action produces
2 years ago
Martin Evans
756a1ad0ba
Added a new way to load dependencies, performing CPU feature detection
2 years ago
Martin Evans
025741a73e
Fixed My Name
The D is for my middle name 😄
2 years ago
Yaohui Liu
20b5363601
fix: remove the history commit of embedding length property.
2 years ago
Yaohui Liu
3a847623ab
docs: update the docs to follow new version.
2 years ago
Yaohui Liu
ca6624edb3
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Rinne
4e83e48ad1
Merge pull request #122 from martindevans/gguf
Add GGUF support
2 years ago
Martin Evans
97349d93be
Merge branch 'gguf' of github.com:martindevans/LLamaSharp into gguf
2 years ago
Martin Evans
bcf06e2652
Added some comments on various native methods
2 years ago
Martin Evans
af680ac2d7
Created a hierarchy of exceptions for grammar format issues. This allows the base catch-all exception to be caught for general handling, or more specific exceptions to be caught for more specific handling.
2 years ago
Rinne
1533ee7dbf
Merge pull request #138 from drasticactions/semantic-kernel
Enable Semantic kernel support
2 years ago
Tim Miller
326c802be7
Have weights generate context
2 years ago
Tim Miller
3bca3b632e
New line
2 years ago
Tim Miller
9a1d6f99f2
Add Semantic Kernel support
2 years ago
Martin Evans
a70c7170dd
- Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system.
- Made all the mechanics of grammar parsing (GBNFGrammarParser, ParseState) internal. Just call `Grammar.Parse("whatever")`.
- Added a `GrammarRule` class which validates elements on construction (this allows constructing grammar without parsing GBNF).
- It should be impossible for a `GrammarRule` to represent an invalid rule.
2 years ago
SignalRT
fb007e5921
Changes to compile in VS Mac + change model to llama2
This commit includes changes to compile en VS Mac + changest to use llama2 not codellama.
It includes MacOS binaries in memory and metal
2 years ago
Mihai
24d3e1bfa8
Address PR review comment
2 years ago
Mihai
60790c5aac
Address code review comments (create custom exception, move printing to the ParseState class, rethrow error).
2 years ago
Mihai
2ae1891c13
Bug fixes after running tests.
SymbolIds is now SortedDictionary (although I'm not sure it really needs to be) because the test was failing due to expected value being in another order. The C++ data structure if SymbolIds is std::map<std::string, uint32_t> so the items are ordered by key.
2 years ago
Mihai
0bd495276b
Add initial tests + fix bugs. Still WIP since the test is failing.
2 years ago
Mihai
0f373fcc6d
Finish grammar_parser translation from C++ to C#
2 years ago
Mihai
3c919b56fe
Use ReadOnlySpan everywhere instead of ReadOnlyMemeory and instead of returning tuple, reference the ReadOnlySpan.
2 years ago
Mihai
8b4ec6d973
Address PR change requests
2 years ago
Mihai
7f31276bdf
[WIP] Translating the GrammarParser
2 years ago
Martin Evans
c9d08b943e
Added binaries for CUDA+Linux
2 years ago
Martin Evans
6711a59d0f
Included Linux deps
2 years ago