Martin Evans
|
328022b13d
|
Fixed merge conflicts
|
2 years ago |
Martin Evans
|
7ec318aab5
|
Added logging to embedder too
|
2 years ago |
Martin Evans
|
f1e5a8f995
|
- Passing the `ILogger` through to every call of `CreateContext`
- Passing `ILogger` into executors
|
2 years ago |
sa_ddam213
|
4ec9aed47a
|
Revert LLamasSharp project changes
|
2 years ago |
sa_ddam213
|
b4b4000342
|
Merge branch 'master' into upstream_master
# Conflicts:
# LLama.Web/Common/ModelOptions.cs
# LLama.Web/Services/ConnectionSessionService.cs
# LLama/LLamaStatelessExecutor.cs
# LLama/LLamaWeights.cs
|
2 years ago |
Martin Evans
|
e89ca5cc17
|
Fixed a few minor warnings
|
2 years ago |
Martin Evans
|
9daf586ba8
|
Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc)
|
2 years ago |
Martin Evans
|
d8434ea9d6
|
Merge pull request #185 from martindevans/wip_major_api_change
Major llama.cpp API Change
|
2 years ago |
Martin Evans
|
1f8c94e386
|
Added in the `special` parameter to the tokenizer (introduced in https://github.com/ggerganov/llama.cpp/pull/3538)
|
2 years ago |
Martin Evans
|
efb0664df0
|
- Added new binaries
- Fixed stateless executor out-of-context handling
- Fixed token tests
|
2 years ago |
Martin Evans
|
b8f0eff080
|
- Added `GetCharCountImpl` tests, fixed handling of empty strings
- Added ifdef to remove `Deconstruct` extension on everything except `NETSTANDARD2_0`
|
2 years ago |
Martin Evans
|
45118520fa
|
- Improved coverage of `GBNFGrammarParser` up to 96%
- Covered text transforms
- Removed unnecessary non-async transforms
|
2 years ago |
Martin Evans
|
2a38808bca
|
- Added threads to context params, replaced all thread args with `uint?`
- Replaced all binaries
|
2 years ago |
Martin Evans
|
4e9b1f8cdc
|
- Split extension methods into separate files
|
2 years ago |
sa_ddam213
|
9b8de007dc
|
Propagate ILogger
|
2 years ago |
Martin Evans
|
669ae47ef7
|
- Split parameters into two interfaces
- params contains a list of loras, instead of just one
|
2 years ago |
Martin Evans
|
9a0a0ae9fe
|
Removed cloning support
|
2 years ago |
Martin Evans
|
0d40338692
|
Fixed out-of-context handling in stateless executor
|
2 years ago |
Martin Evans
|
b306ac23dd
|
Added `Decode` method to `SafeLLamaContextHandle`
|
2 years ago |
Martin Evans
|
9e958e896b
|
safe handle for batch
|
2 years ago |
Martin Evans
|
ce1fc51163
|
Added some more native methods
|
2 years ago |
Martin Evans
|
bca55eace0
|
Initial changes to match the llama.cpp changes
|
2 years ago |
Martin Evans
|
d58fcbbd13
|
Fixed antiprompt checking
|
2 years ago |
Martin Evans
|
08f1615e60
|
- Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated.
- Added a "spinner" to the `StatelessModeExecute` demo, which spins while waiting for the next token (demonstrating that it's not blocked).
|
2 years ago |
Martin Evans
|
fe54f6764f
|
- Added unit tests for extension methods
- Removed unused `AddRangeSpan` extension
|
2 years ago |
Haiping
|
79fa74d59c
|
Merge pull request #177 from redthing1/fix/context-getstate
fix opaque GetState (fixes #176)
|
2 years ago |
redthing1
|
b78044347c
|
fix opaque GetState (fixes #176)
|
2 years ago |
Haiping
|
e1af7a96da
|
Merge pull request #175 from redthing1/feat/inferenceparams_record
make InferenceParams a record so we can use `with`
|
2 years ago |
redthing1
|
296ba607de
|
make InferenceParams a record so we can use with
|
2 years ago |
Haiping
|
10678a83d6
|
Merge pull request #65 from martindevans/alternative_dependency_loading
CPU Feature Detection
|
2 years ago |
Haiping
|
f134c5af59
|
Merge pull request #163 from SignalRT/DefaultMetal
MacOS default build now is metal llama.cpp #2901
|
2 years ago |
Martin Evans
|
3f80190f85
|
Minimal changes required to remove non-async inference.
|
2 years ago |
Martin Evans
|
b1e9d8240d
|
Merge pull request #149 from martindevans/removed_unused_inference_params
Removed unused properties of `InferenceParams` & `ModelParams`
|
2 years ago |
Martin Evans
|
daf09eae64
|
Skipping tokenization of empty strings (saves allocating an empty array every time)
|
2 years ago |
Martin Evans
|
466722dcff
|
Merge pull request #165 from martindevans/better_instruct_antiprompt_checking
better_instruct_antiprompt_checking
|
2 years ago |
Martin Evans
|
d08a125020
|
Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient.
|
2 years ago |
Martin Evans
|
bba801f4b7
|
Added a property to get the KV cache size from a context
|
2 years ago |
SignalRT
|
c41e448d0e
|
ggml-metal.metal MUST be copied to output folder
Metal depends on this file to execute, and MacOS llama.cpp defaults is now METAL.
|
2 years ago |
SignalRT
|
096293a026
|
MacOS Remove Metal as is the current default
See on Mac OS enable Metal by default #2901
|
2 years ago |
Martin Evans
|
b47977300a
|
Removed one more unused parameter
|
2 years ago |
Martin Evans
|
a1b0349561
|
Removed `ModelAlias` property (unused)
|
2 years ago |
Martin Evans
|
4dac142bd5
|
Merge pull request #160 from martindevans/GetState_fix
`GetState()` fix
|
2 years ago |
Martin Evans
|
832bf7dbe0
|
Simplified implementation of `GetState` and fixed a memory leak (`bigMemory` was never freed)
|
2 years ago |
Martin Evans
|
4f7b6ffdcc
|
Removed `GenerateResult` method that was only used in one place
|
2 years ago |
sa_ddam213
|
09d8f434f2
|
Extract LLamaLogLevel, Remove Logger class
|
2 years ago |
sa_ddam213
|
949b0cde16
|
Replace ILLamaLogger for ILogger
|
2 years ago |
sa_ddam213
|
70b36f8996
|
Add Microsoft.Extensions.Logging.Abstractions, update any required deps
|
2 years ago |
Martin Evans
|
d3b8ee988c
|
Beam Search (#155)
* Added the low level bindings to beam search.
|
2 years ago |
Martin Evans
|
a09aa86324
|
Merge pull request #153 from martindevans/fix_savestate_OpenOrCreate
Changed `OpenOrCreate` to `Create`
|
2 years ago |
Martin Evans
|
f366aa3abe
|
Changed `OpenOrCreate` to `Create` to fix #151
|
2 years ago |