Martin Evans
ec49bdd6eb
- Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned!
- Moved repeated code to convert `LLamaTokenDataArray` into a `LLamaTokenDataArrayNative` into a helper method.
- Modified all call sites to dispose the `MemoryHandle`
- Saved one copy of the `List<LLamaTokenData>` into a `LLamaTokenData[]` in `LlamaModel`
2 years ago
Rinne
4d7d4f2bfe
Merge pull request #59 from saddam213/master
Instruct & Stateless web example implemented
2 years ago
Rinne
66d6b00b49
Merge pull request #57 from martindevans/larger_states
Larger states
2 years ago
Rinne
c5e8b3eba2
Merge pull request #56 from martindevans/memory_mapped_save_loading_and_saving
Memory Mapped LoadState/SaveState
2 years ago
Rinne
dee9afc471
Merge pull request #55 from martindevans/removed_dictionary_extensions
Cleaned up unnecessary extension methods
2 years ago
Rinne
d17fa991cc
Merge pull request #53 from martindevans/xml_docs_fixes
XML docs fixes
2 years ago
Rinne
ae98fa19b1
Merge pull request #52 from martindevans/docs_spelling_and_grammar
Documentation Spelling/Grammar
2 years ago
sa_ddam213
3fec7a63c7
Add Instruct and Stateless support
2 years ago
sa_ddam213
a32a5e4ffe
Decouple connectionId from ModelSession
2 years ago
sa_ddam213
d9fbd56f10
Strongly type connection status
2 years ago
sa_ddam213
ef8cf0b283
Add RequestVerificationToken logic fo ajax prefilter, Tidy up js cancel logic
2 years ago
sa_ddam213
e574d89a40
Send prompt on Enter key
2 years ago
Rinne
ac7f1865ee
Merge pull request #51 from fwaris/master
fix breaking change in llama.cpp; bind to latest version llama.cpp to…
2 years ago
Rinne
36ad09790c
Merge branch 'master' into master
2 years ago
Rinne
98825d8a9b
Merge pull request #48 from saddam213/master
Basic ASP.NET Core website example
2 years ago
Rinne
1b0523f630
Merge branch 'master' into master
2 years ago
Rinne
098d5b1544
Merge pull request #47 from SignalRT/master
MacOS metal support
2 years ago
SignalRT
e5d885050e
Align llama.cpp binaries
2 years ago
Martin Evans
f3fa73de2b
Implemented a new `LlamaModel.State` handle which internally stores the state as natively allocated memory. This allows it to exceed the 2GB limit on C# arrays.
2 years ago
Martin Evans
4d72420a04
Replaced `SaveState` and `LoadState` implementations. These new implementations map the file into memory and then pass the pointer directly into the native API. This improves things in two ways:
- A C# array cannot exceed 2,147,483,591 bytes. In my own use of LlamaSharp I encountered this limit.
- This saves an extra copy of the entire state data into a C# `byte[]`, so it should be faster.
This does _not_ fix some other places where `GetStateData` is used. I'll look at those in a separate PR.
2 years ago
Martin Evans
18462beb31
- Removed the `Update` and `GetOrDefault` extension methods (they were unused).
- Renamed `DictionaryExtensions` to `KeyValuePairExtensions`, since nothing in that file extends dictionary any more!
2 years ago
Martin Evans
2e76b79af6
Various minor XML docs fixes
2 years ago
Martin Evans
b39805dfcc
Fixed some spelling and grammar mistakes in the documentation.
2 years ago
Faisal Waris
17838bba49
fix breaking change in llama.cpp; bind to latest version llama.cpp to support new quantization method
2 years ago
sa_ddam213
a139423581
Move session management to service, Use ILLamaExecutor in session to make more versatile, scroll bug
2 years ago
SignalRT
a5c089e7b1
Update llama.cpp libraries
Keep update binaries
2 years ago
sa_ddam213
1ec59e120a
Move session management to service, Infer cancel support
2 years ago
sa_ddam213
fd215dce84
Update Readme
2 years ago
sa_ddam213
21b685649f
Add Readme
2 years ago
sa_ddam213
0c6b213b0d
Basic ASP.NET Core website example, Interactive started
2 years ago
SignalRT
56a37a0d7d
Update to lates llama.cpp
Adapt the interface change in llama_backend_init
2 years ago
unknown
dba866ffcf
Update API method name
2 years ago
SignalRT
b1019ae46f
Update the latest llama.cpp metal libraries
2 years ago
SignalRT
fb9e38d3e8
Update llama.cpp
Update with all new changes
2 years ago
jlsantiago
8de49beb35
Merge branch 'SciSharp:master' into master
2 years ago
Rinne
a53ede191e
Merge pull request #39 from xbotter/webapi-example
update webapi example
2 years ago
xbotter
a074385568
add stream example
2 years ago
xbotter
16f2cb9c4e
add history chat example
2 years ago
xbotter
2fe4ccfb58
update webapi example
2 years ago
SignalRT
37975f405f
Libraries with MacOS Metal Support
Add metal libraries and ggml-metal.metal helper
2 years ago
Yaohui Liu
f836a2c4bf
docs: update readme.
2 years ago
Yaohui Liu
e3bcfaa85b
docs: update readme to match v0.4.1.
2 years ago
Rinne
9c5bca5f6a
Merge pull request #36 from SignalRT/master
Update libllama.dylib
2 years ago
SignalRT
2fde2020a5
Update libllama.dylib
Align llama.cpp MacOS Dynamic Link Library
2 years ago
Rinne
dbfe8256cf
Merge pull request #32 from drasticactions/runtime-targets
Remove native libraries from LLama.csproj and replace it with a targets file.
2 years ago
Rinne
0269af8c17
Merge branch 'master' into runtime-targets
2 years ago
Yaohui Liu
1062fe1a7e
feat: upgrade the native libraries.
2 years ago
Yaohui Liu
40fb9afdf6
docs: update gitignore file.
2 years ago
Yaohui Liu
565cf2438f
docs: remove the site files.
2 years ago
Yaohui Liu
05194b03b8
docs: fix the typo in the documentation.
2 years ago