LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	829f32b27d	- Added `Obsolete` attributes to the entire `OldVersion` namespace, so they can be removed in the future - Minor changes to cleanup some of the compiler warnings	2 years ago
zombieguy	45b01d5a78	Improved type conversion Type conversion is now done in the property rather than the utils class and uses the System.Convert class to ensure consistency.	2 years ago
Martin Evans	2830e5755c	- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed. - Deleted `NativeInfo` (internal class, not used anywhere)	2 years ago
Martin Evans	4b7d718551	Added native symbol for CFG	2 years ago
Martin Evans	759ae26f36	Merge branch 'master' into grammar_basics	2 years ago
Martin Evans	a9e6f21ab8	- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! - Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around	2 years ago
Martin Evans	ae8ef17a4a	- Added various convenience overloads to `LLamaContext.Eval` - Converted `SafeLLamaContextHandle` to take a `ReadOnlySpan` for Eval, narrower type better represents what's really needed	2 years ago
Martin Evans	64416ca23c	- Created a slightly nicer way to create grammar (from `IReadOnlyList<IReadOnlyList<LLamaGrammarElement>>`) - Integrated grammar into sampling - Added a test for the grammar sampling	2 years ago
Martin Evans	0294bb1303	Some of the basics of the grammar API	2 years ago
Rinne	62331852bc	Merge pull request #90 from martindevans/proposal_multi_context Multi Context	2 years ago
zombieguy	10f88ebd0e	Potential fix for .Net Framework issues (#103 ) * Added a bool to sbyte Utils convertor As an attempt to avoid using any MarshalAs attribute for .Net Framework support this Utils method will take in a bool value and return a 1 for true or 0 for false sbyte. * Changed all bool "MarshalAs" types to sbytes Changed all previous BOOL types with "MarshalAs" attributes to SBYTEs and changed all the setters of them to use the Utils.BoolToSignedByte() convertor method. * Fixed Utils bool convertor & added sbyte to bool Improved the Utils bool convertor just casting an sbyte value to get rid of the unneeded sbyte array and added an sbyte to bool convertor to convert back the way to a C# bool assuming any positive value above 0 is a bool and no bools are packed in the single byte integer. * bool to & from sbyte conversions via properties All 1byte bools are now handled where they "sit", via public properties which perform the conversions to keep all external data able to communicate as it did before.	2 years ago
Martin Evans	6c84accce8	Added `llama_sample_classifier_free_guidance` method from native API	2 years ago
Martin Evans	479ff57853	Renamed `EmbeddingCount` to `EmbeddingSize`	2 years ago
Martin Evans	d0a7a8fcd6	- Cleaned up disposal in LLamaContext - sealed some classes not intended to be extended	2 years ago
Martin Evans	f3511e390f	WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. The biggest single change is renaming `LLamaModel` to `LLamaContext`	2 years ago
Martin Evans	d7f971fc22	Improved `NativeApi` file a bit: - Added some more comments - Modified `llama_tokenize` to not allocate - Modified `llama_tokenize_native` to take a pointer instead of an array, allowing use with no allocations - Removed GgmlInitParams (not used)	2 years ago
Martin Evans	841cf88e3b	Merge pull request #96 from martindevans/minor_quantizer_improvements Minor quantizer improvements	2 years ago
Martin Evans	ce325b49c7	Rewritten comments	2 years ago
sa_ddam213	726987b761	Add native logging output	2 years ago
Martin Evans	acd91341e6	Added lots of comments to all the LLamaFtype variants	2 years ago
Martin Evans	2b2d3af26b	Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle`	2 years ago
Martin Evans	0e5e00e300	Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`)	2 years ago
Martin Evans	2d811b2603	- Moved `GetLogits` into `SafeLLamaContextHandle` - Added disposal check into `SafeLLamaContextHandle`	2 years ago
Martin Evans	cd3cf2b77d	- Moved tokenization from `Utils.Tokenize` into `SafeLLamaContextHandle.Tokenize`, one less thing in `Utils`. - Also refactored it to return an `int[]` instead of an `IEnumerable<int>`, solving the "multiple enumeration" problems at the source!	2 years ago
Rinne	bfe9cc8961	Merge pull request #78 from SciSharp/rinne-dev feat: update the llama backends.	2 years ago
Yaohui Liu	bb46a990d0	fix: add bug info for native api.	2 years ago
sa_ddam213	372894e1d4	Expose some native classes	2 years ago
SignalRT	348f2c7d72	Update llama.cpp binaries to 5f631c2 and align the context to that version It solves the problem with netstandard2 (is it really netstandard2 a thing right now?) Change context to solve problems. `5f631c2679`	2 years ago
Rinne	8d37abd787	Merge pull request #68 from martindevans/sampling_improvements Fixed Memory pinning in Sampling API	2 years ago
Martin Evans	add3d5528b	Removed `MarshalAs` on array	2 years ago
Martin Evans	2245b84906	Update LLamaContextParams.cs	2 years ago
sa_ddam213	3e252c81f6	LLamaContextParams epsilon and tensor split changes	2 years ago
Martin Evans	ec49bdd6eb	- Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned! - Moved repeated code to convert `LLamaTokenDataArray` into a `LLamaTokenDataArrayNative` into a helper method. - Modified all call sites to dispose the `MemoryHandle` - Saved one copy of the `List<LLamaTokenData>` into a `LLamaTokenData[]` in `LlamaModel`	2 years ago
Martin Evans	6985d3ab60	Added comments on two properties	2 years ago
Martin Evans	c974c8429e	Removed leftover `using`	2 years ago
Martin Evans	afb9d24f3a	Added model `Tokenize` method	2 years ago
Martin Evans	369c915afe	Added TokenToString conversion on model handle	2 years ago
Martin Evans	b721072aa5	Exposed some extra model properties on safe handle	2 years ago
Martin Evans	44b1e93609	Moved LoRA loading into `SafeLlamaModelHandle`	2 years ago
Martin Evans	c95b14d8b3	- Fixed null check - Additional comments	2 years ago
Martin Evans	f16aa58e12	Updated to use the new loading system in llama (llama_state). This new system has split model weights and contexts into two separate things, allowing one set of weights to be shared between many contexts. This change _only_ implements the low level API and makes no effort to update the LlamaSharp higher level abstraction. It is built upon llama `b3f138d`, necessary DLLs are not included in this commit.	2 years ago
Rinne	c5e8b3eba2	Merge pull request #56 from martindevans/memory_mapped_save_loading_and_saving Memory Mapped LoadState/SaveState	2 years ago
Rinne	d17fa991cc	Merge pull request #53 from martindevans/xml_docs_fixes XML docs fixes	2 years ago
Rinne	1b0523f630	Merge branch 'master' into master	2 years ago
Martin Evans	4d72420a04	Replaced `SaveState` and `LoadState` implementations. These new implementations map the file into memory and then pass the pointer directly into the native API. This improves things in two ways: - A C# array cannot exceed 2,147,483,591 bytes. In my own use of LlamaSharp I encountered this limit. - This saves an extra copy of the entire state data into a C# `byte[]`, so it should be faster. This does _not_ fix some other places where `GetStateData` is used. I'll look at those in a separate PR.	2 years ago
Martin Evans	2e76b79af6	Various minor XML docs fixes	2 years ago
SignalRT	56a37a0d7d	Update to lates llama.cpp Adapt the interface change in llama_backend_init	2 years ago
unknown	dba866ffcf	Update API method name	2 years ago
Yaohui Liu	1062fe1a7e	feat: upgrade the native libraries.	2 years ago
Yaohui Liu	9850417a12	feat: update quantize native params.	2 years ago

1 2

67 Commits (829f32b27dcf99bb535ab7c47ab132e1d68ddff3)