LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	c325ac9127	April 2024 Binary Update (#662 ) * Updated binaries, using [this build](https://github.com/SciSharp/LLamaSharp/actions/runs/8654672719/job/23733195669) for llama.cpp commit `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7`. - Added all new functions. - Moved some functions (e.g. `SafeLlamaModelHandle` specific functions) into `SafeLlamaModelHandle.cs` - Exposed tokens on `SafeLlamaModelHandle` and `LLamaWeights` through a `Tokens` property. As new special tokens are added in the future they can be added here. - Changed all token properties to return nullable tokens, to handle some models not having some tokens. - Fixed `DefaultSamplingPipeline` to handle no newline token in some models. * Moved native methods to more specific locations. - Context specific things have been moved into `SafeLLamaContextHandle.cs` and made private - they're exposed through C# properties and methods already. - Checking that GPU layer count is zero if GPU offload is not supported. - Moved methods for creating default structs (`llama_model_quantize_default_params` and `llama_context_default_params`) into relevant structs. * Removed exception if `GpuLayerCount > 0` when GPU is not supported. * - Added low level wrapper methods for new per-sequence state load/save in `SafeLLamaContextHandle` - Added high level wrapper methods (save/load with `State` object or memory mapped file) in `LLamaContext` - Moved native methods for per-sequence state load/save into `SafeLLamaContextHandle` * Added update and defrag methods for KV cache in `SafeLLamaContextHandle` * Updated submodule to `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7` * Passing the sequence ID when saving a single sequence state	1 year ago
Zoli Somogyi	91e5a3f543	Code optimization	1 year ago
Zoli Somogyi	127c3edd44	KernelMemory update with adding the use of already loaded model When using KernelMemory one may have already loaded a model which can then be used with this extension instead of loading the model again.	1 year ago
Kenneth Tang	9e4109f774	Unable to load the model onto multiple GPUs (#617 )	1 year ago
Zoli Somogyi	f578fcafa3	KernelMemory EmbeddingMode bug correction	1 year ago
xbotter	13a312b4ec	update sk to 1.0.0-rc3 & km to 0.18	1 year ago
xbotter	d1e2a4750b	🔧 Update KernelMemory configuration - Update LLamaSharpTextEmbeddingGeneration and LLamaSharpTextGeneration - Add Microsoft.KernelMemory.Core package reference - Update Microsoft.KernelMemory.Abstractions package reference	2 years ago
xbotter	286904920b	update DefaultInferenceParams in WithLLamaSharpDefaults	2 years ago
Yaohui Liu	46f01bbc94	feat(kernel-memory): avoid loading model twice.	2 years ago
xbotter	a3a6dc3a1b	fix comments	2 years ago
xbotter	3de9d34093	update package to KernelMemory	2 years ago
xbotter	a49438e6e6	🔧 Add Semantic Memory SDK to LLama.Examples Added a project reference to LLama.KernelMemory in the LLama.Examples.csproj file. 🔧 Add KernelMemory class and update TestRunner Added a new class KernelMemory to the LLama.Examples.NewVersion namespace, which includes a Run method that demonstrates the usage of Semantic Kernel Memory. The class imports a sample PDF document and asks a question to the memory, displaying the answer and relevant sources. Updated the TestRunner class in the same namespace to include an option (choice 15) to run the KernelMemory example. 🔧 Fix typo in LLamaSharpTextEmbeddingGeneration class Fixed a typo in the LLamaSharpTextEmbeddingGeneration class where the LlamaSharpConfig variable was incorrectly named as LlamaSharpConfig. Added XML documentation for the LLamaSharpTextEmbeddingGeneration constructor, Dispose method, and GenerateEmbeddingsAsync method. Summary: - Added project reference to LLama.KernelMemory and LLama.SemanticKernel in LLama.Examples.csproj - Added KernelMemory class to demonstrate Semantic Kernel Memory usage - Updated TestRunner class to include option for running KernelMemory example - Fixed typo in LLamaSharpTextEmbeddingGeneration class - Added XML documentation for constructor, Dispose method, and GenerateEmbeddingsAsync method in LLamaSharpTextEmbeddingGeneration class	2 years ago

12 Commits (b1f3987fae88fa85a9655c12869b9505f58c8d3e)