LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Rinne	0773e68111	fix doc ci	1 year ago
Rinne	8be9e8ae70	fix doc ci	1 year ago
Rinne	cb81295524	fix doc ci	1 year ago
Rinne	76be973689	fix doc ci	1 year ago
Rinne	53d68ca075	fix ci	1 year ago
Rinne	9b0850f066	fix ci	1 year ago
Rinne	66eb09b816	fix ci	1 year ago
Rinne	4f56e7297f	fix ci	1 year ago
Rinne	47bf9a8f66	fix ci	1 year ago
Rinne	ae24be0215	fix ci	1 year ago
Rinne	e63fa778fa	trigger workflow on doc_ci branch.	1 year ago
Rinne	c90f97021b	fix ci	1 year ago
Rinne	23a5cb5355	disable build ci.	1 year ago
Rinne	f5f0ab4502	ci: initialize the doc deployment ci.	1 year ago
Rinne	b9cec018c0	temporarily disable build ci.	1 year ago
Rinne	b9444452eb	docs: refactor the documentations.	1 year ago
Rinne	7e53bac1f0	docs: update README.	1 year ago
Rinne	156f369e33	Merge pull request #609 from SignalRT/LlavaExecutor Feature: LLava executor	1 year ago
Rinne	5e17e0f7c7	Merge pull request #630 from zsogitbe/master KernelMemory update with adding the use of already loaded model	1 year ago
Zoli Somogyi	91e5a3f543	Code optimization	1 year ago
Zoli Somogyi	127c3edd44	KernelMemory update with adding the use of already loaded model When using KernelMemory one may have already loaded a model which can then be used with this extension instead of loading the model again.	1 year ago
Rinne	9ee6ae3319	Merge pull request #615 from ChengYen-Tang/master [LLama.KernelMemory] Fixed System.ArgumentException: EmbeddingMode must be true & #617	1 year ago
SignalRT	bc487decae	Delete default prompt	1 year ago
SignalRT	43677c511c	Change interface to support multiple images and add the capabitlity to render the image in the console	1 year ago
SignalRT	2d9a114f66	Include comments and include some checks	1 year ago
SignalRT	f66044fba2	Restore CI on master	1 year ago
SignalRT	8907adcd8e	Clean up duplicate property	1 year ago
SignalRT	4c013aefd4	Disable Llava Embed test on CI	1 year ago
SignalRT	e8732efadd	Example InteractiveExecutor Add an Example and modifications to the interactive executor to enable Llava Models. Just a preview / demo	1 year ago
SignalRT	59a3323c94	Temporary change to compile on current branch	1 year ago
SignalRT	df6a207e95	Revert "Try only to add cublas for the moment" This reverts commit `5fda26c610`.	1 year ago
SignalRT	6589878314	Try only to add cublas for the moment	1 year ago
SignalRT	23a2df7aff	Add Cuda llava_shared library	1 year ago
Rinne	b677cdc6a3	Merge pull request #560 from eublefar/feature/chat-session-state-management Chat session state management	1 year ago
Martin Evans	e2705be6c8	Fixed off by one error in LLamaBatch sampling position (#626 )	1 year ago
Martin Evans	a2d3a847dd	Disabled LLava tests, they're too slow and are crashing CI (#625 )	1 year ago
Martin Evans	91d72e7465	Keeping track of positions where logits will be generated in a batch and what sequence those logits are associated with. (#624 )	1 year ago
eublefar	b8cd5b7ee5	loadTransforms flag for LoadSession methods	1 year ago
eublefar	9440f153da	Make process message method more flexible	1 year ago
Kenneth Tang	e4c2f57e43	Merge branch 'SciSharp:master' into master	1 year ago
Martin Evans	268f3a6b07	BatchedExecutor Fixed Forking (#621 ) * Previously when a conversation was forked this would result in both the parent and the child sharing exactly the same logits. Since sampling is allowed to modify logits this could lead to issues in sampling (e.g. one conversation is sampled and overwrites logits to be all zero, second conversation is sampled and generates nonsense). Fixed this by setting a "forked" flag, logits are copied if this flag is set. Flag is cleared next time the conversation is prompted so this extra copying only happens once after a fork occurs. * Removed finalizer from `BatchedExecutor`. This class does not directly own any unmanaged resources so it is not necessary.	1 year ago
Kenneth Tang	9e4109f774	Unable to load the model onto multiple GPUs (#617 )	1 year ago
Kenneth Tang	6216197196	Merge branch 'SciSharp:master' into master	1 year ago
Martin Evans	ad682fbebd	`BatchedExecutor.Create()` method (#613 ) Replaced `BatchedExecutor.Prompt(string)` method with `BatchedExecutor.Create()` method. This improves the API in two ways: - A conversation can be created, without immediately prompting it - Other prompting overloads (e.g. prompt with token list) can be used without duplicating all the overloads onto `BatchedExecutor` Added `BatchSize` property to `LLamaContext`	1 year ago
Kenneth Tang	3fda708eaa	Fix System.ArgumentException: EmbeddingMode must be true	1 year ago
Rinne	e3ecc318ff	Merge pull request #612 from xbotter/deps/sk-1.6.2 Update Semantic Kernel & Kernel Memory Package	1 year ago
xbotter	a019b5cc24	📝 Update LLamaSharpChatCompletion and LLama.Unittest - Updated LLamaSharpChatCompletion class in LLama.SemanticKernel/ChatCompletion/LLamaSharpChatCompletion.cs - Changed the type of the "_model" field from "StatelessExecutor" to "ILLamaExecutor" - Updated the constructor to accept an "ILLamaExecutor" parameter instead of a "StatelessExecutor" parameter - Updated LLamaSharpChatCompletion class in LLama.SemanticKernel/LLamaSharp.SemanticKernel.csproj - Updated LLama.Unittest project in LLama.Unittest/LLama.Unittest.csproj - Added a "PackageReference" for "Moq" version 4.20.70 - Added ExtensionMethodsTests class in LLama.Unittest/SemanticKernel/ExtensionMethodsTests.cs - Added tests for the "ToLLamaSharpChatHistory" and "ToLLamaSharpInferenceParams" extension methods - Added LLamaSharpChatCompletionTests class in LLama.Unittest/SemanticKernel/LLamaSharpChatCompletionTests.cs - Added tests for the LLamaSharpChatCompletion class ℹ️ The LLamaSharpChatCompletion class in the LLama.SemanticKernel project has been updated to use the ILLamaExecutor interface instead of the StatelessExecutor class. This change allows for better abstraction and flexibility in the implementation of the LLamaSharpChatCompletion class. The LLamaSharpChatCompletion class is responsible for providing chat completion functionality in the LLamaSharp project. The LLama.Unittest project has also been updated to include tests for the LLamaSharpChatCompletion class and the extension methods used by the class.	1 year ago
Martin Evans	024787225b	`SetDllImportResolver` based loading (#603 ) - Modified library loading to be based on `SetDllImportResolver`. This replaces the built in loading system and ensures there can't be two libraries loaded at once. - llava and llama are loaded separately, as needed. - All the previous loading logic is still used, within the `SetDllImportResolver` - Split out CUDA, AVX and MacOS paths to separate helper methods. - `Description` now specifies if it is for `llama` or `llava`	1 year ago
eublefar	d88f9e1199	Return null executor state if it's serialized in an old way	1 year ago
eublefar	00c873a197	Avoid saving empty context state in binary format, it smh messes with the llama.cpp	1 year ago

1 2 3 4 5 ...

1174 Commits (0773e68111e8f23bd6916fc38f53b0f0224d7a50) All Branches Search

1174 Commits (0773e68111e8f23bd6916fc38f53b0f0224d7a50)

All Branches