100 Commits (18b15184ea6dceba07b9e6a6a73594e823714d40)

Author SHA1 Message Date
  Martin Evans e89ca5cc17 Fixed a few minor warnings 2 years ago
  Martin Evans 9daf586ba8 Assorted cleanup leftover after the huge change in the last PR (comments, syntax style, etc) 2 years ago
  Martin Evans 1f8c94e386 Added in the `special` parameter to the tokenizer (introduced in https://github.com/ggerganov/llama.cpp/pull/3538) 2 years ago
  Martin Evans 2a38808bca - Added threads to context params, replaced all thread args with `uint?` 2 years ago
  Martin Evans 9a0a0ae9fe Removed cloning support 2 years ago
  Martin Evans 0d40338692 Fixed out-of-context handling in stateless executor 2 years ago
  Martin Evans b306ac23dd Added `Decode` method to `SafeLLamaContextHandle` 2 years ago
  Martin Evans 9e958e896b safe handle for batch 2 years ago
  Martin Evans ce1fc51163 Added some more native methods 2 years ago
  Martin Evans bca55eace0 Initial changes to match the llama.cpp changes 2 years ago
  Haiping 10678a83d6
Merge pull request #65 from martindevans/alternative_dependency_loading 2 years ago
  Martin Evans daf09eae64 Skipping tokenization of empty strings (saves allocating an empty array every time) 2 years ago
  Martin Evans bba801f4b7 Added a property to get the KV cache size from a context 2 years ago
  sa_ddam213 09d8f434f2
Extract LLamaLogLevel, Remove Logger class 2 years ago
  Martin Evans d3b8ee988c
Beam Search (#155) 2 years ago
  Martin Evans 614ba40948 - Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings. 2 years ago
  Martin Evans 6a842014ac Removed duplicate `llama_sample_classifier_free_guidance` method 2 years ago
  Martin Evans 8f58a40fb9 Added Linux dependency loading 2 years ago
  Martin Evans dd4957471f Changed paths to match what the GitHub build action produces 2 years ago
  Martin Evans 756a1ad0ba Added a new way to load dependencies, performing CPU feature detection 2 years ago
  Rinne 4e83e48ad1
Merge pull request #122 from martindevans/gguf 2 years ago
  Martin Evans bcf06e2652 Added some comments on various native methods 2 years ago
  Martin Evans a70c7170dd - Created a higher level `Grammar` class which is immutable and contains a list of grammar rules. This is the main "entry point" to the grammar system. 2 years ago
  Mihai 0bd495276b Add initial tests + fix bugs. Still WIP since the test is failing. 2 years ago
  Martin Evans 2022b82947 Added binaries generated by this action: https://github.com/SciSharp/LLamaSharp/actions/runs/6002797872/job/16279896150 2 years ago
  Martin Evans 31287b5e6e Rewritten TokenToSpan/TokenToString to better fit the new way it's done in llama.cpp with a few different options: 2 years ago
  Martin Evans 0c98ae1955 Passing ctx to `llama_token_nl(_ctx)` 2 years ago
  Martin Evans 6ffa28f964 Removed `LLAMA_MAX_DEVICES` (not used) 2 years ago
  Martin Evans 2056078aef Initial changes required for GGUF support 2 years ago
  Martin Evans cf4754db44 Removed unnecessary parameters from some low level sampler methods 2 years ago
  Martin Evans f70525fec2 Two small improvements to the native sampling API: 2 years ago
  Martin Evans a911b77dec Various minor changes, resolving about 100 ReSharper code quality warnings 2 years ago
  Martin Evans ebacdb666d - Moved the lower level state get/set methods onto SafeLLamaContextHandle 2 years ago
  Martin Evans 829f32b27d - Added `Obsolete` attributes to the entire `OldVersion` namespace, so they can be removed in the future 2 years ago
  zombieguy 45b01d5a78 Improved type conversion 2 years ago
  Martin Evans 2830e5755c - Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed. 2 years ago
  Martin Evans 4b7d718551 Added native symbol for CFG 2 years ago
  Martin Evans 759ae26f36
Merge branch 'master' into grammar_basics 2 years ago
  Martin Evans a9e6f21ab8 - Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! 2 years ago
  Martin Evans ae8ef17a4a - Added various convenience overloads to `LLamaContext.Eval` 2 years ago
  Martin Evans 64416ca23c - Created a slightly nicer way to create grammar (from `IReadOnlyList<IReadOnlyList<LLamaGrammarElement>>`) 2 years ago
  Martin Evans 0294bb1303 Some of the basics of the grammar API 2 years ago
  Rinne 62331852bc
Merge pull request #90 from martindevans/proposal_multi_context 2 years ago
  zombieguy 10f88ebd0e
Potential fix for .Net Framework issues (#103) 2 years ago
  Martin Evans 6c84accce8 Added `llama_sample_classifier_free_guidance` method from native API 2 years ago
  Martin Evans 479ff57853 Renamed `EmbeddingCount` to `EmbeddingSize` 2 years ago
  Martin Evans d0a7a8fcd6 - Cleaned up disposal in LLamaContext 2 years ago
  Martin Evans f3511e390f WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. 2 years ago
  Martin Evans d7f971fc22 Improved `NativeApi` file a bit: 2 years ago
  Martin Evans 841cf88e3b
Merge pull request #96 from martindevans/minor_quantizer_improvements 2 years ago