154 Commits (refactor_v1.0)

Author SHA1 Message Date
  Martin Evans 835958398c - Removed the object wrappers and configurable pipeline, they can be better written in code. 1 year ago
  Martin Evans 33358124db Initial pass at a new sampling pipeline 1 year ago
  Rinne 1f97ad874b
Merge pull request #333 from AsakusaRinne/master 2 years ago
  Rinne ffc347a3f3
resolve comments. 2 years ago
  Rinne b05c3154f4
feat: allow customized search path for native library loading. 2 years ago
  Rinne 934358a7b3
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into fix_chinese 2 years ago
  Rinne 217c67b757
fix: chinese encoding error. 2 years ago
  Martin Evans a3614f6747 Added `native/` back into path prefix 2 years ago
  Martin Evans 77003d763e Added new symbols from llama.h 2 years ago
  Martin Evans 37466956c7 Added new binaries. 2 years ago
  Martin Evans 48c5039054 Improved test coverage. Discovered some issues: 2 years ago
  Martin Evans c517cc18a2
Merge pull request #304 from martindevans/obsolete_attribute_eval 2 years ago
  Martin Evans 16ab33ba3c Added Obsolete markings to all `Eval` overloads 2 years ago
  Martin Evans 0e51badb38 Exposed `progress_callback` in `LLamaModelParams` (although not in higher level) 2 years ago
  Martin Evans 1970023ef4
Merge pull request #292 from martindevans/dotnet8.0 2 years ago
  Martin Evans 89fef05362 This commit (5fe721bdbe) accidentally removed a load of stuff that it shouldn't. Fixed that. 2 years ago
  Martin Evans e9f5dbba89 Processing AVX512 branch on all dotnet versions 2 years ago
  Martin Evans e850115b5f Added dotnet8.0 as a build target 2 years ago
  Martin Evans b44e780b0f
Merge pull request #281 from martindevans/NativeLibraryConfig_improvements 2 years ago
  Martin Evans e3468d04f0
Merge pull request #277 from martindevans/feature/min_p 2 years ago
  Martin Evans a9d1f6cb47 - Renamed `NativeLibraryConfig.Default` to `NativeLibraryConfig.Instance`. It's not default any more as soon as you call `WithX`! 2 years ago
  Rinne da6718c387
docs: adjust some descriptions. 2 years ago
  Yaohui Liu d7675f7936
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into cuda_detection 2 years ago
  Martin Evans d743516070 - Added support for the MinP sampler 2 years ago
  Yaohui Liu cb5fb210b1
feat: optimize apis for cuda feature detection. 2 years ago
  SignalRT 97006a214f Merge remote-tracking branch 'upstream/master' into RuntimeDetection 2 years ago
  Yaohui Liu bbbfbd20b5
fix: cannot load library under some conditions. 2 years ago
  Martin Evans 31244ae691
Merge branch 'master' into YaRN_scaling_parameters 2 years ago
  SignalRT 7691f83516 Test build and nuget packages 2 years ago
  Yaohui Liu d03e1dbe30
feat: support cuda feature detection. 2 years ago
  SignalRT 5fe721bdbe Revert "Merge branch 'pr/268' into RuntimeDetection" 2 years ago
  SignalRT 200011e186 Revert "Merge feat: add detection template for cuda and avx. #268" 2 years ago
  SignalRT b4b3ea9d99 Merge feat: add detection template for cuda and avx. #268 2 years ago
  Yaohui Liu b893c6f609
feat: add detection template for cuda and avx. 2 years ago
  Martin Evans db1bc741b0 Modified `ContextSize` in parameters to be nullable. A null value means autodetect from the model. 2 years ago
  Martin Evans 04ee64a6be Exposed YaRN scaling parameters in IContextParams 2 years ago
  SignalRT 46fb472d42 Align with llama.cpp b1488 2 years ago
  Martin Evans a03fdc4818 Using a reference to an array instead of pointer arithmetic. This means it will benefit from bounds checking on the array. 2 years ago
  Martin Evans 08c29d52c5 Slightly refactored `SafeLLamaGrammarHandle.Create` to solve CodeQL warning about pointer arithmetic. 2 years ago
  Martin Evans b6d242193e Debugging slowdown by removing some things: 2 years ago
  Martin Evans 51c292ebd8 Added a safe method for `llama_get_logits_ith` 2 years ago
  Martin Evans 7e3cde4c13 Moved helper methods into `LLamaBatchSafeHandle` 2 years ago
  Martin Evans c7fdb9712c Added binaries, built from `6961c4bd0b` 2 years ago
  Martin Evans e81b3023d5 Rewritten sampling API to be accessed through the `LLamaTokenDataArray` object 2 years ago
  Martin Evans 3c5547b2b7 Reduced some uses of `NativeApi` in `BatchedDecoding` by adding some helper methods 2 years ago
  Martin Evans a024d2242e It works! 2 years ago
  Martin Evans 8cd81251b4 initial setup 2 years ago
  Martin Evans 321d0b58c4
Merge pull request #202 from martindevans/multi_gpu 2 years ago
  Martin Evans a03fe003de Fixed decoding of text "accumulating" over time (never properly clearing buffer) 2 years ago
  Martin Evans 51d4411a58 Added two new classes for detokenization tasks: 2 years ago