361 Commits (f68aa777f12f190923d55d2ff6f40cd891db2d94)

Author SHA1 Message Date
  Martin Evans e3468d04f0
Merge pull request #277 from martindevans/feature/min_p 2 years ago
  Rinne da6718c387
docs: adjust some descriptions. 2 years ago
  Yaohui Liu d7675f7936
Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into cuda_detection 2 years ago
  Martin Evans d743516070 - Added support for the MinP sampler 2 years ago
  Yaohui Liu 4d2c5f1003
build: change nuget configuration for cuda detection. 2 years ago
  Yaohui Liu cb5fb210b1
feat: optimize apis for cuda feature detection. 2 years ago
  SignalRT 97006a214f Merge remote-tracking branch 'upstream/master' into RuntimeDetection 2 years ago
  Yaohui Liu bbbfbd20b5
fix: cannot load library under some conditions. 2 years ago
  Martin Evans 31244ae691
Merge branch 'master' into YaRN_scaling_parameters 2 years ago
  SignalRT 7691f83516 Test build and nuget packages 2 years ago
  Yaohui Liu d03e1dbe30
feat: support cuda feature detection. 2 years ago
  SignalRT fb95bbb4e0 Merge remote-tracking branch 'upstream/master' into RuntimeDetection 2 years ago
  SignalRT 5fe721bdbe Revert "Merge branch 'pr/268' into RuntimeDetection" 2 years ago
  SignalRT 200011e186 Revert "Merge feat: add detection template for cuda and avx. #268" 2 years ago
  Rinne 47e016743e
Merge pull request #266 from philippjbauer/master 2 years ago
  SignalRT b4b3ea9d99 Merge feat: add detection template for cuda and avx. #268 2 years ago
  SignalRT 091b8d58b3 Merge branch 'pr/268' into RuntimeDetection 2 years ago
  Yaohui Liu b893c6f609
feat: add detection template for cuda and avx. 2 years ago
  Philipp Bauer d2b544afb8 Improved method return description 2 years ago
  Philipp Bauer 6ea40d1546 Use full history only when the ChatSession runs the first time 2 years ago
  SignalRT 0edbd92530 Change nuget backend packages 2 years ago
  Martin Evans db1bc741b0 Modified `ContextSize` in parameters to be nullable. A null value means autodetect from the model. 2 years ago
  Udayshankar Ravikumar 4071c1f5fc Updated preprocessor directives 2 years ago
  Philipp Bauer a288e7c02b Prevent duplication of user prompts / chat history in ChatSession. 2 years ago
  SignalRT b67198c6ef MacOS Intel Disable METAL 2 years ago
  Udayshankar Ravikumar df310e15da Fixed preprocessor directives 2 years ago
  SignalRT e64b9057d7 Merge branch 'RuntimeDetection' of https://github.com/SignalRT/LLamaSharp into RuntimeDetection 2 years ago
  SignalRT d1244332ed MacOS Runtime detection and clasification 2 years ago
  Martin Evans 04ee64a6be Exposed YaRN scaling parameters in IContextParams 2 years ago
  Udayshankar Ravikumar 1dad1ff834 Enhance framework compatibility 2 years ago
  SignalRT e1a89a8b0a Added all binaries from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/6762323560 2 years ago
  Martin Evans 11d8c55db7 Added all binaries from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/6762323560 (132d25b8a6) 2 years ago
  SignalRT 46fb472d42 Align with llama.cpp b1488 2 years ago
  Martin Evans a03fdc4818 Using a reference to an array instead of pointer arithmetic. This means it will benefit from bounds checking on the array. 2 years ago
  Martin Evans 08c29d52c5 Slightly refactored `SafeLLamaGrammarHandle.Create` to solve CodeQL warning about pointer arithmetic. 2 years ago
  Yaohui Liu 0e139d4ee2
fix: add arm binaries to cpu nuspec. 2 years ago
  Yaohui Liu 7ee27d2f99
fix: binary not copied on MAC platform. 2 years ago
  Martin Evans db8f3980ea New binaries from this commit: 207b51900e 2 years ago
  Martin Evans b6d242193e Debugging slowdown by removing some things: 2 years ago
  Martin Evans 529b06b35b - Fixed rope frequency/base to use the values in the model by default, instead of always overriding them by default! 2 years ago
  Martin Evans dcc82e582e Fixed `Eval` on platforms < dotnet 5 2 years ago
  Martin Evans 51c292ebd8 Added a safe method for `llama_get_logits_ith` 2 years ago
  Martin Evans 7e3cde4c13 Moved helper methods into `LLamaBatchSafeHandle` 2 years ago
  Martin Evans ccb8afae46 Cleaned up stateless executor as preparation for changing it to use the new batched decoding system. 2 years ago
  Martin Evans c786fb0ec8 Using `IReadOnlyList` instead of `IEnumerable` in `IInferenceParams` 2 years ago
  Martin Evans c7fdb9712c Added binaries, built from `6961c4bd0b` 2 years ago
  Martin Evans e81b3023d5 Rewritten sampling API to be accessed through the `LLamaTokenDataArray` object 2 years ago
  Martin Evans 3c5547b2b7 Reduced some uses of `NativeApi` in `BatchedDecoding` by adding some helper methods 2 years ago
  Martin Evans b38e3f6fe2 binaries (avx512) 2 years ago
  Martin Evans a024d2242e It works! 2 years ago