9 Commits (a67ea36dd9cf91c93d5a2fa4e6a82b7d4cb15b89)

Author SHA1 Message Date
  SignalRT 348f2c7d72 Update llama.cpp binaries to 5f631c2 and align the context to that version 2 years ago
  Martin Evans add3d5528b Removed `MarshalAs` on array 2 years ago
  Martin Evans 2245b84906
Update LLamaContextParams.cs 2 years ago
  sa_ddam213 3e252c81f6 LLamaContextParams epsilon and tensor split changes 2 years ago
  Martin Evans f16aa58e12 Updated to use the new loading system in llama (llama_state). This new system has split model weights and contexts into two separate things, allowing one set of weights to be shared between many contexts. 2 years ago
  Yaohui Liu 9850417a12
feat: update quantize native params. 2 years ago
  Yaohui Liu 18c2ff2395
refactor: instruct mode and examples. 2 years ago
  Yaohui Liu 1fca06dc7f
fix: n_gpu_layers miss in llama context. 2 years ago
  Yaohui Liu 5a79edeb51
feat: add the framework and basic usages. 2 years ago