14 Commits (1d29b240b264dffef3ba31efd411d0875ab5dcc9)

Author SHA1 Message Date
  sa_ddam213 3e252c81f6 LLamaContextParams epsilon and tensor split changes 2 years ago
  Martin Evans 44b1e93609 Moved LoRA loading into `SafeLlamaModelHandle` 2 years ago
  Martin Evans f16aa58e12 Updated to use the new loading system in llama (llama_state). This new system has split model weights and contexts into two separate things, allowing one set of weights to be shared between many contexts. 2 years ago
  Yaohui Liu 1062fe1a7e
feat: upgrade the native libraries. 2 years ago
  Yaohui Liu 3bf74ec9b9
feat: add chat session for refactored code. 2 years ago
  Yaohui Liu 264fb9a706
refactor: LLamaModel and LLamaExecutor. 2 years ago
  Yaohui Liu 3a62f087fe
fix: encoding error when using other languages. 2 years ago
  Yaohui Liu e21589afa6
fix: n_gpu_layers not work in latest commit. 2 years ago
  Yaohui Liu 18c2ff2395
refactor: instruct mode and examples. 2 years ago
  Yaohui Liu d6bd1b7107
fix: add check for model file path. 2 years ago
  Yaohui Liu afedd3c949
fix: errors when input is not English or too long. 2 years ago
  Yaohui Liu 1fca06dc7f
fix: n_gpu_layers miss in llama context. 2 years ago
  Yaohui Liu fce10f3c4f
feat: add ChatSession. 2 years ago
  Yaohui Liu d6a7997e46
feat: add gpt model. 2 years ago