7 Commits (96bf214427a9b1f9f0ac90fdb069763e58e24910)

Author SHA1 Message Date
  Martin Evans e2705be6c8
Fixed off by one error in LLamaBatch sampling position (#626) 1 year ago
  Martin Evans ad682fbebd
`BatchedExecutor.Create()` method (#613) 1 year ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 1 year ago
  Martin Evans 91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline (#538) 1 year ago
  Martin Evans 74a39188a2
Used `AnsiConsole` in a few more places: (#534) 1 year ago
  Scott W Harden 06ffe3ac95
LLama.Examples: improve model path prompt (#526) 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago