7 Commits (b1f3987fae88fa85a9655c12869b9505f58c8d3e)

Author SHA1 Message Date
  Martin Evans e2705be6c8
Fixed off by one error in LLamaBatch sampling position (#626) 1 year ago
  Martin Evans ad682fbebd
`BatchedExecutor.Create()` method (#613) 1 year ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 1 year ago
  Martin Evans 91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline (#538) 1 year ago
  Martin Evans 74a39188a2
Used `AnsiConsole` in a few more places: (#534) 1 year ago
  Scott W Harden 06ffe3ac95
LLama.Examples: improve model path prompt (#526) 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago