2 Commits (06ffe3ac95389d18a2e8ca87bd5494c479b205c7)

Author SHA1 Message Date
  Scott W Harden 06ffe3ac95
LLama.Examples: improve model path prompt (#526) 1 year ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 1 year ago