8 Commits (b2f2549692c8a64eee78bb3df5b59d4c969a9991)

Author SHA1 Message Date
  Martin Evans 3ba49754b1 Removed (marked as obsolete) prompting with a string for `Conversation`. Tokenization requires extra parameters (e.g. addBos, special) which require special considersation. For now it's better to tokenize using other tools and pass the tokens directly. 2 years ago
  Martin Evans 00df7c1516 - Added `LLamaWeights.LoadFromFileAsync`. 2 years ago
  Martin Evans ad682fbebd
`BatchedExecutor.Create()` method (#613) 2 years ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 2 years ago
  Martin Evans 91a7967869
`ReadOnlySpan<float>` in ISamplingPipeline (#538) 2 years ago
  Martin Evans 74a39188a2
Used `AnsiConsole` in a few more places: (#534) 2 years ago
  Scott W Harden 06ffe3ac95
LLama.Examples: improve model path prompt (#526) 2 years ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 2 years ago