11 Commits (54dab273cdb671e01da21e2b1d374bba767dc244)

Author SHA1 Message Date
  Martin Evans ccc49eb1e0
BatchedExecutor Save/Load (#681) 2 years ago
  Martin Evans 268f3a6b07
BatchedExecutor Fixed Forking (#621) 2 years ago
  Martin Evans ad682fbebd
`BatchedExecutor.Create()` method (#613) 2 years ago
  Martin Evans f0b0bbcbb7
Mutable Logits (#586) 2 years ago
  Martin Evans a8ba9f05b3
March Binary Update (#565) 2 years ago
  Martin Evans c7d0dc915a Assorted small changes to clean up some code warnings 2 years ago
  Martin Evans e9d9042576 Added `Divide` to `KvAccessor` 2 years ago
  Martin Evans 1cc463b9b7 Added a finalizer to `BatchedExecutor` 2 years ago
  Martin Evans 0c2cff0e1c Added a Finalizer for `Conversation` in case it is not correctly disposed. 2 years ago
  Martin Evans 949861a581 - Added a `Modify` method to `Conversation`. This grants **temporary** access to directly modify the KV cache. 2 years ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 2 years ago