237 Commits (84bb5a36aba609d20992a9712cbda4c19f762033)

Author SHA1 Message Date
  Martin Evans 00df7c1516 - Added `LLamaWeights.LoadFromFileAsync`. 2 years ago
  Martin Evans 18586cc43b
Merge pull request #696 from martindevans/safe_handle_constructor_refactor 2 years ago
  Martin Evans e9fd7f96e0
Merge pull request #691 from martindevans/empty_batch_check 2 years ago
  Martin Evans a2f8573831
Merge pull request #698 from martindevans/slightly_safer_quantize_params 2 years ago
  Martin Evans d4f793a7eb Using `is` check instead of `== null` 2 years ago
  Martin Evans ecb359c9e7
- Using more specific `LoadWeightsFailedException` when a llava model fails to load (#697) 2 years ago
  Martin Evans 58ec798bff Modified `llama_model_quantize` to accept argument by `ref` instead of pointer. 2 years ago
  Martin Evans 54dab273cd - Removed unnecessary constructors from safe handles 2 years ago
  Martin Evans 25812762c9 Added checks in `Decode` to skip doing anything if the batch is empty. 2 years ago
  Martin Evans 3c76440957 - Added tests for generating embeddings with generative model and embedding model 2 years ago
  Martin Evans c325ac9127
April 2024 Binary Update (#662) 2 years ago
  Martin Evans 58107bb5b9
Logging interceptor (#649) 2 years ago
  evolcano 353412923f Merge branch 'master' of https://github.com/SciSharp/LLamaSharp 2 years ago
  evolcano 9d091c0316 Add path to find llama.dll for MAUI 2 years ago
  SignalRT 2d9a114f66 Include comments and include some checks 2 years ago
  SignalRT e8732efadd Example InteractiveExecutor 2 years ago
  Martin Evans e2705be6c8
Fixed off by one error in LLamaBatch sampling position (#626) 2 years ago
  Martin Evans 91d72e7465
Keeping track of positions where logits will be generated in a batch and what sequence those logits are associated with. (#624) 2 years ago
  Martin Evans 024787225b
`SetDllImportResolver` based loading (#603) 2 years ago
  jlsantiago 3b2836eac4
Llava api (#563) 2 years ago
  Martin Evans ce4de7d607
llama_decode lock (#595) 2 years ago
  Clovis Henrique Ribeiro d0f79814e9
Added conditional compilation code to progress_callback (in LlamaModelParams struct) so the struct plays nice with legacy NET Framework 4.8 (#593) 2 years ago
  Martin Evans f0b0bbcbb7
Mutable Logits (#586) 2 years ago
  Martin Evans a8ba9f05b3
March Binary Update (#565) 2 years ago
  Martin Evans 8ac1634233
Removed `llama_eval`. It is going to be completely removed in the next version of llama.cpp (#553) 2 years ago
  Martin Evans f0e7e7cc0a
Removed `SamplingApi`. it has been marked as Obsolete for a while, replaced by instance methods on `LLamaTokenDataArray` (#552) 2 years ago
  Martin Evans 7d84625a67
Classifier Free Guidance (#536) 2 years ago
  Scott W Harden a6394001a1
NativeLibraryConfig: WithLogs(LLamaLogLevel) (#529) 2 years ago
  Martin Evans c7d0dc915a Assorted small changes to clean up some code warnings 2 years ago
  Martin Evans e9d9042576 Added `Divide` to `KvAccessor` 2 years ago
  Martin Evans 949861a581 - Added a `Modify` method to `Conversation`. This grants **temporary** access to directly modify the KV cache. 2 years ago
  Martin Evans b0acecf080 Created a new `BatchedExecutor` which processes multiple "Conversations" in one single inference batch. This is faster, even when the conversations are unrelated, and is much faster if the conversations share some overlap (e.g. a common system prompt prefix). 2 years ago
  Martin Evans 90915c5a99 Added increment and decrement operators to `LLamaPos` 2 years ago
  Martin Evans c5146bac23 - Exposed KV debug view through `SafeLLamaContextHandle` 2 years ago
  Martin Evans 15a98b36d8 Updated everything to work with llama.cpp ce32060198 2 years ago
  Martin Evans 5da2a2f64b - Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing! 2 years ago
  Jason Couture ec59c5bf9e Fix missing library name prefix for cuda 2 years ago
  Jason Couture 443ce4fff4 While the dllimport changes work, manual path searching needed to be updated 2 years ago
  Jason Couture db7e1e88f8 Use llama instead of libllama in `[DllImport]` 2 years ago
  Martin Evans 92b9bbe779 Added methods to `SafeLLamaContextHandle` for KV cache manipulation 2 years ago
  Martin Evans 96c26c25f5
Merge pull request #445 from martindevans/stateless_executor_llama_decode 2 years ago
  Martin Evans 9fe878ae1f - Fixed example 2 years ago
  Martin Evans 9ede1bedc2 Automatically growing batch n_seq_max when exceeded. This means no parameters need to be picked when the batch is created. 2 years ago
  Martin Evans a2e29d393c Swapped `StatelessExecutor` to use `llama_decode`! 2 years ago
  Martin Evans 99969e538e - Removed some unused `eval` methods. 2 years ago
  Martin Evans 36a9335588 Removed `LLamaBatchSafeHandle` (using unmanaged memory, created by llama.cpp) and replaced it with a fully managed `LLamaBatch`. Modified the `BatchedDecoding` example to use new managed batch. 2 years ago
  Martin Evans 1472704e12 Added a test with examples of troublesome strings from 0.9.1 2 years ago
  Martin Evans 73172bbaba
Merge pull request #438 from martindevans/cleanup_model_unnecessary_unsafe 2 years ago
  Martin Evans ce1d302e7e Moved some native methods into `SafeLlamaModelHandle`, these methods are all wrapped in safer accessors with no extra costs so there is no need to expose them. 2 years ago
  Martin Evans 1e86755071 - Removed unnecessary `unsafe` block in model metadata loading 2 years ago