You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
SignalRT 348f2c7d72 Update llama.cpp binaries to 5f631c2 and align the context to that version 2 years ago
..
GgmlInitParams.cs feat: support loading and saving state. 2 years ago
LLamaContextParams.cs Update llama.cpp binaries to 5f631c2 and align the context to that version 2 years ago
LLamaFtype.cs feat: update quantize native params. 2 years ago
LLamaModelQuantizeParams.cs Various minor XML docs fixes 2 years ago
LLamaTokenData.cs refactor: LLamaModel and LLamaExecutor. 2 years ago
LLamaTokenDataArray.cs - Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned! 2 years ago
NativeApi.Quantize.cs Various minor XML docs fixes 2 years ago
NativeApi.Sampling.cs - Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned! 2 years ago
NativeApi.cs Added model `Tokenize` method 2 years ago
NativeInfo.cs feat: add the framework and basic usages. 2 years ago
SafeLLamaContextHandle.cs Updated to use the new loading system in llama (llama_state). This new system has split model weights and contexts into two separate things, allowing one set of weights to be shared between many contexts. 2 years ago
SafeLLamaHandleBase.cs - Fixed null check 2 years ago
SafeLlamaModelHandle.cs Added comments on two properties 2 years ago
SamplingApi.cs - Most importantly: Fixed issue in `SamplingApi`, `Memory` was pinned, but never unpinned! 2 years ago

C#/.NET上易用的LLM高性能推理框架,支持LLaMA和LLaVA系列模型。

C# Text Metal JavaScript HTML+Razor other