LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	54dab273cd	- Removed unnecessary constructors from safe handles - Returning SafeLLamaGrammarHandle directly from `llama_grammar_init` and `llama_grammar_copy`	1 year ago
Martin Evans	ccc49eb1e0	BatchedExecutor Save/Load (#681 ) * Added the ability to save and load individual conversations in a batched executor. - New example - Added `BatchedExecutor.Load(filepath)` method - Added `Conversation.Save(filepath)` method - Added new (currently internal) `SaveState`/`LoadState` methods in LLamaContext which can stash some extra binary data in the header * Added ability to save/load a `Conversation` to an in-memory state, instead of to file. * Moved the new save/load methods out to an extension class specifically for the batched executor. * Removed unnecessary spaces	1 year ago
Lyrcaxis	f01c13ee54	Made special tokens included in prompts tokenize as intended (#677 )	1 year ago
Martin Evans	3c76440957	- Added tests for generating embeddings with generative model and embedding model - Rewritten native API methods for embeddings to return pointers - null is a valid value for these methods to return so `Span` is not appropriate	1 year ago
Zoli Somogyi	89217f73ca	Embeddings correction (#674 ) * Embeddings correction	1 year ago
Martin Evans	c325ac9127	April 2024 Binary Update (#662 ) * Updated binaries, using [this build](https://github.com/SciSharp/LLamaSharp/actions/runs/8654672719/job/23733195669) for llama.cpp commit `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7`. - Added all new functions. - Moved some functions (e.g. `SafeLlamaModelHandle` specific functions) into `SafeLlamaModelHandle.cs` - Exposed tokens on `SafeLlamaModelHandle` and `LLamaWeights` through a `Tokens` property. As new special tokens are added in the future they can be added here. - Changed all token properties to return nullable tokens, to handle some models not having some tokens. - Fixed `DefaultSamplingPipeline` to handle no newline token in some models. * Moved native methods to more specific locations. - Context specific things have been moved into `SafeLLamaContextHandle.cs` and made private - they're exposed through C# properties and methods already. - Checking that GPU layer count is zero if GPU offload is not supported. - Moved methods for creating default structs (`llama_model_quantize_default_params` and `llama_context_default_params`) into relevant structs. * Removed exception if `GpuLayerCount > 0` when GPU is not supported. * - Added low level wrapper methods for new per-sequence state load/save in `SafeLLamaContextHandle` - Added high level wrapper methods (save/load with `State` object or memory mapped file) in `LLamaContext` - Moved native methods for per-sequence state load/save into `SafeLLamaContextHandle` * Added update and defrag methods for KV cache in `SafeLLamaContextHandle` * Updated submodule to `f7001ccc5aa359fcf41bba19d1c99c3d25c9bcc7` * Passing the sequence ID when saving a single sequence state	1 year ago
SignalRT	168f697db6	Clean up and align documentation with the changes in the interface	1 year ago
SignalRT	d6890e4ec4	Initial approach to clear images	1 year ago
Zoli Somogyi	f4fad825c7	Simplifying image handling	1 year ago
Zoli Somogyi	44a82b0f3f	Download image implementation	1 year ago
Zoli Somogyi	e991e631f9	Standardizing Image Data implementation	1 year ago
Zoli Somogyi	d3c5a42040	Extension LLava with in memory images	1 year ago
Rinne	544a38d3bd	Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into release_0.11.2	1 year ago
Rinne	4640c6af04	release: update release info of packages.	1 year ago
Rinne	045850819e	Merge pull request #647 from AsakusaRinne/fix_llava_backend fix: add cuda llava native libraries.	1 year ago
Martin Evans	58107bb5b9	Logging interceptor (#649 ) * - Added `NativeLogConfig` which allows overriding the llama.cpp log callback - Delaying binding of this into llama.cpp until after `NativeLibraryConfig` has loaded * Using the log callback to show loading log messages during loading. * Registering log callbacks before any calls to llama.cpp except `llama_empty_call`, this is specifically selected to be a method that does nothing and is just there for triggering DLL loading. * - Removed much of the complexity of logging from `NativeApi.Load`. It always call whatever log callbacks you have registered. - Removed alternative path for `ILogger` in NativeLibraryConfig, instead it redirects to wrapping it in a delegate. * Saving a GC handle to keep the log callback alive * Removed prefix, logger should already do that. * Buffering up messages until a newline is encountered before passing log message to ILogger. * - Added trailing `\n` to log messages from loading. - Using `ThreadLocal<StringBuilder>` to ensure messages from separate threads don't get mixed together.	1 year ago
Rinne	ec8f832365	fix: add cuda llava native libraries.	1 year ago
liuyaohui.lyh	f7bd458341	fix: llava backend ignores avx and cuda.	1 year ago
Rinne	4038a39843	Merge pull request #637 from SciSharp/dependabot/nuget/Microsoft.Extensions.Logging.Abstractions-8.0.1 build(deps): bump Microsoft.Extensions.Logging.Abstractions from 8.0.0 to 8.0.1	1 year ago
dependabot[bot]	1bfb900fbe	build(deps): bump Microsoft.Extensions.Logging.Abstractions Bumps [Microsoft.Extensions.Logging.Abstractions](https://github.com/dotnet/runtime) from 8.0.0 to 8.0.1. - [Release notes](https://github.com/dotnet/runtime/releases) - [Commits](https://github.com/dotnet/runtime/compare/v8.0.0...v8.0.1) --- updated-dependencies: - dependency-name: Microsoft.Extensions.Logging.Abstractions dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	1 year ago
dependabot[bot]	1d163352a0	build(deps): bump System.Text.Json from 8.0.2 to 8.0.3 Bumps [System.Text.Json](https://github.com/dotnet/runtime) from 8.0.2 to 8.0.3. - [Release notes](https://github.com/dotnet/runtime/releases) - [Commits](https://github.com/dotnet/runtime/compare/v8.0.2...v8.0.3) --- updated-dependencies: - dependency-name: System.Text.Json dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	1 year ago
Rinne	3bc952cf60	Merge pull request #633 from AsakusaRinne/doc_ci fix: the missing of llava_shared library.	1 year ago
Rinne	b941540aaf	fix errors in nuspecs.	1 year ago
Rinne	d104d9a85b	fix the missing of llava_shared library.	1 year ago
Rinne	b4317eebbe	Merge pull request #632 from AsakusaRinne/master Release version 0.11.0	1 year ago
Rinne	d67658a0d6	docs: update the information to v0.11.0.	1 year ago
evolcano	353412923f	Merge branch 'master' of https://github.com/SciSharp/LLamaSharp	1 year ago
evolcano	9d091c0316	Add path to find llama.dll for MAUI This commit is originally made by lcarrere in https://github.com/SciSharp/LLamaSharp/issues/180 . I have confirmed this modification is OK in my windows 11 laptop, add make this commit according require of AsakusaRinne.	1 year ago
SignalRT	43677c511c	Change interface to support multiple images and add the capabitlity to render the image in the console	1 year ago
SignalRT	2d9a114f66	Include comments and include some checks	1 year ago
SignalRT	8907adcd8e	Clean up duplicate property	1 year ago
SignalRT	e8732efadd	Example InteractiveExecutor Add an Example and modifications to the interactive executor to enable Llava Models. Just a preview / demo	1 year ago
Rinne	b677cdc6a3	Merge pull request #560 from eublefar/feature/chat-session-state-management Chat session state management	1 year ago
Martin Evans	e2705be6c8	Fixed off by one error in LLamaBatch sampling position (#626 )	1 year ago
Martin Evans	91d72e7465	Keeping track of positions where logits will be generated in a batch and what sequence those logits are associated with. (#624 )	1 year ago
eublefar	b8cd5b7ee5	loadTransforms flag for LoadSession methods	1 year ago
eublefar	9440f153da	Make process message method more flexible	1 year ago
Martin Evans	268f3a6b07	BatchedExecutor Fixed Forking (#621 ) * Previously when a conversation was forked this would result in both the parent and the child sharing exactly the same logits. Since sampling is allowed to modify logits this could lead to issues in sampling (e.g. one conversation is sampled and overwrites logits to be all zero, second conversation is sampled and generates nonsense). Fixed this by setting a "forked" flag, logits are copied if this flag is set. Flag is cleared next time the conversation is prompted so this extra copying only happens once after a fork occurs. * Removed finalizer from `BatchedExecutor`. This class does not directly own any unmanaged resources so it is not necessary.	1 year ago
Martin Evans	ad682fbebd	`BatchedExecutor.Create()` method (#613 ) Replaced `BatchedExecutor.Prompt(string)` method with `BatchedExecutor.Create()` method. This improves the API in two ways: - A conversation can be created, without immediately prompting it - Other prompting overloads (e.g. prompt with token list) can be used without duplicating all the overloads onto `BatchedExecutor` Added `BatchSize` property to `LLamaContext`	1 year ago
Martin Evans	024787225b	`SetDllImportResolver` based loading (#603 ) - Modified library loading to be based on `SetDllImportResolver`. This replaces the built in loading system and ensures there can't be two libraries loaded at once. - llava and llama are loaded separately, as needed. - All the previous loading logic is still used, within the `SetDllImportResolver` - Split out CUDA, AVX and MacOS paths to separate helper methods. - `Description` now specifies if it is for `llama` or `llava`	1 year ago
eublefar	d88f9e1199	Return null executor state if it's serialized in an old way	1 year ago
eublefar	00c873a197	Avoid saving empty context state in binary format, it smh messes with the llama.cpp	1 year ago
eublefar	a31391edd7	Polymorphic serialization for executor state and transforms	1 year ago
eublefar	6f76d77350	Make text transform interfaces have explicit copy operation	1 year ago
eublefar	5f3803d23c	Make state editable by the user, add deepcopy to fields that require it	1 year ago
eublefar	87fe982f10	Change method signature as suggested	1 year ago
eublefar	af796fc3e9	Change List types in executor state to arrays to enforce copy on get/set operations	1 year ago
jlsantiago	3b2836eac4	Llava api (#563 ) * Add llava_binaries, update all binaries to make the test * Llava API + LlavaTest Preliminary * First prototype of Load + Unit Test * Temporary run test con branch LlavaAPI * Disable Embed test to review the rest of the test * Restore Embedding test * Use BatchThread to eval image embeddings Test Threads default value to ensure it doesn´t produce problems. * Rename test file * Update action versions * Test only one method, no release embeddings * Revert "Test only one method, no release embeddings" This reverts commit `264e176dcc`. * Correct API call * Only test llava related functionality * Cuda and Cblast binaries * Restore build policy * Changes related with code review * Add SafeHandles * Set overwrite to upload-artifact@v4 * Revert to upload-artifact@v3 * revert to upload-artifact@v3	1 year ago
Martin Evans	ce4de7d607	llama_decode lock (#595 ) * Added a lock object into `SafeLlamaModelHandle` which all calls to `llama_decode` (in the `SafeLLamaContextHandle`) lock first. This prevents two contexts from running inference on the same model at the same time, which seems to be unsafe in llama.cpp. * Modified the lock to be global over _all_ inferences. This seems to be necessary (at least with the CUDA backend).	1 year ago
Clovis Henrique Ribeiro	d0f79814e9	Added conditional compilation code to progress_callback (in LlamaModelParams struct) so the struct plays nice with legacy NET Framework 4.8 (#593 )	1 year ago

1 2 3 4 5 ...

559 Commits (54dab273cdb671e01da21e2b1d374bba767dc244)