LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	d58fcbbd13	Fixed antiprompt checking	2 years ago
Martin Evans	08f1615e60	- Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated. - Added a "spinner" to the `StatelessModeExecute` demo, which spins while waiting for the next token (demonstrating that it's not blocked).	2 years ago
Martin Evans	fe54f6764f	- Added unit tests for extension methods - Removed unused `AddRangeSpan` extension	2 years ago
Haiping	79fa74d59c	Merge pull request #177 from redthing1/fix/context-getstate fix opaque GetState (fixes #176)	2 years ago
redthing1	b78044347c	fix opaque GetState (fixes #176 )	2 years ago
Haiping	e1af7a96da	Merge pull request #175 from redthing1/feat/inferenceparams_record make InferenceParams a record so we can use `with`	2 years ago
redthing1	296ba607de	make InferenceParams a record so we can use with	2 years ago
Haiping	10678a83d6	Merge pull request #65 from martindevans/alternative_dependency_loading CPU Feature Detection	2 years ago
Haiping	f134c5af59	Merge pull request #163 from SignalRT/DefaultMetal MacOS default build now is metal llama.cpp #2901	2 years ago
Martin Evans	3f80190f85	Minimal changes required to remove non-async inference.	2 years ago
Martin Evans	b1e9d8240d	Merge pull request #149 from martindevans/removed_unused_inference_params Removed unused properties of `InferenceParams` & `ModelParams`	2 years ago
Martin Evans	daf09eae64	Skipping tokenization of empty strings (saves allocating an empty array every time)	2 years ago
Martin Evans	466722dcff	Merge pull request #165 from martindevans/better_instruct_antiprompt_checking better_instruct_antiprompt_checking	2 years ago
Martin Evans	d08a125020	Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient.	2 years ago
Martin Evans	bba801f4b7	Added a property to get the KV cache size from a context	2 years ago
SignalRT	c41e448d0e	ggml-metal.metal MUST be copied to output folder Metal depends on this file to execute, and MacOS llama.cpp defaults is now METAL.	2 years ago
SignalRT	096293a026	MacOS Remove Metal as is the current default See on Mac OS enable Metal by default #2901	2 years ago
Martin Evans	b47977300a	Removed one more unused parameter	2 years ago
Martin Evans	a1b0349561	Removed `ModelAlias` property (unused)	2 years ago
Martin Evans	4dac142bd5	Merge pull request #160 from martindevans/GetState_fix `GetState()` fix	2 years ago
Martin Evans	832bf7dbe0	Simplified implementation of `GetState` and fixed a memory leak (`bigMemory` was never freed)	2 years ago
Martin Evans	4f7b6ffdcc	Removed `GenerateResult` method that was only used in one place	2 years ago
sa_ddam213	09d8f434f2	Extract LLamaLogLevel, Remove Logger class	2 years ago
sa_ddam213	949b0cde16	Replace ILLamaLogger for ILogger	2 years ago
sa_ddam213	70b36f8996	Add Microsoft.Extensions.Logging.Abstractions, update any required deps	2 years ago
Martin Evans	d3b8ee988c	Beam Search (#155 ) * Added the low level bindings to beam search.	2 years ago
Martin Evans	a09aa86324	Merge pull request #153 from martindevans/fix_savestate_OpenOrCreate Changed `OpenOrCreate` to `Create`	2 years ago
Martin Evans	f366aa3abe	Changed `OpenOrCreate` to `Create` to fix #151	2 years ago
Martin Evans	77bd090150	Simplified `LLamaInteractExecutor` antiprompt matching by using new extension method	2 years ago
Martin Evans	614ba40948	- Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings. - Minimal amount of characters converted - Allocation free - Added `TokensToSpan` to `SafeLlamaModelHandle` which converts as many tokens as possible into a character span - Allocation free	2 years ago
Martin Evans	d79a6556a1	Removed 3 unused properties of `InferenceParams`	2 years ago
Martin Evans	6a842014ac	Removed duplicate `llama_sample_classifier_free_guidance` method	2 years ago
Martin Evans	4a53cdc56b	Merge pull request #142 from SciSharp/rinne-dev refactor: remove old version files.	2 years ago
Martin Evans	33035c82bf	- Removed `LLamaNewlineTokens` from `InteractiveExecutorState`. This is always set in the constructor from the context, so there's no point serializing it.	2 years ago
Yaohui Liu	18294a725e	refactor: remove old version files.	2 years ago
Martin Evans	8f58a40fb9	Added Linux dependency loading	2 years ago
Martin Evans	dd4957471f	Changed paths to match what the GitHub build action produces	2 years ago
Martin Evans	756a1ad0ba	Added a new way to load dependencies, performing CPU feature detection	2 years ago
Martin Evans	025741a73e	Fixed My Name The D is for my middle name 😄	2 years ago
Yaohui Liu	20b5363601	fix: remove the history commit of embedding length property.	2 years ago
Yaohui Liu	3a847623ab	docs: update the docs to follow new version.	2 years ago
Yaohui Liu	ca6624edb3	Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev	2 years ago
Rinne	4e83e48ad1	Merge pull request #122 from martindevans/gguf Add GGUF support	2 years ago
Martin Evans	97349d93be	Merge branch 'gguf' of github.com:martindevans/LLamaSharp into gguf	2 years ago
Martin Evans	bcf06e2652	Added some comments on various native methods	2 years ago
Martin Evans	af680ac2d7	Created a hierarchy of exceptions for grammar format issues. This allows the base catch-all exception to be caught for general handling, or more specific exceptions to be caught for more specific handling.	2 years ago
Rinne	1533ee7dbf	Merge pull request #138 from drasticactions/semantic-kernel Enable Semantic kernel support	2 years ago
Tim Miller	326c802be7	Have weights generate context	2 years ago
Tim Miller	3bca3b632e	New line	2 years ago
Tim Miller	9a1d6f99f2	Add Semantic Kernel support	2 years ago

1 2 3 4 5 ...

264 Commits (6571e982e1dbdbbd3deaf4690da5e7f4e8bf20a3)