Martin Evans
d58fcbbd13
Fixed antiprompt checking
2 years ago
Martin Evans
08f1615e60
- Converted LLamaStatelessExecutor to run `Exec` calls inside an awaited task. This unblocks async callers while the model is being evaluated.
- Added a "spinner" to the `StatelessModeExecute` demo, which spins while waiting for the next token (demonstrating that it's not blocked).
2 years ago
Martin Evans
fe54f6764f
- Added unit tests for extension methods
- Removed unused `AddRangeSpan` extension
2 years ago
Haiping
79fa74d59c
Merge pull request #177 from redthing1/fix/context-getstate
fix opaque GetState (fixes #176 )
2 years ago
redthing1
b78044347c
fix opaque GetState ( fixes #176 )
2 years ago
Haiping
e1af7a96da
Merge pull request #175 from redthing1/feat/inferenceparams_record
make InferenceParams a record so we can use `with`
2 years ago
redthing1
296ba607de
make InferenceParams a record so we can use with
2 years ago
Haiping
10678a83d6
Merge pull request #65 from martindevans/alternative_dependency_loading
CPU Feature Detection
2 years ago
Haiping
f134c5af59
Merge pull request #163 from SignalRT/DefaultMetal
MacOS default build now is metal llama.cpp #2901
2 years ago
Martin Evans
3f80190f85
Minimal changes required to remove non-async inference.
2 years ago
Martin Evans
b1e9d8240d
Merge pull request #149 from martindevans/removed_unused_inference_params
Removed unused properties of `InferenceParams` & `ModelParams`
2 years ago
Martin Evans
daf09eae64
Skipping tokenization of empty strings (saves allocating an empty array every time)
2 years ago
Martin Evans
466722dcff
Merge pull request #165 from martindevans/better_instruct_antiprompt_checking
better_instruct_antiprompt_checking
2 years ago
Martin Evans
d08a125020
Using the `TokensEndsWithAnyString` extensions for antiprompt checking in instruct executor. Simpler and more efficient.
2 years ago
Martin Evans
bba801f4b7
Added a property to get the KV cache size from a context
2 years ago
SignalRT
c41e448d0e
ggml-metal.metal MUST be copied to output folder
Metal depends on this file to execute, and MacOS llama.cpp defaults is now METAL.
2 years ago
SignalRT
096293a026
MacOS Remove Metal as is the current default
See on Mac OS enable Metal by default #2901
2 years ago
Martin Evans
b47977300a
Removed one more unused parameter
2 years ago
Martin Evans
a1b0349561
Removed `ModelAlias` property (unused)
2 years ago
Martin Evans
4dac142bd5
Merge pull request #160 from martindevans/GetState_fix
`GetState()` fix
2 years ago
Martin Evans
832bf7dbe0
Simplified implementation of `GetState` and fixed a memory leak (`bigMemory` was never freed)
2 years ago
Martin Evans
4f7b6ffdcc
Removed `GenerateResult` method that was only used in one place
2 years ago
sa_ddam213
09d8f434f2
Extract LLamaLogLevel, Remove Logger class
2 years ago
sa_ddam213
949b0cde16
Replace ILLamaLogger for ILogger
2 years ago
sa_ddam213
70b36f8996
Add Microsoft.Extensions.Logging.Abstractions, update any required deps
2 years ago
Martin Evans
d3b8ee988c
Beam Search ( #155 )
* Added the low level bindings to beam search.
2 years ago
Martin Evans
a09aa86324
Merge pull request #153 from martindevans/fix_savestate_OpenOrCreate
Changed `OpenOrCreate` to `Create`
2 years ago
Martin Evans
f366aa3abe
Changed `OpenOrCreate` to `Create` to fix #151
2 years ago
Martin Evans
77bd090150
Simplified `LLamaInteractExecutor` antiprompt matching by using new extension method
2 years ago
Martin Evans
614ba40948
- Added a `TokensEndsWithAnyString` extension to `IReadOnlyList<int>` which efficiently checks if a set of tokens ends with one of a set of strings.
- Minimal amount of characters converted
- Allocation free
- Added `TokensToSpan` to `SafeLlamaModelHandle` which converts as many tokens as possible into a character span
- Allocation free
2 years ago
Martin Evans
d79a6556a1
Removed 3 unused properties of `InferenceParams`
2 years ago
Martin Evans
6a842014ac
Removed duplicate `llama_sample_classifier_free_guidance` method
2 years ago
Martin Evans
4a53cdc56b
Merge pull request #142 from SciSharp/rinne-dev
refactor: remove old version files.
2 years ago
Martin Evans
33035c82bf
- Removed `LLamaNewlineTokens` from `InteractiveExecutorState`. This is always set in the constructor from the context, so there's no point serializing it.
2 years ago
Yaohui Liu
18294a725e
refactor: remove old version files.
2 years ago
Martin Evans
8f58a40fb9
Added Linux dependency loading
2 years ago
Martin Evans
dd4957471f
Changed paths to match what the GitHub build action produces
2 years ago
Martin Evans
756a1ad0ba
Added a new way to load dependencies, performing CPU feature detection
2 years ago
Martin Evans
025741a73e
Fixed My Name
The D is for my middle name 😄
2 years ago
Yaohui Liu
20b5363601
fix: remove the history commit of embedding length property.
2 years ago
Yaohui Liu
3a847623ab
docs: update the docs to follow new version.
2 years ago
Yaohui Liu
ca6624edb3
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Rinne
4e83e48ad1
Merge pull request #122 from martindevans/gguf
Add GGUF support
2 years ago
Martin Evans
97349d93be
Merge branch 'gguf' of github.com:martindevans/LLamaSharp into gguf
2 years ago
Martin Evans
bcf06e2652
Added some comments on various native methods
2 years ago
Martin Evans
af680ac2d7
Created a hierarchy of exceptions for grammar format issues. This allows the base catch-all exception to be caught for general handling, or more specific exceptions to be caught for more specific handling.
2 years ago
Rinne
1533ee7dbf
Merge pull request #138 from drasticactions/semantic-kernel
Enable Semantic kernel support
2 years ago
Tim Miller
326c802be7
Have weights generate context
2 years ago
Tim Miller
3bca3b632e
New line
2 years ago
Tim Miller
9a1d6f99f2
Add Semantic Kernel support
2 years ago