Martin Evans
7ebff89f68
Merge pull request #101 from martindevans/llama_sample_classifier_free_guidance
llama_sample_classifier_free_guidance
2 years ago
Martin Evans
6c84accce8
Added `llama_sample_classifier_free_guidance` method from native API
2 years ago
Martin Evans
8e8cf1404d
Merge pull request #100 from martindevans/Logger_comments
Logger Comments
2 years ago
Martin Evans
afe559ef1c
Added comments to `Logger` and fixed some nullability warnings
2 years ago
Martin Evans
8fbb7e8eae
Merge pull request #99 from martindevans/native_api_improvements
Improved `NativeApi` file a bit
2 years ago
Martin Evans
d7f971fc22
Improved `NativeApi` file a bit:
- Added some more comments
- Modified `llama_tokenize` to not allocate
- Modified `llama_tokenize_native` to take a pointer instead of an array, allowing use with no allocations
- Removed GgmlInitParams (not used)
2 years ago
Martin Evans
841cf88e3b
Merge pull request #96 from martindevans/minor_quantizer_improvements
Minor quantizer improvements
2 years ago
Martin Evans
9b4d0e3bdd
Merge pull request #95 from saddam213/logging
Add native logging output
2 years ago
Martin Evans
ce325b49c7
Rewritten comments
2 years ago
Martin Evans
b69f4bc40e
- Expanded range of supported types in quantizer to match llama.cpp
- Rewritten `LLamaFtype` parsing to support any substring which uniquely matches a single enum variant
2 years ago
sa_ddam213
a67ea36dd9
Typo and formatting
2 years ago
sa_ddam213
726987b761
Add native logging output
2 years ago
Martin Evans
acd91341e6
Added lots of comments to all the LLamaFtype variants
2 years ago
Martin Evans
270c6d55ef
Merge pull request #88 from martindevans/fix_serialization_nan
Fix serialization error due to NaN
2 years ago
Martin Evans
f612275dc0
Merge pull request #89 from martindevans/fix_use_mmap
Fixed `ToLlamaContextParams` using the wrong parameter for `use_mmap`
2 years ago
Martin Evans
91bcefc852
comment on IModelParamsExtensions
2 years ago
Martin Evans
9cdc72aa67
Fixed `ToLlamaContextParams` using the wrong parameter for `use_mmap`
2 years ago
Martin Evans
bab3b46f0c
Merge pull request #82 from martindevans/tokenization_cleanup
Utils Cleanup
2 years ago
Martin Evans
5b0d3ba610
Merge pull request #86 from saddam213/upstream_master
Access to IModelParamsExtensions
2 years ago
Martin Evans
b5de3ee5aa
Fixed some final mentions of "mirostate" instead of "mirostat"
2 years ago
Martin Evans
be52737488
Using a nullable float instead of NaN, this should fix the serialization issue reported in #85
2 years ago
sa_ddam213
2d1269cae9
Access to IModelParamsExtensions
2 years ago
Martin Evans
ad0aed5cdf
Merge branch 'master' into tokenization_cleanup
2 years ago
Martin Evans
1fceeaf352
Applied fix from #84 (antiprompt does not work in stateless executor)
2 years ago
Rinne
250c024402
Merge pull request #84 from SciSharp/rinne-dev
fix: antiprompt does not work in stateless executor.
2 years ago
Yaohui Liu
d609b0e1d5
Merge branch 'master' of github.com:SciSharp/LLamaSharp into rinne-dev
2 years ago
Yaohui Liu
b60c8bd285
fix: antiprompt does not work in stateless executor.
2 years ago
Martin Evans
2b2d3af26b
Moved `Eval` out of `Utils` and into `SafeLLamaContextHandle`
2 years ago
Martin Evans
7fabcc1849
One last `TokenToString` case
2 years ago
Martin Evans
0e5e00e300
Moved `TokenToString` from Utils into `SafeLLamaContextHandle` (thin wrappers around the same method in `SafeLlamaModelHandle`)
2 years ago
Martin Evans
2d811b2603
- Moved `GetLogits` into `SafeLLamaContextHandle`
- Added disposal check into `SafeLLamaContextHandle`
2 years ago
Martin Evans
cd3cf2b77d
- Moved tokenization from `Utils.Tokenize` into `SafeLLamaContextHandle.Tokenize`, one less thing in `Utils`.
- Also refactored it to return an `int[]` instead of an `IEnumerable<int>`, solving the "multiple enumeration" problems at the source!
2 years ago
Martin Evans
73882de591
Merge pull request #81 from martindevans/tensor_splits_array
Improved Tensor Splits
2 years ago
Martin Evans
a9880d5665
Merge pull request #83 from martindevans/FixedSizeQueue_size_check
Cleaned up multiple enumeration in FixedSizeQueue
2 years ago
Martin Evans
bd3d8d3dc4
Cleaned up multiple enumeration in FixedSizeQueue
2 years ago
Martin Evans
f2499371ea
Pulled conversion of a `IModelParams` into a `LLamaContextParams` out into an extension method which can be used in other places.
2 years ago
Martin Evans
f1111a9f8b
Using a pin instead of a `fixed` block
2 years ago
Martin Evans
2c933c57a1
Fixed ModelOptions in Web project
2 years ago
Martin Evans
685eb3b9c2
Replaced `nint` with `float[]?` in Model params, which is much more user friendly!
2 years ago
Martin Evans
18b1df6762
Merge pull request #79 from saddam213/upstream_master
ModelParams & InferenceParams abstractions
2 years ago
sa_ddam213
e02d0c3617
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Rinne
486c317887
docs: update README.md
2 years ago
Rinne
4d3cdbcd66
docs: add info about prerelease v0.4.2.
2 years ago
Rinne
bfe9cc8961
Merge pull request #78 from SciSharp/rinne-dev
feat: update the llama backends.
2 years ago
Yaohui Liu
1b15d28fc9
ci: remove MACOS ci pipelines.
2 years ago
sa_ddam213
e46646b8db
Merge branch 'master' of https://github.com/SciSharp/LLamaSharp into upstream_master
2 years ago
Martin Evans
5e15077766
Merge pull request #80 from saddam213/ExposeNative
Expose some native classes
2 years ago
Yaohui Liu
bb46a990d0
fix: add bug info for native api.
2 years ago
Yaohui Liu
e64c05126b
ci: update ci content.
2 years ago
Yaohui Liu
f16db72841
ci: add debug info.
2 years ago