Martin Evans
de2b20aae5
- Added a specific exception for failing to load model weights.
- Checking if model is readable
2 years ago
Martin Evans
096e0e75f8
Check that the model file actually exists immediately before loading it. Improve #395
2 years ago
Martin Evans
3c6af909dd
Merge pull request #434 from martindevans/stateless_eos_check
Added a check for EOS token in LLamaStatelessExecutor
2 years ago
Martin Evans
f160fbd6d1
Added a check for EOS token in LLamaStatelessExecutor
2 years ago
Martin Evans
2ea2048b78
- Added a test for tokenizing just a new line (reproduce issue https://github.com/SciSharp/LLamaSharp/issues/430 )
- Properly displaying `LLamaToken`
- Removed all tokenisation code in `SafeLLamaContextHandle` - just pass it all through to the `SafeLlamaModelHandle`
- Improved `SafeLlamaModelHandle` tokenisation:
- Renting an array, for one less allocation
- Not using `&tokens[0]` to take a pointer to an array, this is redundant and doesn't work on empty arrays
2 years ago
Martin Evans
98635a0d5a
Fixed decoding of large tokens (over 16 bytes) in streaming text decoder
2 years ago
Martin Evans
402a110a3a
Merge pull request #404 from martindevans/switched_to_LLamaToken_struct
LLamaToken Struct
2 years ago
Steven Kennedy
988f2fa302
Reverted Net8.0
2 years ago
Steven Kennedy
cf2e9e35f8
Updating the GpuLayerCount to mirror the Python Port of Llama.cpp
2 years ago
Martin Evans
9573e2c9ba
Merge pull request #417 from martindevans/safe_handle_initialisation
Safer Handle Initialisation
2 years ago
Martin Evans
1e69e265b6
Moved some native methods to do with creating/destroying resources into their respective handles. There is **no** safe way to call most of these methods, everything must be done through through handles.
2 years ago
Martin Evans
8416cb866a
Improved exceptions in IModelParams for unknown KV override types.
2 years ago
Martin Evans
82727c4414
Removed collection expressions from test
2 years ago
Martin Evans
2eb52b1630
made casts to/from int explicit, fixed places affected
2 years ago
Martin Evans
42be9b136d
Switched form using raw integers, to a `LLamaToken` struct
2 years ago
Martin Evans
48ef3bb080
Added runtime checks that UseMemoryLock and UseMemorymap are actually supported.
2 years ago
Martin Evans
4e5e994dda
- directly returning a SafeLlamaModelHandle, instead of an IntPtr which is wrapped in a handle.
- made `llama_backend_init` private. This is automatically called, there is no way it can correctly be used externally.
- made `llama_token_to_piece` safe (Span instead of pointer)
2 years ago
Martin Evans
bac3e43498
Fixed handling of empty spans
2 years ago
Martin Evans
3925545147
Fixed LLamaExecutorBase.cs
2 years ago
Martin Evans
c002642268
- Removed some `unsafe` where it wasn't necessary
- Wrapped some native functions which take (pointer, length) in function which take a `span` instead.
2 years ago
Martin Evans
f860f88c36
Code cleanup driven by R# suggestions:
- Made `NativeApi` into a `static class` (it's not intended to be instantiated)
- Moved `LLamaTokenType` enum out into a separate file
- Made `LLamaSeqId` and `LLamaPos` into `record struct`, convenient to have equality etc
2 years ago
Martin Evans
2cded1b296
Fixed alignment of value fields in `LLamaModelMetadataOverride`
2 years ago
Martin Evans
3fc0f34cbe
Fixed some issues which were causing metadata overrides not to work (mostly importantly, converting the key was failing so all keys were null bytes and thus ignored).
2 years ago
Martin Evans
6be3f62321
Fixed loading of very large metadata values (over 1kb)
2 years ago
Martin Evans
f0d7468b22
Merge pull request #356 from xbotter/deps/sk-rc3
bump sk to 1.0.1 & km to 0.18
2 years ago
xbotter
211ce12bf5
LLamaEmbedder exposes the Context
2 years ago
Martin Evans
fb606c2488
Fixed incorrect values
2 years ago
Martin Evans
47e4fcef2a
Fixed GetString on netstandard2
2 years ago
Martin Evans
2a1e1b6183
Removed unused imports
2 years ago
Martin Evans
a2bae178fa
Added a `Metadata` property to `LLamaWeights`
2 years ago
Martin Evans
7139281e4d
Merge pull request #373 from martindevans/enhanced_avx512
Improved support for AVX512
2 years ago
Martin Evans
1b13f7c717
Improved support for AVX512:
- Enabled more features in build process (VBMI and VNNI)
- Added runtime checking for this features
- Improved runtime checking to no longer require dotnet8.0
2 years ago
Martin Evans
82d84afaea
Resetting the custom sampling pipeline in the stateless executor
2 years ago
Martin Evans
c298ab828a
Merge pull request #368 from martindevans/context_set_seed
Context Set Seed
2 years ago
Martin Evans
dc8e5d88f7
Update LLama/LLamaContext.cs
2 years ago
Martin Evans
a3177ab140
Merge pull request #369 from martindevans/rename_llama_sample_temperature
Renamed `llama_sample_temperature` to `llama_sample_temp`
2 years ago
Martin Evans
db7ecf5a43
Added a method to create a clone of a grammar instance
2 years ago
Martin Evans
ea523d2e2a
Renamed `llama_sample_temperature` to `llama_sample_temp`, Mirroring the same change made in llama.cpp
2 years ago
Martin Evans
2df3e7617e
Added a method to set the RNG seed on the context
2 years ago
Martin Evans
cedef5e45a
Added the `pure` field to `LLamaModelQuantizeParams` (it's been added to llama.cpp)
2 years ago
Martin Evans
199b4418c3
Fixed JSON serialization for ModelParams
2 years ago
Martin Evans
2f0deeadcd
Implemented serialization for `MetadataOverride`. Deserialization is broken (converter is never called)
2 years ago
Martin Evans
b868b056f7
Added metadata overrides to `IModelParams`
2 years ago
xbotter
ce20b30e06
Merge branch 'SciSharp:master' into deps/sk-rc3
2 years ago
Martin Evans
b22d8b7495
- Added `GroupDisposable` to dispose a collection of items all together
- Renamed `LLamaModelKvOverride` to `LLamaModelMetadataOverride`
2 years ago
Martin Evans
5ad2cd1d3c
Added a comment on the type itself
2 years ago
Martin Evans
b0270b5788
Added comments on GGMLType
2 years ago
Martin Evans
b3e576608b
fixed safe handle
2 years ago
Martin Evans
bab6b65b61
Added a safe handle for LLamaKvCacheView
2 years ago
Martin Evans
439d14a061
Updated binaries:
- build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440
- commit: 9fb13f9584
2 years ago