Martin Evans
a2bae178fa
Added a `Metadata` property to `LLamaWeights`
2 years ago
Rinne
4635185767
Merge pull request #378 from asmirnov82/fix_docs
Fix broken references in docs
2 years ago
Aleksei Smirnov
1c88bd5a74
fix broken references in docs (after commit "docs: update the docs to follow new version")
2 years ago
Martin Evans
7139281e4d
Merge pull request #373 from martindevans/enhanced_avx512
Improved support for AVX512
2 years ago
Martin Evans
a0232a8534
Merge pull request #372 from martindevans/custom_pipeline_stateless_reset
Reset Custom Sampling Pipeline
2 years ago
Martin Evans
1b13f7c717
Improved support for AVX512:
- Enabled more features in build process (VBMI and VNNI)
- Added runtime checking for this features
- Improved runtime checking to no longer require dotnet8.0
2 years ago
Martin Evans
82d84afaea
Resetting the custom sampling pipeline in the stateless executor
2 years ago
Martin Evans
c298ab828a
Merge pull request #368 from martindevans/context_set_seed
Context Set Seed
2 years ago
Martin Evans
dc8e5d88f7
Update LLama/LLamaContext.cs
2 years ago
Martin Evans
a3177ab140
Merge pull request #369 from martindevans/rename_llama_sample_temperature
Renamed `llama_sample_temperature` to `llama_sample_temp`
2 years ago
Martin Evans
cbc4c8d9af
Merge pull request #370 from martindevans/copy_grammar
Clone Grammar
2 years ago
Martin Evans
db7ecf5a43
Added a method to create a clone of a grammar instance
2 years ago
Martin Evans
ea523d2e2a
Renamed `llama_sample_temperature` to `llama_sample_temp`, Mirroring the same change made in llama.cpp
2 years ago
Martin Evans
2df3e7617e
Added a method to set the RNG seed on the context
2 years ago
Martin Evans
8494fd5769
Merge pull request #367 from martindevans/fix_quantisation_pure_field
Added missing field to LLamaModelQuantizeParams
2 years ago
Martin Evans
cedef5e45a
Added the `pure` field to `LLamaModelQuantizeParams` (it's been added to llama.cpp)
2 years ago
Martin Evans
6650e683f4
Merge pull request #361 from martindevans/update_binaries_dec_2023
Updated Binaries December 2023
2 years ago
Martin Evans
199b4418c3
Fixed JSON serialization for ModelParams
2 years ago
Martin Evans
6128c18824
Merge pull request #366 from asmirnov82/fix_code_in_readme
Fix documentation to reflect changes in ChatSession API
2 years ago
Aleksei Smirnov
8558d109b4
Fix documentation according to the changes in newer versions
2 years ago
Martin Evans
2f0deeadcd
Implemented serialization for `MetadataOverride`. Deserialization is broken (converter is never called)
2 years ago
Martin Evans
b868b056f7
Added metadata overrides to `IModelParams`
2 years ago
Martin Evans
b22d8b7495
- Added `GroupDisposable` to dispose a collection of items all together
- Renamed `LLamaModelKvOverride` to `LLamaModelMetadataOverride`
2 years ago
Martin Evans
5ad2cd1d3c
Added a comment on the type itself
2 years ago
Martin Evans
b0270b5788
Added comments on GGMLType
2 years ago
Martin Evans
b3e576608b
fixed safe handle
2 years ago
Martin Evans
bab6b65b61
Added a safe handle for LLamaKvCacheView
2 years ago
Martin Evans
439d14a061
Updated binaries:
- build run: https://github.com/SciSharp/LLamaSharp/actions/runs/7196891440
- commit: 9fb13f9584
2 years ago
Martin Evans
9813a30193
Merge pull request #358 from xbotter/unittest-net8
Upgrade unittest target framework to .NET 8.0
2 years ago
xbotter
340bbbcf48
Move JSON converter for TensorSplitsCollection
2 years ago
Martin Evans
728f7077d8
Merge pull request #360 from martindevans/macos_avx
Added AVX and AVX2 to MacOS x86_64 builds
2 years ago
Martin Evans
0b8422ea7f
Added AVX and AVX2 to MacOS x86_64 builds
2 years ago
Martin Evans
01c7f1b4da
Update LLama/Common/ModelParams.cs
2 years ago
Martin Evans
e6148c952e
Fixed encoding of `Encoding`
2 years ago
xbotter
df66d7e0c6
Upgrade unittest target framework to .net8
2 years ago
Martin Evans
d87d654a34
Merge pull request #348 from martindevans/new_object_based_sampling_pipeline
Custom Sampling Pipelines
2 years ago
Martin Evans
50c1b2d52d
Merge pull request #344 from philippjbauer/master
Rebuilt ChatSession class
2 years ago
Martin Evans
8a2923a7bc
Merge pull request #352 from SciSharp/dependabot/nuget/xunit-2.6.3
build(deps): bump xunit from 2.6.2 to 2.6.3
2 years ago
Rinne
fb75e06293
fix: output prefix of Chinese example.
2 years ago
dependabot[bot]
85dc43dde0
build(deps): bump xunit from 2.6.2 to 2.6.3
Bumps [xunit](https://github.com/xunit/xunit ) from 2.6.2 to 2.6.3.
- [Commits](https://github.com/xunit/xunit/compare/2.6.2...2.6.3 )
---
updated-dependencies:
- dependency-name: xunit
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2 years ago
Martin Evans
ec6d0e8778
Merge pull request #353 from SciSharp/dependabot/nuget/xunit.runner.visualstudio-2.5.5
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5
2 years ago
Rinne
836f071cd0
fix: Chinese example.
2 years ago
dependabot[bot]
8fb4476813
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5
Bumps [xunit.runner.visualstudio](https://github.com/xunit/visualstudio.xunit ) from 2.5.4 to 2.5.5.
- [Release notes](https://github.com/xunit/visualstudio.xunit/releases )
- [Commits](https://github.com/xunit/visualstudio.xunit/compare/2.5.4...2.5.5 )
---
updated-dependencies:
- dependency-name: xunit.runner.visualstudio
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2 years ago
Philipp Bauer
29c5c6e93c
Update the StatefulChatService to use new ChatSession integration
2 years ago
Philipp Bauer
f669a4f5a7
Update the Chinese chat sample to use new ChatSession integration
2 years ago
Philipp Bauer
2cc01efdae
Merge branch 'SciSharp:master' into master
2 years ago
Rinne
3019da9d22
Merge pull request #341 from xbotter/sk/chat-stateless
🔧 Refactor Semantic Kernel chat completion implementation
2 years ago
Martin Evans
835958398c
- Removed the object wrappers and configurable pipeline, they can be better written in code.
- Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline`
- Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling
2 years ago
Martin Evans
3afc007499
- Added "protected" logits, instead of the awkward save/load mechanism
- Added an example usage to one of the tests
2 years ago
Martin Evans
b34f72a883
- Added `SamplingPipeline` to inference params which overrides all other options with an entirely custom pipeline.
- Added a `Sample` method to `LLamaContext` which uses a custom pipeline
- Modified all executors to use the custom pipeline if it exists
2 years ago