Martin Evans
|
ea12ff4e07
|
Merge pull request #502 from vikramvee/Examples
Updated Examples
|
1 year ago |
vikramvee
|
ebd853fede
|
Updated Examples
|
1 year ago |
Martin Evans
|
859160d6f7
|
Merge pull request #501 from martindevans/LLamaPos_inc_dec
Added increment and decrement operators to `LLamaPos`
|
2 years ago |
Martin Evans
|
90915c5a99
|
Added increment and decrement operators to `LLamaPos`
|
2 years ago |
Martin Evans
|
82c471eac4
|
Merge pull request #500 from martindevans/improved_kv_cache_methods
Small KV Cache Handling Improvements
|
2 years ago |
Martin Evans
|
a8f9262b7f
|
Merge pull request #499 from martindevans/better_batch_processing
Using `AddRange` in `LLamaEmbedder`
|
2 years ago |
Martin Evans
|
c5146bac23
|
- Exposed KV debug view through `SafeLLamaContextHandle`
- Added `KvCacheSequenceDivide`
- Moved count tokens/cells methods to `SafeLLamaContextHandle`
|
2 years ago |
Martin Evans
|
744758f110
|
Using `AddRange` in `LLamaEmbedder`
|
2 years ago |
Martin Evans
|
5d80a56d11
|
Merge pull request #496 from martindevans/smaller_unit_test_model
Smaller Unit Test Model
|
2 years ago |
Martin Evans
|
418345cbaf
|
limited parallelism of CI jobs
|
2 years ago |
Martin Evans
|
df38d73c79
|
Switched to `Q3_K_S` for unit test model, instead of `Q4`. This is almost 1gb smaller, and _may_ make the macos tests less flakey.
|
2 years ago |
Martin Evans
|
ac7faa0f93
|
Merge pull request #495 from martindevans/quantise_new_formats
Added new file types to quantisation
|
2 years ago |
Martin Evans
|
c7103e86e4
|
Added new file types to quantisation
|
2 years ago |
Martin Evans
|
17385e12b6
|
Merge pull request #479 from martindevans/update_binaries_feb_2024
Update binaries feb 2024
|
2 years ago |
Martin Evans
|
21bdecd049
|
Merge branch 'update_binaries_feb_2024' of github.com:martindevans/LLamaSharp into update_binaries_feb_2024
|
2 years ago |
Martin Evans
|
bac40a3b7a
|
Added new binaries, from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7792319886
|
2 years ago |
Martin Evans
|
0e2521cff4
|
Merge pull request #493 from jasoncouture/remove_sha256_check
Disable SHA256 check
|
2 years ago |
Jason Couture
|
a101224c34
|
Disable SHA256 check
|
2 years ago |
Martin Evans
|
0592164dc3
|
Merge pull request #489 from jasoncouture/clblast_nuget
Create nuspec for OpenCL
|
2 years ago |
Jason Couture
|
1f45bae2cf
|
Update compile.yml
Fix SHA256 hash path
|
2 years ago |
Jason Couture
|
c963b051e2
|
Add nuspec for OpenCL (CLBLAST)
|
2 years ago |
Martin Evans
|
d468df08d6
|
Merge pull request #487 from jasoncouture/clblast_linux
CLBlast for linux
|
2 years ago |
Martin Evans
|
d4f3f642c3
|
Merge pull request #488 from jasoncouture/concurrency_limit
Only allow one build in parallel per ref
|
2 years ago |
Jason Couture
|
bfe3ad50aa
|
Only allow one build in parallel per ref
|
2 years ago |
Jason Couture
|
f7a6eaa49f
|
Cancel previous builds when a new build is started, due to how expensive this build is.
|
2 years ago |
Jason Couture
|
277175af4d
|
CLBlast for linux
This builds CLBLAST support for linux, and makes sure to copy the
clblast shared library on both windows and linux to the artifacts.
|
2 years ago |
Martin Evans
|
7dbaed2d3b
|
Update README.md
|
2 years ago |
Martin Evans
|
dfac029dde
|
Merge pull request #485 from zsogitbe/master
KernelMemory EmbeddingMode bug correction
|
2 years ago |
Zoli Somogyi
|
f578fcafa3
|
KernelMemory EmbeddingMode bug correction
|
2 years ago |
Martin Evans
|
765c697f77
|
Fixed number type
|
2 years ago |
Martin Evans
|
b2e815d51e
|
Updated all binaries (from this run: https://github.com/SciSharp/LLamaSharp/actions/runs/7746303349)
|
2 years ago |
Martin Evans
|
2323988cc7
|
Merge pull request #478 from martindevans/fixed_artifact_paths
Updated download-artifact to v4
|
2 years ago |
Martin Evans
|
b5674ead97
|
Updated download-artifact to v4
|
2 years ago |
Martin Evans
|
15a98b36d8
|
Updated everything to work with llama.cpp ce32060198
|
2 years ago |
Martin Evans
|
48798837fb
|
Merge pull request #477 from martindevans/updated_cuda_toolkit
Updated compile.yml to use `Jimver/cuda-toolkit@v0.2.14`
|
2 years ago |
Martin Evans
|
2df7e35c81
|
Updated compile.yml to use `Jimver/cuda-toolkit@v0.2.14`
|
2 years ago |
Martin Evans
|
72687ca230
|
Merge pull request #476 from martindevans/upload_artifact_v4
Updated compile.yml to use `actions/upload-artifact@v4`
|
2 years ago |
Martin Evans
|
d8ed875a2a
|
Updated compile.yml to use `actions/upload-artifact@v4`
|
2 years ago |
Martin Evans
|
faa342ca13
|
Merge pull request #475 from martindevans/checkout_v4
Updated compile.yml to use checkout v4
|
2 years ago |
Martin Evans
|
ae40d993c4
|
Updated compile.yml to use checkout v4
|
2 years ago |
Martin Evans
|
3523c51fa3
|
Merge pull request #474 from martindevans/embeddings_generator_decode
Swapped `GetEmbeddings` to `llama_decode`
|
2 years ago |
Martin Evans
|
c9c8cd0d62
|
- Swapped embeddings generator to use `llama_decode`
- Modified `GetEmbeddings` method to be async
|
2 years ago |
Martin Evans
|
3b08874bec
|
Merge pull request #468 from jasoncouture/clblast
Add CLBLAST native library to native libraries build
|
2 years ago |
Martin Evans
|
22aba9a671
|
Merge pull request #473 from martindevans/base_handle_removed_constructor
Removed `SafeLLamaHandleBase` Constructor
|
2 years ago |
Martin Evans
|
2488f74bbd
|
Merge pull request #472 from martindevans/remove_params_interface_set
Removed `IModelParams` and `IContextParams` setters.
|
2 years ago |
Martin Evans
|
5da2a2f64b
|
- Removed one of the constructors of `SafeLLamaHandleBase`, which implicitly states that memory is owned. Better to be explicit about this kind of thing!
- Also fixed `ToString()` in `SafeLLamaHandleBase`
|
2 years ago |
Martin Evans
|
9b995510d6
|
Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable.
|
2 years ago |
Martin Evans
|
f9a9aaabca
|
Merge pull request #471 from jasoncouture/master
Fix incorrect event input variable name
|
2 years ago |
Martin Evans
|
96d7d37f80
|
Merge pull request #469 from jasoncouture/library_name_fix
Fix missing library name prefix for cuda
|
2 years ago |
Jason Couture
|
52a85c35e6
|
Add missing CMAKE prefix
|
2 years ago |