LLamaSharp

Commit Graph

Author	SHA1	Message	Date
Martin Evans	2830e5755c	- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed. - Deleted `NativeInfo` (internal class, not used anywhere)	2 years ago
Martin Evans	a45d9089e1	Fixed demos	2 years ago
Martin Evans	a9e6f21ab8	- Creating and destroying contexts in the stateless executor, saving memory. It now uses zero memory when not inferring! - Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around	2 years ago
Martin Evans	ae8ef17a4a	- Added various convenience overloads to `LLamaContext.Eval` - Converted `SafeLLamaContextHandle` to take a `ReadOnlySpan` for Eval, narrower type better represents what's really needed	2 years ago
Martin Evans	02a46fc363	Updated demos to use the new loading/multi context system	2 years ago
Martin Evans	e2fe08a9a2	Added a higher level `LLamaWeights` wrapper around `SafeLlamaModelHandle`	2 years ago
Martin Evans	f31bdf6b93	Using the right context for Bob	2 years ago
Martin Evans	f3511e390f	WIP demonstrating changes to support multi-context. You can see this in use in `TalkToYourself`, along with notes on what still needs improving. The biggest single change is renaming `LLamaModel` to `LLamaContext`	2 years ago
Rinne	0269af8c17	Merge branch 'master' into runtime-targets	3 years ago
Yaohui Liu	1062fe1a7e	feat: upgrade the native libraries.	3 years ago
Yaohui Liu	eed96248b5	docs: add the documentations with mkdocs.	3 years ago
Yaohui Liu	2eb2d6df83	test: add 9 examples of the new version.	3 years ago
Tim Miller	bbd2650cf0	Include runtime targets file	3 years ago
Yaohui Liu	f3565d6b2d	refactor: rename Quantizer to LLamaQuantizer.	3 years ago
Yaohui Liu	a3b8186f20	feat: support save and load chat session.	3 years ago
Yaohui Liu	bdbd6aa824	feat: add transforms for chat session.	3 years ago
Yaohui Liu	3bf74ec9b9	feat: add chat session for refactored code.	3 years ago
Yaohui Liu	908b79e855	feat: add stateless executor.	3 years ago
Yaohui Liu	e603a09137	fix: state loading and saving not working.	3 years ago
Yaohui Liu	5679e08718	feat: add ILLamaExecutor.InferAsync.	3 years ago
Yaohui Liu	aaa0cba9f8	test: update the examples after refactoring.	3 years ago
Yaohui Liu	a545ab7eda	test: remove dumplicated space of example output.	3 years ago
Yaohui Liu	56c56b9c51	refactor: drop LLamaModelV1.	3 years ago
Yaohui Liu	18c2ff2395	refactor: instruct mode and examples.	3 years ago
Yaohui Liu	6ffcb5306b	refactor: use official api of quantization instead.	3 years ago
Yaohui Liu	0958bbac2c	feat: add get-embedding api to LLamaModel.	3 years ago
Haiping Chen	21c36cbf80	Added WebAPI.	3 years ago
Yaohui Liu	33067f990f	feat: run quantization in csharp.	3 years ago
Yaohui Liu	648b14fd0f	docs: revise examples.	3 years ago
Yaohui Liu	856d6549de	build: add linux support.	3 years ago
Yaohui Liu	8978410eab	docs: add examples for running LLamaSharp.	3 years ago

31 Commits (634bd3feedeacce23252bdbfa8fa53a48d57dd60)