Yaohui Liu
|
cb5fb210b1
|
feat: optimize apis for cuda feature detection.
|
2 years ago |
Martin Evans
|
bca55eace0
|
Initial changes to match the llama.cpp changes
|
2 years ago |
Yaohui Liu
|
18294a725e
|
refactor: remove old version files.
|
2 years ago |
Martin Evans
|
2830e5755c
|
- Applied a lot of minor R# code quality suggestions. Lots of unnecessary imports removed.
- Deleted `NativeInfo` (internal class, not used anywhere)
|
2 years ago |
Yaohui Liu
|
2eb2d6df83
|
test: add 9 examples of the new version.
|
2 years ago |
Yaohui Liu
|
a3b8186f20
|
feat: support save and load chat session.
|
2 years ago |
Yaohui Liu
|
bdbd6aa824
|
feat: add transforms for chat session.
|
2 years ago |
Yaohui Liu
|
3bf74ec9b9
|
feat: add chat session for refactored code.
|
2 years ago |
Yaohui Liu
|
908b79e855
|
feat: add stateless executor.
|
2 years ago |
Yaohui Liu
|
e603a09137
|
fix: state loading and saving not working.
|
2 years ago |
Yaohui Liu
|
5679e08718
|
feat: add ILLamaExecutor.InferAsync.
|
2 years ago |
Yaohui Liu
|
aaa0cba9f8
|
test: update the examples after refactoring.
|
2 years ago |
Yaohui Liu
|
56c56b9c51
|
refactor: drop LLamaModelV1.
|
2 years ago |
Yaohui Liu
|
18c2ff2395
|
refactor: instruct mode and examples.
|
2 years ago |
Yaohui Liu
|
6ffcb5306b
|
refactor: use official api of quantization instead.
|
2 years ago |
Yaohui Liu
|
0958bbac2c
|
feat: add get-embedding api to LLamaModel.
|
2 years ago |
Haiping Chen
|
21c36cbf80
|
Added WebAPI.
|
2 years ago |
Yaohui Liu
|
33067f990f
|
feat: run quantization in csharp.
|
2 years ago |
Yaohui Liu
|
648b14fd0f
|
docs: revise examples.
|
2 years ago |
Yaohui Liu
|
856d6549de
|
build: add linux support.
|
2 years ago |
Yaohui Liu
|
8978410eab
|
docs: add examples for running LLamaSharp.
|
2 years ago |