Martin Evans
d47b6afe4d
Normalizing embeddings in `LLamaEmbedder`. As is done in llama.cpp: 2891c8aa9a/examples/embedding/embedding.cpp (L92)
1 year ago
Martin Evans
c9c8cd0d62
- Swapped embeddings generator to use `llama_decode`
- Modified `GetEmbeddings` method to be async
1 year ago
Martin Evans
9b995510d6
Removed all setters in `IModelParams` and `IContextParams`, allowing implementations to be immutable.
1 year ago
Martin Evans
48c5039054
Improved test coverage. Discovered some issues:
FixedSizeQueue:
- Enqueue would always stop one short of filling the capacity
- Fill would only _replace_ existing items. It was only used in a place where there were not existing items! Removed the method entirely.
LLamaGrammarElement:
- Converted into a `record` struct, removed all of the (now unnecessary) equality stuff.
2 years ago
Martin Evans
e89ca5cc17
Fixed a few minor warnings
2 years ago
Martin Evans
0c98ae1955
Passing ctx to `llama_token_nl(_ctx)`
2 years ago
Martin Evans
634bd3feed
Loosened requirements even more!
2 years ago
Martin Evans
9587699a60
loosened requirements even further for CI
2 years ago
Martin Evans
bc70358b3e
expanded epsilon enough to allow CI to pass
2 years ago
Martin Evans
08501db155
cleaned up basic test a bit
2 years ago
Martin Evans
498ed109c1
Fixed dot product based test
2 years ago
Martin Evans
35e48691ea
Added a known correct value to the unit tests
2 years ago
Martin Evans
dff308e31b
Added a test for the LLamaEmbedder
2 years ago