|
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261 |
- # Inference Parameters
-
- Different from `LLamaModel`, when using an exeuctor, `InferenceParams` is passed to the `Infer` method instead of constructor. This is because executors only define the ways to run the model, therefore in each run, you can change the settings for this time inference.
-
-
- # InferenceParams
-
- Namespace: LLama.Common
-
- ```csharp
- public class InferenceParams
- ```
-
- Inheritance [Object](https://docs.microsoft.com/en-us/dotnet/api/system.object) → [InferenceParams]()
-
- ## Properties
-
- ### **TokensKeep**
-
- number of tokens to keep from initial prompt
-
- ```csharp
- public int TokensKeep { get; set; }
- ```
-
- #### Property Value
-
- [Int32](https://docs.microsoft.com/en-us/dotnet/api/system.int32)<br>
-
- ### **MaxTokens**
-
- how many new tokens to predict (n_predict), set to -1 to inifinitely generate response
- until it complete.
-
- ```csharp
- public int MaxTokens { get; set; }
- ```
-
- #### Property Value
-
- [Int32](https://docs.microsoft.com/en-us/dotnet/api/system.int32)<br>
-
- ### **LogitBias**
-
- logit bias for specific tokens
-
- ```csharp
- public Dictionary<int, float> LogitBias { get; set; }
- ```
-
- #### Property Value
-
- [Dictionary<Int32, Single>](https://docs.microsoft.com/en-us/dotnet/api/system.collections.generic.dictionary-2)<br>
-
- ### **AntiPrompts**
-
- Sequences where the model will stop generating further tokens.
-
- ```csharp
- public IEnumerable<string> AntiPrompts { get; set; }
- ```
-
- #### Property Value
-
- [IEnumerable<String>](https://docs.microsoft.com/en-us/dotnet/api/system.collections.generic.ienumerable-1)<br>
-
- ### **PathSession**
-
- path to file for saving/loading model eval state
-
- ```csharp
- public string PathSession { get; set; }
- ```
-
- #### Property Value
-
- [String](https://docs.microsoft.com/en-us/dotnet/api/system.string)<br>
-
- ### **InputSuffix**
-
- string to suffix user inputs with
-
- ```csharp
- public string InputSuffix { get; set; }
- ```
-
- #### Property Value
-
- [String](https://docs.microsoft.com/en-us/dotnet/api/system.string)<br>
-
- ### **InputPrefix**
-
- string to prefix user inputs with
-
- ```csharp
- public string InputPrefix { get; set; }
- ```
-
- #### Property Value
-
- [String](https://docs.microsoft.com/en-us/dotnet/api/system.string)<br>
-
- ### **TopK**
-
- 0 or lower to use vocab size
-
- ```csharp
- public int TopK { get; set; }
- ```
-
- #### Property Value
-
- [Int32](https://docs.microsoft.com/en-us/dotnet/api/system.int32)<br>
-
- ### **TopP**
-
- 1.0 = disabled
-
- ```csharp
- public float TopP { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **TfsZ**
-
- 1.0 = disabled
-
- ```csharp
- public float TfsZ { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **TypicalP**
-
- 1.0 = disabled
-
- ```csharp
- public float TypicalP { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **Temperature**
-
- 1.0 = disabled
-
- ```csharp
- public float Temperature { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **RepeatPenalty**
-
- 1.0 = disabled
-
- ```csharp
- public float RepeatPenalty { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **RepeatLastTokensCount**
-
- last n tokens to penalize (0 = disable penalty, -1 = context size) (repeat_last_n)
-
- ```csharp
- public int RepeatLastTokensCount { get; set; }
- ```
-
- #### Property Value
-
- [Int32](https://docs.microsoft.com/en-us/dotnet/api/system.int32)<br>
-
- ### **FrequencyPenalty**
-
- frequency penalty coefficient
- 0.0 = disabled
-
- ```csharp
- public float FrequencyPenalty { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **PresencePenalty**
-
- presence penalty coefficient
- 0.0 = disabled
-
- ```csharp
- public float PresencePenalty { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **Mirostat**
-
- Mirostat uses tokens instead of words.
- algorithm described in the paper https://arxiv.org/abs/2007.14966.
- 0 = disabled, 1 = mirostat, 2 = mirostat 2.0
-
- ```csharp
- public MiroStateType Mirostat { get; set; }
- ```
-
- #### Property Value
-
- [MiroStateType]()<br>
-
- ### **MirostatTau**
-
- target entropy
-
- ```csharp
- public float MirostatTau { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **MirostatEta**
-
- learning rate
-
- ```csharp
- public float MirostatEta { get; set; }
- ```
-
- #### Property Value
-
- [Single](https://docs.microsoft.com/en-us/dotnet/api/system.single)<br>
-
- ### **PenalizeNL**
-
- consider newlines as a repeatable token (penalize_nl)
-
- ```csharp
- public bool PenalizeNL { get; set; }
- ```
-
- #### Property Value
-
- [Boolean](https://docs.microsoft.com/en-us/dotnet/api/system.boolean)<br>
|