- Passing encoding in the `IModelParams`, which reduces how often encoding needs to be passed around
The biggest single change is renaming `LLamaModel` to `LLamaContext`