How is a deployment token different from a model token?
Deployment tokens control access to the inference service endpoint when authentication is enabled for a deployment. They are ideal for production environments due to their controlled access. Deployment tokens can be created and managed via the CLI or UI and can have a set expiration date and time.
Model tokens authenticate and interact with all inference services deployed by a specific model. They are particularly useful during development, as they remain valid with the packaged model even when deployments are created or deleted. Model tokens are generated via the CLI using aioli model token <MODEL_NAME>
and do not expire. To learn how to create a model token, visit Interact with a deployment.