Quick start
The fastest path is to deploy a model image that bundles runtime + model using docker compose. For more information about deploying in production environments check out deployment.Install dependencies
Getting started requires either Docker or Podman. We also recommend the RamaLama CLI for a streamlined experience.
- Install Podman or Docker
- (Optional) Install RamaLama CLI
Create docker-compose.yaml
Create a
docker-compose.yaml using a model image which bundles both the runtime and model together into a single runnable container.
