Quick start
The fastest path is to deploy a model image that bundles runtime + model using docker compose. For more information about deploying in production environments check out deployment.1
Install dependencies
Getting started requires either Docker or Podman. We also recommend the RamaLama CLI for a streamlined experience.
- Install Podman or Docker
- (Optional) Install RamaLama CLI
2
Create docker-compose.yaml
Create a
docker-compose.yaml using a model image which bundles both the runtime and model together into a single runnable container.3
Start the stack
4
Get chatting
