Planned Highlights
- Request volume and token usage by model and time range
- Latency percentiles and error rates for each deployment
- Cost and quota insights to track spend and limits
- Exportable reports for audits and internal sharing
Usage and performance analytics for RamaLama Cloud (coming soon).