Tutorials

  • Tutorial
Why Serverless Compute Partners Are Now More Important Than Ever
  • Tutorial
Deploying a global scale, AI voice agent with 500ms latency.
  • Tutorial
Deploying Ultravox on Cerebrium for Ultra-low Latency Voice Applications
  • Tutorial
Building a Real-time Coding Assistant
  • Tutorial
Creating a realtime AI Commentator with Cerebrium, LiveKit and Cartesia
  • Tutorial
Overcoming Transcription Challenges for Multilingual AI voice agents
  • Tutorial
ML apps at scale: ASGI support now available on Cerebrium
  • Tutorial
An Alternative to OpenAI Realtime API for Voice Capabilities
  • Tutorial
Benchmarking vLLM, SGLang and TensorRT for Llama 3.1 API
  • Tutorial
Cerebrium supports HIPAA compliance: A guide for health applications
  • Tutorial
How to Build a Real-Time AI Avatar for Training and Coaching
  • Tutorial
Building a Real-Time Shopping Assistant: Turn Live Video into Instant Purchases
  • Tutorial
Using Codestral to Summarize, Correct and Auto-Approve Pull Requests
  • Tutorial
Getting better price-performance, latency, and availability on AWS Trn1/Inf2 instances
  • Tutorial
Running Llama 3 8B with TensorRT-LLM on Serverless GPUs
Next