Overview
Start here. What I do, results, and where to go next.
Last updated
Start here. What I do, results, and where to go next.
Freelance AWS Machine Learning Engineer. I build cost-efficient ML systems.
São Paulo, Brazil (remote) contact@antoniovfranco.com Website · GitHub · Medium · LinkedIn
AWS ML architecture.
Cost optimization for training and inference.
PEFT fine-tuning: LoRA, QLoRA, QDoRA.
Production MLOps: CI/CD, monitoring, retraining.
40-60% AWS cost reduction on ML workloads.
5,000+ RPS inference with p95 under 100ms.
Processing cost under 0.15 USD per document.
85% infra savings with multi-adapter serving.
Last updated: Jan 2026.
Last updated