Start here. What I do, results, and where to go next.
Freelance AWS Machine Learning Engineer. I build cost-efficient ML systems.
São Paulo, Brazil (remote) contact@antoniovfranco.com Websitearrow-up-right · GitHubarrow-up-right · Mediumarrow-up-right · LinkedInarrow-up-right
AWS ML architecture.
Cost optimization for training and inference.
PEFT fine-tuning: LoRA, QLoRA, QDoRA.
Production MLOps: CI/CD, monitoring, retraining.
40-60% AWS cost reduction on ML workloads.
5,000+ RPS inference with p95 under 100ms.
Processing cost under 0.15 USD per document.
85% infra savings with multi-adapter serving.
AWS ML architecture on AWS
AWS cost optimization for ML
Parameter-efficient fine-tuning (LoRA, QLoRA, QDoRA)
MLOps and production ML systems
Services and engagement model
Client case studies
Debugging and problem solving
Skills and tooling
Writing and open source
Last updated: Jan 2026.
Last updated 1 month ago