# Overview

## Overview

Freelance AWS Machine Learning Engineer. I build cost-efficient ML systems.

São Paulo, Brazil (remote)\
<contact@antoniovfranco.com>\
[Website](https://antoniovfranco.com/) · [GitHub](https://github.com/AntonioVFranco) · [Medium](https://medium.com/@AntonioVFranco) · [LinkedIn](https://linkedin.com/in/antoniovfranco)

### What I do

* AWS ML architecture.
* Cost optimization for training and inference.
* PEFT fine-tuning: LoRA, QLoRA, QDoRA.
* Production MLOps: CI/CD, monitoring, retraining.

### Selected outcomes

* 40-60% AWS cost reduction on ML workloads.
* 5,000+ RPS inference with p95 under 100ms.
* Processing cost under 0.15 USD per document.
* 85% infra savings with multi-adapter serving.

### Deep dives

* [AWS ML architecture on AWS](https://antoniovfranco.gitbook.io/antoniovfranco-docs/deep-dives/aws-ml-architecture-on-aws)
* [AWS cost optimization for ML](https://antoniovfranco.gitbook.io/antoniovfranco-docs/deep-dives/aws-cost-optimization-for-ml)
* [Parameter-efficient fine-tuning (LoRA, QLoRA, QDoRA)](https://antoniovfranco.gitbook.io/antoniovfranco-docs/deep-dives/parameter-efficient-fine-tuning-lora-qlora-qdora)
* [MLOps and production ML systems](https://antoniovfranco.gitbook.io/antoniovfranco-docs/deep-dives/mlops-and-production-ml-systems)

### Portfolio

* [Services and engagement model](https://antoniovfranco.gitbook.io/antoniovfranco-docs/portfolio/services-and-engagement-model)
* [Client case studies](https://antoniovfranco.gitbook.io/antoniovfranco-docs/portfolio/client-case-studies)
* [Debugging and problem solving](https://antoniovfranco.gitbook.io/antoniovfranco-docs/portfolio/debugging-and-problem-solving)
* [Skills and tooling](https://antoniovfranco.gitbook.io/antoniovfranco-docs/portfolio/skills-and-tooling)
* [Writing and open source](https://antoniovfranco.gitbook.io/antoniovfranco-docs/portfolio/writing-and-open-source)

*Last updated: Jan 2026.*
