Available for work --:--:-- PT

Julian Vargas.

AI Engineer — production RAG and agent systems, and the infrastructure they run on.

Most engineers can call an LLM API. Fewer can deploy, observe, secure, and evaluate what they ship. I build production RAG and agent systems — grounding, evaluation, and observability first — and the infrastructure they run on.

Open to remote AI Engineer, AI Platform, and LLMOps roles.

Explore the work Resume

01 / Work 2024 — Present

01 Shipped · v0.1.0

2-layerTenant isolation · Postgres RLS
5Pluggable model protocols
12Architecture decisions documented

ForgeDocs AI

Agentic document Q&A that cites its sources — hybrid RAG with inline [n] citations, database-enforced multi-tenancy, an eval floor in CI, and honest abstention instead of hallucination.

Python
FastAPI
LangGraph
Postgres · pgvector
Hybrid RAG · RRF
Row-Level Security

Deep dive View source

02 Live

~3 minProvision time
$6/moAlways-on cost
3CI workflows · 1 review gate

terraform-homelab

The repo that ships this site: a hardened $6/mo VPS, GitOps plan/apply pipeline with a production review gate, and every piece of the infrastructure in code. You're looking at the artifact.

Terraform
GitHub Actions
cloud-init
Vultr
Cloudflare
Caddy

Deep dive View source

03 Running 24/7

4Alert streams
24/7Uptime since deploy
2Channels (push · pull)

monitoring-platform

Prometheus + Grafana + a custom Python exporter watching live game servers 24/7 — Discord pushes for events that need a human, Prometheus pulls for trends.

Python
Prometheus
Grafana
Docker Compose

Deep dive View source

04 Running 24/7

5–10 GbpsDefense ceiling, single VPS
4,631Reputation CIDRs (FireHOL + Spamhaus)
~153 MBRAM at idle
5Containers, 1 VPS

halo-ce-command-center

Layered DDoS defense, live player telemetry, and Discord ops for the Halo CE servers I host — five containers on one VPS, running around the clock.

Python
Lua (SAPP)
iptables · ipset
Prometheus
Docker Compose

Deep dive View source

05 Shipped · v0.1.0

5Lifecycle commands
2Cloud providers

infra-automator

One CLI — up | harden | deploy | status | destroy — owning the full lifecycle of a small cloud footprint across two providers, with Terraform underneath and Ansible hardening.

Python
Click
Terraform
Ansible

Deep dive View source

06 Shipped · v0.1.0

~90sCluster rebuild time
20 MiBContainer image
2 podsnginx replicas

k3s-homelab

This site again, deployed a second way: single-node k3s, a Helm chart, cert-manager TLS — rebuilt from nothing in ~90 seconds.

Kubernetes
k3s
Helm
cert-manager

Deep dive View source

02 / About

This site is the artifact, not the brochure — the repo that builds it is on the work list above.

I'm a self-taught engineer who builds production LLM systems — retrieval, agents, structured extraction — and the infrastructure they run on. The part I care about most is whether the thing actually works outside the demo: knowing when an answer is grounded versus guessed, catching the regressions before users do, watching how the whole thing degrades when something upstream breaks.

The infra work is the other half. I provision, harden, and operate real services in code — VPS, DNS, TLS, monitoring — and try to write down what I chose, what I weighed, and the parts I'd do differently next time. Most of what's on this site I run on my own boxes.

Bilingual (EN/ES), US/Mexican dual citizen, based in California on Pacific Time. I work remote, async-first. I move quickly when something clicks and I'd rather ship a small thing that works than plan a big thing that doesn't.

03 / Stack

AI / LLM Systems

LangGraph (multi-agent)
Hybrid RAG · RRF
pgvector + BM25
Cross-encoder rerank
Pydantic extraction
RAGAS-shaped evals
LLM-as-judge
Langfuse / tracing
Ollama (local-first)

Backends & Infrastructure

Python · FastAPI
Next.js · TypeScript
Postgres / Supabase
Terraform
Ansible · cloud-init
Docker · Compose
Cloudflare · Vultr
Caddy · Let's Encrypt

Observability & Security

Prometheus
Grafana
Postgres RLS
UFW · fail2ban
Linux · systemd
GitHub Actions (CI)
pytest · ruff · mypy

04 / Contact

Let's talk.

julivnexe@gmail.com

github.com/julivnexe linkedin.com/in/julivnexe Download resume (PDF)

California · Pacific Time · Open to remote, anywhere