● live/api.dalesai.com/serving inference now

I design, build, and operate private AI infrastructure.This site runs on it.

DalesAI is a complete AI platform: six open models, authentication, billing, and usage dashboards, all running on dedicated hardware I own in Gilbert, Arizona. It's not a product for sale. It's proof of what I can build for your business.

6 modelsserved from own hardware
Full stackauth / billing / dashboards
Private by designdata never leaves the box
Zero cloud APIs100% self-hosted

01 / The Platform

Everything a real AI product needs, running in production.

Most "AI consultants" resell someone else's API. I built the whole thing. When I build for you, I know every layer because I've operated every layer.

Client requestHTTPS
Secure edge tunnelZERO OPEN PORTS · TLS
API gatewayOPENAI-COMPATIBLE · AUTH · METERING
Llama 3.1 70B+ DEEPSEEK R1
Qwen / GemmaVISION ENABLED
Mistral Small+ MODEL WARMER
Apple Silicon · 128GB unified memoryM-SERIES GPU · ZERO CLOUD

fig. 01 · request path, edge to silicon · live simulation

Inference

Six open models, locally served

Llama 3.1 70B, DeepSeek R1, Qwen, Gemma, and Mistral on Apple Silicon, with vision support and a model warmer keeping responses fast.

API layer

OpenAI-compatible gateway

A unified API in front of every model. Anything built for OpenAI works against private infrastructure with one URL change.

Accounts

Auth & usage dashboards

Email-code login, per-user token tracking, and live usage dashboards backed by a production database.

Billing

Stripe subscriptions & metering

Plan-based token limits, webhooks, and automated provisioning: the full path from checkout to working API key.

Security

Hardened, tunneled, monitored

Zero open ports to the public internet, locked-down CORS, security headers, secrets out of source.

Operations

Self-healing services

Every component runs as a managed service that survives reboots and restarts itself. Built to run unattended.

02 / What I Build

AI systems for businesses that do real work.

Watch the phone run both: a missed call becoming a booked appointment, then a private assistant answering from local files. No human touches either one.

/01

AI receptionists & lead follow-up

Missed-call text-back, appointment booking, and instant lead response for service businesses, so inquiries at 9 PM become customers instead of voicemails.

/02

Private AI assistants

Internal chat and document assistants running on dedicated hardware, for businesses that want AI without sending customer data to a third-party cloud.

/03

Automation & integration

Connecting AI to the tools you already use: CRMs, calendars, payment systems, email. Webhooks and APIs that just work.

/04

Full product builds

Auth, billing, dashboards, and deployment: the same production stack this platform runs on, built for your idea.

fig. 02 · missed-call text-back, as the customer sees it

03 / About

Built by one person. Operated every day.

I'm Dale, a builder based in Gilbert, Arizona. I'm not a classically trained engineer, and I think that's an advantage: I build with modern AI tools the way your business will actually use them, and I've shipped every piece of this platform myself: the inference servers, the security hardening, the billing, the deployments.

If you want slideware, hire a consultancy. If you want a working system from someone who runs his own, email me.

04 / Work With Me

Have a project? Let's scope it.

Tell me what you're trying to automate or build. I'll reply with an honest read on whether it's worth doing, and a fixed price if it is.

Replies within one business day