Routine steps
Classification, extraction, validation, routing, and repair do not always need the most expensive model in the stack.
This is the pre-sale proof surface for teams that need burst capacity without runaway idle cost. It shows what the USD 99 diagnostic would inspect before recommending scaling, load testing, monitoring, alerts, or auto-recovery changes.
Flev workflows can route routine, structured steps to local or private small models while keeping stronger models available for complex reasoning.
Classification, extraction, validation, routing, and repair do not always need the most expensive model in the stack.
Teams should be able to see which model handled which step, why fallback exists, and who can approve changes.
Better Call evidence shows tool-call accuracy improving from 73.4% to 83.8% on 3,625 granite4.1:3b BFCL v4 cases.
The first pass stays read-only: collect facts, identify missing measurements, and decide which small tests prove whether the system can safely reach 500 RPS.
Separate steady-state, burst windows, cacheable paths, authenticated paths, background jobs, and webhook or ingest spikes.
Map Railway services, process types, concurrency limits, health checks, restart behavior, and idle-cost controls.
Review Cloudflare caching, rate limits, origin shielding, Postgres connection limits, slow queries, pool sizing, and migration risk.
A useful 500 RPS answer should not be a guess. It should show which signals prove readiness, which gaps block confidence, and which change needs explicit approval.
| Area | What to inspect | Acceptance signal |
|---|---|---|
| Load test | Run a staged read-only test against safe endpoints with ramp, hold, and burst phases. | P95 latency, error rate, CPU/memory, and queue depth remain inside the agreed threshold at 500 RPS. |
| Postgres | Check connection pool usage, slow queries, lock waits, index coverage, and burst behavior under representative reads/writes. | No pool exhaustion, lock pileups, or query plan regressions during the target burst. |
| Railway | Review service sizing, autoscaling or manual scale plan, health checks, restart policy, and cost at idle. | Burst path is documented, rollback is clear, and idle cost does not require overprovisioning. |
| Cloudflare | Check cache rules, bypass paths, rate limits, WAF noise, origin response headers, and observability. | Cacheable traffic stays at the edge and protected origin paths remain measurable. |
| Recovery | Review alert thresholds, synthetic checks, deployment rollback, and auto-recovery limits. | Operators know which failures self-heal, which page a human, and which require rollback approval. |
This is the shape of the first paid answer: short enough to read quickly, specific enough for an engineer to challenge, and explicit about what is not proven yet.
Postgres connection pressure and cache-bypass routes are checked before increasing Railway process count. Scaling app workers first can make the database failure arrive faster.
Start with a read-only ramp on public cacheable routes and one authenticated representative route. Stop when P95, 5xx rate, pool usage, or origin CPU crosses the agreed guardrail.
No production cache-rule change, Railway scale change, migration, or auto-recovery action runs inside the diagnostic without explicit approval.
If the evidence supports it, the next scope can implement the load-test script, Cloudflare rules, Railway scale plan, Postgres pool tuning, and alert thresholds.
The quickest scope confirmation comes from a short, concrete message. These bullets are enough to decide whether the USD 99 diagnostic fits.
The packet is intentionally narrow: it makes the next scaling decision safer, but it does not pretend to be an unlimited infrastructure migration.
One service or app surface, one target traffic shape, one evidence table, one load-test plan, one runbook, and one paid-scope recommendation.
Production mutation, full redesign, long-running monitoring implementation, database migration, incident response retainer, or guaranteed 500 RPS capacity.
Interest, intake, or a draft invoice is not counted as income. The experiment only counts cleared payment evidence.
Send the failing or scaling path, the target burst shape, and what would count as safe enough to proceed. We will confirm whether it fits the fixed USD 99 diagnostic before any payment request.