Agentic
stack review.
An architecture advisory for Savvy Loans.
01 the brief
What we reviewed, and against what.
We read the architecture Ramee sent over, set it against the conversations we've had with the Savvy team, and built our recommendation from that and what we've learned.
02 the bottom line
You've built the loop.
You're still building the harness.
It's a thoughtful, honest pilot. It's also roughly 3,000 lines of bespoke harness, plus a roadmap that is almost all undifferentiated infrastructure: sandboxing, audit logs, dedupe, a spend guard, auth, least‑privilege IAM.
Our recommendation: don't hand‑build that roadmap. Use AWS‑managed agent infrastructure for sandboxing, isolation, identity, memory and audit, and route the model through Bedrock so inference stays in AWS.
03 credit where due
What's genuinely strong. Keep these.
- ✓The swappable seam is real.Tools and prompts have zero dependency on the loop, proven by a live CLI to LangGraph migration.
- ✓Honesty about risk.The spec lists verified issues, not a sales pitch. Rare, and it earns trust.
- ✓Human in the loop, to start.The right posture for a lender today, with a clear path to dial it down to the minimum as trust is earned.
- ✓Structured completion.
finishandsubmit_triageseparate what the agent decided from whether it's done. - ✓Real tests.The test suite drives the actual graph, not mocks.
- ✓Clean AWS hygiene.Terraform, OIDC with no static keys, internal‑only ALB, Secrets Manager.
04 risk register
Five things we'd push on.
-
01
high
No evals
The tests check the wiring, not the judgement. Nothing measures whether an agent makes the right call, so quality can't be tracked or trusted.
-
02
high
Thin observability
Transcripts are discarded and there's no run audit log. When an agent does the wrong thing, there's little to debug with, and nothing to show a regulator.
-
03
high
Single point of failure
One task, no autoscaling. Every deploy is downtime and any crash stops all work.
-
04
strategic
Building harness, not agents
Concurrency, memory, dedupe, audit, scaling. Real work, but none of it differentiates Savvy. It's the team's time going into plumbing.
-
05
cost
Unbounded cost
The top model runs on every poll, with no spend guard and no model tiering. Spend grows with traffic, uncapped.
05 a worked example
Input you don't control, tools you do.
This isn't a flaw in what they've built today. It's the failure mode to design against. As autonomy grows, the trust boundary has to move from the host to the tool. Worth keeping front of mind while building, and a big reason to put the agents inside a managed, isolated runtime.
06 how we got here
The options, and our recommendation.
| Option | Stays in AWS | Harness to own | Isolation & audit | Speed to ship |
|---|---|---|---|---|
| Status quoLangGraph + direct API, on ECS | No | High | DIY | Built, but stuck |
| Just move to Bedrockmodel layer only | Yes | High | DIY | Fast, partial |
| Bedrock + AgentCorelift the agents onto the managed runtime | Yes | Low | Managed | Fast, durable |
| Re-platform from scratchnew framework, new infra | Depends | High | Varies | Slow |
07 the framework question
LangGraph, or something more native?
08 what we're proposing
The target stack, layer by layer.
09 target architecture
One event, into a live agent.
10 the path
Staged, low-risk, no rewrite to start.
-
1
first
Lift
Containerise the existing LangGraph agents and run them on AgentCore, unchanged. Point the model at Bedrock. Residency solved, isolation gained, the singleton retired.
-
2
next
Wire
Move state to AgentCore Memory, tools behind the Gateway, guardrails into Policy, traces to CloudWatch. Most of the roadmap becomes config.
-
3
later
Evolve
Where a framework switch earns its keep, rewrite that agent to Strands or the Claude Agent SDK. Per agent, measured, never big-bang.
The honest counter-argument: LangGraph 1.0 is stable, and its explicit, auditable control flow is something a regulated lender may prefer. AgentCore also deepens the AWS commitment. Both are fine here: residency and speed are the brief, and the runtime keeps the framework reversible. Switch on evidence, not fashion.
Claude on Bedrock.
Agents on AgentCore.
The rest is yours to build.
~/savvy-loans/agentic-stack/2026