AI was supposed to take my job ...

The PoC: Banking Sentinel

A customer support agent for ROGERVINAS bank built with Strands Agents.

3 mock accounts, 5 transactions each
Tools: freeze/unfreeze card, list transactions, open/check/list disputes
Chat UI served by FastAPI
Structured response: answer + suggested_actions
Session memory with FileSessionManager
Configurable model: Gemini, Bedrock, Ollama

The agent is intentionally simple — no RAG, no external service calls. The goal is to keep the agent logic minimal so the focus stays on observability and evaluations.

AI was supposed to take my job ...

Instead it gave me a new one: Evaluations

Try it yourself

Why Langfuse?

Testing classic apps vs AI apps

The solution: traces + evaluations

What this PoC covers

The PoC: Banking Sentinel

The PoC: Banking Sentinel

Langfuse Tracing

Langfuse Tracing

Offline Evaluations — Strands Evals

Offline Evaluations — Strands Evals

Offline Evaluations — Langfuse Experiments

Offline Evaluations — Langfuse Experiments

Online Evaluations — LLM-as-judge

Online Evaluations — LLM-as-judge

External Evaluations

External Evaluations

Annotation Queues

Annotation Queues

Annotation Queues - trace

Prompt Management

Prompt Management

Prompt Management

CI/CD

AI didn't take my job. It gave me a new one.