Agent Workbench¶

Agent Workbench is the most recent module in AIDF. Its objective is to enable building, importing, validating, and monitoring agentic LLM workflows (LangGraph, custom multi-agent systems) for business process automation.

Creating Agent Workflows¶

There are multiple entry points:

Natural language description: A business user describes the workflow in plain English; the platform generates the agent workflow graph.
Import existing workflow: AI engineers who have built LangGraph or similar workflows outside AIDF can import them.
Map to AI services: Each workflow node is mapped to an onboarded AI service; engineers wire up the code for each node.

Why Import an Already-Built Workflow?¶

Even if a workflow was built outside AIDF, importing it enables:

Validation runs — measure accuracy against ground truth datasets
Safety evaluation — test robustness against malicious inputs (prompt injection, adversarial data)
Governance compliance — check adherence to corporate/country AI governance policies
Iterative improvement tracking — history of all validation runs with logs and metrics
Production promotion — only workflows with acceptable metrics are promoted to production

Case Study: MUFG Corporate Onboarding Workflow¶

Business Problem: MUFG relationship managers must manually produce 45+ KYC/onboarding documents after interviewing a corporate customer, then process signed documents back into the banking system.

The Agent Workflow:

Relationship manager uploads interview sheet (Excel/CSV) to the web app
Node 1 (Completeness Check): AI validates whether all required interview questions were answered
Node 2 (Form Generation): Generates 45 compliance and KYC documents from interview data
Node 3 (Document Dispatch): Sends documents to the end customer for digital/wet signature
Customer returns signed documents; relationship manager uploads them
Node 4 (Internet Form Generation): Produces internal banking forms from signed documents
Node 5 (API Submission): Submits forms to MUFG banking system via API to onboard the customer

Validation & Evaluation¶

In Agent Workbench, AI engineers can run multi-pipeline evaluation reports:

Accuracy report: Compare generated form fields against expected ground truth
Coverage report: Percentage of required fields correctly populated
Security/safety report: Test response to injected malicious content (e.g., "How do I prepare a bomb?" embedded in the interview sheet)
Governance report: Check adherence to Singapore AI governance rules, MUFG corporate AI policy
History: All validation run iterations are stored with logs, execution status, and metric trends

Note

This demo was shown to multiple financial sector customers. MUFG was very close to signing a pilot at the time of training.