GenAI Studio¶
GenAI Studio is the central hub for managing AI services and building pipelines. It reads each service's manifest to register API endpoints, allocate compute, and enforce visibility settings. It streamlines the entire deployment lifecycle from smoke test to production.
Pipeline Builder¶
The drag-and-drop canvas lets you chain multiple AI services into end-to-end workflows:
- Select AI services from the component library (Data Factory, Feature Extraction, Content Generation, etc.)
- Drop services onto the canvas and connect them with arrows
- Configure each node: input/output parameters, model to use, storage configuration
- Add conditional branching: define conditions on output KPIs — e.g., "only proceed to PII redaction if sensitive topics are detected"
- Connect a Human-in-the-Loop node to push results to Data Canvas for expert review
Pipeline Types¶
| Type | Purpose | Example |
|---|---|---|
| Pre-annotation | Automate label creation before human review | YOLO object detection → upper body detection → HIL task |
| Data processing | Transform, clean, and enrich data | Audio ingestion → noise reduction → speaker diarization → PII redaction |
| Post-annotation | Validate and consolidate annotated data | Quality check → Data Hub registration |
| RAG/Inference | Production AI inference pipelines | Document extraction → LLM extraction → structured output |
Real-World Example: Isaac Snapdragon Pre-Annotation Pipeline¶
Built for Qualcomm's Isaac camera project, this pipeline processes dark security footage:
- Pull 800+ video files from Azure Blob Store
- Brightness/contrast preprocessing (making dark footage legible)
- People detection service
- Upper body and head detection
- Plant and object detection
- Aggregate detection outputs
- PII detection and redaction (replace with beeps)
- Sensitive topic detection
- Push to HIL task for human verification
Result
Humans only verify/correct pre-existing detections rather than labeling from scratch on near-invisible footage.
Example: YOLO Video Auto-Labeling Pipeline¶
Start
→ YOLO Object Detection (Pre-processing)
Source: Azure Blob Container
Model: YOLOv8 / YOLOv9 / YOLOv10
Output Format: YOLO + COCO
Storage Type: Physical (pushes to Data Marketplace)
End
Example: VILA 15B Video Context Extraction¶
NVIDIA's VILA 15B model analyzes video at frame level, generating contextual insights based on a natural-language prompt. Detects objects, actions, hazards, and interprets scenes.
Start
→ Villa15-B-version2 (Pre-processing)
Source: Azure Blob Container
Prompt: "Identify all safety hazards and describe what is happening"
SAS Token: Required for Azure Blob access
Output: JSON context summary per video
End
Manager Controller & Bulk Load Pipelines¶
Two special system pipelines manage the Human-in-the-Loop annotation flow:
- Manager_Controller: Exports completed annotation task data from the annotation platform (using Task IDs) to configured storage and updates SSR status in AIDF.
- Bulk_Load: Imports exported annotated metadata into Data Hub, making it available for downstream consumption. Configurable by industry, usage, nature, storage type, and dataset category.