Prerequisites for Agent Factory on Hybrid Manager v1.4.0 (LTS)

Enabling Agent Factory

Before you can use Agent Factory in Hybrid Manager, you need to enable it during the installation of HM.

See Creating other necessary secrets for the Agent Factory-specific enablement steps.

Infrastructure Requirements

Kubernetes Cluster Foundation

Agent Factory requires a properly configured Hybrid Manager Kubernetes cluster with sufficient resources for AI workloads. The cluster must support GPU scheduling and have appropriate node groups configured for different workload types.

Cluster Requirements

  • Kubernetes 1.27 or later with GPU device plugin support
  • NVIDIA GPU operator installed for GPU node management
  • Sufficient CPU and memory for orchestration components
  • Network policies supporting service mesh communication

GPU Node Configuration

GPU resources are essential for model serving operations. Node configuration must align with model requirements and expected workload characteristics.

GPU Node Requirements

  • NVIDIA GPUs with CUDA 12.1+ support
  • GPU nodes labeled with nvidia.com/gpu=true
  • GPU taint nvidia.com/gpu for dedicated scheduling
  • Sufficient GPU memory for target model sizes

For per-model GPU requirements and cloud node recommendations, see GPU recommendations.

Registry Configuration

Internet-connected deployments

For clusters with internet access, no additional image mirroring is required. Ensure your cluster has egress to nvcr.io so HM can pull NVIDIA NIM images during model deployment.

Air-gapped deployments

For environments without internet access, use the hub guides to prepare images and deployment assets in advance, and configure private registry access in Hybrid Manager.

Model image migration

Prepare and mirror required model images to your private registry following these hub references:

Model registry updates

Update default model references to point to your private registry. Work with your Model Library configuration and HM API/console paths.

Profile caching (NIM)

Some NVIDIA NIM models use runtime profiles that must be available locally for offline operation. Follow NVIDIA’s documentation for profile discovery and caching strategies.

Next Steps

With prerequisites satisfied, proceed to:

  1. Review GPU recommendations to plan your node groups.
  2. Deploy your first model: Deploy with HM UI.
  3. Build your first AI flow: Getting started.

For architecture context, see Agent Factory architecture.