Version: Latest

Ascend vNPU User Guide

Introduction

Volcano supports two vNPU modes for sharing Ascend devices:

1. MindCluster mode

Description:

The initial version of MindCluster—the official Ascend cluster scheduling add-on—required custom modifications and recompilation of Volcano. Furthermore, it was limited to Volcano release1.7 and release1.9, which complicated its use and restricted access to newer Volcano features.

To address this, we have integrated its core scheduling logic for Ascend vNPU into Volcano's native device-share plugin, which is designed specifically for scheduling and sharing heterogeneous resources like GPUs and NPUs. This integration provides seamless access to vNPU capabilities through the procedure below, while maintaining full compatibility with the latest Volcano features.

Use case:

vNPU cluster for Ascend 310 series
with support for more chip types to come

2. HAMi mode

Description:

This mode is developed by a third-party community 'HAMi', which is the developer of volcano-vgpu feature. It supports vNPU feature for both Ascend 310 and Ascend 910. It also supports managing heterogeneous Ascend cluster(Cluster with multiple Ascend types, i.e. 910A,910B2,910B3,310P)

Use case:

NPU and vNPU cluster for Ascend 910 series
NPU and vNPU cluster for Ascend 310 series
Heterogeneous Ascend cluster

Installation

To enable vNPU scheduling, the following components must be set up based on the selected mode:

Prerequisites:

Kubernetes >= 1.16
Volcano >= 1.14
ascend-docker-runtime (for HAMi Mode)

Install Volcano:

Follow instructions in Volcano Installer Guide

Follow instructions in Volcano Installer Guide

Install ascend-device-plugin and third-party components

In this step, you need to select different ascend-device-plugin based on the vNPU mode you selected. MindCluster mode requires additional components from Ascend to be installed.

MindCluster Mode

Install Third-Party Components

Follow the official Ascend documentation to install the following components:

NodeD
Ascend Device Plugin
Ascend Docker Runtime
ClusterD
Ascend Operator

Note: Skip the installation of ascend-volcano mentioned in the document above, as we have already installed the native Volcano from the Volcano community in the Prerequisites part.

Configuration Adjustment for Ascend Device Plugin:

When installing ascend-device-plugin, you must set the presetVirtualDevice parameter to "false" in the device-plugin-310P-volcano-v{version}.yaml file to enable dynamic virtualization of 310P:

...
args:
  [
    "device-plugin",
    "-useAscendDocker=true",
    "-volcanoType=true",
    "-presetVirtualDevice=false",
    "-logFile=/var/log/mindx-dl/devicePlugin/devicePlugin.log",
    "-logLevel=0",
  ]
...

For detailed information, please consult the official Ascend MindCluster documentation.

Scheduler Config Update

kind: ConfigMap
apiVersion: v1
metadata:
  name: volcano-scheduler-configmap
  namespace: volcano-system
data:
  volcano-scheduler.conf: |
    actions: "enqueue, allocate, backfill"
    tiers:
    - plugins:
      - name: predicates
      - name: deviceshare
        arguments:
          deviceshare.AscendMindClusterVNPUEnable: true   # enable ascend vnpu
    configurations:
    ...
    - name: init-params
      arguments: {"grace-over-time":"900","presetVirtualDevice":"false"}  # to enable dynamic virtualization, presetVirtualDevice need to be set false

HAMi mode

Label the Node with `ascend=on`

kubectl label node {ascend-node} ascend=on

Deploy `hami-scheduler-device` ConfigMap

kubectl apply -f https://raw.githubusercontent.com/Project-HAMi/ascend-device-plugin/refs/heads/main/ascend-device-configmap.yaml

Deploy ascend-device-plugin

kubectl apply -f https://raw.githubusercontent.com/Project-HAMi/ascend-device-plugin/refs/heads/main/ascend-device-plugin.yaml

For more information, refer to the ascend-device-plugin documentation.

Scheduler Config Update

kind: ConfigMap
apiVersion: v1
metadata:
  name: volcano-scheduler-configmap
  namespace: volcano-system
data:
  volcano-scheduler.conf: |
    actions: "enqueue, allocate, backfill"
    tiers:
    - plugins:
      - name: predicates
      - name: deviceshare
        arguments:
          deviceshare.AscendHAMiVNPUEnable: true   # enable ascend vnpu
          deviceshare.SchedulePolicy: binpack  # scheduling policy. binpack / spread
          deviceshare.KnownGeometriesCMNamespace: kube-system
          deviceshare.KnownGeometriesCMName: hami-scheduler-device

Note: You may notice that, 'volcano-vgpu' has its own GeometriesCMName and GeometriesCMNamespace, which means if you want to use both vNPU and vGPU in a same volcano cluster, you need to merge the configMap from both sides and set it here.

Usage

Usage is different depending on the mode you selected

MindCluster mode

apiVersion: batch.volcano.sh/v1alpha1
kind: Job
metadata:
  name: mindx-dls
  namespace: vnpu
  labels:
    ring-controller.atlas: ascend-310P
spec:
  minAvailable: 1
  schedulerName: volcano
  policies:
    - event: PodEvicted
      action: RestartJob
  plugins:
    ssh: []
    env: []
    svc: []
  maxRetry: 3
  queue: default
  tasks:
    - name: "default-test"
      replicas: 1
      template:
        metadata:
          labels:
            app: infers
            ring-controller.atlas: ascend-310P
            vnpu-dvpp: "null"
            vnpu-level: low
        spec:
          schedulerName: volcano
          containers:
            - name: resnet50infer
              image: swr.cn-south-1.myhuaweicloud.com/ascendhub/mindie:2.1.RC1-300I-Duo-py311-openeuler24.03-lts
              imagePullPolicy: IfNotPresent
              securityContext:
                privileged: false
              command: ["/bin/bash", "-c", "tail -f /dev/null"]
              resources:
                requests:
                  huawei.com/npu-core: 8
                limits:
                  huawei.com/npu-core: 8
          nodeSelector:
            host-arch: huawei-arm

The supported Ascend chips and their ResourceNames are shown in the following table:

ChipName	JobLabel and TaskLabel	ResourceName
310P3	ring-controller.atlas: ascend-310P	huawei.com/npu-core

Description of Labels in the Virtualization Task YAML

Key	Value	Description
vnpu-level	low	Low configuration (default). Selects the lowest-configuration "virtualized instance template."
	high	Performance-first. When cluster resources are sufficient, the scheduler will choose the highest-configured virtualized instance template possible. When most physical NPUs in the cluster are already in use and only a few AI Cores remain on each device, the scheduler will allocate templates that match the remaining AI Core count rather than forcing high-profile templates. For details, refer to the table below.
vnpu-dvpp	yes	The Pod uses DVPP.
	no	The Pod does not use DVPP.
	null	Default value. DVPP usage is not considered.
ring-controller.atlas	ascend-310P	Indicates that the task uses products from the Atlas inference series.

Effect of DVPP and Level Configurations

Product Model	Requested AI Core Count	vnpu-dvpp	vnpu-level	Downgrade	Selected Template
Atlas Inference Series (8 AI Cores)	1	`null`	Any value	–	`vir01`
	2	`null`	`low` / other values	–	`vir02_1c`
	2	`null`	`high`	No	`vir02`
	2	`null`	`high`	Yes	`vir02_1c`
	4	`yes`	`low` / other values	–	`vir04_4c_dvpp`
	4	`no`	`low` / other values	–	`vir04_3c_ndvpp`
	4	`null`	`low` / other values	–	`vir04_3c`
	4	`yes`	`high`	–	`vir04_4c_dvpp`
	4	`no`	`high`	–	`vir04_3c_ndvpp`
	4	`null`	`high`	No	`vir04`
	4	`null`	`high`	Yes	`vir04_3c`
	8 or multiples of 8	Any value	Any value	–	–

Notice

For chip virtualization (non-full card usage), the value of vnpu-dvpp must strictly match the corresponding value listed in the above table. Any other values will cause the task to fail to be dispatched.

For detailed information, please consult the official Ascend MindCluster documentation.

HAMi mode

apiVersion: v1
kind: Pod
metadata:
  name: ascend-pod
spec:
  schedulerName: volcano
  containers:
    - name: ubuntu-container
      image: swr.cn-south-1.myhuaweicloud.com/ascendhub/ascend-pytorch:24.0.RC1-A2-1.11.0-ubuntu20.04
      command: ["sleep"]
      args: ["100000"]
      resources:
        limits:
          huawei.com/Ascend310P: "1"
          huawei.com/Ascend310P-memory: "4096"

The supported Ascend chips and their ResourceNames are shown in the following table:

ChipName	ResourceName	ResourceMemoryName
910A	huawei.com/Ascend910A	huawei.com/Ascend910A-memory
910B2	huawei.com/Ascend910B2	huawei.com/Ascend910B2-memory
910B3	huawei.com/Ascend910B3	huawei.com/Ascend910B3-memory
910B4	huawei.com/Ascend910B4	huawei.com/Ascend910B4-memory
910B4-1	huawei.com/Ascend910B4-1	huawei.com/Ascend910B4-1-memory
310P3	huawei.com/Ascend310P	huawei.com/Ascend310P-memory

Hami vNPU scene memory allocation restrictions

When a Pod requests a single vNPU device, the memory can be configured to any value, and the memory request of the job will automatically align with the closest sharding strategy
- Example: Single card memory is 65536, virtualization templates are 1/4 (16384), 1/2 (32768)
  - Pod requests 1 vNPU device and requests 1024 memory, so the actual allocated memory is 16384
  - Pod requests 1 vNPU device with a requested memory of 20480, resulting in an actual allocated memory of 32768
When a Pod requests multiple vNPU devices, the memory resource request can be left unspecified or filled with the maximum value, The memory allocated to Pod is the actual memory of the entire card

Introduction​

1. MindCluster mode​

2. HAMi mode​

Installation​

Install Volcano:​

Install ascend-device-plugin and third-party components​

MindCluster Mode​

Install Third-Party Components​

Scheduler Config Update​

HAMi mode​

Label the Node with ascend=on​

Deploy hami-scheduler-device ConfigMap​

Deploy ascend-device-plugin​

Scheduler Config Update​

Usage​

MindCluster mode​

HAMi mode​

Hami vNPU scene memory allocation restrictions​

Introduction

1. MindCluster mode

2. HAMi mode

Installation

Install Volcano:

Install ascend-device-plugin and third-party components

MindCluster Mode

Install Third-Party Components

Scheduler Config Update

HAMi mode

Label the Node with `ascend=on`

Deploy `hami-scheduler-device` ConfigMap

Deploy ascend-device-plugin

Scheduler Config Update

Usage

MindCluster mode

HAMi mode

Hami vNPU scene memory allocation restrictions