GPU Affine Transformation

This example mirrors the CPU affine example but keeps all streams in GPU shared memory.

Computation

output_vector = transform_matrix @ input_vector + offset_vector

Files

  • examples/gpu_affine_transformation/pipeline.yaml

  • examples/gpu_affine_transformation/run_example.py

Run it

python examples/gpu_affine_transformation/run_example.py

What it demonstrates

  • GPU-backed shared-memory streams

  • CUDA-backed tensor exchange between processes

  • parity between CPU and GPU kernel kinds for the same topology

  • numeric verification against a host-side reference

Requirements

  • installed gpu extra or equivalent PyTorch dependency

  • compatible CUDA runtime and device

When to use it

Use this example after the CPU affine example when you want to confirm that the GPU runtime path is working correctly on your machine.