GPU Affine Transformation

This example mirrors the CPU affine example but keeps all streams in GPU shared memory.

Computation

output_vector = transform_matrix @ input_vector + offset_vector

Files

examples/gpu_affine_transformation/pipeline.yaml
examples/gpu_affine_transformation/run_example.py

Run it

python examples/gpu_affine_transformation/run_example.py

What it demonstrates

GPU-backed shared-memory streams
CUDA-backed tensor exchange between processes
parity between CPU and GPU kernel kinds for the same topology
numeric verification against a host-side reference

Requirements

installed gpu extra or equivalent PyTorch dependency
compatible CUDA runtime and device

When to use it

Use this example after the CPU affine example when you want to confirm that the GPU runtime path is working correctly on your machine.