test(e2e): run gpu workloads from manifest#1709
Conversation
5cc2d92 to
efe4d25
Compare
5a84bca to
1c8f7b7
Compare
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
de40d64 to
8426fac
Compare
1c8f7b7 to
c5182b1
Compare
|
🌿 Preview your docs: https://nvidia-preview-pr-1709.docs.buildwithfern.com/openshell |
032f133 to
55ed9ce
Compare
|
Label |
|
Label |
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
2f36b22 to
386d638
Compare
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Summary
This PR adds manifest-driven GPU workload execution tests on top of the workload image artifacts from #1484. It keeps the existing GPU device-selection coverage, adds workload execution coverage under the umbrella
gputarget, and documents how to build workload images locally before running the GPU e2e suite.This branch is now rebased on the local e2e stabilization fixes from #1935, so the Docker GPU test path also includes the supervisor-image and host SSH linker-environment fixes needed for local Nix/devenv runs.
Related Issue
Closes #1472
Changes
openshell sandbox create --gpu --from <image> -- <command>and enforce declaredpassorfailexpectations.e2e/gpu/images/.build/workloads.yamlby default, withOPENSHELL_E2E_WORKLOAD_MANIFESTavailable for external manifests.serde_yamlto the e2e crate for manifest parsing.Testing
mise run pre-commitpassesValidation status:
mise run e2e:docker:gpumise run pre-commitwas run after rebasing ontomain; Rust format/check/clippy, markdown lint, Python format, license checks, and docs checks completed successfully.mise run pre-commitcurrently fails inhelm:lintbecause the local chart dependency directory is missing thepostgresqldependency. This is unrelated to the GPU workload changes.GPU validation commands for future runs:
mise run e2e:workloads:buildmise run e2e:docker:gpuNotes:
mise run e2e:workloads:buildbefore runningmise run e2e:docker:gpulocally.OPENSHELL_E2E_WORKLOAD_MANIFEST=/abs/path/to/workloads.yaml.Checklist