[Klaud Cold] Update gptoss-fp4-h100-vllm vLLM image to v0.22.0#1605
[Klaud Cold] Update gptoss-fp4-h100-vllm vLLM image to v0.22.0#1605functionstackx wants to merge 1 commit into
Conversation
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
0d1d98c to
360ab8b
Compare
|
Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers. If additional help is needed, PR authors can reach out to core maintainers over Slack. |
3 similar comments
|
Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers. If additional help is needed, PR authors can reach out to core maintainers over Slack. |
|
Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers. If additional help is needed, PR authors can reach out to core maintainers over Slack. |
|
Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers. If additional help is needed, PR authors can reach out to core maintainers over Slack. |
|
see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26696227016 |
There was a problem hiding this comment.
LGTM — straightforward vLLM image version bump with matching changelog entry.
Extended reasoning...
Overview
This PR bumps the container image pin for the gptoss-fp4-h100-vllm benchmark recipe from vllm/vllm-openai:v0.21.0 to v0.22.0 in .github/configs/nvidia-master.yaml, and adds a corresponding entry in perf-changelog.yaml linking back to this PR.
Security risks
None. This is a benchmark config change pinning a different upstream vLLM image tag for an H100 FP4 recipe. No auth, crypto, permissions, or application logic is touched.
Level of scrutiny
Low. Mechanical version bump touching a single YAML key plus an append-only changelog entry. The full-sweep-enabled label indicates the PR will be exercised by the automated sweep, which is the appropriate validation for a recipe image bump.
Other factors
The bug hunting system found no issues. The changelog entry follows the established format used by adjacent entries in the file, and the PR scope matches the description exactly.
|
see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26696228511 |
Summary
Update vLLM image from v0.21.0 to v0.22.0
Recipes touched:
gptoss-fp4-h100-vllmTest plan
🤖 Generated with Claude Code
Note
Low Risk
Only updates a container image pin and changelog for an H100 FP4 vLLM benchmark recipe; no application or infra logic changes.
Overview
Bumps the
gptoss-fp4-h100-vllmbenchmark recipe innvidia-master.yamlfromvllm/vllm-openai:v0.21.0tov0.22.0, and normalizes theimagefield indentation under that config key.Documents the change in
perf-changelog.yamlwith a short entry tied to PR #1605.Reviewed by Cursor Bugbot for commit 360ab8b. Bugbot is set up for automated code reviews on this repo. Configure here.