Reconstruction Notes¶
This page records a practical validation path for reproducing a working
vllm-omni-metax environment from scratch.
Goal¶
Validate that:
vllm-metaxprovides a healthy MetaX backendvllm-omnican discover an external platform plugin- the
0.20.0runtime patch layer is active when needed vllm-omni-metaxactivates and routes stage device visibility correctly
Recommended reconstruction flow¶
- Start from a clean Linux environment with MACA installed.
- Verify raw device visibility through the MetaX runtime.
- Install and validate
vllm-metax. - Install
vllm-omni. - Install
vllm-omni-metaxin editable mode. - Verify plugin activation with a one-line Python probe.
- Run a small
Qwen3-OmniorQwen-Image-Edit-2511workload. - Move to larger multimodal pipelines only after the model-specific path is stable.
Minimal checks¶
Check backend runtime¶
python -c "from vllm_metax.utils import import_pymxsml; m=import_pymxsml(); m.nvmlInit(); print(m.nvmlDeviceGetCount()); m.nvmlShutdown()"
Check plugin activation¶
python -c "from vllm_omni_metax.plugin import metax_omni_platform_plugin; print(metax_omni_platform_plugin())"
Check forced activation path¶
export VLLM_OMNI_METAX_FORCE=1
python -c "from vllm_omni_metax.plugin import metax_omni_platform_plugin; print(metax_omni_platform_plugin())"
Compare with patches disabled¶
export VLLM_OMNI_METAX_DISABLE_PATCHES=1
python -c "from vllm_omni_metax.plugin import metax_omni_platform_plugin; print(metax_omni_platform_plugin())"
Common failure points¶
vllm-metaxis installed but its runtime libraries are not visible inLD_LIBRARY_PATH.- The Python environment contains mismatched
vllm-omniandvllm-omni-metaxversions. Qwen3-OmniorQwen-Image-Edit-2511reaches a rotary code path that still assumes CUDA-only flash-attn imports.- Device visibility is filtered unexpectedly by container runtime settings.
- Debugging uses force-enable and then forgets to return to the default auto-detection path.
Expected steady state¶
In a stable environment:
- MetaX devices are visible through
pymxsml. - The plugin resolves to
MetaxOmniPlatformwithout force mode. - The runtime patch layer is available for the affected Omni rotary path.
- Omni workers use the standard GPU worker path.
- Stage device control remains aligned with
MACA_VISIBLE_DEVICES.