The vllm/modelexecutor/weightutils.py implements hfmodelweightsiterator to load the model checkpoint, which is downloaded from huggingface. It use torch.load function and weightsonly parameter is default value False. There is a security warning on https://pytorch.org/docs/stable/generated/torch.load.html, when torch.load load a malicious pickle data it will execute arbitrary code during unpickling.
This vulnerability can be exploited to execute arbitrary codes and OS commands in the victim machine who fetch the pretrained repo remotely.
Note that most models now use the safetensors format, which is not vulnerable to this issue.
{
"severity": "HIGH",
"github_reviewed": true,
"cwe_ids": [
"CWE-502"
],
"nvd_published_at": "2025-01-27T18:15:41Z",
"github_reviewed_at": "2025-01-27T20:50:30Z"
}