Actually i was getting the same error while using the qwen2.5 model for inference but there was one thing i overlooked by mistake and it was very silly i forgot to edit the pod container size large enough to fit the model weights, after i corrected that it RAN without any error.
I hope this helps .