i made a guide may be it could help https://github.com/Granddyser/windows-llama-cpp-python-cuda-guide