I also ran into same issue and below are my observations.
There are different versions which is very confusing -
Client API Version which is tied to Azure AI api client/library service. Also note there are 3 versions control plane, data plane for authentication and finally inference
Model version which is linked to llm model. This changes as per selected llm model
You have to use latest inference client api version in the code but not llm model version !!