I just found a python library that do exactly that, the name is athena_bridge.
I have tried it and it works quite well, allow you to run Pyspark code using Python + Athena.
existing data in S3.
https://github.com/AlvaroMF83/athena_bridge
https://pypi.org/project/athena-bridge/
Best regards.