79260036

Date: 2024-12-07 05:54:30
Score: 3
Natty:
Report link

Seems like I missed to set the following environment variables,

PYSPARK_PYTHON=path\python
PYSPARK_DRIVER_PYTHON=path\python

After setting above variables in the env, everything works fine.

but I still wonder why only the DF created using spark.createDataFrame() failed, but the one created using spark.read() worked when the above env variables are missing. Please let me know.

Reasons:
  • RegEx Blacklisted phrase (2.5): Please let me know
  • Has code block (-0.5):
  • Self-answer (0.5):
  • Low reputation (0.5):
Posted by: Aravindan vaithialingam