to load images into a DataFrame without issue us this please from pyspark.ml.image import ImageSchema imagesDF = ImageSchema.readImages("/path/to/imageFolder") labledImageDF = imagesDF.withColumn("label", lit(0)) from pyspark.sql.functions import *