79817744

Date: 2025-11-12 12:45:20
Score: 0.5
Natty:
Report link

I also faced same issue and mapreduce.input.fileinputformat.list-status.num-threads helped.
For 50000 xml files, it was taking 13 mins but with this property set to 50, it took 15 seconds. All of this happens in the driver

Reasons:
  • Low length (0.5):
  • Has code block (-0.5):
  • Low reputation (0.5):
Posted by: Roobal Jindal