79489380

Date: 2025-03-06 12:40:12
Score: 2.5
Natty:
Report link

I ended up building the missing functionality in the pyspark API for arbitrary stateful functions for myself, using at first the delta tables as a means of keep the state information. This worked fine, but in order to speed up to sub-second processing of our cases we ended up with a more production ready version that used a redis cache in the background.

Reasons:
  • No code block (0.5):
  • Unregistered user (0.5):
  • Single line (0.5):
  • Low reputation (1):
Posted by: joris_van_agtmaal