79262835

Date: 2024-12-08 16:18:09
Score: 2
Natty:
Report link

Issue was default timeout for real-time Inference endpoint being 60 seconds.
Seems like missing the timeout threshold caused the request to repeat for some reason (docs)

Switching to async inference endpoint solved it as request takes ~2m

Reasons:
  • Low length (0.5):
  • No code block (0.5):
  • Self-answer (0.5):
  • Low reputation (0.5):
Posted by: Eyal Solomon