Reports

Check out a userscript which highlights deleted posts. GitHub

79494355

Date: 2025-03-08 12:15:17

Score: 0.5

Natty:

Report link

Your issue seems to be related to systematic degradation in accuracy over time when running bulk processing, and it’s great that you’ve already tried switching models to rule out model-specific issues. Here are a few potential causes and mitigation strategies:

Potential Causes

Hidden Throttling or Rate Limiting
Token Usage and Context Carryover
Prompt Compression Due to Model Memory Constraints
Concept Drift or Model Adaptation Over Time
Server-Side Caching Issues

Next Steps

Run a small batch of 100-500 requests with different throttling delays to see if accuracy remains stable.
Test with different API keys or different inference servers.
Implement session resets if applicable.
Shuffle product inputs randomly to check for caching effects.

Reasons:

Long answer (-0.5):
No code block (0.5):
Low reputation (0.5):

Posted by: Shashika Silva