I had a support session with AWS engineer, and he said that the quota is 100 requests per second, even though you have in service quota a quota 10k for invokeEndpoint (not asyncEndpoint).
This quota is not populated anywhere, not in service quota, nor public documentation.