To understand this, is it right to assume that at partition level partition key concept is used to separate messages for each client?
In that case using quotas makes sense to use at the producer level. That is what being advised as mentioned in above post by the term rate of messages produced(throttle). I wondering to handle this scenario, there must be extra coding required to delay in producing the messages? Is that what being suggested?