Currently, we are only considering "retry-after-ms" when throttling requests.
But as with 429 errors, we can also receive "retry-after" header (which would provide the time value in seconds) ,should we not consider that header too in our logic?
Azure documentation ref. :

https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-throughput