Big scale is not an easy job. For a simple explain how production works, I wrote a blog post to explain a request lifecycle. I also work for SerpApi, and we scale our API to serve billion requests. Hope this help.