Assuming it is for text-based or NLP LLM not multi-modal, i.e one with visual-to-text web scraping. In a text-based case, all paginated context would be already loaded in HTML. You can chunk HTML content and use prompts like
"Remove all HTML tags and give me only information: html-text"
to get the text.