It depends on the nature of the images in your pdf, if they mainly contain text in a structured form, you can use pytesseract, in case where the image is not structured eg. Charts, Digrams, Comparison Tables, you might need a complete new approach to process these, a common approach is to train or use a pre-trained model to extract the useful textual features from the image.