79340369

Date: 2025-01-08 18:31:57
Score: 0.5
Natty:
Report link

It depends on the nature of the images in your pdf, if they mainly contain text in a structured form, you can use pytesseract, in case where the image is not structured eg. Charts, Digrams, Comparison Tables, you might need a complete new approach to process these, a common approach is to train or use a pre-trained model to extract the useful textual features from the image.

Reasons:
  • Whitelisted phrase (-1.5): you can use
  • No code block (0.5):
  • Single line (0.5):
  • Low reputation (1):
Posted by: Mayyar