79555675

Date: 2025-04-04 15:10:43
Score: 1.5
Natty:
Report link

Consider using Tesseract OCR https://github.com/tesseract-ocr/tesseract and/ or PdfPig

You can install the packages using NuGet.

PdfPig can do text extraction and Tesseract can do both text and images if you modify the method to include images. Then you can post your extracted text and images in a view.

Our implementation isn't .Net based but has been pretty effective in processing our content, which is primarily text based, but does include some imagery.

Reasons:
  • No code block (0.5):
  • Low reputation (1):
Posted by: Ben