79220021

Date: 2024-11-24 12:14:06
Score: 0.5
Natty:
Report link

There is an issue when working on this particular sample pdf but when I used a similar pdf report I was able to crop it based on boundaries.

This is what I used:

pages[0].find_tables()[0].bbox

output:

(25.19059366666667, 125.0, 569.773065, 269.64727650000003)

# this shows the part that I want to get rid off
p0.crop((25.19059366666667, 125.0, 569.773065, 269.64727650000003)).to_image().debug_tablefinder()

# below taking y0 value from where top table ends (269.64) to almost bottom of page 840 
# x0 from leftmost part (0) of page and x1 as (590) to almost right end of page

p0.crop((0, 269.0, 590, 840)).to_image()
Reasons:
  • RegEx Blacklisted phrase (1): I want
  • Long answer (-0.5):
  • Has code block (-0.5):
  • Self-answer (0.5):
Posted by: ViSa