79278566

Date: 2024-12-13 14:04:17
Score: 1.5
Natty:
Report link

This question was asked forever ago, but for posterity and in case you still want an answer, PaliGemma's segmentation outputs are special "soft" tokens that come from a special visual encoder described here.

To parse them into meaningful coordinates, Google has an example here.

Reasons:
  • Low length (0.5):
  • No code block (0.5):
  • Low reputation (0.5):
Posted by: Chris Collier