79408395

Date: 2025-02-03 10:01:55
Score: 1
Natty:
Report link

It seems that the poor performance in real-world images is mainly due to overexposure on the keyboard and background interference (it looks like the training data has a very uniform background). In practical applications, you may first apply white balance to the image to address the overexposure issue. As for the background, you can initially use Grounding DINO + SAM2 to detect the keyboard area and then use your trained model for detection.

Regarding the model itself, the training phase seems to have performed quite well. Adding real-world data could enhance its robustness. Additionally, you might consider fine-tuning a pre-trained model like YOLO (https://docs.ultralytics.com/zh/tasks/segment/). Wishing you success in your development!

Reasons:
  • Long answer (-0.5):
  • No code block (0.5):
  • Low reputation (1):
Posted by: 方紹宇