79250166

Date: 2024-12-04 07:15:15
Score: 0.5
Natty:
Report link

1st approach is not a good choice because leveraging the [CLS] token embedding directly might not be the best approach, in case if the BERT was fine tuned for a task other than similarity matching.

Consider taking average or pooling (passing through another dense layer) will work.

Reasons:
  • Long answer (-0.5):
  • No code block (0.5):
  • Low reputation (0.5):
Posted by: XGB