79655515

Date: 2025-06-06 07:13:07
Score: 2.5
Natty:
Report link

Although the Sentence BERT improve the ability to evaluate of semantic similarity to BLEU, it lacks sufficient sensitivity to surface-level error such as spelling mistake, word order issue etc. According to this paper ( Evaluation of Metrics Performance ) research, I think the best evaluation is BLEU + Sentence BERT.

Reasons:
  • Low length (0.5):
  • No code block (0.5):
  • Single line (0.5):
  • Low reputation (1):
Posted by: Jung-Yi Tsai