Add comprehensive tests for BLEU and lexical metrics including edge cases, batch scoring, and aggregate statistics.
36 B
36 B
Add comprehensive tests for BLEU and lexical metrics including edge cases, batch scoring, and aggregate statistics.