Table 2 Comparison of similarity search error rates on all 200 FLORES languages and limited to the intersection of 98 languages on which each model has been trained

From: Joint speech and text machine translation for up to 100 languages

Model

Overall

Intersection

xsim

xsim++

xsim

xsim++

(n = 200)

(n = 200)

(n = 98)

(n = 98)

SONAR

1.4

15.2

0.1

9.3

LASER3

5.1

36.4

1.1

27.5

LaBSE

10.7

36.1

1.5

15.4

  1. The best results are in bold.