Extended Data Table 2 Comparison between Mistral-v1-7b and Mistral-v2-7b accuracies

From: An evaluation framework for clinical use of large language models in patient interaction tasks

  1. Mean accuracy and adjusted P value for difference in mean accuracies for Mistral-v1-7b and Mistral-v2-7b.