Model's Prompt Average Performance by Scenario

language

Heatmap of Model Performance by Scenario

language language

Table of Model Performance

References