davidpomerenke commited on
Commit
3a1f6aa
·
verified ·
1 Parent(s): f81f302

Upload from GitHub Actions: added disclaimer

Browse files
Files changed (1) hide show
  1. frontend/src/App.js +7 -3
frontend/src/App.js CHANGED
@@ -478,6 +478,10 @@ function App () {
478
  </a>
479
  , plus community-submitted models.
480
  </p>
 
 
 
 
481
  <h4>✍️ Authors</h4>
482
  <p>
483
  languagebench is a collaboration between
@@ -489,15 +493,15 @@ function App () {
489
  >
490
  Data Lab
491
  </a>
492
- , the BMZ-Initiative{' '}
493
  <a
494
  href='https://www.bmz-digital.global/en/overview-of-initiatives/fair-forward/'
495
  target='_blank'
496
  rel='noopener noreferrer'
497
  >
498
- Fair Forward
499
  </a>{' '}
500
- (implemented by GIZ), and the{' '}
501
  <a
502
  href='https://www.dfki.de/en/web/research/research-departments/multilinguality-and-language-technology/ee-team'
503
  target='_blank'
 
478
  </a>
479
  , plus community-submitted models.
480
  </p>
481
+ <h4>⚠️ Note on interpretation</h4>
482
+ <p>
483
+ Results are currently based on a sample of 10 sentences per language and task to keep computation affordable. For this reason, we report confidence intervals and recommend treating small differences between models with caution. In future iterations, we plan to add more benchmark datasets and richer visualisations, with large-scale evaluations across many more prompts and tasks as a longer-term goal.
484
+ </p>
485
  <h4>✍️ Authors</h4>
486
  <p>
487
  languagebench is a collaboration between
 
493
  >
494
  Data Lab
495
  </a>
496
+ (<a href='mailto:davidpomerenke@mailbox.org'>David Pomerenke</a>), the BMZ-Initiative{' '}
497
  <a
498
  href='https://www.bmz-digital.global/en/overview-of-initiatives/fair-forward/'
499
  target='_blank'
500
  rel='noopener noreferrer'
501
  >
502
+ GIZ Fair Forward
503
  </a>{' '}
504
+ (<a href='mailto:jonas.nothnagel@gmail.com'>Jonas Nothnagel</a>), and the{' '}
505
  <a
506
  href='https://www.dfki.de/en/web/research/research-departments/multilinguality-and-language-technology/ee-team'
507
  target='_blank'