MSEB Leaderboard

This is a leaderboard for MSEB: Massive Speech Embedding Benchmark.

For more information, see the MSEB GitHub repository.

Tasks:882
Task Types:12
Languages:37

Leaderboard Results

Rank Encoder Name classification (mean) clustering (mean) reasoning (mean) reranking (mean) retrieval (mean) segmentation (mean) transcription (mean)
1 gemini 0.8339 N/A 0.0399 0.9963 0.7258 N/A 0.3594
2 perch 0.5756 0.3917 N/A N/A N/A N/A N/A
3 gecko 0.4437 N/A 0.0762 0.9697 0.5070 N/A N/A
4 clap 0.4322 0.4119 N/A N/A N/A N/A N/A
5 elevenlabs N/A N/A N/A N/A N/A N/A 0.2883
6 gemma N/A N/A 0.5341 N/A N/A N/A N/A
7 gpt N/A N/A N/A 0.2350 0.6551 N/A 0.3240
8 hubert N/A 0.6014 N/A N/A N/A N/A N/A
9 raw N/A 0.5983 N/A N/A N/A N/A N/A
10 wav2vec2 N/A 0.5990 N/A N/A N/A N/A N/A
11 whisper N/A N/A N/A N/A N/A 0.4018 0.3300

classification

Metric clap gecko gemini perch
BirdsetHSNClassification/ebird_classification (mAP)
(Classification, und) [?]
N/A N/A N/A 0.5271
BirdsetNBPClassification/ebird_classification (mAP)
(Classification, und) [?]
N/A N/A N/A 0.6583
BirdsetPOWClassification/ebird_classification (mAP)
(Classification, und) [?]
N/A N/A N/A 0.5413
FSD50KTestClassification/classification (mAP)
(Classification, und) [?]
0.4322 N/A N/A N/A
SpeechMassiveArSaIntentClassification/intent_classification (Accuracy)
(IntentClassification, ar-SA) [?]
N/A 0.3937 0.7666 N/A
SpeechMassiveDeDeIntentClassification/intent_classification (Accuracy)
(IntentClassification, de-DE) [?]
N/A 0.4590 0.8440 N/A
SpeechMassiveEsEsIntentClassification/intent_classification (Accuracy)
(IntentClassification, es-ES) [?]
N/A 0.4610 0.8373 N/A
SpeechMassiveFrFrIntentClassification/intent_classification (Accuracy)
(IntentClassification, fr-FR) [?]
N/A 0.4593 0.8473 N/A
SpeechMassiveHuHuIntentClassification/intent_classification (Accuracy)
(IntentClassification, hu-HU) [?]
N/A 0.4153 0.8312 N/A
SpeechMassiveKoKrIntentClassification/intent_classification (Accuracy)
(IntentClassification, ko-KR) [?]
N/A 0.4469 0.8373 N/A
SpeechMassiveNlNlIntentClassification/intent_classification (Accuracy)
(IntentClassification, nl-NL) [?]
N/A 0.4627 0.8534 N/A
SpeechMassivePlPlIntentClassification/intent_classification (Accuracy)
(IntentClassification, pl-PL) [?]
N/A 0.4553 0.8410 N/A
SpeechMassivePtPtIntentClassification/intent_classification (Accuracy)
(IntentClassification, pt-PT) [?]
N/A 0.4610 0.8362 N/A
SpeechMassiveRuRuIntentClassification/intent_classification (Accuracy)
(IntentClassification, ru-RU) [?]
N/A 0.4546 0.8473 N/A
SpeechMassiveTrTrIntentClassification/intent_classification (Accuracy)
(IntentClassification, tr-TR) [?]
N/A 0.4328 0.8329 N/A
SpeechMassiveViVnIntentClassification/intent_classification (Accuracy)
(IntentClassification, vi-VN) [?]
N/A 0.4233 0.8319 N/A

clustering

Metric clap hubert perch raw wav2vec2
BirdsetClusteringHSN/clustering (VMeasure)
(Clustering, en-US) [?]
0.1907 0.0495 0.2947 0.0401 N/A
BirdsetClusteringNBP/clustering (VMeasure)
(Clustering, en-US) [?]
0.5196 N/A 0.5265 N/A N/A
BirdsetClusteringPOW/clustering (VMeasure)
(Clustering, en-US) [?]
0.2605 0.1089 0.3539 0.1441 N/A
FSD50KTestClustering/sound_event (VMeasure)
(Clustering, und) [?]
0.6768 N/A N/A N/A N/A
SVQClustering/speaker_age (VMeasure)
(Clustering, en-US)
N/A N/A N/A 0.0604 N/A
SVQClustering/speaker_gender (VMeasure)
(Clustering, en-US)
N/A N/A N/A 0.0079 N/A
SVQClustering/speaker_id (VMeasure)
(Clustering, en-US)
N/A N/A N/A 0.3682 N/A
SVQClusteringArEg/speaker_age (VMeasure)
(Clustering, ar-EG) [?]
N/A 0.8482 N/A 0.9007 0.8868
SVQClusteringArEg/speaker_gender (VMeasure)
(Clustering, ar-EG) [?]
N/A 0.0018 N/A 0.0403 0.0018
SVQClusteringArEg/speaker_id (VMeasure)
(Clustering, ar-EG) [?]
N/A 0.8330 N/A 0.9359 0.9359
SVQClusteringArXGulf/speaker_age (VMeasure)
(Clustering, ar-x-gulf) [?]
N/A 0.8868 N/A 0.9007 0.8482
SVQClusteringArXGulf/speaker_gender (VMeasure)
(Clustering, ar-x-gulf) [?]
N/A 0.0018 N/A 0.4208 0.0986
SVQClusteringArXGulf/speaker_id (VMeasure)
(Clustering, ar-x-gulf) [?]
N/A 0.8069 N/A 0.8482 0.8482
SVQClusteringArXLevant/speaker_age (VMeasure)
(Clustering, ar-x-levant) [?]
N/A 0.9007 N/A 0.7796 0.9007
SVQClusteringArXLevant/speaker_gender (VMeasure)
(Clustering, ar-x-levant) [?]
N/A 0.5291 N/A 0.4084 0.5291
SVQClusteringArXLevant/speaker_id (VMeasure)
(Clustering, ar-x-levant) [?]
N/A 0.9007 N/A 0.9176 0.9359
SVQClusteringArXMaghrebi/speaker_age (VMeasure)
(Clustering, ar-x-maghrebi) [?]
N/A 0.8056 N/A 0.7745 0.8056
SVQClusteringArXMaghrebi/speaker_gender (VMeasure)
(Clustering, ar-x-maghrebi) [?]
N/A 0.0371 N/A 0.0986 0.0371
SVQClusteringArXMaghrebi/speaker_id (VMeasure)
(Clustering, ar-x-maghrebi) [?]
N/A 0.9359 N/A 0.9690 0.9690
SVQClusteringBnBd/speaker_age (VMeasure)
(Clustering, bn-BD) [?]
N/A 0.6555 N/A 0.6628 0.7208
SVQClusteringBnBd/speaker_gender (VMeasure)
(Clustering, bn-BD) [?]
N/A 1.0000 N/A 1.0000 1.0000
SVQClusteringBnBd/speaker_id (VMeasure)
(Clustering, bn-BD) [?]
N/A 0.7745 N/A 0.7330 0.8530
SVQClusteringBnIn/speaker_age (VMeasure)
(Clustering, bn-IN) [?]
N/A 0.6421 N/A 0.8238 0.6421
SVQClusteringBnIn/speaker_gender (VMeasure)
(Clustering, bn-IN) [?]
N/A 1.0000 N/A 1.0000 1.0000
SVQClusteringBnIn/speaker_id (VMeasure)
(Clustering, bn-IN) [?]
N/A 0.8530 N/A 0.8787 0.8631
SVQClusteringEnAu/speaker_age (VMeasure)
(Clustering, en-AU) [?]
N/A 0.8482 N/A 0.9007 0.8228
SVQClusteringEnAu/speaker_gender (VMeasure)
(Clustering, en-AU) [?]
N/A 0.0206 N/A 0.3037 0.0206
SVQClusteringEnAu/speaker_id (VMeasure)
(Clustering, en-AU) [?]
N/A 0.8868 N/A 0.9007 0.9690
SVQClusteringEnGb/speaker_age (VMeasure)
(Clustering, en-GB) [?]
N/A 0.6284 N/A 0.8369 0.7621
SVQClusteringEnGb/speaker_gender (VMeasure)
(Clustering, en-GB) [?]
N/A 0.2174 N/A 0.0018 0.0986
SVQClusteringEnGb/speaker_id (VMeasure)
(Clustering, en-GB) [?]
N/A 0.9114 N/A 0.7621 0.7796
SVQClusteringEnIn/speaker_age (VMeasure)
(Clustering, en-IN) [?]
N/A 0.7905 N/A 0.7448 0.7145
SVQClusteringEnIn/speaker_gender (VMeasure)
(Clustering, en-IN) [?]
N/A 0.5291 N/A 0.3063 0.2895
SVQClusteringEnIn/speaker_id (VMeasure)
(Clustering, en-IN) [?]
N/A 0.9007 N/A 0.8631 0.9007
SVQClusteringEnPh/speaker_age (VMeasure)
(Clustering, en-PH) [?]
N/A 0.8482 N/A 0.8631 0.9176
SVQClusteringEnPh/speaker_gender (VMeasure)
(Clustering, en-PH) [?]
N/A 0.1097 N/A 0.1097 0.1097
SVQClusteringEnPh/speaker_id (VMeasure)
(Clustering, en-PH) [?]
N/A 0.8868 N/A 0.9359 0.9690
SVQClusteringEnUs/speaker_age (VMeasure)
(Clustering, en-US) [?]
N/A 0.6993 N/A 0.6355 0.7189
SVQClusteringEnUs/speaker_gender (VMeasure)
(Clustering, en-US) [?]
N/A 0.1214 N/A 0.5061 0.1117
SVQClusteringEnUs/speaker_id (VMeasure)
(Clustering, en-US) [?]
N/A 0.8069 N/A 0.9505 0.8069
SVQClusteringFiFi/speaker_age (VMeasure)
(Clustering, fi-FI) [?]
N/A 0.5421 N/A 0.5340 0.6189
SVQClusteringFiFi/speaker_gender (VMeasure)
(Clustering, fi-FI) [?]
N/A 0.4208 N/A 0.3037 0.0812
SVQClusteringFiFi/speaker_id (VMeasure)
(Clustering, fi-FI) [?]
N/A 0.9359 N/A 0.9690 0.8868
SVQClusteringGuIn/speaker_age (VMeasure)
(Clustering, gu-IN) [?]
N/A 0.6387 N/A 0.6057 0.5455
SVQClusteringGuIn/speaker_gender (VMeasure)
(Clustering, gu-IN) [?]
N/A 0.1263 N/A 0.2746 0.0000
SVQClusteringGuIn/speaker_id (VMeasure)
(Clustering, gu-IN) [?]
N/A 0.7624 N/A 0.7919 0.7330
SVQClusteringHiIn/speaker_age (VMeasure)
(Clustering, hi-IN) [?]
N/A 0.8158 N/A 0.8238 0.7740
SVQClusteringHiIn/speaker_gender (VMeasure)
(Clustering, hi-IN) [?]
N/A 0.6190 N/A 0.0063 0.2020
SVQClusteringHiIn/speaker_id (VMeasure)
(Clustering, hi-IN) [?]
N/A 0.7433 N/A 0.8721 0.8525
SVQClusteringIdId/speaker_age (VMeasure)
(Clustering, id-ID) [?]
N/A 0.7624 N/A 0.7796 0.9176
SVQClusteringIdId/speaker_gender (VMeasure)
(Clustering, id-ID) [?]
N/A 0.2866 N/A 0.1097 0.0812
SVQClusteringIdId/speaker_id (VMeasure)
(Clustering, id-ID) [?]
N/A 0.9007 N/A 0.8482 0.8069
SVQClusteringJaJp/speaker_age (VMeasure)
(Clustering, ja-JP) [?]
N/A 0.7538 N/A 0.5843 0.7433
SVQClusteringJaJp/speaker_gender (VMeasure)
(Clustering, ja-JP) [?]
N/A 0.1010 N/A 0.0403 0.1010
SVQClusteringJaJp/speaker_id (VMeasure)
(Clustering, ja-JP) [?]
N/A 0.7039 N/A 0.6628 0.6478
SVQClusteringKnIn/speaker_age (VMeasure)
(Clustering, kn-IN) [?]
N/A 0.5611 N/A 0.7547 0.6421
SVQClusteringKnIn/speaker_gender (VMeasure)
(Clustering, kn-IN) [?]
N/A 0.2342 N/A 0.1650 0.2155
SVQClusteringKnIn/speaker_id (VMeasure)
(Clustering, kn-IN) [?]
N/A 0.7796 N/A 0.8693 0.7796
SVQClusteringKoKr/speaker_age (VMeasure)
(Clustering, ko-KR) [?]
N/A 0.8093 N/A 0.7621 0.7145
SVQClusteringKoKr/speaker_gender (VMeasure)
(Clustering, ko-KR) [?]
N/A 0.0087 N/A 0.1263 0.0063
SVQClusteringKoKr/speaker_id (VMeasure)
(Clustering, ko-KR) [?]
N/A 0.7796 N/A 0.8482 0.8787
SVQClusteringMlIn/speaker_age (VMeasure)
(Clustering, ml-IN) [?]
N/A 0.7556 N/A 0.7624 0.7740
SVQClusteringMlIn/speaker_gender (VMeasure)
(Clustering, ml-IN) [?]
N/A 0.0371 N/A 0.0290 0.2746
SVQClusteringMlIn/speaker_id (VMeasure)
(Clustering, ml-IN) [?]
N/A 0.8868 N/A 0.9359 0.9359
SVQClusteringMrIn/speaker_age (VMeasure)
(Clustering, mr-IN) [?]
N/A 0.7796 N/A 0.8583 0.7145
SVQClusteringMrIn/speaker_gender (VMeasure)
(Clustering, mr-IN) [?]
N/A 0.4024 N/A 0.2811 0.1470
SVQClusteringMrIn/speaker_id (VMeasure)
(Clustering, mr-IN) [?]
N/A 0.7145 N/A 0.8414 0.7624
SVQClusteringRuRu/speaker_age (VMeasure)
(Clustering, ru-RU) [?]
N/A 0.8369 N/A 0.8069 0.8069
SVQClusteringRuRu/speaker_gender (VMeasure)
(Clustering, ru-RU) [?]
N/A 0.1471 N/A 0.2866 0.0573
SVQClusteringRuRu/speaker_id (VMeasure)
(Clustering, ru-RU) [?]
N/A 0.9229 N/A 0.8631 0.9690
SVQClusteringSw/speaker_age (VMeasure)
(Clustering, sw) [?]
N/A 0.8056 N/A 0.7239 0.8210
SVQClusteringSw/speaker_gender (VMeasure)
(Clustering, sw) [?]
N/A 0.0290 N/A 0.0371 0.0290
SVQClusteringSw/speaker_id (VMeasure)
(Clustering, sw) [?]
N/A 0.8937 N/A 0.8228 0.9176
SVQClusteringTaIn/speaker_age (VMeasure)
(Clustering, ta-IN) [?]
N/A 0.7239 N/A 0.7239 0.4926
SVQClusteringTaIn/speaker_gender (VMeasure)
(Clustering, ta-IN) [?]
N/A 0.1263 N/A 0.2746 0.0290
SVQClusteringTaIn/speaker_id (VMeasure)
(Clustering, ta-IN) [?]
N/A 0.8937 N/A 0.8093 0.8937
SVQClusteringTeIn/speaker_age (VMeasure)
(Clustering, te-IN) [?]
N/A 0.7919 N/A 0.7905 0.7624
SVQClusteringTeIn/speaker_gender (VMeasure)
(Clustering, te-IN) [?]
N/A 0.0063 N/A 0.0063 0.0063
SVQClusteringTeIn/speaker_id (VMeasure)
(Clustering, te-IN) [?]
N/A 0.9690 N/A 0.9690 0.9690
SVQClusteringUrIn/speaker_age (VMeasure)
(Clustering, ur-IN) [?]
N/A 0.6955 N/A 0.8369 0.8203
SVQClusteringUrIn/speaker_gender (VMeasure)
(Clustering, ur-IN) [?]
N/A 0.0371 N/A 0.0986 0.0063
SVQClusteringUrIn/speaker_id (VMeasure)
(Clustering, ur-IN) [?]
N/A 0.8228 N/A 0.9316 0.8203
SVQClusteringUrPk/speaker_age (VMeasure)
(Clustering, ur-PK) [?]
N/A 0.8530 N/A 0.8530 0.7621
SVQClusteringUrPk/speaker_gender (VMeasure)
(Clustering, ur-PK) [?]
N/A 0.0371 N/A 0.2746 0.0371
SVQClusteringUrPk/speaker_id (VMeasure)
(Clustering, ur-PK) [?]
N/A 0.8787 N/A 0.7796 0.8203

reasoning

Metric gecko gemini gemma
SVQArEgSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ar-EG) [?]
0.0000 0.0320 0.4112
SVQArEgSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ar-EG) [?]
0.1533 0.0719 0.5728
SVQArXGulfSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ar-x-gulf) [?]
0.0000 0.0324 0.4071
SVQArXGulfSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ar-x-gulf) [?]
0.1526 0.0712 0.5736
SVQArXLevantSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ar-x-levant) [?]
0.0000 0.0335 0.4109
SVQArXLevantSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ar-x-levant) [?]
0.3418 0.0720 0.5713
SVQArXMaghrebiSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ar-x-maghrebi) [?]
0.0000 0.0318 0.4035
SVQArXMaghrebiSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ar-x-maghrebi) [?]
0.1509 0.0718 0.5732
SVQBnBdSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, bn-BD) [?]
0.0000 0.0010 0.5471
SVQBnBdSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, bn-BD) [?]
0.1676 0.0792 0.6469
SVQBnInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, bn-IN) [?]
0.0000 0.0011 0.5511
SVQBnInSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, bn-IN) [?]
0.1769 0.0845 0.6585
SVQEnAuSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, en-AU) [?]
0.1375 0.0606 0.6006
SVQEnGbSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, en-GB) [?]
0.1388 0.0605 0.6058
SVQEnInSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, en-IN) [?]
0.1361 0.0568 0.5981
SVQEnPhSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, en-PH) [?]
0.1379 0.0619 0.6014
SVQEnUsSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, en-US) [?]
0.1395 0.0619 0.6087
SVQFiFiSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, fi-FI) [?]
0.0000 0.0290 0.3884
SVQFiFiSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, fi-FI) [?]
0.1934 0.0710 0.6080
SVQGuInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, gu-IN) [?]
0.0000 0.0054 0.5520
SVQHiInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, hi-IN) [?]
0.0112 0.0000 0.4576
SVQIdIdSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, id-ID) [?]
0.1873 0.0963 0.5972
SVQJaJpSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ja-JP) [?]
0.0000 0.0277 0.4292
SVQKnInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, kn-IN) [?]
0.0018 0.0037 0.5447
SVQKoKrSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ko-KR) [?]
0.0000 0.0192 0.4226
SVQKoKrSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ko-KR) [?]
0.1407 0.0529 0.5844
SVQMlInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ml-IN) [?]
0.0016 0.0010 0.5474
SVQMrInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, mr-IN) [?]
0.0016 0.0028 0.5285
SVQRuRuSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ru-RU) [?]
0.0000 0.0288 0.4044
SVQRuRuSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, ru-RU) [?]
0.1095 0.0651 0.5618
SVQSwSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, sw) [?]
0.1345 0.0723 0.6049
SVQTaInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ta-IN) [?]
0.0000 0.0039 0.5229
SVQTeInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, te-IN) [?]
0.0015 0.0022 0.5246
SVQTeInSpanInLangReasoning/span_reasoning_in_lang (GmeanF1)
(SpanInLangReasoning, te-IN) [?]
0.1276 0.0628 0.6267
SVQUrInSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ur-IN) [?]
0.0000 0.0045 0.4862
SVQUrPkSpanCrossLangReasoning/span_reasoning_cross_lang (GmeanF1)
(SpanCrossLangReasoning, ur-PK) [?]
0.0000 0.0051 0.4929

reranking

Metric gecko gemini gpt
SVQArEgQueryReranking/query_reranking (MAP)
(QueryReranking, ar-EG) [?]
0.9993 0.9990 0.1724
SVQArEgQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ar-EG) [?]
N/A N/A 0.1659
SVQArEgQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ar-EG) [?]
N/A N/A 0.1757
SVQArEgQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ar-EG) [?]
N/A N/A 0.1746
SVQArEgQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ar-EG) [?]
N/A N/A 0.1737
SVQArXGulfQueryReranking/query_reranking (MAP)
(QueryReranking, ar-x-gulf) [?]
0.9992 0.9989 0.1761
SVQArXGulfQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ar-x-gulf) [?]
N/A N/A 0.1730
SVQArXGulfQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ar-x-gulf) [?]
N/A N/A 0.1858
SVQArXGulfQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ar-x-gulf) [?]
N/A N/A 0.1773
SVQArXGulfQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ar-x-gulf) [?]
N/A N/A 0.1683
SVQArXLevantQueryReranking/query_reranking (MAP)
(QueryReranking, ar-x-levant) [?]
0.9994 0.9990 0.1764
SVQArXLevantQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ar-x-levant) [?]
N/A N/A 0.1757
SVQArXLevantQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ar-x-levant) [?]
N/A N/A 0.1841
SVQArXLevantQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ar-x-levant) [?]
N/A N/A 0.1748
SVQArXLevantQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ar-x-levant) [?]
N/A N/A 0.1705
SVQArXMaghrebiQueryReranking/query_reranking (MAP)
(QueryReranking, ar-x-maghrebi) [?]
0.9993 0.9990 0.1594
SVQArXMaghrebiQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ar-x-maghrebi) [?]
N/A N/A 0.1795
SVQArXMaghrebiQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ar-x-maghrebi) [?]
N/A N/A 0.1478
SVQArXMaghrebiQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ar-x-maghrebi) [?]
N/A N/A 0.1540
SVQArXMaghrebiQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ar-x-maghrebi) [?]
N/A N/A 0.1555
SVQBnBdQueryReranking/query_reranking (MAP)
(QueryReranking, bn-BD) [?]
0.9911 0.9920 0.0211
SVQBnBdQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, bn-BD) [?]
N/A N/A 0.0187
SVQBnBdQueryReranking/query_reranking:clean (MAP)
(QueryReranking, bn-BD) [?]
N/A N/A 0.0257
SVQBnBdQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, bn-BD) [?]
N/A N/A 0.0227
SVQBnBdQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, bn-BD) [?]
N/A N/A 0.0168
SVQBnInQueryReranking/query_reranking (MAP)
(QueryReranking, bn-IN) [?]
0.9896 0.9911 0.0170
SVQBnInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, bn-IN) [?]
N/A N/A 0.0170
SVQBnInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, bn-IN) [?]
N/A N/A 0.0146
SVQBnInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, bn-IN) [?]
N/A N/A 0.0184
SVQBnInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, bn-IN) [?]
N/A N/A 0.0179
SVQEnAuQueryReranking/query_reranking (MAP)
(QueryReranking, en-AU) [?]
1.0000 1.0000 0.6870
SVQEnAuQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, en-AU) [?]
N/A N/A 0.6624
SVQEnAuQueryReranking/query_reranking:clean (MAP)
(QueryReranking, en-AU) [?]
N/A N/A 0.7011
SVQEnAuQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, en-AU) [?]
N/A N/A 0.7048
SVQEnAuQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, en-AU) [?]
N/A N/A 0.6803
SVQEnGbQueryReranking/query_reranking (MAP)
(QueryReranking, en-GB) [?]
1.0000 1.0000 0.6669
SVQEnGbQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, en-GB) [?]
N/A N/A 0.6319
SVQEnGbQueryReranking/query_reranking:clean (MAP)
(QueryReranking, en-GB) [?]
N/A N/A 0.6931
SVQEnGbQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, en-GB) [?]
N/A N/A 0.6903
SVQEnGbQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, en-GB) [?]
N/A N/A 0.6513
SVQEnInQueryReranking/query_reranking (MAP)
(QueryReranking, en-IN) [?]
1.0000 1.0000 0.6978
SVQEnInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, en-IN) [?]
N/A N/A 0.6593
SVQEnInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, en-IN) [?]
N/A N/A 0.7131
SVQEnInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, en-IN) [?]
N/A N/A 0.7155
SVQEnInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, en-IN) [?]
N/A N/A 0.7027
SVQEnPhQueryReranking/query_reranking (MAP)
(QueryReranking, en-PH) [?]
1.0000 1.0000 0.6827
SVQEnPhQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, en-PH) [?]
N/A N/A 0.6772
SVQEnPhQueryReranking/query_reranking:clean (MAP)
(QueryReranking, en-PH) [?]
N/A N/A 0.7026
SVQEnPhQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, en-PH) [?]
N/A N/A 0.6945
SVQEnPhQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, en-PH) [?]
N/A N/A 0.6568
SVQEnUsQueryReranking/query_reranking (MAP)
(QueryReranking, en-US) [?]
1.0000 1.0000 0.6712
SVQEnUsQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, en-US) [?]
N/A N/A 0.6341
SVQEnUsQueryReranking/query_reranking:clean (MAP)
(QueryReranking, en-US) [?]
N/A N/A 0.6973
SVQEnUsQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, en-US) [?]
N/A N/A 0.6547
SVQEnUsQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, en-US) [?]
N/A N/A 0.6990
SVQFiFiQueryReranking/query_reranking (MAP)
(QueryReranking, fi-FI) [?]
1.0000 1.0000 0.4992
SVQFiFiQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, fi-FI) [?]
N/A N/A 0.4207
SVQFiFiQueryReranking/query_reranking:clean (MAP)
(QueryReranking, fi-FI) [?]
N/A N/A 0.5522
SVQFiFiQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, fi-FI) [?]
N/A N/A 0.4970
SVQFiFiQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, fi-FI) [?]
N/A N/A 0.5040
SVQGuInQueryReranking/query_reranking (MAP)
(QueryReranking, gu-IN) [?]
0.9991 0.9974 0.0148
SVQGuInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, gu-IN) [?]
N/A N/A 0.0205
SVQGuInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, gu-IN) [?]
N/A N/A 0.0125
SVQGuInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, gu-IN) [?]
N/A N/A 0.0095
SVQGuInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, gu-IN) [?]
N/A N/A 0.0166
SVQHiInQueryReranking/query_reranking (MAP)
(QueryReranking, hi-IN) [?]
0.9989 0.9993 0.0472
SVQHiInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, hi-IN) [?]
N/A N/A 0.0473
SVQHiInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, hi-IN) [?]
N/A N/A 0.0501
SVQHiInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, hi-IN) [?]
N/A N/A 0.0468
SVQHiInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, hi-IN) [?]
N/A N/A 0.0446
SVQIdIdQueryReranking/query_reranking (MAP)
(QueryReranking, id-ID) [?]
1.0000 1.0000 0.6047
SVQIdIdQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, id-ID) [?]
N/A N/A 0.5585
SVQIdIdQueryReranking/query_reranking:clean (MAP)
(QueryReranking, id-ID) [?]
N/A N/A 0.6309
SVQIdIdQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, id-ID) [?]
N/A N/A 0.5988
SVQIdIdQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, id-ID) [?]
N/A N/A 0.6157
SVQJaJpQueryReranking/query_reranking (MAP)
(QueryReranking, ja-JP) [?]
0.9997 0.9996 0.0285
SVQJaJpQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ja-JP) [?]
N/A N/A 0.0327
SVQJaJpQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ja-JP) [?]
N/A N/A 0.0249
SVQJaJpQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ja-JP) [?]
N/A N/A 0.0276
SVQJaJpQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ja-JP) [?]
N/A N/A 0.0286
SVQKnInQueryReranking/query_reranking (MAP)
(QueryReranking, kn-IN) [?]
0.9997 0.9998 0.0232
SVQKnInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, kn-IN) [?]
N/A N/A 0.0254
SVQKnInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, kn-IN) [?]
N/A N/A 0.0217
SVQKnInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, kn-IN) [?]
N/A N/A 0.0208
SVQKnInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, kn-IN) [?]
N/A N/A 0.0245
SVQKoKrQueryReranking/query_reranking (MAP)
(QueryReranking, ko-KR) [?]
0.9996 0.9998 0.0233
SVQKoKrQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ko-KR) [?]
N/A N/A 0.0226
SVQKoKrQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ko-KR) [?]
N/A N/A 0.0236
SVQKoKrQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ko-KR) [?]
N/A N/A 0.0229
SVQKoKrQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ko-KR) [?]
N/A N/A 0.0240
SVQMlInQueryReranking/query_reranking (MAP)
(QueryReranking, ml-IN) [?]
0.9964 0.9958 0.0138
SVQMlInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ml-IN) [?]
N/A N/A 0.0158
SVQMlInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ml-IN) [?]
N/A N/A 0.0144
SVQMlInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ml-IN) [?]
N/A N/A 0.0071
SVQMlInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ml-IN) [?]
N/A N/A 0.0167
SVQMrInQueryReranking/query_reranking (MAP)
(QueryReranking, mr-IN) [?]
0.2942 0.9862 0.0408
SVQMrInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, mr-IN) [?]
N/A N/A 0.0420
SVQMrInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, mr-IN) [?]
N/A N/A 0.0405
SVQMrInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, mr-IN) [?]
N/A N/A 0.0406
SVQMrInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, mr-IN) [?]
N/A N/A 0.0402
SVQRuRuQueryReranking/query_reranking (MAP)
(QueryReranking, ru-RU) [?]
0.9999 0.9999 0.1401
SVQRuRuQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ru-RU) [?]
N/A N/A 0.1285
SVQRuRuQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ru-RU) [?]
N/A N/A 0.1407
SVQRuRuQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ru-RU) [?]
N/A N/A 0.1381
SVQRuRuQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ru-RU) [?]
N/A N/A 0.1531
SVQSwQueryReranking/query_reranking (MAP)
(QueryReranking, sw) [?]
1.0000 0.9998 0.4191
SVQSwQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, sw) [?]
N/A N/A 0.4064
SVQSwQueryReranking/query_reranking:clean (MAP)
(QueryReranking, sw) [?]
N/A N/A 0.4414
SVQSwQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, sw) [?]
N/A N/A 0.4100
SVQSwQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, sw) [?]
N/A N/A 0.4188
SVQTaInQueryReranking/query_reranking (MAP)
(QueryReranking, ta-IN) [?]
0.9806 0.9779 0.0189
SVQTaInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ta-IN) [?]
N/A N/A 0.0103
SVQTaInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ta-IN) [?]
N/A N/A 0.0168
SVQTaInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ta-IN) [?]
N/A N/A 0.0189
SVQTaInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ta-IN) [?]
N/A N/A 0.0284
SVQTeInQueryReranking/query_reranking (MAP)
(QueryReranking, te-IN) [?]
0.9699 0.9747 0.0251
SVQTeInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, te-IN) [?]
N/A N/A 0.0256
SVQTeInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, te-IN) [?]
N/A N/A 0.0250
SVQTeInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, te-IN) [?]
N/A N/A 0.0237
SVQTeInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, te-IN) [?]
N/A N/A 0.0262
SVQUrInQueryReranking/query_reranking (MAP)
(QueryReranking, ur-IN) [?]
0.9983 0.9977 0.0475
SVQUrInQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ur-IN) [?]
N/A N/A 0.0517
SVQUrInQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ur-IN) [?]
N/A N/A 0.0479
SVQUrInQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ur-IN) [?]
N/A N/A 0.0441
SVQUrInQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ur-IN) [?]
N/A N/A 0.0465
SVQUrPkQueryReranking/query_reranking (MAP)
(QueryReranking, ur-PK) [?]
0.9982 0.9979 0.0434
SVQUrPkQueryReranking/query_reranking:background_speech (MAP)
(QueryReranking, ur-PK) [?]
N/A N/A 0.0517
SVQUrPkQueryReranking/query_reranking:clean (MAP)
(QueryReranking, ur-PK) [?]
N/A N/A 0.0438
SVQUrPkQueryReranking/query_reranking:media_noise (MAP)
(QueryReranking, ur-PK) [?]
N/A N/A 0.0405
SVQUrPkQueryReranking/query_reranking:traffic_noise (MAP)
(QueryReranking, ur-PK) [?]
N/A N/A 0.0375

retrieval

Metric gecko gemini gpt
SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ar-EG) [?]
0.3081 0.6160 0.5694
SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ar-EG) [?]
N/A 0.6188 0.5677
SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ar-EG) [?]
N/A 0.6132 0.5702
SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ar-EG) [?]
N/A 0.6137 0.5730
SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ar-EG) [?]
N/A 0.6182 0.5667
SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ar-EG) [?]
0.6476 0.7312 0.5023
SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ar-EG) [?]
N/A 0.6118 0.5026
SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ar-EG) [?]
N/A 0.6469 0.5058
SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ar-EG) [?]
N/A 0.6104 0.4972
SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ar-EG) [?]
N/A 0.6061 0.5038
SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ar-EG) [?]
0.4511 0.7392 0.7473
SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ar-EG) [?]
N/A 0.7441 0.7450
SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ar-EG) [?]
N/A 0.7360 0.7514
SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ar-EG) [?]
N/A 0.7351 0.7494
SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ar-EG) [?]
N/A 0.7416 0.7436
SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ar-EG) [?]
0.7034 0.8491 0.8217
SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ar-EG) [?]
N/A 0.8498 0.8223
SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ar-EG) [?]
N/A 0.8538 0.8240
SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ar-EG) [?]
N/A 0.8474 0.8192
SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ar-EG) [?]
N/A 0.8453 0.8214
SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ar-x-gulf) [?]
0.3096 0.6363 0.5726
SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.6329 0.5703
SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.6465 0.5748
SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.6302 0.5749
SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.6354 0.5704
SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ar-x-gulf) [?]
0.6489 0.7322 0.4990
SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ar-x-gulf) [?]
N/A 0.6106 0.4982
SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ar-x-gulf) [?]
N/A 0.6550 0.4940
SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ar-x-gulf) [?]
N/A 0.6105 0.5003
SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ar-x-gulf) [?]
N/A 0.6074 0.5037
SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ar-x-gulf) [?]
0.4516 0.7426 0.7484
SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.7399 0.7492
SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.7427 0.7470
SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.7492 0.7523
SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ar-x-gulf) [?]
N/A 0.7387 0.7452
SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ar-x-gulf) [?]
0.7029 0.8658 0.8219
SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ar-x-gulf) [?]
N/A 0.8647 0.8171
SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ar-x-gulf) [?]
N/A 0.8644 0.8227
SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ar-x-gulf) [?]
N/A 0.8651 0.8217
SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ar-x-gulf) [?]
N/A 0.8690 0.8260
SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ar-x-levant) [?]
0.3066 0.6413 0.5765
SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ar-x-levant) [?]
N/A 0.6400 0.5741
SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ar-x-levant) [?]
N/A 0.6351 0.5772
SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-levant) [?]
N/A 0.6405 0.5716
SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-levant) [?]
N/A 0.6507 0.5845
SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ar-x-levant) [?]
0.6470 0.7295 0.4984
SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ar-x-levant) [?]
N/A 0.6159 0.4940
SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ar-x-levant) [?]
N/A 0.6399 0.5166
SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ar-x-levant) [?]
N/A 0.6256 0.4967
SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ar-x-levant) [?]
N/A 0.6211 0.4854
SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ar-x-levant) [?]
0.4543 0.7122 0.7475
SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ar-x-levant) [?]
N/A 0.7175 0.7351
SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ar-x-levant) [?]
N/A 0.7153 0.7526
SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ar-x-levant) [?]
N/A 0.7141 0.7477
SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ar-x-levant) [?]
N/A 0.7126 0.7555
SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ar-x-levant) [?]
0.7038 0.8351 0.8219
SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ar-x-levant) [?]
N/A 0.8340 0.8189
SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ar-x-levant) [?]
N/A 0.8378 0.8215
SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ar-x-levant) [?]
N/A 0.8389 0.8259
SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ar-x-levant) [?]
N/A 0.8291 0.8207
SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ar-x-maghrebi) [?]
0.3060 0.6102 0.5681
SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6015 0.5615
SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6172 0.5815
SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6135 0.5643
SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6090 0.5653
SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ar-x-maghrebi) [?]
0.6501 0.7324 0.4984
SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6104 0.4994
SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6258 0.4954
SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6221 0.5120
SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.6127 0.4873
SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ar-x-maghrebi) [?]
0.4479 0.7630 0.7416
SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.7670 0.7406
SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.7557 0.7359
SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.7600 0.7411
SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ar-x-maghrebi) [?]
N/A 0.7694 0.7489
SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ar-x-maghrebi) [?]
0.7015 0.8635 0.8244
SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.8629 0.8253
SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.8678 0.8234
SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.8688 0.8260
SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ar-x-maghrebi) [?]
N/A 0.8549 0.8230
SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, bn-BD) [?]
0.3257 0.6519 0.5875
SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, bn-BD) [?]
N/A 0.6582 0.5833
SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, bn-BD) [?]
N/A 0.6551 0.5907
SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, bn-BD) [?]
N/A 0.6504 0.5905
SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, bn-BD) [?]
N/A 0.6403 0.5859
SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, bn-BD) [?]
0.5933 0.7479 0.2601
SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, bn-BD) [?]
N/A 0.7023 0.2285
SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, bn-BD) [?]
N/A 0.6927 0.2872
SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, bn-BD) [?]
N/A 0.6333 0.2510
SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, bn-BD) [?]
N/A 0.6101 0.2755
SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, bn-BD) [?]
0.4703 0.7481 0.7705
SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, bn-BD) [?]
N/A 0.7472 0.7735
SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, bn-BD) [?]
N/A 0.7552 0.7718
SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, bn-BD) [?]
N/A 0.7496 0.7603
SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, bn-BD) [?]
N/A 0.7387 0.7745
SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, bn-BD) [?]
0.7053 0.8697 0.6666
SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, bn-BD) [?]
N/A 0.8724 0.6686
SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, bn-BD) [?]
N/A 0.8640 0.6643
SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, bn-BD) [?]
N/A 0.8884 0.6546
SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, bn-BD) [?]
N/A 0.8546 0.6791
SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, bn-IN) [?]
0.3271 0.6298 0.5983
SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, bn-IN) [?]
N/A 0.6301 0.5991
SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, bn-IN) [?]
N/A 0.6320 0.5960
SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, bn-IN) [?]
N/A 0.6276 0.5979
SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, bn-IN) [?]
N/A 0.6296 0.6003
SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, bn-IN) [?]
0.5968 0.7505 0.2903
SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, bn-IN) [?]
N/A 0.6810 0.2675
SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, bn-IN) [?]
N/A 0.6845 0.3110
SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, bn-IN) [?]
N/A 0.6798 0.3082
SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, bn-IN) [?]
N/A 0.6525 0.2754
SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, bn-IN) [?]
0.4751 0.7765 0.7697
SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, bn-IN) [?]
N/A 0.7768 0.7692
SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, bn-IN) [?]
N/A 0.7787 0.7692
SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, bn-IN) [?]
N/A 0.7732 0.7698
SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, bn-IN) [?]
N/A 0.7770 0.7707
SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, bn-IN) [?]
0.7037 0.8660 0.6685
SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, bn-IN) [?]
N/A 0.8673 0.6699
SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, bn-IN) [?]
N/A 0.8737 0.6677
SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, bn-IN) [?]
N/A 0.8649 0.6768
SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, bn-IN) [?]
N/A 0.8582 0.6658
SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, en-AU) [?]
0.6562 0.8029 0.7659
SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, en-AU) [?]
N/A 0.8037 0.7664
SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, en-AU) [?]
N/A 0.8004 0.7688
SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, en-AU) [?]
N/A 0.8043 0.7688
SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, en-AU) [?]
N/A 0.8033 0.7596
SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, en-AU) [?]
0.7157 0.8963 0.8313
SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, en-AU) [?]
N/A 0.8981 0.8308
SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, en-AU) [?]
N/A 0.8980 0.8300
SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, en-AU) [?]
N/A 0.8968 0.8340
SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, en-AU) [?]
N/A 0.8924 0.8305
SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, en-GB) [?]
0.6568 0.8054 0.7690
SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, en-GB) [?]
N/A 0.8045 0.7669
SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, en-GB) [?]
N/A 0.8049 0.7691
SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, en-GB) [?]
N/A 0.8069 0.7703
SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, en-GB) [?]
N/A 0.8054 0.7695
SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, en-GB) [?]
0.7146 0.8913 0.8298
SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, en-GB) [?]
N/A 0.8902 0.8307
SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, en-GB) [?]
N/A 0.8914 0.8315
SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, en-GB) [?]
N/A 0.8954 0.8299
SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, en-GB) [?]
N/A 0.8882 0.8270
SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, en-IN) [?]
0.6571 0.7976 0.7623
SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, en-IN) [?]
N/A 0.7928 0.7563
SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, en-IN) [?]
N/A 0.7986 0.7641
SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, en-IN) [?]
N/A 0.7985 0.7673
SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, en-IN) [?]
N/A 0.8004 0.7613
SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, en-IN) [?]
0.7158 0.8674 0.8292
SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, en-IN) [?]
N/A 0.8681 0.8200
SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, en-IN) [?]
N/A 0.8650 0.8323
SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, en-IN) [?]
N/A 0.8690 0.8320
SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, en-IN) [?]
N/A 0.8675 0.8325
SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, en-PH) [?]
0.6572 0.8049 0.7697
SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, en-PH) [?]
N/A 0.8028 0.7648
SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, en-PH) [?]
N/A 0.8078 0.7713
SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, en-PH) [?]
N/A 0.8048 0.7701
SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, en-PH) [?]
N/A 0.8042 0.7724
SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, en-PH) [?]
0.7145 0.8944 0.8331
SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, en-PH) [?]
N/A 0.9007 0.8338
SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, en-PH) [?]
N/A 0.8897 0.8282
SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, en-PH) [?]
N/A 0.8920 0.8327
SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, en-PH) [?]
N/A 0.8953 0.8377
SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, en-US) [?]
0.6574 0.7732 0.7647
SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, en-US) [?]
N/A 0.7722 0.7646
SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, en-US) [?]
N/A 0.7724 0.7648
SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, en-US) [?]
N/A 0.7670 0.7662
SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, en-US) [?]
N/A 0.7812 0.7630
SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, en-US) [?]
0.7152 0.8941 0.8316
SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, en-US) [?]
N/A 0.8970 0.8315
SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, en-US) [?]
N/A 0.8959 0.8297
SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, en-US) [?]
N/A 0.8920 0.8346
SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, en-US) [?]
N/A 0.8917 0.8307
SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, fi-FI) [?]
0.4204 0.6676 0.6379
SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, fi-FI) [?]
N/A 0.6525 0.6160
SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, fi-FI) [?]
N/A 0.6676 0.6428
SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, fi-FI) [?]
N/A 0.6718 0.6423
SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, fi-FI) [?]
N/A 0.6712 0.6413
SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, fi-FI) [?]
0.6709 0.7987 0.7525
SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, fi-FI) [?]
N/A 0.7979 0.7600
SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, fi-FI) [?]
N/A 0.7999 0.7492
SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, fi-FI) [?]
N/A 0.8001 0.7551
SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, fi-FI) [?]
N/A 0.7967 0.7483
SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, fi-FI) [?]
0.5257 0.8211 0.7720
SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, fi-FI) [?]
N/A 0.8232 0.7835
SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, fi-FI) [?]
N/A 0.8166 0.7751
SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, fi-FI) [?]
N/A 0.8187 0.7653
SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, fi-FI) [?]
N/A 0.8256 0.7703
SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, fi-FI) [?]
0.6500 0.8365 0.7547
SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, fi-FI) [?]
N/A 0.8431 0.7699
SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, fi-FI) [?]
N/A 0.8368 0.7485
SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, fi-FI) [?]
N/A 0.8337 0.7539
SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, fi-FI) [?]
N/A 0.8354 0.7516
SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, gu-IN) [?]
0.2701 0.5849 0.5470
SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, gu-IN) [?]
N/A 0.5909 0.5455
SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, gu-IN) [?]
N/A 0.5836 0.5538
SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, gu-IN) [?]
N/A 0.5863 0.5498
SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, gu-IN) [?]
N/A 0.5790 0.5391
SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, gu-IN) [?]
0.4543 0.7256 0.7212
SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, gu-IN) [?]
N/A 0.7218 0.7251
SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, gu-IN) [?]
N/A 0.7280 0.7224
SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, gu-IN) [?]
N/A 0.7287 0.7199
SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, gu-IN) [?]
N/A 0.7237 0.7175
SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, hi-IN) [?]
0.3391 0.5941 0.5536
SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, hi-IN) [?]
N/A 0.5862 0.5499
SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, hi-IN) [?]
N/A 0.5961 0.5572
SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, hi-IN) [?]
N/A 0.5983 0.5534
SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, hi-IN) [?]
N/A 0.5956 0.5538
SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, hi-IN) [?]
0.4919 0.7149 0.7230
SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, hi-IN) [?]
N/A 0.7175 0.7257
SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, hi-IN) [?]
N/A 0.7150 0.7234
SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, hi-IN) [?]
N/A 0.7129 0.7264
SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, hi-IN) [?]
N/A 0.7143 0.7167
SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, id-ID) [?]
0.6675 0.8347 0.7870
SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, id-ID) [?]
N/A 0.8465 0.7955
SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, id-ID) [?]
N/A 0.8292 0.7840
SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, id-ID) [?]
N/A 0.8307 0.7848
SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, id-ID) [?]
N/A 0.8365 0.7862
SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, id-ID) [?]
0.7041 0.8909 0.8180
SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, id-ID) [?]
N/A 0.8908 0.8225
SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, id-ID) [?]
N/A 0.8897 0.8148
SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, id-ID) [?]
N/A 0.8899 0.8130
SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, id-ID) [?]
N/A 0.8935 0.8234
SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ja-JP) [?]
0.2553 0.5799 0.5403
SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ja-JP) [?]
N/A 0.5808 0.5369
SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ja-JP) [?]
N/A 0.5858 0.5380
SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ja-JP) [?]
N/A 0.5791 0.5460
SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ja-JP) [?]
N/A 0.5741 0.5401
SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ja-JP) [?]
0.4272 0.7520 0.7472
SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ja-JP) [?]
N/A 0.7464 0.7454
SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ja-JP) [?]
N/A 0.7543 0.7519
SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ja-JP) [?]
N/A 0.7542 0.7479
SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ja-JP) [?]
N/A 0.7530 0.7436
SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, kn-IN) [?]
0.2876 0.5926 0.5501
SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, kn-IN) [?]
N/A 0.5942 0.5522
SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, kn-IN) [?]
N/A 0.5905 0.5537
SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, kn-IN) [?]
N/A 0.5908 0.5458
SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, kn-IN) [?]
N/A 0.5946 0.5476
SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, kn-IN) [?]
0.4209 0.7037 0.7221
SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, kn-IN) [?]
N/A 0.7078 0.7190
SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, kn-IN) [?]
N/A 0.7026 0.7228
SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, kn-IN) [?]
N/A 0.7006 0.7189
SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, kn-IN) [?]
N/A 0.7033 0.7271
SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ko-KR) [?]
0.2811 0.5634 0.5388
SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ko-KR) [?]
N/A 0.5624 0.5432
SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ko-KR) [?]
N/A 0.5697 0.5352
SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ko-KR) [?]
N/A 0.5604 0.5399
SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ko-KR) [?]
N/A 0.5610 0.5367
SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ko-KR) [?]
0.4085 0.6085 0.3348
SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ko-KR) [?]
N/A 0.6073 0.3327
SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ko-KR) [?]
N/A 0.6009 0.3348
SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ko-KR) [?]
N/A 0.6141 0.3320
SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ko-KR) [?]
N/A 0.6118 0.3397
SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ko-KR) [?]
0.4412 0.7250 0.7343
SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ko-KR) [?]
N/A 0.7185 0.7352
SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ko-KR) [?]
N/A 0.7251 0.7363
SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ko-KR) [?]
N/A 0.7298 0.7338
SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ko-KR) [?]
N/A 0.7267 0.7319
SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ko-KR) [?]
0.5882 0.7818 0.5458
SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ko-KR) [?]
N/A 0.7841 0.5468
SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ko-KR) [?]
N/A 0.7818 0.5556
SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ko-KR) [?]
N/A 0.7838 0.5521
SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ko-KR) [?]
N/A 0.7776 0.5482
SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ml-IN) [?]
0.3268 0.5948 0.5701
SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ml-IN) [?]
N/A 0.5907 0.5734
SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ml-IN) [?]
N/A 0.5907 0.5715
SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ml-IN) [?]
N/A 0.5998 0.5621
SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ml-IN) [?]
N/A 0.5986 0.5721
SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ml-IN) [?]
0.4673 0.7703 0.7290
SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ml-IN) [?]
N/A 0.7754 0.7368
SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ml-IN) [?]
N/A 0.7747 0.7235
SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ml-IN) [?]
N/A 0.7640 0.7257
SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ml-IN) [?]
N/A 0.7665 0.7299
SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, mr-IN) [?]
0.3098 0.5894 0.5580
SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, mr-IN) [?]
N/A 0.5831 0.5561
SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, mr-IN) [?]
N/A 0.5911 0.5597
SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, mr-IN) [?]
N/A 0.5910 0.5603
SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, mr-IN) [?]
N/A 0.5924 0.5559
SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, mr-IN) [?]
0.4502 0.6885 0.7054
SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, mr-IN) [?]
N/A 0.6838 0.6972
SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, mr-IN) [?]
N/A 0.7169 0.7061
SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, mr-IN) [?]
N/A 0.6958 0.7113
SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, mr-IN) [?]
N/A 0.6930 0.7068
SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ru-RU) [?]
0.3777 0.6331 0.6046
SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ru-RU) [?]
N/A 0.6336 0.6027
SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ru-RU) [?]
N/A 0.6323 0.6111
SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ru-RU) [?]
N/A 0.6347 0.6056
SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ru-RU) [?]
N/A 0.6316 0.5991
SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, ru-RU) [?]
0.5660 0.7517 0.3918
SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, ru-RU) [?]
N/A 0.7165 0.3973
SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, ru-RU) [?]
N/A 0.7286 0.4046
SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, ru-RU) [?]
N/A 0.7200 0.4039
SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, ru-RU) [?]
N/A 0.6583 0.3637
SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ru-RU) [?]
0.5313 0.8106 0.7690
SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ru-RU) [?]
N/A 0.8115 0.7695
SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ru-RU) [?]
N/A 0.8088 0.7692
SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ru-RU) [?]
N/A 0.8111 0.7721
SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ru-RU) [?]
N/A 0.8113 0.7651
SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, ru-RU) [?]
0.5751 0.8141 0.7289
SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, ru-RU) [?]
N/A 0.8126 0.7250
SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, ru-RU) [?]
N/A 0.8182 0.7281
SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, ru-RU) [?]
N/A 0.8134 0.7318
SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, ru-RU) [?]
N/A 0.8123 0.7309
SVQSwDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, sw) [?]
0.5630 0.7004 0.6658
SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, sw) [?]
N/A 0.6979 0.6657
SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, sw) [?]
N/A 0.6986 0.6676
SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, sw) [?]
N/A 0.6989 0.6633
SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, sw) [?]
N/A 0.6974 0.6666
SVQSwPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, sw) [?]
0.5308 0.7344 0.6601
SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, sw) [?]
N/A 0.7344 0.6582
SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, sw) [?]
N/A 0.7320 0.6616
SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, sw) [?]
N/A 0.7330 0.6599
SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, sw) [?]
N/A 0.7384 0.6609
SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ta-IN) [?]
0.2799 0.5769 0.5337
SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ta-IN) [?]
N/A 0.5833 0.5491
SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ta-IN) [?]
N/A 0.5706 0.5266
SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ta-IN) [?]
N/A 0.5739 0.5329
SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ta-IN) [?]
N/A 0.5817 0.5296
SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ta-IN) [?]
0.4278 0.7514 0.6904
SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ta-IN) [?]
N/A 0.7671 0.7080
SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ta-IN) [?]
N/A 0.7499 0.6897
SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ta-IN) [?]
N/A 0.7451 0.6810
SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ta-IN) [?]
N/A 0.7469 0.6866
SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, te-IN) [?]
0.3165 0.6204 0.5707
SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, te-IN) [?]
N/A 0.6135 0.5715
SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, te-IN) [?]
N/A 0.6299 0.5719
SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, te-IN) [?]
N/A 0.6270 0.5729
SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, te-IN) [?]
N/A 0.6114 0.5664
SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang (MRR)
(DocumentInLangRetrieval, te-IN) [?]
0.7186 0.7214 0.2586
SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech (MRR)
(DocumentInLangRetrieval, te-IN) [?]
N/A 0.6165 0.2639
SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:clean (MRR)
(DocumentInLangRetrieval, te-IN) [?]
N/A 0.6138 0.2570
SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise (MRR)
(DocumentInLangRetrieval, te-IN) [?]
N/A 0.6083 0.2559
SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise (MRR)
(DocumentInLangRetrieval, te-IN) [?]
N/A 0.6127 0.2572
SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, te-IN) [?]
0.4666 0.7689 0.7339
SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, te-IN) [?]
N/A 0.7705 0.7353
SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, te-IN) [?]
N/A 0.7705 0.7333
SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, te-IN) [?]
N/A 0.7671 0.7411
SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, te-IN) [?]
N/A 0.7673 0.7262
SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang (MRR)
(PassageInLangRetrieval, te-IN) [?]
0.6768 0.7923 0.7087
SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech (MRR)
(PassageInLangRetrieval, te-IN) [?]
N/A 0.7918 0.7060
SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:clean (MRR)
(PassageInLangRetrieval, te-IN) [?]
N/A 0.7947 0.7044
SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise (MRR)
(PassageInLangRetrieval, te-IN) [?]
N/A 0.7925 0.7081
SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise (MRR)
(PassageInLangRetrieval, te-IN) [?]
N/A 0.7901 0.7162
SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ur-IN) [?]
0.2094 0.5310 0.4997
SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ur-IN) [?]
N/A 0.5355 0.5043
SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ur-IN) [?]
N/A 0.5359 0.4904
SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ur-IN) [?]
N/A 0.5231 0.5013
SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ur-IN) [?]
N/A 0.5374 0.5028
SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ur-IN) [?]
0.3496 0.7194 0.6709
SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ur-IN) [?]
N/A 0.7272 0.6743
SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ur-IN) [?]
N/A 0.7135 0.6687
SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ur-IN) [?]
N/A 0.7149 0.6679
SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ur-IN) [?]
N/A 0.7223 0.6727
SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang (MRR)
(DocumentCrossLangRetrieval, ur-PK) [?]
0.2072 0.5296 0.4984
SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech (MRR)
(DocumentCrossLangRetrieval, ur-PK) [?]
N/A 0.5405 0.5081
SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean (MRR)
(DocumentCrossLangRetrieval, ur-PK) [?]
N/A 0.5215 0.5003
SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise (MRR)
(DocumentCrossLangRetrieval, ur-PK) [?]
N/A 0.5265 0.4945
SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise (MRR)
(DocumentCrossLangRetrieval, ur-PK) [?]
N/A 0.5297 0.4904
SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang (MRR)
(PassageCrossLangRetrieval, ur-PK) [?]
0.3506 0.7138 0.6700
SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech (MRR)
(PassageCrossLangRetrieval, ur-PK) [?]
N/A 0.7073 0.6679
SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean (MRR)
(PassageCrossLangRetrieval, ur-PK) [?]
N/A 0.7149 0.6714
SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise (MRR)
(PassageCrossLangRetrieval, ur-PK) [?]
N/A 0.7168 0.6696
SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise (MRR)
(PassageCrossLangRetrieval, ur-PK) [?]
N/A 0.7164 0.6713

segmentation

Metric whisper
SVQArEgSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ar-EG) [?]
0.3863
SVQArEgSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ar-EG) [?]
0.3939
SVQArEgSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ar-EG) [?]
0.4485
SVQArEgSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ar-EG) [?]
0.3721
SVQArEgSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ar-EG) [?]
0.3307
SVQArXGulfSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ar-x-gulf) [?]
0.4233
SVQArXGulfSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ar-x-gulf) [?]
0.4441
SVQArXGulfSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ar-x-gulf) [?]
0.5149
SVQArXGulfSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ar-x-gulf) [?]
0.3999
SVQArXGulfSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ar-x-gulf) [?]
0.3315
SVQArXLevantSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ar-x-levant) [?]
0.4082
SVQArXLevantSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ar-x-levant) [?]
0.3720
SVQArXLevantSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ar-x-levant) [?]
0.4408
SVQArXLevantSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ar-x-levant) [?]
0.4287
SVQArXLevantSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ar-x-levant) [?]
0.3872
SVQArXMaghrebiSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ar-x-maghrebi) [?]
0.2716
SVQArXMaghrebiSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ar-x-maghrebi) [?]
0.3590
SVQArXMaghrebiSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ar-x-maghrebi) [?]
0.2372
SVQArXMaghrebiSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ar-x-maghrebi) [?]
0.2384
SVQArXMaghrebiSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ar-x-maghrebi) [?]
0.2467
SVQBnBdSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, bn-BD) [?]
0.0493
SVQBnBdSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, bn-BD) [?]
0.0587
SVQBnBdSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, bn-BD) [?]
0.0483
SVQBnBdSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, bn-BD) [?]
0.0378
SVQBnBdSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, bn-BD) [?]
0.0489
SVQBnInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, bn-IN) [?]
0.0536
SVQBnInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, bn-IN) [?]
0.0529
SVQBnInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, bn-IN) [?]
0.0543
SVQBnInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, bn-IN) [?]
0.0536
SVQBnInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, bn-IN) [?]
0.0535
SVQEnAuSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, en-AU) [?]
0.8276
SVQEnAuSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, en-AU) [?]
0.8309
SVQEnAuSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, en-AU) [?]
0.8779
SVQEnAuSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, en-AU) [?]
0.8465
SVQEnAuSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, en-AU) [?]
0.7551
SVQEnGbSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, en-GB) [?]
0.7577
SVQEnGbSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, en-GB) [?]
0.7761
SVQEnGbSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, en-GB) [?]
0.8050
SVQEnGbSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, en-GB) [?]
0.8032
SVQEnGbSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, en-GB) [?]
0.6425
SVQEnInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, en-IN) [?]
0.7832
SVQEnInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, en-IN) [?]
0.7393
SVQEnInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, en-IN) [?]
0.7913
SVQEnInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, en-IN) [?]
0.8129
SVQEnInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, en-IN) [?]
0.7890
SVQEnPhSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, en-PH) [?]
0.7816
SVQEnPhSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, en-PH) [?]
0.7682
SVQEnPhSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, en-PH) [?]
0.8699
SVQEnPhSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, en-PH) [?]
0.8239
SVQEnPhSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, en-PH) [?]
0.6644
SVQEnUsSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, en-US) [?]
0.8090
SVQEnUsSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, en-US) [?]
0.7387
SVQEnUsSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, en-US) [?]
0.8673
SVQEnUsSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, en-US) [?]
0.8051
SVQEnUsSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, en-US) [?]
0.8254
SVQFiFiSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, fi-FI) [?]
0.3770
SVQFiFiSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, fi-FI) [?]
0.2558
SVQFiFiSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, fi-FI) [?]
0.5280
SVQFiFiSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, fi-FI) [?]
0.3304
SVQFiFiSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, fi-FI) [?]
0.3759
SVQGuInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, gu-IN) [?]
0.1592
SVQGuInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, gu-IN) [?]
0.1386
SVQGuInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, gu-IN) [?]
0.1936
SVQGuInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, gu-IN) [?]
0.1522
SVQGuInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, gu-IN) [?]
0.1527
SVQHiInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, hi-IN) [?]
0.2653
SVQHiInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, hi-IN) [?]
0.2295
SVQHiInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, hi-IN) [?]
0.2738
SVQHiInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, hi-IN) [?]
0.3169
SVQHiInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, hi-IN) [?]
0.2406
SVQIdIdSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, id-ID) [?]
0.6619
SVQIdIdSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, id-ID) [?]
0.6148
SVQIdIdSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, id-ID) [?]
0.7159
SVQIdIdSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, id-ID) [?]
0.6860
SVQIdIdSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, id-ID) [?]
0.6091
SVQJaJpSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ja-JP) [?]
0.6247
SVQJaJpSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ja-JP) [?]
0.6378
SVQJaJpSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ja-JP) [?]
0.6620
SVQJaJpSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ja-JP) [?]
0.6472
SVQJaJpSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ja-JP) [?]
0.5525
SVQKnInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, kn-IN) [?]
0.1255
SVQKnInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, kn-IN) [?]
0.1259
SVQKnInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, kn-IN) [?]
0.1380
SVQKnInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, kn-IN) [?]
0.1295
SVQKnInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, kn-IN) [?]
0.1093
SVQKoKrSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ko-KR) [?]
0.5981
SVQKoKrSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ko-KR) [?]
0.5828
SVQKoKrSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ko-KR) [?]
0.6149
SVQKoKrSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ko-KR) [?]
0.6455
SVQKoKrSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ko-KR) [?]
0.5490
SVQMlInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ml-IN) [?]
0.0033
SVQMlInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ml-IN) [?]
0.0026
SVQMlInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ml-IN) [?]
0.0036
SVQMlInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ml-IN) [?]
0.0038
SVQMlInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ml-IN) [?]
0.0032
SVQMrInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, mr-IN) [?]
0.1101
SVQMrInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, mr-IN) [?]
0.0990
SVQMrInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, mr-IN) [?]
0.1169
SVQMrInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, mr-IN) [?]
0.1321
SVQMrInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, mr-IN) [?]
0.0923
SVQRuRuSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ru-RU) [?]
0.6260
SVQRuRuSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ru-RU) [?]
0.6401
SVQRuRuSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ru-RU) [?]
0.6521
SVQRuRuSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ru-RU) [?]
0.6301
SVQRuRuSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ru-RU) [?]
0.5814
SVQTaInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ta-IN) [?]
0.2507
SVQTaInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ta-IN) [?]
0.2740
SVQTaInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ta-IN) [?]
0.2744
SVQTaInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ta-IN) [?]
0.2522
SVQTaInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ta-IN) [?]
0.2036
SVQTeInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, te-IN) [?]
0.0784
SVQTeInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, te-IN) [?]
0.0760
SVQTeInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, te-IN) [?]
0.1093
SVQTeInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, te-IN) [?]
0.0855
SVQTeInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, te-IN) [?]
0.0435
SVQUrInSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ur-IN) [?]
0.3223
SVQUrInSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ur-IN) [?]
0.2837
SVQUrInSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ur-IN) [?]
0.3505
SVQUrInSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ur-IN) [?]
0.3326
SVQUrInSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ur-IN) [?]
0.3222
SVQUrPkSalientTermSegmentation/salient_term (NDCG)
(SalientTermSegmentation, ur-PK) [?]
0.3037
SVQUrPkSalientTermSegmentation/salient_term:background_speech (NDCG)
(SalientTermSegmentation, ur-PK) [?]
0.3083
SVQUrPkSalientTermSegmentation/salient_term:clean (NDCG)
(SalientTermSegmentation, ur-PK) [?]
0.3392
SVQUrPkSalientTermSegmentation/salient_term:media_noise (NDCG)
(SalientTermSegmentation, ur-PK) [?]
0.2881
SVQUrPkSalientTermSegmentation/salient_term:traffic_noise (NDCG)
(SalientTermSegmentation, ur-PK) [?]
0.2785

transcription

Metric elevenlabs gemini gpt whisper
SVQArEgSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ar-EG) [?]
0.2960 0.3487 0.2891 0.3431
SVQArEgSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ar-EG) [?]
0.3230 0.3627 0.2281 0.3346
SVQArEgSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ar-EG) [?]
0.2099 0.2800 0.2440 0.2580
SVQArEgSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ar-EG) [?]
0.3190 0.3525 0.3004 0.3323
SVQArEgSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ar-EG) [?]
0.3320 0.3999 0.3846 0.4481
SVQArXGulfSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ar-x-gulf) [?]
0.2675 0.3280 0.2673 0.2849
SVQArXGulfSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ar-x-gulf) [?]
0.3071 0.3418 0.2610 0.2865
SVQArXGulfSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ar-x-gulf) [?]
0.1639 0.2749 0.1990 0.1988
SVQArXGulfSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ar-x-gulf) [?]
0.2586 0.3348 0.2559 0.2854
SVQArXGulfSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ar-x-gulf) [?]
0.3431 0.3620 0.3551 0.3713
SVQArXLevantSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ar-x-levant) [?]
0.2681 0.3596 0.2922 0.3091
SVQArXLevantSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ar-x-levant) [?]
0.2803 0.3572 0.3147 0.3304
SVQArXLevantSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ar-x-levant) [?]
0.2383 0.3441 0.2558 0.2877
SVQArXLevantSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ar-x-levant) [?]
0.2502 0.3443 0.2721 0.2828
SVQArXLevantSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ar-x-levant) [?]
0.3087 0.3977 0.3318 0.3406
SVQArXMaghrebiSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ar-x-maghrebi) [?]
0.5209 0.4714 0.4526 0.4797
SVQArXMaghrebiSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ar-x-maghrebi) [?]
0.3681 0.3764 0.3466 0.3707
SVQArXMaghrebiSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ar-x-maghrebi) [?]
0.6564 0.5319 0.5156 0.5432
SVQArXMaghrebiSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ar-x-maghrebi) [?]
0.5384 0.5032 0.4819 0.5167
SVQArXMaghrebiSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ar-x-maghrebi) [?]
0.5280 0.4789 0.4714 0.4936
SVQBnBdSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, bn-BD) [?]
0.0715 0.0869 0.1490 0.4289
SVQBnBdSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, bn-BD) [?]
0.0824 0.1074 0.1456 0.4018
SVQBnBdSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, bn-BD) [?]
0.0557 0.0584 0.1137 0.4038
SVQBnBdSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, bn-BD) [?]
0.0774 0.1054 0.1745 0.4723
SVQBnBdSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, bn-BD) [?]
0.0721 0.0793 0.1754 0.4567
SVQBnInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, bn-IN) [?]
0.0925 0.0957 0.1326 0.4340
SVQBnInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, bn-IN) [?]
0.1610 0.1653 0.1253 0.4419
SVQBnInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, bn-IN) [?]
0.0666 0.0616 0.1178 0.3943
SVQBnInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, bn-IN) [?]
0.0663 0.0724 0.1194 0.4489
SVQBnInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, bn-IN) [?]
0.0746 0.0821 0.1670 0.4515
SVQEnAuSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, en-AU) [?]
0.0579 0.0914 0.0614 0.0746
SVQEnAuSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, en-AU) [?]
0.0727 0.0863 0.0608 0.0789
SVQEnAuSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, en-AU) [?]
0.0309 0.0587 0.0322 0.0465
SVQEnAuSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, en-AU) [?]
0.0549 0.0906 0.0540 0.0590
SVQEnAuSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, en-AU) [?]
0.0732 0.1304 0.1164 0.1139
SVQEnGbSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, en-GB) [?]
0.1134 0.1357 0.1062 0.1389
SVQEnGbSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, en-GB) [?]
0.1901 0.1615 0.0948 0.1688
SVQEnGbSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, en-GB) [?]
0.0541 0.0678 0.0604 0.0794
SVQEnGbSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, en-GB) [?]
0.0465 0.0864 0.0604 0.0786
SVQEnGbSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, en-GB) [?]
0.1656 0.2308 0.2138 0.2329
SVQEnInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, en-IN) [?]
0.0644 0.0873 0.1388 0.1086
SVQEnInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, en-IN) [?]
0.0946 0.1283 0.1885 0.1788
SVQEnInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, en-IN) [?]
0.0606 0.0858 0.1200 0.0945
SVQEnInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, en-IN) [?]
0.0468 0.0670 0.1065 0.0734
SVQEnInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, en-IN) [?]
0.0559 0.0682 0.1406 0.0879
SVQEnPhSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, en-PH) [?]
0.1433 0.1138 0.0975 0.1149
SVQEnPhSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, en-PH) [?]
0.3756 0.1701 0.1254 0.1667
SVQEnPhSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, en-PH) [?]
0.0337 0.0539 0.0448 0.0491
SVQEnPhSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, en-PH) [?]
0.0525 0.0752 0.0665 0.0771
SVQEnPhSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, en-PH) [?]
0.1104 0.1556 0.1532 0.1663
SVQEnUsSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, en-US) [?]
0.1386 0.1083 0.0634 0.1212
SVQEnUsSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, en-US) [?]
0.3778 0.1795 0.0799 0.2537
SVQEnUsSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, en-US) [?]
0.0358 0.0710 0.0421 0.0533
SVQEnUsSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, en-US) [?]
0.0861 0.0942 0.0597 0.0985
SVQEnUsSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, en-US) [?]
0.0533 0.0879 0.0716 0.0784
SVQFiFiSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, fi-FI) [?]
0.5694 0.8788 0.5629 0.5096
SVQFiFiSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, fi-FI) [?]
0.8118 2.2473 0.7436 0.6787
SVQFiFiSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, fi-FI) [?]
0.3184 0.3688 0.3389 0.3225
SVQFiFiSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, fi-FI) [?]
0.6044 0.6713 0.6380 0.5469
SVQFiFiSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, fi-FI) [?]
0.5917 0.7251 0.5584 0.5216
SVQGuInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, gu-IN) [?]
0.2404 0.1850 0.5581 0.3203
SVQGuInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, gu-IN) [?]
0.3340 0.2035 0.6612 0.3424
SVQGuInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, gu-IN) [?]
0.2087 0.1909 0.5358 0.2676
SVQGuInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, gu-IN) [?]
0.2116 0.1793 0.4989 0.3249
SVQGuInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, gu-IN) [?]
0.2074 0.1661 0.5366 0.3462
SVQHiInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, hi-IN) [?]
0.1174 0.0934 0.1937 0.1787
SVQHiInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, hi-IN) [?]
0.0956 0.0894 0.1762 0.1909
SVQHiInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, hi-IN) [?]
0.0993 0.0931 0.1978 0.1682
SVQHiInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, hi-IN) [?]
0.0646 0.0751 0.1668 0.1397
SVQHiInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, hi-IN) [?]
0.2109 0.1161 0.2341 0.2163
SVQIdIdSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, id-ID) [?]
0.1281 0.1500 0.1630 0.1892
SVQIdIdSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, id-ID) [?]
0.2005 0.2513 0.2138 0.3250
SVQIdIdSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, id-ID) [?]
0.0971 0.1163 0.1314 0.1268
SVQIdIdSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, id-ID) [?]
0.1136 0.1306 0.1507 0.1534
SVQIdIdSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, id-ID) [?]
0.1240 0.1320 0.1741 0.1957
SVQJaJpSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ja-JP) [?]
0.6229 1.8754 0.5664 0.6276
SVQJaJpSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ja-JP) [?]
0.6656 2.0375 0.5419 0.6501
SVQJaJpSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ja-JP) [?]
0.4167 1.6633 0.4867 0.5667
SVQJaJpSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ja-JP) [?]
0.4557 2.0067 0.5377 0.5854
SVQJaJpSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ja-JP) [?]
0.9524 1.7931 0.6991 0.7080
SVQKnInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, kn-IN) [?]
0.1809 0.1769 0.3033 0.3383
SVQKnInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, kn-IN) [?]
0.2089 0.1876 0.3264 0.3349
SVQKnInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, kn-IN) [?]
0.1519 0.1572 0.2380 0.2984
SVQKnInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, kn-IN) [?]
0.1467 0.1417 0.2305 0.2988
SVQKnInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, kn-IN) [?]
0.2087 0.2132 0.4022 0.4126
SVQKoKrSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ko-KR) [?]
0.2179 0.2286 0.1833 0.2122
SVQKoKrSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ko-KR) [?]
0.4095 0.3077 0.2091 0.2429
SVQKoKrSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ko-KR) [?]
0.1406 0.1870 0.1617 0.1865
SVQKoKrSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ko-KR) [?]
0.1569 0.1860 0.1677 0.1934
SVQKoKrSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ko-KR) [?]
0.1654 0.2343 0.1950 0.2265
SVQMlInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ml-IN) [?]
0.1137 0.1135 0.3046 1.1018
SVQMlInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ml-IN) [?]
0.0964 0.0868 0.2671 1.1013
SVQMlInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ml-IN) [?]
0.1262 0.1146 0.3275 1.0716
SVQMlInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ml-IN) [?]
0.1058 0.0989 0.2585 1.0902
SVQMlInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ml-IN) [?]
0.1232 0.1484 0.3531 1.1419
SVQMrInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, mr-IN) [?]
0.1003 0.1182 0.1610 0.3307
SVQMrInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, mr-IN) [?]
0.1069 0.1434 0.1743 0.3368
SVQMrInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, mr-IN) [?]
0.0816 0.1021 0.1066 0.3113
SVQMrInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, mr-IN) [?]
0.0910 0.1105 0.1380 0.3087
SVQMrInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, mr-IN) [?]
0.1217 0.1167 0.2484 0.3663
SVQRuRuSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ru-RU) [?]
0.1696 0.1490 0.1140 0.1471
SVQRuRuSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ru-RU) [?]
0.1605 0.1859 0.0928 0.1400
SVQRuRuSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ru-RU) [?]
0.0937 0.0775 0.1112 0.0938
SVQRuRuSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ru-RU) [?]
0.2435 0.1741 0.0947 0.1741
SVQRuRuSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ru-RU) [?]
0.1805 0.1583 0.1577 0.1804
SVQSwSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, sw) [?]
0.2764 0.1891 0.3557 0.5410
SVQSwSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, sw) [?]
0.1773 0.2276 0.2924 0.4878
SVQSwSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, sw) [?]
0.1119 0.1197 0.2524 0.4403
SVQSwSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, sw) [?]
0.4199 0.2032 0.4351 0.5597
SVQSwSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, sw) [?]
0.3964 0.2059 0.4426 0.6758
SVQTaInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ta-IN) [?]
0.1426 0.1678 0.2437 0.2212
SVQTaInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ta-IN) [?]
0.1149 0.1495 0.2177 0.2059
SVQTaInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ta-IN) [?]
0.1022 0.1413 0.1913 0.1642
SVQTaInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ta-IN) [?]
0.2120 0.1786 0.2537 0.2294
SVQTaInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ta-IN) [?]
0.1367 0.2012 0.3133 0.2889
SVQTeInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, te-IN) [?]
0.2321 0.1926 0.3859 0.4129
SVQTeInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, te-IN) [?]
0.2451 0.2136 0.4015 0.4199
SVQTeInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, te-IN) [?]
0.1814 0.1215 0.2653 0.3215
SVQTeInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, te-IN) [?]
0.1793 0.1586 0.3280 0.3640
SVQTeInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, te-IN) [?]
0.3200 0.2749 0.5464 0.5443
SVQUrInSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ur-IN) [?]
1.2384 1.2862 1.2591 0.2763
SVQUrInSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ur-IN) [?]
1.3791 1.2664 0.9796 0.2920
SVQUrInSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ur-IN) [?]
1.2521 1.1984 1.5011 0.2706
SVQUrInSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ur-IN) [?]
1.3764 1.4212 1.4886 0.2771
SVQUrInSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ur-IN) [?]
0.9499 1.2589 1.0616 0.2660
SVQUrPkSpeechTranscription/speech_transcription (WER)
(SpeechTranscription, ur-PK) [?]
1.0966 1.2075 1.0028 0.3152
SVQUrPkSpeechTranscription/speech_transcription:background_speech (WER)
(SpeechTranscription, ur-PK) [?]
1.4406 1.2340 1.0230 0.2926
SVQUrPkSpeechTranscription/speech_transcription:clean (WER)
(SpeechTranscription, ur-PK) [?]
0.7422 1.0672 0.8312 0.2582
SVQUrPkSpeechTranscription/speech_transcription:media_noise (WER)
(SpeechTranscription, ur-PK) [?]
1.2218 1.2457 1.0697 0.3264
SVQUrPkSpeechTranscription/speech_transcription:traffic_noise (WER)
(SpeechTranscription, ur-PK) [?]
0.9775 1.2846 1.0896 0.3854

Task Definitions

Classification

Classification tasks involve assigning one or more predefined labels to an audio segment. These tasks evaluate the model's ability to extract specific features or categories from sound representations, such as intent in speech or species in bioacoustic recordings.

BirdSet Classification

Evaluates the model's ability to identify bird species in soundscapes.

  • Task: Multi-label Classification.
  • Input: 5-second audio segments.
  • Labels: Predict all bird species present in the segment based on eBird codes.

Tasks (See BIRDSET):

  • BirdsetHSNClassification/ebird_classification
  • BirdsetNBPClassification/ebird_classification
  • BirdsetPOWClassification/ebird_classification

FSD50K Classification

Evaluates the model's ability to classify general sound events.

  • Task: Multi-label Classification across 200 diverse categories.
  • Evaluation: Mean Average Precision (mAP) is the primary metric.

Tasks (See FSD50K):

  • FSD50KTestClassification/classification

Speech MASSIVE Classification

Evaluates the model's ability to categorize speech into intents.

  • Task: Intent Classification.
  • Labels: 60 predefined intents (e.g., datetime_query, play_music).
  • Goal: Correctlly identify the user's intent from the audio utterance.

Tasks (See SPEECH_MASSIVE):

  • SpeechMassiveArSaIntentClassification/intent_classification
  • SpeechMassiveDeDeIntentClassification/intent_classification
  • SpeechMassiveEsEsIntentClassification/intent_classification
  • SpeechMassiveFrFrIntentClassification/intent_classification
  • SpeechMassiveHuHuIntentClassification/intent_classification
  • SpeechMassiveKoKrIntentClassification/intent_classification
  • SpeechMassiveNlNlIntentClassification/intent_classification
  • SpeechMassivePlPlIntentClassification/intent_classification
  • SpeechMassivePtPtIntentClassification/intent_classification
  • SpeechMassiveRuRuIntentClassification/intent_classification
  • SpeechMassiveTrTrIntentClassification/intent_classification
  • SpeechMassiveViVnIntentClassification/intent_classification

[Back to top]


Clustering

Clustering tasks evaluate how well sound embeddings group together based on semantic or acoustic similarity without explicit labels during the grouping process. This tests the inherent structure and separability of the embedding space.

BirdSet Clustering

Evaluates the semantic grouping of bird vocalizations in the embedding space.

  • Goal: Group bird calls from the same species together without explicit supervised labels.
  • Evaluation: Primarily uses V-Measure to compare predicted clusters against ground-truth species labels.

Tasks (See BIRDSET):

  • BirdsetClusteringHSN/clustering
  • BirdsetClusteringNBP/clustering
  • BirdsetClusteringPOW/clustering

FSD50K Clustering

Evaluates how well embeddings group by sound event categories.

  • Goal: Cluster audio clips such that clips belonging to the same AudioSet class are grouped together.

Tasks (See FSD50K):

  • FSD50KTestClustering/sound_event

SVQ Clustering

Evaluates how well embeddings group by speaker-related attributes.

  • Attributes: The dataset provides labels for speaker_id, speaker_gender, and speaker_age.
  • Goal: Group audio segments based on these metadata labels without using the labels during embedding generation.

Tasks (See SVQ):

  • SVQClusteringArEg/speaker_age
  • SVQClusteringArEg/speaker_gender
  • SVQClusteringArEg/speaker_id
  • SVQClusteringArXGulf/speaker_age
  • SVQClusteringArXGulf/speaker_gender
  • SVQClusteringArXGulf/speaker_id
  • SVQClusteringArXLevant/speaker_age
  • SVQClusteringArXLevant/speaker_gender
  • SVQClusteringArXLevant/speaker_id
  • SVQClusteringArXMaghrebi/speaker_age
  • SVQClusteringArXMaghrebi/speaker_gender
  • SVQClusteringArXMaghrebi/speaker_id
  • SVQClusteringBnBd/speaker_age
  • SVQClusteringBnBd/speaker_gender
  • SVQClusteringBnBd/speaker_id
  • SVQClusteringBnIn/speaker_age
  • SVQClusteringBnIn/speaker_gender
  • SVQClusteringBnIn/speaker_id
  • SVQClusteringEnAu/speaker_age
  • SVQClusteringEnAu/speaker_gender
  • SVQClusteringEnAu/speaker_id
  • SVQClusteringEnGb/speaker_age
  • SVQClusteringEnGb/speaker_gender
  • SVQClusteringEnGb/speaker_id
  • SVQClusteringEnIn/speaker_age
  • SVQClusteringEnIn/speaker_gender
  • SVQClusteringEnIn/speaker_id
  • SVQClusteringEnPh/speaker_age
  • SVQClusteringEnPh/speaker_gender
  • SVQClusteringEnPh/speaker_id
  • SVQClusteringEnUs/speaker_age
  • SVQClusteringEnUs/speaker_gender
  • SVQClusteringEnUs/speaker_id
  • SVQClusteringFiFi/speaker_age
  • SVQClusteringFiFi/speaker_gender
  • SVQClusteringFiFi/speaker_id
  • SVQClusteringGuIn/speaker_age
  • SVQClusteringGuIn/speaker_gender
  • SVQClusteringGuIn/speaker_id
  • SVQClusteringHiIn/speaker_age
  • SVQClusteringHiIn/speaker_gender
  • SVQClusteringHiIn/speaker_id
  • SVQClusteringIdId/speaker_age
  • SVQClusteringIdId/speaker_gender
  • SVQClusteringIdId/speaker_id
  • SVQClusteringJaJp/speaker_age
  • SVQClusteringJaJp/speaker_gender
  • SVQClusteringJaJp/speaker_id
  • SVQClusteringKnIn/speaker_age
  • SVQClusteringKnIn/speaker_gender
  • SVQClusteringKnIn/speaker_id
  • SVQClusteringKoKr/speaker_age
  • SVQClusteringKoKr/speaker_gender
  • SVQClusteringKoKr/speaker_id
  • SVQClusteringMlIn/speaker_age
  • SVQClusteringMlIn/speaker_gender
  • SVQClusteringMlIn/speaker_id
  • SVQClusteringMrIn/speaker_age
  • SVQClusteringMrIn/speaker_gender
  • SVQClusteringMrIn/speaker_id
  • SVQClusteringRuRu/speaker_age
  • SVQClusteringRuRu/speaker_gender
  • SVQClusteringRuRu/speaker_id
  • SVQClusteringSw/speaker_age
  • SVQClusteringSw/speaker_gender
  • SVQClusteringSw/speaker_id
  • SVQClusteringTaIn/speaker_age
  • SVQClusteringTaIn/speaker_gender
  • SVQClusteringTaIn/speaker_id
  • SVQClusteringTeIn/speaker_age
  • SVQClusteringTeIn/speaker_gender
  • SVQClusteringTeIn/speaker_id
  • SVQClusteringUrIn/speaker_age
  • SVQClusteringUrIn/speaker_gender
  • SVQClusteringUrIn/speaker_id
  • SVQClusteringUrPk/speaker_age
  • SVQClusteringUrPk/speaker_gender
  • SVQClusteringUrPk/speaker_id

[Back to top]


Reasoning

Reasoning tasks (often implemented as Span Retrieval) require the model to identify specific segments within a text document that directly answer a voice query. This tests deeper semantic understanding and the ability to align fine-grained concepts between speech and text.

SVQ Reasoning (Span Retrieval)

The reasoning task requires identifying the exact span of text within a Wikipedia article that answers a voice query.

  • Task Format: Given an audio query and a target document, the model must predict the start and end offsets of the answer span.
  • In-Lang Reasoning: Query and document share the same language.
  • Cross-Lang Reasoning: Query is in a non-English language; the document and target answer span are in English.

Tasks (See SVQ):

  • SVQArEgSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQArEgSpanInLangReasoning/span_reasoning_in_lang
  • SVQArXGulfSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQArXGulfSpanInLangReasoning/span_reasoning_in_lang
  • SVQArXLevantSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQArXLevantSpanInLangReasoning/span_reasoning_in_lang
  • SVQArXMaghrebiSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQArXMaghrebiSpanInLangReasoning/span_reasoning_in_lang
  • SVQBnBdSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQBnBdSpanInLangReasoning/span_reasoning_in_lang
  • SVQBnInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQBnInSpanInLangReasoning/span_reasoning_in_lang
  • SVQEnAuSpanInLangReasoning/span_reasoning_in_lang
  • SVQEnGbSpanInLangReasoning/span_reasoning_in_lang
  • SVQEnInSpanInLangReasoning/span_reasoning_in_lang
  • SVQEnPhSpanInLangReasoning/span_reasoning_in_lang
  • SVQEnUsSpanInLangReasoning/span_reasoning_in_lang
  • SVQFiFiSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQFiFiSpanInLangReasoning/span_reasoning_in_lang
  • SVQGuInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQHiInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQIdIdSpanInLangReasoning/span_reasoning_in_lang
  • SVQJaJpSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQKnInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQKoKrSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQKoKrSpanInLangReasoning/span_reasoning_in_lang
  • SVQMlInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQMrInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQRuRuSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQRuRuSpanInLangReasoning/span_reasoning_in_lang
  • SVQSwSpanInLangReasoning/span_reasoning_in_lang
  • SVQTaInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQTeInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQTeInSpanInLangReasoning/span_reasoning_in_lang
  • SVQUrInSpanCrossLangReasoning/span_reasoning_cross_lang
  • SVQUrPkSpanCrossLangReasoning/span_reasoning_cross_lang

[Back to top]


Reranking

Given a set of candidate answers, reranking tasks evaluate a model's ability to re-order them such that the most relevant results appear at the top. This is often used to refine the output of a primary retrieval system.

SVQ Reranking

The reranking task assesses a model's ability to refine a list of candidate answers.

  • Input: A voice query and a set of candidate text answers (e.g., top-K results from a first-stage retrieval system).
  • Goal: Re-order the candidates so that the ground-truth answer appears at rank 1.

Tasks (See SVQ):

  • SVQArEgQueryReranking/query_reranking
  • SVQArEgQueryReranking/query_reranking:background_speech
  • SVQArEgQueryReranking/query_reranking:clean
  • SVQArEgQueryReranking/query_reranking:media_noise
  • SVQArEgQueryReranking/query_reranking:traffic_noise
  • SVQArXGulfQueryReranking/query_reranking
  • SVQArXGulfQueryReranking/query_reranking:background_speech
  • SVQArXGulfQueryReranking/query_reranking:clean
  • SVQArXGulfQueryReranking/query_reranking:media_noise
  • SVQArXGulfQueryReranking/query_reranking:traffic_noise
  • SVQArXLevantQueryReranking/query_reranking
  • SVQArXLevantQueryReranking/query_reranking:background_speech
  • SVQArXLevantQueryReranking/query_reranking:clean
  • SVQArXLevantQueryReranking/query_reranking:media_noise
  • SVQArXLevantQueryReranking/query_reranking:traffic_noise
  • SVQArXMaghrebiQueryReranking/query_reranking
  • SVQArXMaghrebiQueryReranking/query_reranking:background_speech
  • SVQArXMaghrebiQueryReranking/query_reranking:clean
  • SVQArXMaghrebiQueryReranking/query_reranking:media_noise
  • SVQArXMaghrebiQueryReranking/query_reranking:traffic_noise
  • SVQBnBdQueryReranking/query_reranking
  • SVQBnBdQueryReranking/query_reranking:background_speech
  • SVQBnBdQueryReranking/query_reranking:clean
  • SVQBnBdQueryReranking/query_reranking:media_noise
  • SVQBnBdQueryReranking/query_reranking:traffic_noise
  • SVQBnInQueryReranking/query_reranking
  • SVQBnInQueryReranking/query_reranking:background_speech
  • SVQBnInQueryReranking/query_reranking:clean
  • SVQBnInQueryReranking/query_reranking:media_noise
  • SVQBnInQueryReranking/query_reranking:traffic_noise
  • SVQEnAuQueryReranking/query_reranking
  • SVQEnAuQueryReranking/query_reranking:background_speech
  • SVQEnAuQueryReranking/query_reranking:clean
  • SVQEnAuQueryReranking/query_reranking:media_noise
  • SVQEnAuQueryReranking/query_reranking:traffic_noise
  • SVQEnGbQueryReranking/query_reranking
  • SVQEnGbQueryReranking/query_reranking:background_speech
  • SVQEnGbQueryReranking/query_reranking:clean
  • SVQEnGbQueryReranking/query_reranking:media_noise
  • SVQEnGbQueryReranking/query_reranking:traffic_noise
  • SVQEnInQueryReranking/query_reranking
  • SVQEnInQueryReranking/query_reranking:background_speech
  • SVQEnInQueryReranking/query_reranking:clean
  • SVQEnInQueryReranking/query_reranking:media_noise
  • SVQEnInQueryReranking/query_reranking:traffic_noise
  • SVQEnPhQueryReranking/query_reranking
  • SVQEnPhQueryReranking/query_reranking:background_speech
  • SVQEnPhQueryReranking/query_reranking:clean
  • SVQEnPhQueryReranking/query_reranking:media_noise
  • SVQEnPhQueryReranking/query_reranking:traffic_noise
  • SVQEnUsQueryReranking/query_reranking
  • SVQEnUsQueryReranking/query_reranking:background_speech
  • SVQEnUsQueryReranking/query_reranking:clean
  • SVQEnUsQueryReranking/query_reranking:media_noise
  • SVQEnUsQueryReranking/query_reranking:traffic_noise
  • SVQFiFiQueryReranking/query_reranking
  • SVQFiFiQueryReranking/query_reranking:background_speech
  • SVQFiFiQueryReranking/query_reranking:clean
  • SVQFiFiQueryReranking/query_reranking:media_noise
  • SVQFiFiQueryReranking/query_reranking:traffic_noise
  • SVQGuInQueryReranking/query_reranking
  • SVQGuInQueryReranking/query_reranking:background_speech
  • SVQGuInQueryReranking/query_reranking:clean
  • SVQGuInQueryReranking/query_reranking:media_noise
  • SVQGuInQueryReranking/query_reranking:traffic_noise
  • SVQHiInQueryReranking/query_reranking
  • SVQHiInQueryReranking/query_reranking:background_speech
  • SVQHiInQueryReranking/query_reranking:clean
  • SVQHiInQueryReranking/query_reranking:media_noise
  • SVQHiInQueryReranking/query_reranking:traffic_noise
  • SVQIdIdQueryReranking/query_reranking
  • SVQIdIdQueryReranking/query_reranking:background_speech
  • SVQIdIdQueryReranking/query_reranking:clean
  • SVQIdIdQueryReranking/query_reranking:media_noise
  • SVQIdIdQueryReranking/query_reranking:traffic_noise
  • SVQJaJpQueryReranking/query_reranking
  • SVQJaJpQueryReranking/query_reranking:background_speech
  • SVQJaJpQueryReranking/query_reranking:clean
  • SVQJaJpQueryReranking/query_reranking:media_noise
  • SVQJaJpQueryReranking/query_reranking:traffic_noise
  • SVQKnInQueryReranking/query_reranking
  • SVQKnInQueryReranking/query_reranking:background_speech
  • SVQKnInQueryReranking/query_reranking:clean
  • SVQKnInQueryReranking/query_reranking:media_noise
  • SVQKnInQueryReranking/query_reranking:traffic_noise
  • SVQKoKrQueryReranking/query_reranking
  • SVQKoKrQueryReranking/query_reranking:background_speech
  • SVQKoKrQueryReranking/query_reranking:clean
  • SVQKoKrQueryReranking/query_reranking:media_noise
  • SVQKoKrQueryReranking/query_reranking:traffic_noise
  • SVQMlInQueryReranking/query_reranking
  • SVQMlInQueryReranking/query_reranking:background_speech
  • SVQMlInQueryReranking/query_reranking:clean
  • SVQMlInQueryReranking/query_reranking:media_noise
  • SVQMlInQueryReranking/query_reranking:traffic_noise
  • SVQMrInQueryReranking/query_reranking
  • SVQMrInQueryReranking/query_reranking:background_speech
  • SVQMrInQueryReranking/query_reranking:clean
  • SVQMrInQueryReranking/query_reranking:media_noise
  • SVQMrInQueryReranking/query_reranking:traffic_noise
  • SVQRuRuQueryReranking/query_reranking
  • SVQRuRuQueryReranking/query_reranking:background_speech
  • SVQRuRuQueryReranking/query_reranking:clean
  • SVQRuRuQueryReranking/query_reranking:media_noise
  • SVQRuRuQueryReranking/query_reranking:traffic_noise
  • SVQSwQueryReranking/query_reranking
  • SVQSwQueryReranking/query_reranking:background_speech
  • SVQSwQueryReranking/query_reranking:clean
  • SVQSwQueryReranking/query_reranking:media_noise
  • SVQSwQueryReranking/query_reranking:traffic_noise
  • SVQTaInQueryReranking/query_reranking
  • SVQTaInQueryReranking/query_reranking:background_speech
  • SVQTaInQueryReranking/query_reranking:clean
  • SVQTaInQueryReranking/query_reranking:media_noise
  • SVQTaInQueryReranking/query_reranking:traffic_noise
  • SVQTeInQueryReranking/query_reranking
  • SVQTeInQueryReranking/query_reranking:background_speech
  • SVQTeInQueryReranking/query_reranking:clean
  • SVQTeInQueryReranking/query_reranking:media_noise
  • SVQTeInQueryReranking/query_reranking:traffic_noise
  • SVQUrInQueryReranking/query_reranking
  • SVQUrInQueryReranking/query_reranking:background_speech
  • SVQUrInQueryReranking/query_reranking:clean
  • SVQUrInQueryReranking/query_reranking:media_noise
  • SVQUrInQueryReranking/query_reranking:traffic_noise
  • SVQUrPkQueryReranking/query_reranking
  • SVQUrPkQueryReranking/query_reranking:background_speech
  • SVQUrPkQueryReranking/query_reranking:clean
  • SVQUrPkQueryReranking/query_reranking:media_noise
  • SVQUrPkQueryReranking/query_reranking:traffic_noise

[Back to top]


Retrieval

Retrieval tasks evaluate a model's ability to find relevant documents or passages from a large corpus given a voice query. This involves mapping speech and text into a shared embedding space where semantic similarity can be measured, often across different languages.

SVQ Retrieval

The SVQ retrieval task evaluates the model's ability to find relevant Wikipedia content given a voice query.

  • In-Lang Retrieval: The voice query and the Wikipedia content are in the same language. This tests native-language semantic alignment.
  • Cross-Lang Retrieval: The voice query is in a non-English language, while the Wikipedia content is in English. This evaluates the model's ability to map cross-lingual concepts into a shared embedding space.
  • Data: Both document-level and passage-level retrieval variants are supported.

Tasks (See SVQ):

  • SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQArEgDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQArEgDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQArEgPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQArEgPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQArXGulfDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQArXGulfDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQArXGulfPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQArXGulfPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQArXLevantDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQArXLevantDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQArXLevantPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQArXLevantPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQArXMaghrebiDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQArXMaghrebiDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQArXMaghrebiPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQArXMaghrebiPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQBnBdDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQBnBdDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQBnBdPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQBnBdPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQBnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQBnInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQBnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQBnInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQEnAuDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQEnAuPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQEnGbDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQEnGbPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQEnInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQEnInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQEnPhDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQEnPhPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQEnUsDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQEnUsPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQFiFiDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQFiFiDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQFiFiPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQFiFiPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQGuInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQGuInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQHiInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQHiInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQIdIdDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQIdIdPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQJaJpDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQJaJpPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQKnInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQKnInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQKoKrDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQKoKrDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQKoKrPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQKoKrPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQMlInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQMlInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQMrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQMrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQRuRuDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQRuRuDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQRuRuPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQRuRuPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQSwDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQSwDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQSwPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQSwPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQTaInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQTaInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQTeInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang
  • SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:background_speech
  • SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:clean
  • SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:media_noise
  • SVQTeInDocumentInLangRetrieval/document_retrieval_in_lang:traffic_noise
  • SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQTeInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang
  • SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:background_speech
  • SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:clean
  • SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:media_noise
  • SVQTeInPassageInLangRetrieval/passage_retrieval_in_lang:traffic_noise
  • SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQUrInDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQUrInPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise
  • SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang
  • SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:background_speech
  • SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:clean
  • SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:media_noise
  • SVQUrPkDocumentCrossLangRetrieval/document_retrieval_cross_lang:traffic_noise
  • SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang
  • SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:background_speech
  • SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:clean
  • SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:media_noise
  • SVQUrPkPassageCrossLangRetrieval/passage_retrieval_cross_lang:traffic_noise

[Back to top]


Segmentation

Segmentation tasks involve identifying and boundary-marking specific parts of an audio stream, such as salient terms or keywords. This evaluates the model's temporal precision and its ability to distinguish meaningful units of sound.

SVQ Segmentation

This task focuses on identifying salient terms within a continuous audio stream.

  • Target: The model must identify the timestamps for "salient terms" or keywords in the voice query.
  • Evaluation: Precision and recall of the predicted time-boundaries compared to human-labeled keywords.

Tasks (See SVQ):

  • SVQArEgSalientTermSegmentation/salient_term
  • SVQArEgSalientTermSegmentation/salient_term:background_speech
  • SVQArEgSalientTermSegmentation/salient_term:clean
  • SVQArEgSalientTermSegmentation/salient_term:media_noise
  • SVQArEgSalientTermSegmentation/salient_term:traffic_noise
  • SVQArXGulfSalientTermSegmentation/salient_term
  • SVQArXGulfSalientTermSegmentation/salient_term:background_speech
  • SVQArXGulfSalientTermSegmentation/salient_term:clean
  • SVQArXGulfSalientTermSegmentation/salient_term:media_noise
  • SVQArXGulfSalientTermSegmentation/salient_term:traffic_noise
  • SVQArXLevantSalientTermSegmentation/salient_term
  • SVQArXLevantSalientTermSegmentation/salient_term:background_speech
  • SVQArXLevantSalientTermSegmentation/salient_term:clean
  • SVQArXLevantSalientTermSegmentation/salient_term:media_noise
  • SVQArXLevantSalientTermSegmentation/salient_term:traffic_noise
  • SVQArXMaghrebiSalientTermSegmentation/salient_term
  • SVQArXMaghrebiSalientTermSegmentation/salient_term:background_speech
  • SVQArXMaghrebiSalientTermSegmentation/salient_term:clean
  • SVQArXMaghrebiSalientTermSegmentation/salient_term:media_noise
  • SVQArXMaghrebiSalientTermSegmentation/salient_term:traffic_noise
  • SVQBnBdSalientTermSegmentation/salient_term
  • SVQBnBdSalientTermSegmentation/salient_term:background_speech
  • SVQBnBdSalientTermSegmentation/salient_term:clean
  • SVQBnBdSalientTermSegmentation/salient_term:media_noise
  • SVQBnBdSalientTermSegmentation/salient_term:traffic_noise
  • SVQBnInSalientTermSegmentation/salient_term
  • SVQBnInSalientTermSegmentation/salient_term:background_speech
  • SVQBnInSalientTermSegmentation/salient_term:clean
  • SVQBnInSalientTermSegmentation/salient_term:media_noise
  • SVQBnInSalientTermSegmentation/salient_term:traffic_noise
  • SVQEnAuSalientTermSegmentation/salient_term
  • SVQEnAuSalientTermSegmentation/salient_term:background_speech
  • SVQEnAuSalientTermSegmentation/salient_term:clean
  • SVQEnAuSalientTermSegmentation/salient_term:media_noise
  • SVQEnAuSalientTermSegmentation/salient_term:traffic_noise
  • SVQEnGbSalientTermSegmentation/salient_term
  • SVQEnGbSalientTermSegmentation/salient_term:background_speech
  • SVQEnGbSalientTermSegmentation/salient_term:clean
  • SVQEnGbSalientTermSegmentation/salient_term:media_noise
  • SVQEnGbSalientTermSegmentation/salient_term:traffic_noise
  • SVQEnInSalientTermSegmentation/salient_term
  • SVQEnInSalientTermSegmentation/salient_term:background_speech
  • SVQEnInSalientTermSegmentation/salient_term:clean
  • SVQEnInSalientTermSegmentation/salient_term:media_noise
  • SVQEnInSalientTermSegmentation/salient_term:traffic_noise
  • SVQEnPhSalientTermSegmentation/salient_term
  • SVQEnPhSalientTermSegmentation/salient_term:background_speech
  • SVQEnPhSalientTermSegmentation/salient_term:clean
  • SVQEnPhSalientTermSegmentation/salient_term:media_noise
  • SVQEnPhSalientTermSegmentation/salient_term:traffic_noise
  • SVQEnUsSalientTermSegmentation/salient_term
  • SVQEnUsSalientTermSegmentation/salient_term:background_speech
  • SVQEnUsSalientTermSegmentation/salient_term:clean
  • SVQEnUsSalientTermSegmentation/salient_term:media_noise
  • SVQEnUsSalientTermSegmentation/salient_term:traffic_noise
  • SVQFiFiSalientTermSegmentation/salient_term
  • SVQFiFiSalientTermSegmentation/salient_term:background_speech
  • SVQFiFiSalientTermSegmentation/salient_term:clean
  • SVQFiFiSalientTermSegmentation/salient_term:media_noise
  • SVQFiFiSalientTermSegmentation/salient_term:traffic_noise
  • SVQGuInSalientTermSegmentation/salient_term
  • SVQGuInSalientTermSegmentation/salient_term:background_speech
  • SVQGuInSalientTermSegmentation/salient_term:clean
  • SVQGuInSalientTermSegmentation/salient_term:media_noise
  • SVQGuInSalientTermSegmentation/salient_term:traffic_noise
  • SVQHiInSalientTermSegmentation/salient_term
  • SVQHiInSalientTermSegmentation/salient_term:background_speech
  • SVQHiInSalientTermSegmentation/salient_term:clean
  • SVQHiInSalientTermSegmentation/salient_term:media_noise
  • SVQHiInSalientTermSegmentation/salient_term:traffic_noise
  • SVQIdIdSalientTermSegmentation/salient_term
  • SVQIdIdSalientTermSegmentation/salient_term:background_speech
  • SVQIdIdSalientTermSegmentation/salient_term:clean
  • SVQIdIdSalientTermSegmentation/salient_term:media_noise
  • SVQIdIdSalientTermSegmentation/salient_term:traffic_noise
  • SVQJaJpSalientTermSegmentation/salient_term
  • SVQJaJpSalientTermSegmentation/salient_term:background_speech
  • SVQJaJpSalientTermSegmentation/salient_term:clean
  • SVQJaJpSalientTermSegmentation/salient_term:media_noise
  • SVQJaJpSalientTermSegmentation/salient_term:traffic_noise
  • SVQKnInSalientTermSegmentation/salient_term
  • SVQKnInSalientTermSegmentation/salient_term:background_speech
  • SVQKnInSalientTermSegmentation/salient_term:clean
  • SVQKnInSalientTermSegmentation/salient_term:media_noise
  • SVQKnInSalientTermSegmentation/salient_term:traffic_noise
  • SVQKoKrSalientTermSegmentation/salient_term
  • SVQKoKrSalientTermSegmentation/salient_term:background_speech
  • SVQKoKrSalientTermSegmentation/salient_term:clean
  • SVQKoKrSalientTermSegmentation/salient_term:media_noise
  • SVQKoKrSalientTermSegmentation/salient_term:traffic_noise
  • SVQMlInSalientTermSegmentation/salient_term
  • SVQMlInSalientTermSegmentation/salient_term:background_speech
  • SVQMlInSalientTermSegmentation/salient_term:clean
  • SVQMlInSalientTermSegmentation/salient_term:media_noise
  • SVQMlInSalientTermSegmentation/salient_term:traffic_noise
  • SVQMrInSalientTermSegmentation/salient_term
  • SVQMrInSalientTermSegmentation/salient_term:background_speech
  • SVQMrInSalientTermSegmentation/salient_term:clean
  • SVQMrInSalientTermSegmentation/salient_term:media_noise
  • SVQMrInSalientTermSegmentation/salient_term:traffic_noise
  • SVQRuRuSalientTermSegmentation/salient_term
  • SVQRuRuSalientTermSegmentation/salient_term:background_speech
  • SVQRuRuSalientTermSegmentation/salient_term:clean
  • SVQRuRuSalientTermSegmentation/salient_term:media_noise
  • SVQRuRuSalientTermSegmentation/salient_term:traffic_noise
  • SVQTaInSalientTermSegmentation/salient_term
  • SVQTaInSalientTermSegmentation/salient_term:background_speech
  • SVQTaInSalientTermSegmentation/salient_term:clean
  • SVQTaInSalientTermSegmentation/salient_term:media_noise
  • SVQTaInSalientTermSegmentation/salient_term:traffic_noise
  • SVQTeInSalientTermSegmentation/salient_term
  • SVQTeInSalientTermSegmentation/salient_term:background_speech
  • SVQTeInSalientTermSegmentation/salient_term:clean
  • SVQTeInSalientTermSegmentation/salient_term:media_noise
  • SVQTeInSalientTermSegmentation/salient_term:traffic_noise
  • SVQUrInSalientTermSegmentation/salient_term
  • SVQUrInSalientTermSegmentation/salient_term:background_speech
  • SVQUrInSalientTermSegmentation/salient_term:clean
  • SVQUrInSalientTermSegmentation/salient_term:media_noise
  • SVQUrInSalientTermSegmentation/salient_term:traffic_noise
  • SVQUrPkSalientTermSegmentation/salient_term
  • SVQUrPkSalientTermSegmentation/salient_term:background_speech
  • SVQUrPkSalientTermSegmentation/salient_term:clean
  • SVQUrPkSalientTermSegmentation/salient_term:media_noise
  • SVQUrPkSalientTermSegmentation/salient_term:traffic_noise

[Back to top]


Transcription

Transcription tasks (Automatic Speech Recognition) evaluate the model's ability to convert spoken language into written text. This tests the phonological and linguistic coverage of the sound representations.

SVQ Transcription

A standard speech-to-text (ASR) task evaluating the linguistic accuracy of representations.

  • Goal: Transcribe the input voice query into the corresponding written text in the same language.
  • Metric: Word Error Rate (WER) and Sentence Error Rate (SER).

Tasks (See SVQ):

  • SVQArEgSpeechTranscription/speech_transcription
  • SVQArEgSpeechTranscription/speech_transcription:background_speech
  • SVQArEgSpeechTranscription/speech_transcription:clean
  • SVQArEgSpeechTranscription/speech_transcription:media_noise
  • SVQArEgSpeechTranscription/speech_transcription:traffic_noise
  • SVQArXGulfSpeechTranscription/speech_transcription
  • SVQArXGulfSpeechTranscription/speech_transcription:background_speech
  • SVQArXGulfSpeechTranscription/speech_transcription:clean
  • SVQArXGulfSpeechTranscription/speech_transcription:media_noise
  • SVQArXGulfSpeechTranscription/speech_transcription:traffic_noise
  • SVQArXLevantSpeechTranscription/speech_transcription
  • SVQArXLevantSpeechTranscription/speech_transcription:background_speech
  • SVQArXLevantSpeechTranscription/speech_transcription:clean
  • SVQArXLevantSpeechTranscription/speech_transcription:media_noise
  • SVQArXLevantSpeechTranscription/speech_transcription:traffic_noise
  • SVQArXMaghrebiSpeechTranscription/speech_transcription
  • SVQArXMaghrebiSpeechTranscription/speech_transcription:background_speech
  • SVQArXMaghrebiSpeechTranscription/speech_transcription:clean
  • SVQArXMaghrebiSpeechTranscription/speech_transcription:media_noise
  • SVQArXMaghrebiSpeechTranscription/speech_transcription:traffic_noise
  • SVQBnBdSpeechTranscription/speech_transcription
  • SVQBnBdSpeechTranscription/speech_transcription:background_speech
  • SVQBnBdSpeechTranscription/speech_transcription:clean
  • SVQBnBdSpeechTranscription/speech_transcription:media_noise
  • SVQBnBdSpeechTranscription/speech_transcription:traffic_noise
  • SVQBnInSpeechTranscription/speech_transcription
  • SVQBnInSpeechTranscription/speech_transcription:background_speech
  • SVQBnInSpeechTranscription/speech_transcription:clean
  • SVQBnInSpeechTranscription/speech_transcription:media_noise
  • SVQBnInSpeechTranscription/speech_transcription:traffic_noise
  • SVQEnAuSpeechTranscription/speech_transcription
  • SVQEnAuSpeechTranscription/speech_transcription:background_speech
  • SVQEnAuSpeechTranscription/speech_transcription:clean
  • SVQEnAuSpeechTranscription/speech_transcription:media_noise
  • SVQEnAuSpeechTranscription/speech_transcription:traffic_noise
  • SVQEnGbSpeechTranscription/speech_transcription
  • SVQEnGbSpeechTranscription/speech_transcription:background_speech
  • SVQEnGbSpeechTranscription/speech_transcription:clean
  • SVQEnGbSpeechTranscription/speech_transcription:media_noise
  • SVQEnGbSpeechTranscription/speech_transcription:traffic_noise
  • SVQEnInSpeechTranscription/speech_transcription
  • SVQEnInSpeechTranscription/speech_transcription:background_speech
  • SVQEnInSpeechTranscription/speech_transcription:clean
  • SVQEnInSpeechTranscription/speech_transcription:media_noise
  • SVQEnInSpeechTranscription/speech_transcription:traffic_noise
  • SVQEnPhSpeechTranscription/speech_transcription
  • SVQEnPhSpeechTranscription/speech_transcription:background_speech
  • SVQEnPhSpeechTranscription/speech_transcription:clean
  • SVQEnPhSpeechTranscription/speech_transcription:media_noise
  • SVQEnPhSpeechTranscription/speech_transcription:traffic_noise
  • SVQEnUsSpeechTranscription/speech_transcription
  • SVQEnUsSpeechTranscription/speech_transcription:background_speech
  • SVQEnUsSpeechTranscription/speech_transcription:clean
  • SVQEnUsSpeechTranscription/speech_transcription:media_noise
  • SVQEnUsSpeechTranscription/speech_transcription:traffic_noise
  • SVQFiFiSpeechTranscription/speech_transcription
  • SVQFiFiSpeechTranscription/speech_transcription:background_speech
  • SVQFiFiSpeechTranscription/speech_transcription:clean
  • SVQFiFiSpeechTranscription/speech_transcription:media_noise
  • SVQFiFiSpeechTranscription/speech_transcription:traffic_noise
  • SVQGuInSpeechTranscription/speech_transcription
  • SVQGuInSpeechTranscription/speech_transcription:background_speech
  • SVQGuInSpeechTranscription/speech_transcription:clean
  • SVQGuInSpeechTranscription/speech_transcription:media_noise
  • SVQGuInSpeechTranscription/speech_transcription:traffic_noise
  • SVQHiInSpeechTranscription/speech_transcription
  • SVQHiInSpeechTranscription/speech_transcription:background_speech
  • SVQHiInSpeechTranscription/speech_transcription:clean
  • SVQHiInSpeechTranscription/speech_transcription:media_noise
  • SVQHiInSpeechTranscription/speech_transcription:traffic_noise
  • SVQIdIdSpeechTranscription/speech_transcription
  • SVQIdIdSpeechTranscription/speech_transcription:background_speech
  • SVQIdIdSpeechTranscription/speech_transcription:clean
  • SVQIdIdSpeechTranscription/speech_transcription:media_noise
  • SVQIdIdSpeechTranscription/speech_transcription:traffic_noise
  • SVQJaJpSpeechTranscription/speech_transcription
  • SVQJaJpSpeechTranscription/speech_transcription:background_speech
  • SVQJaJpSpeechTranscription/speech_transcription:clean
  • SVQJaJpSpeechTranscription/speech_transcription:media_noise
  • SVQJaJpSpeechTranscription/speech_transcription:traffic_noise
  • SVQKnInSpeechTranscription/speech_transcription
  • SVQKnInSpeechTranscription/speech_transcription:background_speech
  • SVQKnInSpeechTranscription/speech_transcription:clean
  • SVQKnInSpeechTranscription/speech_transcription:media_noise
  • SVQKnInSpeechTranscription/speech_transcription:traffic_noise
  • SVQKoKrSpeechTranscription/speech_transcription
  • SVQKoKrSpeechTranscription/speech_transcription:background_speech
  • SVQKoKrSpeechTranscription/speech_transcription:clean
  • SVQKoKrSpeechTranscription/speech_transcription:media_noise
  • SVQKoKrSpeechTranscription/speech_transcription:traffic_noise
  • SVQMlInSpeechTranscription/speech_transcription
  • SVQMlInSpeechTranscription/speech_transcription:background_speech
  • SVQMlInSpeechTranscription/speech_transcription:clean
  • SVQMlInSpeechTranscription/speech_transcription:media_noise
  • SVQMlInSpeechTranscription/speech_transcription:traffic_noise
  • SVQMrInSpeechTranscription/speech_transcription
  • SVQMrInSpeechTranscription/speech_transcription:background_speech
  • SVQMrInSpeechTranscription/speech_transcription:clean
  • SVQMrInSpeechTranscription/speech_transcription:media_noise
  • SVQMrInSpeechTranscription/speech_transcription:traffic_noise
  • SVQRuRuSpeechTranscription/speech_transcription
  • SVQRuRuSpeechTranscription/speech_transcription:background_speech
  • SVQRuRuSpeechTranscription/speech_transcription:clean
  • SVQRuRuSpeechTranscription/speech_transcription:media_noise
  • SVQRuRuSpeechTranscription/speech_transcription:traffic_noise
  • SVQSwSpeechTranscription/speech_transcription
  • SVQSwSpeechTranscription/speech_transcription:background_speech
  • SVQSwSpeechTranscription/speech_transcription:clean
  • SVQSwSpeechTranscription/speech_transcription:media_noise
  • SVQSwSpeechTranscription/speech_transcription:traffic_noise
  • SVQTaInSpeechTranscription/speech_transcription
  • SVQTaInSpeechTranscription/speech_transcription:background_speech
  • SVQTaInSpeechTranscription/speech_transcription:clean
  • SVQTaInSpeechTranscription/speech_transcription:media_noise
  • SVQTaInSpeechTranscription/speech_transcription:traffic_noise
  • SVQTeInSpeechTranscription/speech_transcription
  • SVQTeInSpeechTranscription/speech_transcription:background_speech
  • SVQTeInSpeechTranscription/speech_transcription:clean
  • SVQTeInSpeechTranscription/speech_transcription:media_noise
  • SVQTeInSpeechTranscription/speech_transcription:traffic_noise
  • SVQUrInSpeechTranscription/speech_transcription
  • SVQUrInSpeechTranscription/speech_transcription:background_speech
  • SVQUrInSpeechTranscription/speech_transcription:clean
  • SVQUrInSpeechTranscription/speech_transcription:media_noise
  • SVQUrInSpeechTranscription/speech_transcription:traffic_noise
  • SVQUrPkSpeechTranscription/speech_transcription
  • SVQUrPkSpeechTranscription/speech_transcription:background_speech
  • SVQUrPkSpeechTranscription/speech_transcription:clean
  • SVQUrPkSpeechTranscription/speech_transcription:media_noise
  • SVQUrPkSpeechTranscription/speech_transcription:traffic_noise

[Back to top]


Datasets

BirdSet

BirdSet is a large-scale dataset for bioacoustic monitoring, focusing on bird species classification from audio recordings.

Task Description

The primary task is Multi-label Classification. Given a 5-second audio segment, the model must predict all bird species present in the recording.

Configurations

BirdSet includes data from various recording sites and setups, referred to as configurations: - HSN: High Sierra Nevada - NBP: NiBiolas Point - POW: Powdermill Nature Reserve - SSW: Sapsucker Woods - SNE: Sierra Nevada - PER: Peru - NES: Northeast - UHH: Hawaii - XCM: Xeno-Canto Mixed - XCL: Xeno-Canto Low-noise

References

[Back to top]


FSD50K

FSD50K is an open dataset of human-labeled sound events containing 51,197 audio clips totaling 108.3 hours.

Task Description

The task is Multi-label Sound Event Classification. Audio clips are labeled using 200 classes from the AudioSet ontology.

Reference

[Back to top]


Speech MASSIVE

Speech MASSIVE is a multilingual dataset for Speech Intent Classification, derived from the MASSIVE text dataset.

Task Description

The task is Intent Classification. Given an utterance, the model must categorize it into one of 60 predefined intents (e.g., datetime_query, iot_hue_lightchange, play_music).

Coverage

It covers 12 languages and provides a challenging testbed for multilingual speech understanding.

References

[Back to top]


Simple Voice Questions (SVQ) Dataset

Simple Voice Questions (SVQ) is a multilingual dataset designed for evaluating sound representations. It consists of voice queries in multiple languages based on Wikipedia content.

Dataset Characteristics

  • Languages: 17 languages including Arabic, Bengali, English, Finnish, Gujarati, Hindi, Indonesian, Japanese, Kannada, Korean, Malayalam, Marathi, Russian, Swahili, Tamil, Telugu, and Urdu.
  • Environments: To test robustness, queries are provided in four environments:
    • clean: High-quality recording.
    • media_noise: With background music or other media.
    • traffic_noise: With street and vehicle noise.
    • background_speech: With other people talking in the background.

References

[Back to top]