CLEF 2020 LNCS Springer Proceedings Available -

The CLEF 2020 Proceedings are available online as Spring LNCS Proceedings 12260 at:

Swahili and Somali Query Translations of CLEF Bilingual Dataset Available to Researchers

Center for Intelligent Information Retrieval (CIIR) researchers within the University of Massachusetts Amherst College of Information and Computer Sciences are providing a dataset that consists of Swahili and Somali queries translated from the CLEF 2000-2003 Campaign for Bilingual Ad-Hoc Retrieval Tracks (


For researching on low-resource languages, the CIIR has produced an extension of 200 queries by translating all four years of bilingual queries (2000-2003) into Swahili and Somali, with topic set IDs of C001-C200 corresponding to the other languages that exist in the CLEF data.  They used a translation organization to translate the title and description of the English queries from that topic set into Swahili and Somali languages. Somali is in the Afro-Asiatic language family, and Swahili is in the Niger-Congo language family. Both are mostly spoken in Africa.


More information can be found in their paper, “Simulating CLIR Translation Resource Scarcity using High-resource Languages,” by authors Hamed Bonab, James Allan, and Ramesh Sitaraman in the Proceedings of ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR 2019).


The dataset and paper can be downloaded at:

The CLEF Book on lessons learned in 20 years of research -

The book celebrating 20 years of CLEF activities

Information Retrieval Evaluation in a Changing World.  Lessons Learned from 20 Years of CLEF
is published by Springer and it is available at
CLEF 2019 LNCS Springer Proceedings Available -

The CLEF 2019 Proceedings are available online as Spring LNCS Proceedings 10456 at:

CLEF 2019 Working Notes Available -

The CLEF 2019 Working Notes are available online as CEUR-WS Proceedings 2380 at:
Showing 1 - 5 of 35 results.
Items per Page 5
of 7