XMOD-base fine-tuned using ColBERT-XM methodology on Dutch Translated to Afrikaans Queries and Dutch Documents from mMARCO/v2.

Essentially, it's ColBERT-XM but fine-tuned on Afrikaans-Dutch mMARCOv2 in contrast to MSMARCO of the original ColBERT-XM model.

This model was fine-tuned for the "Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer" ECIR2025 paper. The source code for the paper can be found here

Downloads last month
7
Safetensors
Model size
853M params
Tensor type
F32
·
Inference Examples
Inference API (serverless) does not yet support colbert-ai models for this pipeline type.

Model tree for andreaschari/colbert-xm-lt-afdt

Base model

facebook/xmod-base
Finetuned
(3)
this model

Datasets used to train andreaschari/colbert-xm-lt-afdt