XMOD-base fine-tuned using ColBERT-XM methodology on Dutch Translated to Afrikaans Queries and Dutch Documents from mMARCO/v2.

Essentially, it's ColBERT-XM but fine-tuned on Afrikaans-Dutch mMARCOv2 in contrast to MSMARCO of the original ColBERT-XM model.

This model was fine-tuned for the "Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer" ECIR2025 paper. The source code for the paper can be found here

Downloads last month: 7

Safetensors

Model size

853M params

Tensor type

F32

Inference Examples

Sentence Similarity

Inference API (serverless) does not yet support colbert-ai models for this pipeline type.

Model tree for andreaschari/colbert-xm-lt-afdt

Base model

facebook/xmod-base

Finetuned

antoinelouis/colbert-xm

Finetuned

(3)

this model

andreaschari
/

colbert-xm-lt-afdt

Model tree for andreaschari/colbert-xm-lt-afdt

Datasets used to train andreaschari/colbert-xm-lt-afdt