Hello!
I have a custom dataset that I have formatted according to recbole's Atomic Files guide (https://www.recbole.io/atomic_files.html) and created the custom_dataset.inter file
I place it in /dataset/custom_dataset/ , and then run:
python run_recbole.py
--model=DirectAU --dataset=custom_dataset
--learning_rate=1e-3 --weight_decay=1e-6
--gamma=1 --encoder=MF --train_batch_size=1024
However, while the true #users, # items ~ (2 million, 2 million), the terminal output says #Users are 360587 (and sets the batch size to that) . [ Pic Attached ]
Why this discrepancy?
I also tried with explicitly passing a config file as well.
See below


Hello!
I have a custom dataset that I have formatted according to recbole's Atomic Files guide (https://www.recbole.io/atomic_files.html) and created the custom_dataset.inter file
I place it in /dataset/custom_dataset/ , and then run:
python run_recbole.py
--model=DirectAU --dataset=custom_dataset
--learning_rate=1e-3 --weight_decay=1e-6
--gamma=1 --encoder=MF --train_batch_size=1024
However, while the true #users, # items ~ (2 million, 2 million), the terminal output says #Users are 360587 (and sets the batch size to that) . [ Pic Attached ]
Why this discrepancy?
I also tried with explicitly passing a config file as well.

See below