Some questions in your evaluation code and data #2

liuhaifeng0212 · 2019-09-09T03:10:31Z

In your paper "To evaluate the results more efficiently, we randomly sample 999 items which have no interaction with the target user and rank the validation and test items with respect to these 999 items."
But I can't fount the validation dataset, Meanwhile, In MovieLens, only 943 users, 1682 items, I counted the trian in your Movielens train data, I fount that user id : 466 have 684 items in train data. user id: 405 have 732 items in train data, So how could you evaluation it with 1000 items in test set?

ghost · 2019-09-09T06:52:22Z

Thank you for your interests. In the paper, I think I write that for MovieLens, I perform the ranking between all items. The sampling 999 negative is only for KKBOX datasets. Besides, in the readme file, I upload the links for the test file for KKBOX.

ghost · 2019-09-09T06:56:13Z

I read the paper again. There is a confusion. I'm sorry about that. The circumstance is that because MovieLens has smaller item number, so the ranking in all items is affordable. The sampling strategy is only for KKBOX.

liuhaifeng0212 · 2019-09-18T13:57:54Z

when i running your RCF.py code in the default setting, print "the total loss in 1 th iteration is: nan, the attentions are nan, nan, nan, na", no matter how I change the parament setting, always report the same error. do you have any suggestion？

ghost · 2019-09-27T14:57:38Z

sorry for the late reply. I have some friends who is also doing something based on this work. According to them, they didn't encounter the NAN problem. If you do always have nan, I doubt there is something wrong with the activation or the softmax function.

zxm97 · 2019-10-18T18:50:10Z

sorry for the late reply. I have some friends who is also doing something based on this work. According to them, they didn't encounter the NAN problem. If you do always have nan, I doubt there is something wrong with the activation or the softmax function.

I'm sorry but i didn't find out how to run the RCF model(on MovieLens) without masking. I don't understand the meaning of "mode == 'add'" and "mode == 'mul'". Could you please tell me how can I get a satisfying performance?

ghost · 2019-10-19T11:19:08Z

Hi, mask is necessary because of the batch setting so we can feed the data in one feed-dict with a fixed length. For example, one I_u^t contains item {1,2,3,4}, the other contains {5,6}. So we need to add a mask to feed them in one batch and the later will be {5,6,mask,mask}. The mode denotes how we treat the masked position. ‘’ADD’’ means it will be treated as -NA. “Mul” means it is treated as zeros. Best, Xin Xin On Oct 18, 2019, at 7:50 PM, zxm97 <[email protected]<mailto:[email protected]>> wrote: sorry for the late reply. I have some friends who is also doing something based on this work. According to them, they didn't encounter the NAN problem. If you do always have nan, I doubt there is something wrong with the activation or the softmax function. I'm sorry but i didn't find out how to run the RCF model(on MovieLens) without masking. I don't understand the meaning of "mode == 'add'" and "mode == 'mul'". Could you please tell me how can I get a satisfying performance? — You are receiving this because you commented. Reply to this email directly, view it on GitHub<#2>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIDVPH5MRU24ROHLUAE3TBTQPIAOFANCNFSM4IUWCJJA>.

zxm97 · 2019-10-20T06:52:17Z

Hi, mask is necessary because of the batch setting so we can feed the data in one feed-dict with a fixed length. For example, one I_u^t contains item {1,2,3,4}, the other contains {5,6}. So we need to add a mask to feed them in one batch and the later will be {5,6,mask,mask}. The mode denotes how we treat the masked position. ‘’ADD’’ means it will be treated as -NA. “Mul” means it is treated as zeros.

Best,
Xin Xin

On Oct 18, 2019, at 7:50 PM, zxm97
Thanks for you reply. I downloaded the code again and made a few changes to ensure it work in python3. And I added tf.clip_by_value() into every log loss computation to prevent inf/nan. But I can't get satisfactory results on MovieLens. The attentions are about 0.31, 0.27, 0.25, 0.17 respectively, far away from 0.1397, 0.3191, 0.2552, 0.2859. And the performance is even worse than MF. I don't know what the problem is.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions in your evaluation code and data #2

Some questions in your evaluation code and data #2

liuhaifeng0212 commented Sep 9, 2019

ghost commented Sep 9, 2019

ghost commented Sep 9, 2019

liuhaifeng0212 commented Sep 18, 2019

ghost commented Sep 27, 2019

zxm97 commented Oct 18, 2019

ghost commented Oct 19, 2019 via email

zxm97 commented Oct 20, 2019

Some questions in your evaluation code and data #2

Some questions in your evaluation code and data #2

Comments

liuhaifeng0212 commented Sep 9, 2019

ghost commented Sep 9, 2019

ghost commented Sep 9, 2019

liuhaifeng0212 commented Sep 18, 2019

ghost commented Sep 27, 2019

zxm97 commented Oct 18, 2019

ghost commented Oct 19, 2019 via email

zxm97 commented Oct 20, 2019