Hi, thanks for sharing your code! I read your paper and am wondering about the matrix multiplication order for the backward loss correction approach.
The paper says T^{-1} loss
In loss.robust, for backward, we have:
return -K.sum(K.dot(y_true, P_inv) * K.log(y_pred), axis=-1)
It looks to me like the order of matrix multiplication for P_inv and y_true should be switched. My guess is that I'm misunderstanding something, but would really appreciate if you could clarify.
Thanks!
Hi, thanks for sharing your code! I read your paper and am wondering about the matrix multiplication order for the backward loss correction approach.
The paper says T^{-1} loss
In loss.robust, for backward, we have:
return -K.sum(K.dot(y_true, P_inv) * K.log(y_pred), axis=-1)It looks to me like the order of matrix multiplication for P_inv and y_true should be switched. My guess is that I'm misunderstanding something, but would really appreciate if you could clarify.
Thanks!