Sampled softmax loss support #147

okuchaiev · 2017-04-06T00:15:30Z

Please consider adding sampled softmax loss, in addition to "cross_entropy_sequence_loss".
For tasks with large target vocabularies, the speedup can be significant (with, perhaps, minor accuracy loss per step).
Even on "nmt_large" config with batch size of 128 and voc size of 32,000 I am getting about x1.224 speedup.
It is, however, a little tricky to add. I have a first draft here: https://github.com/okuchaiev/seq2seq/tree/sampled_softmax_first_try
It is not ready to be merged yet.
Let me know if this is of interest - I plan to polish my implementation and would appreciate thoughts on what is the right way to add it here.

sefialbo · 2017-06-05T19:38:06Z

is it ready to use ? this feature is really helpful

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampled softmax loss support #147

Sampled softmax loss support #147

okuchaiev commented Apr 6, 2017 •

edited

Loading

sefialbo commented Jun 5, 2017

Sampled softmax loss support #147

Sampled softmax loss support #147

Comments

okuchaiev commented Apr 6, 2017 • edited Loading

sefialbo commented Jun 5, 2017

okuchaiev commented Apr 6, 2017 •

edited

Loading