Skip to content

Releases: FluxML/NNlibCUDA.jl

v0.2.7

02 Feb 17:52
5f797ae
Compare
Choose a tag to compare

NNlibCUDA v0.2.7

Diff since v0.2.6

Merged pull requests:

v0.2.6

07 Jan 18:17
2e29186
Compare
Choose a tag to compare

NNlibCUDA v0.2.6

Diff since v0.2.5

Merged pull requests:

v0.2.5

06 Jan 01:55
797f567
Compare
Choose a tag to compare

NNlibCUDA v0.2.5

Diff since v0.2.4

Merged pull requests:

v0.2.4

23 Jul 04:42
b789f43
Compare
Choose a tag to compare

NNlibCUDA v0.2.4

Diff since v0.2.3

Closed issues:

  • NNlibCUDA Heisenbug in conv! with nonzero beta (#37)

Merged pull requests:

  • get rid of duplicate code in upsampling code by using dispatch (#49) (@maxfreu)
  • save allocs during algorithm search (#53) (@maxfreu)
  • print/convert batchedadjtrans over cuarray (#54) (@chengchingwen)
  • Move ctc_loss from Flux to NNlibCUDA (#55) (@mcabbott)

v0.2.3

21 May 01:31
da73f07
Compare
Choose a tag to compare

NNlibCUDA v0.2.3

Diff since v0.2.2

Closed issues:

  • Unconstrained element type on activations causing errors with e.g. Complex, ForwardDiff.Dual (#47)

Merged pull requests:

  • Restrict element type of activation overrides to CUDNN datatypes (#48) (@DomCRose)

v0.2.2

07 Mar 16:14
e16e235
Compare
Choose a tag to compare

NNlibCUDA v0.2.2

Diff since v0.2.1

Closed issues:

  • Slow ∇softmax! compared with generic version. (#30)

Merged pull requests:

v0.2.1

07 Feb 23:25
30c3e6e
Compare
Choose a tag to compare

NNlibCUDA v0.2.1

Diff since v0.2.0

Closed issues:

  • No frule for some activations (#42)

Merged pull requests:

v0.2.0

23 Jan 16:51
1b75f6b
Compare
Choose a tag to compare

NNlibCUDA v0.2.0

Diff since v0.1.11

Merged pull requests:

v0.1.11

06 Dec 19:16
fb6fe8e
Compare
Choose a tag to compare

NNlibCUDA v0.1.11

Diff since v0.1.10

Closed issues:

  • gpu scatter with CartesianIndex not supported (#29)

Merged pull requests:

  • Support gpu scatter/gather with CartesianIndex (#33) (@yuehhua)

v0.1.10

15 Nov 20:25
96a3346
Compare
Choose a tag to compare

NNlibCUDA v0.1.10

Diff since v0.1.9

Merged pull requests:

  • Add CUDA kernels for grid sampling (#31) (@pxl-th)