src/backends/gpu/kernels/assemble_matrix.hpp · bd1e56a562a4c64d40f9e67ffedd64a177d25d71 · arbor-sim / arbor

Add required thread synchronization to matrix kernel. (#280) · bd1e56a5

Sam Yates authored 8 years ago

There is a potential data race in the `assemble_matrix_interleaved` kernel, where threads in a different warp can overwrite the `buffer_v` and `buffer_i` values before they are used to update the `d` and `rhs` vectors.

This race has been exercised in the asynchronous event delivery branch.

* Add `__syncthreads()` to assemble matrix interleaved kernel after `d` and `rhs` update.

bd1e56a5

assemble_matrix.hpp 3.46 KiB