Skip to content
Snippets Groups Projects
Unverified Commit 8087cb86 authored by Thorsten Hater's avatar Thorsten Hater Committed by GitHub
Browse files

:octopus: Optimise PPACK (#2067)

- pointers in `PPACK_IFACE_BLOCK` will be `__restrict__`
  - this is expected to produce better binary code for the _scalar_ case
- for GPU we expect this to better utilise data caches as per CUDA docs
- `ved_di` is not used anymore, gone and ABI bumped
- GPU code never used the indexing structs, so remove from
`PPACK_IFACE_BLOCK`
parent 861aa677
No related branches found
No related tags found
No related merge requests found
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment