Optimize vectorized compound_indexed_add (#673)
* Optimize "none" index_constraint specialization of compound_indexed_add, so that it only reads/writes each distinct memory index once per vector. Related to issue #637.
Please register or sign in to comment