An experiment with Gnocchi â (the database) â Part 3
In Part 2, I summarized the iterations applied to the kernel (the piece of CUDA code that executes on GPU) to remove the bottlenecks encountered during profiling, like using shared memory to avoid non-coalesced memory access. In this part, I will talk about the final version of the kernel and using the GPU in other […]
The post An experiment with Gnocchi â (the database) â Part 3 appeared first on SUSE Communities.