We'll step through the process of migrating code from native Python to Numba, and then to a CuPy Raw Kernel (CUDA C++). Basic workflow, best practices, lessons learned, and coding samples will be provided. NVIDIA Nsight Systems profilers will be used to demonstrate how minor optimizations can provide substantial performance benefits to custom developed code. The techniques discussed in this session can be used in any domain.