As a project for our computational fabrication course, I implemented a paper with a peer. We did it in python because I didn't want him to suffer with lower level stuff, but then it runs sooo slow.
And I'm just itching, dying to OpenCL/CUDA the shit out of this code, because it's so parallelizable it's a sin not to do it.
But it's really out of the scope for the course, not worth the effort, and I have tons of other stuff to do now.
I miss being paid to make these sort of algorithms efficient 😭

