Nice work! I've been simultaneously working on something similar though a totally different approach. I'm using `snowflake-arctic-embed-s` (which I've benchmarked to be the most accurate small embedding model for my use case - source code embedding) and a pure Go approach, but unsurprisingly it is somewhat slower than your approach. Did you attempt a pure Go version at first before adding in the Go assembly?
Nice work! I've been simultaneously working on something similar though a totally different approach. I'm using `snowflake-arctic-embed-s` (which I've benchmarked to be the most accurate small embedding model for my use case - source code embedding) and a pure Go approach, but unsurprisingly it is somewhat slower than your approach. Did you attempt a pure Go version at first before adding in the Go assembly?