79794067

Date: 2025-10-19 02:37:55
Score: 0.5
Natty:
Report link

Looks like you might be encountering the same problem discovered in llama.cl -- hidden memory allocation, at least in the BLAS library we used, LLA. To get fast results with gemm you'll need to allocate the memory ahead of time. and use the destructive version (gemm! for example; note the allocation of the target array before the matrix multiplication)

Reasons:
  • Has code block (-0.5):
  • Single line (0.5):
  • Low reputation (0.5):
Posted by: CL-USER