kernels/fmha/smem_tile.h (4 lines): - line 138: // TODO: Why not merge it with the read offset? - line 223: // Move the read offset to next buffer. TODO: Remove this member function!!! - line 238: // Move the read offset to next N buffer (circular-buffer). TODO: Remove this member function!!! - line 257: // Move the write offset to next buffer. TODO: Remove that member function! kernels/fmha/gmem_tile.h (4 lines): - line 488: // TODO: Not sure if it's right for MMAS_M > 1 - line 499: // TODO: Not sure if it's right for MMAS_M > 1 - line 508: // TODO: Not sure if it's right for MMAS_M > 1 - line 519: // TODO: Not sure if it's right for MMAS_M > 1 kernels/fmha_fprop_kernel_1xN.h (1 line): - line 499: // TODO: this should change after we rearrange the warps (e.g. cutlass branch) kernels/fmha/gemm.h (1 line): - line 129: // TODO (TD 2022-04-09): Shouldn't this be NUM_REGS instead of NUM_ELTS? src/lib.rs (1 line): - line 107: // TODO: Handle head sizes that are not a multiple of 8 via some padding.