Menu

#Mamba

4 posts

Feed·
4 of 4 posts
📰
0

Technical question about Mamba Selective Scan kernel and FP16/FP32 precision

Reddit r/learnmachinelearning·u/Dry-Trouble4373·about 1 month ago
#kGw7XvYx
#kernel#fp16#mamba#fp32#precision#article

I'm trying to evaluate the model's accuracy when all internal operations are strictly limited to **FP16**. However, I noticed that the `selective_scan` CUDA kernel seems to use **FP32 accumulators** by default.…

15s
Read More