Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Mamba came out of the same research group, Hazy Research, led by Chris Ré. This new "Jamba" model incorporating Mamba and dot-product attention layers has ~8x more parameters than the largest open Striped Hyena, and appears to work much better.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: