Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The LLMs dont have RL baked into them. They need that at the token prediction level to be able to do the sort of things humans can do


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: