Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's part of it, but I think another part is just the way the LLMs are tuned. They're capable of more conversational tones, but human feedback in post-training biases them toward a writing style that's more of a Quora / StackOverflow / Reddit Q&A style because that's what gets the best ratings during the RLHF process.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: