Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's necessary for devs right now, no matter how good they are, and it's those devs' code the models are trained on


Even worse, the training set probably includes a lot of code that needed review but didn't get it...


If we know the outcome of that code, such as whether it caused bugs or data corruption or a crappy UX or tech debt -- which is potentially available in subsequent PR commit messages -- it's still valuable training data.

Probably even more valuable than code that just worked, because evidently we have enough of that and AI code still has issues.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: