Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It shouldn't be a problem if you only train on legally acquired data. You will know the authors name and can contact them if you so wish.


There aren't any laws that require "acquiring" something in a way that "knows the author's name".


I don't think any of the major players could do that for all their data and they are acquiring it legally.


What? How do you know the data your buying isn't AI generated by the sellers?

If they are scamming and you contact them, of course they will lie.

So how does this work?




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: