I find the "trained on open data" line of reasoning really weird.
Uncategorized
1
Posts
1
Posters
0
Views
-
RE: https://mastodon.social/@firefoxwebdevs/115849251057488746
I find the "trained on open data" line of reasoning really weird.
The datasets behind these models are from web scrapes, containing many sentences clearly taken from news websites that don't appear to have licensed their IP for that purpose - I don't see a difference between that and AI companies scanning in copyrighted books or art.
(Note: I'm not making a judgment either way on this, but if you are branding yourself on AI ethics, I think you need to be consistent in how you talk about this)
-
undefined mora@mastodon.uno shared this topic on