⚑🚨 NEW - NVIDIA allegedly contacted Anna's Archive directly for access to ~500 terabytes of "pirated" books and papers for pre-training their LLMs Anna's warned them the collections were illegal and copyrighted. NVIDIA's data strategy team pushed anyway; executives gave the green light within days, per internal docs cited in the lawsuit.

Replies (11)

Default avatar
G Force G 1 week ago
I don't think it is necessarily about LLMs, its about the concept of IP. You can't really have both without removing the "Large" in Large Language Model.
Default avatar
G Force G 1 week ago
Its sort of like nuclear bombs. There is no practical use for them unless you are a nation state. There is no practical way to produce them unless you have a money printer and a bunch of statist scientists and engineers who want to get paid.
Yeah I've sort of had my doubts about whether AI (LLMs in particular) is ultimately a "democratizing" technology or not, precisely because for these things to be of any use, they have to ingest massive amounts of data and energy.
↑