Jared Stein รีทวีตแล้ว

When Anthropic stopped training on books that were literally pirated, they managed to hit on the one way of buying books that means no money goes to authors: buying used books.
They spent tens of millions of dollars buying used books from wholesalers, in batches of tens of thousands at a time. These were shipped to Illinois, scanned, and pulped.
They called this Project Panama, using a codename because they didn’t want people to know they were doing it. (It ultimately came out through court documents.)
Before alighting on this plan, they were discussing licensing from book publishers, which would have meant money going to authors. But then they came up with the used books plan, and stopped all licensing discussions. Anthropic uses their huge war chest to get all the books in the world (that’s their aim) - authors get nothing.
IMO there are serious questions over whether this should be legal. Yes, they are buying the books. But you can’t just do anything you like with a book once you’ve bought it. You can’t scan it and sell it as an ebook, for instance. There are limits on what you can do with books you’ve bought, where what you would be doing would compete with the book’s rights holders.
As Judge Chhabria said in Meta v Kadrey, LLMs will likely compete with the books they are trained on by flooding the market. And as Dario Amodei himself said in 2021, big AI companies centralizing profits by training on books without the authors getting paid is a real concern.
Whatever your view of its legality, it’s pretty clear that it sucks for authors, letting Anthropic make money at their expense. Authors should get paid when their books are used to train AI, and should have the chance to say no to that training. Anthropic’s used books strategy gives them neither.

English













