Pirated Training Data
Training corpora alleged to include unlawfully copied or distributed works.
CopyrightCurrent
Plain-English explanation
When datasets include books or images from shadow libraries or torrents, defendants face harder fair use and willfulness arguments.
Legal meaning
Use of knowingly infringing source material can affect statutory damages, willfulness, and the equitable character of fair use analysis, even if training itself might otherwise be transformative.
AI-specific relevance
Several AI cases allege training on LibGen, shadow libraries, or scraped content without license.
Related terms
Related content
Aidicia is an educational legal research portfolio. It does not provide legal advice, create a lawyer-client relationship, or replace advice from a licensed attorney.