Training Data
The corpus of examples used to optimize a machine learning model's parameters.
CopyrightCurrent
Plain-English explanation
Training data is the raw material fed into an AI system—text, images, code, or other signals—from which the model learns patterns.
Legal meaning
Legally, training data matters because it is often assembled by copying or accessing third-party works. The method of acquisition and retention of copies is central to infringement analysis.
AI-specific relevance
Whether training data was licensed, scraped, or sourced from pirated libraries is a recurring factual and legal battleground.
Related terms
Related content
Aidicia is an educational legal research portfolio. It does not provide legal advice, create a lawyer-client relationship, or replace advice from a licensed attorney.