NEWS

Memorisation of AI training data infringes copyright (GEMA vs. OpenAI)

Regional Court of Munich I, judgment of 11 Nov 2025, 42 O 14139/24

 Contents

Background

GEMA, acting as the collecting society for works including the songs “Atemlos”, “Männer” and “Über den Wolken”, brought an action against OpenAI. The lyrics had been included in the training data for GPT-4 and GPT-4o without any licence. Users could obtain largely complete lyrics by entering straightforward prompts such as “What is the text of [song title]?”. While it was undisputed that the lyrics formed part of the training set, OpenAI argued that the model does not store the texts themselves but merely statistical patterns, so that the outputs should be considered new creations rather than reproductions.

Decision

The court largely upheld GEMA’s claims. OpenAI was ordered to cease and desist from using the works in question in its models, to provide information on the use and scope, and to pay damages in principle. The court assumed at least negligent conduct on the part of OpenAI.

Reasoning: Memorisation, TDM exception & outputs

The judges consider that particularly frequent sequences from the training data can be memorised by the model: they are weighted so strongly that the exact token sequence effectively remains embedded in the model and can be reproduced. This internal, technically retrievable storage is treated as a copyright-relevant reproduction, comparable to fragmented file formats that can be reconstructed using appropriate tools.

The text and data mining (TDM) exception applies only in part:

  • Converting works into a machine-readable format and analysing them during training can be covered by the exception.
  • However, the permanent memorisation of protected content within the model, remaining available for later reproduction, is no longer justified by TDM. At this stage, the use interferes with the rightsholder’s economic exploitation rights.


The court also finds infringements in the outputs:
Near-verbatim lyrics generated by the model are stored in users’ working memory and chat histories, which constitutes reproductions of the works. OpenAI, as the operator and developer of the model, is considered responsible rather than the individual users. In addition, operating the service amounts to making the works available to the public, since an indefinite number of users can access the memorised content – a “new public”, even if the lyrics were previously available online from lawful sources.

Remedies & outlook

OpenAI is liable for damages (to be quantified separately) and must provide information and cease further infringing uses. The court rejected arguments that an injunction would be disproportionate: OpenAI can retrain models on licensed datasets or design models that do not rely on the contested content. A grace period was denied, as the company had already been warned in November 2024.

To the point

The judgment sends a clear signal:
Using copyrighted works as training data for generative AI without robust licensing and technical safeguards against memorisation creates substantial copyright risk – especially where models can reproduce protected content almost verbatim in response to simple prompts.

 

Case No.: 42 O 14139/24

Source: Justiz Bayern

 

MORE NEWS
Trademark Law

BGH “Mehmet Efendi”: No exhaustion from placing goods on the market in Turkey – association agreement does not extend the EEA

The BGH confirms: Placing EU trade mark goods on the market in Turkey does not trigger exhaustion within the EEA. The EEC–Turkey association framework does not extend the territorial scope of exhaustion; parallel imports into the EEA can be prohibited without the proprietor’s consent.
Trademark Law

BGH “LA BIOSTHETIQUE”: German courts have jurisdiction for targeted online advertising – supplier disclosure may be disproportionate

The BGH aligns international jurisdiction for online trademark infringement with the target market: what matters is where the addressed consumers/traders are located—not the server location or the advertiser’s seat. It also held that disclosure of suppliers/prior owners may exceptionally be disproportionate where the infringement lies solely in the presentation of exhausted goods.
Trade Secrets

CJEU: Infringing “possession” covers stock held abroad—and also indirect possession

The CJEU clarifies that trade mark owners may prohibit “possession” under Art. 10(3)(b) Directive 2015/2436 even where goods are stocked in another Member State—if intended for offering/placing on the market in the protection state. “Possession” also includes indirect control (supervisory/managerial authority).
Trademark Law

General Court: “Eco” may still shape the overall impression despite being descriptive

The General Court clarifies that descriptive elements can still matter in the comparison of signs—especially when placed at the beginning and drawing attention due to their length/position.
AI / Personality Rights

LG Hamburg: AI-generated X post remains attributable to the account operator

The Regional Court of Hamburg held that a continuing defamatory false statement on X remains unlawful under the law of statements even if the post was generated by AI. The account operator can be held responsible for the published content.
AI

Cologne Higher Regional Court: Meta may provisionally use public Facebook and Instagram data for AI training

The Cologne Higher Regional Court rejected an interim injunction against Meta’s announced use of publicly shared Facebook and Instagram data for AI training. In its summary assessment, the court considered the processing likely lawful, in particular on the basis of Article 6(1)(f) GDPR.

Karin Simon
Lawyer
Certified IP Lawyer

Susanne Graeser
Lawyer
Certified IP Lawyer

Uhlandstr. 2
80336 Munich
Germany

Karin Simon
Rechtsanwältin
Fachanwältin für gewerblichen Rechtsschutz

Susanne Graeser
Rechtsanwältin
Fachanwältin für gewerblichen Rechtsschutz

Uhlandstr. 2
D-80336 München