We say that a model has “memorized” a piece of training data when: (1) it is ...

2024-07-18 23:26:34

We say that a model has “memorized” a piece of training data when:
(1) it is possible to reconstruct from the model
(2) a near-exact copy of
(3) a substantial portion of
(4) that specific piece of training data.
We think that this is the most useful definition for legal conversations, and we explain how it relates to other terms in common use, such as “learning,” “extraction,” and “regurgitation.”

Author Public Key

npub1mehtl0g8g3kcgpcwg9nc27yft99mfautjecvhqq04e0ychpqa4nqd56hg8

Show more details

James Grimmelmann on Nostr: We say that a model has “memorized” a piece of training data when: (1) it is ...