David Bremner on Nostr: I have a possibly tricky #licensing question for all of you people who are not my ...
I have a possibly tricky #licensing question for all of you people who are not my #lawyer. I have collected a corpus of #email [1] to help tune #notmuch
There are about 200k messages collected from two public mailing lists and the Enron corpus. In a fit of public spirit I thought I should upload it as a dataset to
https://zenodo.org. But I have no idea what license, if any to put on the data set. Anyone with similar experience? Somehow the Enron corpus ended up CC licensed, but I don't know if any lawyers were harmed in the making of that decision.
[1]:
https://notmuchmail.org/corpus/Published at
2023-07-18 23:32:14Event JSON
{
"id": "1a07c9a112e75c80e0b9659c5be77ee82aeb366718702594225730f541e0a4d7",
"pubkey": "0b5847bcce25abbc42ff9143ed49234cb30eab9ee0cc397c8d2af334a47f4dbb",
"created_at": 1689723134,
"kind": 1,
"tags": [
[
"t",
"notmuch"
],
[
"t",
"email"
],
[
"t",
"lawyer"
],
[
"t",
"licensing"
],
[
"mostr",
"https://mathstodon.xyz/users/bremner/statuses/110737695333351306"
]
],
"content": "I have a possibly tricky #licensing question for all of you people who are not my #lawyer. I have collected a corpus of #email [1] to help tune #notmuch \n\nThere are about 200k messages collected from two public mailing lists and the Enron corpus. In a fit of public spirit I thought I should upload it as a dataset to https://zenodo.org. But I have no idea what license, if any to put on the data set. Anyone with similar experience? Somehow the Enron corpus ended up CC licensed, but I don't know if any lawyers were harmed in the making of that decision.\n\n[1]: https://notmuchmail.org/corpus/",
"sig": "5913d9f52f55b4b8cb4a822e7c232bc1ca77fb13261d9c9cb8e906d1d10fcff7b5be69cb89a5a2654adc407446d60c19fabc91f772d2d63fdc7939fb62db63ff"
}