Welcome diverse approaches to training data within a unified Open Source AI Definition

I agree with this.

In the OSAID checklist, we specify the legal conditions for datasets as “OSD-compliant license,” and I believe many participants here interpret this as implicitly including CC4 and CC0. This is a reasonable interpretation since many datasets are licensed under Creative Commons. However, OSI has not yet made a definitive assessment of Creative Commons licenses, and CC0 was reviewed in 2012, but no decision was made to declare it compliant with the OSD. I recall that there were concerns about its weakened defenses against patent claims.

Until now, OSI has only evaluated software licenses, but moving forward with OSAID, we will likely need to learn much from Creative Commons.

So, personally…, Section 2.b.1 of CC BY 4.0 states the following, but I am curious to hear opinions from organizations that actually operate under this license, such as whether there are cases where “to the extent possible” does not apply or whether this clause works effectively in most jurisdictions.

Moral rights, such as the right of integrity, are not licensed under this Public License, nor are publicity, privacy, and/or other similar personality rights; however, to the extent possible, the Licensor waives and/or agrees not to assert any such rights held by the Licensor to the limited extent necessary to allow You to exercise the Licensed Rights, but not otherwise.