Proposal to handle Data Openness in the Open Source AI definition [RFC]

I like the clarity of the quadrant, I think it’s an effective tool to explain the legal issues of distributing datasets used to create an AI system.

If I understand you correctly, you’re proposing to create two designations:

  1. Open Source AI with Open Data (OSAID D+)
  2. Open Source AI without Open Data (OSAID D-)

You’re also requesting that the designation of OSAID D- be reserved for developers who justify their legitimate reasons not to distribute the dataset.

Did I understand you correctly?