While I’d support the first two change sugestions, and I totally agree that training data is to trained model weights as software source code is to binary executable, I can’t see how a “system trained on unshareable non-public training data” could match an “Open Source AI” definition.
Such system would not provide the freedom to study the system and would limit the freedom to modify the model in a huge way.
Open Source grants the freedom to study and modify or just the freedom to fine tune?