if it is truly possible to recreate a model with performance equivalent to Meta’s model using that large amount of synthetic data
Ah but that’s the thing. Current OSAID proposal only requires that a ‘skilled user’ (skilled how?) be able to create a ‘substantially equivalent’ system.
Equivalent how? (Performance on automatic, proven-to-be-gameable metrics, likely.) And how much wiggle room does that hedge ‘substantially’ provide?
It just seems designed to open up all sorts of degrees of freedom that will mostly be beneficial to the big players seeking to evade scrutiny, while taking the oxygen out of the room for the smaller players who have already demonstrated meaningful openness is in fact possible.