It’s worth highlighting Ai2’s work on building a fully Open Source AI. It seems to adhere to the OSAID:
What makes Tülu 3 405B different for users, though, is how Ai2 has made the model available.
There is a lot of noise in the AI market about open source. DeepSeek says its model is open-source, and so is Meta’s Llama 3.1, which Tülu 3 405B also outperforms.
With both DeepSeek and Llama the models are freely available for use; and some code, but not all, is available.
For example, DeepSeek-R1 has released its model code and pre-trained weights but not the training data. Ai2 is taking a different approach in an attempt to be more open.
“We don’t leverage any closed datasets,” Hajishirzi said. “As with our first Tülu 3 release in November 2024, we are releasing all of the infrastructure code.”
She added that Ai2’s fully open approach, which includes data, training code and models, ensures users can easily customize their pipeline for everything from data selection through evaluation.