Meta/Facebook's new open-source AI model, LLaMA-13B, outperforms OpenAI's GPT-3 in most benchmarks and is more than 10 times smaller.
Abstract
Meta/Facebook has released a new collection of AI models, including the LLaMA-13B model, which has surprised the AI community by outperforming OpenAI's GPT-3 in most benchmarks. Despite being more than 10 times smaller than GPT-3, LLaMA-13B performs almost the same work. This development could lead to the creation of powerful AI models that can run locally on personal devices. Meta's model was trained using only publicly available datasets, making it more reproducible and open-source than other successful models.
Opinions
Size matters in AI, and a model with fewer parameters that can perform the same work is a significant step forward.
The fact that a model 10 times smaller can do almost the same work as GPT-3 is a game-changer for AI development.
Using only publicly available datasets for training makes Meta's work more reproducible and open-source than other successful models.
This development is an unexpected move from Meta, as it is known for being a closed-source company.
The AI community is excited to see how this development will impact the AI race.
The author believes that this is a treat for the AI community and will lead to the development of powerful AI models that can run locally on personal devices.
The author encourages readers to follow them on Twitter or Medium for more information about AI and creativity.
Size Matters And Facebook Just Unleashed A Beast
Meta/Facebook’s Open-Source AI Model Outperforms OpenAI’s GPT-3
Meta/Facebook released a collection of new AI models this week.
One of them, LLaMA-13B, surprised the AI community when it found that it outperformed OpenAI’s highly acclaimed GPT-3 in most benchmarks (the tech-savvy can read the original paper here).
But that’s not all! Meta’s LLaMA does not only outperform OpenAI’s flagship model, but it’s also more than 10× smaller!
Oh, and it’s basically open-source. That, too, is an absolute first.
Does Size Really Matter?
Yes, in AI size matters a lot. That’s why everybody keeps talking about parameters. Now, a parameter is basically like a variable that a machine-learning model can use to learn and make predictions. The more parameters the model has, the better it can do its job, but it also needs more space and energy to use them.
So, if a model can do a great job with fewer parameters, that’s obviously a great step forward because it saves a lot of time and energy:
GPT-3, the foundational model behind OpenAI’s flagship ChatGPT, has 175 billion parameters.
Meta released a whole collection of models last week, ranging from 7 to 65 billion parameters, with the most astonishing one being LLaMA-13B, the 13 billion parameters model outperforming the 10x bigger GPT-3.
As mentioned earlier, this means that a model 10 times smaller does almost the same work! Which could now lead to the development of powerful AI models running locally on our laptops or smartphones.
Sample from LLaMA’s performance after fine-tuning (Link to original paper: https://t.co/q51f2oPZlE)
An Unexpected Move
Understandably, this was one proud tweet when the social media and Big Data pioneer announced that its newly released AI text generation model would outperform its major competitors:
Furthermore, and unlike other successful models such as Chinchilla, PaLM, or GPT-3, Meta used only publicly available datasets to train their model. This means that their work is actually reproducible and ironically more open-source than anything OpenAI has released — a rather unexpected move from Meta, indeed.
It will be exciting to watch how this plays out.
No doubt, we’re now definitely in for a treat with this crazy fast AI race!