Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

f-falcon-40b-82b5106603b4">Falcon</a>, Vicuna, MPT, and <a href="https://readmedium.com/google-bard-is-no-match-for-chatgpt-yet-573a840751e2">PaLM</a>-Bison, and even almost wins against the old <a href="https://readmedium.com/how-to-create-your-own-custom-ai-chatbot-with-a-text-editor-28-lines-of-javascript-29563510a740">ChatGPT-0301</a> model:</p><figure id="357f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*ZhYPZs64gcbqLBAQs4YH0A.png"><figcaption><a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=qhK-ahCbkBMAX-t5zlE&_nc_ht=scontent-ham3-1.xx&oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&oe=64BE66FF">LLAMA-2 research paper</a></figcaption></figure><p id="1fa4"><i>(For further information about the training process and the evaluation of the models, I recommend you read the <a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=qhK-ahCbkBMAX-t5zlE&_nc_ht=scontent-ham3-1.xx&oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&oe=64BE66FF">official release paper here</a>)</i></p><h1 id="c0b8">How Does LLAMA-2 Compare To ChatGPT?</h1><p id="aa29">Despite the really impressive results, there is still a lot of room for growth for open-source models like LLAMA-2 to reach the quality of closed models like <a href="https://readmedium.com/next-week-openai-will-change-software-forever-31fac826c70c">OpenAI</a>’s GPT-3.5 or GPT-4.</p><p id="94cc">The official LLAMA-2 research paper is very transparent about this:</p><figure id="25a5"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*ofwpMwAwAhFulQm86nDjww.png"><figcaption><a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=qhK-ahCbkBMAX-t5zlE&_nc_ht=scontent-ham3-1.xx&oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&oe=64BE66FF">LLAMA-2 research paper</a></figcaption></figure><p id="2c9f">However, we are only at the beginning of this new AI paradigm, and given recent <a href="https://readmedium.com/wait-did-they-just-leak-the-secret-behind-gpt-4-845d3db751cd">discoveries about the nature of models like GPT-4 </a>or <a href="https://readmedium.com/what-openai-doesnt-want-you-to-know-the-deleted-sam-altman-article-96192b7cdfc7">the leaked agenda of OpenAI</a>, we may be surprised by a future where open-source models can catch up.</p><div id="ffac" class="link-block"> <a href="https://generativeai.pub/wait-did-they-just-leak-the-secret-behind-gpt-4-845d3db751cd"> <div> <div> <h2>Wait, Did They Just Leak The Secret Behind GPT-4?</h2> <div><h3>OpenAI’s GPT-4 may owe its capabilities to an old technique from the early 1990s known as “Mixture of Experts”</h3></div> <div><p>generativeai.pub</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*SePHbA3GeJiEgX51)"></div> </div> </div> </a> </div><h1 id="d506">How To Use LLAMA-2?</h1><p id="3c30">If you want to try LLAMA-2, you can do so with the generous demo apps at <b>Hugging Face</b>. Here are links to the 7B, 13B, and 70B chatbot versions:</p><h2 id="03e8">LLAMA-2 7B-CHAT</h2><p id="a051"><b>is optimized for ChatGPT-style conversations. Link: <a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat"></a></b><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat</a></p><figure id="2fc6"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Zv9CXdWNKAP7i0JskiteAA.png">

Options

<figcaption><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat</a></figcaption></figure><h2 id="5a9e">LLAMA-2 13B-CHAT</h2><p id="8dc2"><b>is another chatbot optimized model but with almost twice as much parameters as the previous 7B version. Link:</b> <a href="https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat</a></p><figure id="19bf"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*IZFFZP9UstG2rT632uFpCw.png"><figcaption><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat</a></figcaption></figure><h2 id="609a">LLAMA-2 70B-CHAT</h2><p id="9fb9"><b>comes with 70 billion parameters. Also chatbot-optimized. Link:</b> <a href="https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI">https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI</a></p><figure id="b4ad"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*NZ5uyDqp7c2Pmuf84OJZ6A.png"><figcaption><a href="https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI">https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI</a></figcaption></figure><p id="738c"><i>(If you want to run LLAMA-2 privately, you can<a href="https://tristwolff.medium.com/how-to-setup-use-the-hundreds-of-ai-models-from-hugging-face-14b8240b6f4"> follow this guide on how to run your own private instance of one of the hundreds of models provided by Huggingface</a>)</i></p><h1 id="9563">Further Reading & Links</h1><ul><li><b>Official LLAMA-2 Github repository: <a href="https://github.com/facebookresearch/llama">https://github.com/facebookresearch/llama</a></b></li><li><b>LLAMA-2 Research paper (<a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=qhK-ahCbkBMAX-t5zlE&_nc_ht=scontent-ham3-1.xx&oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&oe=64BE66FF">LINK</a>)</b></li></ul><p id="bd57">➡️ Follow me to stay up to date on “AI &<a href="https://readmedium.com/artificial-intelligence-the-misconception-of-creativity-1679d207f3e0">Creativity</a>”. If you want to support my work, become a Medium member using <a href="https://medium.com/@tristwolff/membership">my referral link and get full access to all my articles</a> (140+ and growing) and those of thousands of other writers. 🙏</p><div id="cba2" class="link-block"> <a href="https://medium.com/@tristwolff/membership"> <div> <div> <h2>Join Medium with my referral link - Tristan Wolff</h2> <div><h3>As a Medium member, a portion of your membership fee goes to writers you read, and you get full access to every story…</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*yhpYNoCLnuaujmxa)"></div> </div> </div> </a> </div><p id="1662">➡️ If you like my content, why not leave a “clap” at the end of this article, so more people can see it?</p><figure id="a30a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*uQ_OBX60jlZrBdZ4.png"><figcaption></figcaption></figure><p id="3fa5"><b>This story is published on <a href="https://generativeai.pub/">Generative AI</a>. Connect with us on <a href="https://www.linkedin.com/company/generative-ai-publication">LinkedIn</a> to get the latest AI stories and insights right in your feed. Let’s shape the future of AI together!</b></p><figure id="5730"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*u9lrmc63b7QC9a9k.png"><figcaption></figcaption></figure></article></body>

Artificial Intelligence Rockstars

Facebook/Meta Storms The AI Charts With LLAMA-2

Facebook’s New Open-Source Language Model Available Now

And they keep coming: we’ve seen the release of some truly amazing open-source language models here and here, but now it’s Facebook/Meta in the spotlight again.

This time with a rockstar release, they catapulted their new large language model (LLM) straight to pole position on the Huggingface Open LLM charts.

Ladies and gentlemen say hello to LLAMA-2, a collection of pre-trained and fine-tuned generative text models coming with a full commercial license!

Here is everything you need to know about LLAMA-2 and how you can start using it today!

What Is LLAMA-2?

LLAMA-2 is not a single language model but rather a suite of several pre-trained and fine-tuned large language models (LLMs) in different sizes (ranging from 7 to 70 billion parameters).

Some of the fine-tuned versions, referred to as LLAMA-2-Chat, are specifically enhanced for chatbot-style interfaces and apps like ChatGPT.

And, LLAMA-2 models rock. They are already dominating the so-called Open LLM Leaderboard at Huggingface, which means that they perform better than any other open-source large language model currently out there:

https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?utm_source=substack&utm_medium=email

Interestingly, LLAMA-2 models not only demonstrate superior performance compared with other open-source models across a range of benchmarks: but researchers also found promising results in terms of safety and utility, suggesting that LLAMA-2 models (coming with full commercial licenses) might serve as an alternative to proprietary models in many cases.

Take a look at this comparison of how different models can be forced to produce hate speech, unqualified advice, or criminal activities:

But wait, there’s more.

In another creative experiment, the researchers conducted “games” in which they let GPT-4 decide which open source model provides the best answers and thus wins.

Watch how LLAMA-2 beats some of the best models so far, such as Falcon, Vicuna, MPT, and PaLM-Bison, and even almost wins against the old ChatGPT-0301 model:

(For further information about the training process and the evaluation of the models, I recommend you read the official release paper here)

How Does LLAMA-2 Compare To ChatGPT?

Despite the really impressive results, there is still a lot of room for growth for open-source models like LLAMA-2 to reach the quality of closed models like OpenAI’s GPT-3.5 or GPT-4.

The official LLAMA-2 research paper is very transparent about this:

However, we are only at the beginning of this new AI paradigm, and given recent discoveries about the nature of models like GPT-4 or the leaked agenda of OpenAI, we may be surprised by a future where open-source models can catch up.

Wait, Did They Just Leak The Secret Behind GPT-4?

OpenAI’s GPT-4 may owe its capabilities to an old technique from the early 1990s known as “Mixture of Experts”

generativeai.pub

How To Use LLAMA-2?

If you want to try LLAMA-2, you can do so with the generous demo apps at Hugging Face. Here are links to the 7B, 13B, and 70B chatbot versions:

LLAMA-2 7B-CHAT

is optimized for ChatGPT-style conversations. Link: https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat

LLAMA-2 13B-CHAT

is another chatbot optimized model but with almost twice as much parameters as the previous 7B version. Link: https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat

LLAMA-2 70B-CHAT

comes with 70 billion parameters. Also chatbot-optimized. Link: https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI

(If you want to run LLAMA-2 privately, you can follow this guide on how to run your own private instance of one of the hundreds of models provided by Huggingface)