avatarTristan Wolff

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

3970

Abstract

f-falcon-40b-82b5106603b4">Falcon</a>, Vicuna, MPT, and <a href="https://readmedium.com/google-bard-is-no-match-for-chatgpt-yet-573a840751e2">PaLM</a>-Bison, and even almost wins against the old <a href="https://readmedium.com/how-to-create-your-own-custom-ai-chatbot-with-a-text-editor-28-lines-of-javascript-29563510a740">ChatGPT-0301</a> model:</p><figure id="357f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*ZhYPZs64gcbqLBAQs4YH0A.png"><figcaption><a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&amp;ccb=1-7&amp;_nc_sid=3c67a6&amp;_nc_ohc=qhK-ahCbkBMAX-t5zlE&amp;_nc_ht=scontent-ham3-1.xx&amp;oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&amp;oe=64BE66FF">LLAMA-2 research paper</a></figcaption></figure><p id="1fa4"><i>(For further information about the training process and the evaluation of the models, I recommend you read the <a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&amp;ccb=1-7&amp;_nc_sid=3c67a6&amp;_nc_ohc=qhK-ahCbkBMAX-t5zlE&amp;_nc_ht=scontent-ham3-1.xx&amp;oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&amp;oe=64BE66FF">official release paper here</a>)</i></p><h1 id="c0b8">How Does LLAMA-2 Compare To ChatGPT?</h1><p id="aa29">Despite the really impressive results, there is still a lot of room for growth for open-source models like LLAMA-2 to reach the quality of closed models like <a href="https://readmedium.com/next-week-openai-will-change-software-forever-31fac826c70c">OpenAI</a>’s GPT-3.5 or GPT-4.</p><p id="94cc">The official LLAMA-2 research paper is very transparent about this:</p><figure id="25a5"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*ofwpMwAwAhFulQm86nDjww.png"><figcaption><a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&amp;ccb=1-7&amp;_nc_sid=3c67a6&amp;_nc_ohc=qhK-ahCbkBMAX-t5zlE&amp;_nc_ht=scontent-ham3-1.xx&amp;oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&amp;oe=64BE66FF">LLAMA-2 research paper</a></figcaption></figure><p id="2c9f">However, we are only at the beginning of this new AI paradigm, and given recent <a href="https://readmedium.com/wait-did-they-just-leak-the-secret-behind-gpt-4-845d3db751cd">discoveries about the nature of models like GPT-4 </a>or <a href="https://readmedium.com/what-openai-doesnt-want-you-to-know-the-deleted-sam-altman-article-96192b7cdfc7">the leaked agenda of OpenAI</a>, we may be surprised by a future where open-source models can catch up.</p><div id="ffac" class="link-block"> <a href="https://generativeai.pub/wait-did-they-just-leak-the-secret-behind-gpt-4-845d3db751cd"> <div> <div> <h2>Wait, Did They Just Leak The Secret Behind GPT-4?</h2> <div><h3>OpenAI’s GPT-4 may owe its capabilities to an old technique from the early 1990s known as “Mixture of Experts”</h3></div> <div><p>generativeai.pub</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*SePHbA3GeJiEgX51)"></div> </div> </div> </a> </div><h1 id="d506">How To Use LLAMA-2?</h1><p id="3c30">If you want to try LLAMA-2, you can do so with the generous demo apps at <b>Hugging Face</b>. Here are links to the 7B, 13B, and 70B chatbot versions:</p><h2 id="03e8">LLAMA-2 7B-CHAT</h2><p id="a051"><b>is optimized for ChatGPT-style conversations. Link: <a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat"></a></b><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat</a></p><figure id="2fc6"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Zv9CXdWNKAP7i0JskiteAA.png">

Options

<figcaption><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat</a></figcaption></figure><h2 id="5a9e">LLAMA-2 13B-CHAT</h2><p id="8dc2"><b>is another chatbot optimized model but with almost twice as much parameters as the previous 7B version. Link:</b> <a href="https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat</a></p><figure id="19bf"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*IZFFZP9UstG2rT632uFpCw.png"><figcaption><a href="https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat">https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat</a></figcaption></figure><h2 id="609a">LLAMA-2 70B-CHAT</h2><p id="9fb9"><b>comes with 70 billion parameters. Also chatbot-optimized. Link:</b> <a href="https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI">https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI</a></p><figure id="b4ad"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*NZ5uyDqp7c2Pmuf84OJZ6A.png"><figcaption><a href="https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI">https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI</a></figcaption></figure><p id="738c"><i>(If you want to run LLAMA-2 privately, you can<a href="https://tristwolff.medium.com/how-to-setup-use-the-hundreds-of-ai-models-from-hugging-face-14b8240b6f4"> follow this guide on how to run your own private instance of one of the hundreds of models provided by Huggingface</a>)</i></p><h1 id="9563">Further Reading &amp; Links</h1><ul><li><b>Official LLAMA-2 Github repository: <a href="https://github.com/facebookresearch/llama">https://github.com/facebookresearch/llama</a></b></li><li><b>LLAMA-2 Research paper (<a href="https://scontent-ham3-1.xx.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&amp;ccb=1-7&amp;_nc_sid=3c67a6&amp;_nc_ohc=qhK-ahCbkBMAX-t5zlE&amp;_nc_ht=scontent-ham3-1.xx&amp;oh=00_AfACUnmGnjbUktpeAgZklacI4ffUC5wPIYkdbGmJuXmFUg&amp;oe=64BE66FF">LINK</a>)</b></li></ul><p id="bd57">➡️ Follow me to stay up to date on “AI &amp;<a href="https://readmedium.com/artificial-intelligence-the-misconception-of-creativity-1679d207f3e0">Creativity</a>”. If you want to support my work, become a Medium member using <a href="https://medium.com/@tristwolff/membership">my referral link and get full access to all my articles</a> (140+ and growing) and those of thousands of other writers. 🙏</p><div id="cba2" class="link-block"> <a href="https://medium.com/@tristwolff/membership"> <div> <div> <h2>Join Medium with my referral link - Tristan Wolff</h2> <div><h3>As a Medium member, a portion of your membership fee goes to writers you read, and you get full access to every story…</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*yhpYNoCLnuaujmxa)"></div> </div> </div> </a> </div><p id="1662">➡️ If you like my content, why not leave a “clap” at the end of this article, so more people can see it?</p><figure id="a30a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*uQ_OBX60jlZrBdZ4.png"><figcaption></figcaption></figure><p id="3fa5"><b>This story is published on <a href="https://generativeai.pub/">Generative AI</a>. Connect with us on <a href="https://www.linkedin.com/company/generative-ai-publication">LinkedIn</a> to get the latest AI stories and insights right in your feed. Let’s shape the future of AI together!</b></p><figure id="5730"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*u9lrmc63b7QC9a9k.png"><figcaption></figcaption></figure></article></body>

Artificial Intelligence Rockstars

Facebook/Meta Storms The AI Charts With LLAMA-2

Facebook’s New Open-Source Language Model Available Now

Image by author & Midjourney

And they keep coming: we’ve seen the release of some truly amazing open-source language models here and here, but now it’s Facebook/Meta in the spotlight again.

This time with a rockstar release, they catapulted their new large language model (LLM) straight to pole position on the Huggingface Open LLM charts.

Ladies and gentlemen say hello to LLAMA-2, a collection of pre-trained and fine-tuned generative text models coming with a full commercial license!

Here is everything you need to know about LLAMA-2 and how you can start using it today!

What Is LLAMA-2?

LLAMA-2 is not a single language model but rather a suite of several pre-trained and fine-tuned large language models (LLMs) in different sizes (ranging from 7 to 70 billion parameters).

Some of the fine-tuned versions, referred to as LLAMA-2-Chat, are specifically enhanced for chatbot-style interfaces and apps like ChatGPT.

And, LLAMA-2 models rock. They are already dominating the so-called Open LLM Leaderboard at Huggingface, which means that they perform better than any other open-source large language model currently out there:

https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?utm_source=substack&utm_medium=email

Interestingly, LLAMA-2 models not only demonstrate superior performance compared with other open-source models across a range of benchmarks: but researchers also found promising results in terms of safety and utility, suggesting that LLAMA-2 models (coming with full commercial licenses) might serve as an alternative to proprietary models in many cases.

Take a look at this comparison of how different models can be forced to produce hate speech, unqualified advice, or criminal activities:

LLAMA-2 research paper

But wait, there’s more.

In another creative experiment, the researchers conducted “games” in which they let GPT-4 decide which open source model provides the best answers and thus wins.

Watch how LLAMA-2 beats some of the best models so far, such as Falcon, Vicuna, MPT, and PaLM-Bison, and even almost wins against the old ChatGPT-0301 model:

LLAMA-2 research paper

(For further information about the training process and the evaluation of the models, I recommend you read the official release paper here)

How Does LLAMA-2 Compare To ChatGPT?

Despite the really impressive results, there is still a lot of room for growth for open-source models like LLAMA-2 to reach the quality of closed models like OpenAI’s GPT-3.5 or GPT-4.

The official LLAMA-2 research paper is very transparent about this:

LLAMA-2 research paper

However, we are only at the beginning of this new AI paradigm, and given recent discoveries about the nature of models like GPT-4 or the leaked agenda of OpenAI, we may be surprised by a future where open-source models can catch up.

How To Use LLAMA-2?

If you want to try LLAMA-2, you can do so with the generous demo apps at Hugging Face. Here are links to the 7B, 13B, and 70B chatbot versions:

LLAMA-2 7B-CHAT

is optimized for ChatGPT-style conversations. Link: https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat

https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat

LLAMA-2 13B-CHAT

is another chatbot optimized model but with almost twice as much parameters as the previous 7B version. Link: https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat

https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat

LLAMA-2 70B-CHAT

comes with 70 billion parameters. Also chatbot-optimized. Link: https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI

https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI

(If you want to run LLAMA-2 privately, you can follow this guide on how to run your own private instance of one of the hundreds of models provided by Huggingface)

Further Reading & Links

➡️ Follow me to stay up to date on “AI &Creativity”. If you want to support my work, become a Medium member using my referral link and get full access to all my articles (140+ and growing) and those of thousands of other writers. 🙏

➡️ If you like my content, why not leave a “clap” at the end of this article, so more people can see it?

This story is published on Generative AI. Connect with us on LinkedIn to get the latest AI stories and insights right in your feed. Let’s shape the future of AI together!

Artificial Intelligence
Technology
Facebook
ChatGPT
Machine Learning
Recommended from ReadMedium