avatarThe Pareto Investor

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

2386

Abstract

ulations arose about “Miqu” being a quantized version of a Mistral model, possibly an internal leak or a rogue move by an employee or customer.</p><p id="2e39"><b>However, I suspect that Mistral orchestrated everything, given the team’s notably peculiar communication style.</b></p><p id="4372">Also, to read:</p><div id="3511" class="link-block"> <a href="https://readmedium.com/chatgpt-has-just-been-dethroned-by-french-geniuses-bcee41843775"> <div> <div> <h2>ChatGPT has Just Been Dethroned by French Geniuses!</h2> <div><h3>These Three Individuals, a Former Researcher at DeepMind and Two Others from Meta, Completely Transformed the AI Game!</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*6IIgPUJ-0UPUL4xe3Y5IKg.png)"></div> </div> </div> </a> </div><h1 id="d33d">Mistral CEO’s Clarification</h1><p id="5b65">Arthur Mensch, co-founder and CEO of Mistral, addressed the leak on X.</p> <figure id="ad97"> <div> <div> <img class="ratio" src="http://placehold.it/16x9"> <iframe class="" src="https://cdn.embedly.com/widgets/media.html?type=text%2Fhtml&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;schema=twitter&amp;url=https%3A//twitter.com/arthurmensch/status/1752737462663684344%3Fs%3D20&amp;image=" allowfullscreen="" frameborder="0" height="281" width="500"> </div> </div> </figure></iframe></div></div></figure><p id="7aa7">He confirmed that an over-enthusiastic customer of Mistral leaked a quantized version of an old model, which was initially retrained from Meta’s Llama 2.</p><figure id="a8f3"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*lMhxCoI1OZgOGmR6CV7RFw.png"><figcaption>LMSys Leaderboard. (Screenshot from Dec 22, 2023) Mixtral 8x7B Instruct v0.1 achieves an Arena Elo rating of 1121 outperforming Claude-2.1 (1117), all versions of GPT-3.5-Turbo (1117 best), Gemini Pro (1111), and Llama-2–70b-chat (1077). Mixtral is currently the best open-weights model by a large margin.</figcaption></figure><p id="c869">Mensch’s response suggests that Mistral is actively developing a model comp

Options

arable to GPT-4, hinting at exciting advancements to come.</p><h1 id="75ed">Quantization?</h1><p id="2d1c">Quantization, the process mentioned in this context, is a technique in machine learning (ML) that simplifies AI model architectures for use on less powerful hardware.</p><p id="6298">This approach could democratize access to advanced AI technologies, previously limited to those with high-end computing resources.</p><h1 id="54f4">Open-Source AI</h1><p id="d57b">If Mistral or another open-source initiative releases a model rivaling GPT-4, it could shift the competitive landscape significantly.</p><p id="5fb2"><b>This situation potentially represents a pivotal moment for open-source generative AI.</b></p><figure id="40bc"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*IYi1hQBVdE6XRPIA"><figcaption>Mixtral 8x7b is exceptional — It’s like a light belt fighting in heavyweight!</figcaption></figure><p id="bb43">Such a development could challenge the dominance of proprietary models like GPT-4, especially as more businesses consider integrating open-source solutions into their applications.</p><h1 id="1386">Competitive Pressure</h1><p id="b325">OpenAI, the organization behind GPT-4, might face substantial competition if an open-source model of similar capability becomes widely available.</p><p id="a84f">While OpenAI currently leads with its advanced versions like GPT-4 Turbo and GPT-4V (vision), the fast-paced advancements in open-source AI could redefine this dynamic market.</p><p id="e49b"><b>The crucial question is whether OpenAI’s head start, and unique features will be enough to maintain its leadership in the LLM space?</b></p><h1 id="a386">Unpredictable AI Future</h1><p id="31cc"><b>So, the “Miqu” leak is true, it’s fascinating to see how open-source models are stepping up, challenging the big proprietary players.</b></p><p id="0f62">What’s really interesting here is how this event not only shows the rapid progress in AI tech but also suggests a future where both open-source and proprietary models coexist.</p><p id="590d">It’s a really exciting time in the field!</p><p id="39ed">Sincerely,</p><p id="9a43">The Pareto Investor</p><p id="e2e4"><i>My Book Free to Read: <a href="https://paretoinvestor.substack.com/p/the-little-principle-that-beats-the-market">The Little Principle That Beats the Market</a></i></p></article></body>

Mistral CEO Confirms ‘Leak’ of New Open-Source AI Model Nearing GPT-4 Performance

The new open-source large language model (LLM), rumored to approach the performance of GPT-4, a benchmark in the field.

How to Get Rich with Investing (without Getting Lucky)

Arthur Mensch — Mistral CEO

The buzz began when a user, “Miqu Dev,” uploaded files on HuggingFace, a prominent open-source AI platform.

These files allegedly represent a new LLM, “miqu-1–70b,” closely related to Mistral’s technology, a leading open-source AI firm from Paris.

An Unexpected Leak

This development took an intriguing turn with an anonymous 4chan post, possibly by “Miqu Dev,” leading to widespread online discussion.

The AI community, including on X, LinkedIn, and other platforms, began analyzing the potential of this new model.

Speculations arose about “Miqu” being a quantized version of a Mistral model, possibly an internal leak or a rogue move by an employee or customer.

However, I suspect that Mistral orchestrated everything, given the team’s notably peculiar communication style.

Also, to read:

Mistral CEO’s Clarification

Arthur Mensch, co-founder and CEO of Mistral, addressed the leak on X.

He confirmed that an over-enthusiastic customer of Mistral leaked a quantized version of an old model, which was initially retrained from Meta’s Llama 2.

LMSys Leaderboard. (Screenshot from Dec 22, 2023) Mixtral 8x7B Instruct v0.1 achieves an Arena Elo rating of 1121 outperforming Claude-2.1 (1117), all versions of GPT-3.5-Turbo (1117 best), Gemini Pro (1111), and Llama-2–70b-chat (1077). Mixtral is currently the best open-weights model by a large margin.

Mensch’s response suggests that Mistral is actively developing a model comparable to GPT-4, hinting at exciting advancements to come.

Quantization?

Quantization, the process mentioned in this context, is a technique in machine learning (ML) that simplifies AI model architectures for use on less powerful hardware.

This approach could democratize access to advanced AI technologies, previously limited to those with high-end computing resources.

Open-Source AI

If Mistral or another open-source initiative releases a model rivaling GPT-4, it could shift the competitive landscape significantly.

This situation potentially represents a pivotal moment for open-source generative AI.

Mixtral 8x7b is exceptional — It’s like a light belt fighting in heavyweight!

Such a development could challenge the dominance of proprietary models like GPT-4, especially as more businesses consider integrating open-source solutions into their applications.

Competitive Pressure

OpenAI, the organization behind GPT-4, might face substantial competition if an open-source model of similar capability becomes widely available.

While OpenAI currently leads with its advanced versions like GPT-4 Turbo and GPT-4V (vision), the fast-paced advancements in open-source AI could redefine this dynamic market.

The crucial question is whether OpenAI’s head start, and unique features will be enough to maintain its leadership in the LLM space?

Unpredictable AI Future

So, the “Miqu” leak is true, it’s fascinating to see how open-source models are stepping up, challenging the big proprietary players.

What’s really interesting here is how this event not only shows the rapid progress in AI tech but also suggests a future where both open-source and proprietary models coexist.

It’s a really exciting time in the field!

Sincerely,

The Pareto Investor

My Book Free to Read: The Little Principle That Beats the Market

Artificial Intelligence
Technology
Business
Machine Learning
Programming
Recommended from ReadMedium