GPT-4 Lost This Battle 449 to 28

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

9223

Abstract

e results of relevant internal and external testing and optimization of the model.</li><li><b>Member states</b> [Annex VIII, Section C, page 24]. Member States in which the foundation model is or has been placed on the market, put into service, or made available in the Union.</li><li><b>Downstream documentation</b> [Annex VIII, 60g, page 29 as well as Article 28b, paragraph 2e, page 40]. Also, foundation models should have information obligations and prepare all necessary technical documentation for potential downstream providers to be able to comply with their obligations under this Regulation.</li><li><b>Machine-generated content</b> [Annex VIII, 60g, page 29]. Generative foundation models should ensure transparency about the fact the content is generated by an AI system, not by humans.</li><li><b>Pre-market compliance</b> [Article 28b, paragraph 1, page 39]. A provider of a foundation model shall, prior to making it available on the market or putting it into service, ensure that it is compliant with the requirements set out in this Article, regardless of whether it is provided as a standalone model or embedded in an AI system or a product, or provided under free and open source licenses, as a service, as well as other distribution channels.</li><li><b>Data governance</b> [Article 28b, paragraph 2b, page 39]. Process and incorporate only datasets that are subject to appropriate data governance measures for foundation models, in particular, measures to examine the suitability of the data sources and possible biases and appropriate mitigation.</li><li><b>Energy</b> [Article 28b, paragraph 2d, page 40]. Design and develop the foundation model, making use of applicable standards to reduce energy use, resource use and waste, as well as to increase energy efficiency and the overall efficiency of the system. This shall be without prejudice to relevant existing Union and national law, and this obligation shall not apply before the standards referred to in Article 40 are published. They shall be designed with capabilities enabling the measurement and logging of the consumption of energy and resources and, where technically feasible, another environmental impact the deployment and use of the systems may have over their entire lifecycle.</li><li><b>Quality management</b> [Article 28b, paragraph 2f, page 40]. Establish a quality management system to ensure and document compliance with this Article, with the possibility to experiment in fulfilling this requirement.</li><li><b>Upkeep</b> [Article 28b, paragraph 3, page 40]. Providers of foundation models shall, for a period ending ten years after their foundation models have been placed on the market or put into service, keep the technical documentation referred to in paragraph 1(c) at the disposal of the national competent authorities.</li><li><b>Law-abiding generated content</b> [Article 28b, paragraph 4b, page 40]. Train, and, where applicable, design and develop the foundation model in such a way as to ensure adequate safeguards against the generation of content in breach of Union law in line with the generally acknowledged state of the art and without prejudice to fundamental rights, including the freedom of expression.</li><li><b>Training on copyrighted data</b> [Article 28b, paragraph 4c, page 40]. Without prejudice to national or Union legislation on copyright, document and make publicly available a sufficiently detailed summary of the use of training data protected under copyright law.</li><li><b>Adherence to general principles</b> [Article 4a, paragraph 1, page 142–3]. All operators falling under this Regulation shall do their best to develop and use AI systems or foundation models in accordance with the following general principles establishing a high-level framework that promotes a coherent humancentric European approach to ethical and trustworthy Artificial Intelligence, which is fully in line with the Charter as well as the values on which the Union is founded: a) ‘human agency and oversight’ means that AI systems shall be developed and used as a tool that serves people, respects human dignity and personal autonomy, and that is functioning in a way that can be appropriately controlled and overseen by humans. b) ‘technical robustness and safety means that AI systems shall be developed and used in a way to minimize unintended and unexpected harm as well as be robust in case of unintended problems and be resilient against attempts to alter the use or performance of the AI system so as to allow unlawful use by malicious third parties. c) ‘ Privacy and data governance’ means that AI systems shall be developed and used in compliance with existing privacy and data protection rules while processing data that meets high standards in terms of quality and integrity. d) ‘transparency’ means that AI systems shall be developed and used in a way that allows appropriate traceability and explainability while making humans aware that they communicate or interact with an AI system as well as duly informing users of the capabilities and limitations of that AI system and affected persons about their rights. e) ‘diversity, non-discrimination and fairness’ means that AI systems shall be developed and used in a way that includes diverse actors and promotes equal access, gender equality, and cultural diversity while avoiding discriminatory impacts and unfair biases that are prohibited by Union or national law. f) ‘social and environmental well-being’ means that AI systems shall be developed and used in a sustainable and environmentally friendly manner as well as in a way to benefit all human beings while monitoring and assessing the long-term impacts on the individual, society, and democracy. For foundation models, the general principles are translated into and complied with by providers by means of the requirements set out in Articles 28 to 28b.</li><li><b>The system is designed so users know it's an AI</b> [Article 52(1) <a href="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A52021PC0206">Paragraph</a> 1 — not in the Compromise text, but invoked in 28(b), paragraph 4a, page 40]. Providers shall ensure that AI systems intended to interact with natural persons are designed and developed in such a way that natural persons are informed that they are interacting with an AI system unless this is obvious from the circumstances and the context of use. This obligation shall not apply to AI systems authorized by law to detect, prevent, investigate, and prosecute criminal offenses unless those systems are available for the public to report a criminal offense.</li><li><b>Appropriate levels</b> [Article 28b, paragraph 2c, page 39]. design and develop the foundation model in order to achieve throughout its lifecycle appropriate levels of performance, predictability, interpretability, corrigibility, safety, and cybersecurity assessed through appropriate methods such as model evaluation with the involvement of independent experts, documented analysis, and extensive testing during conceptualization, design, and development</li></ol><h2 id="1806">12 of the 22 requirements were further assessed and graded</h2><figure id="3a5f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*FE64jUakTXFgcXoet3rNwA.png"><figcaption>Source: <a href="https://github.com/stanford-crfm/TransparencyIndex/blob/main/rubrics.md">Stanford.edu/git</a></figcaption></figure><h1 id="0333">How is it looking like, Chief?</h1><p id="8952">I will not lie; the results are not surprising, but certain themes are !! Out of the 12 requirements that relate to the data and technical aspects of the implementation, 4 core categories develop which are important for the EU regulation.</p><ul><li>Category 1: Data (includes sources, governance, and specific attention to the Copyrights)</li><li>Category 2: Compute (includes declaration about compute needed to replicate/train and Energy consumption during training as well as the inference)</li><li>Category 3: The Model Liabilities (includes Capabilities and limitations, risk mitigation plan and strategy, Evaluations, and Testing)</li><li>Category 4: Deployment (includes clarity on the requirements for the disclosure of the machine-generated content, disclosure to the EU member states when a model is being deployed and made available to EU, and requirements about the provision of technical compliance for the EU AI act.</li></ul><figure id="2646"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*mV1IRTlcFVyL3_wx.png"><figcaption>Source: <a href="https://crfm.stanford.edu/2023/06/15/eu-ai-act.html">Stanford.edu</a></figcaption></figure><p id="e3bd">The figure above shows the final scores. One could easily lump models with 24+ scores out of 48 (a passing grade — OpenAI, Google PaLM, Bloom, and the elueutherAI.</p><p id="0aad">Based on the scores above, I am worried about Category 1 and 3 of the 12/22 requirements. Most of the models have scored abysmally low based on Category 1. Given that most of the data was collected from internet scraping, a sizable chunk of it can be assumed to be generated from copyrighted material (explicit or implicit). It could be considered as even private data (if not

Options

copyrighted). Does the Category 1 language define the “fair use of internet data”? No. Is the regulation clear? “yes, as mud!”.</p><p id="8a0f">Another big issue is category 3, which is the risk mitigation attempt or plan. This includes an evaluation of the model. In its truly responsible manner, Meta has scored almost zero on their releases of the models. I sincerely think that they have not given 1% of the thought as much Google and OpenAI have given to the safety of their models. This is clearly visible from the scores.</p><p id="2264">For both category 1 and 3 an essential spice in the recipe is currently missing. A standard way to evaluate these models for both performance and safety. There is a lot of new literature covering this topic and I am hopeful that we can come up with a rubric to test these models in a more standard way.</p><p id="2223">Energy utilization and downstream technical documentation are the least of my worries and we should not sweat about it in IMHO. GPT-NeoX by ElutherAi and Bloom both get a crown in terms of being open and transparent about their data and disclosures.</p><h1 id="1d4b">No, we will not disclose vs regulation.</h1><p id="b29a">Both Google, in their PaLM paper and OpenAI/Microsoft in their release of GPT-4, clearly stated that they have no intent to disclose further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar. Although these requirements are really not hampering the safety of the model, these show the pressure on these companies to keep their developments secret.</p><p id="e0d3">The inherent push to be secret due to the competition is more harmful than the specific actions not to disclose technical details or secrete sauce recipes. If these companies are allowed to develop a system like patents using which they could keep their financial interests intact for X years, it will help to make these technologies more transparent without putting undue pressure on the source of innovation.</p><p id="ce90">Self-reporting has never worked. We need regulation, innovation, and the people to play together, which means we must have a strict but fair approach to regulation. I am glad to see the stricter side of regulation. I am sure this is just the pendulum swinging on the side of over-regulation after a long period of under-regulation. Transparency is the key to the trust of the end-user. Let us tread responsibly.</p><p id="a64e">If you have read it until this point — Thank you! You are a hero (and a Nerd ❤)! I try to keep my readers up to date with “interesting happenings in the AI world,” so please 🔔 <b><i>clap </i></b>| <b><i>follow | <a href="https://ithinkbot.com/subscribe">Subscribe</a> </i>🔔</b></p><p id="fe09">Become a member using the referral: <a href="https://ithinkbot.com/membership">https://ithinkbot.com/membership</a></p><p id="3954">Find me on Linkedin <a href="https://www.linkedin.com/in/mandarkarhade/">https://www.linkedin.com/in/mandarkarhade/</a></p><div id="065a" class="link-block"> <a href="https://pub.towardsai.net/how-do-8-smaller-models-in-gpt4-work-7335ccdfcf05"> <div> <div> <h2>How Do 8 Smaller Models in GPT4 Work?</h2> <div><h3>The secret “Model of Experts” is out; let's understand why GPT4 is so good!</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*JTMlYovgVjEgq8SL1GICyg.gif)"></div> </div> </div> </a> </div><div id="f073" class="link-block"> <a href="https://pub.towardsai.net/gpt-4-8-models-in-one-the-secret-is-out-e3d16fd1eee0"> <div> <div> <h2>GPT-4: 8 Models in One ; The Secret is Out</h2> <div><h3>GPT4 kept the model secret to avoid competition, now the secret is out!</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*vuZbGVQVpcWeEgTB-mr7Dg.png)"></div> </div> </div> </a> </div><div id="d232" class="link-block"> <a href="https://pub.towardsai.net/meet-mpt-30b-a-fully-opensouce-llm-that-outperforms-gpt-3-22f7b1e00e3e"> <div> <div> <h2>Meet MPT-30B: A Fully OpenSouce LLM that Outperforms GPT-3</h2> <div><h3>Releasing two fine-tuned variants, MPT-30B-Instruct and MPT-30B-Chat, that are built on top of MPT-30B</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*E0tjlFP_vR4uDy3t.jpg)"></div> </div> </div> </a> </div><div id="9408" class="link-block"> <a href="https://pub.towardsai.net/forget-lamp-stack-llm-stack-is-here-e628ae85aa3b"> <div> <div> <h2>Forget LAMP Stack: LLM stack is here!</h2> <div><h3>Huggingface has positioned itself as the new standard stack in the NLP/LLM ecosystem. Now the companies are asking for…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*MU28BwjHagaXXfJL.png)"></div> </div> </div> </a> </div><div id="e2d8" class="link-block"> <a href="https://pub.towardsai.net/meet-gorilla-a-fully-opensource-llm-tuned-for-api-calls-7447c6cbc78"> <div> <div> <h2>Meet Gorilla: A Fully OpenSource LLM Tuned For API Calls</h2> <div><h3>Fewer Hallucinations and better than GPT-4 in writing API calls</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*KLhQcz3L-OlYvj9mk7gA2g.gif)"></div> </div> </div> </a> </div><div id="005e" class="link-block"> <a href="https://pub.towardsai.net/fine-tune-gpt-mode-using-lit-parrot-by-lightening-ai-164ca552f61f"> <div> <div> <h2>Fine Tune GPT mode using Lit-Parrot by Lightening-AI</h2> <div><h3>BYOD Bring Your Data! and Let’s Train on Your GPU</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*0ANJAOVMBkWrRAbi)"></div> </div> </div> </a> </div><div id="3c67" class="link-block"> <a href="https://pub.towardsai.net/wizardlm-fully-open-source-automated-instruction-data-generator-efd7d4efb77e"> <div> <div> <h2>WizardLM: Fully Open-source Automated Instruction Data Generator</h2> <div><h3>Automate tedious steps of instruction-based training data generation</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*KyHC78agrkUTeTv3)"></div> </div> </div> </a> </div><div id="19cf" class="link-block"> <a href="https://pub.towardsai.net/falcon-40b-a-fully-opensourced-foundation-llm-945dd9824157"> <div> <div> <h2>Falcon-40B: A Fully OpenSourced Foundation LLM</h2> <div><h3>Each Contributor hereby grants Grants to You a perpetual, worldwide, non-exclusive, irrevocable copyright license to…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*0NqUA09dX97Gd1b6)"></div> </div> </div> </a> </div><div id="f7d4" class="link-block"> <a href="https://pub.towardsai.net/h2oai-releases-fully-opensourced-gpt-9bdcc2fc1f6d"> <div> <div> <h2>H2Oai releases Fully OpenSourced GPT</h2> <div><h3>h2oGPT-20B, h2oGPT-12B v1, and h2oGPT-12B v2 models have been released with Apache 2.0 license (Completely free for…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*xBM-9qH2YVs_25tM2um7Yw.gif)"></div> </div> </div> </a> </div><figure id="4bb9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*qneuOdUFNZ84xLfLtGaHLw.gif"><figcaption></figcaption></figure></article></body>

GPT-4 Lost This Battle 449 to 28

After GDPR, Europe’s push for Safe and Transparent AI will change the LLM landscape significantly.

Enter European Parliament

Obligations for General purpose AI (ChatGPT and other LLMs)

Do LLMs Comply with EU regulations?

The 22 requirements can be found here :

12 of the 22 requirements were further assessed and graded

How is it looking like, Chief?

No, we will not disclose vs regulation.

How Do 8 Smaller Models in GPT4 Work?

The secret “Model of Experts” is out; let's understand why GPT4 is so good!

GPT-4: 8 Models in One ; The Secret is Out

GPT4 kept the model secret to avoid competition, now the secret is out!

Meet MPT-30B: A Fully OpenSouce LLM that Outperforms GPT-3

Releasing two fine-tuned variants, MPT-30B-Instruct and MPT-30B-Chat, that are built on top of MPT-30B

Forget LAMP Stack: LLM stack is here!

Huggingface has positioned itself as the new standard stack in the NLP/LLM ecosystem. Now the companies are asking for…

Meet Gorilla: A Fully OpenSource LLM Tuned For API Calls

Fewer Hallucinations and better than GPT-4 in writing API calls

Fine Tune GPT mode using Lit-Parrot by Lightening-AI

BYOD Bring Your Data! and Let’s Train on Your GPU

WizardLM: Fully Open-source Automated Instruction Data Generator

Automate tedious steps of instruction-based training data generation

Falcon-40B: A Fully OpenSourced Foundation LLM

Each Contributor hereby grants Grants to You a perpetual, worldwide, non-exclusive, irrevocable copyright license to…

H2Oai releases Fully OpenSourced GPT

h2oGPT-20B, h2oGPT-12B v1, and h2oGPT-12B v2 models have been released with Apache 2.0 license (Completely free for…