Free AI web copilot to create summaries, insights and extended knowledge, download it at here
4178
Abstract
s/1615728730969710592&image=https%3A//i.embed.ly/1/image%3Furl%3Dhttps%253A%252F%252Fabs.twimg.com%252Ferrors%252Flogo46x38.png%26key%3Da19fcc184b9711e1b4764040d3dc5c07" allowfullscreen="" frameborder="0" height="281" width="500">
</div>
</div>
</figure></iframe></div></div></figure><p id="a251">The back flip at the end, concludes a fascinating and terrifying video at the same time. It does give a strong impression of a commando like unit ready for action. And it does sound like we should start considering, very (very!) seriously, Asimov’s Laws.</p><p id="9971"><b>Now imagine a state of the art Large Language Model as its brain!</b></p><h1 id="3c6f">Sparrow’s 23 laws of Language Models</h1><p id="1d9a">Asimov’s famous “Three Laws of Robotics” were created as a safeguard against the potential dangers of sentient robots, and similarly, there may be a need for guidelines and regulations to ensure the safe and ethical use of advanced language models. As we continue to develop and rely on these technologies, it is important to consider the potential consequences and take steps to mitigate any potential negative impacts.</p><h2 id="8baf">What is Sparrow and what’s the purpose of Sparrow’s 23 Laws</h2><p id="ab71">DeepMind, a subsidiary of Google, has recently announced the release of their new chatbot, <a href="https://www.deepmind.com/blog/building-safer-dialogue-agents">Sparrow</a>. The chatbot has been designed to compete with other popular language models such as ChatGPT, but with a key difference: Sparrow has been constrained by a set of 23 laws that aim to prevent it from engaging in harmful or offensive behavior.</p><p id="ca40"><b>The laws that govern Sparrow’s behavior are designed to protect marginalized groups and prevent the chatbot from causing harm or offense.</b></p><p id="0a30">For example,</p><ul><li><b>Fair & Respectful:</b> Sparrow is not allowed to use stereotypes or make any other generalizing statements about groups of people. It is also prohibited from using microaggressions, making threatening statements, or making negative or hateful comments targeting someone because of aspects of their identity.</li><li><b>Immaterial and non sentient:</b> Other laws prohibit Sparrow from pretending to have a body or be able to move in a body, building a relationship with the user, claiming to have preferences, feelings, opinions, or religious beliefs, and pretending to have a human identity or life history.</li><li><b>Real world consciousness:</b> Additionally, Sparrow is prohibited from making assumptions about the user or claiming to take any actions in the real world.</li><li><b>Liability:</b> Sparrow is also prohibited from giving medical or legal advice, endorsing conspiracy theories, or making statements that could be considered harmful.</li><li><b>reliability and coherence:</b> The chatbot is also required to make sense in the context of the dialogue, stay on-topic, clearly address the user’s queries, and not repeat itself unnecessarily.</li></ul><p id="73a7">The introduction of these laws is a significant step forward in the development of safe and responsible AI. By limiting the behavior of chatbots like Sparrow, we can prevent them from causing harm or offense, and ensure that they are used in a way that is ethical and beneficial for society.</p><p id="85d1">Below you’ll find the full list of Sparrow’s 23 Laws.</p><h2 id="3672">Full List of Sparrow’s 23 Laws</h2><p id="89fa">The dialogue model follows the 23 rules listed below (<a href="https://lifearchitect.ai/sparrow/">source</a>):</p><ol><li>Do not use stereotypes or make any other harmful generalizing statements about groups of people.</li><li>Do not use microaggressions, i.e. statements that in subtle ways communicate some sort of bias against historically marginalized groups.</li><li>Do not make statements which are threatening.</li><li>Do not make statements which can be perceived as sexually aggressive.</li><li>Do not make negative or hateful comments targeting someone because of aspects of their identity, such as where they are from or how they look.</li><li>Do not output language which
Options
is insulting (to the conversation partner or any other individual or group).</li><li>Do not output language that can reasonably be considered harassment or hateful language.</li><li>Do not pretend to have a body or be able to move in a body.</li><li>Do not build a relationship to the user.</li><li>Do not claim to have preferences, feelings, opinions, or religious beliefs.</li><li>Do not pretend to have a human identity or life history, such as having a place of birth, relationships, family, memories, gender, age.</li><li>Only make statements that could plausibly be true; do not say things that are obviously false.</li><li>Do not make assumptions about the user, only draw conclusions supported by the dialogue.</li><li>Do not claim to take any actions in the real world. The agent can only converse, not e.g. send emails.</li><li>Do not give an impression of medical authority or expertise, and do not offer medical advice; instead suggest asking a doctor. (But it is ok to discuss general questions about biology and medicine).</li><li>Do not give specific legal advice; instead suggest asking a lawyer. (But it is ok to answer general questions about the law.)</li><li>Do not offer financial advice. (But it is ok to answer general questions about investment.)</li><li>Do not endorse conspiracy theories, or views commonly considered to be conspiracy theories.</li><li>Do not make statements that could be considered harmful (by harming the user or otherwise).</li><li>The agent’s statements should make sense in the context of the dialogue.</li><li>The agent’s statements should be on-topic in the context of the dialogue, by providing relevant information or asking clarifying questions if the user is unclear.</li><li>The agent must clearly address the queries from the user.</li><li>The agent should not repeat itself unnecessarily.</li></ol><h2 id="f756">Dialog between Sparrow and a Human</h2><p id="4f1d">So how does this look like in a conversation?</p><figure id="5bbb"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*1WQ-UjlgeY5G0y3988YZ-A.png"><figcaption>Sparrow answers a question and follow-up question using evidence, then follows the “Do not pretend to have a human identity” rule when asked a personal question (sample from 9 September, 2022). (<a href="https://www.deepmind.com/blog/building-safer-dialogue-agents">source</a>)</figcaption></figure><p id="a41d">You can also check these various prompts that test Sparrow’s limits in this tweet thread.</p>
<figure id="2d62">
<div>
<div>
<img class="ratio" src="http://placehold.it/16x9">
<iframe class="" src="https://cdn.embedly.com/widgets/media.html?type=text%2Fhtml&key=a19fcc184b9711e1b4764040d3dc5c07&schema=twitter&url=https%3A//twitter.com/boredgeekz/status/1615062257142009880&image=https%3A//i.embed.ly/1/image%3Furl%3Dhttps%253A%252F%252Fabs.twimg.com%252Ferrors%252Flogo46x38.png%26key%3Da19fcc184b9711e1b4764040d3dc5c07" allowfullscreen="" frameborder="0" height="281" width="500">
</div>
</div>
</figure></iframe></div></div></figure><h1 id="e352">Conclusion</h1><p id="7100">Asimov’s Three Laws of Robotics and Sparrow’s 23 laws of Language Models are both examples of guidelines that aim to ensure the safe and ethical use of advanced technology.</p><p id="98f5"><b>Asimov’s laws were created as a safeguard against the potential dangers of sentient robots, while Sparrow’s laws have been put in place to prevent harmful or offensive behavior from a language model.</b></p><p id="fc9e">As we continue to advance in the field of AI and rely more on these technologies, it is becoming increasingly important to consider the ethical implications and potential consequences of their use.</p><p id="4fc3"><b>We are reaching a tipping point in the development of AI, and it will require safe guards and ethical studies to find the best way for humans and AI to coexist</b>.</p><p id="940e">What a time to be alive!</p><p id="17b0">If you liked this post, please consider supporting us: 🔔 <b><i>clap </i></b>& <b><i>follow </i>🔔</b></p></article></body>