Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

nd or contextual data included in the prompt helps the model understand the context it needs to generate a response.</li><li>Parameters for Specificity: In some advanced uses, prompt parameters can include instructions on specificity or creativity, guiding the model on how closely it should stick to known information versus generating novel content.</li></ol>Developer: Prompt Parameters? Are we talking about Prompt Engineering here?Yes, this takes us to the next major topic — Prompting.The method of providing a model with a specific input to elicit desired output or responseA prompt can be a simple question or a large set of instructions with examples and can take many shapes:<ul><li>Direct Instruction</li><li>Question-Based</li><li>Conversational</li><li>Creative Prompting</li><li>Zero-Shot or Few-Shot Learning</li></ul>Prompt Engineering is referred to the design and optimization of these prompt patternsLet’s look at examples for each style of Prompting<h2 id="104c">Direct Instruction</h2>User: Analyze the following text for emotional tone and key themes, then summarize your findings: The Atlantic puffin (Fratercula arctica) is a seabird species in the auk family and is the only puffin native to the Atlantic Ocean. It has a black crown and back, pale grey cheek patches and white underparts and its broad, boldly marked red and black beak and orange legs contrast with its otherwise sombre plumage. The Atlantic puffin spends the autumn and winter at sea, mainly in the North Atlantic, and returns to land at the start of the breeding season in late spring.LLM: The provided text describes the Atlantic puffin (Fratercula arctica), focusing primarily on its physical characteristics and migration patterns. The emotional tone of the text is neutral and informative, devoid of personal opinion or emotional language.Key themes in the text include:<ol><li>Species Identification</li><li>Physical Description</li><li>Habitat and Behavior</li></ol><h2 id="e25b">Question — Based</h2>User: Can you explain how blockchain technology works and its potential financial industry implications?This prompting style requires the model to provide relevant information on blockchain technology and delve into its specific applications in the financial sector.<h2 id="24ed">Conversational</h2>User: I’ve been thinking about switching careers but am unsure about the risks involved. What should I consider before deciding, and how can I prepare for a career transition?This prompt engages the model in a more nuanced conversation. It seeks guidance on a complex personal decision, inviting the model to provide thoughtful considerations and practical advice for a career change.<h2 id="187a">Creative Prompting</h2>User: Create a story set in a dystopian future where humans coexist with advanced AI. The narrative should focus on a human protagonist who discovers a hidden truth about the AI that changes their

Options

perception of reality.This style of prompting invites the model to be creative and craft a detailed and imaginative story based on the elements provided<h2 id="fb39">Zero-Shot Learning</h2>User: Explain the principle behind solar panels and how they contribute to sustainable energy. Also, discuss the challenges in their widespread adoption.The model is prompted to provide detailed information on a topic (solar panels) without prior examples, relying on its pre-trained knowledge to explain the technology and discuss its benefits and challenges.<h2 id="40e0">Few-Shot Prompting</h2><figure id="fd2e"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*AqN-g6_a1SSghSL0HqhsqQ.png"><figcaption>Few-Shot Prompting example</figcaption></figure>This style focuses on providing some examples followed by a new question. The expectation is for the model to use the examples and respond in the same format and depth of explanation.Developer: Okay, so far we have covered Tokenization, Embedding, Parameters and Prompting. Is there anything else?Yes, the last piece of the puzzle is Fine Tuning, which refers to the process of taking a pre-trained model and further training it on a specific smaller dataset to specialize its capabilities.Below are Key aspects of fine-tuning:<ol><li>Pre-Trained Models: Fine-tuning starts with a model already trained on a large, general dataset. This pre-training equips the model with a broad understanding of language, context, and general knowledge.</li><li>Specialized Dataset: Fine-tuning involves training the model further on a dataset specific to the desired task or domain. For example, if you want the model to perform well in medical diagnostics, you would fine-tune it on medical texts.</li><li>Adjusting Parameters: During fine-tuning, the model’s parameters (weights and biases) are slightly adjusted to fit the specifics of the new data. This adjustment is typically smaller and more focused than the initial broad-scale training.</li><li>Reduced Training Time: Since the model is already pre-trained, fine-tuning is usually quicker and requires less data than training a model from scratch.</li><li>Task-Specific Performance: The goal of fine-tuning is to enhance the model’s performance on a specific task, making it more accurate, relevant, and effective in that particular area.</li></ol>A great example of Fine-tuning is <a href="https://openai.com/blog/introducing-gpts"><code>GPTs by Ope</code>nAI</a> which are custom versions of ChatGPT that can be created for a specific purposeDeveloper: This is a great start. Is there anything else?That’s pretty much it. To succeed in today's tech landscape, it's vital for developers to understand LLM models and the nuances of prompting and fine-tuning.Remember, they are not just tools for today but a foundation for a future where the interaction between humans and machines becomes more seamless, intelligent, and beneficial for all.Thanks for reading!</article></body>

LLM Basics — What Every Developer Should Know — Part II

Hey! Welcome back.

I hope you had a chance to review Part I of the series and now understand two essential aspects of LLM architecture — Tokens & Embeddings.

Next, Let’s start by understanding Parameters

In the context of Machine learning and LLMs, parameters are elements of the model that are learned from training data. They are the backbone of LLMs, allowing them to learn from data and perform complex tasks.

Consider Parameters as internal settings of a neural network that can be adjusted through training to better predict and generate accurate outputs

Parameters are primarily two types: 1. Weights — connections between neurons across different layers of LLM. Weights determine how much influence one neuron’s output will have on other neuron’s activation

2. Biases — a weighting factor generally added before neuron activation to shift the model left or right, which helps generate better output

Another distinction that is commonly used exists between Model parameters and prompt parameters. Model Parameters can be used at multiple levels in a LLM

Training — Parameters can be adjusted to capture the relationships and language structures better
Context and Meaning — Parameters enable LLMs to capture context. For ex, in a transformer-based model, parameters can help a model understand words in a sentence are relevant to each other.
Text Generation — Parameters can determine how a model constructs sentences and maintains coherence
Model Size — The most common usage tied to Parameters. The # of parameters is generally connected to a model size. Although the model size does not necessarily imply it is a better model, the notion exists.
Fine-Tuning — Using Bias Parameters, models can be fine-tuned for a specific task.

Prompt Parameters, on the other hand, refer to the variables or elements that are part of the input given to the model.

Input Text: The prompt's actual text guides the model on the expected response or output.
Control Tokens or Instructions: These might include special tokens or structured instructions that guide the model’s response style, length, or content.
Contextual Information: Any background or contextual data included in the prompt helps the model understand the context it needs to generate a response.
Parameters for Specificity: In some advanced uses, prompt parameters can include instructions on specificity or creativity, guiding the model on how closely it should stick to known information versus generating novel content.

Developer: Prompt Parameters? Are we talking about Prompt Engineering here?

Yes, this takes us to the next major topic — Prompting.

The method of providing a model with a specific input to elicit desired output or response

A prompt can be a simple question or a large set of instructions with examples and can take many shapes:

Direct Instruction
Question-Based
Conversational
Creative Prompting
Zero-Shot or Few-Shot Learning

Prompt Engineering is referred to the design and optimization of these prompt patterns

Let’s look at examples for each style of Prompting

Direct Instruction

User: Analyze the following text for emotional tone and key themes, then summarize your findings: The Atlantic puffin (Fratercula arctica) is a seabird species in the auk family and is the only puffin native to the Atlantic Ocean. It has a black crown and back, pale grey cheek patches and white underparts and its broad, boldly marked red and black beak and orange legs contrast with its otherwise sombre plumage. The Atlantic puffin spends the autumn and winter at sea, mainly in the North Atlantic, and returns to land at the start of the breeding season in late spring.

LLM: The provided text describes the Atlantic puffin (Fratercula arctica), focusing primarily on its physical characteristics and migration patterns. The emotional tone of the text is neutral and informative, devoid of personal opinion or emotional language.

Key themes in the text include:

Species Identification
Physical Description
Habitat and Behavior

Question — Based

User: Can you explain how blockchain technology works and its potential financial industry implications?

This prompting style requires the model to provide relevant information on blockchain technology and delve into its specific applications in the financial sector.

Conversational

User: I’ve been thinking about switching careers but am unsure about the risks involved. What should I consider before deciding, and how can I prepare for a career transition?

This prompt engages the model in a more nuanced conversation. It seeks guidance on a complex personal decision, inviting the model to provide thoughtful considerations and practical advice for a career change.

Creative Prompting

User: Create a story set in a dystopian future where humans coexist with advanced AI. The narrative should focus on a human protagonist who discovers a hidden truth about the AI that changes their perception of reality.

This style of prompting invites the model to be creative and craft a detailed and imaginative story based on the elements provided

Zero-Shot Learning

User: Explain the principle behind solar panels and how they contribute to sustainable energy. Also, discuss the challenges in their widespread adoption.

The model is prompted to provide detailed information on a topic (solar panels) without prior examples, relying on its pre-trained knowledge to explain the technology and discuss its benefits and challenges.

Few-Shot Prompting

This style focuses on providing some examples followed by a new question. The expectation is for the model to use the examples and respond in the same format and depth of explanation.

Developer: Okay, so far we have covered Tokenization, Embedding, Parameters and Prompting. Is there anything else?

Yes, the last piece of the puzzle is Fine Tuning, which refers to the process of taking a pre-trained model and further training it on a specific smaller dataset to specialize its capabilities.

Below are Key aspects of fine-tuning:

Pre-Trained Models: Fine-tuning starts with a model already trained on a large, general dataset. This pre-training equips the model with a broad understanding of language, context, and general knowledge.
Specialized Dataset: Fine-tuning involves training the model further on a dataset specific to the desired task or domain. For example, if you want the model to perform well in medical diagnostics, you would fine-tune it on medical texts.
Adjusting Parameters: During fine-tuning, the model’s parameters (weights and biases) are slightly adjusted to fit the specifics of the new data. This adjustment is typically smaller and more focused than the initial broad-scale training.
Reduced Training Time: Since the model is already pre-trained, fine-tuning is usually quicker and requires less data than training a model from scratch.
Task-Specific Performance: The goal of fine-tuning is to enhance the model’s performance on a specific task, making it more accurate, relevant, and effective in that particular area.

A great example of Fine-tuning is GPTs by OpenAI which are custom versions of ChatGPT that can be created for a specific purpose

Developer: This is a great start. Is there anything else?

That’s pretty much it. To succeed in today's tech landscape, it's vital for developers to understand LLM models and the nuances of prompting and fine-tuning.

Remember, they are not just tools for today but a foundation for a future where the interaction between humans and machines becomes more seamless, intelligent, and beneficial for all.

Thanks for reading!