Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

nagement systems typically can be composed using a combination of the following:<ul><li>Rules</li><li>Models</li></ul>Dialogue manager dictates the action of the bot. The designers of the bot usually have some expectation of what may be the “happy paths” or common paths that things can go wrong. They can simply use IF THEN logic to encode these rules into the “action plan”, such as the following example:<figure id="ce31"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*_QrLnjoghekChpc3.jpeg"><figcaption>Example rules for dialogue management</figcaption></figure>Another reason that rules are necessary is due to constraints for the purpose of a bounded customer experience. Customers don’t want surprises. There are times that a customer’s inquiry has absolutely no ambiguity, e.g., “What is the current balance of my checking account ending in 0325?” In this case, there’s no need to take the risk to predict an action that the bot should take using any probabilistic models.On the other hand, a model based approach can be used otherwise. In the Rasa system, they offer a model called “Transformer-based sequence prediction model” (TED). Using our example <a href="https://mlnotes.substack.com/i/65872141/so-believers-and-non-believers-what-does-a-chatbot-look-like-under-the-hood">dialogue</a> , we can see that each bot’s responses has an action label by itself. With rules and context, the dialogue manager can learn to predict the correct action at each response point. The arrows below illustrate where the dialogue manager needs to predict the next action.<figure id="6982"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*K6_7MP-UdXO97VzF.jpeg"><figcaption>Example model-driven dialogue management</figcaption></figure><blockquote id="40bb">In practice most bots are a mix of rules and ML based dialogue management.</blockquote>This reminds me of the challenge in robotics to leverage natural language prompts and a NLU system to decide how to execute a task. The following is an example fro

Options

m Google Research.<blockquote id="297c">The blue bar shows how likely the language model estimates the skill to be useful to the task at hand. The red bar shows how likely the system is to successfully execute a skill and the green bar shows the combined score used to finally select a skill to execute.</blockquote><figure id="2a1e"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*c7PpUg4ropyMfWJF.jpeg"><figcaption><a href="https://www.youtube.com/watch?v=E2R1D8RzOlM&t=109s">Google Research</a>: Grounding language in robotic affordances</figcaption></figure>In other words, the following is how the robot might be thinking about this problem:<figure id="398b"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*mNBzR3jt9syCSNkb.jpeg"><figcaption>Image by author</figcaption></figure>As you can see, where NLP and robotics cross paths is exactly “dialogue management” (if not more). That’s where actions are determined. A chatbot simply responds with natural language or takes a digital action, whereas a robot might take a physical action to fulfill the user request.Lastly, the google research video is really short and worth a view:<a href="https://www.youtube.com/watch?v=E2R1D8RzOlM&t=109s">https://www.youtube.com/watch?v=E2R1D8RzOlM&t=109s</a><figure id="1d54"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*88hZv3B8B6jT7n5t.gif"><figcaption></figcaption></figure>Happy practicing!Thanks for reading my <a href="https://mlnotes.substack.com/p/what-is-behind-the-scene-of-a-chatbot?r=164sm1&s=w&utm_campaign=post&utm_medium=web">newsletter</a>. You can follow me on <a href="https://www.linkedin.com/in/yangyy/">Linkedin</a>!Source of images and quotes: Stanford MLSys Seminars, ML Frameworks for Chatbots feat. Chris Kedzie | Stanford MLSys Seminar <a href="https://www.youtube.com/watch?v=rrzqa1C1aeU">Episode 34</a> Rasa. <a href="https://rasa.com/">https://rasa.com/</a></article></body>

What Is Behind the Scene of A Chatbot NLU 🤖?

A few posts ago we introduced the basic components of a chatbot system. Today we’ll dive a bit deeper into the nuts and bolts.

To review, the sequence of actions behind the scene look like the following:

How does the NLU component work?

Once the user message is received, there are a sequence of steps for the NLU engine to understand what’s going on. They include:

Tokenization
Featurization
Entity tagging
Intent classification

Let’s review part of the dialogue from last time:

Illustration of behind the scene when user issues a query

What are the options for each of these steps?

In some of the commercial systems, each of these components can be highly customizable. For instance, for tokenization, you may want to use the open-source NLP libraries like SpaCy, which supports tokenization of different languages; for featurizer, you can use various embedding choices including pre-trained models like BERT or your own domain fine-tuned representations; for entity tagging, SpaCy offers great NER tagging capabilities; for classification models, you can plug in sklearn classifiers, transformers, or in Rasa’s setting, they have a default joint classifier and tagger model called DIET (Dual Intent Entity Transformer).

How does the dialogue manager work?

Dialogue management systems typically can be composed using a combination of the following:

Rules
Models

Dialogue manager dictates the action of the bot. The designers of the bot usually have some expectation of what may be the “happy paths” or common paths that things can go wrong. They can simply use IF THEN logic to encode these rules into the “action plan”, such as the following example:

Another reason that rules are necessary is due to constraints for the purpose of a bounded customer experience. Customers don’t want surprises. There are times that a customer’s inquiry has absolutely no ambiguity, e.g., “What is the current balance of my checking account ending in 0325?” In this case, there’s no need to take the risk to predict an action that the bot should take using any probabilistic models.

On the other hand, a model based approach can be used otherwise. In the Rasa system, they offer a model called “Transformer-based sequence prediction model” (TED). Using our example dialogue , we can see that each bot’s responses has an action label by itself. With rules and context, the dialogue manager can learn to predict the correct action at each response point. The arrows below illustrate where the dialogue manager needs to predict the next action.

Example model-driven dialogue management

In practice most bots are a mix of rules and ML based dialogue management.

This reminds me of the challenge in robotics to leverage natural language prompts and a NLU system to decide how to execute a task. The following is an example from Google Research.

The blue bar shows how likely the language model estimates the skill to be useful to the task at hand. The red bar shows how likely the system is to successfully execute a skill and the green bar shows the combined score used to finally select a skill to execute.

Google Research: Grounding language in robotic affordances

In other words, the following is how the robot might be thinking about this problem:

As you can see, where NLP and robotics cross paths is exactly “dialogue management” (if not more). That’s where actions are determined. A chatbot simply responds with natural language or takes a digital action, whereas a robot might take a physical action to fulfill the user request.

Lastly, the google research video is really short and worth a view:

https://www.youtube.com/watch?v=E2R1D8RzOlM&t=109s

Happy practicing!

Thanks for reading my newsletter. You can follow me on Linkedin!

Source of images and quotes: Stanford MLSys Seminars, ML Frameworks for Chatbots feat. Chris Kedzie | Stanford MLSys Seminar Episode 34 Rasa. https://rasa.com/