Prefect — Orchestrate Your Machine Learning Workflow

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

5274

Abstract

w. It is declared with a @task decorator to any type of function. Moreover, task functions can have inputs or outputs, parameter that specify if the task has to retry and more.</li></ul><figure id="e07d"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*QZ8m94hvthpO5n2KiFxBFg.png"><figcaption></figcaption></figure><ul><li>Flows are containers (as the <a href="https://orion-docs.prefect.io/concepts/flows/">doc</a> explains) because it wraps all the tasks and dependencies between them. It is used with the @flow decorator to a python function in which you can organize the tasks as you want and prefect will create links between them. It is possible to specify some parameters like the task_runner which is one that defines how the flow has to run (Concurrently or Sequentially for example). Prefect flows object automatically logs a lot of information about flow runs. You can have flows in another flow, that we call subflows.</li></ul><figure id="4b62"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*SGHLjWXPN6z99p2_XJm3kg.png"><figcaption></figcaption></figure>Having a flow to run can be done either manually by calling the script of the flow or by deploying it. Deployment is an important concept in Prefect. You can deploy a flow either in the Prefect Cloud (at a cost) or either locally (remotely on a VM also) within a Prefect Orion Server that we will now introduce.Prefect has an UI called Core or Orion (depending on the version) that makes possible the visualization of the flows with edges between tasks and the launch of flows directly with the API (from anywhere as long as an agent is available and the server up).<figure id="8c04"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*xlB741Yx0IY77FISuyCWvg.png"><figcaption></figcaption></figure>Once the UI is ready, you can access it at <a href="http://127.0.0.1:4200">http://127.0.0.1:4200</a> and find any flow, deployments, and runs. It is possible to filter by tags, date and more. If you want to know better about a flow run, you can have more information by selecting the one that you want. This is an example of the Radar View of a flow run :<figure id="85cc"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*GMELRFMMc0ANPcxI2oO3tg.png"><figcaption></figcaption></figure>It shows how the tasks were performed and the dependencies between each other.Let’s talk a little bit about Deployment and Logging.<h2 id="3d62">Deployments in Prefect</h2>Deployments are packaging the flows into Prefect Orion Server allowing it to be run with the API (not just with a python script) and add schedule on it (to run alone).Each deployment is backed by a flow, however, you can have multiple deployments for a unique flow and specify different parameters for example.Here is an example :<figure id="db4d"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*7UsoJ4oUvzDsvAdLXwF0pA.png"><figcaption></figcaption></figure>You can then run the following command to create the deployment :<code>prefect deployment create file.py</code>That will create the schedules of the runs in the Prefect Orion Server but it will not run the flow, you need Work Queues and Agents to achieve this. Work queues are associated with specific runs from deployments (depending on filter criteria) and sends agents to launch these runs when the time comes.To know more about work queues and agent, check <a href="https://orion-docs.prefect.io/concepts/work-queues/">this link</a>.<h2 id="b46d">Logging in Prefect</h2>It is possible to add any log you want, in addition to those integrated by Prefect, inside your task and flows by calling get_run_logger :<figure id="4eca"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*RjdG7ysDawjxr2JWBYVXTA.png"><figcaption></figcaption></figure><h1 id="f741">Conclusion</h1>Workflow Orchestration is a must have process when working with pipelines or a product/service involving various steps, all linked between each other. The idea behind tools like Prefect is to fight against negative engineering by having an eye on all the development pipeline and be able to fix quickly.Naturally, orchestration is about having a logic on the elements that work together and those frameworks improve the way it is done. What I mean is that it is possible to create a large pipeline and tell any task in it some condition to not ruin the entire process. This is clearly more difficult without workflow orchestration tools, especially when many people are working on the project.Thanks for reading this article, I hope you enjoy it and discover what is Workflow Orchestration as well as Prefect which is a complete tool.<h2 id="959b">Resources :</h2><div id="52c9" class="link-block"> <a href="https://orion-docs.prefect.io/"> <div>

Options

         <div>
            <h2>Home</h2>
            <div><h3>Prefect is Air Traffic Control for your dataflows. It's the coordination plane that provides you with everything from…</h3></div>
            <div><p>orion-docs.prefect.io</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*YXSrBMW_plEsPQhM)"></div>
          </div>
        </div>
      </a>
    </div><div id="3bcb" class="link-block">
      <a href="https://neptune.ai/blog/best-workflow-and-pipeline-orchestration-tools">
        <div>
          <div>
            <h2>Best Workflow and Pipeline Orchestration Tools: Machine Learning Guide - neptune.ai</h2>
            <div><h3>Machine learning is rampaging through the IT world, and driving a lot of high-end tech. It created a revolution of…</h3></div>
            <div><p>neptune.ai</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*NPDkJahhWDHzNy2C)"></div>
          </div>
        </div>
      </a>
    </div><div id="40bd" class="link-block">
      <a href="https://subscription.packtpub.com/book/data/9781800562882/2/ch02lvl1sec07/concepts-and-workflow-of-mlops">
        <div>
          <div>
            <h2>Concepts and workflow of MLOps | Engineering MLOps</h2>
            <div><h3>In this section, we will learn about a generic MLOps workflow; it is the result of many design cycle iterations as…</h3></div>
            <div><p>subscription.packtpub.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*uwq776NrmC4FI5q5)"></div>
          </div>
        </div>
      </a>
    </div><div id="1d51" class="link-block">
      <a href="https://towardsdatascience.com/workflow-orchestration-vs-data-orchestration-are-those-different-a661c46d2e88">
        <div>
          <div>
            <h2>Workflow Orchestration vs. Data Orchestration — Are Those Different?</h2>
            <div><h3>Let’s disambiguate the terms to understand workflow orchestration better — with a real-life analogy!</h3></div>
            <div><p>towardsdatascience.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*Zrs38KATpASYL062hrQ0bw.jpeg)"></div>
          </div>
        </div>
      </a>
    </div><div id="2f29" class="link-block">
      <a href="https://readmedium.com/positive-and-negative-data-engineering-a02cb497583d">
        <div>
          <div>
            <h2>Positive and Negative Engineering</h2>
            <div><h3>Don’t Panic.</h3></div>
            <div><p>medium.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*9gizj1DiblN9DWzg)"></div>
          </div>
        </div>
      </a>
    </div><div id="d5f2" class="link-block">
      <a href="https://towardsdatascience.com/airflow-prefect-and-dagster-an-inside-look-6074781c9b77">
        <div>
          <div>
            <h2>Airflow, Prefect, and Dagster: An Inside Look</h2>
            <div><h3>One of the great things about the Modern Data Stack is the interoperability with all the different components that make…</h3></div>
            <div><p>towardsdatascience.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*JUnRi0Yl2YQqZCiV)"></div>
          </div>
        </div>
      </a>
    </div><div id="f7be" class="link-block">
      <a href="https://future.com/negative-engineering-and-the-art-of-failing-successfully/#:~:text=Negative%20engineering%20is%20the%20time,success%20of%20their%20primary%20objectives">
        <div>
          <div>
            <h2>What Is Negative Engineering?</h2>
            <div><h3>It was the second game of a double-header, and the Washington Nationals had a problem. Not on the field, of course: The…</h3></div>
            <div><p>future.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*7-vkX8CQqDHxYz63)"></div>
          </div>
        </div>
      </a>
    </div><div id="279f" class="link-block">
      <a href="https://readmedium.com/mlearning-ai-submission-suggestions-b51e2b130bfb">
        <div>
          <div>
            <h2>Mlearning.ai Submission Suggestions</h2>
            <div><h3>How to become a writer on Mlearning.ai</h3></div>
            <div><p>medium.com</p></div>
          </div>
          <div>
            <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*6xCb1sNpjadaSBuVLPTFQQ.png)"></div>
          </div>
        </div>
      </a>
    </div></article></body>

Prefect — Orchestrate Your Machine Learning Workflow

Some example type of Machine Learning Workflow Orchestration Tools

Presentation of Prefect

Deployments in Prefect

Logging in Prefect

Conclusion

Resources :

Home

Prefect is Air Traffic Control for your dataflows. It's the coordination plane that provides you with everything from…

Best Workflow and Pipeline Orchestration Tools: Machine Learning Guide - neptune.ai

Machine learning is rampaging through the IT world, and driving a lot of high-end tech. It created a revolution of…

Concepts and workflow of MLOps | Engineering MLOps

In this section, we will learn about a generic MLOps workflow; it is the result of many design cycle iterations as…

Workflow Orchestration vs. Data Orchestration — Are Those Different?

Let’s disambiguate the terms to understand workflow orchestration better — with a real-life analogy!

Positive and Negative Engineering

Don’t Panic.

Airflow, Prefect, and Dagster: An Inside Look

One of the great things about the Modern Data Stack is the interoperability with all the different components that make…

What Is Negative Engineering?

It was the second game of a double-header, and the Washington Nationals had a problem. Not on the field, of course: The…

Mlearning.ai Submission Suggestions

How to become a writer on Mlearning.ai