Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

e using a fraction (0.38%) of parameters compared to <b>StyleVideoGAN.</b></p><div id="d3e1"><pre><span class="language-xml">@article</span><span class="hljs-template-variable">{oorloff2022encodeinstyle, title={Encode-in-Style: Latent-based Video Encoding using StyleGAN2}</span><span class="language-xml">, author=</span><span class="hljs-template-variable">{Trevine Oorloff and Yaser Yaoob}</span><span class="language-xml">, year=</span><span class="hljs-template-variable">{2022}</span><span class="language-xml">, eprint=</span><span class="hljs-template-variable">{2203.14512}</span><span class="language-xml">, archivePrefix=</span><span class="hljs-template-variable">{arXiv}</span><span class="language-xml">, primaryClass=</span><span class="hljs-template-variable">{cs.CV}</span><span class="language-xml">, }</span></pre></div><figure id="5cd3"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*-3b2urrvEo5dCftajylptw.png"><figcaption><a href="https://arxiv.org/pdf/2203.14512.pdf">https://arxiv.org/pdf/2203.14512.pdf</a></figcaption></figure><h2 id="bbb1">Project Page:</h2><p id="a34a"><a href="https://arxiv.org/pdf/2203.14512.pdf">https://arxiv.org/pdf/2203.14512.pdf</a></p><h2 id="0378">Keywords: video resynthesis, video encoding, StyleGAN, image inversion, latent space editing</h2><p id="c461">I invite you to explore the concept of “AI creativity” by reading and learning from the many articles found on 🔵 <a href="https://mlearning.substack.com/"><b>MLearning.ai</b></a><b> </b>🟠</p><div id="6c88" class="link-block"> <a href="https://evartology.medium.com/membership"> <div> <div> <h2>Join Medium with my referral link - Eva Rtology</h2> <div><h3>As a Medium member, a portion of your membership fee goes to writers you read, and you get full access to every story…</h3></div> <div><p>evartology.medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*D7gMBKIUN14KNXKA)"></div> </div> </div> </a> </div><p id="2825">I am <a href="https://readmedium.com/how-to-become-a-curator-3c0c75f74637">an Art Curator,</a> founder at <a href="https://evartology.com/">EvArtology</a>. I advise companies and instit

Options

utions in the <a href="https://readmedium.com/machine-learning-will-free-creatives-79f005145e4">creative industries</a> on using AI tools in their daily work. Human collaboration with ML models can be very creative and bring huge benefits. <a href="https://readmedium.com/is-ai-art-really-art-a363073d62d0">The new era begins now.</a></p><blockquote id="0c2c"><p><i>Data Scientists must think like an artist when finding a solution when creating a piece of code. <a href="https://medium.com/mlearning-ai/tagged/art">Artists</a> enjoy working on interesting problems, even if there is no obvious answer.</i></p></blockquote><p id="7bb7">All our writers (<a href="https://www.getrevue.co/profile/mlearning_ai/members">members</a>) receive the opportunity to be promoted on our social media, which increases the popularity of articles published on MLearning.ai</p><ol><li><a href="https://www.linkedin.com/company/mlearning-ai/">Linkedin</a> (6.9K+ ML-professionals)</li><li><a href="https://twitter.com/Mlearning_ai">Twitter</a> (4.7K+ followers)</li><li><a href="https://www.instagram.com/mlearning.ai/">Instagram</a> (2.2K + followers )</li><li><a href="https://readmedium.com/take-vr-tour-of-these-ml-stories-a7550340a6a2">Sketchfab</a> * — individual v<a href="https://readmedium.com/zahra-ahmads-vroom-1510367d679d">Roo</a>ML!</li><li><a href="https://www.facebook.com/Art.Machine.Learning">Facebook</a></li><li><a href="https://www.youtube.com/watch?v=-AXMoEiGdaI">Youtube</a></li><li><a href="https://podcasts.apple.com/pl/podcast/learning-better-and-faster/id1580007913">Apple Podcasts</a></li><li><a href="https://mlearning.substack.com/"><b>Substack</b></a></li></ol><p id="dcdf">🔵 <a href="https://readmedium.com/mlearning-ai-submission-suggestions-b51e2b130bfb">Submission Suggestions</a></p><div id="74dc" class="link-block"> <a href="https://readmedium.com/mlearning-ai-submission-suggestions-b51e2b130bfb"> <div> <div> <h2>Mlearning.ai Submission Suggestions</h2> <div><h3>How to become a writer on Mlearning.ai</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*ib0DX0UzRoFcNuZILb7rNA.jpeg)"></div> </div> </div> </a> </div></article></body>

Machine Learning Art

Re-Creating Your Own Face

With SOTA video algorithm

I will tell you about a technology that can generate near-human level videos of any person. Now, this technology could be used in advertising or movies, and it might seem like magic at first, but I think you’ll find out that it’s not far from the truth.

Improving the original StyleGAN2 image inversion and multi-stage non-linear latent-space editing, the authors propose an end-to-end facial video encoding approach that facilitates data-efficient high-quality video re-synthesis by optimizing low dimensional edits of a single Identity. The approach builds on StyleGAN2 image inversion and multi-stage non-linear latent space editing to generate videos that are nearly comparable to input videos. It captures face identity, head pose, and complex facial motions at acceptable levels and thereby bypasses training and person modeling, often hampering many re-synthesis approaches. This pipeline can also be used for puppeteering (i.e., motion transfer).

Project Page (scroll down)

The authors are the initial: 🔵 in automating the editing of latent spaces in contrast to the prevailing work on latent-space editing that illustrates plausible semantic visual results (e.g. smiles, hair color, gaze) 🔵 to propose an extremely compact latent-based facial video encoding scheme that captures extremely fine, rich, and complex facial deformations.

Conclusion

The authors extend the StyleGAN2’s photo-realism and disentanglement of its StyleSpace spatiotemporally to propose a novel end-to-end pipeline for latent-based facial video encoding. It enables high-fidelity (10242) video re-synthesis and reenactment using a single W+ latent and 35 parameters per frame. Furthermore, their algorithm achieves SOTA performance for video re-synthesis at 10242 while using a fraction (0.38%) of parameters compared to StyleVideoGAN.

@article{oorloff2022encodeinstyle,
      title={Encode-in-Style: Latent-based Video Encoding using StyleGAN2},
      author={Trevine Oorloff and Yaser Yaoob},
      year={2022},
      eprint={2203.14512},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
}

Project Page:

https://arxiv.org/pdf/2203.14512.pdf

Keywords: video resynthesis, video encoding, StyleGAN, image inversion, latent space editing

I invite you to explore the concept of “AI creativity” by reading and learning from the many articles found on 🔵 MLearning.ai 🟠

Join Medium with my referral link - Eva Rtology

As a Medium member, a portion of your membership fee goes to writers you read, and you get full access to every story…

evartology.medium.com

I am an Art Curator, founder at EvArtology. I advise companies and institutions in the creative industries on using AI tools in their daily work. Human collaboration with ML models can be very creative and bring huge benefits. The new era begins now.

Data Scientists must think like an artist when finding a solution when creating a piece of code. Artists enjoy working on interesting problems, even if there is no obvious answer.

All our writers (members) receive the opportunity to be promoted on our social media, which increases the popularity of articles published on MLearning.ai

Linkedin (6.9K+ ML-professionals)
Twitter (4.7K+ followers)
Instagram (2.2K + followers )
Sketchfab * — individual vRooML!
Facebook
Youtube
Apple Podcasts
Substack

🔵 Submission Suggestions

Mlearning.ai Submission Suggestions

How to become a writer on Mlearning.ai

medium.com