avatarDariusz Gross #DATAsculptor

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

4337

Abstract

touching body and scene elements and better distinguish surfaces.</figcaption></figure><p id="e4eb">Their journey was not without challenges. The system had to overcome numerous obstacles, such as variations in lighting, different camera angles, and changes in the person’s appearance. But the researchers were undeterred. They refined their algorithms, improved their models, and gradually, their system became more robust and versatile.</p><div id="2352" class="link-block"> <a href="https://readmedium.com/turn-text-to-3d-ai-art-a6911bf5c43d"> <div> <div> <h2>Turn TEXT to 3D AI art</h2> <div><h3>Open-World Scene: The Future of 3D DIGITAL ART</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*DWeplOKqBoWs05Uar3NPNQ.png)"></div> </div> </div> </a> </div><h1 id="f768">A Breakthrough in Computer Vision: The Power of Video-to-3D Avatar Transformation</h1><p id="9535">The culmination of their journey was a breakthrough in the field of computer vision and graphics. They had successfully transformed a simple video into a lifelike 3D avatar. But more than that, they had opened up a world of possibilities. Their system could be used in virtual reality, gaming, film production, and many other applications.</p> <figure id="90e3"> <div> <div> <img class="ratio" src="http://placehold.it/16x9"> <iframe class="" src="https://cdn.embedly.com/widgets/media.html?src=https%3A%2F%2Fwww.youtube.com%2Fembed%2FEGi47YeIeGQ%3Ffeature%3Doembed&amp;display_name=YouTube&amp;url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DEGi47YeIeGQ&amp;image=https%3A%2F%2Fi.ytimg.com%2Fvi%2FEGi47YeIeGQ%2Fhqdefault.jpg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=youtube" allowfullscreen="" frameborder="0" height="480" width="854"> </div> </div> </figure></iframe></div></div></figure><h2 id="80e3">The Beginning of a New Era: Lifelike 3D Avatars and the Endless Possibilities of Technology</h2><p id="787b">In the end, the researchers from ETH Zurich had not just created a system. They had told a story, a story of innovation, perseverance, and the endless possibilities of technology. And as they looked at the <a href="https://open.substack.com/pub/mlearning/p/state-of-the-3d-art-june-2023-tools-design-ai?r=z7zu8&amp;utm_campaign=post&amp;utm_medium=web">3D</a> avatar, a digital reflection of the real world, they knew that their journey was just the beginning.</p><p id="f917"><b>In short, their contributions are:</b></p><ol><li>a method to reconstruct detailed <a href="https://open.substack.com/pub/mlearning/p/state-of-the-3d-art-june-2023-tools-design-ai?r=z7zu8&amp;utm_campaign=post&amp;utm_medium=web">3D</a> avatars from in-the-wild monocular videos using self-supervised scene decomposition;</li><li>a way to get robust and detailed 3D reconstructions of the human even in difficult poses and environments without using external segmentation methods; and</li><li>a new semi-synthetic testing dataset that allows for the first time to compare monocular human reconstruction methods on realistic scenes. The file has a lot of notes about the <a href="https://open.substack.com/pub/mlearning/p/state-of-the-3d-art-june-2023-tools-design-ai?r=z7zu8&amp;utm_campaign=post&amp;utm_medium=web">3D</a> surface.</li></ol><p id="3cb0">Researchers will be able to use the <a href="#cfec">code and data</a> for their work.</p><blockquote id="f84e"><p>Over the course of the previous year, we have made significant progress in transitioning from <a href="https://readmedium.com/text-to-3d-generation-cb8ce2c7f0f7">2D to 3D</a>; this fresh method is the next step, and there is still more to come.</p></blockquote><div id="acdd" class="link-block"> <a href="https://open.substack.com/pub/mlearning/p/state-of-the-3d-art-june-2023-tools-design-ai?r=z7zu8&amp;utm_campaign=post&amp;utm_medium=web"> <div> <div> <h2>State of the 3D Art, June 2023</h2> <div><h3>Delve into the revolutionary 3D design wi

Options

th June 2023 top picks of 3D AI tools, these avant-garde solutions are…</h3></div> <div><p>open.substack.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*I_IoEyehDdfYocGT)"></div> </div> </div> </a> </div><h2 id="224e">AI is everywhere, But the question is, how much do you love it?</h2><p id="3a00">I invite you to explore the concept of <a href="https://mlearning.substack.com/p/can-ai-generate-3d-models?r=z7zu8&amp;s=w&amp;utm_campaign=post&amp;utm_medium=web">Machine Learning </a>Art by reading and learning from the many articles found on 🔵 <a href="https://mlearning.substack.com">MLearning.ai</a> 🟠</p><div id="7bf9" class="link-block"> <a href="https://datasculptor.medium.com/membership"> <div> <div> <h2>Join Medium with my referral link — Dariusz Gross #DATAsculptor</h2> <div><h3>AI is everywhere 🟠 But the question is, how much do you love it? Join the Medium Membership to enjoy every story! Your…</h3></div> <div><p>datasculptor.medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*voSKDnEy0vUhaW9N)"></div> </div> </div> </a> </div><p id="0501"><i>Check out my <a href="https://www.instagram.com/datasculptor/">instagram</a> with new material every week</i></p><ul><li><i>If you enjoyed this, <a href="/@DATAsculptor">follow me on Medium</a> for more</i></li><li><i>Want to collaborate? Let’s connect on <a href="https://www.linkedin.com/in/dariusz-gross/">LinkedIn</a></i></li><li><a href="https://linktr.ee/datasculptor"><i>https://linktr.ee/datasculptor</i></a></li><li><i>3D Machine Learning generated model on <a href="https://sketchfab.com/degross">sketchfab</a></i></li></ul><h2 id="504a">Keywords: computer vision, Artificial Intelligence, Machine Learning, AI art, art, wombo dream, digital art, Dalle 2, Imagen, wombo ai, Parti, 3D point cloud, diffusion models, generative art, wombo art, photographic quality, img by AI system, AI art generator, text to art generator, 3D, midjourney, dalle2, stablediffusion, Dalle 3, Vid2Avatar</h2><figure id="e57f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*KVr8l72WtbN3OdvYeUAKIw.jpeg"><figcaption><a href="https://files.ait.ethz.ch/projects/vid2avatar/main.pdf">https://files.ait.ethz.ch/projects/vid2avatar/main.pdf</a></figcaption></figure><h2 id="cfec">Project Page:</h2><p id="a4c0"><a href="https://files.ait.ethz.ch/projects/vid2avatar/main.pdf">https://files.ait.ethz.ch/projects/vid2avatar/main.pdf</a></p><h1 id="a051">3D Avatar Reconstruction from Videos in the Wild</h1><h2 id="55db">CODE:</h2><p id="5d79"><a href="https://github.com/MoyGcc/vid2avatar">https://github.com/MoyGcc/vid2avatar</a></p><div id="5cb6"><pre>@inproceedings{guo2023vid2avatar, title={Vid2Avatar: <span class="hljs-number">3</span>D Avatar Reconstruction from Videos in the Wild via Self-supervised <span class="hljs-keyword">Scene </span>Decomposition}, author={Guo, Chen <span class="hljs-keyword">and </span><span class="hljs-keyword">Jiang, </span>Tianjian <span class="hljs-keyword">and </span>Chen, Xu <span class="hljs-keyword">and </span>Song, <span class="hljs-keyword">Jie </span><span class="hljs-keyword">and </span>Hilliges, Otmar},
<span class="hljs-keyword">booktitle </span>= {Computer Vision <span class="hljs-keyword">and </span>Pattern Recognition (CVPR)}, year = {<span class="hljs-number">2023</span>} }</pre></div><div id="13b5" class="link-block"> <a href="https://readmedium.com/text-to-3d-generation-cb8ce2c7f0f7"> <div> <div> <h2>Text-to-3D Generation</h2> <div><h3>Can AI create 3D models? [update June 2023]</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*sP5nOOahbt1XWTx_UuozOw.jpeg)"></div> </div> </div> </a> </div></article></body>

Video-to-3D Avatar. Dynamic Human Reconstruction

How Machines Decode Cinematic Stories

Unleashing New 3D video Tools

Bridging Video and 3D Avatars

Once upon a time, in the bustling world of technology and innovation, a team of researchers from ETH Zurich embarked on a fascinating journey. Their mission was to bridge the gap between the realms of video and 3D avatars, a task that had challenged many before them.

AI and Cinema

From Simple Videos to Dynamic 3D Avatars: A Technological Leap

Their journey began with a simple video, a mere collection of two-dimensional images. The video, however, held a world of potential. It was a window into a three-dimensional reality, a reality that the researchers aimed to capture and translate into the digital world.

The team, armed with their knowledge and expertise, developed a novel approach. They created a system that could take a single video as input and generate a 3D avatar as output. This was no ordinary avatar, though. It was a dynamic, animated representation that could mimic the movements and expressions of the person in the video with remarkable accuracy.

Deep Learning: The Key to Capturing Human Movement in 3D

The researchers’ system was a marvel of modern technology. It utilized deep learning techniques to understand and replicate the intricate details of human movement. It could capture the subtleties of facial expressions, the fluidity of body movements, and even the complex dynamics of clothing.

3D Avatar Method

Instead of using standard 2D segmentation methods or manually annotated masks, the researchers perform scene decomposition and surface reconstruction directly in 3D to reconstruct the shape and appearance of implicit neural avatars from monocular movies in the wild. They implicitly model the people and background in the image using two neural fields trained simultaneously from photographs to composite the entire scene. They propose innovative goals that regularize ray opacity using the dynamically updated human form in canonical space to reduce the ambiguity of touching body and scene elements and better distinguish surfaces.

Their journey was not without challenges. The system had to overcome numerous obstacles, such as variations in lighting, different camera angles, and changes in the person’s appearance. But the researchers were undeterred. They refined their algorithms, improved their models, and gradually, their system became more robust and versatile.

A Breakthrough in Computer Vision: The Power of Video-to-3D Avatar Transformation

The culmination of their journey was a breakthrough in the field of computer vision and graphics. They had successfully transformed a simple video into a lifelike 3D avatar. But more than that, they had opened up a world of possibilities. Their system could be used in virtual reality, gaming, film production, and many other applications.

The Beginning of a New Era: Lifelike 3D Avatars and the Endless Possibilities of Technology

In the end, the researchers from ETH Zurich had not just created a system. They had told a story, a story of innovation, perseverance, and the endless possibilities of technology. And as they looked at the 3D avatar, a digital reflection of the real world, they knew that their journey was just the beginning.

In short, their contributions are:

  1. a method to reconstruct detailed 3D avatars from in-the-wild monocular videos using self-supervised scene decomposition;
  2. a way to get robust and detailed 3D reconstructions of the human even in difficult poses and environments without using external segmentation methods; and
  3. a new semi-synthetic testing dataset that allows for the first time to compare monocular human reconstruction methods on realistic scenes. The file has a lot of notes about the 3D surface.

Researchers will be able to use the code and data for their work.

Over the course of the previous year, we have made significant progress in transitioning from 2D to 3D; this fresh method is the next step, and there is still more to come.

AI is everywhere, But the question is, how much do you love it?

I invite you to explore the concept of Machine Learning Art by reading and learning from the many articles found on 🔵 MLearning.ai 🟠

Check out my instagram with new material every week

Keywords: computer vision, Artificial Intelligence, Machine Learning, AI art, art, wombo dream, digital art, Dalle 2, Imagen, wombo ai, Parti, 3D point cloud, diffusion models, generative art, wombo art, photographic quality, img by AI system, AI art generator, text to art generator, 3D, midjourney, dalle2, stablediffusion, Dalle 3, Vid2Avatar

https://files.ait.ethz.ch/projects/vid2avatar/main.pdf

Project Page:

https://files.ait.ethz.ch/projects/vid2avatar/main.pdf

3D Avatar Reconstruction from Videos in the Wild

CODE:

https://github.com/MoyGcc/vid2avatar

@inproceedings{guo2023vid2avatar,
      title={Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition},
      author={Guo, Chen and Jiang, Tianjian and Chen, Xu and Song, Jie and Hilliges, Otmar},    
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year      = {2023}
    }
Ai Art
3d
Computer Vision
Machine Learning
Virtual Reality
Recommended from ReadMedium