date
type
status
slug
summary
tags
category
icon
password
URL

January 10, 2024 • 4 min read
by Simon Meng • See original
AIGC is getting so competitive nowadays that 2D/3D/video content is no longer enough—4D has entered the scene 😂. Quickly sharing some of the recent 4D generation algorithms I came across—some are almost ready for use. Note: here, 4D refers to 3D models with movement (4D models), and videos that allow switching viewpoints during playback (4D spatial scenes).
  • Animate124: Converts a single static image into a 3D video based on text descriptions, achieving a leap from 2D to 4D. This technology leverages a three-stage optimization and multi-diffusion prior, creating a unique animation experience.
notion image
  • 4D-fy: Combines variational SDS and text-to-image models (T2I) to enhance the 4D generation process. This algorithm enhances visual effects through mixed gradient supervision, showcasing its unique advantages in text-driven four-dimensional creation.
notion image
  • Grounded 4D Content Generation: Combines static 3D assets with monocular video sequences, offering users finer geometric and motion control in 4D scene construction. This method provides new perspectives in the field of 4D content creation.
notion image
  • DreamGaussian4D: Significantly improves content generation speed and enhances motion control and detail presentation through its 4D Gaussian splatting technique. This framework has distinct advantages in both efficiency and expressiveness.
  • Control4D: Enables users to intuitively edit 4D portraits using text instructions. The innovation of this framework lies in its high fidelity and editing consistency, providing new possibilities for four-dimensional editing.
  • Consistent4D: Opens new pathways for generating four-dimensional objects through uncalibrated monocular video. It adds a new dimension to the text-to-3D tasks, providing a strong complement to traditional methods.
notion image
  • EasyVolcap: A PyTorch-based library focusing on accelerating research in neural volumetric video, especially in volumetric video capture, reconstruction, and rendering. It provides a set of tools aimed at simplifying the complex volumetric video processing workflow.
  • SpacetimeGaussians: Introduces a new dynamic scene representation method—spatiotemporal Gaussian splatting. It combines enhanced 3D Gaussian models with feature splatting rendering techniques, achieving high-resolution real-time shading while maintaining compact storage.
  • GPS-Gaussian: Focuses on real-time reconstruction and rendering of 4D Gaussians, providing an efficient solution for novel human viewpoint synthesis. This tool aims for fast and accurate dynamic 3D rendering.
  • Dynamic 3D Gaussians: Breaks the limitations of neural implicit field modeling through its persistent dynamic view synthesis, enabling the reconstruction of dynamic objects and effectively combining models from different scenes.
notion image

 
 
 
At the end of 2023, I want to share two comforting AI tools with you and have a heartfelt chat.Using AI to transcend every doomsday of humanity until the end of the universe!
Loading...
Simon Shengyu Meng
Simon Shengyu Meng
AI artist driven by curiosity, cross-disciplinary researcher, PhD candidate, science communication blogger.
公告
--- About me ---
--- Contact Me ---
Design and Art Creation | AIGC Consultation and Training | Commercial Deployment