Table of contents
Feature image from: Understanding Diffusion Probabilistic Models (DPMs) | by Joseph Rocca - Medium
(
Searched by diffusion model in DDG image
)
Collections
References:
-
moatifbutt/awesome-diffusion-iclr-2025 - GitHub
- Surfaced when searching the paper of IC-Light in DDG
-
Variational Diffusion Models ~ NIPS 2021
(2023-11-04)
-
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
(2023-12-26)
- DM for single-image depth estimation
NeRF
-
HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion
【Diffusion生成NeRF】TUM, Apple提出HyperDiffusion,用Diffusion计算神经场权重,统一框架下生成3D权重或4D动画
(2023-07-16)
-
Use DM to generate a NeRF.
-
Comment: “思路不如 shapE 宽。shapE的encoder不仅把3d assets压缩为MLP, 而且同时支持Nerf和DMTet的表征,在MLP上做diffusion还是conditional的。这篇文章相比起来还不太清楚卖点在哪”
-
RCG
-
Self-conditioned Image Generation via Generating Representations Code | brief
(2023-12-30)
-
The distribution of image is learned by a pre-trained encoder, used as the condition for image generation.
-
Representative Diffusion model: Sampling from the representation distribution
-
Pixel generater: convert samples to pixel
FID (Frechet Inception Distance): 3.31, IS (Inception score): 253.4
-
2D to 3D
-
MVDD: Multi-View Depth Diffusion Models
(2023-12-31)
-
Use DM to generate multi-view depth maps for point cloud generation.
-
20K+ points. The number of valid points may no larger than the resolution of an image, because depth and geometry consistencies needs to be checked like the point cloud fusion performed in MVSNet.
-
Depth map fusion
-
-
Epipolar attention affects the denosing steps.
-
-
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
(2023-12-31) (可能是 美貌与智慧并重 他们做的,他在VAST?)
-
DM conditioned by a single image for generating multi-view images.
-
Restrict the frozen diffusion model with an epipolar cross-view attention
- Reminds me MVDiffusion
-
Generate 16 multi-view images in 12 seconds
- What is the resolution?
- What is the device?
-
Adjusting feature maps to control image generation
- No 3D geometry. I believe explicit structure is necessary for multi-view consistency especially in views with large-baselines.
-
Text to 3D
-
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D
(2024-01-05)
- generalizable Normal-Depth diffusion model,
- PBR
Multi-view
-
Cameras as Rays: Pose Estimation via Ray Diffusion ~ ICLR 2024 (Oral)
(2024-03-01)
- Generate ray moments and ray directions by diffusion model.
Control
Illumination Editing
Light Transport
References:
-
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport - OpenReview
- Surfaced by WeChat subscription: ICLR 惊现 10,10,10,10 满分论文,ControlNet 作者新作,Github 5.8k 颗星 - 机器之心
- Style2Paints Research Lvmin Zhang (Lyumin Zhang)
Training Model From Scratch:
(2024-12-01)
-
IC-Light r1-OpenReview
-
Related:
- Pdf: Lvmin Zhang
- lllyasviel/IC-Light
-
Reasons:
- This paper draw my attention as it involves light transportation.
-
Q&A:
-
How does this method combine with Light Transport?
-
Is the training process similar to NeRF, which integrated differentiable rendering into the “pipeline to fulfill the task”, i.e., volumetric rendering.
-
-
Bonds:
-
“in-the-wild data” reminds me NeRF-in-the-wild, which separates transient and consistant contents using two gates.
-
“linear blending” of lighting effects under each single illumination condition.
-
Weighted sum, which the NN is good at.
-
I remember the word prompts to diffusion model have arithmatic characteristic, demonstrated in the short course of DLAI (Andrew Ng).
-
-
Diffusion-baed illumination editing method
- Lvmin commits himself to help artists r2-Paints.
-
-
Ideas:
-
Inproper training constraints result in a “Structure-guided random image generator”.
-
Complex illumination > Mixture of illumination > Approximated with $k$ diffusion model.
-
-
Questions:
-
Can the Mixture of diffusion models be replaced with Gaussian mixture model?
What are the similarity between the Mixture of diffusion models and Gaussian mixture model?
-
-