3D recent
updated
4K4DGen: Panoramic 4D Generation at 4K Resolution
Paper
• 2406.13527
• Published
• 9
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper
• 2406.13393
• Published
• 5
YouDream: Generating Anatomically Controllable Consistent Text-to-3D
Animals
Paper
• 2406.16273
• Published
• 43
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything
Model
Paper
• 2406.20076
• Published
• 10
GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly
Enhanced Quality
Paper
• 2406.18462
• Published
• 12
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
Paper
• 2407.00367
• Published
• 11
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D
Facial Prior-guided Identity Alignment Network
Paper
• 2406.18284
• Published
• 19
Magic Insert: Style-Aware Drag-and-Drop
Paper
• 2407.02489
• Published
• 21
CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion
Blur Images
Paper
• 2407.03923
• Published
• 9
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper
• 2407.05282
• Published
• 15
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side
Images
Paper
• 2407.06191
• Published
• 13
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Paper
• 2407.06938
• Published
• 25
Vision language models are blind
Paper
• 2407.06581
• Published
• 85
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Paper
• 2407.06188
• Published
• 3
Controlling Space and Time with Diffusion Models
Paper
• 2407.07860
• Published
• 17
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large
Multimodal Models
Paper
• 2407.07895
• Published
• 42
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Paper
• 2407.09473
• Published
• 13
GRUtopia: Dream General Robots in a City at Scale
Paper
• 2407.10943
• Published
• 25
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Paper
• 2407.11793
• Published
• 3
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper
• 2407.11398
• Published
• 10
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling
Editability and Identity Preservation
Paper
• 2407.11394
• Published
• 12
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Paper
• 2407.12781
• Published
• 13
AppWorld: A Controllable World of Apps and People for Benchmarking
Interactive Coding Agents
Paper
• 2407.18901
• Published
• 35
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone
Capture
Paper
• 2407.19593
• Published
• 12
3D Question Answering for City Scene Understanding
Paper
• 2407.17398
• Published
• 22
Cycle3D: High-quality and Consistent Image-to-3D Generation via
Generation-Reconstruction Cycle
Paper
• 2407.19548
• Published
• 27
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Paper
• 2407.20179
• Published
• 47
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
Views
Paper
• 2408.10195
• Published
• 13