3D recent - a Inishds Collection

Inishds 's Collections

3D recent

updated Aug 20, 2024

4K4DGen: Panoramic 4D Generation at 4K Resolution

Paper • 2406.13527 • Published Jun 19, 2024 • 9
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images

Paper • 2406.13393 • Published Jun 19, 2024 • 5
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

Paper • 2406.16273 • Published Jun 24, 2024 • 43
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model

Paper • 2406.20076 • Published Jun 28, 2024 • 10
GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

Paper • 2406.18462 • Published Jun 26, 2024 • 12
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix

Paper • 2407.00367 • Published Jun 29, 2024 • 11
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network

Paper • 2406.18284 • Published Jun 26, 2024 • 19
Magic Insert: Style-Aware Drag-and-Drop

Paper • 2407.02489 • Published Jul 2, 2024 • 21
CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images

Paper • 2407.03923 • Published Jul 4, 2024 • 9
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Paper • 2407.05282 • Published Jul 7, 2024 • 15
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

Paper • 2407.06191 • Published Jul 8, 2024 • 13
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Paper • 2407.06938 • Published Jul 9, 2024 • 25
Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 84
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

Paper • 2407.06188 • Published Jul 8, 2024 • 3
Controlling Space and Time with Diffusion Models

Paper • 2407.07860 • Published Jul 10, 2024 • 17
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 42
StyleSplat: 3D Object Style Transfer with Gaussian Splatting

Paper • 2407.09473 • Published Jul 12, 2024 • 13
GRUtopia: Dream General Robots in a City at Scale

Paper • 2407.10943 • Published Jul 15, 2024 • 25
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Paper • 2407.11793 • Published Jul 16, 2024 • 3
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Paper • 2407.11398 • Published Jul 16, 2024 • 10
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Paper • 2407.11394 • Published Jul 16, 2024 • 12
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

Paper • 2407.12781 • Published Jul 17, 2024 • 13
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

Paper • 2407.18901 • Published Jul 26, 2024 • 35
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture

Paper • 2407.19593 • Published Jul 28, 2024 • 12
3D Question Answering for City Scene Understanding

Paper • 2407.17398 • Published Jul 24, 2024 • 22
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28, 2024 • 27
Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Paper • 2407.20179 • Published Jul 29, 2024 • 47
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views

Paper • 2408.10195 • Published Aug 19, 2024 • 13