Computer Vision Archives

PUBLICATIONS

Consistent View Synthesis with Pose-Guided Diffusion Models

Propose a framework based on diffusion models for consistent and realistic long-term novel view synthesis. Diffusion models have achieved impressive performance on many content creation applications, such as image-to-image translation and text-to- image generation.

PUBLICATIONS

A Practical Stereo Depth System for Smart Glasses

We present the design of a productionized end-to-end stereo depth sensing system that does pre-processing, online stereo rectification, and stereo depth estimation with...

PUBLICATIONS

Robust Dynamic Radiance Fields

We introduce RoDynRF, an algorithm for reconstructing dynamic radiance fields from casual videos. Unlike existing approaches, we do not require accurate camera poses as input. Our method optimizes camera poses and two radiance fields, modeling static and dynamic elements. Our approach includes a coarse-to-fine strategy and epipolar geometry to exclude moving pixels, deformation fields, time- dependent appearance models, and regularization losses for improved consistency.

PUBLICATIONS

Research

Computer Vision

Consistent View Synthesis with Pose-Guided Diffusion Models

A Practical Stereo Depth System for Smart Glasses

Robust Dynamic Radiance Fields

OMNI3D: A Large Benchmark and Model for 3D Object Detection in the Wild

AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation

Egocentric Audio-Visual Object Localization

Few-shot Semantic Image Synthesis with Class Affinity Transfer

RelightableHands: Efficient Neural Relighting of Articulated Hand Models

Egocentric Video Task Translation

Multiview Compressive Coding for 3D Reconstruction