KinectFusion: Real-Time Dynamic 3D Surface Reconstruction and Interaction Shahram Izadi1 , Richard A. Newcombe1,2 , David Kim1 , Otmar Hilliges1 , David Molyneaux1 , Steve Hodges1 , Pushmeet Kohli1 , Jamie Shotton1 , Andrew J. Davison2 , Andrew Fitzgibbon1 1 Microsoft Research∗, 2 Imperial College London†

Figure 1: We introduce a new system for acquiring high-quality, geometrically precise 3D models of an entire room rapidly using a single moving Kinect camera. The system generates high-quality models from noisy Kinect data in real-time. An entire room or smaller objects can be reconstructed in seconds (top sequences). We demonstrate a number of compelling new interactive possibilities such as multi-touch on any arbitrarily shaped surface (bottom left sequence); real-time rigid body physics simulated on a dynamic reconstructed model (bottom middle sequence); and rapid segmentation and tracking of objects within the model (bottom right sequence).

1

Introduction

We present KinectFusion, a system that takes live depth data from a moving Kinect camera and in real-time creates high-quality, geometrically accurate, 3D models. Our system allows a user holding a Kinect camera to move quickly within any indoor space, and rapidly scan and create a fused 3D model of the whole room and its contents within seconds. Even small motions, caused for example by camera shake, lead to new viewpoints of the scene and thus refinements of the 3D model, similar to the effect of image superresolution. As the camera is moved closer to objects in the scene more detail can be added to the acquired 3D model. To achieve this, our system continually tracks the 6DOF pose of the camera and rapidly builds a representation of the geometry of arbitrary surfaces. Novel GPU-based implementations for both camera tracking and surface reconstruction allow us to run at interactive real-time rates that have not previously been demonstrated. We define new instantiations of two well known graphics algorithms designed specifically for parallelizable GPGPU hardware.

2

KinectFusion

The main system pipeline first takes the live depth map from Kinect and converts from image coordinates into 3D points and normals in the coordinate space of the camera. Next the tracking phase computes a rigid 6DOF transform that closely aligns the current oriented points with the previous frames, using a novel GPU implementation of the ICP algorithm [Besl and McKay 1992]. This defines a relative rigid transform from the previous camera pose to the current. These transforms are incrementally applied to a global transform that defines the global pose of the camera. Given the ∗ {shahrami,b-davidk,otmarh,a-davmo,shodges,pkohli,jamiesho,awf}

@microsoft.com ajd}@doc.ic.ac.uk

† {rnewcombe,

Copyright is held by the author / owner(s). SIGGRAPH 2011, Vancouver, British Columbia, Canada, August 7 – 11, 2011. ISBN 978-1-4503-0921-9/11/0008

global pose of the camera, points and normals are converted into global coordinates, and a single consistent 3D model is updated. Instead of simply fusing point clouds, we reconstruct surfaces based on a novel GPU-based implementation of volumetric truncated signed distance functions [Curless and Levoy 1996]. Voxels within the volume are updated based on our globally converted measurements. Each voxel stores a running average of its distance to the assumed position of a physical surface. Finally we use GPU accelerated raycasting to render a view of the volume and the 3D surfaces it contains given the current global pose of the camera. The view of the volume equates to a synthetic depth map, which can be used as a less noisy more globally consistent base or reference frame for the next iteration of ICP tracking. This allows us to track by comparing the current live depth map with our less noisy raycasted view of the model, as opposed to using only the live depth maps frame-to-frame. The system can reconstruct a scene within seconds and enables interactive possibilities including: extending multi-touch interactions to any arbitrarily shaped reconstructed surface; advanced features for augmented reality; real-time physics that are simulated live on the dynamic model; and novel methods for segmentation and tracking of scanned objects. We also present extensions to our GPUbased tracking algorithm to distinguish scene motion from camera motion thus dealing with dynamic scenes, in particular ones where users are interacting. See Figure 1 and accompanying video for examples.

References B ESL , P., AND M C K AY, N. 1992. A method for registration of 3D shapes. 239–256. C URLESS , B., AND L EVOY, M. 1996. A volumetric method for building complex models from range images. In ACM Transactions on Graphics (SIGGRAPH).

KinectFusion: real-time dynamic 3D surface reconstruction and ...

SIGGRAPH 2011, Vancouver, British Columbia, Canada, August 7 – 11, 2011. ... refinements of the 3D model, similar to the effect of image super- resolution.

2MB Sizes 1 Downloads 322 Views

Recommend Documents

Road Surface 3D Reconstruction Based on Dense Subpixel ...
and computer vision have been increasingly applied in civil. Rui Fan is with the ... e.g., stereo vision, are more capable of reconstructing the 3D ..... Road Surface 3D Reconstruction Based on Dense Subpixel Disparity Map Estimation .pdf.

Dynamic Surface Matching by Geodesic Mapping for 3D ... - CiteSeerX
point clouds from scanner data are registered using a ran- domized feature matching ..... tion Technology for Convivial Society”. References. [1] N. Ahmed, C.

Schematic Surface Reconstruction - Semantic Scholar
multiple swept surfaces, of which the transport curves lie in horizontal planes. This section will introduce the basic reconstruction framework that initializes a set ...

Schematic Surface Reconstruction - Changchang Wu
This paper introduces a schematic representation for architectural scenes together with robust algorithms for reconstruction from sparse 3D point cloud data. The.

Bone Surface Reconstruction Using Localized ...
the case of long shaped bones such as the tibia, humerus, clavicle, ulna or .... as connected if their distance is less than 10 pixels (Fig. 2d). .... 987–1010, 2006.

A Fast and Simple Surface Reconstruction Algorithm
Jun 17, 2012 - Octree decomposition. Root cell smallest bounding cube of P. Splitting rule split a splittable leaf cell into eight children. Balancing rule split a leaf cell C if it has a neighbor C/ s.t. lC < lC /2. Apply the two rules alternately u

A novel method for 3D reconstruction: Division and ...
object with a satisfactory accuracy, multiple scans, which generally lead to ..... surface B leads to a non-overlapping surface patch. ..... automation, 2009. ICRA'09 ...

Towards Real Time 3D Tracking and Reconstruction on ...
the GPU rather than the CPU, is that the GPU has several computing units, that can run ... algorithm and a reference implementation that works in real time on ..... cloud so the displacement of its projections is similar to the optical flow of the ..

Contour-Based Surface Reconstruction using MPU ...
fits a point-based implicit surface to the contour data, while allowing the user to .... visualization of point sets at interactive frame rates with good visual quality.

Implicit surface reconstruction from point clouds
... van de kwaliteit van een gereconstrueerd model is zowel objectief als subjectief. Objectief kan de kwaliteit van de reconstructie gemeten worden door de afs-.

Complex 3D General Object Reconstruction from Line ...
object found is the pyramid, and the algorithm does not add ... object (cube and pyramid in this example). .... The following scheme can solve this problem.

Soft 3D Reconstruction for View Synthesis - Research at Google
Progress of rendering virtual views of di icult scenes containing foliage, wide baseline occlusions and reflections. View ray and ... ubiquitous digital cameras from cell phones and drones, paired with automatic ... Furthermore, we show that by desig

Skewed mirror symmetry in the 3D reconstruction of ...
Feb 7, 2003 - method of determination. Keywords. Planes of Symmetry. 3D Reconstruction. Mirror Symmetry. Skewed facial-symmetry. Axis of Symmetry. Sketch. Input. ...... Workshop on Geometric Modeling and Computer. Graphics, 2000. [Var00c] Varley P. A

Parameter optimization in 3D reconstruction on a large ...
Feb 20, 2007 - File replication allows a single file to be replicated to multiple storage ... Data are then replicated from that SE to other two SEs and so on.

Absolute 3D reconstruction of thin films topography in ...
stricted to chips made of PDMS and glass, and for other types of chips, one ... Data normalisation along eqn (1) allows correction of background noise and ... cients of the sample. At the interface between two semi- infinite media i and j of index ni

Methodology for 3D reconstruction of objects for ...
return high quality virtual objects. Based on this ... A better solution is the usage of a cloud of web service solution. A review of ... ARC3D (Vergauwen, 2006) is a free web service that provides a standalone software application for uploading.

Soft 3D Reconstruction for View Synthesis - Research
Progress of rendering virtual views of di icult scenes containing foliage, wide baseline ..... by the total amount of votes accumulated along the ray in the event.

Reconstruction of high contrast images for dynamic ...
Nov 6, 2011 - LDR image results of “Merge to HDR Pro” tool in Adobe®. Photoshop CS5 [1]. This tool requires the knowledge of ex- posure settings while the ...

Complete Multi-View Reconstruction of Dynamic Scenes from ...
problems: reconstruction accuracy falls when stereo photo- consistency ... Every frame is reconstructed independently. Nevertheless to obtain very accurate 3D models from stereo a high number of input views are required. While the reconstruction of s

Nonlinear dynamic modeling of surface defects in ...
Aug 6, 2008 - defective bearing rotor systems as the parameters of the system changes. ..... period of T ¼ 1/Ovc where Ovc ¼ ZOvc is the varying compliance frequency, so that: ~UрtЮ ¼ ~Uрt ю TЮ. (18) ... This information is needed to ...

Dynamic Surface Matching by Geodesic Mapping for ...
Surfaces are cap- tured from multi-view video data and represented by se- ... in a free-viewpoint video of real-world subjects in motion .... are deformed over time by tracking photo-consistent sur- ..... high fidelity visualization for 3d video. CVI

Automatic Non-rigid Registration of 3D Dynamic Data ...
istration of 3D dynamic facial data using least-squares con- formal maps, and ..... cial expressions that provides a good representation of facial motion. Isomap ...