skip to content
Deep learning on point clouds for 3D scene understanding Preview this item
ClosePreview this item
Checking...

Deep learning on point clouds for 3D scene understanding

Author: Ruizhongtai Qi; Leonidas J Guibas; Bernd Girod; Silvio Savarese; Stanford University. Department of Electrical Engineering.
Publisher: [Stanford, California] : [Stanford University], 2018. ©2018
Dissertation: Ph. D. Stanford University 2018
Edition/Format:   Thesis/dissertation : Document : Thesis/dissertation : eBook   Computer File : English
Summary:
Point cloud is a commonly used geometric data type with many applications in computer vision, computer graphics and robotics. The availability of inexpensive 3D sensors has made point cloud data widely available and the current interest in self-driving vehicles has highlighted the importance of reliable and efficient point cloud processing. Due to its irregular format, however, current convolutional deep learning  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Find a copy online

Links to this item

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Academic theses
Material Type: Document, Thesis/dissertation, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Ruizhongtai Qi; Leonidas J Guibas; Bernd Girod; Silvio Savarese; Stanford University. Department of Electrical Engineering.
OCLC Number: 1050345555
Notes: Submitted to the Department of Electrical Engineering.
Description: 1 online resource
Responsibility: Ruizhongtai Qi.

Abstract:

Point cloud is a commonly used geometric data type with many applications in computer vision, computer graphics and robotics. The availability of inexpensive 3D sensors has made point cloud data widely available and the current interest in self-driving vehicles has highlighted the importance of reliable and efficient point cloud processing. Due to its irregular format, however, current convolutional deep learning methods cannot be directly used with point clouds. Most researchers transform such data to regular 3D voxel grids or collections of images, which renders data unnecessarily voluminous and causes quantization and other issues. In this thesis, we present novel types of neural networks (PointNet and PointNet++) that directly consume point clouds, in ways that respect the permutation invariance of points in the input. Our network provides a unified architecture for applications ranging from object classification and part segmentation to semantic scene parsing, while being efficient and robust against various input perturbations and data corruption. We provide a theoretical analysis of our approach, showing that our network can approximate any set function that is continuous, and explain its robustness. In PointNet++, we further exploit local contexts in point clouds, investigate the challenge of non-uniform sampling density in common 3D scans, and design new layers that learn to adapt to varying sampling densities. The proposed architectures have opened doors to new 3D-centric approaches to scene understanding. We show how we can adapt and apply PointNets to two important perception problems in robotics: 3D object detection and 3D scene flow estimation. In 3D object detection, we propose a new frustum-based detection framework that achieves 3D instance segmentation and 3D amodal box estimation in point clouds. Our model, called Frustum PointNets, benefits from accurate geometry provided by 3D points and is able to canonicalize the learning problem by applying both non-parametric and data-driven geometric transformations on the inputs. Evaluated on large-scale indoor and outdoor datasets, our real-time detector significantly advances state of the art. In scene flow estimation, we propose a new deep network called FlowNet3D that learns to recover 3D motion flow from two frames of point clouds. Compared with previous work that focuses on 2D representations and optimizes for optical flow, our model directly optimizes 3D scene flow and shows great advantages in evaluations on real LiDAR scans. As point clouds are prevalent, our architectures are not restricted to the above two applications or even 3D scene understanding. This thesis concludes with a discussion on other potential application domains and directions for future research.

Reviews

User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


Primary Entity

<http://www.worldcat.org/oclc/1050345555> # Deep learning on point clouds for 3D scene understanding
    a schema:Book, pto:Web_document, schema:MediaObject, schema:CreativeWork, bgn:Thesis ;
    bgn:inSupportOf "" ;
    library:oclcnum "1050345555" ;
    library:placeOfPublication <http://id.loc.gov/vocabulary/countries/cau> ;
    schema:author <http://experiment.worldcat.org/entity/work/data/5408217640#Person/qi_ruizhongtai> ; # Ruizhongtai Qi
    schema:contributor <http://experiment.worldcat.org/entity/work/data/5408217640#Person/savarese_silvio> ; # Silvio Savarese
    schema:contributor <http://experiment.worldcat.org/entity/work/data/5408217640#Person/guibas_leonidas_j> ; # Leonidas J. Guibas
    schema:contributor <http://experiment.worldcat.org/entity/work/data/5408217640#Person/girod_bernd> ; # Bernd Girod
    schema:contributor <http://experiment.worldcat.org/entity/work/data/5408217640#Organization/stanford_university_department_of_electrical_engineering> ; # Stanford University. Department of Electrical Engineering.
    schema:copyrightYear "2018" ;
    schema:datePublished "2018" ;
    schema:description "Point cloud is a commonly used geometric data type with many applications in computer vision, computer graphics and robotics. The availability of inexpensive 3D sensors has made point cloud data widely available and the current interest in self-driving vehicles has highlighted the importance of reliable and efficient point cloud processing. Due to its irregular format, however, current convolutional deep learning methods cannot be directly used with point clouds. Most researchers transform such data to regular 3D voxel grids or collections of images, which renders data unnecessarily voluminous and causes quantization and other issues. In this thesis, we present novel types of neural networks (PointNet and PointNet++) that directly consume point clouds, in ways that respect the permutation invariance of points in the input. Our network provides a unified architecture for applications ranging from object classification and part segmentation to semantic scene parsing, while being efficient and robust against various input perturbations and data corruption. We provide a theoretical analysis of our approach, showing that our network can approximate any set function that is continuous, and explain its robustness. In PointNet++, we further exploit local contexts in point clouds, investigate the challenge of non-uniform sampling density in common 3D scans, and design new layers that learn to adapt to varying sampling densities. The proposed architectures have opened doors to new 3D-centric approaches to scene understanding. We show how we can adapt and apply PointNets to two important perception problems in robotics: 3D object detection and 3D scene flow estimation. In 3D object detection, we propose a new frustum-based detection framework that achieves 3D instance segmentation and 3D amodal box estimation in point clouds. Our model, called Frustum PointNets, benefits from accurate geometry provided by 3D points and is able to canonicalize the learning problem by applying both non-parametric and data-driven geometric transformations on the inputs. Evaluated on large-scale indoor and outdoor datasets, our real-time detector significantly advances state of the art. In scene flow estimation, we propose a new deep network called FlowNet3D that learns to recover 3D motion flow from two frames of point clouds. Compared with previous work that focuses on 2D representations and optimizes for optical flow, our model directly optimizes 3D scene flow and shows great advantages in evaluations on real LiDAR scans. As point clouds are prevalent, our architectures are not restricted to the above two applications or even 3D scene understanding. This thesis concludes with a discussion on other potential application domains and directions for future research."@en ;
    schema:exampleOfWork <http://worldcat.org/entity/work/id/5408217640> ;
    schema:genre "Academic theses"@en ;
    schema:inLanguage "en" ;
    schema:name "Deep learning on point clouds for 3D scene understanding"@en ;
    schema:productID "1050345555" ;
    schema:url <http://purl.stanford.edu/xm943cz7043> ;
    wdrs:describedby <http://www.worldcat.org/title/-/oclc/1050345555> ;
    .


Related Entities

<http://experiment.worldcat.org/entity/work/data/5408217640#Organization/stanford_university_department_of_electrical_engineering> # Stanford University. Department of Electrical Engineering.
    a schema:Organization ;
    schema:name "Stanford University. Department of Electrical Engineering." ;
    .

<http://experiment.worldcat.org/entity/work/data/5408217640#Person/girod_bernd> # Bernd Girod
    a schema:Person ;
    schema:familyName "Girod" ;
    schema:givenName "Bernd" ;
    schema:name "Bernd Girod" ;
    .

<http://experiment.worldcat.org/entity/work/data/5408217640#Person/guibas_leonidas_j> # Leonidas J. Guibas
    a schema:Person ;
    schema:familyName "Guibas" ;
    schema:givenName "Leonidas J." ;
    schema:name "Leonidas J. Guibas" ;
    .

<http://experiment.worldcat.org/entity/work/data/5408217640#Person/qi_ruizhongtai> # Ruizhongtai Qi
    a schema:Person ;
    schema:familyName "Qi" ;
    schema:givenName "Ruizhongtai" ;
    schema:name "Ruizhongtai Qi" ;
    .

<http://experiment.worldcat.org/entity/work/data/5408217640#Person/savarese_silvio> # Silvio Savarese
    a schema:Person ;
    schema:familyName "Savarese" ;
    schema:givenName "Silvio" ;
    schema:name "Silvio Savarese" ;
    .


Content-negotiable representations

Close Window

Please sign in to WorldCat 

Don't have an account? You can easily create a free account.