Category‐Specific Salient View Selection via Deep Convolutional Neural Networks

Kim, Seong‐heum; Tai, Yu‐Wing; Lee, Joon‐Young; Park, Jaesik; Kweon, In So

dc.contributor.author	Kim, Seong‐heum	en_US
dc.contributor.author	Tai, Yu‐Wing	en_US
dc.contributor.author	Lee, Joon‐Young	en_US
dc.contributor.author	Park, Jaesik	en_US
dc.contributor.author	Kweon, In So	en_US
dc.contributor.editor	Chen, Min and Zhang, Hao (Richard)	en_US
dc.date.accessioned	2018-01-10T07:42:58Z
dc.date.available	2018-01-10T07:42:58Z
dc.date.issued	2017
dc.identifier.issn	1467-8659
dc.identifier.uri	http://dx.doi.org/10.1111/cgf.13082
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf13082
dc.description.abstract	In this paper, we present a new framework to determine up front orientations and detect salient views of 3D models. The salient viewpoint to human preferences is the most informative projection with correct upright orientation. Our method utilizes two Convolutional Neural Network (CNN) architectures to encode category‐specific information learnt from a large number of 3D shapes and 2D images on the web. Using the first CNN model with 3D voxel data, we generate a CNN shape feature to decide natural upright orientation of 3D objects. Once a 3D model is upright‐aligned, the front projection and salient views are scored by category recognition using the second CNN model. The second CNN is trained over popular photo collections from internet users. In order to model comfortable viewing angles of 3D models, a category‐dependent prior is also learnt from the users. Our approach effectively combines category‐specific scores and classical evaluations to produce a data‐driven viewpoint saliency map. The best viewpoints from the method are quantitatively and qualitatively validated with more than 100 objects from 20 categories. Our thumbnail images of 3D models are the most favoured among those from different approaches.In this paper, we present a new framework to determine up front orientations and detect salient views of 3D models. The salient viewpoint to human preferences is the most informative projection with correct upright orientation. Our method utilizes two Convolutional Neural Network (CNN) architectures to encode category‐specific information learnt from a large number of 3D shapes and 2D images on the web. Using the first CNN model with 3D voxel data, we generate a CNN shape feature to decide natural upright orientation of 3D objects. Once a 3D model is upright‐aligned, the front projection and salient views are scored by category recognition using the second CNN model. The second CNN is trained over popular photo collections from internet users. In order to model comfortable viewing angles of 3D models, a category dependent prior is also learnt from the users. Our approach effectively combines category‐specific scores and classical evaluations to produce a data‐driven viewpoint saliency map. The best viewpoints from the method are quantitatively and qualitatively validated with more than 100 objects from 20 categories. Our thumbnail images of 3D models are the most favored among those from different approaches.	en_US
dc.publisher	© 2017 The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	best view selection
dc.subject	upright orientation estimation
dc.subject	deep learning
dc.subject	Categories and Subject Descriptors (according to ACM CCS): I.3.3 [Computer Graphics]: Picture/Image Generation—Display algorithms
dc.subject	Viewing algorithms I.5.1 [Pattern Recognition]: Models—Neural Nets
dc.title	Category‐Specific Salient View Selection via Deep Convolutional Neural Networks	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Articles
dc.description.volume	36
dc.description.number	8
dc.identifier.doi	10.1111/cgf.13082
dc.identifier.pages	313-328

Files in this item

Name:: v36i8pp313-328.pdf
Size:: 13.47Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

36-Issue 8
Regular Issue

Show simple item record