• Login
    View Item 
    •   Eurographics DL Home
    • Eurographics Workshops and Symposia
    • EGGH: SIGGRAPH/Eurographics Workshop on Graphics Hardware
    • High-Performance Graphics 2016
    • View Item
    •   Eurographics DL Home
    • Eurographics Workshops and Symposia
    • EGGH: SIGGRAPH/Eurographics Workshop on Graphics Hardware
    • High-Performance Graphics 2016
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Local Shading Coherence Extraction for SIMD-Efficient Path Tracing on CPUs

    Thumbnail
    View/Open
    119-128.pdf (20.62Mb)
    Date
    2016
    Author
    Áfra, Attila T.
    Benthin, Carsten
    Wald, Ingo ORCID
    Munkberg, Jacob
    Pay-Per-View via TIB Hannover:

    Try if this item/paper is available.

    Metadata
    Show full item record
    Abstract
    Accelerating ray traversal on data-parallel hardware architectures has received widespread attention over the last few years, but much less research has focused on efficient shading for ray tracing. This is unfortunate since shading for many applications is the single most time consuming operation. To maximize rendering performance, it is therefore crucial to effectively use the processor's wide vector units not only for the ray traversal step itself, but also during shading. This is non-trivial as incoherent ray distributions cause control flow divergence, making high SIMD utilization difficult to maintain. In this paper, we propose a local shading coherence extraction algorithm for CPU-based path tracing that enables efficient SIMD shading. Each core independently traces and sorts small streams of rays that fit into the on-chip cache hierarchy, allowing to extract coherent ray batches requiring similar shading operations, with a very low overhead. We show that operating on small independent ray streams instead of a large global stream is sufficient to achieve high SIMD utilization in shading (90% on average) for complex scenes, while avoiding unnecessary memory traffic and synchronization. For a set of scenes with many different materials, our approach reduces the shading time with 1.9–-3.4 compared to simple structure-of-arrays (SoA) based packet shading. The total rendering speedup varies between 1.2-3 , which is also determined by the ratio of the traversal and shading times.
    BibTeX
    @inproceedings {10.2312:hpg.20161198,
    booktitle = {Eurographics/ ACM SIGGRAPH Symposium on High Performance Graphics},
    editor = {Ulf Assarsson and Warren Hunt},
    title = {{Local Shading Coherence Extraction for SIMD-Efficient Path Tracing on CPUs}},
    author = {Áfra, Attila T. and Benthin, Carsten and Wald, Ingo and Munkberg, Jacob},
    year = {2016},
    publisher = {The Eurographics Association},
    ISSN = {2079-8679},
    ISBN = {978-3-03868-008-6},
    DOI = {10.2312/hpg.20161198}
    }
    URI
    http://dx.doi.org/10.2312/hpg.20161198
    Collections
    • High-Performance Graphics 2016

    Eurographics Association copyright © 2013 - 2022 
    Send Feedback | Contact - Imprint | Data Privacy Policy | Disable Google Analytics
    Theme by @mire NV
    System hosted at  Graz University of Technology.
    TUGFhA
     

     

    Browse

    All of Eurographics DLCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    BibTeX | TOC

    Create BibTeX Create Table of Contents

    Eurographics Association copyright © 2013 - 2022 
    Send Feedback | Contact - Imprint | Data Privacy Policy | Disable Google Analytics
    Theme by @mire NV
    System hosted at  Graz University of Technology.
    TUGFhA