Heinzle, SimonGuennebaud, GaƫlBotsch, MarioGross, MarkusDavid Luebke and John Owens2013-10-282013-10-282008978-3-905674-09-51727-3471https://doi.org/10.2312/EGGH/EGGH08/021-031We present a hardware architecture and processing unit for point sampled data. Our design is focused on fundamental and computationally expensive operations on point sets including k-nearest neighbors search, moving least squares approximation, and others. Our architecture includes a configurable processing module allowing users to implement custom operators and to run them directly on the chip. A key component of our design is the spatial search unit based on a kd-tree performing both kNN and eN searches. It utilizes stack recursions and features a novel advanced caching mechanism allowing direct reuse of previously computed neighborhoods for spatially coherent queries. In our FPGA prototype, both modules are multi-threaded, exploit full hardware parallelism, and utilize a fixed-function data path and control logic for maximum throughput and minimum chip surface. A detailed analysis demonstrates the performance and versatility of our design.Categories and Subject Descriptors (according to ACM CCS): I.3.1 [Hardware Architecture]: Graphics processorsA Hardware Processing Unit for Point Sets