Li, ShengrenSimons, LancePakaravoor, Jagadeesh BhaskarAbbasinejad, FatemehOwens, John D.Amenta, NinaCarsten Dachsbacher and Jacob Munkberg and Jacopo Pantaleoni2013-10-282013-10-282012978-3-905674-41-52079-8679https://doi.org/10.2312/EGGH/HPG12/039-047We describe the implementation of a simple method for finding k approximate nearest neighbors (ANNs) on the GPU. While the performance of most ANN algorithms depends heavily on the distributions of the data and query points, our approach has a very regular data access pattern. It performs as well as state of the art methods on easy distributions with small values of k, and much more quickly on more difficult problem instances. Irrespective of the distribution and also roughly of the size of the set of input data points, we can find 50 ANNs for 1M queries at a rate of about 1200 queries/ms.Categories and Subject Descriptors (according to ACM CCS): I.3.1 [Computer Graphics]: Hardware Architecture- Parallel processing I.3.1 [Computer Graphics]: Hardware Architecture-Graphics processors F.2.2 [Analysis of Algorithms and Problem Complexity]: Nonnumerical Algorithms and Problems-Sorting and searchingkANN on the GPU with Shifted Sorting