Render accuracy issues with moderate to high temporal samples on OpenCL

Take the attached edge.xml and sequence it with embergenome --sequence=edge.xml --loops 0 --loopframes 0 --interploops 1 --interpframes=159 > sequenced.xml.

Then render with

emberanimate --in=sequenced.xml --frame=0 --suffix=_gpu --opencl --sp
emberanimate --in=sequenced.xml --frame=0 --suffix=_cpu --sp

You'll notice that the OpenCL-rendered image has some glow in the darker areas of the image. As you increase the number of temporal samples, the rest of the image desaturates and the error areas get brighter, as though samples above some threshold are having their position calculated incorrectly, and increasing the number of temporal samples enlarges some range that pushes more samples past this threshold. I don't have a detailed understanding of how the flame algorithm works though, so this is a guess.

Some renderings I did with various --ts settings are shown here: http://imgur.com/a/fKECp

I don't see any notable differences between single- and double-precision renderings. I have no AMD hardware to test on.

OpenCL Info:
Platform 0: NVIDIA Corporation  NVIDIA CUDA  OpenCL 1.2 CUDA 8.0.0
Device 0: NVIDIA Corporation  GeForce GTX 960
CL_DEVICE_OPENCL_C_VERSION: OpenCL C 1.2
CL_DEVICE_LOCAL_MEM_SIZE: 49,152
CL_DEVICE_LOCAL_MEM_TYPE: 1
CL_DEVICE_MAX_COMPUTE_UNITS: 8
CL_DEVICE_MAX_READ_IMAGE_ARGS: 256
CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 16
CL_DEVICE_MAX_MEM_ALLOC_SIZE: 1,073,741,824
CL_DEVICE_ADDRESS_BITS: 64
CL_DEVICE_GLOBAL_MEM_CACHE_TYPE: 2
CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 128
CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 131,072
CL_DEVICE_GLOBAL_MEM_SIZE: 4,294,967,296
CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 65,536
CL_DEVICE_MAX_CONSTANT_ARGS: 9
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
CL_DEVICE_MAX_WORK_GROUP_SIZE: 1,024
CL_DEVICE_MAX_WORK_ITEM_SIZES: 1,024, 1,024, 64

Comments (4)