Dataset Open Access

# Subjective human thresholds over computer generated images Jérôme Buisine; Samuel Delepoulle; Rémi Synave; Christophe Renaud

##### Thesis supervisor(s) Christophe Renaud; Samuel Delepoulle

Realistic image computation mimics the natural process of acquiring pictures by simulating the physical interactions of light between all the objects, lights and cameras lying within a modelled 3D scene. This process is known as global illumination and was formalised by Kajiya with the following rendering Equation:
$$\begin{equation} \label{eq:rendering_equation} L_o(x, \omega_o) = {L_e(x, \omega_o)} + \int_{\Omega}^{} {L_i(x, \omega_i)} \cdot f_r(x, \omega_i \rightarrow \omega_o) \cdot \cos \theta_i d\omega_i \end{equation}$$

where:

•  $$L_o(x, \omega_o)$$ is the luminance traveling from point $$x$$ in direction $$\omega_o$$;
• $$L_e(x, \omega_o)$$ is point $$x$$ emitted luminance (it is null if point x does not lie on a ligth source surface);
• the integral represents the set of luminances $$L_i$$incident in $$x$$ from the hemisphere of the directions $$\Omega$$ and reflected in the direction $$\omega_o$$. The reflected luminances are weighted by the materials reflecting properties (bidirectionnal reflectance function $$f_r(x, \omega_i \rightarrow \omega_o)$$) and the cosinus of the incident angle.

This equation cannot be analytically solved and Monte Carlo approaches are generally used to estimate the value of the pixels of the final image.

This proposed dataset is composed of 80 points of view of photo realistics images with different level of samples (following the Monte Carlo approach) for each. Each image is 800 x 800 pixels in size. The most noisy image is of 20 samples and the reference one (the most converged image obtained) is of 10000 samples. The pbrt rendering engine (version 3) was used to generate these images.

By exploiting these levels of samples obtained and therefore of noise perceptible in the images, average subjective human thresholds were collected. For this purpose, the images were divided into 16 areas of 200 x 200 pixels in size for each point of view.

The proposed image database is composed of the following files:

• human-thresholds.csv : the set of human subjective thresholds obtained on 40 points of view. A line is composed of the name of the point of view followed by all the thresholds obtained for each of the 16 zones;
• SIN3D_dataset.tar.gz : is an archive containing all the images from 20 to 10000 samples in steps of 20 samples for each point of view (i.e. 500 images per point of view). Each folder in the archive corresponds to a point of view.

This image database has been exploited in order to propose an objective model for noise detection in photo-realistic computer-generated images (article referenced to this image database).

Note: Some of the proposed scenes come from:

Funding: This research was funded by ANR support: project ANR-17-CE38-0009.

Files (35.6 GB)
Name Size
human-thresholds.csv
md5:04d6de6e29b94a2b3a046aba6efd8630
4.3 kB
SIN3D_dataset.tar
md5:f96942181a19420e818c1435863d3e10
35.6 GB