Implementation of contrast() seems wrong

I have created https://github.com/google-research/big_vision/pull/108 for demonstration purpose. In short: the `mean` here

https://github.com/google-research/big_vision/blob/01edb81a4716f93a48be43b3a4af14e29cdb3a7f/big_vision/pp/autoaugment.py#L209-L213
is supposed to be the mean pixel value, but as it is it's just summing over the histogram (therefore equal to height * width), divided by 256. For the standard `decode_jpeg_and_inception_crop(224)`, I have verified that `mean` is always 224 * 224 / 256 = 196. I have also created the following calibration grid to double-check the transform's behavior, with RGB values (192, 64, 64) for the reddish squares and (64, 192, 192) for the bluish squares:

![download (8)](https://github.com/google-research/big_vision/assets/2584418/321585db-a391-43f6-8f40-d377356f876f)

As it is, `contrast(tf_color_tile, 1.9)` returns the following:
![download (11)](https://github.com/google-research/big_vision/assets/2584418/8e41dc3d-f1a9-4b6e-be13-9d19d6e3b061)
with RGB values (188, 0, 0) and (0, 188, 188). After the fix, `contrast(tf_color_tile, 1.9)` returns the following:
![download (12)](https://github.com/google-research/big_vision/assets/2584418/a0a56883-8daf-4d01-befd-7be8d4a3e865)
with RGB values (249, 6, 6) and (6, 249, 249), which is more in line with other implementations. E.g. the approximate torchvision equivalent
```python
from torchvision.transforms.v2 import functional as F
F.adjust_contrast(torch_color_tile, contrast_factor=1.9)
```
returns RGB values (250, 6, 6) and (6, 250, 250).

	# Compute the grayscale histogram, then compute the mean pixel value,
	# and create a constant image size of that value. Use that as the
	# blending degenerate target of the original image.
	hist = tf.histogram_fixed_width(degenerate, [0, 255], nbins=256)
	mean = tf.reduce_sum(tf.cast(hist, tf.float32)) / 256.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of contrast() seems wrong #109

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implementation of contrast() seems wrong #109

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions