feat: add darkness filtering capability to dataloader #150

Copilot · 2025-08-12T10:43:36Z

Using print() statements for logging in production code is not recommended. Consider using a proper logging framework like Python's logging module to allow for better control over log levels and output destinations.

Suggested change

print(

logging.info(

Copilot · 2025-08-12T10:43:35Z

The DarknessFilter is always instantiated even when darkness_threshold is 0.0 (disabled). Consider conditionally adding the filter only when darkness_threshold > 0.0 to avoid unnecessary processing overhead.

Suggested change

]

]

if darkness_threshold > 0.0:

operations.append(

DarknessFilter(

darkness_threshold=darkness_threshold

)

)

operations.append(

grain.transforms.Batch(batch_size=per_process_batch_size, drop_remainder=True)

)

-Original file line number
+Diff line change
@@ Expand Up / @@ -59,6 +59,8 @@ class Args: @@
         param_dtype = jnp.float32
         dtype = jnp.bfloat16
         use_flash_attention: bool = True
+        # Additional parameters
+        darkness_threshold: float = 0.0
     args = tyro.cli(Args)
@@ Expand Down Expand Up @@
             num_workers=0,
             prefetch_buffer_size=1,
             seed=args.seed,
+            darkness_threshold=args.darkness_threshold,
         )
         dataloader = iter(dataloader)
         video_batch_BSHWC = next(dataloader)
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up / @@ -44,6 +44,7 @@ class Args: @@
         )
         warmup_steps: int = 5000
         lr_schedule: str = "wsd"  # supported options: wsd, cos
+        darkness_threshold: float = 0.0
         # Tokenizer
         tokenizer_dim: int = 512
         tokenizer_ffn_dim: int = 2048
@@ Expand Down Expand Up @@
             # The dataloader shards the dataset across all processes
             args.batch_size,
             *image_shape,
+            darkness_threshold=args.darkness_threshold,
             num_workers=8,
             prefetch_buffer_size=1,
             seed=args.seed,
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up / @@ -46,6 +46,7 @@ class Args: @@
         warmup_steps: int = 5000
         lr_schedule: str = "wsd"  # supported options: wsd, cos
         vq_reset_thresh: int = 50
+        darkness_threshold: float = 0.0
         # LAM
         model_dim: int = 512
         ffn_dim: int = 2048
@@ Expand Down Expand Up / @@ -297,6 +298,7 @@ def loss_fn( @@
             # The dataloader shards the dataset across all processes
             args.batch_size,
             *image_shape,
+            darkness_threshold=args.darkness_threshold,
             num_workers=8,
             prefetch_buffer_size=1,
             seed=args.seed,
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up / @@ -45,6 +45,7 @@ class Args: @@
         )
         lr_schedule: str = "wsd"  # supported options: wsd, cos
         warmup_steps: int = 10000
+        darkness_threshold: float = 0.0
         # Tokenizer
         model_dim: int = 512
         ffn_dim: int = 2048
@@ Expand Down Expand Up @@
             # The dataloader shards the dataset across all processes
             args.batch_size,
             *image_shape,
+            darkness_threshold=args.darkness_threshold,
             num_workers=8,
             prefetch_buffer_size=1,
             seed=args.seed,
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add darkness filtering capability to dataloader #150

Uh oh!

Diff view

Diff view

There are no files selected for viewing

Copilot AI Aug 12, 2025

Uh oh!

Copilot AI Aug 12, 2025

Uh oh!

feat: add darkness filtering capability to dataloader #150

Are you sure you want to change the base?

Uh oh!

feat: add darkness filtering capability to dataloader #150

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

Copilot AI Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!