Adding additional checks during upload and download of chunk #39

zarna1parekh · 2025-07-30T02:12:59Z

Summary

Adding sanity checks before upload and post download of chunks.

Check lucene directory for corrupted data
check number of files and size of each file locally and on S3.

Requirements

I've read and understood the Contributing Guidelines and have done my best effort to follow them.
I've read and agree to the Code of Conduct.

astra/src/main/java/com/slack/astra/chunk/ReadOnlyChunkImpl.java

…urpose

baroquebobcat · 2025-08-14T23:46:26Z

astra/src/main/java/com/slack/astra/blobfs/BlobStore.java

+    assert prefix != null && !prefix.isEmpty();
+
+    ListObjectsV2Request listRequest = builder().bucket(bucketName).prefix(prefix).build();
+    ListObjectsV2Publisher asyncPaginatedListResponse =
+        s3AsyncClient.listObjectsV2Paginator(listRequest);
+
+    Map<String, Long> filesListWithSize = new HashMap<>();
+    try {
+      asyncPaginatedListResponse
+          .subscribe(
+              listResponse ->
+                  listResponse
+                      .contents()
+                      .forEach(s3Object -> filesListWithSize.put(s3Object.key(), s3Object.size())))
+          .get();
+    } catch (InterruptedException | ExecutionException e) {
+      throw new RuntimeException(e);
+    }
+    return filesListWithSize;


you could extract a helper method that would be used by both listFiles... methods that takes a prefix and a Consumer. Then this could look like the following.

Also, the block passed to subscribe could be called in multiple threads, so this should use a storage class that is safe wrt concurrent modifications.

Suggested change

assert prefix != null && !prefix.isEmpty();

ListObjectsV2Request listRequest = builder().bucket(bucketName).prefix(prefix).build();

ListObjectsV2Publisher asyncPaginatedListResponse =

s3AsyncClient.listObjectsV2Paginator(listRequest);

Map<String, Long> filesListWithSize = new HashMap<>();

try {

asyncPaginatedListResponse

.subscribe(

listResponse ->

listResponse

.contents()

.forEach(s3Object -> filesListWithSize.put(s3Object.key(), s3Object.size())))

.get();

} catch (InterruptedException | ExecutionException e) {

throw new RuntimeException(e);

}

return filesListWithSize;

Map<String, Long> filesWithSize = new ConcurrentHashMap<>();

listFilesAndDo(prefix, s3Object -> filesListWithSize.put(s3Object.key(), s3Object.size()));

return filesWithSize;

astra/src/main/java/com/slack/astra/chunk/ChunkValidationUtils.java

baroquebobcat · 2025-08-15T00:04:05Z

astra/src/main/java/com/slack/astra/chunk/ReadOnlyChunkImpl.java

+              .collect(
+                  Collectors.toMap(
+                      path ->
+                          dataDirectory.relativize(path).toString().replace(File.separator, "/"),


Hm. I think the replace isn't necessary since Files.list() only returns files in the current directory. Although, maybe you should use Path#getFileName().toString() here, which would align with calling Paths.get(s3Path).getFileName().toString() on the s3 entries below.

astra/src/main/java/com/slack/astra/chunk/ReadOnlyChunkImpl.java

baroquebobcat · 2025-08-15T00:16:12Z

astra/src/main/java/com/slack/astra/chunk/ReadWriteChunk.java

+      // validate the size of the uploaded files
+      for (String fileName : filesToUpload) {
+        String s3Path = String.format("%s/%s", chunkInfo.chunkId, fileName);
+        long sizeOfFile = Files.size(Path.of(dirPath + "/" + fileName));


maybe use File.separator here instead of "/"?

baroquebobcat · 2025-08-15T00:16:56Z

astra/src/main/java/com/slack/astra/chunk/ReadWriteChunk.java

+                  "Mismatch for file %s in S3 and local directory of size %s for chunk %s",
+                  s3Path, sizeOfFile, chunkInfo.chunkId));


It would be good to include the s3 file size here as well.

baroquebobcat · 2025-08-15T00:20:39Z

astra/src/test/java/com/slack/astra/blobfs/BlobStoreTest.java

+    String chunkId = UUID.randomUUID().toString();
+
+    assertThat(blobStore.listFiles(chunkId).size()).isEqualTo(0);
+
+    Path directoryUpload = Files.createTempDirectory("");
+    Path foo = Files.createTempFile(directoryUpload, "", "");
+    try (FileWriter fileWriter = new FileWriter(foo.toFile())) {
+      fileWriter.write("Example test 1");
+    }
+    Path bar = Files.createTempFile(directoryUpload, "", "");


If you used a non-random chunkId and file names, you could have the assertion use more literals and it would be easier to follow.

Also, could you have one of the files have a different number of characters in it so it would be clear that they are different?

baroquebobcat approved these changes Jul 30, 2025

View reviewed changes

astra/src/main/java/com/slack/astra/chunk/ReadOnlyChunkImpl.java Outdated Show resolved Hide resolved

mansu approved these changes Jul 30, 2025

View reviewed changes

zarna1parekh added 5 commits August 12, 2025 11:16

Adding log statement + removing eviction on exception for debugging p…

97a5a2c

…urpose

Formatting

7443840

adding cache eviction back

30ef18c

printing only relevant fields

11de483

printing specific fields

4294bc1

zarna1parekh force-pushed the zparekh/debug_cache_nodes branch from 0a046bb to 6ef31f7 Compare August 12, 2025 18:17

Checking Lucene Index status before upload

ac66b41

zarna1parekh force-pushed the zparekh/debug_cache_nodes branch from 6ef31f7 to ac66b41 Compare August 12, 2025 18:49

zarna1parekh added 7 commits August 12, 2025 14:48

post download check on cache nodes

f909840

Addressing directory locking issue

5c4b07a

Fixing test cases

c975fe3

Updating logging statement

99ae402

lucene check after download on cache nodes

b09f9e0

log clean up

4809591

Code Refactor

808160a

zarna1parekh changed the title ~~Adding log statement + removing eviction on exception for debugging p…~~ Adding additional checks during upload and download of chunk Aug 13, 2025

baroquebobcat reviewed Aug 15, 2025

View reviewed changes

zarna1parekh added 11 commits August 15, 2025 19:08

S3 Exception logging

fd4744c

exposing the s3 exception on failure

31a054c

Check if download is failing from S3

dbfdfe0

adding more log statements

2c2ccec

log lines

5d41271

log line

6dc0c76

Increasing retries for S3

8acd25f

Checksum validation

e5abb7b

Checksum only when required

98dceab

logging cache node id

48c83b2

logs

e171c05

zarna1parekh added 30 commits September 10, 2025 13:37

Enriching log statement

d2d5093

fmt

e3e3b56

Directory cleanup

cfadbdc

Do not delete the error dir

17b0dfb

fmt

4540e97

reverting unwanted changes

6b72251

Updating async client to have more Native Memory

961c8d5

fmt

dd68d77

Incrasing Native memory to multipart upload

f041267

fmt

bcd0970

Conservative on Native memory

175de27

Upload sequentially instead of dir upload

b17a16c

build failure

2787a34

handling wildcard imports

11691ed

fixing build issue

67d541d

Cleanup Chunk Manager

7575911

cleanup debug code

b75d892

Fixing test cases

5b879f8

Verobose log line

2db12ce

upload time taken

15e76f4

Closing searcher and writer before upload begins

d497e6f

Close scheduled jobs

414bd33

rolback searcher changes

6c14cbe

index writer close + lucene check after upload + local download

ca6fb0e

closing searcher before snapshot uplaod

e45a2de

Updating AWS SDK version

4e658fd

Updating aws crt version

3f0b897

Reverting javac to 21

4920c58

Fixing test cases

f5270f4

Java version 21

76809e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Adding additional checks during upload and download of chunk #39

Adding additional checks during upload and download of chunk #39

Uh oh!

zarna1parekh commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

baroquebobcat Aug 14, 2025

Uh oh!

Uh oh!

baroquebobcat Aug 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baroquebobcat Aug 15, 2025

Uh oh!

baroquebobcat Aug 15, 2025

Uh oh!

baroquebobcat Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		"Mismatch for file %s in S3 and local directory of size %s for chunk %s",
		s3Path, sizeOfFile, chunkInfo.chunkId));

Uh oh!

Adding additional checks during upload and download of chunk #39

Are you sure you want to change the base?

Adding additional checks during upload and download of chunk #39

Uh oh!

Conversation

zarna1parekh commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Requirements

Uh oh!

Uh oh!

baroquebobcat Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

baroquebobcat Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baroquebobcat Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

baroquebobcat Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

baroquebobcat Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zarna1parekh commented Jul 30, 2025 •

edited

Loading