[PECOBLR-1121] Arrow patch to circumvent Arrow issues with JDk 16+ by tejassp-db · Pull Request #1243 · databricks/databricks-jdbc

tejassp-db · 2026-03-02T10:33:44Z

Databricks server shares query results in Arrow format for easy cross language functionality. The JDBC driver experiences compatibility issues with JDK 16 and later versions when processing Arrow results.

This problem arises from stricter encapsulation of internal APIs in newer Java versions, which affects the driver's use of the Apache Arrow result format consumption with the Apache Arrow library. The JDBC driver is used in partner solutions, where they do not have control of the runtime environment, and the workaround of setting JVM arguments is not feasible.

This PR patches some of the Arrow code to provide alternative JVM Heap based byte allocators that do not use native MemoryUtil based direct reads from off-heap memory. This implementation uses the native Arrow code path if feasible, else falls back to the patched code.

All the code has been tested for read compatibility with all Arrow types, latency benchmarks have been tested, and automated tests have been added as well.

During the course of this change it became necessary to also convert the project into a multi-module maven project

Patch Arrow to create a Databricks ArrowBuf which allocates memory on the heap and provides access to it through Java methods. This removes the need to specify "--add-opens=java.base/java.nio=ALL-UNNAMED" as JVM args for JDK 16+.

Added tests to validate Arrow patch code paths. Added Maven profiles to validate the behaviour across JVM versions and with/without "--add-opens=java.base/java.nio=ALL-UNNAMED" JVM arguments. By default, JVM version 11 is assumed. To use other JVM versions, the toolchain needs to be setup to point to the correct Java versions on the local machine in .m2/toolchains.xml.

Use native Arrow if available. Otherwise fallback to the patch version.

…ow-patch/stack-1

Remove irrelevant reference counting in patch code. Patch code uses heap memory for arrow operations and reference counting is not required.

…ow-patch/stack-1

Add unit tests for all public API.

Remove redundant todos for accounting.

…ow-patch/stack-1

…ow-patch/stack-3

A JMH benchmark for Arrow parsing of patched and unpatched Arrow Buffers and Buffer allocators.

Convert the code to muli module project. - Cleaner separation of JAR generation for Uber jar and normal/thin JAR with some patched Arrow changes. - Test modules with tests for shaded jars.

Tests to verify that all dependencies are shaded as expected.

Add tests to handle all data types supported by Arrow.

…ow-patch/stack-1

Patch DecimalUtility to not use unsafe methods to set decimal values on DatabricksArrowBuf.

…ow-patch/stack-1

…ow-patch/stack-3

…ow-patch/stack-4

…ow-patch/stack-5

Add tests for Boolean, Null, Fixed size list, UTF-8 view, Binary view, list view, large list view types.

Remove default profile of JDK 11. Do not fail on Github actions.

…ow-patch/stack-3

…ow-patch/stack-4

…ow-patch/stack-5

…ow-patch/stack-6

Add a boolean field to specify whether the patched Arrow code is being used in the JVM to parse Arrow responses.

…ow-patch/stack-1

…ow-patch/stack-3

…ow-patch/stack-4

…ow-patch/stack-5

…ow-patch/stack-6

tejassp-db · 2026-03-02T10:44:25Z

Current github actions wont pass, because the current github workflows are setup for a single module maven project. I have a separate branch to enable these test runs and I have run it from there.

samikshya-db

Verified that there are no additional changes other than the ones in

#1180 #1162 #1161 #1160 #1156 #1144 (These are already approved.)

tejassp-db added 30 commits December 16, 2025 15:58

PECOBLR-1121 Patch Arrow to circumvent JVM args issue.

37d7d15

Patch Arrow to create a Databricks ArrowBuf which allocates memory on the heap and provides access to it through Java methods. This removes the need to specify "--add-opens=java.base/java.nio=ALL-UNNAMED" as JVM args for JDK 16+.

PECOBLR-1121 Use Arrow patch as fallback.

ffd6c1c

Use native Arrow if available. Otherwise fallback to the patch version.

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

2e88cd7

…ow-patch/stack-1

PECOBLR-1121 Simplify patch code.

a78a597

Remove irrelevant reference counting in patch code. Patch code uses heap memory for arrow operations and reference counting is not required.

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

4efaf21

…ow-patch/stack-1

PECOBLR-1121 Minor refactor.

1654f74

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

c5b7620

…ow-patch/stack-1

PECOBLR-1121 Add unit tests for DatabricksArrowBuf.

9d028d6

Add unit tests for all public API.

PECOBLR-1121 Formatting.

30926b2

PECOBLR-1121 Add toolchain version in pom.xml.

34c7859

PECOBLR-1121 Fix todos and fixmes.

42422f1

Remove redundant todos for accounting.

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

ba2f85d

…ow-patch/stack-1

Merge branch 'PECOBLR-1121/arrow-patch/stack-2' into PECOBLR-1121/arr…

3e30c75

…ow-patch/stack-3

PECOBLR-1121 Add unit tests for all arrow patched classes.

c06b7e5

PECOBLR-1121 format code

e50fd34

PECOBLR-1121 JMH benchmark for Arrow parsing.

c308183

A JMH benchmark for Arrow parsing of patched and unpatched Arrow Buffers and Buffer allocators.

PECOBLR-1121 Convert to multi module project.

2133083

Convert the code to muli module project. - Cleaner separation of JAR generation for Uber jar and normal/thin JAR with some patched Arrow changes. - Test modules with tests for shaded jars.

PECOBLR-1121 Tests for dependency shading.

b5829f4

Tests to verify that all dependencies are shaded as expected.

PECOBLR-1121 Add tests for all data types.

e0e0af6

Add tests to handle all data types supported by Arrow.

PECOBLR-1121 Fix derive buffer

36c2d3d

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

0383b7b

…ow-patch/stack-1

PECOBLR-1121 Patch DecimalUtility.

dcdc49a

Patch DecimalUtility to not use unsafe methods to set decimal values on DatabricksArrowBuf.

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

47e9ec3

…ow-patch/stack-1

Merge branch 'PECOBLR-1121/arrow-patch/stack-1' into PECOBLR-1121/arr…

180bfe5

…ow-patch/stack-3

Merge branch 'PECOBLR-1121/arrow-patch/stack-3' into PECOBLR-1121/arr…

c1f75ca

…ow-patch/stack-4

Merge branch 'PECOBLR-1121/arrow-patch/stack-4' into PECOBLR-1121/arr…

ecd629a

…ow-patch/stack-5

PECOBLR-1121 Add tests for more types.

44328bc

Add tests for Boolean, Null, Fixed size list, UTF-8 view, Binary view, list view, large list view types.

PECOBLR-1121 Don't fail on missing toolchains.

05478d3

Remove default profile of JDK 11. Do not fail on Github actions.

Merge branch 'PECOBLR-1121/arrow-patch/stack-1' into PECOBLR-1121/arr…

4890feb

…ow-patch/stack-3

tejassp-db added 22 commits February 4, 2026 16:13

PECOBLR-1121 Use try-with-resources in tests

ebb0b60

PECOBLR-1121 Add tests for empty buffer.

618b912

Merge branch 'PECOBLR-1121/arrow-patch/stack-3' into PECOBLR-1121/arr…

24d850f

…ow-patch/stack-4

Merge branch 'PECOBLR-1121/arrow-patch/stack-4' into PECOBLR-1121/arr…

7a5f8ca

…ow-patch/stack-5

Merge branch 'PECOBLR-1121/arrow-patch/stack-5' into PECOBLR-1121/arr…

8b3b065

…ow-patch/stack-6

PECOBLR-1121 Fix modification notice.

b6ce2b6

PECOBLR-1121 Fix warnings

ef6413c

PECOBLR-1121 Revert thin JAR release to manual trigger.

ff2bbd5

[PECOBLR-1729] Add telemetry for Arrow patch. (#1190)

2a7cc09

Add a boolean field to specify whether the patched Arrow code is being used in the JVM to parse Arrow responses.

Merge branch 'main' into PECOBLR-1121/arrow-patch/stack-0

72cb791

Merge branch 'PECOBLR-1121/arrow-patch/stack-0' into PECOBLR-1121/arr…

4e4441f

…ow-patch/stack-1

Merge branch 'PECOBLR-1121/arrow-patch/stack-1' into PECOBLR-1121/arr…

fe45d57

…ow-patch/stack-3

Merge branch 'PECOBLR-1121/arrow-patch/stack-3' into PECOBLR-1121/arr…

d026b2c

…ow-patch/stack-4

Merge branch 'PECOBLR-1121/arrow-patch/stack-4' into PECOBLR-1121/arr…

7318e61

…ow-patch/stack-5

PECOBLR-1121 Bump up version.

cb7ce9c

Merge branch 'PECOBLR-1121/arrow-patch/stack-5' into PECOBLR-1121/arr…

6ef5e96

…ow-patch/stack-6

PECOBLR-1121 Reduce test memory consumption.

b227226

PECOBLR-1121 Update Readme for multi-module project

8ea3ff9

PECOBLR-1121 Fix memory test

ebf707c

PECOBLR-1121 Use deepEquals for comparison in tests

d16c18e

PECOBLR-1121 exclude DatabricksArrowBuf from test coverage

01c8478

Merge branch 'main' into PECOBLR-1121/arrow-patch/stack-6

fb0e8e1

tejassp-db requested a review from gopalldb March 2, 2026 10:33

tejassp-db self-assigned this Mar 2, 2026

tejassp-db requested a review from samikshya-db March 2, 2026 10:50

PECOBLR-1121 Add to next change log

6060577

samikshya-db changed the title ~~PECOBLR-1121 Arrow patch to circumvent Arrow issues with JDk 16+~~ [PECOBLR-1121] Arrow patch to circumvent Arrow issues with JDk 16+ Mar 2, 2026

samikshya-db approved these changes Mar 2, 2026

View reviewed changes

tejassp-db merged commit 5dc1e5a into main Mar 2, 2026
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PECOBLR-1121] Arrow patch to circumvent Arrow issues with JDk 16+#1243

[PECOBLR-1121] Arrow patch to circumvent Arrow issues with JDk 16+#1243
tejassp-db merged 161 commits intomainfrom
PECOBLR-1121/arrow-patch/stack-6

tejassp-db commented Mar 2, 2026

Uh oh!

tejassp-db commented Mar 2, 2026

Uh oh!

samikshya-db left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tejassp-db commented Mar 2, 2026

Uh oh!

tejassp-db commented Mar 2, 2026

Uh oh!

samikshya-db left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants