QDB-10908 - Add Arrow query API support to Python API #108

vikonix · 2025-11-24T09:03:23Z

No description provided.

solatis

I didn't dug into the C++ part too much yet, but I'm noticing there is a lot of "stuff" in this PR that's actually not part of the task. Perhaps the cleanups / improvements are necessary, but I prefer to keep this PR focused.

Secondly: the test approach is not great and causes a lot of duplication. I think the best approach is to immediately bite the bullet: integrate this directly into pandas/__init__.py, and make it a (temporarily) flag the user can specify whether or not to use arrow.

this will also ensure we verify we can correctly accept and pull pandas dataframes without copies in the same way we currently do it with numpy.

tests/conftest.py

solatis · 2025-12-10T11:09:45Z

tests/test_arrow_batch_push.py

+import numpy as np
+import pytest
+
+import quasardb
+
+
+def _arrow_reader(timestamps, values):
+    pa = pytest.importorskip("pyarrow")
+
+    ts_array = pa.array(timestamps.astype("datetime64[ns]"), type=pa.timestamp("ns"))
+    value_array = pa.array(values, type=pa.float64())
+    batch = pa.record_batch([ts_array, value_array], names=["$timestamp", "value"])
+    return pa.RecordBatchReader.from_batches(batch.schema, [batch])
+
+
+def _create_arrow_table(connection, entry_name):
+    table_name = entry_name + "_arrow"
+    table = connection.table(table_name)
+
+    column = quasardb.ColumnInfo(quasardb.ColumnType.Double, "value")
+    table.create([column])
+
+    return table
+
+
+@pytest.mark.usefixtures("qdbd_connection")
+def test_batch_push_arrow_with_options(qdbd_connection, entry_name):
+    pa = pytest.importorskip("pyarrow")
+
+    table = _create_arrow_table(qdbd_connection, entry_name)
+
+    timestamps = np.array(
+        [
+            np.datetime64("2024-01-01T00:00:00", "ns"),
+            np.datetime64("2024-01-01T00:00:01", "ns"),
+        ],
+        dtype="datetime64[ns]",
+    )


I don't like this.

I would approach this very differently:

reuse the existing bulk reader type

but make the mechanism by which it is read parametrized

right now there's a lot, a lot of duplication of test logic.

alternatively (and preferable): wire this into pandas. make the push mechanism a (temporary?) optional flag, so that we can differentiate between the two modes. then use that as a parameter for parametrized testing.

that way you automagically hook into the hundreds if not thousands of different tests we have for pandas and numpy

this means it needs to be wired into numpy/__init__.py first, and then in pandas/__init__.py.

solatis · 2025-12-10T13:13:12Z

setup.py

    keywords="quasardb timeseries database API driver ",
    setup_requires=[],
-    install_requires=["numpy"],
+    install_requires=["numpy", "PyArrow"],


Do we really have it as a hard dependency now? Or can it be optional?

solatis · 2025-12-10T13:13:43Z

tests/conftest.py

 ):

    index = pd.Index(
+        # pd.date_range(start_date, periods=row_count, freq="s"), name="$timestamp"


What's this?

tests/test_numpy.py: 634 warnings D:\work\quasar\qdb-api-python\tests\conftest.py:685: FutureWarning: 'S' is deprecated and will be removed in a future version, please use 's' instead. pd.date_range(start_date, periods=row_count, freq="S"), name="$timestamp"

solatis · 2025-12-10T13:13:56Z

tests/conftest.py

    return request.param


+# @pytest.fixture(params=["s"], ids=["frequency=s"])


What's this?

vikonix added 30 commits November 20, 2025 18:14

fix warnings

85088f1

revert s 1

e155e3b

variant 2

fdd991f

add Arrow 1

fa6a636

arrow 2

42b6cc8

cosmetic

78b6f2a

off arrow

4797f5e

add depend

6d0a8d9

fix name

df04b1e

fix capsule

c96db3e

cut off "$timestamp"

7d8143e

formatting

0cad78c

maybe fix

d5d5ae2

fix stub

c557e37

?fix 3

a7722d5

? fix 4

6a7ca8f

?fix 5

95d5de5

fix version

e036c6e

fix fix

67b877b

test

0f14b8a

fix reader tests

ab98f8b

add arrow batch

638a9a4

register arrow writer

9c08e4f

validation

dd9426d

fix ownership

7482652

cosmetic

a42339b

batch push arrow test

94d7a7a

validation

0f5769c

more tests

868a78d

validation

966e140

vikonix added 4 commits December 5, 2025 18:12

fix test

aea6bb7

remove Schema

7cdd5a6

check this

cc57cb5

revert

df9b4b6

vikonix requested a review from solatis December 9, 2025 08:25

vikonix marked this pull request as ready for review December 9, 2025 22:37

solatis reviewed Dec 10, 2025

View reviewed changes

vikonix added 5 commits December 10, 2025 13:47

revert 1

32d9b24

revert 2

7749d35

cosmetic

ab7dff2

revert 3

00e68ad

cosmetic

a8d0e5f

solatis reviewed Dec 10, 2025

View reviewed changes

vikonix added 7 commits December 11, 2025 18:21

panda tests

b249687

fix tests

2c4fde7

validation

d1e5536

validation 2

9b4912f

validation 3

1ec2bd8

metrics

4ee0770

fix 1

493363c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

QDB-10908 - Add Arrow query API support to Python API #108

QDB-10908 - Add Arrow query API support to Python API #108

Uh oh!

vikonix commented Nov 24, 2025

Uh oh!

solatis left a comment

Uh oh!

Uh oh!

solatis Dec 10, 2025

Uh oh!

solatis Dec 10, 2025

Uh oh!

solatis Dec 10, 2025

Uh oh!

vikonix Dec 10, 2025

Uh oh!

solatis Dec 10, 2025

Uh oh!

vikonix Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		return request.param


		# @pytest.fixture(params=["s"], ids=["frequency=s"])

QDB-10908 - Add Arrow query API support to Python API #108

Are you sure you want to change the base?

QDB-10908 - Add Arrow query API support to Python API #108

Uh oh!

Conversation

vikonix commented Nov 24, 2025

Uh oh!

solatis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

solatis Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

solatis Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

solatis Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

vikonix Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

solatis Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

vikonix Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants