Transition value JSON from old to new flat format #539

suvayu · 2025-05-26T14:07:56Z

Migrate old JSON parameter_values into the new schema that is more like a flat table (for time_series, array, and map) and singular pyarrow compatible values for date_time, duration, and time_pattern.

No related issue

Checklist before merging

Documentation (also in Toolbox repo) is up-to-date
Release notes have been updated
Unit tests have been added/updated accordingly
Code has been formatted by black & isort
Unit tests pass

Authors

Since GH doesn't support setting multiple people as author in a PR, documenting it here

@OleMussmann, @suvayu

suvayu

Add some notes/questions as review comments.

spinedb_api/alembic/versions/a973ab537da2_reencode_parameter_values.py

TimePattern was implemented as annotated type for schema generation, however this is not distinguishable at runtime, so add an alternate dataclass implementation.

make columns instead of records from old format parameter_value

spinedb_api/compat/reencode_for_data_transition.py

- update only rows that need changes - batch row updates - convert types to `table` where necessary - add `transition_data` function override for debugging

Remove unnecessary nullable types, and unused union type (ValueTypes)

Do not use pandas as intermediate step, instead transform ourselves from record based to column based - easier for type inspection. TODO: factor out into specific for data transition and generally useful to inserting into spinedb from outside sources when using spinedb_api as a library.

soininen · 2025-09-01T14:12:28Z

I think we need to allow null values in ArrayIndex and the like to support uneven maps, i.e. this should be a valid table:

index 1	index 2	value
A	null	1.1
B	null	1.2
C	a	2.1
C	b	2.2

This raises the question whether we need the index arrays as separate types at all.

I also reintroduced custom conversion for some types into models.py to get the unit tests to pass. I am currently working on Toolbox to make it compatible with this branch and for that I need to_database() and from_database() to work.

suvayu · 2025-09-02T13:43:10Z

Hi,

Strictly speaking, the index column and value column distinction is not needed. But I would like to have them because then downstream code can make useful assumptions. But before I get into that, I think there's a misunderstanding here, mostly because I think I didn't document this anywhere. There is no requirement for all but the last column to be an index column. So this is acceptable:

col1	col2	value
A	null	1.1
B	null	1.2
C	a	2.1
C	b	2.2

It would mean col1 is index type, but col2 and value are just nullable arrays.

The reason this is useful, say, when converting to a dataframe (or something else in a user script), we can treat col1 as index, while excluding col2. Making that assumption won't require inspecting the contents of col2, the type can indicate this is safe.

Of course this is not feasible when working only with the parameter_value, but it is useful when you combine the other columns from the table.

Anyway, I think we should discuss this a bit further. I'll email you.

Cheers,

soininen · 2025-09-03T05:26:33Z

Strictly speaking, the index column and value column distinction is not needed. But I would like to have them because then downstream code can make useful assumptions. But before I get into that, I think there's a misunderstanding here, mostly because I think I didn't document this anywhere. There is no requirement for all but the last column to be an index column. So this is acceptable:

col1	col2	value
A	null	1.1
B	null	1.2
C	a	2.1
C	b	2.2

It would mean col1 is index type, but col2 and value are just nullable arrays.

These are very good points. I agree that the first index column (col1 in the example) should not be nullable. We should indeed keep the index column types.

spinedb_api/value_support.py

- Moved dump_db_value(), from_database_to_dimension_count(), join_value_and_type() and split_value_and_type() to a new module incomplete_values. - Added JSONConverter to importer's convert functions. This replaces the functionality where we tried to convert every parameter value string to parameter value.

The value JSON is not compatible with previous versions.

GAMS version check runs GAMS executable to resolve its version. The executable fails if executed in read-only directory, so we create a temporary directory to avoid that.

This reverts commit 31588a0.

soininen · 2025-09-29T05:55:37Z

spinedb_api/models.py

+
+
+# types
+class TimePeriod(str):


Could this class be replaced by typing.NewType?

TimePeriod = NewType("TimePeriod", str)

Ole Mussmann and others added 11 commits May 21, 2025 14:45

move compatibility scripts to their own folder

c5bbc0e

add parameter_value reencoding

f114d29

add alembic migration script

304b257

spinedb_api/compat/: remove unused methods

3ab46be

spinedb_api/compat/: fix type hints

0872ab0

spinedb_api/compat/: remove PEP 723 metadata

8bda858

spinedb_api/compat/: fix input type

06db77a

add developer's documentation for data transition

0c7dd8e

alembic: blacken reencode_parameter_values.py

a7f2140

alembic: partly handle type column in reencode_parameter_values.py

463767d

models.py: remove bytes, since pa.UnionArray is now supported

3533c2c

suvayu commented May 26, 2025

View reviewed changes

spinedb_api/alembic/versions/a973ab537da2_reencode_parameter_values.py Outdated Show resolved Hide resolved

spinedb_api/alembic/versions/a973ab537da2_reencode_parameter_values.py Outdated Show resolved Hide resolved

suvayu added 5 commits May 29, 2025 15:00

models.py: remove last remaining bits for bytes support

7baf441

models.py: fix TimePattern

c5a41a2

TimePattern was implemented as annotated type for schema generation, however this is not distinguishable at runtime, so add an alternate dataclass implementation.

models.py: add 'any_array' to support nullable mixed types

acff451

data_transition: fix missing import TimePattern

4841a6f

data_transition: alternate implementation

287851d

make columns instead of records from old format parameter_value

OleMussmann reviewed Jun 2, 2025

View reviewed changes

spinedb_api/compat/reencode_for_data_transition.py Outdated Show resolved Hide resolved

Ole Mussmann and others added 12 commits June 2, 2025 21:05

alembic: improve data migration script

8ce8086

- update only rows that need changes - batch row updates - convert types to `table` where necessary - add `transition_data` function override for debugging

models.py: remove last remaining bits for bytes support

7c2e269

models.py: fix 'any_array'

1c6001d

models.py: cleanup type aliases

47d79a5

Remove unnecessary nullable types, and unused union type (ValueTypes)

models.py: use pydantic dataclasses

fa25066

compat: make warning in make_columns more prominent

8d480ac

data-transition: use relativedelta instead of pandas.DateOffset

80749bd

models.py: model duration with relativedelta from dateutil

57b78c4

compat/encode.py: change sentinel implementation

4b8ea09

compat/encode.py: fix column to array conversion for all types

ae1f30f

compat/encode.py: drop all conversion from dataframe

4c8a990

suvayu and others added 3 commits September 1, 2025 12:12

compat/converters.py: add pa.MonthDayNano -> duration/relativedelta

143f367

compat/converters.py: bug fix pa.MonthDayNano -> intermediate dict

01fbd13

Allow null values in index arrays, fix unit tests

6e4a3e9

Remove unused test_data_transition.py

89f3808

soininen mentioned this pull request Sep 2, 2025

Add unit tests #563

Merged

5 tasks

soininen and others added 2 commits September 3, 2025 14:44

Merge branch 'master' into WIP-data-transition

78002e6

value_support.py: backwards compat fix load_db_value signature

1ad08e5

soininen reviewed Sep 9, 2025

View reviewed changes

spinedb_api/value_support.py Outdated Show resolved Hide resolved

soininen mentioned this pull request Sep 11, 2025

Transition to new flat JSON format spine-tools/Spine-Toolbox#3170

Open

5 tasks

soininen changed the title ~~Transition JSON parameter_value from old to new flat format~~ Transition value JSON from old to new flat format Sep 11, 2025

soininen mentioned this pull request Sep 11, 2025

Transition to new flat JSON spine-tools/spine-items#277

Open

5 tasks

soininen added 5 commits September 11, 2025 17:01

Merge branch 'master' into WIP-data-transition

deaa13a

Bump DB server version to 9

482c322

The value JSON is not compatible with previous versions.

Merge branch 'master' into WIP-data-transition

67537c2

Fix GAMS version check when current work directory is read-only

31588a0

GAMS version check runs GAMS executable to resolve its version. The executable fails if executed in read-only directory, so we create a temporary directory to avoid that.

Revert "Fix GAMS version check when current work directory is read-only"

a6085da

This reverts commit 31588a0.

This comment was marked as resolved.

Sign in to view

soininen added 4 commits September 17, 2025 09:07

Fix Alembic migration

d3cb47b

Merge branch 'master' into WIP-data-transition

2bb0da9

Merge branch 'master' into WIP-data-transition

fbeda3e

Fix unit tests for Alembic migrations

24eb6d0

soininen reviewed Sep 29, 2025

View reviewed changes

soininen added 4 commits October 10, 2025 17:40

Merge branch 'master' into WIP-data-transition

faa8a51

Store leaf TimeSeries indices to last column when expanding Maps.

cfeac47

Rework Map.from_arrow() to handle more corner-cases

4159c5d

Add support for converting Map's leafs from/to Arrays

cfd4475

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transition value JSON from old to new flat format #539

Transition value JSON from old to new flat format #539

Uh oh!

suvayu commented May 26, 2025 •

edited

Loading

Uh oh!

suvayu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soininen commented Sep 1, 2025 •

edited

Loading

Uh oh!

suvayu commented Sep 2, 2025 •

edited by soininen

Loading

Uh oh!

soininen commented Sep 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

This comment was marked as resolved.

soininen Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Transition value JSON from old to new flat format #539

Are you sure you want to change the base?

Transition value JSON from old to new flat format #539

Uh oh!

Conversation

suvayu commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist before merging

Authors

Uh oh!

suvayu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soininen commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suvayu commented Sep 2, 2025 • edited by soininen Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soininen commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

soininen Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

suvayu commented May 26, 2025 •

edited

Loading

soininen commented Sep 1, 2025 •

edited

Loading

suvayu commented Sep 2, 2025 •

edited by soininen

Loading

soininen commented Sep 3, 2025 •

edited

Loading