`operations` table: Store the `operation_xdr` as BYTEA by aditya1702 · Pull Request #497 · stellar/wallet-backend

aditya1702 · 2026-02-05T21:26:14Z

What

Updates the operations.operation_xdr storage to use PostgreSQL BYTEA (raw XDR bytes) while preserving the GraphQL API contract by continuing to expose operationXdr as a base64-encoded String.

Changes:

Change operations.operation_xdr from TEXT to BYTEA in the schema/migration and update DB write paths accordingly.
Introduce types.XDRBytea to scan/store raw bytes and encode to base64 when presented as a string.
Update GraphQL schema/resolvers and tests to reflect the new storage/encoding behavior.

Why

Storage optimization for our DB

Known limitations

N/A

Issue that this PR addresses

Closes #492

Part of the plan to store XDR data as raw bytes for efficiency.

Similar pattern to HashBytea but uses base64 encoding (standard for XDR) and supports variable-length data.

Changed OperationXDR field from string to XDRBytea for automatic base64/BYTEA conversion.

Cast the base64 XDR string to XDRBytea for proper database storage.

- Change UNNEST cast from text[] to bytea[] - Convert XDRBytea to raw bytes in BatchInsert - Update BatchCopy to pass raw bytes instead of pgtype.Text

Forces GraphQL to use a resolver for operationXdr to convert XDRBytea to base64 string for API consumers.

- Update test_utils.go to use types.XDRBytea - Regenerate GraphQL code with OperationXdr resolver

Returns the XDRBytea as a base64-encoded string for API consumers.

- Use parameterized queries with XDRBytea for SQL inserts - Update assertions to use .String() for comparison - Fix type casting in test data creation

- Use base64-encoded test XDR data in test_utils.go - Add testOpXDR helper functions in test files - Update all assertions to use .String() method

- Update createTestOperation to use proper base64 XDR - Update generateTestOperations to encode test data as base64 - Update operations_test.go with base64 encoding for all test XDR data - Fix assertions to compare against the stored XDRBytea values

This simplifies the XDR storage flow by storing raw bytes directly instead of encoding to base64 and then decoding. The String() method now handles base64 encoding for external representation.

Skip the intermediate base64 encoding step by using MarshalBinary() instead of MarshalBase64(). The raw bytes are now stored directly in XDRBytea.

Remove unnecessary Value() calls since XDRBytea is now []byte. Access raw bytes directly via type conversion.

Decode expected base64 XDR string to raw bytes for comparison since XDRBytea now uses []byte underlying type.

Use raw bytes directly instead of base64-encoded strings when creating test data for XDRBytea fields.

Use raw bytes directly for test XDR data instead of base64-encoding. The String() method will handle base64 encoding for assertions.

Use raw bytes directly instead of pre-encoded base64 string.

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column. Fix .String() assertion to compare base64 values via opXdr1.String().

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column in BatchGetByOperationIDs and BatchGetByStateChangeIDs tests.

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column in BatchGetByOperationIDs test.

Copy the byte slice from the database driver instead of referencing it directly. The pgx driver reuses its internal buffer across rows, so without copying, all scanned XDRBytea values end up pointing to the same (overwritten) buffer.

Copilot

Pull request overview

Updates the operations.operation_xdr storage to use PostgreSQL BYTEA (raw XDR bytes) while preserving the GraphQL API contract by continuing to expose operationXdr as a base64-encoded String.

Changes:

Change operations.operation_xdr from TEXT to BYTEA in the schema/migration and update DB write paths accordingly.
Introduce types.XDRBytea to scan/store raw bytes and encode to base64 when presented as a string.
Update GraphQL schema/resolvers and tests to reflect the new storage/encoding behavior.

Reviewed changes

Copilot reviewed 15 out of 16 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
internal/services/ingest_test.go	Updates ingest test fixtures to use `types.XDRBytea`.
internal/serve/graphql/schema/operation.graphqls	Forces resolver for `operationXdr` so GraphQL continues returning a base64 string.
internal/serve/graphql/resolvers/transaction_resolvers_test.go	Updates expectations for base64-encoded XDR output.
internal/serve/graphql/resolvers/test_utils.go	Stores operation XDR as raw bytes in test DB setup.
internal/serve/graphql/resolvers/queries_resolvers_test.go	Updates query resolver tests to compare base64 encoding of raw bytes.
internal/serve/graphql/resolvers/operation.resolvers.go	Adds `OperationXdr` field resolver to base64-encode `XDRBytea`.
internal/serve/graphql/resolvers/account_resolvers_test.go	Updates account resolver tests for base64-encoded XDR output.
internal/serve/graphql/generated/generated.go	Regenerates gqlgen output to wire the forced resolver.
internal/indexer/types/types.go	Adds `XDRBytea` type and updates `types.Operation.OperationXDR` to use it.
internal/indexer/processors/utils_test.go	Updates ConvertOperation test to compare raw bytes vs base64 string.
internal/indexer/processors/utils.go	Stores raw XDR bytes via `MarshalBinary()` instead of base64 strings.
internal/db/migrations/2025-06-10.3-operations.sql	Changes `operation_xdr` column type to `BYTEA` (but see migration concern).
internal/data/transactions_test.go	Updates test inserts to pass `XDRBytea` for `operation_xdr`.
internal/data/operations_test.go	Updates operation model tests for bytea storage and base64 string comparisons.
internal/data/operations.go	Updates batch insert/copy paths to send `operation_xdr` as `bytea`.
internal/data/accounts_test.go	Updates account model tests inserting operations to use `XDRBytea`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

internal/db/migrations/2025-06-10.3-operations.sql

internal/serve/graphql/schema/operation.graphqls

aristidesstaffieri · 2026-02-25T18:50:41Z

Code Review — Unsafe type assertions on `.Value()` results

Severity: High (latent panic)

In 4 locations, .Value() results are type-asserted with .([]byte) without a nil guard. Both AddressBytea.Value() and HashBytea.Value() return (nil, nil) for empty strings, causing a panic on nil.([]byte).

Locations:

wallet-backend/internal/data/operations.go

Lines 312 to 314 in b84442d

    
           	} 
        
           	stellarAddressBytes = append(stellarAddressBytes, addrBytes.([]byte)) 
        
           }

wallet-backend/internal/data/transactions.go

Lines 208 to 210 in b84442d

}

hashes[i] = hashBytes.([]byte)

toIDs[i] = t.ToID

wallet-backend/internal/data/transactions.go

Lines 229 to 231 in b84442d

    
           	} 
        
           	stellarAddressBytes = append(stellarAddressBytes, addrBytes.([]byte)) 
        
           }

wallet-backend/internal/data/statechanges.go

Lines 221 to 223 in b84442d

}

accountIDBytes[i] = addrBytes.([]byte)

The safe pattern already exists in query_utils.go:77:

if val == nil {
    return nil, nil
}
return val.([]byte), nil

While current production paths filter empty values upstream, BatchInsert is a public API callable from tests and backfill paths.

aristidesstaffieri · 2026-02-25T18:51:03Z

Code Review — Thresholds and SignerWeights resolvers silently map NULL to 0

Severity: Medium (incorrect data for GraphQL clients)

The Thresholds resolver never checks .Valid on sql.NullInt16 fields:

wallet-backend/internal/serve/graphql/resolvers/statechange.resolvers.go

Lines 275 to 280 in b84442d

    
           // Formats the old/new threshold values as a JSON object for backward compatibility. 
        
           func (r *signerThresholdsChangeResolver) Thresholds(ctx context.Context, obj *types.SignerThresholdsStateChangeModel) (string, error) { 
        
           	// Format as {"old": "X", "new": "Y"} for backward compatibility with JSONB format 
        
           	// Values are stored as ints but returned as quoted strings in JSON (0-255 range) 
        
           	return fmt.Sprintf(`{"old": "%d", "new": "%d"}`, obj.ThresholdOld.Int16, obj.ThresholdNew.Int16), nil 
        
           }

When a threshold is first set (no prior state), ThresholdOld is NULL (Valid=false, Int16=0). Output: {"old": "0", "new": "5"} — clients cannot distinguish "was 0" from "did not exist."

The SignerWeights resolver has a similar issue — it returns nil only when both fields are NULL, but when one is NULL and the other valid, the NULL one renders as 0:

wallet-backend/internal/serve/graphql/resolvers/statechange.resolvers.go

Lines 238 to 247 in b84442d

    
           // SignerWeights is the resolver for the signerWeights field. 
        
           // Formats the old/new signer weight values as a JSON object for backward compatibility. 
        
           func (r *signerChangeResolver) SignerWeights(ctx context.Context, obj *types.SignerStateChangeModel) (*string, error) { 
        
           	if !obj.SignerWeightOld.Valid && !obj.SignerWeightNew.Valid { 
        
           		return nil, nil 
        
           	} 
        
           	// Format as {"old": X, "new": Y} for backward compatibility with JSONB format 
        
           	result := fmt.Sprintf(`{"old": %d, "new": %d}`, obj.SignerWeightOld.Int16, obj.SignerWeightNew.Int16) 
        
           	return &result, nil 
        
           }

The correct pattern exists in TrustlineLimit (same file), which checks .Valid per-field and uses "null" for absent values:

wallet-backend/internal/serve/graphql/resolvers/statechange.resolvers.go

Lines 349 to 364 in b84442d

    
           func (r *trustlineChangeResolver) Limit(ctx context.Context, obj *types.TrustlineStateChangeModel) (*string, error) { 
        
           	if !obj.TrustlineLimitOld.Valid && !obj.TrustlineLimitNew.Valid { 
        
           		return nil, nil 
        
           	} 
        
           	// Format old/new as JSON for backward compatibility with JSONB format 
        
           	oldVal := "null" 
        
           	newVal := "null" 
        
           	if obj.TrustlineLimitOld.Valid { 
        
           		oldVal = fmt.Sprintf(`"%s"`, obj.TrustlineLimitOld.String) 
        
           	} 
        
           	if obj.TrustlineLimitNew.Valid { 
        
           		newVal = fmt.Sprintf(`"%s"`, obj.TrustlineLimitNew.String) 
        
           	} 
        
           	result := fmt.Sprintf(`{"old": %s, "new": %s}`, oldVal, newVal) 
        
           	return &result, nil 
        
           }

aristidesstaffieri · 2026-02-25T18:51:25Z

Code Review — Getter methods return internal maps without cloning (data race risk)

Severity: Medium (latent data race)

GetTransactionsParticipants() and GetOperationsParticipants() return direct references to internal maps while holding only a read lock. Once the caller has the reference and the lock is released, concurrent Push* calls that modify these maps can trigger Go's fatal error: concurrent map read and map write.

wallet-backend/internal/indexer/indexer_buffer.go

Lines 169 to 175 in b84442d

    
           // GetTransactionsParticipants returns a map of transaction ToIDs to its participants. 
        
           func (b *IndexerBuffer) GetTransactionsParticipants() map[int64]set.Set[string] { 
        
           	b.mu.RLock() 
        
           	defer b.mu.RUnlock() 
        
           	return b.participantsByToID 
        
           }

wallet-backend/internal/indexer/indexer_buffer.go

Lines 349 to 355 in b84442d

    
           // GetOperationsParticipants returns a map of operation IDs to its participants. 
        
           func (b *IndexerBuffer) GetOperationsParticipants() map[int64]set.Set[string] { 
        
           	b.mu.RLock() 
        
           	defer b.mu.RUnlock() 
        
           	return b.participantsByOpID 
        
           }

The safe pattern is already used by GetUniqueSEP41ContractTokensByID() in the same file, which returns maps.Clone(...):

wallet-backend/internal/indexer/indexer_buffer.go

Lines 590 to 597 in b84442d

    
           // GetUniqueSEP41ContractTokensByID returns a map of unique SEP-41 contract IDs to their types. 
        
           // Thread-safe: uses read lock. 
        
           func (b *IndexerBuffer) GetUniqueSEP41ContractTokensByID() map[string]types.ContractType { 
        
           	b.mu.RLock() 
        
           	defer b.mu.RUnlock() 
        
           	return maps.Clone(b.uniqueSEP41ContractTokensByID) 
        
           }

aristidesstaffieri · 2026-02-25T18:54:53Z

Code Review — XDRBytea.Value() returns aliased slice (latent correctness hazard)

Severity: Low (defensive programming)

XDRBytea.Value() returns []byte(x) which shares the same backing array as the receiver. The companion Scan() method was already fixed in b84442d9 with make + copy to avoid pgx driver buffer reuse — Value() has the analogous risk in the write direction.

wallet-backend/internal/indexer/types/types.go

Lines 190 to 196 in b84442d

    
           // Value implements driver.Valuer - returns raw bytes for BYTEA storage 
        
           func (x XDRBytea) Value() (driver.Value, error) { 
        
           	if len(x) == 0 { 
        
           		return nil, nil 
        
           	} 
        
           	return []byte(x), nil 
        
           }

The old string-based XDRBytea always produced an independent buffer via base64.DecodeString(). Current drivers (pgx, lib/pq) don't mutate the slice they receive, so this is not an active bug — but a defensive make + copy in Value() would match the Scan() fix and guard against future driver changes.

For comparison, the fixed Scan():

wallet-backend/internal/indexer/types/types.go

Lines 175 to 188 in b84442d

    
           // Scan implements sql.Scanner - reads raw bytes from BYTEA column 
        
           func (x *XDRBytea) Scan(value any) error { 
        
           	if value == nil { 
        
           		*x = nil 
        
           		return nil 
        
           	} 
        
           	bytes, ok := value.([]byte) 
        
           	if !ok { 
        
           		return fmt.Errorf("expected []byte, got %T", value) 
        
           	} 
        
           	*x = make([]byte, len(bytes)) 
        
           	copy(*x, bytes) 
        
           	return nil 
        
           }

aristidesstaffieri · 2026-02-25T18:58:40Z

Code Review — Cursor values interpolated directly into SQL instead of parameterized

Severity: Low (code consistency)

BatchGetByToID and BatchGetByOperationID use fmt.Sprintf with %d to embed cursor int64 values directly into SQL, while BatchGetByAccountAddress in the same file uses parameterized $N placeholders. No SQL injection risk since these are strongly-typed int64, but the inconsistency is worth noting.

wallet-backend/internal/data/statechanges.go

Lines 521 to 530 in b84442d

    
           		if sortOrder == DESC { 
        
           			queryBuilder.WriteString(fmt.Sprintf(` 
        
           				AND (to_id, operation_id, state_change_order) < (%d, %d, %d) 
        
           			`, cursor.ToID, cursor.OperationID, cursor.StateChangeOrder)) 
        
           		} else { 
        
           			queryBuilder.WriteString(fmt.Sprintf(` 
        
           				AND (to_id, operation_id, state_change_order) > (%d, %d, %d) 
        
           			`, cursor.ToID, cursor.OperationID, cursor.StateChangeOrder)) 
        
           		} 
        
           	}

Compare with the parameterized approach in BatchGetByAccountAddress:

wallet-backend/internal/data/statechanges.go

Lines 69 to 75 in b84442d

    
           			queryBuilder.WriteString(fmt.Sprintf(` 
        
           				AND (to_id, operation_id, state_change_order) < ($%d, $%d, $%d) 
        
           			`, argIndex, argIndex+1, argIndex+2)) 
        
           			args = append(args, cursor.ToID, cursor.OperationID, cursor.StateChangeOrder) 
        
           			argIndex += 3 
        
           		} else { 
        
           			queryBuilder.WriteString(fmt.Sprintf(`

aditya1702 added 23 commits February 5, 2026 14:45

Change operation_xdr column from TEXT to BYTEA

1829d3e

Part of the plan to store XDR data as raw bytes for efficiency.

Add XDRBytea type for storing XDR data as BYTEA

c844520

Similar pattern to HashBytea but uses base64 encoding (standard for XDR) and supports variable-length data.

Update Operation struct to use XDRBytea type

b5ac8ad

Changed OperationXDR field from string to XDRBytea for automatic base64/BYTEA conversion.

Update ConvertOperation to use XDRBytea type

c23554e

Cast the base64 XDR string to XDRBytea for proper database storage.

Update operations.go for BYTEA operation_xdr storage

23ba111

- Change UNNEST cast from text[] to bytea[] - Convert XDRBytea to raw bytes in BatchInsert - Update BatchCopy to pass raw bytes instead of pgtype.Text

Add forceResolver directive to operationXdr field

a347cc7

Forces GraphQL to use a resolver for operationXdr to convert XDRBytea to base64 string for API consumers.

Run gql-generate and fix test_utils.go type

ddaeeee

- Update test_utils.go to use types.XDRBytea - Regenerate GraphQL code with OperationXdr resolver

Implement OperationXdr GraphQL resolver

aee79f2

Returns the XDRBytea as a base64-encoded string for API consumers.

Update tests to use XDRBytea type

00eeaa3

- Use parameterized queries with XDRBytea for SQL inserts - Update assertions to use .String() for comparison - Fix type casting in test data creation

Update GraphQL resolver tests for XDRBytea type

ec34cef

- Use base64-encoded test XDR data in test_utils.go - Add testOpXDR helper functions in test files - Update all assertions to use .String() method

Change XDRBytea underlying type from string to []byte

525a59d

This simplifies the XDR storage flow by storing raw bytes directly instead of encoding to base64 and then decoding. The String() method now handles base64 encoding for external representation.

Use MarshalBinary directly in ConvertOperation

5e221f7

Skip the intermediate base64 encoding step by using MarshalBinary() instead of MarshalBase64(). The raw bytes are now stored directly in XDRBytea.

Simplify BatchInsert and BatchCopy in operations.go

ce12f8a

Remove unnecessary Value() calls since XDRBytea is now []byte. Access raw bytes directly via type conversion.

Update utils_test.go for XDRBytea []byte type

52bdb28

Decode expected base64 XDR string to raw bytes for comparison since XDRBytea now uses []byte underlying type.

Update operations_test.go for XDRBytea []byte type

5bb41f3

Use raw bytes directly instead of base64-encoded strings when creating test data for XDRBytea fields.

Update test_utils.go for XDRBytea []byte type

a5429a0

Use raw bytes directly for test XDR data instead of base64-encoding. The String() method will handle base64 encoding for assertions.

Update ingest_test.go for XDRBytea []byte type

41dfbf7

Use raw bytes directly instead of pre-encoded base64 string.

Fix operations_test.go for XDRBytea []byte type

bd27099

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column. Fix .String() assertion to compare base64 values via opXdr1.String().

Fix accounts_test.go for XDRBytea []byte type

762593e

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column in BatchGetByOperationIDs and BatchGetByStateChangeIDs tests.

Fix transactions_test.go for XDRBytea []byte type

5f26b91

Use parameterized queries instead of raw SQL string literals for BYTEA operation_xdr column in BatchGetByOperationIDs test.

Update generated.go

25ccec9

Fix XDRBytea.Scan buffer reuse bug

b84442d

Copy the byte slice from the database driver instead of referencing it directly. The pgx driver reuses its internal buffer across rows, so without copying, all scanned XDRBytea values end up pointing to the same (overwritten) buffer.

aditya1702 changed the title ~~Opxdr bytea 2~~ operations table: Store the operation_xdr as BYTEA Feb 5, 2026

aditya1702 requested a review from Copilot February 11, 2026 13:54

Copilot started reviewing on behalf of aditya1702 February 11, 2026 13:54 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

internal/db/migrations/2025-06-10.3-operations.sql Show resolved Hide resolved

internal/serve/graphql/schema/operation.graphqls Show resolved Hide resolved

aditya1702 marked this pull request as ready for review February 20, 2026 19:44

aditya1702 and others added 3 commits February 26, 2026 15:24

statechanges table: Change TokenID to BYTEA (#509)

baa684d

Merge branch 'hash-bytea' into opxdr-bytea-2

4488c29

resolve review comments

276dff6

aditya1702 merged commit 02ca3dd into hash-bytea Feb 26, 2026
7 checks passed

aditya1702 deleted the opxdr-bytea-2 branch February 26, 2026 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`operations` table: Store the `operation_xdr` as BYTEA#497

`operations` table: Store the `operation_xdr` as BYTEA#497
aditya1702 merged 26 commits intohash-byteafrom
opxdr-bytea-2

aditya1702 commented Feb 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aditya1702 commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Known limitations

Issue that this PR addresses

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Code Review — Unsafe type assertions on .Value() results

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Code Review — Thresholds and SignerWeights resolvers silently map NULL to 0

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Code Review — Getter methods return internal maps without cloning (data race risk)

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Code Review — XDRBytea.Value() returns aliased slice (latent correctness hazard)

Uh oh!

aristidesstaffieri commented Feb 25, 2026

Code Review — Cursor values interpolated directly into SQL instead of parameterized

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aditya1702 commented Feb 5, 2026 •

edited

Loading

Code Review — Unsafe type assertions on `.Value()` results