Added checks on output vector in mxv by GiovaGa · Pull Request #401 · Algebraic-Programming/ALP

GiovaGa · 2025-11-12T08:30:59Z

Resolves #400

anyzelman · 2025-11-24T22:30:05Z

The MR fixes it for reference and reference_omp, but I thought you mentioned all other backends also run into this issue @GiovaGa ?

GiovaGa · 2025-11-25T08:34:39Z

You are definitely right. I have now fixed also for the nonblocking backend.
The other backends call the same internal function of the reference backend, so this should do it for them

anyzelman · 2025-12-23T03:17:00Z

Running CI, running all unit & smoke tests with LPF. Looks ready to merge if these are both OK. Concept release notes:

Prior to this MR, calling mxv or vxm with a dense descriptor while the input/output vector was not dense, would not yield ILLEGAL for all supported backends. This MR fixes that, and also ensures that ILLEGAL is returned when one of the vector masks is non-empty (size larger than 0) and sparse. Furthermore, this MR adds a unit tests to guard against regressions.

Thanks to @GiovaGa for spotting the bug and providing the fixes for the reference, reference_omp, and nonblocking backends!

GiovaGa

Seems good to me.
My only observation is that it may make sense to add an assert in exec_tests to guarantee that indeed at least one vector is not dense, as going quickly through the test, this doesn't seem obvious (and the function exec_tests does not specify such precondition)

GiovaGa · 2026-01-06T15:47:48Z

Right now, the test fails when running with 16 processes (and only in that case)

GiovaGa · 2026-01-28T10:07:08Z

It seems that the test fails only if n is small enough. Using n = 1000 the test runs correctly on my machine. This is probably a bug with the bsp1d backend

…kends

anyzelman · 2026-02-05T11:42:47Z

Example of a failed run:

$ /scratch/workspace/alp-build/install/bin/grbrun -b bsp1d -np 16 tests/unit/illegal_spmv_debug_bsp1d 148
This is functional test tests/unit/illegal_spmv_debug_bsp1d
Info: grb::init (BSP1D) called using 16 user processes.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: grb::init (reference) called.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Info: process mask is all-one, we therefore assume a single user process is present on this node and thus shall use aligned mode for memory allocations that are potentially touched by multiple threads.
Error: unexpected error code in grb::setElement (BSP1D): Uninterpretable error code detected, please notify the developers.. Please submit a bug report.
Error: unexpected error code in grb::setElement (BSP1D): Uninterpretable error code detected, please notify the developers.. Please submit a bug report.
Test batch 5-8: initialisation FAILED
Test batch 5-8: initialisation FAILED
Info: grb::finalize (bsp1d) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Launching test FAILED
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.
Info: grb::finalize (reference) called.

Any larger problem size no longer fails.

anyzelman · 2026-02-05T11:51:58Z

The following similarly fails (P=11, n=102 -- all larger n are OK):

/scratch/workspace/alp-build/install/bin/grbrun -b bsp1d -np 11 tests/unit/illegal_spmv_debug_bsp1d 102

anyzelman · 2026-02-05T13:45:57Z

More minimal one that fails:

void grb_program( const size_t &n, grb::RC &rc ) {
        grb::Vector< bool > out( n ) , out2( 2 * n );
        if( grb::setElement( out, true, 0 ) != grb::SUCCESS ) {
                std::cout << "FAILED\n";
                rc = grb::FAILED;
        } else {
                std::cout << "OK\n";
                rc = grb::SUCCESS;
        }
        std::cout << std::endl;
}

Failure here occurs "already" for P=7 and n=64 (this is the smallest P, n that causes failure). What is particularly interesting is that without declaring out2 is needed while the 2x size is also "mandatory"-- otherwise there would be no failure. This points to a shared buffer corruption or a more general memory corruption problem.

anyzelman added the bug Something isn't working label Nov 24, 2025

anyzelman added this to the v0.8 milestone Nov 24, 2025

anyzelman force-pushed the 400-dense_output_vector_not_checked_mxv branch from 778c24f to e4c0a49 Compare December 23, 2025 02:34

anyzelman previously approved these changes Dec 23, 2025

View reviewed changes

anyzelman dismissed their stale review via b027555 December 23, 2025 06:35

GiovaGa commented Jan 6, 2026

View reviewed changes

GiovaGa and others added 6 commits January 28, 2026 11:27

Added checks on output vector in mxv

69dcbf4

Fixed also for nonblocking backend

500b1e4

Similar missing checks for the dense case in the BSP1D and hybrid bac…

cae2666

…kends

Add unit test that checks illegal is returned

85fec4e

Missing newlines

034cb97

Added assertion to check that at least one vector is not dense

2dc77dd

GiovaGa force-pushed the 400-dense_output_vector_not_checked_mxv branch from 49cf9db to 2dc77dd Compare January 28, 2026 10:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added checks on output vector in mxv#401

Added checks on output vector in mxv#401
GiovaGa wants to merge 6 commits intodevelopfrom
400-dense_output_vector_not_checked_mxv

GiovaGa commented Nov 12, 2025 •

edited

Loading

Uh oh!

anyzelman commented Nov 24, 2025

Uh oh!

GiovaGa commented Nov 25, 2025 •

edited

Loading

Uh oh!

anyzelman commented Dec 23, 2025

Uh oh!

GiovaGa left a comment

Uh oh!

GiovaGa commented Jan 6, 2026

Uh oh!

GiovaGa commented Jan 28, 2026

Uh oh!

anyzelman commented Feb 5, 2026

Uh oh!

anyzelman commented Feb 5, 2026 •

edited

Loading

Uh oh!

anyzelman commented Feb 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

GiovaGa commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anyzelman commented Nov 24, 2025

Uh oh!

GiovaGa commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anyzelman commented Dec 23, 2025

Uh oh!

GiovaGa left a comment

Choose a reason for hiding this comment

Uh oh!

GiovaGa commented Jan 6, 2026

Uh oh!

GiovaGa commented Jan 28, 2026

Uh oh!

anyzelman commented Feb 5, 2026

Uh oh!

anyzelman commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anyzelman commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GiovaGa commented Nov 12, 2025 •

edited

Loading

GiovaGa commented Nov 25, 2025 •

edited

Loading

anyzelman commented Feb 5, 2026 •

edited

Loading

anyzelman commented Feb 5, 2026 •

edited

Loading