continuation of multi-shard + ram bus #1084

hero78119 · 2025-10-13T09:13:06Z

To deal with > 1 shards and satisfy offline memory constrain
related to #1061, #1063, #699, #698, #697, #696, #700

change scope

separate mem bus to read/write 2 different map
add new public input
integrate local init chip
integrate local final chip
shift opcode & mem record to shard ts
integrate mem bus chip
refactor mock prover mock proving

benchmarks

bench on single chunk
with fibonacci on CPU: 5900XT 32 cores, 64GB RAM

Benchmark	Median Time (s)	Median Change (%)
fibonacci_max_steps_1048576	2.8142	+1.54% (Change within noise threshold)
fibonacci_max_steps_2097152	4.9337	+1.05% (Change within noise threshold)
fibonacci_max_steps_4194304	9.1971	-3.21% (Change within noise threshold)

which shows no performance impact

hero78119 · 2025-10-27T08:35:07Z

multi shards and continuation support

This PR add Shard: meta info BEFORE emulate, and ShardContext: meta info AFTER emulate. ShardContext was passed into opcode & table circuit, and each memory access (read/write) was categorized via ShardContext and differentiate it's within or outside chunk

hero78119 · 2025-10-27T08:35:14Z

shards mem records tracking

refer ceno_emul/src/chunked_vec.rs
To tracking a memory write record being accessed in the future or not, a data structure ChunkedVec was being added in emulator to track future access toward this record. As in each cycle there used to be something write, so the cycle pattern is pretty dense, we choose implement dynamic extended vector so leverage index access with a slightly waste of space.

@kunxian-xia

## Summary ### Sub tasks - [x] $N = 2^n$ septic curve points accumulation (in **one layer**) using [Quark](https://eprint.iacr.org/2020/1275) cmd: `cargo test --release --lib test_ecc_quark_prover --features "sanity-check" -- --nocapture` - [x] `Global` chip @kunxian-xia - [x] constraints - [x] debugging `sum != p[0] + p[1]` - [x] enable poseidon2 - [x] #1093 - [x] enable non power-of-two #1081 --------- Co-authored-by: Ming <hero78119@gmail.com>

ceno_emul/src/tracer.rs

ceno_zkvm/src/e2e.rs

build on top of #1084 ### changes - [x] (cpu) migrate all tables to gkr-iop - [x] fixing verifier logic and e2e - [x] refactor code under gpu module - [x] code cleanup & benchmark ### benchmark With CPU 3-4% performance regressed | Benchmark | Median Time (s) | Median Change (%) | |------------------------------|------------------|------------------------------------| | fibonacci_max_steps_1048576 | 2.8744 | +4.06% (Performance has regressed) | | fibonacci_max_steps_2097152 | 5.0004 | +3.31% (Performance has regressed) | | fibonacci_max_steps_4194304 | 9.3081 | +3.08% (Performance has regressed) | With GPU On GPU 5070 ti, CPU 5900XT performance regressed a bit on smaller size is expected. On larger workload it even improve. Think the reason is because we got less cuda kernel invocations during e2e | Benchmark | GPU (New) | Δ Change (%) | |------------------------------|-----------|------------------------------------| | fibonacci_max_steps_1048576 | 0.979 s | +3.63% (Performance has regressed) | | fibonacci_max_steps_2097152 | 1.344 s | +3.92% (Performance has regressed) | | fibonacci_max_steps_4194304 | 2.222 s | -14.87% (Performance has improved) | --------- Co-authored-by: kunxian xia <xiakunxian130@gmail.com>

# Summary A few cleanups to #1084 - [x] remove `RamBusCircuit` - [x] rename `global` circuit to `ShardRamCircuit`.

kunxian-xia

LGTM 🎉

hero78119 added 4 commits October 8, 2025 17:45

add ram bus

3714569

wip add rambus impl

0da1c22

add shardcontext

33b47ec

separate init/final/ramchip

d0d2471

hero78119 mentioned this pull request Oct 13, 2025

[WIP] continuation of multi-shard + ram bus #1082

Closed

hero78119 marked this pull request as draft October 13, 2025 09:35

hero78119 added 5 commits October 13, 2025 19:18

local ram circuit

a0c7b91

add shard info to public io

4963848

wip config as trait

c542201

separate circuit into init/final

b84f74e

complete local finalized mem chip logic

03092e9

hero78119 force-pushed the feat/multi_shard branch from d5849da to 015c542 Compare October 15, 2025 03:31

aligned step cycle and prev_cycle to local version

d32c71f

hero78119 force-pushed the feat/multi_shard branch from 015c542 to d32c71f Compare October 15, 2025 03:40

hero78119 added 2 commits October 15, 2025 15:48

with mem bus chip build pass

f347310

cleanup

4d5a421

hero78119 force-pushed the feat/multi_shard branch from 517f8d5 to 4d5a421 Compare October 15, 2025 08:30

hero78119 added 7 commits October 15, 2025 21:12

add table circuit cpu sumcheck

3c21158

one shard prover pass

ea6f8ed

fix most of local final table issue in e2e

82403f2

chores: cosmetics

aeea15d

gkr iop support table circuit

40887fc

wip convert local final ram circuit to gkr-iop circuit

08783ca

chores: rename config

6ac69fc

hero78119 force-pushed the feat/multi_shard branch from 4503fd9 to 6ac69fc Compare October 19, 2025 09:18

hero78119 added 3 commits October 19, 2025 22:10

fix few bugs in e2e

48d5f93

debug log

306e0df

chores: mock_proving non-static ram type

8e9c2d2

hero78119 mentioned this pull request Oct 20, 2025

chores: mock_proving non-static ram type #1091

Closed

chores: mock_proving non-static ram type

dc15ff4

chores fix missing padding of DynVolatileRamTableConfig

1739d4a

hero78119 changed the title ~~[WIP] continuation of multi-shard + ram bus~~ continuation of multi-shard + ram bus Oct 20, 2025

kunxian-xia mentioned this pull request Oct 21, 2025

Feat: septic curve based continuation #1061

Merged

7 tasks

optimise tracer performance

0ea92e0

hero78119 requested a review from kunxian-xia October 22, 2025 08:34

hero78119 force-pushed the feat/multi_shard branch from 0163be6 to 0ea92e0 Compare October 22, 2025 14:22

hero78119 mentioned this pull request Oct 22, 2025

unify all circuit to gkr-iop #1092

Merged

4 tasks

merge with master

82fea5e

hero78119 mentioned this pull request Oct 28, 2025

[milestones] ethproofs & benchmark #1095

Open

21 tasks

kunxian-xia and others added 2 commits November 3, 2025 12:46

merge with master

3bfffad

kunxian-xia reviewed Nov 3, 2025

View reviewed changes

ceno_zkvm/src/e2e.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/e2e.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/e2e.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/e2e.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/e2e.rs Outdated Show resolved Hide resolved

kunxian-xia mentioned this pull request Nov 3, 2025

#1084 cleanup #1111

Merged

2 tasks

hero78119 and others added 2 commits November 5, 2025 17:36

#1084 cleanup (#1111)

b6d5f7d

# Summary A few cleanups to #1084 - [x] remove `RamBusCircuit` - [x] rename `global` circuit to `ShardRamCircuit`.

hero78119 mentioned this pull request Nov 5, 2025

make circuit stats shows ShardRAMCircuit #1116

Open

hero78119 added 2 commits November 6, 2025 14:33

address review comments and code cosmetics

0fdcb81

more global -> shard renaming

9bddd35

kunxian-xia approved these changes Nov 6, 2025

View reviewed changes

hero78119 added this pull request to the merge queue Nov 6, 2025

Merged via the queue into master with commit 1e3c940 Nov 6, 2025
4 checks passed

hero78119 deleted the feat/multi_shard branch November 6, 2025 07:53

This was referenced Nov 7, 2025

timestamp store in 2 limbs #1000

Closed

feat: split timestamp into shard, clk #1063

Closed

[recursion] ecc verifier + single chunk verify #1131

Open

hero78119 mentioned this pull request Nov 19, 2025

Sharding of witness generation and memory storage #700

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

continuation of multi-shard + ram bus #1084

continuation of multi-shard + ram bus #1084

Uh oh!

hero78119 commented Oct 13, 2025 •

edited

Loading

Uh oh!

hero78119 commented Oct 27, 2025

Uh oh!

hero78119 commented Oct 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunxian-xia left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

continuation of multi-shard + ram bus #1084

continuation of multi-shard + ram bus #1084

Uh oh!

Conversation

hero78119 commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

change scope

benchmarks

Uh oh!

hero78119 commented Oct 27, 2025

multi shards and continuation support

Uh oh!

hero78119 commented Oct 27, 2025

shards mem records tracking

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunxian-xia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hero78119 commented Oct 13, 2025 •

edited

Loading