Leios design: Perf & Tracing #596

mgmeier · 2025-10-27T08:49:11Z

~~This PR contributes to #542 .~~
This PR contributes to #615 .

ch1bo

@mgmeier Do you see P&T as an aspect of the design (a dedicated area/layer or relevant across all concerns?), or something we should elaborate on in the #592.

ch1bo · 2025-10-27T11:03:52Z

docs/leios-design/README.md

+As Leios development in all aspects is an ongoing and very much alive process, this chapter could only focus on 
+mid-to-high-level items. Depending on the progress of various protoypes and implementations, this high-level plan
+will need to be broken down and sequenced into smaller tasks. We will do so keeping in mind the result of those
+tasks should be reusable both internally, and by the community.


How should this relate to the implementation plan chapter (#592)?

This section was an artifact of conflating the strategy description with design. Now that it's narrowed down what this PR targets, I've removed the section.

mgmeier · 2025-10-28T15:21:15Z

Generally, the issue this PR targets addresses "technical design and implementation plan" - so this contribution contains a bit of both. I'd be fine though with moving the contribution to issue #592.
P&T work on Leios is by nature less geared towards design questions.

ch1bo · 2025-10-29T08:35:21Z

Generally, the issue this PR targets addresses "technical design and implementation plan" - so this contribution contains a bit of poth. I'd be fine though with moving the contribution to issue #592. P&T work on Leios is by nature less geared towards design questions.

As discussed, let's incorporate the great points of this PR into respective sections next month - once the dust on the initial scaffold settles (the many in flight PRs onto #542).

ch1bo

Thanks for writing up these points again. They feel a bit disconnected from the existing sections and I'm not convinced that we should make a dedicated chapter about this (with the E2E testing for example still as part of technical design).

Would it make sense to incorporate these paragraphs with existing sections? See individual comments for some ideas.

ch1bo · 2025-11-28T13:59:23Z

docs/leios-design/README.md

+## Testing during development
+
+Wheras simulations operate on models and are able to falsify hypotheses or assess probability of certain outcomes, evolving
+prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedbeck for development, but also to provide transparency into the ongoing process.  


Suggested change

prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedbeck for development, but also to provide transparency into the ongoing process.

prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedback for development, but also to provide transparency into the ongoing process.

ch1bo · 2025-11-28T14:00:00Z

docs/leios-design/README.md

+isufficient to maintain that pressure. We will create a declarative, abstract definition of what constitues a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees
+identical outcomes regardless of how exactly the workload is generated. These workloads will retain their property of being customizable regarding particular aspects they stress in the system, such as the UTxO set, or Plutus script evaluation.
+
+As raw data from a benchmark / confirmance test can be huge, existing analysis tooling will be extended or built, such that extracting key insights from raw data can be automated as much as possible.


Suggested change

As raw data from a benchmark / confirmance test can be huge, existing analysis tooling will be extended or built, such that extracting key insights from raw data can be automated as much as possible.

As raw data from a benchmark / conformance test can be huge, existing analysis tooling will be extended or built, such that extracting key insights from raw data can be automated as much as possible.

ch1bo · 2025-11-28T14:05:53Z

docs/leios-design/README.md

+this tends to be versatile and fast to evaluate incrementally. This means, system invariants can be tested as part of CI, or even by consuming live output of a running testnet.
+
+Performance testing requires constant submission pressure over an extended period of time. With Leios being built for high throughput, creating submissions fully dynamically (as is the case with Praos benchmarks) is likely
+isufficient to maintain that pressure. We will create a declarative, abstract definition of what constitues a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees


Suggested change

isufficient to maintain that pressure. We will create a declarative, abstract definition of what constitues a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees

insufficient to maintain that pressure. We will create a declarative, abstract definition of what constitutes a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees

ch1bo · 2025-11-28T14:11:33Z

docs/leios-design/README.md


+# Performance and quality assurance strategy
+
+## Observability as a first-class citizen


This could fit well as a section of the Overview (introduction) chapter?

ch1bo · 2025-11-28T14:12:45Z

docs/leios-design/README.md

+a specific prototype or implementation. This enables deploying adversarial nodes for the purpose of network conformance testing, as well as performance testing at system integration level. This environment also guarantees the
+observed network behaviour or performance metrics have high confidence, and are reproducible.
+
+Conformance testing can be done on multiple layers. For authoritative end-to-end verification of protocol states, all evidence will need to be processed wrt. the formal specification, keeping track of all states and transitions. A second, complementary


How does this paragraph relate to the section on "Correctness in two dimensions"? Should we incorporate the additional approach of using LTL there?

ch1bo · 2025-11-28T14:14:17Z

docs/leios-design/README.md

+## Testing during development
+
+Wheras simulations operate on models and are able to falsify hypotheses or assess probability of certain outcomes, evolving
+prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedbeck for development, but also to provide transparency into the ongoing process.  


Should this and the following paragraph maybe become part of the implementation plan, for example as part of Prototyping and adversarial testing?

ch1bo · 2025-11-28T14:17:32Z

docs/leios-design/README.md

+approach we chose is conformance testing using Linear Temporal Logic (LTL). By formulating LTL propositions that need to hold for observed evidence, one can achieve broad conformance and regression testing without embedding it in protocol semantics;
+this tends to be versatile and fast to evaluate incrementally. This means, system invariants can be tested as part of CI, or even by consuming live output of a running testnet.
+
+Performance testing requires constant submission pressure over an extended period of time. With Leios being built for high throughput, creating submissions fully dynamically (as is the case with Praos benchmarks) is likely


This sounds very related to what I already mentioned in Public testnets and integration on the implementation plan:

Performance testing measures achieved throughput against business requirements - sustained transaction rate, mempool-to-ledger latency, and behavior under bursty synthetic workloads.

Maybe we should elaborate there? Or Maybe even "before" on controlled environments / the Prototyping and adversarial testing

ch1bo · 2025-11-28T14:20:55Z

docs/leios-design/README.md

+This eventual step will stop support for prototypes and instead focus on full implementations of Leios. This will allow
+for a uniform way to operate, and artificially constrain, Leios by configuration while maintaining its performance properties.
+
+Furthermore, this phase will see custom benchmarks that can scale individual aspects of Leios independently (by config or protocol


The implementation plan is already outlining "phases" (which you hopefully agree with .. otherwise we should change them!) .. which of them would be about this kind of benchmarking? Is this "end-to-end benchmarking" corresponding to "end-to-end testing"?

ch1bo · 2025-11-28T14:25:54Z

docs/leios-design/README.md


 Note that the PoP checks probably are done at the certificate level, and that the above-described API should not be responsible for this. The current code on BLS12-381 already abstracts over both curves `G1`/`G2`, we should maintain this. The `BLST` package also exposes fast verification over many messages and signatures + public keys by doing a combined pairing check. This might be helpful, though it's currently unclear if we can use this speedup. It might be the case, since we have linear Leios, that this is never needed.

-## Performance & Tracing (P&T)


The content here is still very good and we should consider keeping it! Maybe we will (not need to be in this PR) expand it into doing what is mentioned there? e.g. drafting that specification of relevant traces and their semantics in this section of the "Technical design" chapter.

mgmeier mentioned this pull request Oct 27, 2025

Write technical design and implementation plan #542

Closed

12 tasks

ch1bo changed the title ~~Leios Design - Perf & Tracing~~ Leios design: Perf & Tracing Oct 27, 2025

ch1bo linked an issue Oct 27, 2025 that may be closed by this pull request

Write technical design and implementation plan #542

Closed

12 tasks

ch1bo self-requested a review October 27, 2025 11:01

ch1bo reviewed Oct 27, 2025

View reviewed changes

ch1bo self-assigned this Oct 29, 2025

ch1bo marked this pull request as draft October 29, 2025 08:35

ch1bo linked an issue Nov 18, 2025 that may be closed by this pull request

Performance and quality assurance strategy #615

Open

mgmeier force-pushed the leios-design-perf-tracing branch from 69d8b0f to 05d3f14 Compare November 28, 2025 10:43

mgmeier added 2 commits November 28, 2025 11:43

Performance and quality assurance strategy

fd60771

remove duplicate content from ImpactAnalysis.md

cabe84f

mgmeier force-pushed the leios-design-perf-tracing branch from 05d3f14 to cabe84f Compare November 28, 2025 10:44

mgmeier marked this pull request as ready for review November 28, 2025 10:47

ch1bo reviewed Nov 28, 2025

View reviewed changes

jmchapman requested review from javierdiaz72, ramsay-t and yveshauser November 28, 2025 15:21

PR feedback | typos

8135ebc

mgmeier added the do not merge Good for newcomers label Nov 28, 2025

	prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedbeck for development, but also to provide transparency into the ongoing process.
	prototypes and implementations rely on evidence to that end. A dedicated environment suitable for both performance and conformance testing will be created; primarily as feedback for development, but also to provide transparency into the ongoing process.

	As raw data from a benchmark / confirmance test can be huge, existing analysis tooling will be extended or built, such that extracting key insights from raw data can be automated as much as possible.
	As raw data from a benchmark / conformance test can be huge, existing analysis tooling will be extended or built, such that extracting key insights from raw data can be automated as much as possible.

	isufficient to maintain that pressure. We will create a declarative, abstract definition of what constitues a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees
	insufficient to maintain that pressure. We will create a declarative, abstract definition of what constitutes a workload to be submitted. This enables to pre-generate part or all submissions for the benchmarks. Moreover, it guarantees


		# Performance and quality assurance strategy

		## Observability as a first-class citizen


		Note that the PoP checks probably are done at the certificate level, and that the above-described API should not be responsible for this. The current code on BLS12-381 already abstracts over both curves `G1`/`G2`, we should maintain this. The `BLST` package also exposes fast verification over many messages and signatures + public keys by doing a combined pairing check. This might be helpful, though it's currently unclear if we can use this speedup. It might be the case, since we have linear Leios, that this is never needed.

		## Performance & Tracing (P&T)

Leios design: Perf & Tracing #596

Are you sure you want to change the base?

Leios design: Perf & Tracing #596

Uh oh!

Conversation

mgmeier commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ch1bo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mgmeier commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ch1bo commented Oct 29, 2025

Uh oh!

ch1bo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mgmeier commented Oct 27, 2025 •

edited

Loading

mgmeier commented Oct 28, 2025 •

edited

Loading