Skip to content

Adding docs for EOAP_CWL on CDSE. #842

Merged
EmileSonneveld merged 7 commits intopublishfrom
eoap_cwl_docs
Feb 27, 2026
Merged

Adding docs for EOAP_CWL on CDSE. #842
EmileSonneveld merged 7 commits intopublishfrom
eoap_cwl_docs

Conversation

@EmileSonneveld
Copy link
Contributor

@github-actions
Copy link

github-actions bot commented Feb 19, 2026

PR Preview Action v1.8.1

QR code for preview link

🚀 View preview at
https://eu-cdse.github.io/documentation/pr-preview/pr-842/

Built to branch gh-pages at 2026-02-26 14:41 UTC.
Preview will be ready when the GitHub Pages deployment is complete.

@EmileSonneveld
Copy link
Contributor Author

---

With openEO, it is now possible to run CWL ([Common Workflow Language](https://www.commonwl.org/)) in the `run_udf` process. This is executed on Calrissian
On the CDSE backend, this has some extra features, which are described on this page.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"extra" compared to what?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess you mean extra in addition to what the generic openeo-geopyspark-driver docs list. But then I would swap both sentences.

Also make it more clear that the openeo-geopyspark-driver docs are generic docs: "more information" is pretty vague.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rephrased. Does it look good like this?

- /EOAP-CWL.html
---

With openEO, it is now possible to run CWL ([Common Workflow Language](https://www.commonwl.org/)) in the `run_udf` process. This is executed on Calrissian
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"This is executed on Calrissian" looks like incomplete sentence. I'm also not sure this implementation detail must be in inttroduction

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rephrased: "Workflows will be executed using calrissian on kubernetes."


## Docker images

Only whitelisted Docker images can be used in the cluster. Contact us through [support](https://helpcenter.dataspace.copernicus.eu/hc/en-gb/requests/new) (You might need to create an account)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe be more explicit about what kind of docker images you refer to here. I guess here it's about docker images referenced from CWL workflow documents?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clearified


## Memory limits

As of February 2026, the memory limit is set to 15GB.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that's the total memory limit for all concurrent containers? or is it per container?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is not very clear from the documentation:

When running calrissian, you must provide a limit the the number of CPU cores (--max-cores) and RAM megabytes (--max-ram) to use concurrently

or

Maximum amount of RAM to use, e.g 1048576, 512Mi or 2G. Follows k8s resource conventions

or

  • Calrissian requires specifying an envelope of RAM and CPU resources to use in the cluster, so these are provided as --max-ram and --max-cores

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed the limit to be on per step basis, instead of being shared over the whole process.
And clarified this wording.

--preserve-environment=AWS_ENDPOINT_URL_S3 \
--preserve-environment=AWS_ACCESS_KEY_ID \
--preserve-environment=AWS_SECRET_ACCESS_KEY \
example_workflow.cwl example_parameters.json
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems a lot of mentioned options are not essential or even risky to use unless you know what you are doing. I would just have the essentials here and suggest some more advanced options seperately

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed 2 unneeded parameters. Annoyingly, --tmpdir-prefix and --no-read-only are needed to be able to run for me.

@EmileSonneveld EmileSonneveld merged commit b74bfe4 into publish Feb 27, 2026
1 check passed
@EmileSonneveld EmileSonneveld deleted the eoap_cwl_docs branch February 27, 2026 10:36
@EmileSonneveld
Copy link
Contributor Author

Already merged. It is starting to get more urgent

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants