UpCloud data platform

This repo contains the code for building an open-source data platform on UpCloud.

The data platform currently includes the following components:

Trino: A distributed SQL engine for interactive queries across large and small datasets It allows us to build a data warehouse on UpCloud without depending on a managed service.
Lakekeeper: The production-ready metadata catalog for Iceberg tables, tightly integrated with Trino and OPA.
Open policy agent (OPA): A general-purpose policy engine used here to enforce fine-grained data access control.
Traefik: A reverse proxy and ingress controller that manages SSL termination and routes traffic to the different services of our data platform.
Zitadel: An identity and access management platform that handles user and application authentication, with support for integration into your company’s identity provider.

Prerequisites

Before starting the deployment, make sure you have:

A verified UpCloud account with an API enabled subaccount for creating resources.
A hosted domain and DNS provider (e.g., Route53, GoDaddy) for assigning a subdomain to the data platform stack.
Installed OpenTofu, kubectl, AWS CLI (for the S3-compatible object storage backend)

Getting started

If you want to deploy this stack on UpCloud, start by checking out the tutorial

Contributing

We welcome contributions from the community! Whether it's bug reports, feature requests, or code contributions, your input is valuable to us. Please read our contributing guidelines for more details on how to contribute to this repository.

Support

If you have any questions or run into issues, feel free to open an issue in this Github repo or reach out to niels.claeys@dataminded.com or anyone else at Dataminded.

If you want guidance on how to extend this stack or make it production ready, you can reach out to DataMinded.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github/workflows		.github/workflows
docs		docs
infra		infra
modules		modules
scripts		scripts
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UpCloud data platform

Prerequisites

Getting started

Contributing

Support

About

Uh oh!

Releases

Packages

Languages

License

datamindedbe/demo-upcloud-data-platform

Folders and files

Latest commit

History

Repository files navigation

UpCloud data platform

Prerequisites

Getting started

Contributing

Support

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages