Skip to content

flatland-association/flatland-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flatland Benchmarks

Main

Flatland Benchmarks (FAB) is an open-source web-based platform for running Benchmarks to foster Open Research aiming at:

Goal Description How does FAB contribute?
Open access and open data research results and gained knowledge are accessible and can be used by anyone for applications or as the basis further research the web-based platform is open to researchers, developers, data scientists and domain experts, enabling to access and interpret data, problem formulations and results
Open source software and open peer review allow for the replication and reproducibility of research as well as for the reuse of data and software in other research projects, empowering anyone to contribute to and accelerating the pace of research participants disclose their code and the leaderboard links to papers, code and discussion
Open community enables collaborative formulation of problems and research questions, bringing in the expertise of various domains and fostering interdisciplinary approaches platform and problem formulations are made open and the community can contribute

It supports the following use cases:

  • competitions: submissions comprise all results of a benchmark, a leaderboard shows the best submissions per benchmark
  • long-running benchmarks: idem
  • validation campaigns: test results can come from different experiments and are aggregated to overall benchmark results; the campaign features the best results for each benchmark.

Experiments are run in the following loops:

  • closed-loop: the evaluation is carried out without human in the loop (always for competition and benchmarks, optionally in campaign setting)
  • interactive-loop: the evaluation is carried out by a human Operator interacting with a UI, the results are uploaded automatically or/and manually
  • offline-loop: the evaluation is done completely offline, the results are uploaded manually at the end of the experiment

FAB Scope and Context

SystemContext.drawio.png

More details, see ARCHITECTURE.md.

Building Block View

Closed_and_Interactive_Loop_Eval.drawio.png

More details, see ARCHITECTURE.md.

Data Model

DataModel.drawio.png

More details, see ARCHITECTURE.md.

Nomenclature

Technical naming CAMPAIGN setup COMPETITION setup DEFAULT setup
suite Validation Campaign Competition/Challenge (Benchmark) Suite
benchmark Evaluation Objective Round Benchmark
test KPI Test Test
scenario Scenario Scenario Scenario

📦 TL; DR;

To participate, or even organize your own benchmarks or competitions, you don't need to install anything, you just need to sign in an instance of the platform (e.g. this one). If you wish to configure your own instance of the platform, here are the instructions:

cd evaluation
docker compose  -f docker-compose.yml --profile full up -d

📑 Documentation

💬 Communication

About

Flatland Benchmarks (FAB) is an open-source web-based platform for running Benchmarks to foster Open Research

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors 7