Skip to content

SODRAM collects DCAT and none-DCAT metadata from open data portals for all datasets and map them based on the available geo information

Notifications You must be signed in to change notification settings

SODIC-research/SODRAM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ—ΊοΈ SODRAM - Specific Open Dataset Regional Analysis Metadata

This Node.js-based research tool analyzes the metadata quality of German municipal Open Data portals. Specifically, it evaluates the regional assignability of datasets depending on whether standardized metadata models like DCAT-AP and GeoDCAT-AP are used (typically via CKAN), best practice as a SPARQL endpoint, or non-standard formats (e.g. ArcGIS JSON).

πŸ§ͺ Developed as part of the research poster submission for DCMI 2025 by Florian Hahn, TU Chemnitz (SODIC Research Group)


πŸ” Research Question

Does the use of DCAT and CKAN in municipal German Open Data portals improve the assignability of datasets to regional categories compared to non-standardized alternatives?


πŸ“¦ Features

  • πŸ”Ž Automated metadata harvesting via:

    • ArcGIS REST API
    • CKAN API
    • CKAN SPARQL
    • DCAT RDF
    • DCAT RDF SPARQL
  • 🧠 Regional categorization logic using place/entity keywords

  • πŸ“Š CSV export for per-portal analysis

  • 🎨 Colored console output via chalk

  • ✍️ Easily extendable keyword classification logic


πŸ—οΈ Setup

git clone https://github.com/SODIC-research/SODRAM.git
cd SODRAM
npm install

πŸš€ Usage

Run the analysis:

npm run start

This will:

  • Fetch metadata from German city portals (e.g. Chemnitz, Dresden, Leipzig)
  • Apply classification logic
  • Export results to /export/*.csv
  • Export summary to /export/summary.json

πŸ“ Output

Each analyzed portal produces a .csv file with the following structure:

Title,Description,Spatial,Assignable
"Population by District","...","Leipzig",true
"Verkehrsdaten 2022","...","",false

🧠 Methodology

  • All portals are evaluated using a fixed list of regional keywords and spatial metadata fields (dct:spatial, title/description).
  • The code is designed to replicate the methodology described in the poster: "Regional Analysis of Topic-Specific Open Datasets Through Metadata: Evaluating the Analytical Usability of DCAT vs. Non-DCAT Metadata in a Municipal Portal"

πŸ“š References


πŸ”– License

MIT License Β© 2025 Florian Hahn, TU Chemnitz, SODIC

About

SODRAM collects DCAT and none-DCAT metadata from open data portals for all datasets and map them based on the available geo information

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published