Skip to content

Benchmark for Evaluating LLM Performance in Residential Building Energy Modeling (BEMEval framework).

Notifications You must be signed in to change notification settings

BEMEval/bemeval-res

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

BEMEval Logo

🏠 BEMEval-Res

Benchmark for Evaluating LLM Performance in Residential Building Energy Modeling

License: MIT Python Dataset BEMEval


πŸ“˜ Overview

BEMEval-Res is the residential benchmark of the open-source BEMEval framework β€” a standardized dataset and evaluation suite for evaluating large language models (LLMs) performance on building energy modeling (BEM) tasks.

The benchmark focuses on translating unstructured building descriptions into structured/machine-readable energy modeling schemas, enabling consistent and reproducible evaluation of AI models in the BEM domain.


πŸ“ Folder Structure

bemeval-res/
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ datasets/          # Benchmark building cases
β”‚   β”‚   β”œβ”€β”€ l100/          # HERS L100 test case
β”‚   β”‚   β”‚   β”œβ”€β”€ input/     # Building descriptions (text)
β”‚   β”‚   β”‚   └── output/    # HPXML reference outputs
β”‚   β”‚   β”œβ”€β”€ nzertf/        # NIST Net-Zero Energy Residential Test Facility
β”‚   β”‚   β”‚   β”œβ”€β”€ input/     # Building specifications (xlsx)
β”‚   β”‚   β”‚   └── output/    # HPXML reference outputs
β”‚   β”‚   └── iunit/         # NREL iUnit (multifamily)
β”‚   β”‚       β”œβ”€β”€ input/     # Building specifications (xlsx)
β”‚   β”‚       └── output/    # EPC-Schema reference outputs (TOML)
β”‚   └── metadata/          # Schema definitions and references
β”‚       β”œβ”€β”€ epc-schema/    # EPC-Schema specification
β”‚       β”‚   β”œβ”€β”€ epc_schema.json
β”‚       β”‚   β”œβ”€β”€ epc_schema_descriptions.json
β”‚       β”‚   └── references/  # Supporting documentation and diagrams
β”‚       └── hpxml/         # HPXML schema files
β”‚           β”œβ”€β”€ HPXML.xsd
β”‚           └── HPXML.txt
β”œβ”€β”€ evaluation/            # Evaluation scripts and metrics
β”‚   β”œβ”€β”€ __init__.py
β”‚   └── evaluate.py        # KVOR metric implementation
β”œβ”€β”€ docs/                  # Documentation
β”‚   └── getting_started.md
└── pyproject.toml        # Project configuration

🧩 Key Features

  • Multiple Schemas – Includes both industry and research schemas:
    • HPXML – consensus residential schema for home energy modeling
    • EPC-Schema – customized normative schema based on ISO/CEN 13790 energy performance methods
  • Representative Building Cases – curated building descriptions from:
    • HERS L100 test case
    • NIST NZERTF (single-family)
    • NREL iUnit (apartment/multifamily)
  • Evaluation Metrics
    • Key–Value Overlap Rate (KVOR)

About

Benchmark for Evaluating LLM Performance in Residential Building Energy Modeling (BEMEval framework).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages