-
Notifications
You must be signed in to change notification settings - Fork 330
Add Monitoring for DocSum on Xeon and Gaudi (Docker only) #2316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
98d859e
Add monitoring for DocSum on Xeon
joshuayao b5a5679
Add monitoring for DocSum on Gaudi
joshuayao cd5c837
Enhance monitoring for DocSum on Gaudi
joshuayao 4812f66
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] f234f3c
Apply suggestions from code review
joshuayao 87cbafc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 6e344d4
Fix typo in DocSum README
joshuayao File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
59 changes: 59 additions & 0 deletions
59
DocSum/docker_compose/intel/cpu/xeon/compose.monitoring.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,59 @@ | ||
| # Copyright (C) 2024 Intel Corporation | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
|
|
||
| services: | ||
| prometheus: | ||
| image: prom/prometheus:v2.52.0 | ||
| container_name: opea_prometheus | ||
| user: root | ||
| volumes: | ||
| - ./prometheus.yaml:/etc/prometheus/prometheus.yaml | ||
| - ./prometheus_data:/prometheus | ||
| command: | ||
| - '--config.file=/etc/prometheus/prometheus.yaml' | ||
| ports: | ||
| - '9090:9090' | ||
| ipc: host | ||
| restart: unless-stopped | ||
|
|
||
| grafana: | ||
| image: grafana/grafana:11.0.0 | ||
| container_name: grafana | ||
| volumes: | ||
| - ./grafana_data:/var/lib/grafana | ||
| - ./grafana/dashboards:/var/lib/grafana/dashboards | ||
| - ./grafana/provisioning:/etc/grafana/provisioning | ||
| user: root | ||
| environment: | ||
| GF_SECURITY_ADMIN_PASSWORD: admin | ||
| GF_RENDERING_CALLBACK_URL: http://grafana:3000/ | ||
| GF_LOG_FILTERS: rendering:debug | ||
| no_proxy: ${no_proxy} | ||
| host_ip: ${host_ip} | ||
| depends_on: | ||
| - prometheus | ||
| ports: | ||
| - '3000:3000' | ||
| ipc: host | ||
| restart: unless-stopped | ||
|
|
||
| node-exporter: | ||
| image: prom/node-exporter | ||
| container_name: node-exporter | ||
| volumes: | ||
| - /proc:/host/proc:ro | ||
| - /sys:/host/sys:ro | ||
| - /:/rootfs:ro | ||
| command: | ||
| - '--path.procfs=/host/proc' | ||
| - '--path.sysfs=/host/sys' | ||
| - --collector.filesystem.ignored-mount-points | ||
joshuayao marked this conversation as resolved.
Show resolved
Hide resolved
|
||
| - "^/(sys|proc|dev|host|etc|rootfs/var/lib/docker/containers|rootfs/var/lib/docker/overlay2|rootfs/run/docker/netns|rootfs/var/lib/docker/aufs)($$|/)" | ||
| environment: | ||
| no_proxy: ${no_proxy} | ||
| ports: | ||
| - 9100:9100 | ||
| ipc: host | ||
| restart: always | ||
| deploy: | ||
| mode: global | ||
11 changes: 11 additions & 0 deletions
11
DocSum/docker_compose/intel/cpu/xeon/grafana/dashboards/download_opea_dashboard.sh
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| #!/bin/bash | ||
| # Copyright (C) 2025 Intel Corporation | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
| if ls *.json 1> /dev/null 2>&1; then | ||
| rm *.json | ||
| fi | ||
|
|
||
| wget https://raw.githubusercontent.com/opea-project/GenAIEval/refs/heads/main/evals/benchmark/grafana/vllm_grafana.json | ||
| wget https://raw.githubusercontent.com/opea-project/GenAIEval/refs/heads/main/evals/benchmark/grafana/tgi_grafana.json | ||
| wget https://raw.githubusercontent.com/opea-project/GenAIEval/refs/heads/main/evals/benchmark/grafana/docsum_megaservice_grafana.json | ||
| wget https://raw.githubusercontent.com/opea-project/GenAIEval/refs/heads/main/evals/benchmark/grafana/node_grafana.json |
14 changes: 14 additions & 0 deletions
14
DocSum/docker_compose/intel/cpu/xeon/grafana/provisioning/dashboards/local.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,14 @@ | ||
| # Copyright (C) 2025 Intel Corporation | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
|
|
||
| apiVersion: 1 | ||
|
|
||
| providers: | ||
| - name: 'default' | ||
| orgId: 1 | ||
| folder: '' | ||
| type: file | ||
| disableDeletion: false | ||
| updateIntervalSeconds: 10 #how often Grafana will scan for changed dashboards | ||
| options: | ||
| path: /var/lib/grafana/dashboards |
54 changes: 54 additions & 0 deletions
54
DocSum/docker_compose/intel/cpu/xeon/grafana/provisioning/datasources/datasource.yml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,54 @@ | ||
| # Copyright (C) 2025 Intel Corporation | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
|
|
||
| # config file version | ||
| apiVersion: 1 | ||
|
|
||
| # list of datasources that should be deleted from the database | ||
| deleteDatasources: | ||
| - name: Prometheus | ||
| orgId: 1 | ||
|
|
||
| # list of datasources to insert/update depending | ||
| # what's available in the database | ||
| datasources: | ||
| # <string, required> name of the datasource. Required | ||
| - name: Prometheus | ||
| # <string, required> datasource type. Required | ||
| type: prometheus | ||
| # <string, required> access mode. direct or proxy. Required | ||
| access: proxy | ||
| # <int> org id. will default to orgId 1 if not specified | ||
| orgId: 1 | ||
| # <string> url | ||
| url: http://$host_ip:9090 | ||
| # <string> database password, if used | ||
| password: | ||
| # <string> database user, if used | ||
| user: | ||
| # <string> database name, if used | ||
| database: | ||
| # <bool> enable/disable basic auth | ||
| basicAuth: false | ||
| # <string> basic auth username, if used | ||
| basicAuthUser: | ||
| # <string> basic auth password, if used | ||
| basicAuthPassword: | ||
| # <bool> enable/disable with credentials headers | ||
| withCredentials: | ||
| # <bool> mark as default datasource. Max one per org | ||
| isDefault: true | ||
| # <map> fields that will be converted to json and stored in json_data | ||
| jsonData: | ||
| httpMethod: GET | ||
| graphiteVersion: "1.1" | ||
| tlsAuth: false | ||
| tlsAuthWithCACert: false | ||
| # <string> json object of data that will be encrypted. | ||
| secureJsonData: | ||
| tlsCACert: "..." | ||
| tlsClientCert: "..." | ||
| tlsClientKey: "..." | ||
| version: 1 | ||
| # <bool> allow users to edit datasources from the UI. | ||
| editable: true |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,27 @@ | ||
| # Copyright (C) 2025 Intel Corporation | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
| # [IP_ADDR]:{PORT_OUTSIDE_CONTAINER} -> {PORT_INSIDE_CONTAINER} / {PROTOCOL} | ||
| global: | ||
| scrape_interval: 5s | ||
| external_labels: | ||
| monitor: "my-monitor" | ||
| scrape_configs: | ||
| - job_name: "prometheus" | ||
| static_configs: | ||
| - targets: ["opea_prometheus:9090"] | ||
| - job_name: "vllm" | ||
| metrics_path: /metrics | ||
| static_configs: | ||
| - targets: ["docsum-xeon-vllm-service:80"] | ||
| - job_name: "tgi" | ||
| metrics_path: /metrics | ||
| static_configs: | ||
| - targets: ["docsum-xeon-tgi-server:80"] | ||
| - job_name: "docsum-backend-server" | ||
| metrics_path: /metrics | ||
| static_configs: | ||
| - targets: ["docsum-xeon-backend-server:8888"] | ||
| - job_name: "prometheus-node-exporter" | ||
| metrics_path: /metrics | ||
| static_configs: | ||
| - targets: ["node-exporter:9100"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.