openwindscada

list of open wind turbine data sets

https://github.com/sltzgs/openwindscada

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README
  • Academic publication links
    Links to: wiley.com, nature.com, zenodo.org
  • Committers with academic emails
    1 of 2 committers (50.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (4.5%) to scientific vocabulary

Keywords

open-data renewable-energy scada wind-energy wind-power
Last synced: 6 months ago · JSON representation

Repository

list of open wind turbine data sets

Basic Info
  • Host: GitHub
  • Owner: sltzgs
  • License: gpl-3.0
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 13 MB
Statistics
  • Stars: 164
  • Watchers: 3
  • Forks: 30
  • Open Issues: 0
  • Releases: 0
Topics
open-data renewable-energy scada wind-energy wind-power
Created about 4 years ago · Last pushed 9 months ago
Metadata Files
Readme License

README.md

OpenWindSCADA

Repository of openly available wind turbine SCADA datasets with high-level descriptions, reusable data loaders for convenient CSV import, and a platform for documenting insights related to data quality and malfunctions.

For questions and feedback, plese reach out to: simon.leszek@tu-berlin.de

Table of open source wind turbine SCADA data sets:

| |ID| Dataset | .jpynb |Loc |Met-
mast |Trb
# |Var
# |Logs
✓/✗ |Labels
✓/✗ |ΔT |∑T |Ref | Remarks/License | |---|--|----------------------------------------------------------------------------------------------------------------|--------------------------------------------|- |- |- |- |- |- |- |- | - |--------------------------------------------| |:-1:|1 | EDP Open Data | here | ESP (on) |✓ | 5 |~80 | ✓ | ✓1| 10m | 2y | - | :warning: Currently unavailable; T09 removed from dataset | | |2 | Winji Gearbox Challenge | ✗ | ? |? | 5 |~20 | ✓ | ✓2| 10m | 3y | - | register & consent from WinJi | |:star:|3 | Kelmarsh Farm | here | UK (on) |✗ | 6 |~99 | ✓ | ✗ | 10m3 | 5y| - | farm info| |:star:|4 | Penmanshiel Farm | ✗ | UK (on) |✗ |14 |>150 | ✓ | ✗ | 10m3 | 5y| - | farm info | | |5 | Ørsted Anholt Offshore | ✗ | DEN (off) |(✓)4 | 111 | ? | ? | ? | 10m | 2y | - | application/NDA; farm info | | |6 | Ørsted Westermost Rough | ✗ | UK (off) |(✓)4 | 35 | ? | ? | ? | 10m | 2y | - | application/NDA; farm info | | |7a| "CAREtoCompare" Windfarm B | ✗ | GER (off) |? | 9 | 64| ? | ✓ | 10m | 2y | - | normalized for anonymization | | |7b| "CAREtoCompare" Windfarm C | ✗ | GER (off) |? | 22 | 238| ? | ✓ | 10m | 2y | - | normalized for anonymization | | |8 | Fuhrländer Farm | ✗ | ? (on) |✗ | 5 | 312| ✓ | ✗ | 5m | 3y | [2] | Eclipse Public License v2.0 | | |9a | DSforWind Windfarm 1a | ✗ | ? (on) |✓6| 4 | 7 | ✗ | ✗ | 10m | 1y | - | - | | |9b | DSforWind Windfarm1b | ✗ | ? (off) |✓6| 2 | 7 | ✗ | ✗ | 10m | 1y | - | - | | |9c | DSforWind Windfarm 2a | ✗ | ? (on) |✓6| 2 | 7 | ✗ | ✗ | 10m | 1y | - | - | | |9d | DSforWind Windfarm 2b | ✗ | ? (off) |✓6| 2 | 7 | ✗ | ✗ | 10m | 1y | - | - | | |10 | PCWG Data Sets | ✗ | ? (on) |✓ | 3 | 1 | ✗ | ✗ | 10m | 1y | - | - | | |11 | Norrekaer Windfarm | ✗ | DK (on) |✓ | 41 | 3 | ✗ | ✗ | 10m | 1.5y | [3] | farm info | | |11 | Delabole Windfarm | ✗ | UK (on) |✓ | 10 | 1 | ✗ | ✗ | 10m | 1y | [4] | farm info | | |12| Dundalk IoT | ✗ | IRE (on) |✗ | 1 | 20 | ✗ | ✓7| 10m | 14y | - | urban terrain | | |13| Kaggle Wind Turbine | ✗ | TUR (on) |✗ | 1 | 4 | ✗ | ✗ | 10m | 1y| - | - | | |14| Small São Paulo | ✗ | BRZ (on) |✗ | 1 | ~40 | ✗ | ✗ | 1m | 5y| - | small, urban turbine | | |15| Björkö Wind Turbine | ✗ | SWE (on) |✗ | 1 | 68 | ✗ | ✗ | 1s | 1y| - | small; turbine info| | |16| IET-OST Turbine | ✗ | SUI (on) |✗ | 1 | 15 | ✗ | ✗ | 1s | 1.5y| - | small; turbine info| | |17| Pedra do Sal Wind Farm | ✗ | BRZ (on) |✓ | 20 | ~40 | ✗ | ✗ | 10m | 1y | - | farm info| | |18| Beberibe Wind Farm | ✗ | BRZ (on) |✓ | 32 | ~40 | ✗ | ✗ | 10m | 1y | - | farm info| | |19| SMARTEOLE Wind Farm | ✗ | FRA (on) |✓ | 7 | ~40 | ✓ | ✗ | 1m | 4m | [5] | wake steering; farm info| | |20| Loegtved VestasV100 | ✗ | DK (on) |✗ | 1 | 3 | ✗ | ✗ | 10m | 4y | - | contact for more data| |:new::exclamation:|21 | Hill of Towie Wind Farm Open Dataset | ✗ | SCT (on) |✗ | 21 |655 | ✓ | ✗ | 10m | 8.7y | - | CC-BY-4.0; AeroUp/TuneUp upgrade info included, data loader here | |:-1:|98| Engie La Haute Borne | ✗ | FR (on) |✗ | 4 |~80 | ✗ | ✗ | 10m | 8y| - | offline; farm info | |:-1:|99| Levenmouth Turbine | ✗ | UK (near) |✓ | 1 | >500 | ✓ | ✗ | 10m/1s| 3y| - | not for free (~2000 £) |

  • ✗ = no / ✓ = yes
  • :-1: = no longer available
  • :star: = comprehensive dataset
  • :new::exclamation: = latest addition
  • bold = best in class
1 Manual annotations of major failures or component replacements
2 SCADA error log indicator
3 Statistics from wave buoy and ground-based LIDAR data.
4 Higher resolution on request
5 Environmental measures (except wind speed & TI) come from metmast
6 Ground-based LIDAR
7 Gearbox replacement in 2018-2019

Notebooks - Data Loaders and Overview Plots:

The jupyter notebooks in the 'notebooks' folder contain a data loader for SCADA signals, logs, annotations as well as community annotations (see next sections). Table 1 indicates whether the respective dataset has already been added. Furthermore, they produce an overview over each dataset such as shown in the following image:

image

Also, for each turbine, there is an 'Overview Cockpit' with a power curve plot, a wind rose and the data avilability over time. An example is shown here:

image

Lastly, operator annotations are listed, if they are part of the dataset. See e.g. for T01 of the edp data set:

image

To run the notebooks yourself, please add the respective .csv-files to the data folder.

Comunity Annotations:

We want to enable researchers to build upon the findings of others who were previously working with the dataset. For every dataset, we have set up a community-annotation folder, containing simple CSV's to collect data quality or malfunction related observations. They contain the following columns:

  • annot_id: unique annotation identifies (running ascending number)
  • turbine_id: which turbine of the respective dataset is affected?
  • signal: which signal exhibits the respective observation?
  • timestart / timestop: during which time is the observation present?
  • relatedlogmessage (optional): is there a SCADA log message that coincides with the observation?
  • remarks: describe your observation in a few words.

The respective notebooks automatically load, read and display the respective malfuncitons. See e.g. this example from T01 of the edp-dataset:

image

How to contribute:

We welcome contributions to expand the collection of open datasets in this repository as well as community annotations for the datasets. Feel free to create respective PRs :).

Other Resources:

References:

Many of the above listed datasets are described and analysed in [1].

[1]
Effenberger, Nina, and Nicole Ludwig. "A collection and categorization of open‐source wind and wind power datasets." Wind Energy 25.10 (2022): 1659-1683.

[2]
Marti-Puig, P., Blanco-M., A., Cusidó, J. et al. Wind turbine database for intelligent operation and maintenance strategies. Sci Data 11, 255 (2024).

[3]
Hansen, Kurt Schaldemose; Vasiljevic, Nikola; Sørensen, Steen Arne (2022). SCADA data from Norre_m2 wind farm. Technical University of Denmark. Dataset.

[4]
Hansen, Kurt Schaldemose (2021). Scada data from Delabole wind farm. Technical University of Denmark. Dataset.

[5]
Simley, E., Fleming, P., Girard, N., Alloin, L., Godefroy, E., and Duc, T.: Results from a wake-steering experiment at a commercial wind plant: investigating the wind speed dependence of wake-steering performance, Wind Energ. Sci., 6, 1427–1453, 2021.

Owner

  • Name: Simon Letzgus
  • Login: sltzgs
  • Kind: user
  • Location: Berlin
  • Company: Technische Universität Berlin

GitHub Events

Total
  • Issues event: 2
  • Watch event: 41
  • Push event: 11
  • Fork event: 8
Last Year
  • Issues event: 2
  • Watch event: 41
  • Push event: 11
  • Fork event: 8

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 65
  • Total Committers: 2
  • Avg Commits per committer: 32.5
  • Development Distribution Score (DDS): 0.015
Past Year
  • Commits: 44
  • Committers: 2
  • Avg Commits per committer: 22.0
  • Development Distribution Score (DDS): 0.023
Top Committers
Name Email Commits
Simon Letzgus s****s@t****e 64
Simon Leszek (TUB) s****k@S****l 1
Committer Domains (Top 20 + Academic)