Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce/Include MaSTR data #16

Closed
FabianHofmann opened this issue Dec 17, 2019 · 2 comments · Fixed by #165
Closed

Introduce/Include MaSTR data #16

FabianHofmann opened this issue Dec 17, 2019 · 2 comments · Fixed by #165

Comments

@FabianHofmann
Copy link
Contributor

Hello dear friends,
I just finished a new data download of the MaStR database. The data quality is improving slowly.
Now I am facing some problems in the wind dataset. There are around 10000 double entries I would like to find. Did someone already tried MaStR and PPM?

Originally posted by @Ludee in #11 (comment)

@fneum
Copy link
Member

fneum commented Aug 29, 2022

New dataset available: https://zenodo.org/record/6807426#.Ywx33VpByV5

@fneum
Copy link
Member

fneum commented May 25, 2024

Latest version

https://zenodo.org/records/8225106

Citation

Eventually: https://joss.theoj.org/papers/dc0d33e7dc74f7233e15a7b6fe0c7a3e

Which CSV files?

  • biomass
  • combustion
  • nuclear
  • hydro
  • (wind)
  • (solar)

Has to be checked whether wind and solar dataset is too large to be handled efficiently.

Instructions by @FabianHofmann:

start with independent script:
create a routine to retrieve them all (leaving out the renewables for the start)
concat them
potentially convert units
rename and select columns to align with ppm format
clean up everyting

modularization and ppm integration
create a function mastr() which does all the above steps in one, add it to powerplantmatching/data.py,
add a config entry tp package_data/config.py for making ppm aware of the new data input, use a high reliability score
try out, add a test for pure retrieval

integrate into merging routine
add mastr to matching_sources
try out the matching with ppm.powerplants(update=True)
run the https://github.com/PyPSA/powerplantmatching/blob/master/analysis/compare-with-entsoe-stats.py
possibly adjust filter query and possibly add is to fully_included_sources in config.yaml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants