-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Marktstammdatenregister (MaStR) #165
Conversation
Hi @lkstrp and other devs from One remark on your comment above: from open_mastr import Mastr
db = Mastr()
db.download()
# if you want csv files then also run
db.to_csv() |
Hey @FlorianK13, So far the idea was to basically just use the zenodo download you provide, which is quite time consuming to download.
Does this approach have any advantages over the zenodo download? E.g. runs faster, allows downloading only selected data? The API reference reads like it downloads the same zip in bulk, but allows data selection. Which means it downloads everything and just strips away unselected data? |
When using the python download method, you will get the most recent data (from the day before). On zenodo you will get the data from our last update, which is a few month old. However with zenodo your code is reproducible, as the python download changes every day as the dataset from BNetzA changes every day. To achieve reproducibilty with python, you would need to specify Both approaches take rather long, as you need to download the whole dataset. Afterwards you can specify which data you are interested to parse. So you are right with your last sentence 'Which means it downloads everything and just strips away unselected data.' |
Based on the discussion above, let's take the zenodo releases. If that's updated at least on an annual basis, that's fine. I am also not too worried about the large download size, as it is usually not a frequent action to update it and it's cached locally as well. @FlorianK13, it could be an option for upcoming releases to upload the individual CSV files unzipped into the zenodo repository, which would allow selective downloads (even though you lose the ZIP compression). This could be additional to the ZIP.
Yes, that's also what Global Energy Monitor does. Perhaps they will also integrate open-MaStR, then we wouldn't have to.
I got one on the same day I requested it today. |
@fneum I created OpenEnergyPlatform/open-MaStR#558 to discuss if we can upload single files at zenodo. |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
Pretty good (if using 50 kW as threshold): Total solar capacity would be 71 GW (for August 2023), but the missing 30 GW are units < 50 kW (rooftop PV). Has much better coverage for wind than GEM. Adds 7 GW of biogas we had been missing. When selectively reading columns, the performance issues also disappear for the cleaning (the matching I did not test yet). |
Ok, @lkstrp! This could go in. I also checked the updated As a future TODO: Going from 1000 kW threshold to 50 kW threshold or even lower would have a large benefit for the total of the solar capacities in Germany closely matching those from https://openenergytracker.org/docs/germany/electricity. Maybe we can add a config that certain (parts of) data sets that are marked as "fully included sources" are not included in the matching process. |
@fneum Okay sounds good to me! Feel free to merge |
Closes #16
Change proposed in this Pull Request
Adds Marktstammdatenregister via open-MaStR.
There are a few issues:
.zip
, so we have to download everything. We could use the API instead, but then the user has to pass a token.powerplant.aggregate_units()
takes too long. Solar and wind are also currently not included.compare-with-entsoe-stats.py
, but below is a first plotDataset
Type of change
Checklist
doc/release_notes.rst
.pre-commit run --all
to lint/format/check my contributiondoc/
.