To create the dataset, you will need access to Google Cloud Storage and Google Earth Engine.
- Install all necessary requirements from the requirements.txt file via
pip install -r requirements.txt
- Set up the Google Cloud SDK and authenticate with your Google account.
- Set up the Google Earth Engine Python API and authenticate with your Google account.
- Enter your Google Service Account credentials and the path to your key file in
main.py
. - Set the yaml file in
main.py
that you want to use to download corresponding data and change the Google cloud storage bucket name in the respective yaml file to yours. - Run
python main.py
to let GEE compute the dataset and upload it into your Google cloud storage bucket.
The yaml files in config
contain only the pre-filtered fires that were used in creating the dataset.
Above, we use preprocessed yaml files that contain the filtered lists of fires, having removed fire events that do not actually contain any fires in the VIIRS active fire product, as well as a low number of fires that had various data format issues.
If you do want to recreate the inital, unfiltered list of fires, or vary some parameters in their creation,
you can paste the content of GEE_get_GlobFire_data.js
into the GEE interface. This will create a task to download the
fires for the year that you set in the script.
Afterwards, you should convert the csv file into a yaml file via
python3 fire_csv_to_yaml.py --csv_path YOUR_INPUT_CSV_PATH --yaml_path YOUR_OUTPUT_YAML_PATH --year YOUR_YEAR --bucket_name YOUR_GOOGLE_CLOUD_BUCKET
This yaml file can then be used in step 5 above. Using the provided yamls in config
would save you some time though.