Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rename inconsistently named GEOGLAM files #213

Open
1 task
j08lue opened this issue Dec 12, 2024 · 2 comments
Open
1 task

Rename inconsistently named GEOGLAM files #213

j08lue opened this issue Dec 12, 2024 · 2 comments

Comments

@j08lue
Copy link
Contributor

j08lue commented Dec 12, 2024

Three GEOGLAM files in veda-data-store have file names that do not conform to the pattern of the rest (CropMonitor_YYYYMM.tif).

Can we rename them - i.e. copy the files to conformant names and change their STAC entries? This may save users some headache who use S3 instead of STAC to discover items (although STAC is advised).

  • CropMonitor_2023_01_28.tif -> CropMonitor_202301.tif
  • CropMonitor_2023_02_28.tif -> CropMonitor_202302.tif
  • CropMonitor_2023_03_28.tif -> CropMonitor_202303.tif
aws s3 ls s3://veda-data-store/geoglam/
2024-03-13 17:34:43    6012504 CropMonitor_202001.tif
2024-03-13 17:34:47    6029419 CropMonitor_202002.tif
2024-03-13 17:34:49    6017018 CropMonitor_202003.tif
2024-03-13 17:34:50    6386366 CropMonitor_202004.tif
2024-03-13 17:34:52    6346561 CropMonitor_202005.tif
2024-03-13 17:34:53    6320955 CropMonitor_202006.tif
2024-03-13 17:34:55    4658995 CropMonitor_202007.tif
2024-03-13 17:34:56    4677472 CropMonitor_202008.tif
2024-03-13 17:34:56    4705830 CropMonitor_202009.tif
2024-03-13 17:34:57    4341809 CropMonitor_202010.tif
2024-03-13 17:34:58    5288105 CropMonitor_202011.tif
2024-03-13 17:34:59    5321207 CropMonitor_202101.tif
2024-03-13 17:35:00    5277308 CropMonitor_202102.tif
2024-03-13 17:35:01    5284661 CropMonitor_202103.tif
2024-03-13 17:35:02    5259615 CropMonitor_202104.tif
2024-03-13 17:35:03    5239591 CropMonitor_202105.tif
2024-03-13 17:35:04    4492889 CropMonitor_202106.tif
2024-03-13 17:35:05    4503863 CropMonitor_202107.tif
2024-03-13 17:35:06    3494043 CropMonitor_202108.tif
2024-03-13 17:35:07    5293999 CropMonitor_202109.tif
2024-03-13 17:35:08    5343006 CropMonitor_202110.tif
2024-03-13 17:35:09    5311489 CropMonitor_202111.tif
2024-03-13 17:35:10    5336762 CropMonitor_202201.tif
2024-03-13 17:35:11    5268139 CropMonitor_202202.tif
2024-03-13 17:35:12    5309246 CropMonitor_202203.tif
2024-03-13 17:35:13    5281256 CropMonitor_202204.tif
2024-03-13 17:35:14    4886345 CropMonitor_202205.tif
2024-03-13 17:35:15    4907825 CropMonitor_202206.tif
2024-03-13 17:35:16    4946370 CropMonitor_202207.tif
2024-03-13 17:35:17    3296784 CropMonitor_202208.tif
2024-03-13 17:35:18    4996502 CropMonitor_202209.tif
2024-03-13 17:35:19    5006583 CropMonitor_202210.tif
2024-03-13 17:35:20    5036055 CropMonitor_202211.tif
2024-03-13 17:35:21    4968798 CropMonitor_202304.tif
2024-03-13 17:35:22    4693854 CropMonitor_202305.tif
2024-03-13 17:35:22    5013383 CropMonitor_202306.tif
2024-03-13 17:35:23    5411525 CropMonitor_202307.tif
2024-03-13 17:35:24    5362108 CropMonitor_202308.tif
2024-03-13 17:35:25    4678356 CropMonitor_202309.tif
2024-03-13 17:35:26    4723276 CropMonitor_202310.tif
2024-03-13 17:35:27    4800011 CropMonitor_202311.tif
2024-03-13 17:35:28    4982767 CropMonitor_2023_01_28.tif
2024-03-13 17:35:29    4973021 CropMonitor_2023_02_28.tif
2024-03-13 17:35:30    5016336 CropMonitor_2023_03_28.tif
2024-03-13 17:35:31    5003250 CropMonitor_202401.tif
2024-03-15 20:21:21    4753596 CropMonitor_202402.tif
2024-05-21 20:53:07    4795250 CropMonitor_202403.tif
2024-08-09 20:08:26    4792191 CropMonitor_202404.tif
2024-08-09 20:08:26    4819082 CropMonitor_202405.tif
2024-08-09 20:08:26    4807514 CropMonitor_202406.tif
2024-08-09 20:08:27    4822375 CropMonitor_202407.tif

Acceptance criteria

  • All GEOGLAM files are consistently named
@j08lue
Copy link
Contributor Author

j08lue commented Dec 12, 2024

Btw, the staging bucket has a few more:

$ aws s3 ls s3://veda-data-store-staging/geoglam/
2023-06-14 17:04:20       6148 .DS_Store
2023-06-14 17:04:20    6012504 CropMonitor_202001.tif
2023-06-14 17:04:20    6029419 CropMonitor_202002.tif
2023-06-14 17:04:20    6017018 CropMonitor_202003.tif
2023-06-14 17:04:20    6386366 CropMonitor_202004.tif
2023-06-14 17:04:20    6346561 CropMonitor_202005.tif
2023-06-14 17:04:20    6320955 CropMonitor_202006.tif
2023-06-14 17:04:20    4658995 CropMonitor_202007.tif
2023-06-14 17:04:20    4677472 CropMonitor_202008.tif
2023-06-14 17:04:20    4705830 CropMonitor_202009.tif
2023-06-14 17:04:20    4341809 CropMonitor_202010.tif
2023-06-14 17:04:24    5288105 CropMonitor_202011.tif
2023-06-14 17:04:24    5321207 CropMonitor_202101.tif
2023-06-14 17:04:24    5277308 CropMonitor_202102.tif
2023-06-14 17:04:24    5284661 CropMonitor_202103.tif
2023-06-14 17:04:24    5259615 CropMonitor_202104.tif
2023-06-14 17:04:24    5239591 CropMonitor_202105.tif
2023-06-14 17:04:24    4492889 CropMonitor_202106.tif
2023-06-14 17:04:24    4503863 CropMonitor_202107.tif
2023-06-14 17:04:24    3494043 CropMonitor_202108.tif
2023-06-14 17:04:24    5293999 CropMonitor_202109.tif
2023-06-14 17:04:26    5343006 CropMonitor_202110.tif
2023-06-14 17:04:26    5311489 CropMonitor_202111.tif
2023-06-14 17:04:26    5336762 CropMonitor_202201.tif
2023-06-14 17:04:26    5268139 CropMonitor_202202.tif
2023-06-14 17:04:26    5309246 CropMonitor_202203.tif
2023-06-14 17:04:26    5281256 CropMonitor_202204.tif
2023-06-14 17:04:26    4886345 CropMonitor_202205.tif
2023-06-14 17:04:26    4907825 CropMonitor_202206.tif
2023-06-14 17:04:26    4946370 CropMonitor_202207.tif
2023-06-14 17:04:26    3296784 CropMonitor_202208.tif
2023-06-14 17:04:27    4996502 CropMonitor_202209.tif
2023-06-14 17:04:27    5006583 CropMonitor_202210.tif
2023-06-14 17:04:27    5036055 CropMonitor_202211.tif
2023-06-14 17:04:27    4968798 CropMonitor_202304.tif
2023-07-19 14:39:19    4693854 CropMonitor_202305.tif
2023-08-24 13:40:42    5013383 CropMonitor_202306.tif
2023-11-03 13:05:31    5411525 CropMonitor_202307.tif
2023-11-03 13:06:38    5362108 CropMonitor_202308.tif
2023-11-07 20:32:14    4678356 CropMonitor_202309.tif
2023-11-07 20:31:23    4723276 CropMonitor_202310.tif
2024-02-09 03:17:38    4800011 CropMonitor_202311.tif
2023-06-14 17:04:27    4982767 CropMonitor_2023_01_28.tif
2023-06-14 17:04:27    4973021 CropMonitor_2023_02_28.tif
2023-06-14 17:04:27    5016336 CropMonitor_2023_03_28.tif
2024-02-09 03:21:04    5003250 CropMonitor_202401.tif
2024-03-14 18:56:23    4753596 CropMonitor_202402.tif
2024-04-25 18:43:51    4795250 CropMonitor_202403.tif
2024-06-03 23:27:29    4792191 CropMonitor_202404.tif
2024-06-20 14:02:41    4819082 CropMonitor_202405.tif
2024-08-02 13:31:59    4807514 CropMonitor_202406.tif
2024-08-02 13:32:52    4822375 CropMonitor_202407.tif
2024-12-12 21:33:21    4791300 CropMonitor_202408.tif
2024-12-12 21:33:26    4812414 CropMonitor_202409.tif
2024-12-12 21:33:31    4837923 CropMonitor_202410.tif
2024-12-12 21:33:35    4823402 CropMonitor_202411.tif
2024-11-06 21:08:00    4791300 Global_Synthesis_2024_08_28.tif
2024-11-06 21:04:43    3678153 Global_Synthesis_2024_08_28.zip
2024-12-10 20:47:51    4812414 Global_Synthesis_2024_09_28.tif

@anayeaye
Copy link
Contributor

I deleted the three Global_Synthesis files from the staging bucket but I have not yet addressed the corresponding metadata for any of those that might have been ingested into the staging catalog.

The three YYYY_MM_DD files are already in the production data store and catalog so the cleanest way to get this resolved is probably to

  1. rename the files
  2. delete production geoglam
  3. re-ingest the correct filenames

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants