📖 Update data request documentation, #1038

the-bay-kay · 2023-12-18T23:38:21Z

Shankari and I discussed this page of the documentation, and found it was out of date -- as such, we're updating it! I'm also adding some documentation on how to run the data, building off of Abby's work with the dashboard (link)

Added details on how to request and load data TODO: Fill in the extra link, confirm data loading instructions

the-bay-kay · 2023-12-21T23:58:43Z

Once the info in this new readme is OK'd, this branch should be merged before PR 41 -- I want to point to some of the instructions here in this branch, and will be unable to link until the changes are merged.

shankari · 2024-01-14T21:14:26Z

docs/manage/requesting_data_as_a_collaborator.md

-1. Data formats for the json objects are at `emission/core/wrapper` (e.g. `emission/core/wrapper/location.py` and `emission/core/wrapper/cleanedtrip.py`)
-
-## Data analysis ##
+## Data Analysis - Server ##


This is actually the deprecated method now. We sometimes internally use the user specific dumps to reproduce errors, but for external users, they either get the mongodump, or download csv files from their admin dashboard.

Made a change to specify this is for internal testing only! Let me know if I should be more specific about it being a deprecated method.

Should I add a footnote about working with CSV's? I've only worked with the mongodump format, but could ask around for helping writing a section on that process.

docs/manage/requesting_data_as_a_collaborator.md

shankari · 2024-01-14T21:15:10Z

docs/manage/requesting_data_as_a_collaborator.md

+        - More information on this approach can be found in the public dashboard [ReadMe](https://github.com/e-mission/em-public-dashboard/blob/main/README.md#large-dataset-workaround).
+
+
+In general, it is best to follow the instructions of the repository you are working with. There are subtle differences between them, and these instructions are intended as general guidance only.


We should unify these but obviously we should keep this documentation until we do.

- Made the docker style analysis the main data analysis method - Emphasized that the server method was for internal debugging purposes.

shankari · 2024-01-20T05:19:20Z

docs/manage/requesting_data_as_a_collaborator.md

+
+## Working With Data  ##
+
+After requesting data from TSDC, you should receive a "mongodump" file -- a collection of data, archived in `.tar.gz` format.  Here are the broad steps you need to take in order to work with this data:


The TSDC will not provide mongodumps. The TSDC will provide access to the data in csv files/postgres database. The mongodump is currently only available for internal use.

Update data request method, data loading tutorial

57abdcc

Added details on how to request and load data TODO: Fill in the extra link, confirm data loading instructions

the-bay-kay mentioned this pull request Dec 21, 2023

🪣 📊 Updated eval-private data loading instructions e-mission/e-mission-eval-private-data#41

Open

Filled out the compose example / link

a24d7a5

shankari requested changes Jan 14, 2024

View reviewed changes

Re ordered sections, changed labels.

95fb55c

- Made the docker style analysis the main data analysis method - Emphasized that the server method was for internal debugging purposes.

shankari requested changes Jan 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📖 Update data request documentation, #1038

📖 Update data request documentation, #1038

the-bay-kay commented Dec 18, 2023

the-bay-kay commented Dec 21, 2023

shankari Jan 14, 2024

the-bay-kay Jan 16, 2024

shankari Jan 14, 2024

shankari Jan 20, 2024

		- More information on this approach can be found in the public dashboard [ReadMe](https://github.com/e-mission/em-public-dashboard/blob/main/README.md#large-dataset-workaround).


		In general, it is best to follow the instructions of the repository you are working with. There are subtle differences between them, and these instructions are intended as general guidance only.


		## Working With Data ##

		After requesting data from TSDC, you should receive a "mongodump" file -- a collection of data, archived in `.tar.gz` format. Here are the broad steps you need to take in order to work with this data:

📖 Update data request documentation, #1038

Are you sure you want to change the base?

📖 Update data request documentation, #1038

Conversation

the-bay-kay commented Dec 18, 2023

the-bay-kay commented Dec 21, 2023

shankari Jan 14, 2024

Choose a reason for hiding this comment

the-bay-kay Jan 16, 2024

Choose a reason for hiding this comment

shankari Jan 14, 2024

Choose a reason for hiding this comment

shankari Jan 20, 2024

Choose a reason for hiding this comment