Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make astra scraper, schemas & docs #9

Closed
2 tasks done
jpahm opened this issue Oct 2, 2023 · 8 comments · Fixed by #31
Closed
2 tasks done

Make astra scraper, schemas & docs #9

jpahm opened this issue Oct 2, 2023 · 8 comments · Fixed by #31
Assignees
Labels
Help Wanted Extra attention is needed L3 A task suitable for someone who is comfortable implementing large-scale features/projects.

Comments

@jpahm
Copy link
Contributor

jpahm commented Oct 2, 2023

We would like to make an Astra scraper to develop a database of room and room usage data for the API.

We'll also need to develop schemas for the room data we're scraping.

  • Determine what we want to scrape from Astra (what data can we reliably provide via scraping?)
  • Create an Astra scraper (contact me if you want to work on this, I'll need to provide you the auth for it)
@jpahm jpahm changed the title Make Astra Scraper Make Astra Scraper & Schemas Oct 3, 2023
@jpahm jpahm added the L3 A task suitable for someone who is comfortable implementing large-scale features/projects. label Oct 24, 2023
@jpahm jpahm changed the title Make Astra Scraper & Schemas Make astra scraper, schemas & docs Oct 24, 2023
@jpahm jpahm added the Help Wanted Extra attention is needed label Feb 3, 2024
@TyHil TyHil self-assigned this Sep 25, 2024
@TyHil
Copy link
Member

TyHil commented Sep 26, 2024

The data format for the Astra API (https://www.aaiscloud.com/UTXDallas/~api/calendar/CalendarWeekGrid) is a list of these:

[
	"fedac217-4536-11ef-b8d2-85d9c567bf4d",
	357226,
	"FN 2.204 Media Upgrade",
	null,
	null,
	"Meeting",
	null,
	"2024-09-26T00:00:00",
	"2024-09-26T23:30:00",
	5,
	0,
	1410,
	2,
	"86212fa5-5b3a-4a64-bc63-cc9c6e22bac2",
	"075080UTDM",
	"FN",
	"2.204",
	" ",
	"FN 2.204",
	"9845f293-fd21-4dd8-95b4-95eb023fd533",
	null,
	null,
	null,
	0,
	0,
	0,
	"fedac210-4536-11ef-b8d2-85d9c567bf4d",
	85175,
	"Scheduled",
	null,
	"228B22",
	false,
	null,
	0,
	null,
	22
]

which follow the format specified in the fields section of the body:

ActivityId,
ActivityPk,
ActivityName,
ParentActivityId,
ParentActivityName,
MeetingType,
Description,
StartDate,
EndDate,
DayOfWeek,
StartMinute,
EndMinute,
ActivityTypeCode,
ResourceId,
CampusName,
BuildingCode,
RoomNumber,
RoomName,
LocationName,
InstitutionId,
SectionId,
SectionPk,
IsExam,
IsCrosslist,
IsAllDay,
IsPrivate,
EventId,
EventPk,
CurrentState,
NotAllowedUsageMask,
UsageColor,
UsageColorIsPrimary,
EventTypeColor,
MaxAttendance,
ActualAttendance,
Capacity

@TyHil TyHil linked a pull request Sep 26, 2024 that will close this issue
@TyHil
Copy link
Member

TyHil commented Sep 26, 2024

Thanks!

@TyHil TyHil mentioned this issue Sep 27, 2024
@TyHil
Copy link
Member

TyHil commented Sep 27, 2024

I'm not sure how I'd go about making a schema for this. Seems that may be in the nebula-api repo. @jpahm can you point me in the right direction?

And are parser and uploader changes in the scope of this issue?

@jpahm
Copy link
Contributor Author

jpahm commented Oct 11, 2024

I'm not sure how I'd go about making a schema for this. Seems that may be in the nebula-api repo. @jpahm can you point me in the right direction?

And are parser and uploader changes in the scope of this issue?

@TyHil Sorry for the slow response on this Tyler!

Indeed this would involve adding schemas to the nebula-api repo, and I do agree that it may be a bit outside of scope for this issue. I'm thinking of maybe making a new issue for it and letting some new members of the team try to tackle it if that sounds good to you?

@TyHil
Copy link
Member

TyHil commented Oct 11, 2024

No worries and sounds great. I ran the scraper again and put the data to base the schema off of here: https://drive.google.com/file/d/1MzcPu7dFYjpIz5AQd8ZjqaLiP4Mfr-rP/view?usp=sharing

@jpahm
Copy link
Contributor Author

jpahm commented Oct 11, 2024

No worries and sounds great. I ran the scraper again and put the data to base the schema off of here: https://drive.google.com/file/d/1MzcPu7dFYjpIz5AQd8ZjqaLiP4Mfr-rP/view?usp=sharing

Awesome, thanks!

@jpahm
Copy link
Contributor Author

jpahm commented Oct 11, 2024

Closing and moving schema creation to #34.

@jpahm jpahm closed this as completed Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Help Wanted Extra attention is needed L3 A task suitable for someone who is comfortable implementing large-scale features/projects.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants