Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data requests #397

Closed
octycs opened this issue Feb 6, 2023 · 3 comments
Closed

Data requests #397

octycs opened this issue Feb 6, 2023 · 3 comments
Labels
data This issue is related to the data collection and aggregation

Comments

@octycs
Copy link
Contributor

octycs commented Feb 6, 2023

This is a proposed tracking issue for specific data requests. Please comment if you have ideas of how to improve.

Coordinates

Based on calendar events in a reference week from the 16-20 Jan 2023, the following (subset of) rooms have events happening in them but no coordinate known:

Stammgelände

RiWa 1 (HfP, Governance)

(Has neither maps nor coordinates at the moment)

Weihenstephan

(<4 truncated)

Garching (Chemie)

Garching (MW)

Garching (Rest)

@octycs octycs added the data This issue is related to the data collection and aggregation label Feb 6, 2023
@octycs octycs pinned this issue Feb 6, 2023
@CommanderStorm
Copy link
Member

I did some analytics on 2022. This is a more complete view:

General Event Statistics

Events in Rooms we have coordinates for: 62581
Events in Rooms we don't have coordinates for: 292719
=> 21.37% of Events are in Rooms for which we don't have coordinates

Code for this analytic

SELECT COUNT(*)
from calendar c
WHERE extract(year from DATE(c.dtstart)) = 2022 AND
      extract(year from DATE(c.dtend)) = 2022 AND
      NOT EXISTS(
          SELECT * FROM unacurate_coordinates u
          WHERE u.key = c.key
      )

All Rooms

This is the List of rooms with events in 2022, for which we dont have exact coordinates

Code for this analytic

SELECT c.key,Count(*) --,DATE(MIN(c.dtstart)),DATE(MAX(c.dtend))
from calendar c, unacurate_coordinates u
WHERE extract(year from DATE(c.dtstart)) = 2022 AND
      extract(year from DATE(c.dtend)) = 2022 AND
      u.key= c.key
group by c.key
order by Count(*) desc

High-Traffic Rooms

I assume that not all rooms are equally important.
There are 249 work days in a year => lets look at these rooms.

HT Rooms with coordinates: 249
HT Rooms without coordinates: 83
=> we are missing 25% of HT rooms

Code for this analytic

SELECT c.key,Count(*)
from calendar c
WHERE extract(year from DATE(c.dtstart)) = 2022 AND
      extract(year from DATE(c.dtend)) = 2022 AND
      EXISTS(
          SELECT * FROM unacurate_coordinates u
          WHERE u.key = c.key
      )
GROUP BY c.key
HAVING COUNT(*) > 249

@octycs
Copy link
Contributor Author

octycs commented Apr 11, 2023

I was curious why 5612.02.020 has so many events; apparently IRIS adds any usage into the calendar. Anyway it is probably easy to add this coordinate (given 5612.02.018 and 5612.02.019 or you might be close occasionally).

@CommanderStorm
Copy link
Member

Closing as I don't think without #670 this issue makes a how lot of sense. (plus I don't think it has had any effect ^^)

@CommanderStorm CommanderStorm unpinned this issue Mar 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data This issue is related to the data collection and aggregation
Projects
None yet
Development

No branches or pull requests

2 participants