The solution makes it possible to solve the pandemic mapping problem. Today, due to the large volume of data to analyze, it ends up being difficult to map each infection point manually. The solution allows to break this barrier, being focused on mapping the contamination on a broader scale. In a real scenario, it is important to detect outbreaks and also possible contaminants, also helping to test individuals. The project uses an algorithm written in python that applies filters to an image to highlight areas of contamination based on prior knowledge of a geographical point of contamination. Through this idea, we use machine learning in order to map areas at risk of contamination for consultation. One way to manage this data should be done in conjunction with society, in which, in Brazil, you can use the data from the SUS card to validate the addresses of patients who have discharged with symptoms of COVID-19 and at the end of the day update the map Of region.
Project prototype: http://www.trackingthevirus.us/
Data collected at https://trends.google.com.br/trends/
Here we want to identify whether we can predict any abnormal pattern using Google Trend as a data source to cross-check data on the spread of COVID19. With that, we collected some data using the platform and crossed the main keywords that indicated symptoms related to the disease. To complement this idea, we are also looking for data with the same keywords for 2019, the year the H1N1 epidemic occurred.
This data can be essential to detect some unusual patterns since people use the platform a lot to search for symptoms of some disease. The description of the analysis can be seen below:
- First filter: (../data/google_trends/filter_1/g1.csv)
- keyword: [Fever, Dry Cough, Tiredness, Diarrhoea, Headache];
- Date: 2008-2009 (H1N1);
- Country: World
- Second filter: (../data/google_trends/filter_2/g2.csv)
- Keyword [Fever, Dry Cough, Tiredness, Diarrhoea, Headache];
- Date: 2019-2020 (Sars-CoV-2);
- Contry: World
- Terceiro filtro: (../data/google_trends/filter_3/g3.csv)
- Keyword: [Fever, Dry Cough, Tiredness, Diarrhoea, Headache];
- Date: 2019-2020 (Sars-CoV-2);
- Contry: Italy;
The following data are historical searches for symptoms related to viral infections. They all show a higher peak in the event of an infection outbreak. The data generated is based around the world, but this can also be done using place filters on Google Trends. Initially, we can notice 3 things in the following graphics:
Some peaks are quite relevant, such as the one that started on 19/04/2009. That same month there were cases related to H1N1.
We can note that the search for "Fever" and "Cough" starts on 12/1/2019 and goes up to its maximum on 3/15/2020, which coincides with the cases of Sars-Cov-2.
In this case, we separate the case from Italy, here searches also have the same pattern and start to rise in December.
For prediction, we used the SARIMAX model applied to the data available on Google Trends itself:
Satellite images taken from: https://worldwind.arc.nasa.gov/
Here we use satellite data seen at night to compare more populous cities regarding covid infection, some of our comparisons can be seen below: