No Arabic abstract
Using a random 10% sample of tweets authored from 2019-09-01 through 2020-04-30, we analyze the dynamic behavior of words (1-grams) used on Twitter to describe the ongoing COVID-19 pandemic. Across 24 languages, we find two distinct dynamic regimes: One characterizing the rise and subsequent collapse in collective attention to the initial Coronavirus outbreak in late January, and a second that represents March COVID-19-related discourse. Aggregating countries by dominant language use, we find that volatility in the first dynamic regime is associated with future volatility in new cases of COVID-19 roughly three weeks (average 22.49 $pm$ 3.26 days) later. Our results suggest that surveillance of change in usage of epidemiology-related words on social media may be useful in forecasting later change in disease case numbers, but we emphasize that our current findings are not causal or necessarily predictive.
In confronting the global spread of the coronavirus disease COVID-19 pandemic we must have coordinated medical, operational, and political responses. In all efforts, data is crucial. Fundamentally, and in the possible absence of a vaccine for 12 to 18 months, we need universal, well-documented testing for both the presence of the disease as well as confirmed recovery through serological tests for antibodies, and we need to track major socioeconomic indices. But we also need auxiliary data of all kinds, including data related to how populations are talking about the unfolding pandemic through news and stories. To in part help on the social media side, we curate a set of 2000 day-scale time series of 1- and 2-grams across 24 languages on Twitter that are most important for April 2020 with respect to April 2019. We determine importance through our allotaxonometric instrument, rank-turbulence divergence. We make some basic observations about some of the time series, including a comparison to numbers of confirmed deaths due to COVID-19 over time. We broadly observe across all languages a peak for the language-specific word for virus in January 2020 followed by a decline through February and then a surge through March and April. The worlds collective attention dropped away while the virus spread out from China. We host the time series on Gitlab, updating them on a daily basis while relevant. Our main intent is for other researchers to use these time series to enhance whatever analyses that may be of use during the pandemic as well as for retrospective investigations.
Coronavirus disease (COVID-19) which is caused by SARS-COV2 has become a pandemic. This disease is highly infectious and potentially fatal, causing a global public health concern. To contain the spread of COVID-19, governments are adopting nationwide interventions, like lockdown, containment and quarantine, restrictions on travel, cancelling social events and extensive testing. To understand the effects of these measures on the control of the epidemic in a data-driven manner, we propose a probabilistic cellular automata (PCA) based modified SEIQR model. The transitions associated with the model is driven by data available on chronology, symptoms, pathogenesis and transmissivity of the virus. By arguing that the lattice-based model captures the features of the dynamics along with the existing fluctuations, we perform rigorous computational analyses of the model to take into account of the spatial dynamics of social distancing measures imposed on the people. Considering the probabilistic behavioural aspects associated with mitigation strategies, we study the model considering factors like population density and testing efficiency. Using the model, we focus on the variability of epidemic dynamics data for different countries and point out the reasons behind these contrasting observations. To the best of our knowledge, this is the first attempt to model COVID-19 spread using PCA that gives us both spatial and temporal variations of the infection spread with the insight about the contributions of different infection parameters.
As the recent COVID-19 outbreak rapidly expands all over the world, various containment measures have been carried out to fight against the COVID-19 pandemic. In Mainland China, the containment measures consist of three types, i.e., Wuhan travel ban, intra-city quarantine and isolation, and inter-city travel restriction. In order to carry out the measures, local economy and information acquisition play an important role. In this paper, we investigate the correlation of local economy and the information acquisition on the execution of containment measures to fight against the COVID-19 pandemic in Mainland China. First, we use a parsimonious model, i.e., SIR-X model, to estimate the parameters, which represent the execution of intra-city quarantine and isolation in major cities of Mainland China. In order to understand the execution of intra-city quarantine and isolation, we analyze the correlation between the representative parameters including local economy, mobility, and information acquisition. To this end, we collect the data of Gross Domestic Product (GDP), the inflows from Wuhan and outflows, and the COVID-19 related search frequency from a widely-used Web mapping service, i.e., Baidu Maps, and Web search engine, i.e., Baidu Search Engine, in Mainland China. Based on the analysis, we confirm the strong correlation between the local economy and the execution of information acquisition in major cities of Mainland China. We further evidence that, although the cities with high GDP per capita attracts bigger inflows from Wuhan, people are more likely to conduct the quarantine measure and to reduce going out to other cities. Finally, the correlation analysis using search data shows that well-informed individuals are likely to carry out containment measures.
The declaration of COVID-19 as a pandemic has largely amplified the spread of related information on social media, such as Twitter, Facebook, and WeChat.Unlike the previous studies which focused on how to detect the misinformation or fake news related toCOVID-19, we investigate how the disease and information co-evolve in the population. We focus onCOVID-19and its information during the period when the disease was widely spread in China, i.e., from January 25th to March 24th, 2020. We first explore how the disease and information co-evolve via the spatial analysis of the two spreading processes. We visualize the geo-location of both disease and information at the province level and find that disease is more geo-localized compared to information. We find a high correlation between the disease and information data, and also people care about the spread only when it comes to their neighborhood. Regard to the content of the information, we find that positive messages are more negatively correlated with the disease compared to negative and neutral messages. Additionally, we introduce machine learning algorithms, i.e., linear regression and random forest, to further predict the number of infected using different disease spatial related and information-related characteristics. We obtain that the disease spatial related characteristics of nearby cities can help to improve the prediction accuracy. Meanwhile, information-related characteristics can also help to improve the prediction performance, but with a delay, i.e., the improvement comes from using, for instance, the number of messages 10 days ago, for disease prediction. The methodology proposed in this paper may shed light on new clues of emerging infections
New York has become one of the worst-affected COVID-19 hotspots and a pandemic epicenter due to the ongoing crisis. This paper identifies the impact of the pandemic and the effectiveness of government policies on human mobility by analyzing multiple datasets available at both macro and micro levels for the New York City. Using data sources related to population density, aggregated population mobility, public rail transit use, vehicle use, hotspot and non-hotspot movement patterns, and human activity agglomeration, we analyzed the inter-borough and intra-borough moment for New York City by aggregating the data at the borough level. We also assessed the internodal population movement amongst hotspot and non-hotspot points of interest for the month of March and April 2020. Results indicate a drop of about 80% in peoples mobility in the city, beginning in mid-March. The movement to and from Manhattan showed the most disruption for both public transit and road traffic. The city saw its first case on March 1, 2020, but disruptions in mobility can be seen only after the second week of March when the shelter in place orders was put in effect. Owing to people working from home and adhering to stay-at-home orders, Manhattan saw the largest disruption to both inter- and intra-borough movement. But the risk of spread of infection in Manhattan turned out to be high because of higher hotspot-linked movements. The stay-at-home restrictions also led to an increased population density in Brooklyn and Queens as people were not commuting to Manhattan. Insights obtained from this study would help policymakers better understand human behavior and their response to the news and governmental policies.