Issue 
Math. Model. Nat. Phenom.
Volume 15, 2020
Coronavirus: Scientific insights and societal aspects



Article Number  35  
Number of page(s)  25  
DOI  https://doi.org/10.1051/mmnp/2020022  
Published online  19 June 2020 
Contact rate epidemic control of COVID19: an equilibrium view
^{1}
LAMA – UMR 8050, Université Gustave Eiffel,
ChampssurMarne, France
^{2}
CEREMADE – UMR 7534, Université Paris Dauphine, PSL Research University, France
^{*} Corresponding author: gabriel.turinici@dauphine.fr
Received:
17
April
2020
Accepted:
29
May
2020
We consider the control of the COVID19 pandemic through a standard SIR compartmental model. This control is induced by the aggregation of individuals’ decisions to limit their social interactions: when the epidemic is ongoing, an individual can diminish his/her contact rate in order to avoid getting infected, but this effort comes at a social cost. If each individual lowers his/her contact rate, the epidemic vanishes faster, but the effort cost may be high. A Mean Field Nash equilibrium at the population level is formed, resulting in a lower effective transmission rate of the virus. We prove theoretically that equilibrium exists and compute it numerically. However, this equilibrium selects a suboptimal solution in comparison to the societal optimum (a centralized decision respected fully by all individuals), meaning that the cost of anarchy is strictly positive. We provide numerical examples and a sensitivity analysis, as well as an extension to a SEIR compartmental model to account for the relatively long latent phase of the COVID19 disease. In all the scenario considered, the divergence between the individual and societal strategies happens both before the peak of the epidemic, due to individuals’ fears, and after, when a significant propagation is still underway.
Mathematics Subject Classification: 92D30 / 92Bxx / 91A16
Key words: COVID19 / SARSCoV2 / epidemic control / SEIR model / Mean Field Games
© The authors. Published by EDP Sciences, 2020
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
As of April 10th 2020, almost half of world population is under strong restrictions (lockdown) enforced by local governments to limit the ongoing SARSCoV2 mediated COVID19 pandemic. These restrictions mainly concern the reduction of social interactions. They are instantiated at different geographical scales (householdings, towns, states, countries...) and already had a significant worldwide economic impact. All individuals affected by epidemic control measures pay a cost in terms of health (reports show an underreporting of severe illnesses such as hearth, ischemic strokes, ...), time, money, social interactions, psychological pressure (domestic violence increased), etc. However, until a sufficient proportion of the population becomes immune (by infection or vaccination), the choice of control measures together with an adequate tracking of the virus spread are key to limit the epidemic as well as the social and economic impacts of the contact rate decay. In such context, each individual needs to balance the individual and collective outcomes of his/her behavior, when choosing his/her level of social interaction with the population. For instance, he/she may be tempted to use the socalled “free lunch" strategy, i.e., take advantage of the low epidemic activity (due to others’ efforts) while not contributing to the effort himself/herself. However, population benefits are at stake because the epidemic dynamics is induced by the aggregation of individual decisions. In practice, an equilibrium is formed between the individuals and we analyze in this work this equilibrium.
From the technical point of view, we study a “Mean Field Game". Introduced by Lasry and Lions in [38–40] and independently by Huang, Caines, and Malhamé [30–33], Mean Field Games focus on the derivation of a Nash equilibrium within a population containing an infinite number of individuals, see [17] for a complete mathematical description. Such asymptotic viewpoint simplifies the game theoretic analysis of the interactions among the population, as typically the impact of each individual over the entire population can be neglected. It is worth pointing out that the optimal strategy in a Mean Field game provides approximate Nash equilibrium for similar games involving a finite number of players (see [17]). Fields of applications of Mean Field games, mentioned in [28], include energy [24], finance [16, 25], crowd modeling [3], and also epidemic dynamics [10, 34, 36, 37, 50]. These latter works focus on the impact of individual vaccination decisions on the dynamics of the epidemic. In this paper, we follow a similar approach but study the impact of individual decisions concerning distancing and isolation, in an epidemic dynamics where no vaccination is (yet) available.
Optimal control of epidemic is often modeled through a combination of isolation and vaccination strategies. Early work of [1, 2] focuses on the optimal vaccination timing of individuals, while combined with cost free instantaneous isolation strategy. More realistic impacts and constraints on the quarantine and isolation strategies have been considered in [43, 58] or [11, 29]. The main objective of such policy is to control the reproduction number of the epidemic (see [45]) together with limiting the social and economic impact of such policy. An other part of the literature (see [13, 22, 23, 26, 48, 49, 57] for example) tries to model and to take into account the individual behavioral response to isolation policy in such context. Such modeling of individual response is of course greatly influenced by cultural habits together with societal, economic or religious need for social interactions. The addition of such individual based feedback effect on quarantine governmental policy is key in the modeling of optimal control for epidemic dynamics. The main outcome of this paper is to provide an easily tractable modeling approach for such a purpose.
We address in this work the question of epidemic control using an individual based modeling approach, and ask the question of how does the cost of a social distancing strategy disseminate among individuals. Each individual chooses optimally his/her contact rate with others, striking a balance between the cost of being infected by the virus, and the cost induced by reducing social interactions. On one hand, the epidemic dynamics is induced by the aggregation of all individual contact rate decisions. On the other hand, the epidemic level influences the contamination probability of each individual and hereby their own contact rate. We prove that a Mean Field Nash equilibrium is reached among the population and we quantify the impact of individual choice between epidemic risk and other unfavorable outcomes. Numerical experiments show that the self isolation equilibrium strategy is characterized by a quick contact rate reduction followed by a slower return to the usual social interaction level. It allows reducing significantly the proportion of infected individuals required in order to achieve herd immunity.
We also compare the induced epidemic dynamics to a situation where a global planner (typically a strongly empowered government) is able to control all the interaction rates among individuals. We observe numerically that the Mean Field Nash equilibrium provides a suboptimal contact rate strategy in comparison to the societal optimum, and measure the corresponding socalled cost of anarchy. We observe in particular that the divergence between the individual and societal strategies happens after the epidemic peak but while significant propagation is still underway. By taking into account the individual responses to different cost structures, our modeling approach can provide insights on the impact of political decisions concerning the costs induced by contact rate reduction.
The paper is organized as follows. We first present in Section 2 the SIR model used to describe the COVID19 2020 epidemic dynamics, together with the control problem faced by each individual. We prove in Section 3 the existence of a Mean Field Nash equilibrium in our setting and present a numerical algorithm for its approximation. Section 4 focuses on the numerical experiments and presents the computing strategy, and the results in terms of cost of anarchy and sensitivity of the optimal contact rate strategy to the main key modeling parameters. It provides in particular a comparison with a global planner setting. In order to take into account for the relatively long latent phase of the COVID19 disease, we extend our study in Section 5 to an SEIR epidemic model, and present some numerical results. Section 6 provides the technical parts of the mathematical analysis associated to our study. Finally, Section 7 summarizes the main outputs of the paper and discusses their limits and potential extensions.
2 Individual based modeling of the epidemic dynamics
2.1 The model
The dynamics of the epidemic is modeled by the standard SIR (Susceptible–Infected–Recovered) compartment model represented in Figure 1. We refer to [6, 14, 15] for additional details on this model, as well as the description of many other mathematical epidemic propagation models (and to [19, 44, 55, 56] for specific coronavirus models). During the epidemic propagation, each individual can be either “Susceptible", “Infected" or “Recovered", and (S_{t}, I_{t}, R_{t}) denotes the proportion of each category at time t ≥ 0. The group of “Recovered" represents the collection of individuals whose behavior does not impact the transmission of the virus anymore. This includes the individuals who potentially did not survive the epidemic, as well as the individuals tested positive and isolated in a perfect quarantine. Besides, we hereby implicitly assume that the immunity acquired by the “Recovered" lasts for ever. As a first step, we choose to restrict our analysis to a simple SIR model rather than considering more complex (and realistic) models, in order to focus our analysis on the role of individual decisions aggregation on the epidemic dynamics. Nevertheless, our study can be extend to more complicated epidemic models. In particular, an extension to the SEIR compartment model is presented in Section 5.
More precisely, the (continuoustime) evolution of the disease is described by the following equations: (2.1)
with a given initial compartmental distribution of individuals at time 0 denoted (S_{0}, I_{0}, R_{0}), which is supposed to be known. We assume that initially S_{0} + I_{0} + R_{0} = 1, and observe that the system (2.1) possesses a conservation law, i.e., for all t ≥ 0, S_{t} + I_{t} + R_{t} = 1.^{1} As the dynamics of the epidemic is not impacted by the size of the class R, we can restrict our focus on the evolution of the proportion (S, I) of both susceptible and infected.
The model described by (2.1) involves two parameters, γ and . The constant parameter γ > 0 is exogenousand identifies the recovery rate, i.e., it represents the inverse of the length (in days) of the contagious period (or time before strict isolation). In our framework, the key parameter is the transmission rate of the disease, denoted , which is considered to be endogenous and timedependent, contrary to more classical SIR models. To focus our study on this transmission rate, we assume that the parameter γ is fixed and known.
The transmission rate depends essentially on two factors: the disease characteristics and the contact rate within the population. We will denote by β_{0} the constant initial transmission rate of the disease, i.e., without any control measures or effort from the population. Although the society cannot modify the disease characteristics, it can produce (possibly strong) incentives to each individual to reduce his/her contact rate with other individuals in the population. This mitigation strategy (lockdown) has been chosen by many countries during the Covid19 epidemic propagation, some countries starting strict measures in January (China) while other countries in February, March and so on; this is implemented in order to slow down the epidemic propagation. However, for each individual, reducing thecontact rate with others comes at a cost, as described in the introduction (health hazards, psychological pressures, loss of social relationships, income uncertainty...) and it may even expose the individual to yet unknown risks. The latter is especially true for long epidemics that require substantial efforts from the individuals. On the other hand, if everybody else lowers his/her own contact rate then the epidemic diffusion will be vanishing and a given individual, having this knowledge, can act as a freerider and effectively increase his/her contact rate without too much risk.
Finally, the global epidemics propagation rate of the society follows from the aggregation of all individuals contact rates, so that an equilibrium is reached between all contact rates β chosen by each individual and the overall epidemic transmission rate in the society.
Figure 1 Graphical illustration of the SIR model. 
2.2 Individual viewpoint: contact rate optimization
We focus in this section on the individual perspective on the epidemic control. Consistent with the literature, we suppose that the population can be partitioned into several collections of identical individuals (here the “Susceptible", “Infected and infectious" and “Removed" classes), so that each individual in a given collection takes the same decisions as other individuals in the same class. This implies that we can consider a “representative" individual, which is an arbitrary individual in a given class. As we are interested in the unfold of the COVID19 pandemic (as opposed to an endemic disease such as measles, mumps, rubella, pertussis, ...) we will consider a finite time horizon T > 0, large enough so that the epidemic is over at time T^{2}.
The probability of infection of the representative individual depends on both his/her own contact rate β with the population, together with the proportion I_{t} of infectious individuals in the population. More precisely, let τ denote the random infection time of the representative individual. His/Her probability of being infected before time t, when using contact rate β, has the following dynamics (see [36, 37]): (2.2)
where I_{t} is the proportion of (contagious) infected at time t, whose dynamics is driven by the population contact rate (as described in (2.1)).
We assume that, while in the Susceptible class, i.e., before the (possible) infection time τ, the representative individual can choose his/her contact rate β_{t} ∈ [β_{min}, β_{0}] (for all t ≤ τ). Recall that β_{0} represents the transmission rate of the disease without any control measures, while β_{min}> 0 is the lowest possible contact rate value that can be attained by the representative individual. The efforts of an individual for decreasing social interactions, in order to lower the transmission rate of the virus from β_{0} to some β ∈ [β_{min}, β_{0}], induce an instantaneous cost, represented by a decreasing function ^{3}.
Following [36], we assume that the global cost (seen from time t = 0) of an individual, while infected at time τ and choosing a dynamic contact rate , is defined by: (2.3)
where τ ∧ T denotes for the minimum between τ and T.
More precisely, when reducing his/her social interactions on the time interval [0, T], the representative individual faces a global cost given by the sum of:
 (i)
a cost for reducing social interactions until being infected, represented by: (2.4)
meaning that, if the individual is not infected during the period [0, T], his/her efforts are costly until T. On the contrary, if the individual is infected before T, his/her effort are only costly before τ.
 (ii)
a cost incurred at the infection time τ, defined by where r_{I} denotes the unitary cost of infection as quantified by the representative individual, together with any other costs that the presence in the “Infected” class implies. Note that r_{I} is taken here as a constant which does not depend on the pandemic evolution neither on the individual control measures that are specific to the “Infected" class.
We do not specify here the nature of these previous costs, both have health components but may also include other types of costs. We refer the reader to the literature on the QALY/DALY scales for further details [5, 51, 59].
Hence, the expected cost of the representative individual, seen from time t = 0, when using individual contact rate β while the population contact rate is , is given by the following: (2.5)
where the distribution of τ is given by (2.2) and thus indirectly driven by .
In order to determine his/her optimal contact rate β^{*} while the population rate driving is , the representative individual faces the following minimization problem: (2.6)
where the set of admissible contact rate strategies is defined by: (2.7)
2.3 Societal viewpoint: induced epidemic dynamics
In the previous subsection, we described the situation from the point of view of a representative individual. We assumed that each individual chooses his/her own contact rate β in order to minimize the associated cost , while represents the population’s contact rate. In our (Mean Field games) framework, one individual taken alone has no influence on the pandemic dynamics when choosing his/her contact rate. However, the global epidemic propagation rate of the society is induced by the aggregation of all individuals contact rates. Besides, the behavior of the representative individual described in the previous section, is the one used by all individuals assumed to be identical.
Hence, following the prescriptions of the Mean Field games framework, an equilibrium is sought between the contact rate β chosen by each individual and the overall epidemic transmission rate observed at the society level. More precisely, we look for an equilibrium among all individuals in the population, in the sense of the following definition.
Definition 2.1 (Mean Field Nash equilibrium).
The contact rate together with the epidemic dynamics (S^{⋆}, I^{⋆}, R^{⋆}) is a Mean Field Nash equilibrium if the following relations are satisfied:
 (i)
Individual rationality: it is optimal for the representative individual to choose the contact rate β = β^{⋆} when the epidemic dynamics is (S^{⋆}, I^{⋆}, R^{⋆});
 (ii)
Population consistency: whenever the population contact rate is , the induced epidemic dynamics is given by (S^{⋆}, I^{⋆}, R^{⋆}).
Identifying such an equilibrium boils down to a fixed point property of the optimal best response function of the representative individual as described in the following Section.
3 Mean Field Nash equilibrium and numerical approximation
3.1 Mean Field Nash equilibrium
As emphasized in the previous section, the contact rate chosen by each individual is impacted by the epidemic dynamics through the proportion of infected , while the number of infected is a direct consequence of the aggregation of all individuals’ behavior. We thus look for an equilibrium in such context, i.e., in the sense of Definition 2.1. In fact, finding a MeanField Nash equilibrium is equivalent to identifying a fixed point of the socalled best response function, which provides an optimal individual contact rate (or a set of optimal strategies if nonuniqueness of the optimal strategy) in response to a population contact rate .
We therefore introduce the best response function , which associates to any societal transmission rate , the optimal individual contact rate β^{⋆}: (3.1)
As mentioned earlier, although we expect to find at least one fixed point to this application, in general is a multivalued mapping. We make the following assumption:
Assumption 3.1.
The cost function c is decreasing, two times differentiable with continuous second derivative (i.e., of class) and the following holds:
Most of the results that follow also hold under much less demanding assumptions, but at the price of increasing technicalities. We first state the theoretical result concerning the existence of a unique best response strategy in for a given .
Lemma 3.2.
Suppose is fixed. Under Assumption 3.1, there exists a unique satisfying
We are now in position to state the main mathematical result of the paper, concerning the existence of a Mean Field Nash equilibrium.
Theorem 3.3.
Under Assumption 3.1, the model (2.1) admits a Mean Field Nash equilibrium together with , in the sense of Definition 2.1, i.e., for any ,
Note that the result of Theorem 3.3 only informs on the existence of an equilibrium and not to its possible uniqueness. The uniqueness of a Mean Field equilibrium has been recognized as a difficult problem from the very beginning of the MFG theory (see counterexamples in remark after Thm. 2.2 in [38]) and very little theoretical guidance exists to support such a claim. Among the methods that can allow to obtain uniqueness one can list the monotony assumptions (see [38, 39, 40]) or the evolution dynamics (see [54]) but none seems to apply directly here. Moreover, with few exceptions, the epidemic Mean Field equilibrium has rarely been proved to possess systematic uniqueness properties. For sake of clarity, the proofs of both Lemma 3.2 and Theorem 3.3 are postponed to Section 6.4.
3.2 Numerical approach
We describe below the methodology used in the numerical experiments in order to approximate the Mean Field Nash equilibrium β^{⋆} . Recall that the cost for the representative individual when choosing a contact rate β while the societal transmission rate is , is given by .
The metric equilibrium flow approach introduced in [54] (to which we refer the reader for rigorous mathematical transcription of the objects below, see also [50] for an application) prescribes the following iterative procedure in order to reach an equilibrium: choose a pseudotime step h > 0 and define iteratively β^{n+1} as a minimizer of the following functional (3.5)
Here, d(⋅, ⋅) is a geodesic distance on the space of all possible individual choices β. Note that when is a Hilbert space, ℭ(⋅, ⋅) is smooth enough, and d(⋅, ⋅) is the distance induced by the canonical norm, the fact that β^{n+1} is a minimizer of the functional in (3.5) implies: (3.6)
where ∇_{1} denotes for the derivative with respect to the first argument.
Relation (3.6) is similar to the JKO scheme used in one variable gradient flows, see [4]. The problem with (3.6) is that it is implicit and thus not compatible, in this form, with numerical computation. In practice, when the vectorial structure on is compatible with the topological structure, one can propose an iterative procedure to find β^{n+1} solution of (3.6): start with β^{n+1,0} = β^{n} and iterate for ℓ ≥ 0 (3.7)
It is standard to see that for h small enough and in presence of (e.g. Lipschitz) regularity of ∇_{1}ℭ with respect to its first argument, the iterations are guaranteed to converge, by a Picard fixed point argument, to the (unique) solution of (3.6). However, for numerical convenience, in practice only L iterations of (3.7) are performed, and for our numerical simulations, L = 1 worked just fine. Therefore, we implement the following proxy for the minimization in (3.5): (3.8)
When is a Hilbert space and forgetting any possible regularity issues, we obtain an Explicit Euler discretization of the equilibrium flow defined in [54]. Thus, we can expect that will be the equilibrium we want to compute.
4 Numerical experiments
For a Python implementation of the numerical approach see the supplementary material.
4.1 Choice of parameters
The following numerical experiments are done using a contact rate reduction cost function defined by: (4.1)
Recall that the parameter β_{min} represents the minimal achievable contact rate by a representative individual, while β_{0} denotes the usual contact rate used before the beginning of the lockdown measures. In other words, β_{0} represents the transmission rate of the disease without any isolation effort of the population. The shape of the cost function encompasses the increasing difficulty to bring the contact rate closer to zero. Conversely, without effort of the individual at time t, meaning that β_{t} = β_{0}, the associated cost c(β_{t}) is equal to zero. In addition, note that this function c satisfies the Assumption 3.1.
The set of parameters used in the experiments are provided in Table 1. The associated reproduction number without isolation measure, commonly defined by in the literature on epidemic models, is equal to 2.0 in our framework, and is thus in the confidence interval of available data [42]. The parameter γ corresponds to the inverse of the virus contagious period see [7]. We assume that the initial proportion of infected I_{0} in the overall population is 1% at time 0, when the contact rate optimization starts. We set the cost r_{I} incurred by an infected individual to r_{I} = 300. We recall that this cost is not necessarily expressed in terms of money, but can also be medical side effects or general morbidity (see [5, 51, 59] for an introduction on QALY/DALY) and is relative to the definition of c(⋅).
Finally, recall that there is considerable uncertainty in the medical literature on the choice of all parameters described above, so the sensibility to the values chosen has been tested in Section 4.5.
Set of parameters for the numerical experiments.
4.2 Mean Field Nash equilibrium
The numerical approximation of the Mean Field Nash equilibrium is obtained using the algorithm described in Section 3.2. The initial guess β^{0} is taken constant β^{0}(t) = β_{0}, ∀t ≥ 0.^{4}
We plot in Figure 2 the convergence towards the minimal cost; the decay of the cost is very fast in terms of the learning time h. The contact rate at equilibrium is represented in the right plot, while the empirical convergence of the algorithm is confirmed by its superposition with the approximate contact rate computed at the previous step. At this numerical Mean Field Nash equilibrium, the cost function ℭ(β^{⋆}, β^{⋆}) is 211.33 and provides for each individual a relative gain of 12% in comparison to the zero effort strategy β_{0}. In addition, only 85% of this cost is explained in terms of the wealth impact (instead of 100% for the strategy β_{0}), with the remaining 15% being related to the cost of social effort; the probability of being infected over the interval [0, T] decreases from 80% in the 0effort benchmark scenario, to 62%, see Figure 3. Note that in this case, the minimum attainable infection probability is 50% (as calculated from Lem. A.1 in [35] or from Eq. (2.1)).
The equilibrium contact rate β^{⋆} is characterized by three major phases, in response to the Mean Field Nash equilibrium epidemic dynamics presented in Figure 3. First, at the beginning of the epidemic, the number I of infected people is relatively low. Individuals make therefore no effort in order to reduce their social interactions and the virus is transmitted at the normal rate β_{0}. This leads to a large augmentation in the proportion of Infected, implying a significant increase of the individual’s probability of contracting the virus. In response to this, individuals begin to reduce significantly their social interactions, implying a strong reduction of the transmission rate of the disease. Finally, after the epidemic peak, all individuals slowly reduce their effort until the number of infected people is relatively close to 0.
Figure 3 provides a comparison between the SIR dynamics of the Mean Field Nash equilibrium (solid lines), and the one generated by the noeffort strategy β_{0} (dashed lines). Even if driven by selfinterest, individual efforts do reduce social interactions populationwise. This can be explained as follows. First, we observe that the proportion R_{T} of recovered at terminal date drops from 80% to 60% when the population is applying the Mean Field Nash equilibrium strategy. This means that the proportion of the population spared by the virus goes from 20% in the case without effort to 40% at equilibrium. Second, one can observe that the infection peak occurring around t = 50 days is twice less critical at equilibrium, but, as a counterpart, the epidemic lasts longer as the number of infected decreases more slowly after the epidemic peak. As the infection peak is less critical, it limits, and may even prevent, the saturation of the healthcare facilities. Although not represented in our model, this necessarily implies a decrease in the mortality rate of the virus.
Figure 2 Convergence of the algorithm to the Mean Field Nash equilibrium of the model (2.1), using the parameters described in Table 1. Left: global cost ℭ in terms of the learning time. Right: Mean Field Nash equilibrium contact rate β^{*} (magenta line), identical to the contact rate at the penultimate step of the algorithm (green dashed line) and compared to β_{0} (red line). 
Figure 3 Evolution (in proportion) of Susceptible, Infected and Recover classes, using the parameters described in Table 1. Solid lines represent the evolution at the Mean Field Nash equilibrium, i.e., with transmission rate β^{*}. The evolution at equilibrium is compared to the epidemic dynamics with constant transmission rate β_{0} (dashed lines). 
4.3 Impulse control equilibrium
We now focus on the situation where the set of admissible strategies is restricted to a subset of , denoted by , of particularpiecewise constant strategies: (4.2)
where we took as described in Table 1. This framework encompasses the realistic situation where the instantaneous contact rate of each individual can not be, in practice, chosen within the whole set and has to be restricted to a unique control period. The representative individual will optimally select the times t_{1} and t_{2}, representing respectively the beginning and the end of his/her lockdown period.
Figure 4 compared the Mean Field Nash equilibria obtained over both sets and . The equilibrium strategy over starts isolation measures at time t_{1} = 20 by decreasing the contact rate from β_{0} = 0.20 to . The duration of the lockdown is 81 days, after which the contact rate is immediately returning to normal, i.e., β_{0}. The cost induced for each individual is around 210.78, which is lower than the one around 211.33 associated to the equilibrium over . Moreover, we observe that the induced SIR dynamics provides a lower epidemic size R_{T} together with a lower proportion of infected at the epidemic peak, hereby reducing the potential mortality rate induced by the virus. This observation enlightens as well how the Mean Field Nash equilibrium β^{⋆} in is not optimal for the society as a whole, as it can be improved by restricting the set of admissible strategies to . This observation leads us to look towards the optimal societal contact rate for the population as a whole.
Figure 4 Comparison between Mean Field Nash equilibria in (plain lines) and (dashed line), using the parameters described in Table 1. 
4.4 Cost of anarchy
In our previous equilibrium analysis, each individual is considered to be too small in order to impact the epidemic dynamics of the society and can hence acts in a selfish manner: each individual minimizes his own cost in response to the transmissionrate of the epidemic. On the other hand, a global planner, e.g., a government with full empowerment, will optimize the global cost of the entire society with respect to the choice of the transmission rate in the society. Namely, the global planner will solve: (4.3)
This leads to a different optimization problem, which is well documented in the literature, see for example [11, 20, 29, 52], and even more largely on the topic of vaccination (see, e.g., [2, 35, 43, 46]).
In our framework, we can compute, through classical optimization procedures (e.g., the Pontryagin principle), the optimal control of the transmission rate from the society point of view. We refer to the literature previously cited for the details on the techniques allowing to find this optimal control and we only present here the numerical results. Note that in such context there are several additional, classical, procedures available to compute the optimal control (see for instance theforwardbackward sweep method in Chap. 4 from [41]). Nevertheless, in our framework, a slight modification of the previously implemented gradient descent detailed in Section 3.2 works just fine.
First, we observe in Figure 5 that the societal optimal transmission rate imposes larger effort at the beginning of the control period, and relieves these constraints much more slowly. The total control duration period for the societal optimum is 151 days in comparison to 96 days for theMean Field Nash equilibrium, although societal control begins later than in the case of the Nash. Secondly, observe that the optimal transmission rate chosen by the global planner accentuates the already encouraging results obtained with the Mean Field Nash equilibrium on the epidemic dynamics: the epidemic size R_{T} represents only 55% of the total population.
The societal optimum allows to reach an individual cost around 200.25, while the Mean Field Nash equilibrium provides a cost valued around 211.33. This phenomenon allows to mathematically quantify the socalled “cost of anarchy", induced when letting each individual decide on his/her own, instead of letting a global planner take decisions for the population as a whole. Of course, the societal optimal strategy is not a Mean Field Nash equilibrium: given this optimal transition rate for the society, each individual is tempted to make less effort in reducing his/her own contact rate which will drive away the global rate towards the Mean Field equilibrium.
We also compare in Figure 6 the societal optimal transmission rate in to the optimal one in , where only one strong effort period is allowed. The societalwide optimum control period in starts at time t = 23 and lasts 111 days. This induced cost is around 2% higher than the one induced by the optimal social diffusion rate. This may allow to quantify the importance of going through a progressive lockout strategy.
Finally, Figure 7 provides a closer look on the four aforementioned strategies, focusing on the time interval [10, 150], in order to highlight the control (lockdown) and controlless (lockout) properties. One can observe that for both Mean Field Nash equilibrium strategies, the control period starts and ends earlier in comparison to the societal optimum situation. Indeed, on one hand, individuals engage in preventive measures by decreasing their interactions earlier than a global planner would recommend, due to fear of the infection spreading. On the other hand, they release their efforts just after the peak of infection, whereas a global planner would recommend maintaining a relatively high level of effort in order to avoid further spreading of the virus. Therefore, while the Nash equilibrium of individuals allows, through premature efforts, to decrease the peak of infection, the socially optimal strategy allows a more rapid decrease in the proportion of infected after the peak, by maintaining intense efforts. Nevertheless, we shall remind that the global planner should also take into account the possible saturation of the healthcare system, leading to a lower epidemic peak induced by the optimal societal transmission rate.
Figure 5 Comparison between two strategies: the Mean Field Nash equilibrium contact rate (solid lines) and the optimal transmission rate for the society (dashed line), using the parameters described in Table 1. 
Figure 6 Comparison between two strategies: the societal optimum contact rate in (solid lines) and the societal optimum contact rate in , using the parameters described in Table 1. 
Figure 7 Comparison between the transmission rate and the dynamic proportion of infected on the time interval [10, 150], for the four aforementioned strategies: the Mean Field Nash equilibrium in (solid magenta lines) and (dashed red lines), the societal optimum in (dotted greenlines) and (dashdot blue lines), using the parameters described in Table 1. 
4.5 Sensitivity to parameters
The parameters used in the previous numerical experiments are provided in Table 1 above. Nevertheless, as the current medical literature still shows considerable uncertainty concerning the numerical values for those parameters, we tested the sensitivity of our findings with respect to the choice of the main parameters of the model. The corresponding figures are provided in Appendix A for better readability.
 (i)
Figure A.1 provides the Nash equilibrium and the epidemic dynamics for three different values for r_{I}, describing the relative effects of the two parts of the costs. As expected, the more costly is the infection for an individual, the more efforts he/she will do in order to limit his social interactions, in order to decrease his/her probability of infection. In particular, reducing the sanitary cost r_{I} from 350 to 250 implies a20% decrease on the level of the epidemic peak.
 (ii)
Figure A.3 provides a similar study for three different values of the reproducing number . For higher , individuals make more effort in order to reduce their social interactions. Bringing from 2.5 to 1.8 reduces the epidemic peak by 40% together with reducing the size R_{T} of the epidemic from 70% to around 60%. This result is perfectly understandable since the higher is, the higher the probability of being infected without effort is. Each individual limits his social interaction in order to decrease the probability of being infected, and thus diminishes the wealth impact of the epidemic.
 (iii)
Finally, Figure A.2 studies the sensibility of our findings with respect to the initial proportion I_{0} of infected at time 0. A higher I_{0} induces an earlier beginning of the control period, together with a stronger efficacy of it. At terminal date, the total proportion of susceptible remains similar. This implies that the long term effects of a late detection of the epidemic can be compensated by a stronger isolation equilibrium policy. Once again, we omit here the possible negative outcomes induced by the saturation of the health care system.
5 The SEIR model and application to COVID19
It should be noted that the COVID19 disease is characterized by a relatively long latency phase (as well as many other complex dynamics). To account for this latent phase, where infected individuals are not yet contagious, we can extend our reasoning to an SEIR model, where the class E represents the individuals infected but not yet infectious. The dynamics of the SEIR model are described by the following system: (5.1)
where still represents the societal transmission rate, and α > 0 is a parameter specific to the SEIR model, representing the rate at which an exposed person becomes infectious. The average incubation period is therefore given by 1∕α.
The computations provided in Section 6 still hold for the SEIR model. Therefore, the application of the numerical scheme described in Section 3.2 is straightforward, and only the numerical results are presented here. The parameters used for the numerical simulations are those provided in [8, 21], and are described in Table 2. Other parameters given in Table 1 remain unchanged.
The numerical results are provided in Figure 8 and have the same features as for the SIR model. More precisely, the proportion of the population spared by the virus goes from less than 20% in the case without effort to 40% at equilibrium. Moreover, the infection peak occurring around t = 50 days is three times less critical at equilibrium. This result would limit, and may even prevent, the saturation of the healthcare facilities, and thus implies a decrease in the mortality rate of the virus. However, as a counterpart, the epidemic lasts longer as the proportion of exposed and infected individuals decreases more slowly after the epidemic peak. Finally, and similarly as for the SIR model, the cost of anarchy exists. In particular, the societal optimal transmission rate requires a greater and longerlasting effort, even if it starts a little later. The optimal transmission rate at the societal level thus improves the rate of recovery from 60% to 55%. However, since the effort begins later, the infection peak is higher than for the Nash equilibrium.
Set of parameters for the SEIR model.
Figure 8 Transmission rate and induced evolution of Susceptible, Exposed, Infected and Recover classes, using the parameters described in Table 2. Solid lines represent the transmission rate and the epidemic evolution at the Mean Field Nash equilibrium, while dashed lines model the societal equilibrium. These two equilibria are compared to the epidemic dynamics with constant transmission rate β_{0} (dotted lines). 
6 Mathematical details
In this section, we detail the computations and proofs when the epidemic is modeled by a SIR. Nevertheless, these computations still hold if we consider instead an SEIR model, as mentioned in Section 5. More precisely, since the individual is considered infected as soon as he/she enters class E, the probability of being infected in the considered time interval [0, T] remains unchanged. In particular, the cost satisfies the same formula (6.8) as for the SIR model, as well as the gradient formula (6.11).
6.1 Probability of infection
We the introduce in order to denote, at time t ≥ 0, the solution of the system (2.1) with transmission rate . For all t ∈ [0, +∞), we denote by the probability that infection occurs before time t for an individual choosing his/her own contact rate β, while the epidemic evolves according the population’s transmission rate . Note that is redundant with the notation P^{β} but emphasizes the dependence in of the distribution function of the random infection time τ.
Lemma 6.1.
The probability of being infected before time t ≥ 0, for an individual choosing a contact rate , and when the proportion of infected is , is equal to:
To compute this probability, we follow [36, 37]. The Markov chain of an individual, who chooses a contact rate with infected individuals, and whose state at time t ∈ [0, T] is denoted by M_{t}, is described in terms of the following passage probabilities: (6.2)
The probability of being infected before time t + Δt can be written as follows: (6.3)
and replacing in (6.3), we obtain: (6.5)
Using the fact that , and letting Δt → 0, the previous equation naturally implies the formula (6.1).
6.2 Computation of the cost
Recall that given a finite time horizon T, the expected cost of an individual is defined by (2.5). In other words, we have: (6.6)
Since the cumulative distribution function of the random variable τ at time t corresponds to the individual’s probability of being infected before time t, which is denoted by , we obtain: (6.7)
Moreover, using equation (6.1), we can also write: (6.8)
6.3 Gradient of the cost
In order to obtain the gradient of this cost, we have to compute the Gateau derivative D_{h} ℭ of ℭ with respect to the first variable β in the direction h. Using equation (6.7), we obtain: (6.9)
Some preliminary computations of the Gateaux derivatives allow to write:
Replacing in (6.9), we therefore obtain the following relation: (6.10)
Equation (6.11) is used for numerical simulation, in particular in the equilibrium flow descent described in Section 3.2.
6.4 Proof of the existence of an equilibrium
First we recall that equation (2.1) has a unique solution for any (see for instance [12, 18, 53]). In particularone can prove that if is a sequence of functions in converging in L^{1} (thus also in L^{2}) to some then the corresponding solution of (2.1) converges (for any given t) to . Moreover for any the solution is a Lipschitz function of time with Lipschitz constant L_{S} ≤ β_{0} + γ.
Before turning to the proof of the existence of an equilibrium, we start with the proof of Lemma 3.2.
Proof.
To this end, we consider that is fixed. Define the value function at time t (compare with formula (6.8)): (6.13)
This corresponds to the optimal cost of an individual starting at time t in the susceptible class. By invoking standard arguments (see [9]) one can show that Π_{t} is the (unique) solution of the following HamiltonJacobiBellman equation: (6.14)
Under Assumption 3.1, the minimization in (6.15) is straightforward to analyze (and in general is related to the Fenchel transform of c(⋅)). Moreover the value y^{*} realizing the minimum is unique and the following mapping, (6.16)
is Lipschitz in both arguments with a constant L_{H} valid for all x ∈ [0, r_{I}] and . Since is Lipschitz (in particular continuous) we obtain that is continuous with respect to time and thus Π_{t} is a classical solution of (6.15). It is a Lipschitz function of time with Lipschitz constant L_{Π} ≤ c(β_{0}) + r_{I}β_{0}.
We now prove Theorem 3.3.
Proof.
The proof consists of applying Schauder Theorem (see for example Thm. 1.C from [60] or Thm. 18.20 from [47]) to the mapping, in order to prove that it has a fixed point. To this end, we define a subset consisting of Lipschitz functions with constant L_{H}(L_{Π} + β_{0} + γ). Obviously is a compact set in L^{2}(0, T). Previous considerations show that for any , the corresponding optimal individual choice is in . Moreover is a compact subset of L^{2}(0, T). For any sequence in converging in L^{2} to some we have thaton the one hand for any t. Moreover the corresponding value functions (which depend on through ) will also converge pointwise. This means that finally the optimal values will convergeto the corresponding limits . By the dominated convergence theorem we obtain the L^{2} convergence of to which ends the proof, by application of the Schauder fixed point theorem.
7 Conclusion and closing remarks
In this work, we consider, within both a SIR and SEIR models, the question of the COVID19 control measures as seen from the point of view of the individuals. We assume that each individual can choose to decrease his/her social interactions in order to slow down the spread of the virus. This means that the transmission rate, usually constant and exogenous in standard SIR models, is endogenous and timedependent. Individuals can choose to lower their contact rate, which will allow them to minimize the likeliness of being infected, but this comes at a cost. The impact on the overall epidemic unfold from a single individual is negligible but the aggregating behavior of all individuals determine the epidemic evolution. This is formalized through a Mean Field approach.
We prove the existence of a Mean Field Nash equilibrium, i.e., a contact rate such that no individual has an interest in choosing a contact rate different from the overall societal contact rate. We then perform numerical simulations in order to find this equilibrium, which seems to be unique for the tested cases. The transmission rate of the disease induced by the equilibrium allows a clear improvement in the evolution of the epidemic, compared to the evolution with the initial transmission rate β_{0}, which corresponds to the transmission rate of the disease without any effort of the population. In particular, despite the selfishness of individuals (who only seek to minimize their own cost), their efforts make possible to reduce the number of people affected by the disease by 25%. In addition, the infection peak is less critical, which limits, and may even prevent, the saturation of the health care system and a corresponding decrease in the mortality rate of the virus.
However, the Nash equilibrium is not the best that can be achieved if compared with a situation where individuals are 100% altruistic and only see the good of the society as a whole. To quantify this difference, we compute the societalwide optimum to compare thetwo strategies. We observe that the divergence between the two strategies arrives both before and after the peak of the epidemic. More precisely, the Nash equilibrium allows, through premature efforts, to decrease the peak of infected, but the centralized epidemic control implies a more rapid decrease in the proportion infected after the peak, by maintaining intense efforts. As a consequence, there is a cost of anarchy, meaning that the Nash equilibrium induces a higher cost than the societal optimum. While stopping early, at a point where the epidemic decreases but is not yet over, may be intuitively explained by the selfish nature of individuals, the fact that they decide collectively to begin efforts early than the societal optimum is a more intriguing feature. The latter fact is consistent with the experimental results documented in the remarkable work [27].
Of course, any model is but an imperfect description of the reality and there are some limitations of our model too. First of all, as there is no consensus in the medical literature on the parameters of the epidemic, this impacts our model too. Indeed, the recent literature onthe COVID19 epidemic is abundant but discordant on the dynamics of the epidemic, in particular on the parameters γ and . Moreover, the appropriate estimation of the cost of effort, denoted by the function c, and its counterpart r_{I} are still object of research, as they are not necessarily monetary or economic costs, but often costs related to health and social interactions. The modeling of these costs needs to be related to the concept of QALY/DALY, see [5, 51, 59].
To account for the long latency phase of the disease induced by the COVID19, we extend our reasoning from a SIR model to a SEIR. This choice can also be discussed since the disease induced by the COVID19 has many other complex dynamics (for e.g. large number of asymptomatic carriers, see [19, 44, 55] for alternatives already used in coronavirus epidemics). Our model could also be extended to take into account other ways to control an epidemic, such as vaccination for example, even if no vaccine is known to date. Moreover we assume that all individuals are rational, identical, and that they have a perfect knowledge of the epidemic dynamics. The model could also be extended to a add some heterogeneity between individuals, since the epidemic does not affect individuals in the same way, and is more costly for those at risk. All these remain for future work.
The study of the cost of anarchy raises the question of incentives: what levers can healthcare authorities use to bring the Nash equilibrium closer to the societal optimal? Indeed, many countries have introduced fines, or even prison sentences, in the event of failure to comply with lockdown. Our model can give details in this direction, and invites a finer assessment of the costs related to the epidemic, but also to the lockdown. A model with centralized control also opens up the question of modeling the interaction (or not) between countries with different levels of epidemic dynamics.
Acknowledgements
The authors thank the MODCOV19 platform for support.
Appendix A Additional numerical results
The figures listed and described in Section 4.5 are grouped together in this appendix. The parameters are those described in Table 1, except when other values are explicitly mentioned.
Figure A.1 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for r_{I}. Solid lines: r_{I} = 300. Dotted line: r_{I} = 250. Dashed lines: r_{I} = 350. 
Figure A.2 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for I_{0}. Solid lines: I_{0} = 0.01. Dotted line: I_{0} = 0.005. Dashed lines: I_{0} = 0.05. 
Figure A.3 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for . Dashed lines: . Solid lines: . Dotted line: . 
References
 A. Abakuks, An optimal isolation policy for an epidemic. J. Appl. Prob. 10 (1973) 247–262. [Google Scholar]
 A. Abakuks, Optimal immunisation policies for epidemics. Adv. Appl. Prob. 6 (1974) 494–511. [Google Scholar]
 Y. Achdou and M. Laurière, Mean field type control with congestion. Appl. Math. Optim. 73 (2016) 393–418. [Google Scholar]
 L. Ambrosio, N. Gigli and G. Savaré, Gradient flows in metric spaces and in the space of probability measures, 2nd edn. Birkhäuser, Basel (2008). [Google Scholar]
 S. Anand and K. Hanson, Disabilityadjusted life years: a critical review. J. Health Eco. 16 (1997) 685–702. [CrossRef] [Google Scholar]
 R.M. Anderson and R.M. May, Infectious Diseases of Humans Dynamics and Control. Oxford University Press, Oxford (1992). [Google Scholar]
 R.M. Anderson, T.D. Hollingsworth and D.J. Nokes, Mathematical models of transmission and control, Vol. 2. Oxford University Press, Oxford (2009). [Google Scholar]
 N. Bacaer, Un modéle mathématique des débuts de l’épidémie de coronavirus en France. MMNP 15 (2020) 29. [EDP Sciences] [Google Scholar]
 M. Bardi and I. CapuzzoDolcetta, Optimal control and viscosity solutions of HamiltonJacobiBellman equations. Systems & Control: Foundations & Applications. Birkhäuser Boston Inc., Boston, MA (1997). [Google Scholar]
 C.T. Bauch and D.J.D. Earn, Vaccination and the theory of games. Proc. Natl. Acad. Sci. USA 101 (2004) 13391–13394. [CrossRef] [MathSciNet] [Google Scholar]
 H. Behncke, Optimal control of deterministic epidemics. Optim. Control Appl. Methods 21 (2000) 269–285. [Google Scholar]
 A. Bressan and F. Rampazzo, Impulsive control systems with commutative vector fields. J. Optim. Theory Appl. 71 (1991) 67–83. [Google Scholar]
 B. Buonomo, A. d’Onofrio and D. Lacitignola. Global stability of an SIR epidemic model with information dependent vaccination. Math. Biosci. 216 (2008) 9–16. [Google Scholar]
 V. Capasso, Mathematical Structures of Epidemic Systems. Lecture Notes in Biomathematics. SpringerVerlag, Berlin (1993). [CrossRef] [Google Scholar]
 V. Capasso and G. Serio, A generalization of the KermackMcKendrick deterministic epidemic model. Math. Biosci. 42 (1978) 43–61. [Google Scholar]
 R. Carmona, J.P. Fouque and L.H. Sun, Mean field games and systemic risk. Commun. Math. Sci. 13 (2015) 911–933. [Google Scholar]
 R. Carmona, F. Delarue, et al. Probabilistic Theory of Mean Field Games with Applications III. Springer, Berlin (2018). [Google Scholar]
 G. Dal Maso and F. Rampazzo, On systems of ordinary differential equations with measures as controls. Differ. Integral Equ. 4 (1991) 739–765. [Google Scholar]
 A. Danchin, T.W.P. Ng and G. Turinici, A new transmission route for the propagation of the SARSCoV2 coronavirus. Preprint medRxiv: 20022939v1 (2020). [Google Scholar]
 R. DjidjouDemasse, Y. Michalakis, M. Choisy, M.T. Sofonea and S. Alizon, Optimal COVID19 epidemic control until vaccine deployment. Preprint medRxiv: 20049189v3 (2020). [Google Scholar]
 J. Dolbeault and G. Turinici, Heterogeneous social interactions and the COVID19 lockdown outcome in a multigroup SEIR model. Preprint arXiv:2005.00049 (2020). [Google Scholar]
 A. d’Onofrio, P. Manfredi and E. Salinelli, Vaccinating behaviour, information, and the dynamics of SIR vaccine preventable diseases. Theor. Popul. Biol. 71 (2007) 301–317. [PubMed] [Google Scholar]
 A. d’Onofrio, P. Manfredi and E. Salinelli. Fatal SIR diseases and rational exemption to vaccination. Math. Med. Biol. 25 (2008) 337–357. [Google Scholar]
 R. Elie, E. Hubert, T. Mastrolia and D. Possamaï, Meanfield moral hazard for optimal energy demand response management. Preprint arXiv:1902.10405 (2019). [Google Scholar]
 R. Élie, T. Ichiba and M. Laurière, Large banking systems with default and recovery: A mean field game model. Preprint arXiv:2001.10206 (2020). [Google Scholar]
 E.P. Fenichel, C. CastilloChavez, M.G. Ceddia, G. Chowell, P.A.G. Parra, G.J. Hickling, G. Holloway, R. Horan, B. Morin, C. Perrings, M. Springbornh, L. Velazqueze and C. Villalobos, Adaptive human behavior in epidemiological models. Proc. Natl. Acad. Sci. 108 (2011) 6306–6311. [CrossRef] [Google Scholar]
 S. Ghader, J. Zhao, M. Lee, W. Zhou, G. Zhao, L. Zhang, Observed mobility behavior data reveal social distancing inertia. Preprint arXiv:2004.14748 (2020). [Google Scholar]
 O. Guéant, J.M. Lasry and P.L. Lions, Mean field games and applications, in ParisPrinceton lectures on mathematical finance 2010. Springer, Berlin (2011) 205–266. [CrossRef] [Google Scholar]
 E. Hansen and T. Day, Optimal control of epidemics with limited resources. J. Math. Biol. 62 (2011) 423–451. [Google Scholar]
 M. Huang, R. Malhamé and P. Caines, Large population stochastic dynamic games: Closed–loop McKean–Vlasov systems and the Nash certainty equivalence principle. Commun. Inf. Syst. 6 (2006) 221–252. [Google Scholar]
 M. Huang, P. Caines and R. Malhamé, An invariance principle in large population stochastic dynamic games. J. Syst. Sci. Complex. 20 (2007) 162–172. [Google Scholar]
 M. Huang, P. Caines and R. Malhamé, Large–population cost–coupled LQG problems with nonuniform agents: Individual–mass behavior and decentralized ε–Nash equilibria. IEEE Trans. Auto. Control 52 (2007) 1560–1571. [Google Scholar]
 M. Huang, P. Caines and R. Malhamé, The Nash certainty equivalence principle and McKean–Vlasov systems: An invariance principle and entry adaptation, in 46th IEEE Conference on Decision and Control (2007) 121–126. [Google Scholar]
 E. Hubert and G. Turinici, NashMFG equilibrium in a SIR model with time dependent newborn vaccination. Ric. Mat. 67 (2018) 227–246. [CrossRef] [Google Scholar]
 L. Laguzet and G. Turinici, Global optimal vaccination in the SIR model: properties of the value function and application to costeffectiveness analysis. Math. Biosci. 263 (2015) 180–197. [Google Scholar]
 L. Laguzet and G. Turinici, Individual Vaccination as Nash Equilibrium in a SIR Model with Application to the 20092010 Influenza A (H1N1) Epidemic in France. Bull. Math. Biol. 77 (2015) 1955–1984. [Google Scholar]
 L. Laguzet, G. Turinici and G. Yahiaoui, Equilibrium in an individualsocietal SIR vaccination model in presence of discounting and finite vaccination capacity, in New Trends in Differential Equations, Control Theory and Optimization: Proceedings of the 8th Congress of Romanian Mathematicians (2016) 201–214. [CrossRef] [Google Scholar]
 J.M. Lasry and P.L. Lions, Jeux à champ moyen. I–Le cas stationnaire. C R Math. 343 (2006) 619–625. [Google Scholar]
 J.M. Lasry and P.L. Lions, Jeux à champ moyen. II–Horizon fini et contrôle optimal. C R Math. 343 (2006) 679–684. [Google Scholar]
 J.M. Lasry and P.L. Lions, Mean field games. Jpn. J. Math. 2 (2007) 229–260. [CrossRef] [MathSciNet] [Google Scholar]
 S. Lenhart and J.T. Workman, Optimal control applied to biological models. CRC Press, Boca Raton (2007). [CrossRef] [Google Scholar]
 Q. Li, X. Guan, P. Wu, X. Wang, L. Zhou, Y. Tong, R. Ren, K.S.M. Leung, E.H.Y. Lau, J.Y. Wong, X. Xing, N. Xiang, Y. Wu, C. Li, Q. Chen, D. Li, T. Liu, J. Zhao, M. Liu, W. Tu, C. Chen, L. Jin, R. Yang, Q. Wang, S. Zhou, R. Wang, H. Liu, Y. Luo, Y. Liu, G. Shao, H. Li, Z. Tao, Y. Yang, Z. Deng, B. Liu, Z. Ma, Y. Zhang, G. Shi, T.T.Y. Lam, J.T. Wu, G.F. Gao, B.J. Cowling, B. Yang, G.M. Leung and Z. Feng, Early Transmission Dynamics in Wuhan, China, of Novel CoronavirusInfected Pneumonia. New Engl. J. Med. 382 (2020) 1199–1207. [Google Scholar]
 R. Morton and K.H. Wickwire, On the optimal control of a deterministic epidemic. Adv. Appl. Prob. 6 (1974) 622–635. [CrossRef] [Google Scholar]
 T. Ng, G. Turinici and A. Danchin, A double epidemic model for the SARS propagation. BMC Infect. Dis. 3 (2003) 19. [Google Scholar]
 A. Perasso, An introduction to the basic reproduction number in mathematical epidemiology. ESAIM: Proc. Surv. 62 (2018) 123–138. [CrossRef] [Google Scholar]
 A.B. Piunovskiy and D. Clancy, An explicit optimal intervention policy for a deterministic epidemic model. Optim. Control Appl. Methods 29 (2008) 413–428. [Google Scholar]
 A.S. Poznyak, Topics of functional analysis, in Advanced Mathematical Tools for Automatic Control Engineers: Deterministic Techniques, edited by A.S. Poznyak. Elsevier, Oxford (2008) 451–498. [CrossRef] [Google Scholar]
 A. Rizzo, M. Frasca, M. Porfiri, Effect of individual behavior on epidemic spreading in activitydriven networks. Phys. Rev. E 90 (2014) 042801. [Google Scholar]
 F.D. Sahneh, F.N. Chowdhury and C.M. Scoglio, On the existence of a threshold for preventive behavioral responses to suppress epidemic spreading. Sci. Rep. 2 (2012) 632. [CrossRef] [PubMed] [Google Scholar]
 F. Salvarani and G. Turinici, Optimal individual strategies for influenza vaccines with imperfect efficacy and durability of protection. Math. Biosci. Eng. 15 (2018) 629. [CrossRef] [PubMed] [Google Scholar]
 F. Sassi, Calculating QALYs, comparing QALY and DALY calculations. Health Policy Plan. 21 (2006) 402–408. [CrossRef] [PubMed] [Google Scholar]
 S.P. Sethi and P.W. Staats, Optimal control of some simple deterministic epidemic models. Oper. Res. Soc. J. 29 (1978) 129–136. [CrossRef] [Google Scholar]
 G. Silva and R. Vinter, Necessary conditions for optimal impulsive control problems. SIAM J. Control Optim. 35 (1997) 1829–1846. [Google Scholar]
 G. Turinici, Metric gradient flows with state dependent functionals: The NashMFG equilibrium flows and their numerical schemes. Nonlinear Anal. 165 (2017) 163–181. [CrossRef] [Google Scholar]
 G. Turinici and A. Danchin, The SARS Case Study: An Alarm Clock? in Encyclopedia of Infectious Diseases. John Wiley & Sons, Ltd, New Jersey (2006) 151–162. [Google Scholar]
 V. Volpert, M. Banerjee, A. d’Onofrio, T. Lipniacki, S. Petrovskii and V.C. Tran, Coronavirus: scientific insights and societal aspects. MMNP 15 (2020) E2. [EDP Sciences] [Google Scholar]
 Z. Wang, C.T. Bauch, S. Bhattacharyya, A. d’Onofrio, P. Manfredi, M. Perc, N. Perra, M. Salathé and D. Zhao, Statistical physics of vaccination. Phys. Rep. 664 (2016) 1–113. [Google Scholar]
 K. Wickwire, Optimal isolation policies for deterministic and stochastic epidemics. Math. Biosci. 26 (1975) 325–346. [Google Scholar]
 R. Zeckhauser and D. Shepard, Where now for saving lives. Law Contemp. Probl. 40 (1976) 5. [Google Scholar]
 E. Zeidler, Applied Functional Analysis: Applications to Mathematical Physics. Applied Mathematical Sciences. Springer, New York (2012). [Google Scholar]
The SIR model can be extended to include births and deaths. Nevertheless, given that the duration of the COVID19 pandemic is assumed to, hopefully, be relatively short with respect to the life expectancy at birth in most of the concerned countries, the demographic dynamics is not relevant and will not be taken into account.
Considering a finite horizon time also allows some technical simplifications of the mathematical analysis, mainly for the proof of the existence of a Nash equilibrium, without sacrificing qualitative conclusions. Nevertheless, the results can be extended to an infinite horizon, with relevant technical adaptations. Moreover, since the numerical simulations can only tackle finite times, this choice has the advantage to use an unified framework for both contributions.
Supplementary Material
Supplementary file supplied by authors. Access here
All Tables
All Figures
Figure 1 Graphical illustration of the SIR model. 

In the text 
Figure 2 Convergence of the algorithm to the Mean Field Nash equilibrium of the model (2.1), using the parameters described in Table 1. Left: global cost ℭ in terms of the learning time. Right: Mean Field Nash equilibrium contact rate β^{*} (magenta line), identical to the contact rate at the penultimate step of the algorithm (green dashed line) and compared to β_{0} (red line). 

In the text 
Figure 3 Evolution (in proportion) of Susceptible, Infected and Recover classes, using the parameters described in Table 1. Solid lines represent the evolution at the Mean Field Nash equilibrium, i.e., with transmission rate β^{*}. The evolution at equilibrium is compared to the epidemic dynamics with constant transmission rate β_{0} (dashed lines). 

In the text 
Figure 4 Comparison between Mean Field Nash equilibria in (plain lines) and (dashed line), using the parameters described in Table 1. 

In the text 
Figure 5 Comparison between two strategies: the Mean Field Nash equilibrium contact rate (solid lines) and the optimal transmission rate for the society (dashed line), using the parameters described in Table 1. 

In the text 
Figure 6 Comparison between two strategies: the societal optimum contact rate in (solid lines) and the societal optimum contact rate in , using the parameters described in Table 1. 

In the text 
Figure 7 Comparison between the transmission rate and the dynamic proportion of infected on the time interval [10, 150], for the four aforementioned strategies: the Mean Field Nash equilibrium in (solid magenta lines) and (dashed red lines), the societal optimum in (dotted greenlines) and (dashdot blue lines), using the parameters described in Table 1. 

In the text 
Figure 8 Transmission rate and induced evolution of Susceptible, Exposed, Infected and Recover classes, using the parameters described in Table 2. Solid lines represent the transmission rate and the epidemic evolution at the Mean Field Nash equilibrium, while dashed lines model the societal equilibrium. These two equilibria are compared to the epidemic dynamics with constant transmission rate β_{0} (dotted lines). 

In the text 
Figure A.1 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for r_{I}. Solid lines: r_{I} = 300. Dotted line: r_{I} = 250. Dashed lines: r_{I} = 350. 

In the text 
Figure A.2 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for I_{0}. Solid lines: I_{0} = 0.01. Dotted line: I_{0} = 0.005. Dashed lines: I_{0} = 0.05. 

In the text 
Figure A.3 Comparison of the Nash equilibrium contact rate and the evolution of Susceptible, Infected and Recover classes with three different values for . Dashed lines: . Solid lines: . Dotted line: . 

In the text 
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.