Issue 
Math. Model. Nat. Phenom.
Volume 15, 2020
Coronavirus: Scientific insights and societal aspects



Article Number  36  
Number of page(s)  18  
DOI  https://doi.org/10.1051/mmnp/2020025  
Published online  14 July 2020 
Heterogeneous social interactions and the COVID19 lockdown outcome in a multigroup SEIR model
CEREMADE (CNRS UMR n° 7534), PSL university, Université ParisDauphine, Place de Lattre de Tassigny,
75775
Paris 16,
France.
^{*} Corresponding author: dolbeaul@ceremade.dauphine.fr
Received:
30
April
2020
Accepted:
28
June
2020
We study variants of the SEIR model for interpreting some qualitative features of the statistics of the Covid19 epidemic in France. Standard SEIR models distinguish essentially two regimes: either the disease is controlled and the number of infected people rapidly decreases, or the disease spreads and contaminates a significant fraction of the population until herd immunity is achieved. After lockdown, at first sight it seems that social distancing is not enough to control the outbreak. We discuss here a possible explanation, namely that the lockdown is creating social heterogeneity: even if a large majority of the population complies with the lockdown rules, a small fraction of the population still has to maintain a normal or high level of social interactions, such as health workers, providers of essential services, etc. This results in an apparent high level of epidemic propagation as measured through reestimations of the basic reproduction ratio. However, these measures are limited to averages, while variance inside the population plays an essential role on the peak and the size of the epidemic outbreak and tends to lower these two indicators. We provide theoretical and numerical results to sustain such a view.
Mathematics Subject Classification: 92C60 / 92D30 / 34C60
Key words: Epidemic models / disease control / heterogeneous populations / basic reproduction ratio / equilibrium solutions
© The authors. Published by EDP Sciences, 2020
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
Although widely used in practice, compartmental models of epidemic spread (as for instance the celebrated SIR [26] model) rely on various simplifying assumptions, like a limited number of compartments, the homogeneity of the population inside a compartment, or some well defined parameters. The advantage of such models is that the impact of the variation of a single parameter on various qualitative properties can easily be studied, but the risk lies in an outrageous simplification of the representation of a complex system, with the additional difficulty that some parameters can be difficult to quantify from the available statistical data. Here we numerically study how the separation of the population into two subgroups of individuals with different intensities of social interaction can qualitatively explain some observed features of the current pandemic of Covid19, and provide some theoretical explanations which also apply to more realistic models.
New characteristics of the pandemic of Covid19 are unveiled every day and reveal various interesting features. One of the issues is that curves showing the number of new cases in European countries stabilize very slowly after the beginning of the lockdown. Many questions have been raised on the methods for collecting the data and on their quality, that we do not address, not to mention variations between countries that deserve further investigations. As a simple explanation, it has been suggested that the main reason is that a significant fraction of the population does not respect lockdown. This ignores the nonlinear properties of simple epidemic models. We show here that, if a rather small fraction of the population cannot reduce its social interactions, eventually for very good reasons as, e.g., the health workers or some other key actors of our societies, the epidemic keeps spreading, stabilizes at a much slower rate than what one would expect, and finally affects a significant fraction of the population.
In this paper we shall use the SEIR model rather than the simpler SIR model because of the significant period of incubation in the Covid19 disease. The SEIR model has various properties that can be understood, at least for the order of magnitude of its effects, using simple objects like the well known basic reproduction ratio . However, it also has very nonlinear features which are defying common sense and require rigorous mathematical analysis. In the accompanying numerical examples, we take values for the parameters which are compatible with the data collected during the pandemic of Covid19. These examples are intended to understand qualitative features of the epidemic but cannot and should not be implemented for direct predictive use, as our model is too crude and oversimplified to reflect the Covid19 epidemic in a quantitative manner.
We start by reviewing known results concerning the SEIR model applied to a homogeneous population in Section 2. A model with a large majority of the population under lockdown and a small minority which does not or cannot implement the social distancing is then considered. The factor of reduction of social interactions q quantifies the effect of the lockdown. It is certainly very difficult to measure this parameter in real life applications and structured population models should be considered, in order to reflect the variety of social interactions. Our goal is to understand how, by varying q, one eventually triggers the nonlinearity of the SEIR model and, as a consequence, how this drives the system into a stable equilibrium which is far from the usual target of a lockdown policy, that is, the control of the disease, but is also far from the dynamics that would develop in absence of lockdown.
2 Basic mathematical properties of the SEIR model
2.1 The SEIR model
Let us consider the SEIR (Susceptible, Exposed, Infected, Recovered) model [4] defined by the set of equations
which is a variant of the SIR model [26] of Kermack and McKendrick. Here we neglect birth and death rates, i.e., we consider a model without vital dynamics. The average incubation period is 1∕α, the parameter β is the product of the average number of contacts per person and per unit time by the probability of disease transmission in a contact between a susceptible and an infectious individual, γ is a transition rate so that 1∕γ measures the duration of the infection of an individual and N is the total population size. In the Covid19 pandemicg, the average incubation period is of several days and this is why an SEIR model has to be preferred to a simple SIR model. Many qualitative features are the same in the two models but the compartment E of exposed individuals makes the analysis slightly more delicate. Unreported cases or asymptomatic individuals are not taken into account here: this is an important aspect of the Covid19 epidemic, see for instance [9, 34, 35], with important consequences on the epidemic size, but probably not so much on the qualitative issues that are discussed in this paper. Other factors, like delays for the transmission of the information studied in [7], certainly also play a role in the current outbreak.
The SEIR model is a compartmental model used to understand the mathematical modelling of infectious diseases in a large population, with enough individuals in each compartment so that stochastic effects can be neglected. Spatial effects are also neglected, which is a rather crude approximation. However, such a simplified model allows us to perform an analysis of the sensibility to the parameters which are of interest for a qualitative description of the outbreak of an epidemic disease.
The system (2.1)–(2.4) is homogeneous so that we can simply consider the fractions
of the Susceptible, Exposed, Infected and Recovered individuals among the whole population.
2.2 Conservations and large time asymptotics
Here we perform a simple analysis of the model, which is done in the spirit of [22]. By conservation of the total number of individuals, we have the relation (2.5)
for any time t ≥ 0, which is easily proved by summing the equations in the system, while the evolution is now governed by the system
An important and classical observation is that the domain
is stable under the action of the flow (2.6)–(2.9), and it is straightforward to check that (2.10)
Since s is monotone nonincreasing by (2.6), this means that (2.6)–(2.9) has a global solution for any t ≥ 0 and
By an elementary analysis, we find that any solution (s, e, i, r) of (2.6)–(2.9) with initial data (s_{0}, e_{0}, i_{0}, r_{0}) such that i_{0} + e_{0} > 0 converges as t → +∞ to a stationary solution (s^{⋆}, e^{⋆}, i^{⋆}, r^{⋆}) with (2.11)
according to (2.5) and (2.10). Note that the solution is unique if γ − β > 0 but there are two solutions if γ − β < 0 and γ log s_{0} + β r_{0} = γ log s_{0} − β (s_{0} + e_{0} + i_{0}) + β > 0. In our numerical applications, we shall assume that γ log s_{0} + β r_{0} = γ log s_{0} − β (s_{0} + e_{0} + i_{0}) + β < 0 if γ − β < 0, so that s^{⋆} is uniquely defined.
The epidemic size ζ is defined as the fraction of individuals that are affected by the epidemic, here s_{0} − s^{⋆}. Next, let us consider more specifically the case of an initial datum which is a perturbation of the constant in time DFE solution, or Disease Free Equilibrium (see [45]), corresponding to (2.12)
2.3 Stable equilibrium, epidemic size and phase transition
Let us assume that e_{0} + i_{0} + r_{0} = ε > 0 is small and r_{0} = ϑ ε for some given ϑ ∈ [0, 1). This initial condition, in the limit as ε → 0_{+}, is a perturbation of the DFE solution. For any ε > 0, we know by (2.5) that s_{0} = 1 − ε. The solution of (2.6)–(2.9) converges as t → +∞ to (s^{⋆}, 0, 0, r^{⋆} = 1 −s^{⋆}), the stationary solution (s^{⋆}, 0, 0, r^{⋆}) is stable and this is why we call it the stable equilibrium solution. See Section 7.3 in [36] for a discussion of the stability. In a model with birth and death rates, the solution is usually called the endemic equilibrium, but we shall prefer to call it simply the stable equilibrium solution as we neglect birth and death issues. The following discussion is given here in preparation for the next two sections. We do not claim originality and refer for instance to [22] for a detailed study motivated by very explicit examples. Depending on whether the DFE solution is stable or not, we have three regimes for s^{⋆} = s^{⋆}(ε):
– Control of the epidemic. If γ > β, we find that (2.13)
as ε → 0_{+}. In other words, the stable equilibrium is a perturbation of the DFE solution and the epidemic size ζ is of order ε.
– Epidemic spreading and herd immunity. If γ < β, then we note that s↦γ log s − β s achieves a maximum point on (0, 1) at s = γ∕β and, as a consequence, that (2.14)
even for an arbitrarily small value of ε > 0. Hence we find that s^{⋆}(ε) is not only a perturbation of order ε of s_{0} but also that the value of s^{⋆}(ε) is of order s^{⋆}(0), the unique root in (0, 1) of γ log s + β (1 − s) = 0. Since the constant solution is a stable equilibrium, herd immunity is always granted in the sense that no outbreak can occur. Let .
With , the epidemic spreads in the sense that s^{⋆}(0) is the solution of (2.15)
This means that where the function W is known asthe Lambert function (see for instance [8]) and defined as the inverse of w↦ w e^{w}. Note that r = 1 − s is given for s = s^{⋆}(0) by the equation (2.16)
and that r = 1 −s^{⋆}(ε) solves (2.17)
so that s^{⋆}(ε) also depends on ϑ (but this dependence disappears in the limit as ε → 0_{+}). Altogether, if γ < β, that is, for , the epidemic size ζ is of order 1 as ε → 0_{+}. See Figure 2 for an illustration of the dependence of ζ in .
– In the threshold case β = γ, i.e., , which is typical of a phase transition, we have to solve (2.18)
as ε → 0_{+}. We recall that 0 ≤ r_{0} < ε and observe that the epidemic size ζ is of order as ε → 0_{+}. Scale invariance is reflected by the fact that there is no dependence neither on β nor on γ.
Summarizing, there are two phases and a threshold case corresponding to the phase transition. If γ > β, the epidemic size ζ is close to zero. The disease does not spread in the population and simply vanishes exponentially fast. On the opposite, if γ < β, the Disease Free Equilibrium is unstable, the diseases quickly spreads with an exponential growth, the system converges for large times to a stable equilibrium far away from the DFE solution and the epidemic size ζ is a significant fraction of the total population, irrespective of how small the fraction of initially infected individuals is. Whether or determines the asymptotic stable equilibrium starting from the DFE solution. In the literature, is called the basic reproduction ratio, or the basic reproduction number. So far we simply consider it here as the order parameter of the phase transition, in the usual sense in physics: see for instance ([30], p. 449) or ([44], p. 3). In the next section, we shall explain the role it plays in the initial dynamics of the model and refer to Section 2.6 for more considerations on the epidemiologic interpretation of .
Note that the SIR model has exactly the same stationary states and the same phase transition as the SEIR model, as the SIR model is obtained by replacing (2.7) by β s i = α e, so that, in the SIR model, the equation for i becomes (2.20)
The order parameter is also and whether or determines if the epidemic is spreading or if the disease is controlled.
2.4 Linearization and the basic reproduction ratio
In the case of the SIR model, (2.20) can be rewritten as (2.21)
and it is elementary to observe that when s ~ 1, whether or determines the initial dynamics of the model: the interpretation of is clear fromthe above equation. After this digression on the SIR model, let us come back to the SEIR model. Understanding the role of is a little bit more subtle than in the SIR model. We recall that r(t) = 1 −s(t) −e(t) −i(t) plays no role in the stability analysis of the DFE solution. At any time t, the linearized dynamics of is described by the matrix (2.22)
and we may notice that the largest eigenvalue of , corresponding to the linearization around the DFE solution, is (2.23)
so that λ(α, β, γ) is positive if and only if . Moreover, the eigenspace is compatible with the nonlinear dynamics so that there are perturbations of the DFE solution which are exponentially growing with a rate λ(α, β, γ) if and only if . However, the basic reproduction ratio is not anymore directly connected with the linearized growth mode of i. In fact, we can observe that (2.21) is replaced by (2.24)
if s ~ 1, which is indeed the correct way of estimating the growth of the epidemic. We will come back on the interpretation of in Section 2.6 and note that (e + i) is known in the literature as the population in the infectious compartments.
2.5 Social distancing and the factor of reduction of social interactions
The goal of a lockdown policy is to replace the system (2.6)–(2.9) by
for some factor q > 1 which measures the reduction of social interactions of each individual. Of course, what we obtain is exactly (2.6)–(2.9) with the parameter β replaced by β∕q. The point isthat the basic reproduction ratio becomes (2.29)
and the goalis either to fix q to a value large enough so that the epidemic is controlled, that is , or at least to make small in order to flatten the curve, i.e., to have an epidemic going at slower pace. See Figure 1 for an illustration. Here the exponent (1) in the notation points to the assumption that we consider a population with a single group of susceptible individuals or, in other words, a socially homogeneous population.
In a SIR model with social distancing, with (2.25), (2.28) on the one hand, but (2.26) and (2.27) replaced by (2.30)
on the other hand, it is possible to compute the epidemic peak. This is a classical result, see for instance Section 2.1.2 in [36]. The epidemic peak is defined as the maximum i(t_{p}) of t↦i(t) after noticing that, at t = t_{p}, we have the system of equations (2.31)
which provides us with the value (2.32)
for any . Note that as (s_{0}, r_{0}) → (1, 0), i.e., in the limit of a DFE initial datum. We obtain the same expression in the SEIR model if we replace i by e + i. See [15] as a source of inspiration for such considerations, and also property (2) of Theorem 4.2, in a much more general framework.
Figure 1 The peak of the outbreak in the SEIR model. The time t is counted in days. The vertical axis represents the fraction of the population. The basic reproduction ratio is either (left) or 1.37 (right) corresponding to a reduction of social interactions by a factor q = 1.7 as in [5]. This illustrates the flattening of the curves. 
2.6 The basic reproduction ratio and the method of the next generation matrix
According to [12], the basic reproduction ratio is the expected number of secondary cases produced, in a completely susceptible population, by a typical infected individual during its entire period of infectiousness. In [6], Blackwood and Childs provide us with a comprehensive introduction to the computation of using the method of the next generation matrix in the case of the SEIR model. Such a computation goes back to [12], the standard method is exposed in [45] and we can also refer to [13] for an application to the SEIR model. As for a more general presentation of the method in compartmental models and more formal mathematical treatments, one can refer to [11, 36], and to [23] and p. 17 in [4] for early considerations on endemic and stable equilibria. Alternative definitions of the basic reproduction ratio in compartmental models are also available: see [36] for an overview, [22] for a more historical account and [21, 23] for considerations on models which are more directly linked to our interests (see Sect. 3).
For sake of completeness, let us give a brief summary of the method of the next generation matrix. First of all, one restricts the analysis to the infectious compartments, x = (e, i) in case of (2.25)–(2.28), and consider the linearized evolution equation around the DFE solution, that is, (2.33)
where F and V respectively denote the matrices associated with the rate of new infections and the rates of transfer between compartments, i.e., (2.34)
See [45] for details. According to Lemma 1 in [45], we observe that the matrix F is nonnegative and the matrix V is nonsingular. In this framework, the basic reproduction ratio is defined as the largest eigenvalue of F V^{−1}. It is an elementary computation to check that (2.35)
has two eigenvalues, 0 and . This proves that is the basic reproduction ratio, as defined by the method of the next generation matrix. In this framework, it is known from Theorem 2 in [45] that the DFE solution is stable if and unstable if .
3 A heterogenous model of social distancing
3.1 A simple model with two groups
Let us consider a population divided in two groups indexed by k = 1, 2, in which the Susceptible individuals have a factor of reduction of social interactions q_{k} which differ in the two groups. We shall assume that each of these groups gather a fixed fraction of the population p_{k} with p_{2} = p small and p_{1} = 1 − p. While the group corresponding to k = 1 observes a lockdown and has a factor q_{k} > 1, we are interested in the situation in which the other group has no reduction of social interactions: q_{2} = 1, or eventually has more social interactions than average before lockdown, corresponding to some q_{2} < 1. The typical example is the case of health workers in a period of epidemic disease or supermarket cashiers, who have contacts with a much larger number of people than an average individual. It is of course very difficult to estimate q_{2} and one should take into account the efficiency of barrier procedures. Instead of trying to make rough guesses for the value of q_{2}, we will vary it in order to see what is the impact on the solutions.
With a straightforward notation, let us split the population of Susceptible individuals in two groups
and consider for k = 1, 2 the system (3.1)
There are multiple possible variants and it would make sense, for instance, to distinguish i_{1} and i_{2} in the above equations, with detailed contamination rules. The above system has striking properties. It is for instance straightforward to see that the linearized system around the DFE solution, i.e., the matrix , has a largest eigenvalue given by (3.2)
where the basic reproduction ratio , as defined in the method of the next generation matrix, is given by (3.3)
One can indeed apply the method of Section 2.6 and observe that the only change lies in the matrix F, where the coefficient β∕q has to be replaced by (1 − p) β_{1} + p β_{2}, which establishes (3.3). It is easy to deduce from (3.2) that λ(α, β, γ) is positive if and only if .
Modeling heterogeneous mixing in infectious disease dynamics when the population is subdivided by characteristics other than those that are diseaserelated, such as risk status or age, is not new. This has been considered for instance from the dynamical point of view in [28] or in the case of sexually transmitted diseases and particularly in HIV/AIDS models, with groups that are not all defined by disease related properties. In this perspective, contact matrices have been considered, which involve a detailed analysis of the transmission mechanisms. We can refer to [2, 23–25, 37] for various considerations in this direction and to Section 3 in [21] for a discussion of the homogeneous mixing fallacy in the application to successful vaccination policies. The present paper ignores a number of issues like symmetry in transmission between groups and densitydependent transmission questions in order to focus on simple qualitative questions: any serious study with quantitative goals should of course address these issues with care: see for instance [6] for a warning. For sake of simplicity, we have chosen to consider that the origin of the infected individuals (group 1 or 2) plays no role in the transmission. This has the simple consequence that the basic reproduction ratio is in the end exactly the average of the ratios independently computed for each group, as shown by (3.3). In the regime corresponding to p much smaller than 1 − p and whatever the details are, our model is anyway good enough to show that what matters concerning the basic reproduction ratio is the average of the ratios^{1}. However, when one considers the epidemic peak and the large time asymptotics, the message is not only the average and this is what we explain next.
3.2 Conservations and large time asymptotics
System (3.1) inherits of the properties of the standard SEIR model. The conservation of mass (3.4)
guarantees that all quantities are bounded by 1 as long as they are nonnegative. For a solution of (3.1), let us observe that (3.5)
We shall moreover assume that (3.7)
as the population can be considered, in the initial phase of the outbreak (for t ≤ 0, in our setting), as a single group. This point could be reconsidered and studied as in [5] if one is interested in the dynamics of the epidemics, but it has a no significant impact on the stable equilibrium in the uncontrolled case. An analysis of the trajectories of (3.1) as in Section 2.3 shows that solutions globally exist and that there is a unique stable attractor . We deduce from the conservation of mass (3.8)
that the stable stationary solution is given as the unique solution with r^{⋆} = r > r(0) of (3.9)
Under the condition that , we have that (3.10)
3.3 Stable equilibrium and phase transition
The same discussion as in Section 2.3 can be done. To fix ideas, let us assume for instance, as a simplifying assumption, that s(0) = 1 − ε, e (0) + i(0) + r(0) = ε, and r_{0} = ϑ ε for some given ϑ ∈ [0, 1), so that . The equation for the equilibrium (3.9) can be rewritten as (3.11)
and the Taylor expansion (3.12)
for r > 0, small, shows that there is a solution with r > 0, small, as ε → 0_{+} if and only if , thus giving a solution of order ε, which turns out to be the unique solution. Otherwise, the only positive solution of (3.11) corresponds to some r ≫ ε and we have exactly the same phase transition as in Section 2.3, with playing the role of an order parameter. When the disease spreads, we find that r^{⋆} is of the order of the solution of (3.11) with ε = 0, that is, of the solution r = r of (3.13)
By convexity, we know that (3.14)
If we choose q such that , which means , then it is clear that the solution of (3.15)
is larger than r^{⋆}. In other words, the epidemic size is reduced if we replace (2.25)–(2.28) by (3.1) with q, q_{1} and q_{2} as above.
4 General heterogeneous distribution
In this section we extend the previous framework to the situation of an arbitrary number of different population classes. Each class, or group, contains the individuals that share a given value of the transmission rate β∕q. More precisely we consider a probability space and 𝔖(ω, t), 𝔅(ω) random variables on this space designating respectively the state of a random individual ω ∈ Ω and its β∕q parameter. We denote by the average operator.
For instance the case of Section 3 corresponds to the situation when 𝔅 has only two values, β_{1} = β∕q_{1} and β_{2} = β∕q_{2}, and , with k = 1, 2. In order to keep notation simple, we will suppose in the following that the conditional law of 𝔅 relative to being susceptible (at t = 0) only takes a finite number of values β_{1},..., β_{K}, i.e., is of the form , with , but all results given below extend to the general case. Note that . We adopt the notation ℜ = 𝔅∕γ, R_{k} = β_{k}∕γ for any k = 1,..., K and define (4.1)
With these notations p_{k} = s_{k}(0)∕s(0). The evolution of is governed by the system of equations^{2}: (4.3)
Let us start by a simple observation.
By imposing the condition r(0) = 0, what we have in mind as initial datum is a perturbation of the DFE solution, or a solution with initial values for which the infectious compartments are nonempty at t = 0, i.e., e(0) + i(0) > 0, eventually small (but this is not even mandatory). As we shall see, (4.7) follows by convexity as in Section 3.3.
Note that, by assuming that (e + i)(0) is small, it is clear that the peak can be made small if is small.
5 Numerical results
5.1 Choice of a set of parameters
It is not thepurpose of this paper to discuss the values of the parameters in a SEIR model of the pandemic of Covid19 and we shall simply choose values that can be found in the literature, for a purely illustrative purpose. However, for the interested reader, we list our sources and some entries in the rapidly growing literature on the topic.
Concerning the initial data for the SEIR model, we shall assume that the size of the French population is N = 67.8 × 10^{6}. The following values correspond to the situation for the Covid19 in France on March 15, 2020, according to [5] and data released on a daily basis by SPF at [43]. We shall take as initial data the values
with . Here we provide the formulae used in [5] to infer the numbers, which are based on an asymptotic analysis of the SEIR model during the initial phase of the epidemics (without lockdown). Such values are important for controlled epidemics but play essentially no role if the disease is spreading in the population, which is the case under investigation. Note that it is very likely that the numbers will be revised in the future, to account for non well documented cases at the time this study was done. In any case, for numerical computations, we take the following initial data, as fractions of the total population (5.1)
in all our examples.
Now comes the issue of estimating the parameters β, γ and α of the SEIR model, and the factor of reduction of social interactions q, at least as an average, in a socially homogeneous model and also for the majority of the population in the model with two groups. The methodology is out of the scope of this paper and we will not comment it. We refer to [35] for a recent discussion. However, in order to fix the order of magnitude and give an idea of the uncertainties, let us review some numbers which recently appeared. An estimate based on the statistics of the known cases of Covid19 at the beginning of the outbreak has been proposed in [5], with the following values (5.2)
before lockdown. By fitting the values for the first two weeks of lockdown, N. Bacaër came up with q ≈ 1.7 for an estimate of the factor of reduction of social interactions. These numbers are the ones used for our choice (5.1). We learn from [10] that the parameters in the SEIR model can be estimated by: β =2, α =1∕3.7 ≈ 0.27, γ = 1∕1.5 ≈ 0.67 and at regional level (IledeFrance) and it was suggested that q ≈ 2.94 so that . This is consistent with the estimate 0.67 of [42].
Before going further, let us list some limitations of our model and features which have to be taken into account in more realistic models. In [35], it is argued that Unreported cases (U) should be taken into account in a S(E)IRU model for correctly accounting the Covid19 outbreaks (also see [32]). There is certainly an important point here, although we did not introduce it for sake of simplicity. Note that such models produce a very high (at least onthe basis of the data used for fitting the curves) ranging from 4.45 to 4.49. In [41], the basic reproduction ratio is adjusted to . In [42], the authors find that the basic reproduction ratio, which was before lockdown, has been reduced to 0.52, which corresponds to q ≈ 6.6. This is consistent with the factor q = 7 found in [40]. Of course important factors like risk status or age (see for instance [16, 18]) have been put in evidence, but they are not easy to study from the point of view of transmission rates. In our approach, it is clear that we take oversimplifying assumptions, but this is probably the price to pay to prove results and put them numerically in evidence.
Recent papers have emphasized that the heterogeneity of the transmissions rates is important in the current pandemic and that it lowers herd immunity thresholds in various modeling frameworks. During the revision of this paper, we became aware of [14, 16, 17]. Superspreaders and superspreading events are known to play an essential role in the propagation of Covid19 according to [1, 3, 19, 20, 27, 38, 39]. Incorporating these considerations in compartmental models will anyway require further studies.
So far it remains difficult to choose a set of parameters, although one can hope that a better understanding of the dynamics of the pandemic will emerge out of the various studies that are currently done. What we learn after the end of the lockdown is still to be studied. Now comes a very empirical observation, in the framework of SEIR models, with parameters that are supposed to be constant in time, a crude assumption that is definitely not valid after the end of the lockdown. This empirical observation is the starting point for this paper. Situations with a basic reproduction ratio exhibit an exponential decay of the number of cases while the data of [43] (at the beginning of the lockdown) were emblematic of a situation with . Several phases for a population under lockdown are illustrated in [31] by the computation of the effective reproduction ratio (which is of the order of as only a small proportion of the total population was concerned) in Wuhan, based on the reported daily Covid19 infections: when there was no intervention (before January 23, 2020): ; under lockdown with traffic ban and many confined at home (between January 23February 1st, 2020): ; using centralized confinement (after February 1st, 2020): . Also see [33] for a detailed analysis of the data in Wuhan based on a SIR type model. The curves in France and southern Europe during the first month of lockdown look more like to what happened in Wuhan by the end of January, with an , than to what happened later. This is what we intend to explain in France, at least partially, by social heterogeneity. For the sake of simplicity, we shall retain the values of (5.1), and one of the reasons is that is compatible with the above remarks, but our models of Sections 3 and 4 show that there might be some subtleties when heterogeneities are taken into account.
5.2 SEIR model with a single group
We start by considering the standard SEIR model (2.6)–(2.9) or its variant (2.25)–(2.28), where q is the factorof reduction of social interactions. The basic reproduction ratio in the second system is , which allows us to reduce both problems to (2.6)–(2.9) with various values of the basic reproduction ratio depending on the q factor. Here we choose the initial data according to (5.1) and the set of parameters (5.2). See Figure 1 for the epidemic curves, and Figure 2 for the epidemic size.
Figure 2 The epidemic size ζ as a functionof the basic reproduction ratio in the simplest SEIR model corresponding to (2.6)–(2.9) exhibits a clear phase transition at (thick curve). The other curves represent r(t) taken for t =30, 60, 90, for which equilibrium is not yet achieved. In practice, varying is achieved by acting on the q factor in (2.25)–(2.28). 
5.3 SEIR model with two groups having different factors of reduction of social interactions
One of the most disturbing results in [5] is that fitting the data of the cumulated number of cases by model (2.25)–(2.28) shows that q ≈ 1.7, which is far below and suggest that lockdown is inefficient for controlling the outbreak. However, we have seen that there is another possible model, as it is illustrated by Figure 3, with two groups. From the figure, it is clearly impossible to distinguish between the two scenarios at the early stage of the outbreak.
With two groups arises the question of choosing the initial data. As a crude and simplifying assumption, we make the choice to consider that before lockdown there was a single population and that the two categories of the population had the same exposure to the disease. We can then use (5.1) as initial condition and take s_{1} (0) = (1 − p) s(0) and s_{2} (0) = p s(0). With this choice, we recover the results of (2.25)–(2.28) if q_{1} = q_{2} and s = s_{1} + s_{2}. In order to fix ideas, we also make the arbitrary choice of choosing q_{1} = 2.35 so that the epidemic disease would extinguish by itself after affecting 0.81% of the population if q_{2} = q_{1}. We can illustrate the role of the two parameters p and q_{2} by showing that they completely change the picture and bring us back to a regime with an epidemic size corresponding to some inefficient lockdown, however with lower epidemic peak and size: see Figure 4, and Figure 5 for the epidemic size.
Figure 3 Plot of t↦e(t) + i(t) + r(t) for a solution of (2.25)–(2.28) with q = 1.7 (black dotted line) versus a plot of t↦e(t) + i(t) + r(t) for a solution of (3.1) in logarithmic scale with p = 0.02, q_{1} = 2.35, and q_{2} = 0.117 (brown). In both cases the basic reproduction ratio is 1.37. 
Figure 4 Model with a single group (left) and q = 1.7 corresponding to a basic reproduction ratio of 1.37, and two groups (right) with q_{1} = 2.35, q_{2} = 0.117, and p = 0.02 as in Figure 3, with same basic reproduction ratio. Note that the figure on the left is the same as in Figure 1 (right), on a different scale. The straight dotted line is the level p. In the case with two groups, note that almost all individuals of the second group get infected during the propagation of the disease. 
Figure 5 The epidemic size in a population with one group (dotted curve, same as in Fig. 2) or two groups (plain, thick curve) with and p = 0.02 as a function of the basic reproduction ratio (obtained by varying q in the first case and q_{2} in the second case). The other lines correspond to simulations of r(t) for t = 30, 60, 90 in the model with two groups. With two groups, we recover a phase transition as in Figure 2, corresponding to q_{2} ≈ 0.96. 
6 Discussion
In the SEIR model with a basic reproduction ratio , the stable equilibrium does not depends much on the initial data: it is almost entirely determined by . The goal of lockdown is to decrease by dividing it by a reduction factor for social interactions, q. With , the disease is under control. With , but close to 1, the epidemic disease spreads, the final state depends little on the initial data, but the curve is flattened: qualitatively, this is the scenario that one can observe in the Covid19 outbreak in France under lockdown.
Measuring the q factor is difficult. The aim of this article is to show that the crucial information cannot be reduced to the knowledge of an average factor q: if the population is divided into two groups, witha group for which (the majority) and another group (the minority) that keeps a small q factor, the disease may continue to spread. If the q factor of the majority is larger than but close to , so that is small, the impact of the minority becomes extremely important as it eventually triggers the nonlinearity. However, the equilibrium asymptotic state in a twogroup model is not the same as when considering a single group with an averaged basic reproduction ratio. The dynamics of the outbreak, for instance the height of the epidemic peak, is also changed. A twogroup model is of course extremely simplistic, but shows the importance of understanding the distribution of the q factors in a population.
Our observations are not limited to a population divided into two groups. In our model of heterogeneous social interactions, with a whole distribution of q factors, we have shown that an average of q is not relevant. While the basic reproduction ratio behaves as a plain average across the homogeneous q categories, neither the peak nor the total epidemic size do the same. In particular the presence of heterogeneity is beneficial for both the peak and total epidemic size. Or, put it otherwise, a model with only one group and fitting the observed data in the initial phase of the outbreak will be more pessimistic concerning the epidemic outcomes than a heterogeneous model; this is even more true after lockdown when social distancing measures have been enforced, the lockdown being by its nature a creator of heterogeneity. In terms of public health, this also underlines the importance of targeting prevention measures on individuals with a high level of social interactions.
Acknowledgements
This work has been partially supported by the Project EFI (ANR17CE400030) of the French National Research Agency (ANR). The authors thank Nicolas Bacaër for a stimulating discussion which was the starting point of this research project, Gilles Zérah for pointing them an important reference, and an anonymous referee for pointing them a missing argument and suggesting several improvements. The authors also thank the MODCOV19 platform for encouragements and collective effort.
References
 D. Adam, P. Wu, J. Wong, E. Lau, T. Tsang, S. Cauchemez, G. Leung and B. Cowling, Clustering and superspreading potential of severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infections in Hong Kong. To appear in Res. Square (2020). DOI: 10.21203/rs.3.rs29548/v1. [Google Scholar]
 F.R. Adler, The effects of averaging on the basic reproduction ratio. Math. Biosci. 111 (1992) 89–98. [Google Scholar]
 B.M. Althouse et al., Stochasticity and heterogeneity in the transmission dynamics of SARSCoV2. Available from: https://covid.idmod.org/ (2020). [Google Scholar]
 R.M. Anderson, B. Anderson and R.M. May, Infectious diseases of humans: dynamics and control, Oxford University Press, Oxford (1992). [Google Scholar]
 N. Bacaër, Un modèle mathématique des débuts de l'épidémie de coronavirus en France. MMNP 15 (2020) 29. [EDP Sciences] [Google Scholar]
 J.C. Blackwood and L.M. Childs, An introduction to compartmental modeling for the budding infectious disease modeler. Lett. Biomath. 5 (2018) 195–221. [Google Scholar]
 B. Buonomo, A. d' Onofrio and D. Lacitignola, Global stability of an SIR epidemic model with information dependent vaccination. Math. Biosci. 216 (2008) 9–16. [Google Scholar]
 R.M. Corless, G.H. Gonnet, D.E.G. Hare, D.J. Jeffrey and D.E. Knuth, On the Lambert W function. Adv. Comput. Math. 5 (1996) 329–359. [Google Scholar]
 A. Danchin, T.W.P. Ng and G. Turinici, A new transmission route for the propagation of the SARSCoV2 coronavirus. Preprint medRxiv 20022939v1 (2020). [Google Scholar]
 G. Di Domenico, L. Pullano, C.E. Sabbatini, P.Y. Boëlle and V. Colizza, Expected impact of lockdown in ÎledeFrance and possible exit strategies. Available from: https://www.epicxlab.com (2020). [Google Scholar]
 O. Diekmann and J.A.P. Heesterbeek, Mathematical epidemiology of infectious diseases: model building, analysis and interpretation, vol. 5. John Wiley & Sons, New Jersey (2000). [Google Scholar]
 O. Diekmann, J.A.P. Heesterbeek and J.A.J. Metz, On the definition and the computation of the basic reproduction ratio R_{0} in models for infectious diseases in heterogeneous populations. J. Math. Biol. 28 (1990) 365–382. [CrossRef] [MathSciNet] [PubMed] [Google Scholar]
 O. Diekmann, J.A.P. Heesterbeek and M.G. Roberts, The construction of nextgeneration matrices for compartmental epidemic models. J. R. Soc. Interface 7 (2009) 873–885. [Google Scholar]
 A. Endo, S. Abbott, A.J. Kucharski and S. Funk, Estimating the overdispersion in COVID19 transmission using outbreak sizes outside China. Wellcome Open Res. 5 (2020) 67. [PubMed] [Google Scholar]
 Z. Feng, Final and peak epidemic sizes for SEIR models with quarantine and isolation. Math. Biosci. Eng. 4 (2007) 675. [CrossRef] [PubMed] [Google Scholar]
 A. Gerasimov, G. Lebedev, M. Lebedev and I. Semenycheva, Reaching collective immunity for COVID19: an estimate with a heterogeneous model based on the data for Italy. Preprint medRxiv 20112045v1 (2020). [Google Scholar]
 M.G.M. Gomes, R.M. Corder, J.G. King, K.E. Langwig, C. SoutoMaior, J. Carneiro, G. Goncalves, C. PenhaGoncalves, M.U. Ferreira and R. Aguas, Individual variation in susceptibility or exposure to SARSCoV2 lowers the herd immunity threshold. Preprint medRxiv 20081893v1 (2020). [Google Scholar]
 Q. Griette, P. Magal and O. Seydi, Unreported cases for age dependent COVID19 outbreak in Japan. Preprint medRxiv 20093807v1 (2020). [Google Scholar]
 G. Grossmann, M. Backenkoehler and V. Wolf, Importance of interaction structure and stochasticity for epidemic spreading: A COVID19 case study. Preprint medRxiv 20091736v1 (2020). [Google Scholar]
 L. HébertDufresne, B.M. Althouse, S.V. Scarpino and A. Allard, Beyond R0: Heterogeneity in secondary infections and probabilistic epidemic forecasting. Preprint medRxiv 20021725v1 (2020). [Google Scholar]
 H.W. Hethcote, Modeling heterogeneous mixing in infectious disease dynamics, in Models for infectious human diseases: their structure and relation to data. Cambridge University Press, Cambridge (1996) 215–238. [CrossRef] [Google Scholar]
 H.W. Hethcote, The mathematics of infectious diseases. SIAM Rev. 42 (2000) 599–653. [Google Scholar]
 H.W. Hethcote and J.W. Van Ark Epidemiological models for heterogeneous populations: proportionate mixing, parameter estimation, and immunization programs. Math. Biosci. 84 (1987) 85–118. [Google Scholar]
 J.M. Hyman and E. Stanley, Using mathematical models to understand the aids epidemic. Math. Biosci. 90 (1988) 415–473. [Google Scholar]
 J.A. Jacquez, C.P. Simon, J. Koopman, L. Sattenspiel and T. Perry, Modeling and analyzing HIV transmission: the effect of contact patterns. Math. Biosci. 92 (1988) 119–199. [Google Scholar]
 W.O. Kermack and A.G. McKendrick, A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond., Ser. A 115 (1927) 700–721. [CrossRef] [Google Scholar]
 A.J. Kucharski, P. Klepac, A. Conlan, S.M. Kissler, M. Tang, H. Fry, J. Gog and J. Edmunds, Effectiveness of isolation, testing, contact tracing and physical distancing on reducing transmission of SARSCoV2 in different settings. Preprint medRxiv 20077024v1 (2020). [Google Scholar]
 M. Lachiany and Y. Louzoun, Effects of distribution of infection rate on epidemic models. Phys. Rev. E 94 (2016). [Google Scholar]
 L. Laguzet and G. Turinici, Individual Vaccination as Nash Equilibrium in a SIR Model with Application to the 2009–2010 Influenza A (H1N1) Epidemic in France. Bull. Math. Biol. 77 (2015) 1955–1984. [Google Scholar]
 L. Landau and E. Lifshitz, Statistical Physics, 3rd edn. Vol. 5. Elsevier Science, Amsterdam (1980). [Google Scholar]
 X. Lin, Analysis of 25, 000 labconfirmed Covid19 cases in Wuhan: Epidemiological characteristics and nonpharmaceutical intervention effects. Department of Biostatistics and Department of Statistics, Harvard University and Broad Institute. Available from: https://cdn1.sph.harvard.edu/wpcontent/uploads/sites/21/2020/03/COVID1903162020Lin.pdf (2020). [Google Scholar]
 Z. Liu, P. Magal, O. Seydi and G. Webb, A covid19 epidemic model with latency period. Infect. Dis. Model. 5 (2020) 323–337. [PubMed] [Google Scholar]
 Z. Liu, P. Magal, O. Seydi and G. Webb, Predicting the cumulative number of cases for the Covid19 epidemic in China from early data. Math. Biosc. Eng. 17 (2020) 3040–3051. [CrossRef] [Google Scholar]
 Z. Liu, P. Magal, O. Seydi and G. Webb, Understanding unreported cases in the Covid19 epidemic outbreak in Wuhan, China, and the importance of major public health interventions. Biology 9 (2020) 50. [Google Scholar]
 P. Magaland G. Webb, Predicting the number of reported and unreported cases for the Covid19 epidemic in South Korea, Italy, France and Germany. Preprint medRxiv 20040154v1 (2020). [Google Scholar]
 M. Martcheva, An introduction to mathematical epidemiology, Vol. 61 of Texts in Applied Mathematics. Springer, New York (2015). [CrossRef] [Google Scholar]
 R.M. May and R.M. Anderson, Spatial heterogeneity and the design of immunization programs. Math. Biosci. 72 (1984) 83–111. [Google Scholar]
 K. Prem, Y. Liu, T.W. Russell, A.J. Kucharski, R.M. Eggo, N. Davies, M. Jit, P. Klepac, S. Flasche, S. Clifford et al., The effect of control strategies to reduce social mixing on outcomes of the COVID19 epidemic in Wuhan, China: a modelling study. Lancet Public Health 5 (2020) e261–e270. [Google Scholar]
 J. Riou and C.L. Althaus, Pattern of early humantohuman transmission of Wuhan 2019 novel coronavirus (2019nCoV), December 2019 to January 2020. Euro Surveill. 25 (2020) 2000058. [Google Scholar]
 L. Roques, E.K. Klein, J. Papaïx, A. Sar and S. Soubeyrand, Impact of lockdown on the epidemic dynamics of COVID19 in France. Front. Med. 7 (2020) 274. [Google Scholar]
 J. Roux, C. Massonnaud and P. Crépey, Covid19: Onemonth impact of the french lockdown on the epidemic burden. Publications de l'équipe REPERES (Recherche en Pharmacoépidémiologie et recours aux soins, UPRES EA7449). Available from: https://www.eareperes.com/wpcontent/uploads/2020/04/ImpactConfinementEHESP20200322v1.pdf (2020). [Google Scholar]
 H. Salje, C. Tran Kiem, N. Lefrancq, N. Courtejoie, P. Bosetti, J. Paireau, A. Andronico, N. Hozé, J. Richet, C.L. Dubost et al. Estimating the burden of SARSCoV2 in France. Science (2020) eabc3517. Report available from: DOI: 10.1126/science.abc3517 (2020). [Google Scholar]
 Santé Publique France, French national public health agency. Available from: https://dashboard.covid19.data.gouv.fr (2020). [Google Scholar]
 H. Stanley, Introduction to Phase Transitions and Critical Phenomena, The International Series of Monographs on Physics. Oxford, Oxford University Press Inc. (1971). [Google Scholar]
 P. van den Driessche and J. Watmough, Reproduction numbers and subthreshold endemic equilibria for compartmental models of disease transmission. Math. Biosci. 180 (2002) 29–48. [Google Scholar]
The mathematically rigorous formulation of the equations involves defining a continuous time Markov chain for any individual and the associated infection and recovering probabilities, see [29] for details.
All Figures
Figure 1 The peak of the outbreak in the SEIR model. The time t is counted in days. The vertical axis represents the fraction of the population. The basic reproduction ratio is either (left) or 1.37 (right) corresponding to a reduction of social interactions by a factor q = 1.7 as in [5]. This illustrates the flattening of the curves. 

In the text 
Figure 2 The epidemic size ζ as a functionof the basic reproduction ratio in the simplest SEIR model corresponding to (2.6)–(2.9) exhibits a clear phase transition at (thick curve). The other curves represent r(t) taken for t =30, 60, 90, for which equilibrium is not yet achieved. In practice, varying is achieved by acting on the q factor in (2.25)–(2.28). 

In the text 
Figure 3 Plot of t↦e(t) + i(t) + r(t) for a solution of (2.25)–(2.28) with q = 1.7 (black dotted line) versus a plot of t↦e(t) + i(t) + r(t) for a solution of (3.1) in logarithmic scale with p = 0.02, q_{1} = 2.35, and q_{2} = 0.117 (brown). In both cases the basic reproduction ratio is 1.37. 

In the text 
Figure 4 Model with a single group (left) and q = 1.7 corresponding to a basic reproduction ratio of 1.37, and two groups (right) with q_{1} = 2.35, q_{2} = 0.117, and p = 0.02 as in Figure 3, with same basic reproduction ratio. Note that the figure on the left is the same as in Figure 1 (right), on a different scale. The straight dotted line is the level p. In the case with two groups, note that almost all individuals of the second group get infected during the propagation of the disease. 

In the text 
Figure 5 The epidemic size in a population with one group (dotted curve, same as in Fig. 2) or two groups (plain, thick curve) with and p = 0.02 as a function of the basic reproduction ratio (obtained by varying q in the first case and q_{2} in the second case). The other lines correspond to simulations of r(t) for t = 30, 60, 90 in the model with two groups. With two groups, we recover a phase transition as in Figure 2, corresponding to q_{2} ≈ 0.96. 

In the text 
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.