HTML
-
Table 1 lists the top four regions outside of Hubei Province with a relatively large number of reported confirmed cases alongside the corresponding maximum seating capacity for flights from Wuhan. The number of confirmed cases is positively related to passenger volume from Wuhan. Hence, the following model was considered: the number of imported cases XK+d,i from Wuhan to region i by Day (K+d) has a Binomial (10NK, pi) distribution, i=1, 2, …, m, where NK is the total number of cases in Wuhan by Day K to be estimated, pi is the daily probability of traveling from Wuhan to region i, which can be estimated using the ratio of daily volume of passengers and the catchment population of Wuhan airport, and d is the mean time from infection to detection (see details of the model explanation in Appendix A). The calculated daily number of travelers based on flight capacity is further described in Appendix B.
Region Total seats Cases Guangdong 111,624 53 Zhejiang 46,528 43 Beijing 59,364 26 Shanghai 51,517 20 Table 1. Number of confirmed cases and seating capacity for 4 regions in China.
Determining the number of imported cases in region i, namely XK+d,i, plays a crucial role in the modeling procedure. Table 2 shows the number of reported confirmed cases in various provinces/cities/countries (excluding Hubei Province) within and outside of China on January 23, 2020. The column titled “No. of Local Cases” indicates the number of cases which were not directly imported from Wuhan. Despite the rapid spread of the epidemic, the current situation outside Hubei Province is relatively controlled given the adequate medical support being allocated towards the current outbreak. This suggests that the number of reported cases outside Hubei, as of January 23, 2020, is a fairly accurate representation of the actual epidemic situation in the surrounding regions. Note that only cases directly imported from Wuhan were considered. For example, among the 53 confirmed cases reported in Guangdong Province, of which 8 were local cases, the actual number of imported cases, XK+d,i, was regarded as 45. Moreover, for the one case in Singapore, the patient departed from the airport in Guangzhou, hence, it was a non-directly imported case and the corresponding XK+d,i is 0. Furthermore, observations from a few nearby provincial-level administrative divisions (PLADs) including Hunan, Anhui, Henan, Jiangxi, and Tibet and other cities within Hubei Province were dropped due to challenges with estimating daily probability of travel without air transportation data from Wuhan.
Region No. of cases No. of local cases Guangdong 53 8 Zhejiang 43 Beijing 26 Shanghai 20 1 Chongqing 27 Sichuan 15 Guangxi 13 1 Jiangsu 9 Shandong 9 1 Hainan 8 Fujian 5 Tianjin 4 Liaoning 4 1 Heilongjiang 4 Jilin 3 Shaanxi 3 1 Guizhou 3 1 Ningxia 2 Xinjiang 2 Gansu 2 1 Yunnan 1 Inner Mongolia 1 Shanxi 1 Qinghai 0 Hunan 24 1 Anhui 15 Henan 9 1 Jiangxi 7 Hebei 2 Tibet 0 Macau, China 2 Hong Kong, China 2 Taiwan, China 1 Japan 1 South Korea 1 USA 1 Thailand 3 Singapore 1* Vietnam 2 1 * The patient was a resident from Wuhan city but departed from the airport in Guangzhou. Table 2. Number of reported confirmed cases within (excluding Hubei) and outside China on 23 January 2020.
Using XK+d,i obtained from domestically and internationally reported cases and the corresponding estimated travel probability pi, where i=1, 2, …, l, it is possible to infer the magnitude of comparable cases, NK, within Wuhan that may have occurred on Day K through a binomial model. The MLE estimate of NK is 3,933 and the corresponding 95% CI is (3,454–4,450). Note that XK+d,i was obtained on 23 January 2020, hence, the estimated NK is the number of total cases (including those in incubation period) as of January 14, 2020 or the number of cases with symptom onset by January 19, 2020.
-
The number of confirmed cases in Wuhan reported by China’s NHC has increased rapidly in recent weeks. However, the currently reported number of 495 cases as of January 23, 2020 in Wuhan is still far below our estimate of 3,933. This may be due to the insufficient amount of medical resources in Wuhan and Hubei Province given the suddenness of the outbreak. We suggest boosting medical resources using specific methods such as increasing the amount of hospital beds in order to accommodate all fever patients with pneumonia or a severe respiratory disease in Wuhan in order to expedite the virus examination process and to allow the region to more adequately respond to this public health crisis.
-
Assume Day 1 is the date of the infection for the very first case. Let Nj denote the number of cases (including those in incubation period) in Wuhan by Day j, Yj be the number of the cases traveling to region l on Day i, Xj be the number of cases detected in region l by Day j, p is the pre-defined probability of traveling to region l described in Appendix B and d is the mean time from infection to detection (here we suppress the notation l for conciseness). Then Yj would follow a binomial distribution listed in Table 3 below. Note that from Day d+1 on, the number of trials in the binomial is no longer Nj but Nj-(Nj-d-Yj-d) under Assumption 2. Note that Yj-d is relatively small compare with Nj-d, hence we drop Yj-d here for simplicity. Therefore,
Date Distribution Period of Yi being detected Day 1 Y1~Binomial(N1,p) Y1 is expected to be detected on Day d+1 Day 2 Y2~Binomial(N2,p) Y2 is expected to be detected on Day d+1 and Day d+2 $\vdots $ $\vdots $ $\vdots $ Day d Yd~Binomial(Nd,p) Yd is expected to be detected between Day d+1 and Day 2d Day d+1 Yd+1~Binomial(Nd+1−N1,p) Yd+1 is expected to be detected between Day d+2 and Day 2d+1 $\vdots $ $\vdots $ $\vdots $ Day 2d−1 Y2d−1~Binomial(N2d−1−Nd-1,p) Y2d−1 is expected to be detected between Day 2d and Day 3d−1 Day 2d Y2d~Binomial(N2d-Nd,p) Y2d is expected to be detected between Day 2d+1 and Day 3d Table 3. Binomial distributions on Day i.
$$ \mathop \sum \limits_{j = 1}^K {Y_j}\sim Binomial\left( {\mathop \sum \limits_{j = K - d + 1}^K {N_j},p} \right),K > d $$ However, note that Yj would not be directly observed on Day j or any other single day but would be detected between a certain period listed in Table 1. For example, suppose that NK is of interest, then
$\mathop \sum \nolimits_{i = 1}^K {Y_i}$ needs to be calculated, note that Y1,…,YK would be all included in XK+d, but$\mathop \sum \nolimits_{i = 1}^K {Y_i} \le {X_{K + d}}$ as the observed XK would include parts of YK+1,…,YK+d−1. A straightforward but rough way to approximate$\mathop \sum \nolimits_{i = 1}^K {Y_i}$ is to use XK+d/2. The other problem is that using such binomial model, what we can estimate is$\mathop \sum \nolimits_{i = K - d + 1}^K {N_i}$ but not a single Ni, we suggest using$\mathop \sum \nolimits_{i = K - d + 1}^K {N_i}/{\rm{d}}$ as an estimation of NK-d/2, that is$$ {X_{K + d/2}}\sim Binomial\left( {d \times {N_{K - d/2}},p} \right),\;\;K > d $$ A binomial distribution can be approximated by a Poisson distribution if the number of trials in the binomial distribution is large while the probability of success is small. Hence,
$$ {X_{K + d}} \approx Poisson\left( {d \times p \times {N_K}} \right),\;\;K > d/2 $$ Including multiple regions into the model, we have
$$ {X_{K + d,i}} \approx Poisson\left( {d \times {p_i} \times {N_K}} \right)\;\;{\rm{for}}\;{{i}} = 1,2, \cdots ,\;{{m,}} $$ and therefore,
$$ \mathop \sum \limits_{i = 1}^m {X_{K + d,i}}\sim Poisson\left( {d \times {N_K}\mathop \sum \limits_{i = 1}^m {p_i}} \right) $$ where m=25 is the total number of regions used in our model. Note that if pi=p, our model is almost identical to the previous model (5). The total number of cases on Day K, NK, is estimated by its maximum likelihood estimate (MLE), that is
$$ {\hat N_K} = \frac{{\mathop \sum \nolimits_{i = 1}^m {X_{K + d,i}}}}{{d \times \mathop \sum \nolimits_{i = 1}^m {P_i}}} $$ and the corresponding (1–α) CI is derived using the relation between Poisson distribution and chi-square distribution (6).
$$ \left( {\frac{{{\rm{\chi }}_{2\left( {\mathop \sum \nolimits_{i = 1}^m {X_{K + d,i}}} \right),\alpha /2}^2}}{{2 \times d\mathop \sum \nolimits_{i = 1}^m {p_i}}},\frac{{{\rm{\chi }}_{2(\mathop \sum \nolimits_{i = 1}^m {X_{K + d,i}}) + 2,1 - \alpha /2}^2}}{{2 \times d\mathop \sum \nolimits_{i = 1}^m {p_i}}}} \right) $$ -
The daily probability of traveling from Wuhan to region i, pi, can be estimated using the ratio of daily volume of passengers to region i and the catchment population of Wuhan airport. Below are the details for obtaining daily volume of passengers to region i.
There were a total of 7,122 flights from Wuhan to 84 airports in Mainland China in the 30 days from December 22, 2019 to January 20, 2020, where 6,586 flights were to the top 50 destinations which accounted for 6,586/7,122=92.47% of the total volume (7). Meanwhile, there were 854,383 seats in the flights to top 50 destinations being reported in IATA data in the 22 days between December 30, 2019 and January 20, 2020 (8). Hence, the average number of seats in a single flight can be estimated by 854,383/(6,586×22/30)=177. Over Spring Festival/Lunar New Year, Wuhan airport is expected to handle 24,600 flights and 3.52 million passengers in 40 days (9), and thus, each flight is expected to have on average 3,520/24.6=143 passengers onboard, which gives an average load factor of a flight departing from Wuhan as 143/177=0.81. Therefore, the total volume of air travels during the Spring Festival/Lunar New Year can be estimated to be 854,383×0.81/0.9274/22×40=1.35 million. In addition, based on historical evidence, 15 million passengers are expected to depart Wuhan by rail, road, and air, 66% of whom are estimate to travel across 300 km (10). That would imply, on average, that 135/(1,500×0.34)=26.47% of trips longer than 300 km would be by air. Therefore, the total passenger volume from Wuhan to other regions in Mainland China can be calculated by the number of seats×0.81/0.2647. Note that Hainan Province is a special case because of its geographical location, and a majority of passengers from Wuhan to Hainan Province will likely travel by air. As a result, we would use the number of seats ×0.81 for Hainan Province. For other international regions, we use the estimate of 3,301 passengers per day given by the previous model (5).
Appendix A
Appendix B
Citation: |