- Research article
- Open Access
- Open Peer Review
Prevalence of hemorrhagic fever with renal syndrome in Yiyuan County, China, 2005–2014
BMC Infectious Diseasesvolume 16, Article number: 69 (2016)
Hemorrhagic fever with renal syndrome (HFRS) is highly endemic in mainland China, where human cases account for 90 % of the total global cases. Yiyuan County is one of the most serious affected areas in China. Therefore, there is an urgent need for monitoring and predicting HFRS incidence in Yiyuan to make the control of HFRS more effective.
The study was based on the reported cases of HFRS from the National Notifiable Disease Surveillance System. The demographic and spatial distributions of HFRS in Yiyuan were established. Then we fit autoregressive integrated moving average (ARIMA) models and predict the HFRS epidemic trend.
There were 362 cases reported in Yiyuan during the 10-year study period. The human infections in the fall and winter reflected a seasonal characteristic pattern of Hantaan virus (HTNV) transmission. The best model was ARIMA (2, 1, 1) × (0, 1, 1)12 (AIC value 516.86) with a high validity.
The ARIMA model fits the fluctuations in HFRS frequency and it can be used for future forecasting when applied to HFRS prevention and control.
Hemorrhagic fever with renal syndrome (HFRS), a rodent-borne disease caused by hantaviruses (family Bunyaviridae), is characterized by fever, acute renal dysfunction, and hemorrhage manifestations [1–3]. HFRS was first recognized in northeastern China in 1931 and has been prevalent in many other parts of China since 1955. At present, it is highly endemic in mainland China accounting for 90 % of the total cases reported in the world [4–7]. In response to the spread of HFRS in China, the Chinese Center for Disease Control and Prevention (CDC) established the National Notifiable Disease Surveillance System in 2004, which made the surveillance data for HFRS more accurate and comprehensive. A better understanding of the spatial distribution patterns and social demographic distribution characteristics of HFRS would help to identify areas and populations at high risk. Early warnings are also essential for controlling or reducing the risk of outbreaks , epidemic modeling and forecasting can be essential tools to prevent and control HFRS . In epidemiology, Autoregressive integrated moving average (ARIMA) models have been successfully applied to predict the incidence of infectious diseases, such as HIV , influenza , malaria incidence , and other infectious diseases [13–15].
This study aimed to establish the current situation of endemic HFRS in Yiyuan, and characterize its spatio-temporal distribution and demographic distribution characteristics. Furthermore, we fit ARIMA models and predict the HFRS epidemic trend by using SAS version 9.2 (SAS Institute, Cary, NC,USA). Our study was based on HFRS epidemic data from Yiyuan County, China, where it could provide a basis for HFRS prevention and control.
The study site is located in Yiyuan County (latitude 35°55′ ~ 36°23′ N and longitude 117°54′ ~ 118°31′ E), in the central part of Shandong province. Monthly HFRS cases reported during 2005–2014 were provided by Zibo CDC, and we were permitted to use the data. In China, HFRS is a nationally notifiable disease and hospital physicians must report every case of HFRS to the local health authority within 24 h.
The ethical approval was given by Ethics Review Committee of the Zibo Center for Disease Control and Prevention, and the study was conducted in compliance with the principles of the Declaration of Helsinki. Written informed consents for the use of their clinical samples were obtained from the patients and all analyzed data were anonymized. Besides, consents of participants who were under 16 years have been obtained from their parents/guardians.
Demographic distribution analysis
The demographic distribution characteristics including age, sex and occupation distribution of HFRS cases from 2005 to 2014 in Yiyuan County were analyzed according to surveillance data. All HFRS cases were geo-coded and matched to the town-level layers of polygon and point by administrative code using the software ArcGIS9.3 (ESRI Inc., Redlands, CA, USA). To alleviate variations of incidence in small populations and areas, annualized average incidence of HFRS per 100 000 at each town over the 10 year-period were calculated. Furthermore, annualized average incidences and the proportion of monthly average incidence for each town were mapped in gradient colors and pie charts, respectively. To approximately distinguish the dominant hantaviruses, we divided a year into three periods according to the seasonal distribution of the HFRS cases: March to June (in spring and early summer), July to August (in summer), and September to February (in autumn and winter).
ARIMA models are the most commonly used time series prediction models . We constructed ARIMA models for monthly HFRS incidence in Yiyuan from 2005 to 2014. ARIMA was designed to deal with highly seasonal data . An ARIMA (p, d, q) model comprises three types of parameters [9, 16, 17]: the autoregressive parameters (p), number of differencing passes (d), and moving average parameters (q). The multiplicative seasonal ARIMA (p, d, q) × (P, D, Q)s model is an extension of the ARIMA method to time series in which a pattern repeats seasonally over time [15, 16, 18]. Analogous to the simple ARIMA parameters, the seasonal parameters are: seasonal autoregressive (P), seasonal differencing (D), and seasonal moving average parameters (Q). The length of the seasonal period is represented by s. For example, the incidence of infectious disease varies in the annual cycle, so s = 12 in the present study.
We used the Box-Jenkins strategy to construct models. The ARIMA model procedure consists of three iterative steps [15, 17, 18]: identification, estimation, and diagnostic checking. Prior to fitting the ARIMA model, an appropriate difference of the series is usually performed to make the series stationary. Identification is the process of determining seasonal and non-seasonal orders using the autocorrelation functions (ACF) and partial autocorrelation functions (PACF) of the transformed data. Parameters in the ARIMA model(s) are estimated with the conditional least squares method after the identification step. At the diagnosis stage, the adequacy of the established model for the series is verified by employing white noise tests to check whether the residuals are independent and normally distributed. It is possible that several ARIMA models may be identified, and the selection of an optimum model is necessary. Such selection of models is usually based on the Akaike Information Criterion (AIC) and Schwartz Bayesian Criterion (SBC). Smaller AIC values indicate a better model, and the SBC considers the residual error, which is based on AIC. The lowest SBC value with a P value less than 0.05 was considered to be the best model . In addition, to check the accuracy of each model, root mean square error (RMSE) between the number of observed and fitted HFRS infections from 2005 to 2014 were calculated. A lower RMSE value indicates a better fit of the data. Finally, the fitted ARIMA model was used for short-term forecasting of the monthly HFRS incidence between January and December 2014. All analyses were performed using SAS 9.2 with a significant level of p < 0.05.
Descriptive analysis of HFRS in Yiyuan County
A total of 362 cases were reported in Yiyuan County during the 10-year study period. Of these, 65 % were male and 35 % were female, with the sex ratio (male vs. female) 1.85. Among these patients, 1 % were in children ≤14 years of age, 88 % were in persons 15–64 years of age, and 11 % were in persons ≥65 years of age. Regarding to occupation, 89 % of HFRS patients were farmers, 5 % were workers (mainly forestry workers, builders), and followed by students which accounted for 4 %. Poor housing conditions and high rodent density in rural areas seem to be responsible for most HFRS epidemics. The monthly distribution of HFRS cases was shown in Fig. 1, which indicated that the occurrence of HFRS presented significant seasonality.
Figure 2 showed annualized average incidence and the proportion of monthly average incidence for each town. Annualized average incidence at the town-level ranged from 2.58 to 12.78 per 100 000. Among the total 11 towns in Yiyuan, 7 towns were medium-endemic with an incidence between 5 and 15 per 100 000, an epidemic peak in the fall-winter season was mapped by the red color in the pies.
Time series data for HFRS covering 2005–2013 in Yiyuan were used as the training set and monthly data for 2014 were used as the test set (Fig. 3). The sequence diagrams and the seasonal characteristics of HFRS incidence indicated that the data series had a seasonal cycle every 12 months. On the basis of these characteristics, we eliminated the effect of seasonal trends by taking 1-order trend difference and 1-order seasonal difference. The transformed series showed far less dispersion than original series (Fig. 4). Plausible models, i.e., (the ARIMA (2, 1, 1) × (0, 1, 0)12, ARIMA (2, 1, 0) × (0, 1, 1)12, ARIMA (1, 1, 1) × (1, 1, 1)12 and ARIMA (2, 1, 1) × (0, 1, 1)12), were identified on the basis of autocorrelation functions (ACF) and partial autocorrelation functions (PACF) (Fig. 5), and were used for further analysis.
Parameter estimation and model testing
On the basis of parameter estimation and goodness of fit test statistics (Tables 1 and 2), we confirmed that the best model was ARIMA (2, 1, 1) × (0, 1, 1)12. The goodness-of-fit analysis showed that there was no significant autocorrelation between residuals at different lags.
We used ARIMA model (2, 1, 1) × (0, 1, 1)12 and time series data for 2005–2013 as the training set, and the data from January-December 2014 were used as the test set (Fig. 6, and Table 3). The predicted data for the actual data and the predicted data 95 % confidence limit for 2014 are shown in Table 3. The RMSE value was 3.56. The predicted data and the actual data were not perfectly matched, but the actual data fell within the predicted 95 % confidence interval.
In this study, our results demonstrated that the HFRS incidence had been increasing from 2009 to 2014 in Yiyuan County. The proportion of monthly average incidence for each town showed that HFRS in Yiyuan was mainly caused by HTNV. We applied multiplicative seasonal ARIMA (p, d, q) × (P, D, Q)s models to analyze the surveillance data of HFRS in Yiyuan, China. According to the results above, the ARIMA (2, 1, 1) × (0, 1, 1)12 model is reliable with a high validity, which can be used to predict the next one year’s HFRS incidence in Yiyuan.
Yiyuan, are mountainous and hilly with numerous Apodemus agrarius, and farmers have more chances to be exposed to contaminated urine and feces of infected rodents. Agricultural activities such as sleeping in the fields, irrigating, and working on the farmland during the autumn harvest season might have played a significant role in the occurrence of HFRS . Previous studies reported that the transmission of HTNV through Apodemus agrarius peaked in the winter, while Rattus norvegicus associated SEOV infections mainly occurred in the spring [21–24]. Thus, the human infections in Yiyuan in the fall and winter reflect a seasonal characteristic pattern of HTNV transmission. The forecast results suggest that the HFRS incidence in China will experience a slight growth in the next one year. A rise in the number of HFRS incidence may also result from an increase in the number and size of natural foci , climate change, especially the increase of mean temperature [20, 23]. Therefore, knowledge of HFRS forecasts is necessary to prompt health departments to strengthen surveillance systems and reallocate resources in anticipation of increasing HFRS incidence.
Epidemiological surveillance of communicable diseases is one of the most traditional health-related activities. Time-series analysis of incidence of various infections is extremely useful in developing hypotheses to explain and anticipate the dynamics of the observed phenomena and subsequently in the establishment of a quality control system and reallocation of resources . There are a number of methods applied for time series analysis including ARIMA model [8–13, 16], maximum entropy method (MEM) spectral analysis  and the autoregressive conditional heteroscedastic (ARCH) model . The ARIMA model has its advantages in time-series analyses. The secular trend, seasonal variation, and autocorrelation could all be easily controlled by difference, auto-regression, moving average, and seasonal functions without performing complicated transformations or using extra surrogate variables . Once a satisfactory model has been obtained, it can be used to forecast expected numbers of cases for a given number of future time intervals .
Besides, the application of GIS, together with time-series analyses in the present study, provides ways to quantify explicit HFRS and to further identify environmental factors responsible for the increasing disease risk. Although analyses are still preliminary, the findings can be helpful for generating hypothesis for further investigation. For example, based on the prediction results, the government can invest more health resources during high-risk periods and decrease it during low-risk periods to improve the cost-effectiveness of interventions and scheduling of resources. It can also be used to evaluate the effectiveness of public health interventions under varying assumptions by comparing actual HFRS incidence with expected incidence.
However, limitations should also be considered in this present study. The RMSE value was 3.56, and the actual data did not match the predicted data of the model perfectly. Due to a lack of time series data on the population densities of rodents, and the influencing factors, it is difficult to further uncover the probable causes and shifts of the characters of HFRS. Future researches are warranted to focus on the risk factors of HFRS to modify the ARIMA model such as rodent population densities, human activities, farming patterns, various socio-economic and environmental factors in Yiyuan.
In summary, the results of our study provide useful information on the prevailing epidemiological situation of HFRS in Yiyuan. We further confirmed the consensus that ARIMA model is a useful tool in monitoring and predicting changing trends in HFRS. The ARIMA model could be used to optimize HFRS prevention by providing short-term forecasting on the HFRS incidence. To control and prevent HFRS, a comprehensive preventive strategy including public health education and promotion, rodent control, surveillance, and vaccination should be implemented in Yiyuan.
Autoregressive integrated moving average
Hemorrhagic fever with renal syndrome
Schmaljohn CS, Dalrymple JM. Analysis of Hantaan virus RNA: evidence for a new genus of bunyaviridae. Virology. 1983;131(2):482–91.
Liu YX, Feng D, Zhang Q, Jia N, Zhao ZT, De Vlas SJ, et al. Key differentiating features between scrub typhus and hemorrhagic fever with renal syndrome in northern China. Am J Trop Med Hyg. 2007;76(5):801–5.
Cui F, Wang T, Wang L, Yang S, Zhang L, Cao H, et al. Spatial analysis of hemorrhagic fever with renal syndrome in Zibo City, China, 2009–2012. PLoS ONE. 2013;8(6):e67490.
Song G. Epidemiological progresses of hemorrhagic fever with renal syndrome in China. Chin Med J (Engl). 1999;112(5):472–7.
Yan L, Fang LQ, Huang HG, Zhang LQ, Feng D, Zhao WJ, et al. Landscape elements and Hantaan virus-related hemorrhagic fever with renal syndrome, People’s Republic of China. Emerg Infect Dis. 2007;13(9):1301–6.
Zuo SQ, Fang LQ, Zhan L, Zhang PH, Jiang JF, Wang LP, et al. Geo-spatial hotspots of hemorrhagic fever with renal syndrome and genetic characterization of Seoul variants in Beijing. China PLoS Negl Trop Dis. 2011;5(1), e945.
Simmons JH, Riley LK. Hantaviruses: an overview. Comp Med. 2002;52(2):97–110.
Li Q, Guo NN, Han ZY, Zhang YB, Qi SX, Xu YG, et al. Application of an autoregressive integrated moving average model for predicting the incidence of hemorrhagic fever with renal syndrome. Am J Trop Med Hyg. 2012;87(2):364–70.
Liu Q, Liu X, Jiang B, Yang W. Forecasting incidence of hemorrhagic fever with renal syndrome in China using ARIMA model. BMC Infect Dis. 2011;11:218.
Yu HK, Kim NY, Kim SS, Chu C, Kee MK. Forecasting the number of human immunodeficiency virus infections in the korean population using the autoregressive integrated moving average model. Osong Public Health Res Perspect. 2013;4(6):358–62.
Reichert TA, Simonsen L, Sharma A, Pardo SA, Fedson DS, Miller MA. Influenza and the winter increase in mortality in the United States, 1959–1999. Am J Epidemiol. 2004;160(5):492–502.
Gaudart J, Toure O, Dessay N, Dicko AL, Ranque S, Forest L, et al. Modelling malaria incidence with environmental dependency in a locality of Sudanese savannah area. Mali Malar J. 2009;8:61.
Luz PM, Mendes BV, Codeco CT, Struchiner CJ, Galvani AP. Time series analysis of dengue incidence in Rio de Janeiro. Brazil Am J Trop Med Hyg. 2008;79(6):933–9.
Yi J, Du CT, Wang RH, Liu L. [Applications of multiple seasonal autoregressive integrated moving average (ARIMA) model on predictive incidence of tuberculosis]. Zhonghua Yu Fang Yi Xue Za Zhi. 2007;41(2):118–21.
Zhang X, Zhang T, Young AA, Li X. Applications and comparisons of four time series models in epidemiological surveillance data. PLoS ONE. 2014;9(2), e88075.
Lin H, Lu L, Tian L, Zhou S, Wu H, Bi Y, et al. Spatial and temporal distribution of falciparum malaria in China. Malar J. 2009;8:130.
Sato RC. Disease management with ARIMA model in time series. Einstein (Sao Paulo). 2013;11(1):128–31.
Liu X, Jiang B, Bi P, Yang W, Liu Q. Prevalence of haemorrhagic fever with renal syndrome in mainland China: analysis of National Surveillance Data, 2004–2009. Epidemiol Infect. 2012;140(5):851–7.
Chatfield C. The analysis of time series: theory and practice. London: Chapman and Hall; 1975.
Bi P, Wu X, Zhang F, Parton KA, Tong S. Seasonal rainfall variability, the incidence of hemorrhagic fever with renal syndrome, and prediction of the disease in low-lying areas of China. Am J Epidemiol. 1998;148(3):276–81.
Zhang YZ, Zhang FX, Wang JB, Zhao ZW, Li MH, Chen HX, et al. Hantaviruses in rodents and humans, Inner Mongolia Autonomous Region, China. Emerg Infect Dis. 2009;15(6):885–91.
Kim YS, Ahn C, Han JS, Kim S, Lee JS, Lee PW. Hemorrhagic fever with renal syndrome caused by the Seoul virus. Nephron. 1995;71(4):419–27.
Fang LQ, Wang XJ, Liang S, Li YL, Song SX, Zhang WY, et al. Spatiotemporal trends and climatic factors of hemorrhagic fever with renal syndrome epidemic in Shandong Province, China. PLoS Negl Trop Dis. 2010;4(8):e789.
Chen HX, Qiu FX, Dong BJ, Ji SZ, Li YT, Wang Y, et al. Epidemiological studies on hemorrhagic fever with renal syndrome in China. J Infect Dis. 1986;154(3):394–8.
Li Q, Zhao W, Wei Y, Han X, Han Z, Zhang Y, et al. Analysis of incidence and related factors of hemorrhagic Fever with renal syndrome in hebei province, china. PLoS ONE. 2014;9(7):e101348.
Kuhn L, Davidson LL, Durkin MS. Use of Poisson regression and time series analysis for detecting changes over time in rates of child injury following a prevention program. Am J Epidemiol. 1994;140(10):943–55.
Sumi A, Kamo K, Ohtomo N, Mise K, Kobayashi N. Time series analysis of incidence data of influenza in Japan. J Epidemiol. 2011;21(1):21–9.
Techie Quaicoe M, Twenefour FB, Baah EM, Nortey EN. Modeling variations in the cedi/dollar exchange rate in Ghana: an autoregressive conditional heteroscedastic (ARCH) models. Springerplus. 2015;4:329.
Allard R. Use of time-series analysis in infectious disease surveillance. Bull World Health Organ. 1998;76(4):327–33.
This work was supported by Zibo Center for Disease Control and Prevention
The authors declare that they have no competing interests.
TW extracted the data, conducted the statistical analysis and drafted the manuscript. JL and YPZ conceived of the project concept, helped to interpret the results and modify the manuscript. FC helped to interpret the results. ZSH extracted the data. SYZ and LW conceived of the project concept, assisted with the data interpretation, and helped write the manuscript. All of the authors have read and approved the final manuscript.