We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Prevalent HBV point mutations and mutation combinations at BCP/preC region and their association with liver disease progression
BMC Infectious Diseasesvolume 10, Article number: 271 (2010)
Mutations in the basic core promoter (BCP) and its adjacent precore (preC) region in HBV genome are common in chronic hepatitis B patients. However, the patterns of mutation combinations in these two regions during chronic infection are less understood. This study focused on single base mutations in BCP and preC region and the multi-mutation patterns observed in chronic HBV infection patients.
Total 192 blood samples of chronic HBV infection patients were included. Direct PCR sequencing on the target region of HBV genome was successfully conducted in 157 samples. The rest 35 samples were analyzed by clone sequencing. Only the nucleotide substitutions with their frequencies no less than 10% were included in multi-mutation analysis with the exception for the polymorphic sites between genotypes B and C.
Five high frequency mutations (≥10%) were found in BCP and preC region. Thirteen types of multi-mutations in one fragment were observed, among which 3 types were common combinations (≥5%). The top three multi-mutations were A1762T/G1764A (36%), A1762T/G1764A/G1896A (11%) and T1753(A/C)/A1762T/G1764A/G1896A (8%). Patients with multi-mutations in viral genomes (≥3) were more likely to have liver cirrhosis or hepatocellular carcinoma (OR = 3.1, 95% CI: 1.6-6.0, P = 0.001). G1896A mutation seemed to be involved in liver disease progression independent of the patient age (OR = 3.6, 95% CI: 1.5-8.6; P = 0.004). In addition, patients with more viral mutations detected (≥3) were more likely to be HBeAg negative (OR = 2.7, 95% CI: 1.1-6.4; P = 0.027). Moreover, G1776A mutation was shown to contribute to HBeAg negativity in our study (OR = 8.6, 95% CI: 1.2-44.9; P = 0.01).
Patients with advanced liver diseases and with HBeAg negativity more likely have multi-mutations in HBV genomes but with different mutation combination patterns. G1896A mutation appears to be independent of infection history.
The basic core promoter (BCP, nt 1742-1849) and its adjacent precore (preC) region are crucial for replication of HBV. BCP binds various liver factors and preC forms ε structure in pregenomic RNA (pgRNA) as the encapsidation signal [1–3]. Changes in viral replication may influence the progression of liver diseases, particularly in fulminant hepatitis and acute exacerbation of chronic hepatitis [4, 5]. Mounting evidence has emerged to demonstrate that BCP and preC mutants are predisposed to severe and progressive liver diseases after HBV infection, causing an increased risk for hepatocellular carcinoma (HCC) [6–10]. For instance, mutations T1762/A1764 and A1899 have been reported to be independent risk factors for HCC , and T1653 and/or V1753 mutations are believed to promote the process of liver degradation . However, the association of these mutations with severe symptoms is manifested in certain populations but not in others [13, 14].
Studies have been shown that G1896A is involved in HBeAg negativity by introducing a stop codon in the preC region . Although the 1762T/1764A double mutation, commonly occurring in HBeAg-negative patients, was observed in vivo to suppress the production of preC mRNA independent of G1896A, recent in vitro research suggested other single site substitutions rather than these two may be responsible for the reduction of HBeAg expression [5, 16, 17]. Unknown mutations in this core promoter may impede the seroconversion of HBeAg during antiviral treatment .
In the BCP and preC regions, multi-substitutions further complicate mutation research. Triple core promoter mutations C1753T/A1762T/G1764A occurred more commonly in genotype C compared with genotype B . For genotype D, A1757 mutants were prone to accompany with the T1764/G1766 double mutation . In vitro experiments have shown multi-mutations may increase viral replication efficiency in Lamivudine resistant strains . However, the mutation combination in BCP and preC and its clinical significance are less understood in chronic HBV infection patients.
This study focused on substitutions in BCP and preC regions and their combinations in different stages of chronic HBV related liver diseases.
Patients and blood samples
A total of 192 chronic HBV infection patients were enrolled at You'an Hospital (Beijing, China) and Jinxiang County People's Hospital (Shandong Province, China) (Additional file 1, Table S1). A diagnostic workup was performed including physical examination, laboratory and or liver pathology according to the criteria suggested by Chinese Medical Association for Liver Diseases in 2005. Liver function test and serum HBV marker screening were conventionally conducted. No patient had co-infection with hepatitis C virus, hepatitis D virus, or human immunodeficiency virus. Basic patient characters have been summarized in Table 1. Blood samples (5 ml each) were collected, and cells and sera were then separated and stored at -20°C. The study was approved by the Ethics Committees of the institutions, and informed consent was obtained from all patients.
Serological HBV marker detection
Serological markers were detected by electrochemiluminescence immunoassay on a Roche E170 modular immunoassay analyzer following the manufacturer's protocols (Roche Diagnostics, Germany).
HBV DNA quantification
Real time PCR was performed to determine viral DNA titers using an FQ-PCR Kit for HBV (DaAn Gene Co., China) performed in a GeneAmp 5700 Sequence Detection System (PE Applied Biosystems, USA).
Viral DNA extraction
Viral DNA was extracted from 200 μL sera per sample using an AccuPrep Genomic DNA Extraction Kit (Bioneer, Korea) or QIAamp MinElute Virus Spin (Qiagen, Germany) as instructed in the manufacturer's manuals. All DNA samples were stored at -20°C before thawing for PCR.
All PCRs were performed in 20 μL or 50 μL reaction mixtures containing specimen DNA. PCR for amplification of target regions was performed with a hot start at 95°C for 150 s, followed by 35 cycles of denaturation at 94°C for 1 min, annealing at 58°C for 90 s, and elongation at 72°C for 3 min. All reactions were performed on a PTC-200 Peltier Thermal Cycler (MJ Research, USA). The primers employed were CP5: 5'-CTTCGTCTGCGAGGCGAGGG-3' (nt 2381-2400) and CP22: 5'-GAGACCACCGTGAACGCCCA-3' (nt 1611-1630).
PCR and clone sequencing
PCR products were purified with a Montage PCR96 column (Millipore, USA). The final DNA concentration in the sequencing reaction was 10 ng/μL. DNA sequencing was carried out on a Prism 3730 (ABI, USA). Contigs were assembled using SeqMan (DNASTAR, USA) and all the sequences were aligned by ClustalW for further analysis (all sequences analyzed in this paper have been submitted to GenBank) (Additional file 1, Table S3).
The PCR products were first purified with a Takara gel purification kit and were then ligated into pMD18 T vector (Takara Bio, Japan). Vectors were subsequently transfected into DH5α cells and white/blue colony selection was used to detect recombinant vectors. Inserts from positive clones were PCR amplified with primers RV-M/M13-47 according to the manufacturer's instructions to verify the target fragments.
The phylogenetic tree (Additional file 1, Figure S1) was built for all 157 sequences and 858 strains from 8 genotypes retrieved from the NCBI database, using Mega 4  and annotated by TreeDyn . All the fragments of BCP and preC were from whole genomes in the NCBI database with genotypes annotated.
Statistical analysis was performed using SPSS software (version 13.0; SPSS Inc, USA). Logistic regression was used for evaluating the roles of candidate clinical factors and viral mutations in liver disease progression and HBeAg negativity.
High prevalent mutations in BCP and preC regions
A total of 300 viral fragments from nt 1725 to 1900, covering the BCP and preC regions in the HBV genome, were analyzed. As shown in Table 1, direct sequencing resulted 157 fragments from patients of 13 asymptomatic carrier (ASC), 75 chronic hepatitis (CH), 44 liver failure (LF), and 11 hepatocellular carcinoma (HCC). LF, LC, and HCC were defined as advanced liver diseases (ALD). The rest 143 sequences came from clone sequencing in which 95 clones were from 17 samples of LC patients and 48 from 18 samples of HCC patients. Figure 1 illustrates the nucleotide substitution patterns identified by PCR sequencing. Similar mutation profiles were shown in samples collected in Beijing (136) and in a relatively isolated town 600 km south from Beijing (21). Mutations were rarely seen in gene overlapping regions. About 82% (128/157) sequences were genotype C and the rest were type B by clustering with reported sequences in NCBI database (Additional file 1, Figure S1). There were 12 nucleotide substitutions with their mutation rates over 5% (Additional file 1, Table S2). Further comparison of these sites with 233 genotype B and 311 genotype C of HBV sequences from NCBI database demonstrated that five were at the genotype specific positions and were therefore precluded from further analysis (Table 2). Interestingly, in three previously reported common mutations G1764A, A1762T, and G1896A in this region, the first two types were observed with very high prevalence in these samples (70% and 67%, respectively) (Additional file 1, Table S2).
G1896A mutation in liver disease progression
Next we analyzed all substitutions in the ALD group to reveal possible risk factors for liver disease progression. In univariate binary logistic regression analysis, all the top five high occurrence mutations seemed to relate to ALD, including T1753A/C (OR = 3.2, 95% CI: 1.3-7.9, P = 0.013), A1762T (OR = 3.1, 95% CI: 1.5-6.5, P = 0.003), G1764A (OR = 4.8, 95% CI: 2.1-10.9, P < 0.001), T1803A/G (OR = 5.1, 95% CI: 1-25.4, P = 0.058), and G1896A (OR = 2.4, 95% CI: 1.2-5.0, P = 0.015). However, in patients older than 35, age appears to be a significant factor in disease progression (OR = 5.5, 95% CI: 2.6-11.5, P < 0.0001), raising the question if the significance of above mutations in ALD was simply due to the accumulation of mutations during long infection history. Indeed subsequent multivariate binary logistic regression analysis indicated that only the mutation G1896A significantly correlated to the disease progression independent of age (P = 0.007, Table 3). Patients with the G1896A (mean age 40 ± 11) had similar average age with those without this mutation (mean age 38 ± 14) but more had ALD (FET, P = 0.005, Table 1).
G1776A statistically correlated to HBeAg negativity
Although having their viral DNA load detected (> 103 copies/ml), some patients in our study were shown to be HBeAg negative, suggesting their HBeAg negativity was not caused by the disappearance or reduction of viral replication. To identify possible mutations correlating to HBeAg negativity, clinical status and viral substitution patterns in BCP and preC were compared between HBeAg negative and positive groups. As shown in Table 4, HBeAg negative patients contained more mutations in viral genomes and were also prone to develop more severe liver diseases such as liver cirrhosis and carcinoma. Additionally, these patients seemed also to have higher levels of aspartate transaminase (AST) (chi-square test, P = 0.003).
We then did regression analysis for all common mutations (≥5%). Univariate binary logistic regression analysis showed significant correlation of several substitutions with the HBeAg negativity, including G1776A (OR = 8.1, 95% CI: 1.7-39; P = 0.009), A1846T (OR = 3.8, 95% CI: 1.2-11.8; P = 0.02), G1896A (OR = 3.5, 95% CI: 1.4-8.6; P = 0.007), and the number of individual point mutations ≥ 3 (OR = 2.8, 95% CI: 1.1-6.8; P = 0.027). By contribution of G1776A to HBeAg negativity was further verified in multivariate binary logistic regression analysis G1896A, a common mutation known to introduce a stop codon in e antigen, showed tendency in correlation to HBeAg negativity (P = 0.055; Table 5). Moreover, within a recently identified binding site for the transcription factor FXRalpha , G1776A was verified to associate with the HBeAg negativity significantly (P = 0.010). Therefore, nt 1776 appears to be a novel candidate corresponding to the loss of e-antigen.
Common multi-mutations in BCP and preC regions
Next we analyzed the effect of multi-mutations in target region. First we defined multi-mutations detected in a single fragment as the mutation combination or the combination in short. Five point mutations (1753, 1762, 1764, 1846, and 1896) with their rates larger than 10% were included in combination analysis. Nineteen categories containing wild type, single base mutations and observed combinations were resulted as illustrated in Figure 2. The top three combinations in all 192 patients were the double mutation A1762T/G1764A (36%), the triple mutation A1762T/G1764A/G1896A (11%), and the quadruple mutation T1753(A/C)/A1762T/G1764A/G1896A (8%).
While comparing the results of PCR sequencing and clone sequencing, we noticed an interesting phenomenon. All the mutation combinations observed in PCR sequencing were also detected in single strains obtained by clone sequencing, suggesting that point mutations tended to coexist in single genomes rather than to occur in various fragments among viral quasispecies (Figure 2). In addition, these combinations were also seen in NCBI sequences of both genotype B and C (Figure 3).
Multi-mutations in ALD patients
As shown in Figure 2, the distribution of above multi-mutations was different between ALD and other patients (ASC and CH) (FET, P < 0.0001). Patients with the combinations of more than 3 mutations were more likely to have ALD (OR = 3.1, 95% CI: 1.6-6.0, P = 0.001), although only one combination (T1753(A/C)/A1762T/G1764A) was dominant in these patients (FET, P = 0.025). Furthermore, ALD patients (mean age 46 ± 11) were older than others (mean age 33 ± 12; T-Test, P < 0.0001) and the patients with no less than 3 viral mutations (mean age 42 ± 12) were also older than the rest (mean age 36 ± 13) (P = 0.009), suggesting that the age, which usually represent the infection history appears to be a correlating factor in liver disease progression.
Candidate mutations responsible for the loss of e antigen
Previous studies have suggested high mutations in BCP and preC regions are major cause for the loss of HBeAg but without true disappearance or reduction of viral load. Thus far only G1896A, which leads a 2/3 truncated e antigen by a stop codon caused by mutation, was verified to be responsible for HBeAg negativity. In this study, we showed that G1776A is a new candidate mutation correlating to HBeAg negativity. Since nt1776 localizes at the binding site to transcription factor FXRalpha, which promotes the expression of e antigen , one possible mechanism for G1776A mutation is to interrupt normal transcription of preC region. As this is the first time proposing possible role of G1776A, further verification is needed in larger sample size.
Viral mutation accumulation in patients with long history of infection
Evidenced by the appearance of some mutations earlier than others during the infection, the accumulation of mutations in viral genome had been observed in previous studies . Our results suggested these accumulated mutations fell into two categories. Mutations in first type were adaptive to host system after long infection but had little effect in the disease progression. A feature of those mutations, including T1753A/C, G1764A and A1762T, is their significance as risk factors in disease progression only shown by univariate analysis but not in the multivairate model when patient age was considered. The second type, such as G1896A, has high occurrence in ALD patients and is independent from infection history, suggesting mutants carrying this substitution may lead to worsening symptoms once they appear.
The occurrence rates of T1764 and A1762 were very high in our study (70% and 67%, respectively). This is perhaps because the major path of HBV infection is vertical transmission in China and patients in their ages of 30 s are usually already with long infection history . The effects of G1762A and A1764T in disease progression had attracted attentions especially in genotypes B and C [27, 28], however our study suggests that the appearance of these mutations may just be a sign of long infection history and may not be very important in disease prognosis.
Common combinations of prevalent substitutions in BCP and preC
Few studies have been conducted about combination of common nucleotide substitutions, and here we present multi-mutations with high frequencies in patients with various diagnoses. Mutation combination appeared as a common type of variation in HBV genome regardless of genotypes. Despite many possible patterns, only 13 combinations were observed based on 5 common substitutions in our study. These limited multi-mutation patterns indicate only certain combinations may have selective advantage for viral functions which makes it possible to interrogate their effects in disease progression. However, limited samples for each mutation combination reduce the statistic power to dissect their distribution in patients with different diagnoses. Therefore, larger sample size should be considered in further investigation
Based on 192 patients from northern China, analysis of common mutations and their combinations were conducted in BCP and preC regions. G1896A was indicated to be associated with liver disease progression independent of the infection history. G1776A mutation was statistically responsible for HBeAg negativity. Mutation profiles of viral genomes were complicated in ALD patients and HBeAg negative patients. Common mutation combinations were observed by both PCR and clone sequencing, indicating their coexistences in single fragment.
Yuh CH, Chang YL, Ting LP: Transcriptional regulation of precore and pregenomic RNAs of hepatitis B virus. J Virol. 1992, 66 (7): 4073-4084.
Lee WM: Hepatitis B virus infection. N Engl J Med. 1733, 337 (24): 1745.
Seeger C, Mason WS: Hepatitis B virus biology. Microbiol Mol Biol Rev. 2000, 64 (1): 51-68. 10.1128/MMBR.64.1.51-68.2000.
Tsai WL, Lo GH, Hsu PI, Lai KH, Lin CK, Chan HH, Chen WC, Cheng JS, Liu YC, Huang TS, et al: Role of genotype and precore/basal core promoter mutations of hepatitis B virus in patients with chronic hepatitis B with acute exacerbation. Scand J Gastroenterol. 2008, 43 (2): 196-201. 10.1080/00365520701745693.
Jammeh S, Tavner F, Watson R, Thomas HC, Karayiannis P: Effect of basal core promoter and pre-core mutations on hepatitis B virus replication. J Gen Virol. 2008, 89 (Pt 4): 901-909. 10.1099/vir.0.83468-0.
Kao JH, Chen PJ, Lai MY, Chen DS: Basal core promoter mutations of hepatitis B virus increase the risk of hepatocellular carcinoma in hepatitis B carriers. Gastroenterology. 2003, 124 (2): 327-334. 10.1053/gast.2003.50053.
Tong MJ, Blatt LM, Kao JH, Cheng JT, Corey WG: Basal core promoter T1762/A1764 and precore A1896 gene mutations in hepatitis B surface antigen-positive hepatocellular carcinoma: a comparison with chronic carriers. Liver Int. 2007, 27 (10): 1356-1363. 10.1111/j.1478-3231.2007.01585.x.
Yuen MF, Tanaka Y, Shinkai N, Poon RT, But DY, Fong DY, Fung J, Wong DK, Yuen JC, Mizokami M, et al: Risk for hepatocellular carcinoma with respect to hepatitis B virus genotypes B/C, specific mutations of enhancer II/core promoter/precore regions and HBV DNA levels. Gut. 2008, 57 (1): 98-102. 10.1136/gut.2007.119859.
Fang ZL, Sabin CA, Dong BQ, Ge LY, Wei SC, Chen QY, Fang KX, Yang JY, Wang XY, Harrison TJ: HBV A1762T, G1764A mutations are a valuable biomarker for identifying a subset of male HBsAg carriers at extremely high risk of hepatocellular carcinoma: a prospective study. Am J Gastroenterol. 2008, 103 (9): 2254-2262. 10.1111/j.1572-0241.2008.01974.x.
Wang Z, Tanaka Y, Huang Y, Kurbanov F, Chen J, Zeng G, Zhou B, Mizokami M, Hou J: Clinical and virological characteristics of hepatitis B virus subgenotypes Ba, C1, and C2 in China. J Clin Microbiol. 2007, 45 (5): 1491-1496. 10.1128/JCM.02157-06.
Chen CH, Changchien CS, Lee CM, Hung CH, Hu TH, Wang JH, Wang JC, Lu SN: Combined mutations in pre-s/surface and core promoter/precore regions of hepatitis B virus increase the risk of hepatocellular carcinoma: a case-control study. J Infect Dis. 2008, 198 (11): 1634-1642. 10.1086/592990.
Tanaka Y, Mukaide M, Orito E, Yuen MF, Ito K, Kurbanov F, Sugauchi F, Asahina Y, Izumi N, Kato M, et al: Specific mutations in enhancer II/core promoter of hepatitis B virus subgenotypes C1/C2 increase the risk of hepatocellular carcinoma. J Hepatol. 2006, 45 (5): 646-653. 10.1016/j.jhep.2006.06.018.
Abbas Z, Muzaffar R, Siddiqui A, Naqvi SA, Rizvi SA: Genetic variability in the precore and core promoter regions of hepatitis B virus strains in Karachi. BMC Gastroenterol. 2006, 6: 20-10.1186/1471-230X-6-20.
Rezende RE, Fonseca BA, Ramalho LN, Zucoloto S, Pinho JR, Bertolini DA, Martinelli AL: The precore mutation is associated with severity of liver damage in Brazilian patients with chronic hepatitis B. J Clin Virol. 2005, 32 (1): 53-59. 10.1016/j.jcv.2004.08.001.
Carman WF, Jacyna MR, Hadziyannis S, Karayiannis P, McGarvey MJ, Makris A, Thomas HC: Mutation preventing formation of hepatitis B e antigen in patients with chronic hepatitis B infection. Lancet. 1989, 2 (8663): 588-591. 10.1016/S0140-6736(89)90713-7.
Du H, Li T, Zhang HY, He ZP, Dong QM, Duan XZ, Zhuang H: Correlation of hepatitis B virus (HBV) genotypes and mutations in basal core promoter/precore with clinical features of chronic HBV infection. Liver Int. 2007, 27 (2): 240-246. 10.1111/j.1478-3231.2006.01400.x.
Laras A, Koskinas J, Hadziyannis SJ: In vivo suppression of precore mRNA synthesis is associated with mutations in the hepatitis B virus core promoter. Virology. 2002, 295 (1): 86-96. 10.1006/viro.2001.1352.
Chen CH, Lee CM, Lu SN, Changchien CS, Wang JC, Wang JH, Hung CH, Hu TH: Comparison of sequence changes of precore and core promoter regions in HBeAg-positive chronic hepatitis B patients with and without HBeAg clearance in lamivudine therapy. J Hepatol. 2006, 44 (1): 76-82. 10.1016/j.jhep.2005.08.022.
Huang YH, Wu JC, Chang TT, Sheen IJ, Huo TI, Lee PC, Su CW, Lee SD: Association of core promoter/precore mutations and viral load in e antigen-negative chronic hepatitis B patients. J Viral Hepat. 2006, 13 (5): 336-342. 10.1111/j.1365-2893.2005.00688.x.
Sendi H, Mehrab-Mohseni M, Zali MR, Norder H, Magnius LO: T1764G1766 core promoter double mutants are restricted to Hepatitis B virus strains with an A1757 and are common in genotype D. J Gen Virol. 2005, 86 (Pt 9): 2451-2458. 10.1099/vir.0.81023-0.
Tacke F, Gehrke C, Luedde T, Heim A, Manns MP, Trautwein C: Basal core promoter and precore mutations in the hepatitis B virus genome enhance replication efficacy of Lamivudine-resistant mutants. J Virol. 2004, 78 (16): 8524-8535. 10.1128/JVI.78.16.8524-8535.2004.
Guideline on prevention and treatment of chronic hepatitis B in China (2005). Chin Med J (Engl). 2007, 12 (24): 2159-2173.
Kumar S, Nei M, Dudley J, Tamura K: MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008, 9 (4): 299-306. 10.1093/bib/bbn017.
Chevenet F, Brun C, Banuls AL, Jacq B, Christen R: TreeDyn: towards dynamic graphics and annotations for analyses of trees. BMC Bioinformatics. 2006, 7: 439-10.1186/1471-2105-7-439.
Ramiere C, Scholtes C, Diaz O, Icard V, Perrin-Cocon L, Trabaud MA, Lotteau V, Andre P: Transactivation of the hepatitis B virus core promoter by the nuclear receptor FXRalpha. J Virol. 2008, 82 (21): 10832-10840. 10.1128/JVI.00883-08.
Kajiya Y, Hamasaki K, Nakata K, Miyazoe S, Takeda Y, Higashi S, Ohkubo K, Ichikawa T, Nakao K, Kato Y, et al: A long-term follow-up analysis of serial core promoter and precore sequences in Japanese patients chronically infected by hepatitis B virus. Dig Dis Sci. 2001, 46 (3): 059-515. 10.1023/A:1005582812466.
Utsumi T, Yano Y, Truong BX, Tanaka Y, Mizokami M, Seo Y, Kasuga M, Kawabata M, Hayashi Y: Molecular epidemiological study of hepatitis B virus infection in two different ethnic populations from the Solomon Islands. J Med Virol. 2007, 79 (3): 229-235. 10.1002/jmv.20791.
Mohebbi SR, Amini-Bavil-Olyaee S, Zali N, Noorinayer B, Derakhshan F, Chiani M, Rostami Nejad M, Antikchi MH, Sabahi F, Zali MR: Molecular epidemiology of hepatitis B virus in Iran. Clin Microbiol Infect. 2008, 14 (9): 858-866. 10.1111/j.1469-0691.2008.02053.x.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2334/10/271/prepub
This research was supported by Science and Technology Major Projects of "AIDS and viral hepatitis prevention and treatment of major infectious diseases" (2009ZX10004-109) and Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-SW-207) to CZ, Capital Scientific Research Foundation for the Development of Medical Sciences (2007-1021) Municipal Commission of Education Project (Beijing, KM200710025024) and Major Projects on Infectious Disease (2008ZX1002-015) to HD.
The authors declare that they have no competing interests.
The original data process and computation, and manuscript preparation were undertaken by DZ. SM, XZ and HZ collected samples and helped with sequencing experiments and data analysis. HD and CZ directed the project and prepared the bulk of the manuscript. All the authors read and approved the final manuscript.
Dake Zhang, Sufang Ma, Xin Zhang contributed equally to this work.