- Research article
- Open Access
- Open Peer Review
Role of MIRU-VNTR and spoligotyping in assessing the genetic diversity of Mycobacterium tuberculosis in Henan Province, China
BMC Infectious Diseasesvolume 18, Article number: 447 (2018)
Tuberculosis remains a serious threat to human health as an infectious disease in China. Henan, a most populated province in China, has a high incidence of tuberculosis (TB). Though the genetic diversity of Mycobacterium tuberculosis (MTB) has been investigated in many regions, there have been only a few studies on the molecular characteristics and drug resistance phenotypes in Henan. This is the first study on the genetic profile of MTB from Henan.
A total of 668 MTB isolates from various areas were genotyped with spoligotyping and 26-locus MIRU-VNTR (classical 24-locus MIRU-VNTR and 2 other loci). The association between TB spoligotype signatures and drug-resistant profiles was analysed.
Our data revealed that MTB isolates circulating in Henan had a high degree of genetic variation. The Beijing family was the most predominant genotype (83.53%,n = 558), and the typical Beijing type(ST1) was the major sublineage (81.73%,n = 546). In total,668 isolates were divided into 567 different types, forming 38 clusters (2–15 isolates per cluster), and 529 unique types by 26-locus MIRU-VNTR analysis. There was no correlation between the Beijing family and gender, age at diagnosis or treatment history, whereas the Beijing family was significantly associated with all four first-line drug resistance and multidrug-resistant phenotypes. For these samples, 15 of 26 MIRU-VNTR loci had high or moderate discriminatory power according to the Hunter-Gaston discriminatory index. A combination of the 10 most polymorphic loci had similar discriminatory power as the 26-locus set.
The Beijing genotype is the most prevalent family. Ten-locus MIRU-VNTR in combination with spoligotyping can efficiently classify the molecular type of MTB in Henan Province.
Tuberculosis (TB), caused by Mycobacterium tuberculosis(MTB), remains a significant public health problem worldwide. It is estimated that approximately one-third of the world’s population has been infected with MTB, and 1.8 million people die of this disease annually. Among the 22 high TB burden countries reported by the World Health Organization, China ranks second in the world with approximately 1.3 million new cases [1, 2].
Recently, molecular epidemiology tools have been used to assess risk factors associated with recent transmissions ,to track infection transmission dynamics, to distinguish relapse or reinfection and to detect suspected outbreaks; therefore, these tools play a critical role in tuberculosis research and control. MTB molecular markers promoted the development of reproducible genotyping methods , including insertion sequence 6110 (IS6110), restriction fragment length polymorphism (RFLP) typing , spacer oligonucleotide typing (spoligotyping) , single nucleotide polymorphism analysis ,mycobacterial interspersed repetitive unit variable number tandem repeats (MIRU-VNTRs) assessment , large sequence polymorphism (LSP) typing , and genome  sequence analysis. IS6110-RFLP has been the gold standard for genotyping MTB since 1993, but this procedure is time consuming, technically demanding and labour intensive. This method also requires about one microgram of high-quality DNA. Moreover, the discriminating efficiency of this method is insufficient for strains harbouring low copy numbers of IS6110.The Beijing genotype strains exhibit highly similar RFLP patterns, and therefore, discrimination among them is difficult. In addition, rapid and inexpensive genotyping methods based on PCR, such as MIRU-VNTR and spoligotyping, have been effectively used to investigate the genetic relationships and epidemiological characteristics of Beijing strains.
Non-coding regions of the MTB genome contain a set of identical 36-bp direct repeats (DRs), which are separated by 35- to 41-bp unique DNA spacer sequences. Spoligotyping, a rapid and highly reproducible method, detects the presence or absence of DR loci . The results can be represented in a simple binary format that enables the construction of large-scale databases .Therefore, it is considered the gold standard for identifying the Beijing family strains, which have lack spacers 1 to 33 and harbour spacers 34–43 in the DR region [11, 13]. Unfortunately, spoligotyping remains less discriminatory, especially in regions with a high prevalence of Beijing isolates .The discriminatory power is improved when spoligotyping is combined with VNTR.
MIRU-VNTR, a new PCR-based typing method, determines the size and repeated number of units in each locus by amplifying mycobacterial interspersed repetitive units. Easy operation, economical cost, reproducible results and high discriminatory power make it practical for routine use , and the digital results from this method can be compared and exchanged easily between different laboratories [16, 17]. Twelve-locus MIRU-VNTR has been widely used in most cases but has lower discrimination for the Beijing family . Nevertheless, the 24- and 15-locus sets effectively improved discrimination compared with the initial 12-locus set .
In China, more than 80% of tuberculosis patients are in rural communities . Henan Province, the most highly populated province in China, has a significantly higher proportion of the population living in rural areas. Therefore, the epidemic situation of tuberculosis in Henan remains severe. The numbers of both TB and drug-resistant TB patients in Henan are larger than those in any other province, and tuberculosis and HIV co-infection make the bad situation worse. Thus, study of a M.tuberculosis transmission model can help determine risk factors and improve contact tracing. Moreover, little was known about the genetic diversity of MTB in this region until now. The study is the first to use26-locus MIRU-VNTR, including the standard 24-locus and two other loci (ETRF and Mtub38) [21, 22]},for assessments in Henan. In this study, we carried out spoligotyping and MIRU-VNTR to classify 668 representative strains from17 cities. The objective of this study was to assess the diversity of MTB circulating in Henan with higher discrimination and to analyse the probable association between drug resistance profiles and genotypes.
M. tuberculosis clinical isolate collection
In total, 668 strains of MTB were collected from smear-positive pulmonary TB patients in the Tuberculosis Control Institution from various regions of Henan Province during 2015. The patients were from the following regions: 70 from Zhengzhou, 70 from NanYang, 65 from Zhoukou, 50 from Zhumadian, 50 from Luoyang,50 from Kaifeng, 50 from Shangqiu, 45 from Xinyang, 40 from Xinxiang, 35 from Pingdingshan, 35 from Anyang, 20 from Puyang, 19 from Luohe, 18 from Xuchang,18 from Jiaozuo,18 from Sanmenxia, 15 from Hebi, and none from Jiyuan. M. tuberculosis H37Rv was used as the control strain.
This project was approved by the Ethics Review Committee of Henan CDC. All the patients with pulmonary TB provided informed consent before participation in this investigation. Ethics were respected throughout the whole study period.
Genomic DNA extraction
All the isolates were cultured in Lowenstein-Jensen (L-J) medium for 3–4 weeks at 37 °C. A loopful of colonies was added to 300 μl of TE buffer at pH 8.0. The suspension of bacterial cells was incubated at 85 °C for 30 min to inactivate pathogens, followed by centrifugation at 13,000 g for 5 min. The supernatant containing DNA was used as a template for PCR.
These collected isolates were subjected to spoligotyping on commercially available membranes according to a previously described standard protocol [23,24,25]. Briefly, the direct repeat (DR) regions were amplified with biotin-labelled Dra and Drb primers , and then, the amplicons were hybridized with a nylon membrane that covalently bound a set of 43 oligonucleotide probes . The hybrid membranes were washed with SSPE buffer containing 0.5%SDS and incubated with streptavidin peroxidase conjugate. The results were visualized with the ECL system. The spoligotyping results in octal format were compared with those in the international spoligotyping database SITVIT2 to assign the Spoligotype (or Shared) International Type(SIT) codes .
MIRU-VNTR typing based on 26 loci, including the standard 24 loci  and 2 other loci, i.e., ETRF and Mtub38 , was performed to determine genetic relationships among isolates in our study. First, each locus was amplified individually as previously described . Then, the PCR products were detected in a 1.5% agarose gel using a 50 bp DNA ladder as the molecular weight standard. The number of tandem repeats was calculated based on the length of the repeat and flank sequences for each locus. The PCR products of H37Rv were loaded to ensure accuracy and the PCR products of sterile water were used to control for reagent contamination. To visualize evolutionary relationships among these clinical isolates, the resulting data were analysed by BioNumerics 6.6 as a characteristic data set. The results of MIRU-VNTR were analysed to construct a dendrogram based on the UPGMA algorithm. The discriminatory power of each locus was evaluated using the Hunter and Gaston Discriminatory Index (HGDI) .The Hunter-Gaston Index (HGDI) was calculated with the equation:
Strain isolation and drug susceptibility test
Epidemiological information, such as age, sex, and clinical treatment history of the patients, was collected with a questionnaire method.
The sputum samples of clinical patients were isolatedand cultured using Lowenstein-Jensen (L-J) media. Then, the drug susceptibility tofirst- and second-line anti-TB drugs was determined using the proportion method with H37Rv as a control . The drug concentrations in L-J media were as follows: isoniazid (INH) 0.2 μg/ml, rifampicin (RIF) 40 μg/ml, ethambutol (EMB) 2 μg/ml, streptomycin (SM) 4 μg/ml, kanamycin (KM) 3 μg/ml, ofloxacin (OFX) 2 μg/ml, capreomycin (CPM) 40 μg/ml, p-aminosalicylic acid(PAS) 1 μg/ml,and prothionamide (Pto) 40 μg/ml. MDR-TB (multidrug resistance-TB) is defined as an isolate that isresistant to both isoniazid and rifampicin. Extensivedrug-resistant tuberculosis (XDR-TB) is defined as an MDR isolate that isalso resistant to any fluoroquinolone and at least one of three injectable second-line drugs, such as KM or CPM. In addition, MDR isolates resistant to only OFX or KM are defined as pre-XDR-TB }.
The Hunter-Gaston discriminatory Index (HGDI) was calculated as previously described to assess allelic diversity at VNTR loci . The data were analysed with SPSS 19.0 software. The statistical associations between genotype and drug susceptibility, age, gender, and treatment history were detected using a Chi-squared test or Fisher’s exact test. A 5% level of significance (p ≤ 0.05) was considered statistically significant.
In this study, 668 isolates were clustered into 35 distinct genotypes by spoligotyping. The octal coded spoligotyping results were assigned spoligotyping shared international type (SIT) numbers and then compared with those in spolDB4  and theupdated SITVIT  databases. In total, 643 isolates grouped into 10 clusters comprising 2 to 546 isolates, while the other 25 isolates showed an orphan spoligotyping pattern. Among all 668 isolates, 8(1.19%) could not be matched to any determined patterns in either database and thus were labelled “unknown”. The Beijing family genotypes strongly predominated in our area, accounting for 558/668 (83.53%) of the isolates, followed by the T1 family with 59/668isolates (8.83%) and the MANU2 family with 20/168isolates (2.99%). The genotypes Beijing, T1 and MANU2 were dominant, accounting for 95.35% of the total. Among the 558 Beijing isolates, 546 showed the typical Beijing family pattern: the absence of the first 34 spacers and the presence of spacers 35 to 43 . The remaining 12 isolates lacked one or more spacers that are present in the typical Beijing pattern and thus were classified as Beijing-like or an atypical Beijing family. The other 110 non-Beijing isolates were subdivided into 28 lineages. The high clustering rate (94.7%) of spoligotyping was in concordance with the lower discrimination for the Beijing family (Table 1).
To evaluate the genetic relations among all 668 isolates, the minimum spanning tree was generated using BioNumerics 6.6 software. Each node represents a distinct genotype, and the size of each node depends on the number of the corresponding cluster . These isolates were divided into two major lineages, the Beijing family and the non-Beijing family. The largest cluster contained 546 typical Beijing family isolates (SIT1) with small clusters of the atypical Beijing family around it. The two larger clusters on the right comprised the T1 and MANU2 families and were surrounded by other non-Beijing families and 8 unknown genotypes. Therefore, the results suggested that the unknown genotyped isolates maybe genetically closer to the non-Beijing family (Fig. 1).
The analysis of 26-locus MIRU-VNTR showed that 139 isolates (23.9%) grouped into 38 clusters, and the remaining 529 isolates shared unique patterns. The largest cluster contained 15 strains, and the other clusters consisted of 2–14 isolates. The clustering rate of 26-locus MIRU-VNTR was 15.1%. There was some discrepancy between the spoligotyping and 26-locus MIRU-VNTR for the Beijing family isolates. The largest cluster of the MIRU-VNTR mainly included Beijing family isolates; however, 33 isolates belonged to the non-Beijing family patterns in this largest clade (Additional file 1: Figure S1). This finding might be due to mixed infection of two different TB isolates in these patients.
To evaluate the allelic diversity of 26-locus MIRU-VNTR among the isolates in this study, we calculated the Hunter-Gaston discriminatory index (HGDI) for each locus. As previously reported, MIRU-VNTR loci were further designated highly (> 0.6), moderately (0.3–0.6), or poorly (< 0.3) discriminatory loci according to the HGDI scores . These loci in our study exhibited significantly different discriminatory powers with various HGDI scores from 0.767 for Qub11b to 0.015 for MIRU24 (Table 2). Among all the strains, four loci were designated highly discriminatory loci. Qub11b had the best discriminatory power(HGDI = 0.767), followed by Mtub21 (HGDI = 0.645), Miru26 (HGDI = 0.616), and Qub26 (HGDI =0.608). Eleven loci (Mtub04, MIRU10, ETRE, MIRU39, ETRA, MIRU40, Qub4156c, Mtub39,Mtub30, ETRF, and Mtub38) showed moderate discriminatory power, and the remaining loci were less discriminative, with an HGDI ranging from 0.015 to 0.179. The discriminatory power (HGDI) of all the loci sets reached 0.998.
We further compared the allelic diversity between the Beijing family and the total isolates. All 26 loci showed lower allelic diversity among the Beijing family isolates, which was in accordance with the closer affinity of this genotype (Table 2).
To evaluate the discriminatory power of different locus sets by MIRU-VNTR techniques, we compared the performance of this 26-locus set with the 24-locus, 15-locus and 12-locus sets coupled with spoligotyping (Table 3). These 26-locus, 24-locus and 15-locus sets obviously improved the performance compared with the initial 12-locus set, especially in combination with spoligotyping. The 12-locus set of MIRU-VNTR alone generated 263 genotypes, and the combination of 12-locus MIRU-VNTR and spoligotyping increased the number of genotypes to 292, resulting in HGDI values of 0.92 and 0.94, respectively. The 15-locus set generated 501 genotypes, and 15 clusters were subdivided in combination with spoligotyping, generating 516 patterns. The discriminatory power of the 15-locus set alone and combined with spoligotyping was 0.996 and 0.997, respectively. While the combination of the 24-locus set and spoligotyping yielded 561 genotypes with an HGDI value of 0.998, the 26-locus set with spoligotyping differentiated all the isolates into 576 different genotypes (HGDI = 0.998). In conclusion, both the 15-locus and 24-locus sets appeared to provide a discrimination power close to that of the 26-locus set.
To reduce the economic cost and lighten the labour load for high-throughput genotyping, we aimed to determine a minimal set of VNTR loci to analyse the isolates in Henan. We evaluated and compared the cumulative HGDI by successively adding each locus (Table 4). The 22-locus set produced the same cumulative HGDI and clustering rate as the 23-locus set (HGDI = 0.9983, clustering rate = 15.4%), and these two values of the 24- and 25-locus sets were also equal (HGDI = 0.9984, clustering rate = 15.2%). We identified the top 10 most discriminatory loci (qub-11b, mtub21, miru26, qub-26, mtub04, miru10, ETRF, ETRE, miru39 and ETRA) with a cumulative HGDI of 0.996. The combination of the top 10 loci showed adiscriminatory power close to that of the 26-locus set. There was no significant improvement in the cumulative HGDI upon adding other loci (Table 4).
Relationship between the Beijing family and drug resistance and sociodemographic characteristics
To assess the correlation between levels of drug resistance and genetic patterns, susceptibility to the four first-line and five second-line anti-TB drugs was determined with the proportional method. A total of 202 isolates (30.3%) were resistant to at least one drug, 50 (7.5%) isolates were classified as MDR-TB, and 30 isolates (4.5%) were susceptible to all four first-line drugs. As shown in Table 5, among the 50 MDR isolates, 47 showed the Beijing family genotype, and 3 showed the non-Beijing family genotype. In total, 8.4% (47/558) of the Beijing family genotype isolates were MDR, and 2.7% (3/110) of the non-Beijing family genotype isolates were MDR. For the Beijing family genotype, the percentage of isolates resistant to four first-line drugs was 5.19% (29/558), while only 0.91%(1/110) of the non-Beijing family isolates were resistant to four first-line drugs. Thus, the results revealed a significantly higher proportion of MDR among the Beijing family than among the non-Beijing family (OR 3.28, 95%CI 1.01–10.37, p = 0.038), and the Beijing family also showed a higher risk for developing drug resistance to all four first-line drugs (OR 5.97, 95%CI 1.05–44.33, p = 0.045).There was no apparent difference in the proportions of other drug-resistant profiles between the Beijing and non-Beijing family genotypes (Table 5).
Then, the sociodemographic characteristics of patients, including gender, age at diagnosis, and clinical history, were compared between the Beijing and non-Beijing family genotypes, and no statistically significant differences were detected (Table 6).
The Beijing family genotype remains the predominant genotype in China ; however, the proportion of patients carrying this genotype varies in different regions . Henan Province has a high incidence of tuberculosis, but little is known about the genetic background of M. tuberculosis in this region. This is the first study to investigate the allelic diversity of MTB isolates in Henan Province using spoligotyping and MIRU-VNTR.
M. tuberculosis is divided into 162 clades according to the international spoligotyping database SpolDB4 , and the Beijing family is regarded as the most important genetic pattern of the East Asian clade . The pattern is predominant in China, and its proportion is greater in northern China than in southern China. The prevalence rate of the Beijing family is higher in Beijing (82–92.6%) [31, 33], Tianjin (91.7%) , Tibet (96.3%) [33, 35], Inner Mongolia (93.3%) , Heilongjiang (89.5%) ,and Gansu(87.5%)  but lower in Guangdong(25%) , Guangxi (55.3%)  and Fujian(54.5–55.1%) [33, 40]. These results showed that 83.5% of the MTB isolates belonged to the Beijing family, indicating that this genotype was the most predominant genotype in our region, which was consistent with the results of previous studies. The higher prevalence of the Beijing family might be associated with customs influenced by climate . The non-Beijing family lineage included the T1, T2, T3, MANU1, MANU2, S, LAM3,S/convergent, LAM3/S and U genotypes, indicating genotypic polymorphism among MTB strains in this area. Among the non-Beijing family isolates, 59 were in the T1 family (53.6%), 20 were in the MANU2 family (18.2%), 10 were in the T3 family (9.1%), and 7 were in the T2 family (6.4%); these genotypes have also been observed in other regions of China, albeit at different proportions [35,36,37,38,39,40]. This study also identified eight new spoligotype isolates. They were divided into different clusters and were derived from patients in different regions, indicating there might not show an epidemiologic relationship. The larger number of small gene clusters could potentially reflect a recent transmission. The higher rate of small genotypic clusters for the Beijing family suggested its predominance in recent transmissions.
The prevalence of the Beijing genotype is apparently higher worldwide, but very little is known about the reasons for its efficient transmission. Previous studies have suggested that this genotype is associated with drug resistance and shows increased virulence in animal models  and enhanced reproductive fitness . Overproduction of polyketide synthase-derived phenolic glycolipid (PGL) by the Beijing family inhibits the release of pro-inflammatory cytokines, thus enhancing the infective success [43, 44]. The Beijing family has a strong association with drug resistance, indicating that this family might be predisposed to acquiring resistance  and thereby showing increased transmission of drug-resistant M.tuberculosis. However, there has been a discrepancy in different research results for the non-Beijing family because this family includes various subtypes. There is a discrepancy in the relationship between the Beijing lineage and TB outbreaks in a variety of geographic locations [23, 45,46,47,48,49,50,51,52].To analyse the relationship between the Beijing family and drug resistance, we compared the proportion of Beijing genotypes in different drug susceptibility profiles, and the results showed that resistance to all four first-line drugs was significantly higher in the Beijing family. The Beijing family isolates had a higher resistance rate to INH, RIF and MDR in Ukraine . Similarly, the Beijing family also had a close association with INH, RIF, SM and MDR resistance in Central Asia . In agreement with previously reported data, our data revealed that the Beijing genotype showed a greater correlation with MDR-TB phenotypes than did other non-Beijing genotypes. The long-term reciprocal co-evolution between host and bacterium might affect the prevalence of the Beijing genotype [55, 56], thus, we estimated the correlation between the Beijing family and epidemiological features, including sex, age and treatment status. Previously, some studies revealed that the Beijing genotype strains are generally associated with young age  and a higher rates of treatment failure and relapse than other strains [58, 59], but in this study, there was no association between the prevalence of Beijing genotypes and gender, age or clinical treatment history of patients. Therefore, it is necessary to further explore the effect of demographic factors on the genetic diversity of M. Tuberculosis with a larger sample size in our area.
Spoligotyping is an efficient genotyping technique that can classify the MTB lineage, but it cannot effectively distinguish Beijing family isolates due to lower discriminatory power . In this study, 668 samples were successfully classified into 35 distinct genotypes, including 10 clusters and 25 unique spoligotypes. Due to the low resolution of spoligotyping, we applied another typing method based on MIRU-VNTR to further phylogenetically analyse the molecular characteristics of these isolates [35, 37]. It is important to choose the appropriate VNTR loci to identify the most prevalent cluster for the Beijing family. The classical 12-locus MIRU-VNTR set is a widely used molecular epidemiological approach to elucidate the phylogenetic diversity of MTB isolates, but it is not effective at distinguishing Beijing isolates. The 15-locus and 24-locus VNTR combinations have sufficient discriminatory power and are suitable for MTB genotyping, especially in areas where the Beijing family is prevalent. However, it is not necessary to utilize all 24 loci for genotyping MTB isolates due to the diversity of the isolate population structure. In addition, the 24-locus set is very time consuming and complicated to operate. The genotyping efficiency varied depending on the disparate loci sets in different surveyed areas. To identify a suitable locus set for classifying MTB in Henan, we first chose the 26-locus set to assess the 668 clinical isolates according to previous studies [21, 22, 60].In total, 567 genotypes, forming 38 clusters, and 529 unique genotypes were obtained by 26-locus VNTR analysis with an HGDI score of 0.9984. For the Beijing family isolates, the clustering rate (16.12%) of the 26-locus set was obviously lower than that of spoligotyping (98.74%), and the cumulative HGDI value (0.998) of the 26-locus set was significantly higher than that of spoligotyping (0.042). Moreover, the clustering rate of VNTR was different between Beijing and non-Beijing families (16.13% vs. 2.7%), suggesting that the Beijing family may have more effective infectivity in Henan. Previous studies showed that the combination of MIRU-VNTR and spoligotyping could enhance the discriminatory power of MTB [19, 61].Correspondingly, our data showed that the combination of spoligotyping and the 26-locus set VNTR finally classified all 668 isolates into 576 different patterns and had a lower clustering rate (13.77%) than that with 26-locus VNTR (Table 4).
Our analysis showed that 11 loci of the 26-locus MIRU-VNTR set were poorly discriminatory. Especially, the VNTR49 and MIRU02 loci did not improve the cumulative HGDI, indicating that these two loci were conserved, resulting in no power to discriminate different MTB isolates. In this study, the two largest clusters contained 15 and 14 strains, and the other clusters were composed of 2 to 9 strains. Among 29 isolates of the two largest clusters, 24 (82.7%) belonged to the Beijing family, three to the T1 family, one to MANU2, and one to the H3 family. Moreover, 10 (34.5%) of these 29 isolates were resistant to one or more drugs. However, it remains uncertain whether the patients infected with these strains had close contact with each other. Since the ability of the different locus sets to classify MTB was diverse in Henan, we needed to determine an optimal set that had a discriminatory power comparable to that of the 26-locus set. In consideration of labour and economic costs, we chose the top 10 loci combination, which had an HGDI value comparable to that of the classical 24-locus set (0.996 vs. 0.997), which was slightly lower than that of the 26-locus set (0.998). These data indicated that the 10-locus combination was useful and cost-efficient. Therefore, we suggest this 10-locus set as a potential first-line MTB genotyping method in Henan Province, especially for a large-scale molecular epidemiological survey.
Our data revealed some inconsistency between the results of spoligotyping and MIRU-VNTR for several MTB isolates. Beijing family isolates comprised a large proportion of the largest cluster by MIRU-VNTR genotyping; however,33 isolates with a non-Beijing spoligotype were found in this largest clade. Furthermore, another 19 strains showed the Beijing genotype spoligotype pattern but could not be distinguished by MIRU-VNTR (Fig. 1).This divergence might be due to mixed infection of two different TB isolates in these samples .
Until now, no genotyping methods based on genetic markers have been able to completely accurately classify the Beijing family because there were always exceptional strains . Previous studies showed that different VNTR loci had varying discriminatory power for the Beijing and non-Beijing family genotypes [21, 63]. In our study, only Qub11b had a higher discriminatory power for the Beijing family among 26 loci. However, six loci (loci qub-11b, mtub-21, miru-26, qub26, mtub-04 and miru-10) exhibited a higher discriminatory power for the non-Beijing family. Four loci (Mtub38, ETR-B, ETR-D and MIRU40) showed remarkable differences in allelic diversity between Beijing and non-Beijing genotypes, with the difference in the HGDI greater than 0.25.
There are several limitations of this study. First, the small sample size is a major limitation. Further in order to ensure greater reliability and representativeness of the findings, we should enlarge the sample for further observation in the future. Furthermore, the initial isolates taken by sputum were not collected when retreated tuberculosis patients were first diagnosed in this study; thus, we were unable to differentiate relapse and reinfection cases.
We report the genotype distribution of M. Tuberculosis strains in Henan Province. Based on our results, the combination of spoligotyping and 26-locus VNTR can effectively analyse the molecular epidemiological features of MTB in this area. The Beijing genotype was the predominant genotype in this area and exhibited a great correlation with multi-drug-resistant phenotypes. The analysis of MIRU-VNTR data can help select the appropriate VNTR loci to genotype MTB in specific regions; our results identified a reduced 10-locus MIRU-VNTR set that could be applied to distinguish most MTB lineages in this region. In future studies, superior discriminatory power methods, such as genome sequencing, will be used to classify the remaining clusters based on MIRU-VNTR to better understand their significance related to ongoing TB transmission. Overall, this study provided an effective method to distinguish MTB isolates, and the implementation of this technology will help enhance TB control programmes and reduce the TB burden in Henan Province.
Hunter and Gaston Discriminatory Index
Insertion sequence 6110
- L-J media:
Large sequence polymorphism
Multidrug resistance- TB
Mycobacterial interspersed repetitive unite variable number tandem repeat
Restriction fragment length polymorphism
Spoligotype International Type
Spacer oligonucleotide typing
World Health Organization
Extensively drug-resistant tuberculosis
WHO global tuberculosis control report 2010. Summary. Cent Eur J Public Health. 2010;18(4):237.
Zhao Y, Xu S, Wang L, Chin DP, Wang S, Jiang G, Xia H, Zhou Y, Li Q, Ou X, et al. National survey of drug-resistant tuberculosis in China. N Engl J Med. 2012;366(23):2161–70.
Smith CM, Trienekens SC, Anderson C, Lalor MK, Brown T, Story A, Fry H, Hayward AC, Maguire H. Twenty years and counting: epidemiology of an outbreak of isoniazid-resistant tuberculosis in England and Wales, 1995 to 2014. Euro Surveill. 2017;22(8):1–11.
Streicher EM, Sampson SL, Dheda K, Dolby T, Simpson JA, Victor TC, Gey van Pittius NC, van Helden PD, Warren RM. Molecular epidemiological interpretation of the epidemic of extensively drug-resistant tuberculosis in South Africa. J Clin Microbiol. 2015;53(11):3650–3.
Mathuria JP, Anupurba S. Usefulness of IS6110-based restriction fragment length polymorphism analysis in fingerprinting of Mycobacterium tuberculosis isolates in North India. Int J Mycobacteriol. 2016;5(Suppl 1):S176–7.
Feyisa SG, Haeili M, Zahednamazi F, Mosavari N, Taheri MM, Hamzehloo G, Zamani S, Feizabadi MM. Molecular characterization of Mycobacterium tuberculosis isolates from Tehran, Iran by restriction fragment length polymorphism analysis and spoligotyping. Rev Soc Bras Med Trop. 2016;49(2):204–10.
Luo T, Comas I, Luo D, Lu B, Wu J, Wei L, Yang C, Liu Q, Gan M, Sun G, et al. Southern east Asian origin and coexpansion of Mycobacterium tuberculosis Beijing family with Han Chinese. Proc Natl Acad Sci U S A. 2015;112(26):8136–41.
Jonsson J, Hoffner S, Berggren I, Bruchfeld J, Ghebremichael S, Pennhag A, Groenheit R. Comparison between RFLP and MIRU-VNTR genotyping of Mycobacterium tuberculosis strains isolated in Stockholm 2009 to 2011. PLoS One. 2014;9(4):e95159.
Rindi L, Medici C, Bimbi N, Buzzigoli A, Lari N, Garzelli C. Genomic variability of Mycobacterium tuberculosis strains of the euro-American lineage based on large sequence deletions and 15-locus MIRU-VNTR polymorphism. PLoS One. 2014;9(9):e107150.
Sekizuka T, Yamashita A, Murase Y, Iwamoto T, Mitarai S, Kato S, Kuroda M. TGS-TB: Total genotyping solution for Mycobacterium tuberculosis using short-read whole-genome sequencing. PLoS One. 2015;10(11):e0142951.
Kamerbeek J, Schouls L, Kolk A, van Agterveld M, van Soolingen D, Kuijper S, Bunschoten A, Molhuizen H, Shaw R, Goyal M, et al. Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology. J Clin Microbiol. 1997;35(4):907–14.
Filliol I, Driscoll JR, Van Soolingen D, Kreiswirth BN, Kremer K, Valetudie G, Anh DD, Barlow R, Banerjee D, Bifani PJ, et al. Global distribution of Mycobacterium tuberculosis spoligotypes. Emerg Infect Dis. 2002;8(11):1347–9.
Vluggen C, Soetaert K, Groenen G, Wanlin M, Spitaels M, Arrazola de Onate W, Fauville-Dufaux M, Saegerman C, Mathys V. Molecular epidemiology of Mycobacterium tuberculosis complex in Brussels, 2010-2013. PLoS One. 2017;12(2):e0172554.
Pang Y, Zhou Y, Zhao B, Liu G, Jiang G, Xia H, Song Y, Shang Y, Wang S, Zhao YL. Spoligotyping and drug resistance analysis of Mycobacterium tuberculosis strains from national survey in China. PLoS One. 2012;7(3):e32976.
Maes M, Kremer K, van Soolingen D, Takiff H, de Waard JH. 24-locus MIRU-VNTR genotyping is a useful tool to study the molecular epidemiology of tuberculosis among Warao Amerindians in Venezuela. Tuberculosis. 2008;88(5):490–4.
Mazars E, Lesjean S, Banuls AL, Gilbert M, Vincent V, Gicquel B, Tibayrenc M, Locht C, Supply P. High-resolution minisatellite-based typing as a portable approach to global analysis of Mycobacterium tuberculosis molecular epidemiology. Proc Natl Acad Sci U S A. 2001;98(4):1901–6.
Mears J, Abubakar I, Cohen T, McHugh TD, Sonnenberg P. Effect of study design and setting on tuberculosis clustering estimates using mycobacterial interspersed repetitive units-variable number tandem repeats (MIRU-VNTR): a systematic review. BMJ Open. 2015;5(1):e005636.
Supply P, Lesjean S, Savine E, Kremer K, van Soolingen D, Locht C. Automated high-throughput genotyping for study of global epidemiology of Mycobacterium tuberculosis based on mycobacterial interspersed repetitive units. J Clin Microbiol. 2001;39(10):3563–71.
Iwamoto T, Yoshida S, Suzuki K, Tomita M, Fujiyama R, Tanaka N, Kawakami Y, Ito M. Hypervariable loci that enhance the discriminatory ability of newly proposed 15-loci and 24-loci variable-number tandem repeat typing method on Mycobacterium tuberculosis strains predominated by the Beijing family. FEMS Microbiol Lett. 2007;270(1):67–74.
Liu JJ, Yao HY, Liu EY. Analysis of factors affecting the epidemiology of tuberculosis in China. Int J Tuberc Lung Dis. 2005;9(4):450–4.
Zhang L, Chen J, Shen X, Gui X, Mei J, Deriemer K, Gao Q. Highly polymorphic variable-number tandem repeats loci for differentiating Beijing genotype strains of Mycobacterium tuberculosis in shanghai, China. FEMS Microbiol Lett. 2008;282(1):22–31.
Zhang D, An J, Wang J, Hu C, Wang Z, Zhang R, Wang Y, Pang Y. Molecular typing and drug susceptibility of Mycobacterium tuberculosis isolates from Chongqing municipality, China. Infect Genet Evol. 2013;13:310–6.
Alonso M, Alonso Rodriguez N, Garzelli C, Martinez Lirola M, Herranz M, Samper S, Ruiz Serrano MJ, Bouza E, Garcia de Viedma D. Characterization of Mycobacterium tuberculosis Beijing isolates from the Mediterranean area. BMC Microbiol. 2010;10:151.
van Embden JD, van Gorkom T, Kremer K, Jansen R, van Der Zeijst BA, Schouls LM. Genetic variation and evolutionary origin of the direct repeat locus of Mycobacterium tuberculosis complex bacteria. J Bacteriol. 2000;182(9):2393–401.
Lillebaek T, Andersen AB, Dirksen A, Glynn JR, Kremer K. Mycobacterium tuberculosis Beijing genotype. Emerg Infect Dis. 2003;9(12):1553–7.
Brudey K, Driscoll JR, Rigouts L, Prodinger WM, Gori A, Al-Hajoj SA, Allix C, Aristimuno L, Arora J, Baumanis V, et al. Mycobacterium tuberculosis complex genetic diversity: mining the fourth international spoligotyping database (SpolDB4) for classification, population genetics and epidemiology. BMC Microbiol. 2006;6:23.
Supply P, Allix C, Lesjean S, Cardoso-Oelemann M, Rusch-Gerdes S, Willery E, Savine E, de Haas P, van Deutekom H, Roring S, et al. Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis. J Clin Microbiol. 2006;44(12):4498–510.
Flores-Trevino S, Morfin-Otero R, Rodriguez-Noriega E, Gonzalez-Diaz E, Perez-Gomez HR, Bocanegra-Garcia V, Vera-Cabrera L, Garza-Gonzalez E. Genetic diversity of Mycobacterium tuberculosis from Guadalajara, Mexico and identification of a rare multidrug resistant Beijing genotype. PLoS One. 2015;10(2):e0118095.
Hunter PR. Reproducibility and indices of discriminatory power of microbial typing methods. J Clin Microbiol. 1990;28(9):1903–5.
Demay C, Liens B, Burguiere T, Hill V, Couvin D, Millet J, Mokrousov I, Sola C, Zozio T, Rastogi N. SITVITWEB--a publicly available international multimarker database for studying Mycobacterium tuberculosis genetic diversity and molecular epidemiology. Infect Genet Evol. 2012;12(4):755–66.
Liu Y, Tian M, Wang X, Wei R, Xing Q, Ma T, Jiang X, Li W, Zhang Z, Xue Y, et al. Genotypic diversity analysis of Mycobacterium tuberculosis strains collected from Beijing in 2009, using spoligotyping and VNTR typing. PLoS One. 2014;9(9):e106787.
Mokrousov I, Narvskaya O, Limeschenko E, Vyazovaya A, Otten T, Vyshnevskiy B. Analysis of the allelic diversity of the mycobacterial interspersed repetitive units in Mycobacterium tuberculosis strains of the Beijing family: practical implications and evolutionary considerations. J Clin Microbiol. 2004;42(6):2438–44.
Dong H, Liu Z, Lv B, Zhang Y, Liu J, Zhao X, Wan K. Spoligotypes of Mycobacterium tuberculosis from different provinces of China. J Clin Microbiol. 2010;48(11):4102–6.
L-Q C, W-Q L, Li L, Dai ZJ, Bai D-P, Zhang L, Shao S-F, Wu Q, Lu W, Sun Z-G, et al. Study on the genotype of Mycobacterium tuberculosis isolates from hospitals in Tianjin. Chinese Journal of Epidemiology. 2007;28(8):4102–6.
Dong H, Shi L, Zhao X, Sang B, Lv B, Liu Z, Wan K. Genetic diversity of Mycobacterium tuberculosis isolates from Tibetans in Tibet, China. PLoS One. 2012;7(3):e33904.
Yu Q, Su Y, Lu B, Ma Y, Zhao X, Yang X, Dong H, Liu Y, Lian L, Wan L, et al. Genetic diversity of Mycobacterium tuberculosis isolates from Inner Mongolia, China. PLoS One. 2013;8(5):e57660.
Wang J, Liu Y, Zhang CL, Ji BY, Zhang LZ, Shao YZ, Jiang SL, Suzuki Y, Nakajima C, Fan CL, et al. Genotypes and characteristics of clustering and drug susceptibility of Mycobacterium tuberculosis isolates collected in Heilongjiang Province, China. J Clin Microbiol. 2011;49(4):1354–62.
Liu J, Tong C, Jiang Y, Zhao X, Zhang Y, Liu H, Lu B, Wan K. First insight into the genotypic diversity of clinical Mycobacterium tuberculosis isolates from Gansu Province, China. PLoS One. 2014;9(6):e99357.
Li WM, Wang SM, Pei XY, Liu ZQ, Zhong Q. DNA fingerprinting of Mycobacterium tuberculosis strains from Beijing, Guangdong and Ningxia. Chinese Journal of Epidemiology. 2003;24(05):e99357.
Liang QF, Pang Y, Chen QY, Lin SF, Lin J, Zhao Y, Wei SZ, Zheng JF, Zheng SH. Genetic profile of tuberculosis among the migrant population in Fujian Province, China. Int J Tuberc Lung Dis. 2013;17(5):655–61.
Parwati I, van Crevel R, van Soolingen D. Possible underlying mechanisms for successful emergence of the Mycobacterium tuberculosis Beijing genotype strains. Lancet Infect Dis. 2010;10(2):103–11.
Cobelens F. Relative reproductive fitness of the W-Beijing genotype. Int J Tuberc Lung Dis. 2012;16(3):287.
Reed MB, Domenech P, Manca C, Su H, Barczak AK, Kreiswirth BN, Kaplan G, Barry CE 3rd. A glycolipid of hypervirulent tuberculosis strains that inhibits the innate immune response. Nature. 2004;431(7004):84–7.
Manca C, Reed MB, Freeman S, Mathema B, Kreiswirth B, Barry CE 3rd, Kaplan G. Differential monocyte activation underlies strain-specific Mycobacterium tuberculosis pathogenesis. Infect Immun. 2004;72(9):5511–4.
Purwar S, Chaudhari S, Katoch VM, Sampath A, Sharma P, Upadhyay P, Chauhan DS. Determination of drug susceptibility patterns and genotypes of Mycobacterium tuberculosis isolates from Kanpur district, North India. Infect Genet Evol. 2011;11(2):469–75.
Kremer K, Glynn JR, Lillebaek T, Niemann S, Kurepina NE, Kreiswirth BN, Bifani PJ, van Soolingen D. Definition of the Beijing/W lineage of Mycobacterium tuberculosis on the basis of genetic markers. J Clin Microbiol. 2004;42(9):4040–9.
Caws M, Thwaites G, Stepniewska K, Nguyen TN, Nguyen TH, Nguyen TP, Mai NT, Phan MD, Tran HL, Tran TH, et al. Beijing genotype of Mycobacterium tuberculosis is significantly associated with human immunodeficiency virus infection and multidrug resistance in cases of tuberculous meningitis. J Clin Microbiol. 2006;44(11):3934–9.
Al Hajoj S, Rastogi N. The emergence of Beijing genotype of Mycobacterium tuberculosis in the Kingdom of Saudi Arabia. Ann Thorac Med. 2010;5(3):149–52.
Brown T, Nikolayevskyy V, Velji P, Drobniewski F. Associations between Mycobacterium tuberculosis strains and phenotypes. Emerg Infect Dis. 2010;16(2):272–80.
Ani A, Bruvik T, Okoh Y, Agaba P, Agbaji O, Idoko J, Dahle UR. Genetic diversity of Mycobacterium tuberculosis complex in Jos, Nigeria. BMC Infect Dis. 2010;10:189.
Lai CC, Tan CK, Lin SH, Liao CH, Huang YT, Chou CH, Hsu HL, Wang CY, Lin HI, Hsueh PR. Clinical and genotypic characteristics of extensively drug-resistant and multidrug-resistant tuberculosis. Eur J Clin Microbiol Infect Dis. 2010;29(5):597–600.
Kam KM, Yip CW, Tse LW, Wong KL, Lam TK, Kremer K, Au BK, van Soolingen D. Utility of mycobacterial interspersed repetitive unit typing for differentiating multidrug-resistant Mycobacterium tuberculosis isolates of the Beijing family. J Clin Microbiol. 2005;43(1):306–13.
Nikolayevskyy V, Gopaul K, Balabanova Y, Brown T, Fedorin I, Drobniewski F. Differentiation of tuberculosis strains in a population with mainly Beijing-family strains. Emerg Infect Dis. 2006;12(9):1406–13.
Cox HS, Kubica T, Doshetov D, Kebede Y, Rusch-Gerdess S, Niemann S. The Beijing genotype and drug resistant tuberculosis in the Aral Sea region of Central Asia. Respir Res. 2005;6:134.
Oota H, Kitano T, Jin F, Yuasa I, Wang L, Ueda S, Saitou N, Stoneking M. Extreme mtDNA homogeneity in continental Asian populations. Am J Phys Anthropol. 2002;118(2):146–53.
Caws M, Thwaites G, Dunstan S, Hawn TR, Lan NT, Thuong NT, Stepniewska K, Huyen MN, Bang ND, Loc TH, et al. The influence of host and bacterial genotype on the development of disseminated disease with Mycobacterium tuberculosis. PLoS Pathog. 2008;4(3):e1000034.
Buu TN, Huyen MN, Lan NT, Quy HT, Hen NV, Zignol M, Borgdorff MW, Cobelens FG, van Soolingen D. The Beijing genotype is associated with young age and multidrug-resistant tuberculosis in rural Vietnam. Int J Tuberc Lung Dis. 2009;13(7):900–6.
Buu TN, van Soolingen D, Huyen MN, Lan NT, Quy HT, Tiemersma EW, Kremer K, Borgdorff MW, Cobelens FG. Increased transmission of Mycobacterium tuberculosis Beijing genotype strains associated with resistance to streptomycin: a population-based study. PLoS One. 2012;7(8):e42323.
Lan NT, Lien HT, Tung le B, Borgdorff MW, Kremer K, van Soolingen D. Mycobacterium tuberculosis Beijing genotype and risk for treatment failure and relapse, Vietnam. Emerg Infect Dis. 2003;9(12):1633–5.
Liu Q, Yang D, Xu W, Wang J, Lv B, Shao Y, Song H, Li G, Dong H, Wan K, et al. Molecular typing of Mycobacterium tuberculosis isolates circulating in Jiangsu province, China. BMC Infect Dis. 2011;11:288.
Murase Y, Mitarai S, Sugawara I, Kato S, Maeda S. Promising loci of variable numbers of tandem repeats for typing Beijing family Mycobacterium tuberculosis. J Med Microbiol. 2008;57(Pt 7):873–80.
Mallard K, McNerney R, Crampin AC, Houben R, Ndlovu R, Munthali L, Warren RM, French N, Glynn JR. Molecular detection of mixed infections of Mycobacterium tuberculosis strains in sputum samples from patients in Karonga District, Malawi. J Clin Microbiol. 2010;48(12):4512–8.
Stavrum R, Mphahlele M, Ovreas K, Muthivhi T, Fourie PB, Weyer K, Grewal HM. High diversity of Mycobacterium tuberculosis genotypes in South Africa and preponderance of mixed infections among ST53 isolates. J Clin Microbiol. 2009;47(6):1848–56.
We are grateful to all the participants involved in this study.
Availability of data and materials
The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.
Ethics approval and consent to participate
This project was approved by the Ethics Review Committee of Henan Center for Disease Control and Prevention. Written informed consent was obtained from all participants. Ethics has been respected throughout the whole study period.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Genotyping of 668 M. tuberculosis isolates with 26-locus MIRU-VNTR and spoligotyping. The clustering was based on the analysis performed using BioNumerics 6.6 to compare these two genotyping methods. From left to right: (1) UPGMA dendrogram generated by the 26-locus MIRU-VNTR, (2) spoligotyping patterns, and (3) strain number. (PDF 113 kb)