- Research article
- Open Access
- Open Peer Review
Analysis of HCV quasispecies dynamic under selective pressure of combined therapy
BMC Infectious Diseasesvolume 13, Article number: 61 (2013)
The quasispecies composition of Hepatitis C virus (HCV) could have important implications with regard to viral persistence and response to interferon-based therapy. The complete NS5A was analyzed to evaluate whether the composition of NS5A quasispecies of HCV 1a/1b is related to responsiveness to combined interferon pegylated (PEG-IFN) and ribavirin therapy.
Viral RNA was isolated from serum samples collected before, during and after treatment from virological sustained responder (SVR), non-responder (NR) and the end-of-treatment responder patients (ETR). NS5A region was amplified, cloned and sequenced. Six hundred and ninety full-length NS5A sequences were analyzed.
This study provides evidence that lower nucleotide diversity of the NS5A region pre-therapy is associated with viral clearance. Analysis of samples of NRs and the ETRs time points showed that genetic diversity of populations tend to decrease over time. Post-therapy population of ETRs presented higher genetic distance from baseline probably due to the bottleneck phenomenon observed for those patients in the end of treatment. The viral effective population of those patients also showed a strong decrease after therapy. Otherwise, NRs demonstrated a continuous variation or stability of effective populations and genetic diversity over time that did not seem to be related to therapy. Phylogenetic relationships concerning complete NS5A sequences obtained from patients did not demonstrate clustering associated with specific response patterns. However, distinctive clustering of pre/post-therapy sequences was observed. In addition, the evolution of quasispecies over time was subjected to purifying or relaxed purifying selection. Codons 157 (P03), 182 and 440 (P42), 62 and 404 (P44) were found to be under positive selective pressure but it failed to be related to the therapy.
These results confirm the hypothesis that a relationship exists between NS5A heterogeneity and response to therapy in patients infected with chronic hepatitis C.
Hepatitis C virus (HCV) is the major etiological agent of chronic hepatitis worldwide . Chronic infection can progress to liver cirrhosis with risk for the development of hepatocellular carcinoma [2, 3].
The current treatment for chronic hepatitis C is based on interferon (IFN) or pegylated IFN in combination with ribavirin (RBV), leading to a sustained virological response in approximately 50% of patients infected with genotypes 1a/1b [4, 5]. Several host parameters, disease characteristics and virus-related factors are relevant to the possibility of viral clearance after therapy [6, 7].
A member of the family Flaviviridae, HCV is an enveloped virus with a positive, single-stranded RNA genome approximately 9.5 kb in length, encoding a single polyprotein of approximately 3000 amino acids that is co- and post-translationally cleaved by viral and cellular proteases into structural and non structural proteins .
HCV is classified into seven genotypes, HCV-1 to −7, with each genotype being further subdivided into subtypes such as HCV-1a and 1b [9, 10]. Furthermore, in infected individuals, HCV circulates as a population of several closely related viral variants referred to as “quasispecies” [11, 12]. New variants are continuously generated during viral replication as a result of errors made by the viral RNA-dependent RNA polymerase, which lacks proofreading activity, during rapid replication . The quasispecies nature of HCV could have important implications for viral persistence, pathogenicity and resistance to anti-viral agents [14, 15].
The non-structural 5A (NS5A) protein is implicated in interferon resistance . Enomoto et al. [17, 18] suggested that the genetic heterogeneity of a specific domain of the NS5A region of HCV, termed the IFN sensitivity determining region (ISDR), was related to treatment responses in Japanese patients with HCV genotype 1b infection. This is a controversial issue, but analysis of published information supports the hypothesis that a relationship exists between NS5A heterogeneity and response to therapy [19–22].
NS5A is an RNA binding phosphorylated protein comprising three domains separated by trypsin-sensitive low complexity sequences (LCS I and LCS II) and an N-terminal amphipathic alpha-helix that anchors the protein to intracellular membranes [23–25]. Full-length NS5A protein appears to be located exclusively in the cytoplasm. However, N-terminal deleted forms have been found in the nucleus, suggesting that they can move to the nucleus and act as potent transcriptional activators [26–28].
The NS5A region has generated much interest owing to its association with responses to therapy. In the present study, the evolution of the complete NS5A region of HCV genotypes 1a and 1b was examined in patients undergoing pegylated IFN plus ribavirin therapy. The results demonstrate that higher diversity and complexity of quasispecies are related to lower rates of response to peginterferon and ribavirin therapy. Therefore, quasispecies variability of the NS5A region could help understand the mechanism underlying treatment failure in patients infected with chronic hepatitis C.
The characteristics of the patients are presented in Table 1. The population was predominantly male (NR 75% vs. SVR 67% vs. ETR 100%); 55% of patients were infected with HCV genotype 1a (NR 25% vs. SVR 67% vs. ETR 75%) and 45% of patients were infected with HCV genotype 1b (NR 75% vs. SVR 33% vs. ETR 25%). The mean age of the patients was 43 ± 13.9 years (NR 46.3 ±10.6 vs. SVR 39.7 ± 2.6 vs. 42.3 ± 6.2). No statistically significant differences were observed between groups.
The levels of viremia in baseline samples did not differ significantly among the patient groups (NR 6.52 vs. SVR 6.39 vs. ETR 6.17 log IU/ml). Patients who did not respond to therapy had a significant decrease in viremia after 12 weeks of treatment (4.99 ± 0.94 log IU/ml; p=0.002), with recovering levels in samples collected after therapy was complete (6.30 – 6.86 log IU/ml). The viral load values varied among the samples from end of treatment responders. The mean viremia levels for these patients ranged from 4.99 to 5.90 log IU/ml (Table 1).
Nonsense mutations in the NS5A region are observed in vivo
Eight nonsense mutations were detected in the NS5A region after analysis of the 690 sequences dataset, two of them being common to more than one sequence. Codon 9 nonsense mutation was observed in two clones (before treatment samples from patients P03 and P40) and codon 84 in three clones (two clones in the P146 6m and one clone in the P42 4m sample). Nonsense mutations were also observed in codons 47 (before treatment sample from P47), 233 (P37 5m) and 399 (P44 12wt) (Figure 1). The majority of nonsense mutations were detected in Domain I of the NS5A region (codons 47 and 84), with only one being located in domain III (codon 399), one in the Alpha Helix (codon 9) region and one in the Low Complexity Sequence I Region (codon 233). The numeration of codons was based on the NS5A protein, the first codon being the first amino acid coded for in the NS5A genomic region.
Part of this nonsense mutation could provide a functional NS5A protein. Therefore, the sequences at the N-terminal region of NS5A in which the nonsense mutation occurred were analyzed to locate the next methionine residue. The methionines are highlighted in Figure 1A. The nonsense mutation 399 was not investigated owing to the presence of a methionine after it. This sequence could provide functional proteins despite missing a C-terminal portion (Figure 1B).
Nucleotide diversity of NS5A and sustained response to therapy
Shannon entropy analysis showed that the genetic diversity of the NS5A region at baseline was significantly lower for the SVR group (mean value 0.00706) than for the NR (0.017529) and ETR groups (0.017427) (p=0.0253 and p=0.0265, respectively) (Figure 2A). The genetic distance was also significantly lower for the SVR group (0.0050) than for the NR (0.0133) and ETR (0.0148) groups (p=0.021 and p=0.022, respectively; Figure 2B). These results demonstrated that quasispecies of SVR samples at baseline are less diverse than those of NR and ETR samples.
The mean entropy and genetic distance of during-treatment samples for NR and after the completion of therapy for NR and ETR samples were comparable when the results of each patient were analyzed individually. Generally, values had a propensity to decrease in samples collected during and after the end of treatment, but no profile was detected for patients in the same response group (Table 1 and Figure 2). However, significant differences were observed when each group was compared by the mean genetic distance of two time-points. For NRs, 6m post-therapy variants showed significant lower diversity than baseline quasispecies (baseline: 0.01325 vs 6m: 0.01025; p=0.046). ETRs showed values close to significance for mean genetic distance but differences were significant when Shannon entropy was analyzed between pre and 5m post-therapy (mean genetic distance - baseline: 0.01475 vs 5m: 0.005916; p=0.06; Shannon entropy – baseline: 0.017425 vs 5m: 0.011125; p=0.03) (Figure 2C).
To analyze the effect of therapy on quasispecies population over time, the mean of genetic distances between each time point since treatment outset and the baseline sample were also calculated (Table 1). Differences among groups of response were compared. Results showed that post-therapy populations of ETR patients (0.0316) presented significantly higher diversity from baseline variants than NR patients (0.0222) (p=0.006 (Figure 2D). When during and after-therapy variants of NRs were compared to ETRs post-treatment variants, results were also statistically different (0.0205 and 0.0316, respectively; p=0.000).
In order to evaluate the number of NS5A quasispecies in each sample, all sequences in this study were analyzed using LOCQSPEC 1.0 software. Figure 3 presents LOCQSPEC 1.0 analyses for ETR and NR patients. The percentages of different quasispecies in ETR, NR and SVR samples are presented.
There was no specific profile of quasispecies diversity. However, samples from ETR patients showed a more varied quasispecies composition before treatment and a more homogeneous composition in several of the after-treatment samples. In some cases, the homogenization of quasispecies composition appeared to be related to the appearance of predominant quasispecies (Figure 3A). The NR samples presented with a more heterogeneous quasispecies composition than the ETR patient samples. The P08 was the only patient in whom the same quasispecies were identified in before- and after-treatment samples (Figure 3B).
The SVR samples contained the most homogeneous quasispecies composition when compared with ETR and NR samples collected before treatment (Figure 3C).
NS5A quasispecies experience genetic evolution over time
Phylogenetic trees were reconstructed from 690 full-length NS5A sequences and reference sequences as described in the Methods section. Sequences clustered according to subtype, presenting monophyletic clusters with genotype 1a or 1b references. Viral isolates corresponding to the same patient samples also clustered, showing they were more closely related than other patient isolates (Figure 4). The topology of sequences from each patient was individually analyzed to evaluate the phylogenetic relationship among isolates. In general, the phylogenetic analysis suggested clustering of isolates of samples collected before treatment from ETR patients. The isolates of samples collected at different times after treatment were mixed and grouped into another cluster. (Additional files 1, 2, 3, 4). Before treatment, quasispecies clustering was observed for P146 from NR patients. Sequences from samples collected before treatment from patients P44, P11 and P08 tended to group together with quasispecies identified after 12 weeks of treatment. However, bootstrap values only sustained clusters for patient P11 (bootstrap value of 69%). (Additional files 5, 6, 7, 8).
In some cases, the phylogenetic analysis suggested that isolates identified in samples collected after treatment demonstrated the longest distances from the main nodes (Additional Information).
The Skyride analysis shows the variation of effective populations relative to the genetic diversity of each time-point analyzed. In general, a strong decrease of relative populations was observed after therapy for ETR patients. For patients P42 and P37 this variation occurred during the first 6 months showing a trend to recover then. Alternatively, patients P03 and P47 presented a later decrease of post-therapy populations. Non-responders P08 and P146 showed a continuous variation over time that did not seem to be related to therapy and P11 and P44 showed a stability of effective population during all time (Figure 5).
NS5A quasispecies are under relaxed purifying selective pressure
Phylogenetic trees were reconstructed using DNA alignment of pre and post-therapy NS5A sequences corresponding to each patient in the study. The set of sequences relative to a specific patient was subjected to phylogenetic reconstruction by maximum likelihood. As a result, eight individual phylogenetic trees were generated (Additional files 1, 2, 3, 4, 5, 6, 7, 8). In order to test whether selection pressure varied among different clades, one or two clades of interest were selected for testing. Clades with predominant quasispecies and those in which isolates identified in samples collected after treatment that demonstrated the longest distances from the main nodes were selected, making the assumption that the clade of interest would demonstrate different values of ω (dN/dS) from other branches of the tree. For patient P37, it was not possible to select any specific clade; the detailed results are presented in Table 2. The values of ω ranged among patients from ω = 0.1101 to ω = 0.7177, and the majority of values indicated a purifying selection or relaxed purifying selective pressure. For patients P47 and P11, the foreground ω values were statistically different from the background ones. For these patients, two clades were selected for analysis using phylogenetic topology. For patient P47 these were clade one, where post-therapy quasispecies demonstrated the longest distances, and clade two, where predominant quasispecies were grouped. Accordingly, their ω values were different (clade 1: ωF = 0.2487, 2Δl = 6.4343, and clade 2: ωF = 0.7177, 2Δl = 21.4983). For patient P11, clade one, where post-therapy quasispecies were located, and clade two, where quasispecies from the first sample collected after the end of therapy were grouped, were selected for analysis. The ω values were different for each of these clades (clade 1 ωF = 0.2685, 2Δl = 9.264, and clade 2 ωF = 0.5783, 2Δl = 11.8750). These results indicate a relaxed purifying selection, with the exception of ωF = 0.7177, which approximates to neutral evolution.
Analysis of selection pressure per site also revealed that populations were mainly under purifying selection and no significant difference between response groups was observed regarding the number of sites selected (NRs 106 and ETRs 103; ω < 1). Sites under positive selective pressure were found at codons 157 (P03), 182 and 440 (P42), 62 and 404 (P44). Data not showed.
Quasispecies composition appears to be important in the IFN resistance mechanisms , and genetic variability has been studied mainly in E2 and NS5A regions of the HCV genome [30–35]. Additional investigation of NS5A region variability may be relevant as heterogeneity in this region before treatment has been related to IFN-monotherapy responsiveness. However, studies have presented controversial data and the mechanism underlying how NS5A interferes with responsiveness to treatment has yet to be elucidated.
In the present study, the genetic variability of the complete NS5A region in patients infected with HCV genotype 1 was analyzed. Altogether, 690 sequences of the entire NS5A from 11 patients’ samples were analyzed. Samples showed lower genetic diversity of quasispecies composition at baseline for SVR patients than NR and patients who relapsed after the end of therapy (ETR). Those significant differences among genetic distance of before treatment were not observed in our previous work when we exclusively analyzed baseline samples . However, the group of patients and sequences analyzed were not identical, being only part of sequences of previous work used to these analyses.
Our findings are consistent with previous observations made on patients undergoing anti-HCV therapy that less complexity and lower diversity of HCV quasispecies at the baseline is associated with viral clearance [19, 21, 37–39]. Therefore, high genetic variability of HCV could be related to the low efficiency of anti-HCV treatment . Jain et al. observed that the initial anti-viral effect of interferon is influenced by the quasispecies composition at the time of treatment initiation, and patients who demonstrated high viral diversity were less likely to respond to treatment . In agreement with the genetic distance and entropy results, the number of viral strains at baseline was lower for SVR patients than NR and ETR patients, showing that quasispecies composition was more homogeneous than other groups of responses. A recent study evaluating the clinical and virological parameters that could be associated with or predictive of therapy outcome at baseline demonstrated that a higher number of quasispecies variants in the E1/E2 region was significantly associated with treatment failure. However, it is inferred that independent factors cannot provide a consistent prediction of therapy response .
In order to investigate whether therapy caused a significant change in the population over time, we compared variants heterogeneity of baseline, during therapy and follow-up by analyzing the mean of genetic distance and Shannon entropy among each time-point. No specific profile of quasispecies diversity for all patients from each response type could be identified over time. However, significant differences were observed when baseline and post-therapy variants of NRs and ETRs were compared. In either case post-therapy population were significant less heterogeneous than baseline showing a decrease of diversity over time.
We also calculated the mean of genetic distances between each two time points and could observe that when compared to the baseline, post-therapy populations of ETR patients presented significantly higher diversity from baseline variants than NR patients.
Those findings correlate with data obtained by tracking the number of strains. Analyzing ETR, it was evident that quasispecies composition was more homogeneous in post-therapy samples. This homogeneity was most evident in patients P37 and P03, who presented with homogeneous quasispecies composition from the rebound and sustained it. As with ETR patients, the HCV RNA is undetectable by the end of therapy, suggesting that the selective pressure of treatment acts on quasispecies diversity during it until HCV becomes undetectable in samples. One explanation is that during or after the end of therapy predominant quasispecies rise in frequency, while others decrease progressively in frequency or are eliminated. The persistence of this predominant variant could be favorable for sustaining the infection, causing it to become chronic again.
Farci at al. evaluated the quasispecies composition of the E1/E2 region in samples from patients who were untreated or treated with conventional IFN combined with ribavirin. Analyzing samples of relapse patients, they observed greater viral diversity at baseline, and an emergence at relapse of a new dominant strain, or the emergence of a minor dominant population . They inferred that the emergence of a new viral strain after rebound implies that the majority of baseline variants were sensitive to IFN and that the origin of these new strains is uncertain. The authors concluded that it is probable that very low levels of virus replication continued to occur despite the disappearance of viremia .
A study focusing on breakthrough response pattern, defined by patients who have an initial response followed by reactivation whilst receiving IFN therapy, did not identify quasispecies variants from breakthrough in baseline samples . This study assumed that HCV variants sampled at the time of breakthrough represent drug-resistant quasispecies, and if present at baseline, they must be present very low frequency. According to the authors viral breakthrough could be attributed to the selection of pre-existing drug-resistant variants or the emergence of different quasispecies with reduced sensitivity to IFN. However, the study indicated that selection is mostly responsible for appearance of drug resistant quasispecies at breakthrough . As breakthrough and relapse indicate rebound of infection after undetectable HCV RNA levels, is reasonable that the results presented herein are in agreement when considering the quasispecies composition and diversity at breakthrough and relapse time.
The data collected from non-responder samples revealed a very diverse composition of viral strains over the time course of the investigation. As described previously, the high variability of the quasispecies population in these patient samples might represent a continuous process of adaptation , as these variants were continuously eliminated during and after the end of treatment in most cases. Alternatively, it could offer the virus some advantage in sustaining the infection as a large number of quasispecies may indicate a better opportunity for virus persistence [44, 45].
The analysis of the number of effective populations among time-point analyzed showed different profiles for the types of response. A strong decrease of effective populations was observed after therapy for ETR patients whereas NRs presented a continuous variation or stability of its populations over time. Those data are in agreement to nucleotide diversity results where ETRs demonstrated a decrease of population number and heterogeneity presenting more distinct post-therapy variants. In general, populations of those patients experience a bottleneck phenomenon after therapy. On the other hand, profiles found to NRs assent to the continuous high diverse population and a lower diversity between pre and post-therapy variants probably due to a minor effect of treatment in variants selection. Pawlotsky et al. compared pre- and post-therapy NS5A amino acid sequences from non-responder patients and demonstrated that most variants from after-treatment samples were not detected before treatment, whereas most pre-treatment variants were no longer evident after treatment . The data presented here agree with their findings, with the exception of samples from patient P08. In the P08 samples, a variant was identified at baseline (13% of NS5A amino acid sequences) that was detected after 12 weeks of treatment and two months after therapy was completed. Several studies have showed that variants resistant to therapy may be present at baseline, and it is proposed that the persistence of these variants in patients who fail to respond suggests the existence of virus strains with inherent resistance to IFN [41, 43].
Consistent with previous observations, phylogenetic analysis of virus sequences obtained from patients failed to show any clustering associated with specific response [34, 41, 46]. However, the data suggested that pre- and post-therapy isolates from samples of ETR patients tended to group together in different branches. For NR samples, before-treatment quasispecies variants tended to group with quasispecies found after 12 weeks of treatment, and post-therapy sequences grouped together in another branch. The distinctive clustering of pre/post-therapy sequences had been described previously [22, 42, 47], demonstrating the evolutionary process occurred over time. In some cases, isolates identified in after-treatment samples had the longest distances from the main nodes in phylogenetic analysis. This may suggest that changes can improve the fitness of quasispecies resulting in a persistent infection after treatment selection pressure.
To investigate whether quasispecies were under differential selective pressure over time, we performed analysis with the set of sequences of each NR and ETR patient. In general, the results showed that a purifying selection is driving the evolution of quasispecies over time. In some cases, the purifying selection was relaxed (clades selected for patients P11 and P47). In such cases, two scenarios could be envisaged. Variants undetected in the baseline sample due its low frequency could increase in frequency under relaxed selective pressure and consequently be detected in the after therapy samples for some time. Alternatively, those variants could have arisen by mutation during treatment increasing in frequency and being detected then. Relaxed purifying selection may also be indicative of some codons of the entire protein being under positive selection. However, we failed to identify sites under positive selection for those patients. Codons under positive selective pressure observed in this work could not be related to the therapy outcome and were not present in preferential position in the NS5A protein. A study analyzing PKR binding domain and the V3 domain in the NS5A region identified positively selected sites but also failed to detect a pattern mechanism for the inefficient response to anti-viral treatment . Therefore, further analyses are necessary.
Genetic analyses identified nonsense mutations in the NS5A sequence of some HCV variants. Nonsense mutations in samples from HCV-infected patients have been described [34, 36, 49], but the structural and functional implications of such mutations detected in vivo are unclear. Most of the nonsense mutations detected were located in the N-terminal region of NS5A (Alpha Helix, Domain I or Low Complexity Sequence I regions of the NS5A); only one was located in the C-terminal region (domain III). It is known that N-terminal mutants of NS5A are preferentially located in the nucleus and are reported to function as transcriptional regulators. A recent study using a HCV replicating cell system revealed that during the life cycle a variety of N-terminally truncated NS5A fragments are generated . Tests on several of these truncated NS5A fragments demonstrated that they were preferentially located in the nucleus or equally distributed between the cytoplasm and nucleus. The full length NS5A (1–449) was located in the extra-nuclear compartment as previously described. However, truncated forms impaired HCV replication. In contrast, domain III of the C-terminal NS5A region can be deleted with no or minimal effect on RNA replication [51, 52], but the C-terminal region residing between amino acid residues 2404 and 2435 of domain III is crucial for virus production . To elucidate the effects of nonsense mutations identified in the present study, further analysis with HCV replicating cells is necessary. Truncated NS5A forms observed in previous studies show drawbacks in terms of replication or assembly, but these effects could be overcome by the diversity of quasispecies composition.
This study confirms that heterogeneous diversity of quasispecies pre-therapy could be related to a low response to IFN-based therapy, and that homogeneity of quasispecies composition at baseline with viral clearance.
The follow-up of patients’ samples showed that genetic diversity of populations decreased over time. Post-therapy population of end-of-treatment responders presented higher genetic distance from baseline probably due to the bottleneck phenomenon observed for those patients in the end of treatment. The effective population of those patients also showed a strong decrease after therapy. Otherwise, NRs demonstrated a continuous variation or stability of effective populations and genetic diversity over time that did not seem to be related to therapy. Quasispecies distribution of NS5A was variable and distinctly clustered over time. In addition, the evolution of quasispecies over time was subjected to purifying or relaxed purifying selection showing that the majority of the synonymous mutations are not being fixed. Some codons were found to be under positive selective pressure but it failed to be related to the therapy.
Therefore, the quasispecies composition and evolution over time are factors to be considered in terms of patient outcome after combined therapy for chronic hepatitis C.
Eleven naïve patients infected chronically with HCV RNA genotypes 1a or 1b were enrolled from the Hepatology Department of the São José do Rio Preto School of Medicine. Patients with other concomitant liver diseases [hepatitis B virus (HBV) or other hepatotropic virus infections, alcohol abuse, autoimmune hepatitis and hereditary liver diseases] were excluded. This study was approved by “The Ethics Committee of the School of Medicine of São José do Rio Preto”, and written informed consent was obtained from all patients who consented to their individual data being included in the manuscript. Plasma samples were collected before, during and after the end of treatment. During the 48-week treatment, patients received PEG-IFN-α-2b (according to body weight) subcutaneously once a week and RBV daily, taken orally at a dose of 600–1,200 mg (according to body weight).
The patients were classified into three groups according to their response to therapy: three patients demonstrated sustained virological response (SVR); four patients were non-responders (NR) and four patients were end-of-treatment virological responders (ETR) (Table 1). SVR was defined as absence of HCV RNA in plasma using qualitative PCR six months after the end of therapy. NR was defined as continued presence of HCV RNA in plasma during treatment and six months after the end of treatment. Patients who were HCV RNA-negative at the end of therapy but experienced a relapse were classified as ETR. Breakthrough response pattern defined by patients who have an initial response followed by reactivation whilst receiving IFN therapy was not analyzed in this study based on time point analyzed during treatment.
For all patients, one sample before treatment was analyzed. Additional samples were collected during therapy (12 weeks of therapy) and after the end of therapy (14 days, two and six months) for non-responders, and at the relapse time and then monthly for end-of-treatment responders. Samples analyzed for each patient are detailed in Table 1. Samples from the SVR group and before treatment samples from most patients (P05, P35, P40, P44, P03, P37 and P42) enrolled in this study were analyzed in a previous work (identification of patients: P2, P3, P4, P8, P9, P10 and P11 respectively) and were used in this study for comparative analyzes of the evolutionary dynamic of quasispecies . Baseline sample of patients P08, P11, P146 and P47 analyzed in this work were not previously done. Additionally, all samples collected during and after the end of treatment were exclusively analyzed in this work.
RNA extraction, RT-PCR and NS5A amplification
Total RNA was extracted from 140 μl of plasma using the commercially available QIAamp Viral RNA Kit (Qiagen, Uniscience). RNA was reverse-transcribed into cDNA using a High-Capacity cDNA Archive kit (Applied Biosystems, Foster City, CA, USA) and random primers. The mixture was incubated at 37°C for two hours. For amplification of the entire NS5A region of the HCV genome, a nested polymerase chain reaction (PCR) was performed using the primers described previously .
The viral load was quantified using the Cobas TaqMan HCV Test according to the manufacturer’s instructions.
Cloning and sequencing
PCR products of approximately 1.7 kb were purified and ligated into the pCR-XL-TOPO-vector using the TOPO XL PCR cloning kit (InvitrogenTM Life Technologies, Carlsbad, CA, USA). The ligation products were transformed into competent cells (InvitrogenTM Life Technologies, Carlsbad, CA, USA). Fifteen transformants were randomly chosen for further studies, and plasmid DNA was isolated from a 3.0-ml broth culture using the GeneJET plasmid Miniprep Kit (Fermentas). Recombinant pCR-XL-TOPO-NS5A clones were sequenced using dideoxy terminator automated sequencing (ABI Prism Ready Reaction Mix; Applied Biosystems, Foster City, CA, USA) using an ABI Prism 377 and ABI 3130XL sequencers, according to the manufacturer’s instructions (Applied Biosystems Inc, Foster City, CA, USA.). Eight to ten sequencing reactions were performed for each clone, using flanking primers, M13 Forward and M13 Reverse (Invitrogen TM Life Technologies, Carlsbad, CA, USA), internal forward primers (1aF1 5’CACCAGTGGATAASCTCGGA3’, 1aF2 5’CCCATYAATGCCTACACCAC3’, 1aF3 5’CTGTCYGCTCCATCTCTCA3’ and 1aF4 5’GARTCAGARAACAAAGTGGTG3’ for genotype 1a; 1bF1 5’ATCCTCTCHAR-CCTTACCAT3’, 1bF2 5’ GRACATTCCCCRTCAACGC3’, 1bF3 5’ 5’GTCCTRACAG-AATCCACMGTG3’ and 1bF4 5’CCARTTGTCTGCGCCTTC3’ for genotype 1b) and internal reverse primers (1aR1 5’ACATWGAGCAACACACGAC3’, 1aR2 5’GTTCCC-TTGAGAGATGGAGC3’ and 1aR3 5’TAGGCATTRATGGGGAAGGT3’ for genotype 1a; 1bR1 5’ARCARCAGACGACGTCCTC3’, 1bR2 5’AGCGGGTCGAAAGAGTCCA 3’ and 1bR3 5’ GAACCGTTTTTGACATGTCC3’ for genotype 1b).
Genetic and Evolutionary Analysis
All sequences were analyzed with Phred-Phrap programs [54–56]. These programs analyze the quality of the sequences and align them in complete NS5A contigs. This study generated 585 sequences of full length NS5A from samples collected during and after treatment and for patients P08, P11, P146 and P47 also from the sample collected before the beginning of the treatment. In order to analyze viral evolution along time, 105 sequences from samples collected before the treatment, previously published (Accession numbers: EU309511 - EU309525, EU309586 - EU309599 and EU309600 - EU309614) , were used in the analyses. Consequently, 690 full-length NS5A sequences were analyzed in this study, corresponding to 11 patients. The nucleotide sequence contigs (1344 nucleotides for genotype 1a and 1341 for genotype 1b) were aligned using the Clustal X program (version 1.81)  and amino acid sequences were obtained. Primer sequences were removed from all sequences using the BioEdit program (version 220.127.116.11) . The genetic distance between pairs of sequences were calculated with MEGA version 4 or 5 using the p-distance or Tamura Nei methods . The variability of amino acid residues in each position (i) was measured by calculating the Shannon entropy . The entropy value is a measure of the lack of information at each position of the amino acid sequence. The Shannon Entropy was calculated at the amino acid level as follows: H(i) = −(Σf(b,i)log(base 2)f(b,i)), where f(b,i) is the frequency with which each residue b appears in position i of the protein. When different groups of virological response were compared, the value was normalized by Sn=H/logN, where N is the total number of sequences analyzed in each sample .
For analyzing quasispecies variability, all sequences of complete NS5A generated in this study were analyzed using software LOCQSPEC 1.0  and the contigs that presented the same nucleotide or amino acid sequences were grouped together. Bayesian skyride plots were performed using the BEAST package .To construct the phylogenetic tree of NS5A variants obtained from the patient samples, the PAUP* version 4 program was used . Phylogenetic trees were constructed using the maximum likelihood method with the model of substitution, as determined by hierarchical likelihood ratio test score criteria in Modeltest 3.06 . Base frequency, gamma distribution and transition/transversion ratios were determined (from the data) by Modeltest 3.06. A thousand replicates were used to test the support given by the data to the clusters of the tree topology, and bootstrap values >70 were considered significant . Genetic and phylogenetic analyses were performed using the standard genotype 1a sequence H77 (NC_004102.1) as a reference and genotype 1b sequence HCV-J (D90208.1), obtained from GenBank. The ratio between the relative rate of non-synonymous substitution to the relative rate of synonymous substitution (ω=dN/dS) measures the strength of selection acting on a protein-coding gene. Assuming synonymous mutations are subjected to almost strictly neutral selection, ω<1, ω=1, and ω>1 represent negative selection, neutral evolution, and positive Darwinian selection, respectively . Site per site ω was calculated by Single Likelihood Ancestor Counting (SLAC) method using HyPhy . Maximum likelihood analysis of the sequence evolution was performed using the CODEML program in the PAML 3.15 software package . Initially, phylogenetic trees were reconstructed using the maximum likelihood method and the HKY model of substitution, as determined by hierarchical likelihood ratio test score criteria in Modeltest 3.06 for each dataset of NS5A sequences corresponding to each non-responder and end-of-treatment responder in this study. The ratios of global synonymous changes per site (dS) versus replacement changes per site (dN) for each tree were calculated using two models described by Yang . The one-ratio model assumed an equal ω ratio for all branches in the phylogeny. The two-ratio model assumed two ω ratios: one branch for the background (ω B), one for the foreground branch (ω F = branch of interest) leading to a specific clade of the phylogenetic trees, specified in the Additional Information. This analysis makes different assumptions about the dN/dS ratios for branches of interest relative to the background dN/dS ratio for all other branches. For instance, the “two-ratio” model assumes that the branches of interest have a dN/dS ratio that is different from the background ratio.
The results are presented as mean ± SD, or as percentages. Comparisons among the sustained virological responders, end of treatment responders and non-responders were determined using Tukey’s or Fisher’s statistical test after performing an one-way ANOVA or Chi square. A paired t-test was performed to compare means between two time-points. In all tests, a P value less than 0.05 was considered significant.
All sequences obtained in this study were submitted to the GenBank nucleotide sequence database (http://www.ncbi.nlm.nih.gov/genbank/). Accession numbers: GenBank:HQ823765 - HQ824349 (sequences obtained in this study), GenBank:EU309511 - EU309525, GenBank:EU309586 - EU309599 and GenBank:EU309600 - EU309614 (sequences generated in a previous study) .
GBD: Global burden of disease (GBD) for hepatitis C. J Clin Pharmacol. 2004, 44 (1): 20-29.
Chevaliez S, Pawlotsky JM: Hepatitis C virus: virology, diagnosis and management of antiviral therapy. World J Gastroenterol. 2007, 13 (17): 2461-2466.
Saito I, Miyamura T, Ohbayashi A, Harada H, Katayama T, Kikuchi S, Watanabe Y, Koi S, Onji M, Ohta Y, et al: Hepatitis C virus infection is associated with the development of hepatocellular carcinoma. Proc Natl Acad Sci U S A. 1990, 87 (17): 6547-6549. 10.1073/pnas.87.17.6547.
Fried MW, Shiffman ML, Reddy KR, Smith C, Marinos G, Goncales FL, Haussinger D, Diago M, Carosi G, Dhumeaux D, et al: Peginterferon alfa-2a plus ribavirin for chronic hepatitis C virus infection. N Engl J Med. 2002, 347 (13): 975-982. 10.1056/NEJMoa020047.
Manns MP, McHutchison JG, Gordon SC, Rustgi VK, Shiffman M, Reindollar R, Goodman ZD, Koury K, Ling M, Albrecht JK: Peginterferon alfa-2b plus ribavirin compared with interferon alfa-2b plus ribavirin for initial treatment of chronic hepatitis C: a randomised trial. Lancet. 2001, 358 (9286): 958-965. 10.1016/S0140-6736(01)06102-5.
Pawlotsky JM: Mechanisms of antiviral treatment efficacy and failure in chronic hepatitis C. Antiviral Res. 2003, 59 (1): 1-11. 10.1016/S0166-3542(03)00088-3.
Pawlotsky JM: Current and future concepts in hepatitis C therapy. Semin Liver Dis. 2005, 25 (1): 72-83. 10.1055/s-2005-864783.
Choo QL, Kuo G, Weiner AJ, Overby LR, Bradley DW, Houghton M: Isolation of a cDNA clone derived from a blood-borne non-a, non-B viral hepatitis genome. Science. 1989, 244 (4902): 359-362. 10.1126/science.2523562.
Murphy DG, Willems B, Deschenes M, Hilzenrat N, Mousseau R, Sabbah S: Use of sequence analysis of the NS5B region for routine genotyping of hepatitis C virus with reference to C/E1 and 5' untranslated region sequences. J Clin Microbiol. 2007, 45 (4): 1102-1112. 10.1128/JCM.02366-06.
Simmonds P, Bukh J, Combet C, Deleage G, Enomoto N, Feinstone S, Halfon P, Inchauspe G, Kuiken C, Maertens G, et al: Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology. 2005, 42 (4): 962-973. 10.1002/hep.20819.
Martell M, Esteban JI, Quer J, Genesca J, Weiner A, Esteban R, Guardia J, Gomez J: Hepatitis C virus (HCV) circulates as a population of different but closely related genomes: quasispecies nature of HCV genome distribution. J Virol. 1992, 66 (5): 3225-3229.
Domingo E, Martin V, Perales C, Grande-Perez A, Garcia-Arriaza J, Arias A: Viruses as quasispecies: biological implications. Curr Top Microbiol Immunol. 2006, 299: 51-82. 10.1007/3-540-26397-7_3.
Pawlotsky JM: Hepatitis C virus genetic variability: pathogenic and clinical implications. Clin Liver Dis. 2003, 7 (1): 45-66. 10.1016/S1089-3261(02)00065-X.
Forns X, Bukh J: The molecular biology of hepatitis C virus. Genotypes and quasispecies. Clin Liver Dis. 1999, 3 (4): 693-716. 10.1016/S1089-3261(05)70234-8. vii
Forns X, Purcell RH, Bukh J: Quasispecies in viral persistence and pathogenesis of hepatitis C virus. Trends Microbiol. 1999, 7 (10): 402-410. 10.1016/S0966-842X(99)01590-5.
Le Guillou-Guillemette H, Vallet S, Gaudy-Graffin C, Payan C, Pivert A, Goudeau A, Lunel-Fabiani F: Genetic diversity of the hepatitis C virus: impact and issues in the antiviral therapy. World J Gastroenterol. 2007, 13 (17): 2416-2426.
Enomoto N, Sakuma I, Asahina Y, Kurosaki M, Murakami T, Yamamoto C, Izumi N, Marumo F, Sato C: Comparison of full-length sequences of interferon-sensitive and resistant hepatitis C virus 1b. Sensitivity to interferon is conferred by amino acid substitutions in the NS5A region. J Clin Invest. 1995, 96 (1): 224-230. 10.1172/JCI118025.
Enomoto N, Sakuma I, Asahina Y, Kurosaki M, Murakami T, Yamamoto C, Ogura Y, Izumi N, Marumo F, Sato C: Mutations in the nonstructural protein 5A gene and response to interferon in patients with chronic hepatitis C virus 1b infection. N Engl J Med. 1996, 334 (2): 77-81. 10.1056/NEJM199601113340203.
Puig-Basagoiti F, Forns X, Furcic I, Ampurdanes S, Gimenez-Barcons M, Franco S, Sanchez-Tapias JM, Saiz JC: Dynamics of hepatitis C virus NS5A quasispecies during interferon and ribavirin therapy in responder and non-responder patients with genotype 1b chronic hepatitis C. J Gen Virol. 2005, 86 (Pt 4): 1067-1075.
Witherell GW, Beineke P: Statistical analysis of combined substitutions in nonstructural 5A region of hepatitis C virus and interferon response. J Med Virol. 2001, 63 (1): 8-16. 10.1002/1096-9071(200101)63:1<8::AID-JMV1001>3.0.CO;2-K.
Jain MK, Yuan HJ, Adams-Huet B, Reeck A, Shelton J, Attar N, Zhang S, Neumann AU, Carney DS, Gale M, et al: Pegylated interferon and ribavirin promote early evolution of nonstructural 5A protein in individuals with hepatitis C who demonstrate a response to treatment. J Infect Dis. 2009, 200 (6): 866-876. 10.1086/605475.
Pawlotsky JM, Germanidis G, Neumann AU, Pellerin M, Frainais PO, Dhumeaux D: Interferon resistance of hepatitis C virus genotype 1b: relationship to nonstructural 5A gene quasispecies mutations. J Virol. 1998, 72 (4): 2795-2805.
Brass V, Bieck E, Montserret R, Wolk B, Hellings JA, Blum HE, Penin F, Moradpour D: An amino-terminal amphipathic alpha-helix mediates membrane association of the hepatitis C virus nonstructural protein 5A. J Biol Chem. 2002, 277 (10): 8130-8139. 10.1074/jbc.M111289200.
Penin F, Brass V, Appel N, Ramboarina S, Montserret R, Ficheux D, Blum HE, Bartenschlager R, Moradpour D: Structure and function of the membrane anchor domain of hepatitis C virus nonstructural protein 5A. J Biol Chem. 2004, 279 (39): 40835-40843. 10.1074/jbc.M404761200.
Tellinghuisen TL, Marcotrigiano J, Gorbalenya AE, Rice CM: The NS5A protein of hepatitis C virus is a zinc metalloprotein. J Biol Chem. 2004, 279 (47): 48576-48587. 10.1074/jbc.M407787200.
Ide Y, Zhang L, Chen M, Inchauspe G, Bahl C, Sasaguri Y, Padmanabhan R: Characterization of the nuclear localization signal and subcellular distribution of hepatitis C virus nonstructural protein NS5A. Gene. 1996, 182 (1–2): 203-211.
Pawlotsky JM, Germanidis G: The non-structural 5A protein of hepatitis C virus. J Viral Hepat. 1999, 6 (5): 343-356. 10.1046/j.1365-2893.1999.00185.x.
Satoh S, Hirota M, Noguchi T, Hijikata M, Handa H, Shimotohno K: Cleavage of hepatitis C virus nonstructural protein 5A by a caspase-like protease(s) in mammalian cells. Virology. 2000, 270 (2): 476-487. 10.1006/viro.2000.0287.
Salmeron J, De Rueda PM, Ruiz-Extremera A, Casado J, Huertas C, Bernal Mdel C, Rodriguez L, Palacios A: Quasispecies as predictive response factors for antiviral treatment in patients with chronic hepatitis C. Dig Dis Sci. 2006, 51 (5): 960-967. 10.1007/s10620-006-9347-2.
Duverlie G, Khorsi H, Castelain S, Jaillon O, Izopet J, Lunel F, Eb F, Penin F, Wychowski C: Sequence analysis of the NS5A protein of european hepatitis C virus 1b isolates and relation to interferon sensitivity. J Gen Virol. 1998, 79 (Pt 6): 1373-1381.
Pascu M, Martus P, Hohne M, Wiedenmann B, Hopf U, Schreier E, Berg T: Sustained virological response in hepatitis C virus type 1b infected patients is predicted by the number of mutations within the NS5A-ISDR: a meta-analysis focused on geographical differences. Gut. 2004, 53 (9): 1345-1351. 10.1136/gut.2003.031336.
Gaudy C, Lambele M, Moreau A, Veillon P, Lunel F, Goudeau A: Mutations within the hepatitis C virus genotype 1b E2-PePHD domain do not correlate with treatment outcome. J Clin Microbiol. 2005, 43 (2): 750-754. 10.1128/JCM.43.2.750-754.2005.
Gerotto M, Dal Pero F, Pontisso P, Noventa F, Gatta A, Alberti A: Two PKR inhibitor HCV proteins correlate with early but not sustained response to interferon. Gastroenterology. 2000, 119 (6): 1649-1655. 10.1053/gast.2000.20230.
Bittar C, Jardim AC, Yamasaki LH, de Queiroz AT, Carareto CM, Pinho JR, de Carvalho-Mello IM, Rahal P: Genetic diversity of NS5A protein from hepatitis C virus genotype 3a and its relationship to therapy response. BMC Infect Dis. 2010, 10: 36-10.1186/1471-2334-10-36.
Nousbaum J, Polyak SJ, Ray SC, Sullivan DG, Larson AM, Carithers RL, Gretch DR: Prospective characterization of full-length hepatitis C virus NS5A quasispecies during induction and combination antiviral therapy. J Virol. 2000, 74 (19): 9028-9038. 10.1128/JVI.74.19.9028-9038.2000.
Jardim AC, Yamasaki LH, de Queiroz AT, Bittar C, Pinho JR, Carareto CM, Rahal P, Mello IM: Quasispecies of hepatitis C virus genotype 1 and treatment outcome with peginterferon and ribavirin. Infect Genet Evol. 2009, 9 (4): 689-698. 10.1016/j.meegid.2008.11.001.
Pawlotsky JM, Pellerin M, Bouvier M, Roudot-Thoraval F, Germanidis G, Bastie A, Darthuy F, Remire J, Soussy CJ, Dhumeaux D: Genetic complexity of the hypervariable region 1 (HVR1) of hepatitis C virus (HCV): influence on the characteristics of the infection and responses to interferon alfa therapy in patients with chronic hepatitis C. J Med Virol. 1998, 54 (4): 256-264. 10.1002/(SICI)1096-9071(199804)54:4<256::AID-JMV4>3.0.CO;2-3.
Toyoda H, Kumada T, Nakano S, Takeda I, Sugiyama K, Osada T, Kiriyama S, Sone Y, Kinoshita M, Hadama T: Quasispecies nature of hepatitis C virus and response to alpha interferon: significance as a predictor of direct response to interferon. J Hepatol. 1997, 26 (1): 6-13. 10.1016/S0168-8278(97)80002-5.
Ueda E, Enomoto N, Sakamoto N, Hamano K, Sato C, Izumi N, Watanabe M: Changes of HCV quasispecies during combination therapy with interferon and ribavirin. Hepatol Res. 2004, 29 (2): 89-96. 10.1016/j.hepres.2004.02.014.
Saludes V, Bracho MA, Valero O, Ardevol M, Planas R, Gonzalez-Candelas F, Ausina V, Martro E: Baseline prediction of combination therapy outcome in hepatitis C virus 1b infected patients by discriminant analysis using viral and host factors. PLoS One. 2010, 5 (11): e14132-10.1371/journal.pone.0014132.
Farci P, Strazzera R, Alter HJ, Farci S, Degioannis D, Coiana A, Peddis G, Usai F, Serra G, Chessa L, et al: Early changes in hepatitis C viral quasispecies during interferon therapy predict the therapeutic outcome. Proc Natl Acad Sci U S A. 2002, 99 (5): 3081-3086. 10.1073/pnas.052712599.
Xu Z, Fan X, Xu Y, Di Bisceglie AM: Comparative analysis of nearly full-length hepatitis C virus quasispecies from patients experiencing viral breakthrough during antiviral therapy: clustered mutations in three functional genes, E2, NS2, and NS5a. J Virol. 2008, 82 (19): 9417-9424. 10.1128/JVI.00896-08.
Figlerowicz M, Jackowiak P, Formanowicz P, Kedziora P, Alejska M, Malinowska N, Blazewicz J: Hepatitis C virus quasispecies in chronically infected children subjected to interferon-ribavirin therapy. Arch Virol. 2010, 155 (12): 1977-1987. 10.1007/s00705-010-0789-7.
Ramirez S, Perez-del-Pulgar S, Carrion JA, Coto-Llerena M, Mensa L, Dragun J, Garcia-Valdecasas JC, Navasa M, Forns X: Hepatitis C virus superinfection of liver grafts: a detailed analysis of early exclusion of non-dominant virus strains. J Gen Virol. 2010, 91 (Pt 5): 1183-1188.
Ray SC, Wang YM, Laeyendecker O, Ticehurst JR, Villano SA, Thomas DL: Acute hepatitis C virus structural gene sequences as predictors of persistent viremia: hypervariable region 1 as a decoy. J Virol. 1999, 73 (4): 2938-2946.
Zekri AR, El-Din HM, Bahnassy AA, Khaled MM, Omar A, Fouad I, El-Hefnewi M, Thakeb F, El-Awady M: Genetic distance and heterogenecity between quasispecies is a critical predictor to IFN response in egyptian patients with HCV genotype-4. Virol J. 2007, 4: 16-10.1186/1743-422X-4-16.
Pellerin M, Lopez-Aguirre Y, Penin F, Dhumeaux D, Pawlotsky JM: Hepatitis C virus quasispecies variability modulates nonstructural protein 5A transcriptional activation, pointing to cellular compartmentalization of virus-host interactions. J Virol. 2004, 78 (9): 4617-4627. 10.1128/JVI.78.9.4617-4627.2004.
Cuevas JM, Torres-Puente M, Jimenez-Hernandez N, Bracho MA, Garcia-Robles I, Wrobel B, Carnicer F, del Olmo J, Ortega E, Moya A, et al: Genetic variability of hepatitis C virus before and after combined therapy of interferon plus ribavirin. PLoS One. 2008, 3 (8): e3058-10.1371/journal.pone.0003058.
Fan W, Zhu W, Wei L, Wang Q, Yin L, Du S, Zhuang H: Nonstructural 5A gene variability of hepatitis C virus (HCV) during a 10-year follow up. J Gastroenterol. 2005, 40 (1): 43-51. 10.1007/s00535-004-1446-2.
Sauter D, Himmelsbach K, Kriegs M, Carvajal Yepes M, Hildt E: Localization determines function: N-terminally truncated NS5A fragments accumulate in the nucleus and impair HCV replication. J Hepatol. 2009, 50 (5): 861-871. 10.1016/j.jhep.2008.11.024.
Appel N, Pietschmann T, Bartenschlager R: Mutational analysis of hepatitis C virus nonstructural protein 5A: potential role of differential phosphorylation in RNA replication and identification of a genetically flexible domain. J Virol. 2005, 79 (5): 3187-3194. 10.1128/JVI.79.5.3187-3194.2005.
Moradpour D, Evans MJ, Gosert R, Yuan Z, Blum HE, Goff SP, Lindenbach BD, Rice CM: Insertion of green fluorescent protein into nonstructural protein 5A allows direct visualization of functional hepatitis C virus replication complexes. J Virol. 2004, 78 (14): 7400-7409. 10.1128/JVI.78.14.7400-7409.2004.
Appel N, Zayas M, Miller S, Krijnse-Locker J, Schaller T, Friebe P, Kallis S, Engel U, Bartenschlager R: Essential role of domain III of nonstructural protein 5A for hepatitis C virus infectious particle assembly. PLoS Pathog. 2008, 4 (3): e1000035-10.1371/journal.ppat.1000035.
Gordon D, Abajian C, Green P: Consed: a graphical tool for sequence finishing. Genome Res. 1998, 8 (3): 195-202.
Ewing B, Green P: Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998, 8 (3): 186-194.
Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998, 8 (3): 175-185.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.
Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symposium Series. 1999, 41: 95-98.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.
Marucci EA, Zafalon GF, Jardim AC, Yamasaki LH, Bittar C, Rahal P, Machado JM: Routine libraries for pattern recognition in quasispecies. Genet Mol Res. 2008, 7 (3): 970-981. 10.4238/vol7-3X-Meeting013.
Drummond AJ, Rambaut A: BEAST: bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.
Swofford DL: PAUP* phylogenetic analysis using parsimony (*and others methods). Version 4 edn. 2003, Sunderland, Massachusetts: Sinauer Associates
Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14 (9): 817-818. 10.1093/bioinformatics/14.9.817.
McCormack GP, Clewley JP: The application of molecular phylogenetics to the analysis of viral genome diversity and evolution. Rev Med Virol. 2002, 12 (4): 221-238. 10.1002/rmv.355.
Yang Z: Inference of selection from multiple species alignments. Curr Opin Genet Dev. 2002, 12 (6): 688-694. 10.1016/S0959-437X(02)00348-9.
Pond SL, Frost SD, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005, 21 (5): 676-679. 10.1093/bioinformatics/bti079.
Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13 (5): 555-556.
Yang Z: Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol Biol Evol. 1998, 15 (5): 568-573. 10.1093/oxfordjournals.molbev.a025957.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2334/13/61/prepub
We are grateful to FAPESP “Fundação de Amparo à Pesquisa do Estado de São Paulo – FAPESP” for grants that supported this work (projects with process numbers 07/52073-0 and grant with process number 06/60012-9).
The authors have declared that no competing interests exist.
ACGJ participated in the design of the study, performed experiments, analyzed the data and drafted the manuscript. RPAM performed experiments. CB, RAS and CMAC analyzed the data and critically revised the manuscript. LHTY analyzed the data. JRRP designed the study and contributed to acquisition and analysis of data. RMF contributed to acquisition and analysis of data. IMVGCM designed the study, analyzed the data and critically revised the manuscript. PR designed and coordinated the study and helped to draft the manuscript. All authors read and approved the final manuscript.