Characterizing the Zika virus genome

The sudden emergence of the Zika virus epidemic in Latin America in 2015-16 has caught the scientific world unawares. A little known disease that was first diagnosed in the Zika forest environment of Uganda in 1947, the disease largely affected populations in Africa until its emergence in French Polynesia a few years ago and then in Brazil and South America last year. The Zika virus is spread mainly by the Aedes aegypti mosquitoes and, like the dengue virus, belongs to the flaviviridae family along with Japanese encephalitis and West Nile virus. As of now there are no known cures or prophylactics for Zika virus infections that cause chills, aches and fevers, but what makes it worse is the association of microcephaly (small brain case) in newborns and other neurological diseases such as the Guillian-Barre syndrome. The Olympic event in Rio this year is another cause for concern on the possibilities of increased spread of the disease around the world.

While governments in Latin America and elsewhere are concentrating on vector control and increasing citizen awareness, research labs around the world are deeply engaged in developing drugs and vaccines against the Zika virus. In this process understanding the character of the Zika virus in terms of its molecular biology is of vital concern, for the effectiveness of drugs and vaccines will largely depend on what we can fundamentally understand of the virus. The availability of nucleotide and protein sequence data has enabled this study using computer-based approaches to characterize the virus in terms of various biological properties of interest like phylogenetic relationships, transition-transversion ratios, amino acid usage, codon usage bias and hydropathy index.

Our studies showed that the Zika virus has close parallels to the dengue virus in several aspects such as transition-transversion ratios, preferential vector for its spread, genome structure and the like, with the genome structure being more convoluted with a large amount of complex base distribution along the sequence; these have been clearly evident in some of the 2D graphical representation plots. We have also reported very noticeable differences between the viral sequences isolated in African countries and the isolates from more recent periods in Asia, the Pacific regions and the Americas. The codon usage and amino acid counts also show remarkable differences in frequencies of some residues. These aspects of the Zika virus genome suggest qualitative changes have taken place between the older African sequences and the more recent variants circulating in the Asia-Pacific-America region. Since the outbreak in Yap island in 2007 and the recent epidemic in Brazil in 2015 have led to serious infections in pregnant women and reported cases of novel transmission through sexual contact, it is possible that some of the changes determined through this research could constitute factors facilitating this pathogenicity. Given the propensity of humans to travel and the consequences of climate change which are likely to aid the spread of the disease, these bioinformatics studies could form the basis of effective surveillance and closer monitoring of the genomes of the Zika virus. Some of the present authors have suggested computer-assisted approaches towards such surveillance and consequent design of drugs and vaccines to combat the growth and spread of Zika and other zoonotic viruses, which may be taken under advisement for the benefit of the populations at large.

pic Characterizing the Zika virus genome. Credit: Ashesh Nandy, Bentham Science Publishers

source: Bentham Science Publishers