Study On Genome Sequence of Novel Coronavirus (SARS-CoV-2) Strains in Different Countries

1Department of Pharmacy, Madhav University, Abu Road, Pindwara, Rajasthan, India. 2Department of Pharmaceutical technology (Biotechnology), National institute of Pharmaceutical Education and Research, Mohali, sector 67, SAS Nagar, Punjab, India. 3School of Pharmacy, Management and Science University,Seksyen 13, 40100 Shah Alam, Selangor, Malaysia. 4Department of Pharmaceutical Sciences, College of Clinical Pharmacy, King Faisal University, Al-Ahsa-31982, Saudi Arabia. 5Department of Pharmaceutics, Vidya Siri College of Pharmacy, Off Sarjapura Road, Bengaluru 560 035, Karnataka, India. *Corresponding Author E-mail: vishaltrivediqa@gmail.com

2019-nCoV are large about 120 to 160nm long, spherical particles with a linear, nonsegmented, capped, and polyadenylated positivesense. 1,2 It is a single-stranded RNA genome that is encapsulated in a helical nucleocapsid. The envelope is made up of intracellular membranes and have a characteristic crown of widely spaced club-shaped spikes that are 12 to 24 nmlong. 3,4 Based ongenetic and serological relationships coronavirus can be classified into three groups. 5,6 The coronavirus genomes are the largest of the known RNA viruses (27 to 31.5 kb) and are polycistronic, generating a nested set of subgenomic RNAs with common 5 and 3 sequences. 7,8 The 5 two-thirds of the genome consists of two large replicase open reading frames (ORFs), ORF1a and ORF1b. The ORF1a polyprotein (pp1a) can be extended with ORF1b-encoded sequences via a 1 ribosomal frame shift at a conserved slippery site, generating the 7,000-amino-acid polyprotein pp1ab, which includes the putative RNA-dependent RNA polymerase (RdRp) and RNA helicase (HEL) activity. 9,10,11 Virus isolation A Pathogen free human airway epithelial (HAE)cells were used for virus isolation. 12 Briefly, bronchoalveo larlavage fluids or throat swabs from the patients were inoculated into the HAE cells through the apical surfaces. HAE cells were maintained in an air-liquid interface incubated at 37°C. The cells were monitored daily for cytopathic effects by light microscopy and the cells upernatants were collected for use in quantitative RT-PCR assays. 13,14,15 After three passages, apical samples were collected for sequencing. 16

Genome Sequences of SAR-CoV-2
The genome of SARS-CoV-2 is a 29,903nucleotide, polyadenylated RNA, and 41% of the residues are G or C. 17 The genomic organization of coronaviruses, having the characteristic gene order [5replicase (rep), spike (S), envelope (E),membrane (M), and nucleocapsid (N)-3] and short untranslated regions at both termini. 18,19 The SARS-CoV-2 rep gene,which comprises approximately two-thirds of the genome, is predicted to encode two polyproteinsORF1a and ORF1b that undergo cotranslational proteolytic processing. 20 Downstream of four open reading frames (ORFs)are used to encode the structural proteins S, E, M, and N, which are common to all known coronaviruses. 21,22 Hemagglutininesterase, are present between ORF1b and S of coronaviruses. These coronaviruses encoded non-structural proteins which are located between S and E, between M and N, or downstream of N. 23,24 The genome of SARS-CoV-2 contains ORFs for five potential non-structural proteins that are more than 50 amino acids long in these intergenic regions. 25 overlapping of ORFs encoding show the proteins of 274 and 154 amino acids are located between S and E. Three non-structural genes, X3, X4, and X5 are located between M and N. 26,27 GenBank database (BLAST and FastA) show that there is no significant sequence similarity between these potential non-structural proteins ofSARS-CoV-2 and any other proteins. 28,29 ORFs encoding proteins are more than 50 amino acids long in the structuralgenes of SARS-CoV-2 (such as N, S, and rep). 30,31 SARS-CoV-2 repgene products have genomic RNA, and the remainingviral proteins are translated from sub genomicm RNAs that form a 3-coterminal nested set,each with a 5-end derived from the genomic 5leader sequence. 32,33 Sub genomicm RNAs are synthesized from a discontinuous transcription process. The SARS-CoV-2 leader sequence was mapped by comparing the sequence of rapid amplification of cDNA ends. 34,35 The products which are synthesized from the N gene mRNA with those synthesized from genomic RNA. 36 A sequence, AAACGAAC (genomic nucleotides 65to 72), was identified where the N gene mRNA and genomic sequences diverged. 37 This sequence was also present upstream of ORF1a and immediately upstream of five other ORFs suggesting that it functions as the conserved core of the transcription-regulating sequences (TRSs). 38,39 The nucleotides required for TRS function must be identified experimentally. 40 Theproduction of sub genomic mRNAs of SAR-CoV-2 proposes that discontinuous transcription has occurs during the synthesis of negative strand. Sub-genomic negative strands containing a complimentary copy of the leader sequence at their 3termini serve as templates for synthesis of sub-genomic mRNAs. 41,42 At the site of 5terminus of the genome, the TRS conserved core sequence appears six times in the remainder of the genome. 43 The positions of TRS in the genome of SARS-CoV-2 predict that subgenomic mRNAs of 8.3, 4.5, 3.4, 2.5, 2.0, and1.7 kb, not including the poly(A) tail. Atleast five subgenomic mRNAs were detected byhybridization of RNA from SARS-CoV-2 infected cells. 44,45 Sizes of the five predominant bands correspond to the sizes of five of the predicted sub genomic mRNAs of SARS-CoV-2. Full-length of genomic RNA was not detected, probably because it is the least prevalent viral RNA in infected cells. 46,47 The predicted 2.0-kb transcript was also not detected, which suggests that the consensus TRS at 27,771 to 27,778 is not used or that it is a low-abundance mRNA. 48 Byanalogy with other coronaviruses, the 8.3-kband 1.7-kb sub-genomic mRNAs are predicted tobe monocistronic, directing translation of S andN, respectively, whereas multiple proteins could be translated from the 4.5-kb,3.4-kb and 2.5-kb mRNAs. It is possible that the 3.4-kb band contained more than one mRNA species that was not resolved in the gel or that the monocistronicmRNA. 49,50,51 CDC sequencing strategy The whole-genome sequences of SAR-CoV-2 were generated by a combination. Sanger, Illumina, and Oxford nanopore sequencing were used for this detection. 52 In this CDC sequencing strategy at First viral RNAs were extracted directly from clinical samples with the QIAamp Viral RNA Mini Kit, and then used to synthesise cDNA with the Super Script III Reverse Transcriptase andN6 random primers, followed by second-strand synthesis with DNA Polymerase I, Large Fragment. 53,54 Viral cDNA libraries were prepared with use of the Nextera XT Library Prep Kit (Illumina), then purified with Agen court AMPure XP beads followed by quantification with an Invitrogen Qubit 2.0Fluorometer. 55,56 The resulting DNA libraries were sequenced on either the MiSeq or iSeq platforms (Illumina) using a300-cycle reagent kit. About 1·2-5 GB of data wereobtained. 57 The raw fastQ files were filtered then subjected tode novo assembly with the CLCBio software. Mapped assemblies  66 On the basis of the genome sequences obtained, a real-time PCR detection assay was developed. PCR primers and probes were designed using Applied Biosystems Primer Express Software on the basis of our sequenced virus genomes. Primers and probes were synthesised by BGI. RT-PCR was done with an Applied Biosystems 7300 Real-Time PCR. 67,68 It is a nuclear-derived method for detection of genetic material from any pathogen. Radioactive isotope markers are used to detect targeted genetic materials. Fluorescent dyes were used to replace the isotopic labelling. While real time RT-PCR is now the most widely used method for detecting coronaviruses in many countries. 69,70,71 (SARS-CoV-2) only contains RNA. In SARS-CoV-2 RNA take control and reprogrammed the cells so that they multiple and make more viruses. SAR-CoV-2 detected in the body using real time RT-PCR. In which a sample is taken from infected person body like nose or throat. 72,73,74 The taken sample is treated with several chemical solutions that remove substances,only the RNA is been extracted from the taken sample. The RNA is reverse transcription process. By using specific enzymes this RNA was converted into DNA. short fragments of DNA are added that are complementary to specific parts of the transcribed viral DNA. These fragments of DNA to target sections of the viral DNA. 75,76,77 The genetic fragments are for building DNA strands during amplification, while the others are for building the DNA and adding marker labels to the strands, which are then used to detect the virus. These mixers then placed in a RT-PCR machine. 78,79 The machine cycles heat and cool the mixture chemical reactions this create new, identical copies of the target sections of viral DNA. This cycle can repeat over and over to continue copying the target sections of viral DNA. Each cycle doubles the amount: two copies become four, four copies become eight, and so on. 80,81 A standard real time RT-PCR setup goes through 35 cycles, which means that by the end of the process, around 35 billion new copies of the sections of viral DNA are created. the marker labels and a fluorescent dye were used to measured. 82,83 Computer's has been used for tracking the amount of fluorescence in the taken sample after each cycle. If the amount of fluorescencegoes over a certain level, it gives the conformation of virus present. 84 This technique is highly sensitive and specific. It can deliver a reliable diagnosis as fast as three hours. by Comparing with other available virus isolation methods, real time RT-PCR is much faster and it has a lower potential for contamination or errors. the entire process can be done within a closed tube. It is the most accurate method available for detection of the SAR-CoV-2. 85,86,87

Effects of Genome sequence COVID-19 on different countries
This study shows the effects on changing in genomic sequences of COVID 19 on different countries. It was stated from Wuhan city, china and spread all over the world. In December 2019, an outbreak of apparently viral pneumonia of unknown ethology emerged in the city of Wuhan, in the Chinese province of Hubei. Chinese scientist done 7,700 genome sequences of COVID-19. These genome sequences help to vital tracking, how the virus mutates time by time as it spread and also used for development of diagnostic tests and vaccines. 88 Data shows that Africa and south America has only 1% of genome sequencing done on COVID-19. Whereas Europe and north America has the highest genome sequencing done. There should be more genomic data from Africa and south America for vaccine development. 89 9 January, 2020, health authorities of Chinese and the World Health Organization (WHO) found and announces the novel coronavirus, known as SAR-CoV-2, these agents responsible for the pneumonia cases. On 11 and 12 January, the health authorities of Chinese shared the full genome sequence of SAR-CoV-2. The genome sequences of SAR-CoV-2 is crucial for the development of specific diagnostic tests and the identification of potential treatment options. Chinese scientists claim there are two main strain of SAR-CoV-2 in humans, indicating that the virus is mutating. Researchers of china say the COVID-19 virus, which has since been renamed SARS-CoV2, has evolved into two major lineages, known as "L" and "S" types. The newer and more aggressive L type strain accounted for about 70 per cent of the analysed cases, the researchers said, while the rest were linked to the older S type version. 90,91 On Friday 24 January, 2020, detection of the corona virus was confirmed in France.
India researchers [National Institute of Virology (NIV), Pune and Gujarat Biotechnology Research Centre (GBRC)] decoding the entire genome sequence of the novel coronavirus, and confirmed that there are three new mutations has occurred. These viruses mutate in order to adapt to and survive in different situations. It was mutated when it will try to control by medicines. This virus is mutating fast. After establishing the genome sequence then analyse the genome in them and after that formulate a strategy to fight the virus.
World Health Organization, confirmed that the diagnosis of COVID-19 should be done by RT-PCR or gene sequencing. 92 The RT-PCR test is a highly sensitive. It is a gene based diagnostic test. It is much more reliable and much better than the conventional test. In united states of America, Researchers have examined the genomes sequences of coronavirus say the similarities between the cases suggest that the virus may have been spreading in the state for weeks. On 20 January, by the analysis of genome sequences, centre of disease control announced that Washington had the United States' first confirmed case of coronavirus. Peoples who live in the same county, but did not known to have any contact with one another. By the study of genomic sequences, it has been concluded that the virus has been spreading through other people in the community. U.S.A. scientist also concluded that these viruses are independently would arrive in the same geographical area and be genetically related unless they were connected. 93 In united states death rate is very much high as compared to another countries. In USA, Human-to-human transmission has been reported too much. On 27 February, the first case of community spread in the US was reported, a person who did not have any travel history to the coronavirus-affected regions. New York has reported the highest number of coronavirus cases and deaths in the US and also accounts for 40% of deaths in the country. The government announced a disaster declaration for the state and the National Guard has been activated. The US passed a bill on 13 March making coronavirus tests free for its residents. The CDC developed the RT-PCR diagnostic panel for testing patient specimens for COVID-19. It was approved for emergency use by the U.S. Food and Drug Administration (FDA) under Emergency Use Authorization (EUA).The CDC later announced that it would use its existing seasonal influenza surveillance system for testing coronavirus cases in five states including Los Angeles, San Francisco, Seattle, Chicago, and New York. 94,95 India has recorded 78,194 confirmed cases, 2,551deaths, 26,400 cured cases, according to the latest update released by the Union Ministry for Health and Family Welfare.
Covid-19 is not easy to diagnose. Real Time-PCR can be used for virus testing followed by gene fingerprinting. There are 15 labs in India were the test has been done so far and Nineteen will be made operational as soon as possible. So far, specimens of suspected novel coronavirus patients are tested at the National Centre for Disease Control (NCDC) in Delhi, Indian Council of Medical Research (ICMR)'s laboratories at Alappuzha, Bengaluru, Hyderabad and Mumbai, besides the National Institute of Virology (NIV), Pune. Under the guidance of NIV these labs are worked. In this test, a nucleic acid amplification-based assay called polymerase chain reaction (PCR) are used and a more sensitive form called reverse-transcriptionpolymerase chain reaction(RTPCR).According to recommendations by the World Health Organization. COVID-19 diagnosis must be confirmed by RT-PCR or gene sequencing for respiratory or blood specimens. It is much more reliable and much better than the conventional test. 96 China take immediate precautions to stop the spread of the virus and provide diagnosis and treatment in-time for reduction the impacts. On 23 January, the Chinese Civil Aviation Administration guided foreign airlines to reduce their scheduled flights to Wuhan, when necessary, to prevent and control the spread of pneumonia caused by coronavirus. Chinese's government take a number of steps to stop the outbreak, which are: -• Pooling trained medical personnel from across the country • Ensuring supplies to the most vulnerable and affected areas • Construction of new temporary coronavirus hospitals on a war-footing • Extending holidays for schools and businesses as well as the Chinese Lunar New Year holiday Researchers has allowed for gene testing, epidemic control and research institutions to use Linear Fold, its RNA structure algorithm that will help to understand the virus and screening compounds in less than half-a-minute compared to approximately an hour earlier. Bangkok (Thailand) is currently at risk from a spread of the virus. Hong Kong (China) is second on the list, followed by Taipei (Taiwan, the Republic of China).
The most 'at-risk' countries or regions worldwide are Sydney, New York, London, Thailand, Japan and Hong Kong. USA, Australia and the UK. In mainland of China, the cities of Beijing, Guangzhou, Shanghai, Chongqing, Guangdong, Zhejiang, Sichuan and Henan are all identified as high-risk by the researchers. A common factor behind the deaths recorded in china was smoking, those who with already damaged lungs are more likely to succumb to pneumonia. According to WHO, Italy has been hit so badly, because of the age of its population. The country has the oldest average age in Europe, second oldest in the world and coronavirus disproportionately affects the elderly, whose immune system may not strong enough to stave off the pneumonia the virus causes. 97,98 First confirmed cases in Brazil came from Italy. On 28 January, the first Brazilian sequence closely resembles to the sequenced in Germany and it was different from the genome sequenced in Wuhan, the epicentre of the epidemic in China. This virus mutates relatively little, once a month on average, so there's no point in sequencing small pieces of the genome. on February 29, the researchers isolated the sequence of whole genome of the coronavirus from the second Brazilian patient diagnosed with COVID-19. Completed only 24 hours after confirmation of the case in Brazil, the study shows that there are differences between the genomes of the viruses isolated from the first and second patients. "The second genomic sequence of Brazilian is more similar to the genome sequenced in the United Kingdom, and it both differ from the Chinese sequences. This show that the COVID-19 epidemic is maturing in Europe in the sense that internal transmission is already occurring in European countries.
The main advantage of real-time epidemic monitoring. It is used to exactly identifying,where the virus came from. This information is useful to guide containment actions and help reduce dissemination of the disease. on February 26, the first case of COVID-19 was diagnosed in Brazil by molecular biology. 99