ID MG023235; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023235; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR606F/BRAZIL/2014 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; cb82fc8887d14ce4a11238077b85f8a8. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR606F/BRAZIL/2014" FT /isolate="VIR606F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="15-Oct-2014" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K392" FT /protein_id="ATN44781.1" FT /translation="MKMASSDANPSDGSAANLVPEVSNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 82 A; 92 C; 96 G; 81 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca gcaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgatctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023236; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023236; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR615F/BRAZIL/2014 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; cb82fc8887d14ce4a11238077b85f8a8. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR615F/BRAZIL/2014" FT /isolate="VIR615F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="29-Oct-2014" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K379" FT /protein_id="ATN44782.1" FT /translation="MKMASSDANPSDGSAANLVPEVSNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 82 A; 92 C; 96 G; 81 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca gcaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgatctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023237; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023237; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.8/VIR628F/BRAZIL/2014 capsid protein VP1 DE gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; e95df9919da88ee4a877cb1e701a44d5. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.8/VIR628F/BRAZIL/2014" FT /isolate="VIR628F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="26-Nov-2014" FT /note="genotype: GII.8" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K3A6" FT /protein_id="ATN44783.1" FT /translation="MKMASNDAAPSNDGAAGLVPEINHEVMAIEPVAGASLAAPVVGQL FT NIIDPWIRNNFVQAPAGEFTVSPRNAPGEFLLDLELGPELNPYLAHLARMY" XX SQ Sequence 351 BP; 91 A; 83 C; 87 G; 90 T; 0 other; ggatgagatt ttcagacctc agcacgtggg agggcgatcg caatctggct cccgagaatg 60 tgaatgaaga tggcgtcgaa tgacgcagct ccatcgaatg atggcgcggc tggcctcgta 120 ccagagatca accatgaggt catggccata gagcctgttg caggagcctc tttagcagcc 180 cctgtcgtag gacaacttaa tataattgat ccctggatta gaaataattt tgtacaagcc 240 cctgctggag aattcactgt ttcgcccaga aatgctccag gtgaattttt attagatcta 300 gagttaggtc cagaattaaa tccctatctt gctcaccttg cacgcatgta c 351 // ID MG023238; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023238; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR630F/BRAZIL/2014 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; 4f142535466082fd858ec615c7458da3. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR630F/BRAZIL/2014" FT /isolate="VIR630F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="28-Nov-2014" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K3B1" FT /protein_id="ATN44784.1" FT /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 83 A; 92 C; 95 G; 81 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca acaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgatctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023239; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023239; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.6/VIR698F/BRAZIL/2015 capsid protein VP1 DE gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; 9da0321b39b6a490034efd600d441e11. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.6/VIR698F/BRAZIL/2015" FT /isolate="VIR698F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="29-Apr-2015" FT /note="genotype: GII.6" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K363" FT /protein_id="ATN44785.1" FT /translation="MKMASNDAAPSNDGAANLVPEANNEVMALEPVVGASIAAPVVGQQ FT NIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMY" XX SQ Sequence 351 BP; 87 A; 84 C; 92 G; 88 T; 0 other; ggatgagatt ctctgacctc agcacatggg agggcgatcg caatcttgct cccgagagtg 60 tgaatgaaga tggcgtcgaa tgacgccgct ccatcaaatg atggtgctgc caacctcgta 120 ccagaggcca ataatgaggt tatggcactt gaaccggtgg tgggagcttc aatcgcagct 180 cctgttgtcg gtcagcagaa tataattgac ccctggatta gagaaaactt tgttcaagca 240 ccacagggcg agtttactgt ttcaccaagg aactcgcccg gtgagatgct tctaaatctt 300 gaattgggcc cagagcttaa tccctatttg agtcatttgt cccgcatgta c 351 // ID MG023240; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023240; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR724F/BRAZIL/2015 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; b281b4d0ef7144690ea2a9db89da1044. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR724F/BRAZIL/2015" FT /isolate="VIR724F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="05-Jun-2015" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K384" FT /protein_id="ATN44786.1" FT /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 82 A; 93 C; 96 G; 80 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca acaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt gtcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgacctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023241; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023241; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR726F/BRAZIL/2015 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; 4f142535466082fd858ec615c7458da3. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR726F/BRAZIL/2015" FT /isolate="VIR726F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="08-Jun-2015" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K3A4" FT /protein_id="ATN44787.1" FT /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 83 A; 92 C; 95 G; 81 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca acaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgatctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023242; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023242; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR727F/BRAZIL/2015 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; aac8abb237629264979f238ef2a0f434. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR727F/BRAZIL/2015" FT /isolate="VIR727F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="12-Jun-2015" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K372" FT /protein_id="ATN44788.1" FT /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 83 A; 93 C; 95 G; 80 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca acaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccagcaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggtggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgacctaaa tccctaccta tcccatctgg ccagaatgta c 351 // ID MG023243; SV 1; linear; genomic RNA; STD; VRL; 351 BP. XX AC MG023243; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.4_Sydney/VIR731F/BRAZIL/2015 capsid protein DE VP1 gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-351 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; efe2c7465c15573e4c929a079ef42d18. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..351 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.4_Sydney/VIR731F/BRAZIL/2015" FT /isolate="VIR731F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="24-Jun-2015" FT /note="genotype: GII.4_Sydney" FT /db_xref="taxon:122929" FT CDS 64..>351 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K390" FT /protein_id="ATN44789.1" FT /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQQ FT NVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMY" XX SQ Sequence 351 BP; 83 A; 92 C; 95 G; 81 T; 0 other; ggatgagatt ctcagatctg agcacgtggg agggcgatcg caatctggct cccagttttg 60 tgaatgaaga tggcgtcgag tgacgccaac ccatctgatg ggtccgcagc caacctcgtc 120 ccagaggtca acaatgaggt tatggctctg gagcccgttg ttggtgccgc cattgcggca 180 cctgtagcgg gccaacaaaa tgtaattgac ccctggatta gaaataattt tgtacaagcc 240 cctggcggag agtttacagt atcccctaga aacgctccag gtgaaatact atggagcgcg 300 cccttgggcc ctgatctgaa tccctaccta tcccatttgg ccagaatgta c 351 // ID MG023244; SV 1; linear; genomic RNA; STD; VRL; 348 BP. XX AC MG023244; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GI strain NoV/Hu/GI.2/VIR725F/BRAZIL/2015 capsid protein VP1 DE gene, partial cds. XX KW . XX OS Norovirus GI OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-348 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-348 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; cbe1625fde9b1d6105424df931b58753. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..348 FT /organism="Norovirus GI" FT /host="Homo sapiens" FT /strain="NoV/Hu/GI.2/VIR725F/BRAZIL/2015" FT /isolate="VIR725F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="05-Jun-2015" FT /note="genotype: GI.2" FT /db_xref="taxon:122928" FT CDS 52..>348 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K369" FT /protein_id="ATN44790.1" FT /translation="MMMASKDAPQSADGASGAGQLVPEVNTADPLPMEPVAGPTTAIAT FT AGQVNMIDPWIVNNFVQSPQGEFTISPNNTPGDILFDLQLGPHLNPFLSHLSQM" XX SQ Sequence 348 BP; 87 A; 86 C; 78 G; 97 T; 0 other; atgaccttgg tttgtggaca ggagatcgca atctcctgcc tgaatttgta aatgatgatg 60 gcgtctaagg acgcccctca aagcgctgat ggcgcaagcg gcgcaggtca actggtgccg 120 gaggttaata cagctgaccc cttacccatg gaacctgtgg ctgggccaac aacagccata 180 gccactgctg ggcaagttaa tatgattgat ccctggattg ttaataattt tgtccagtca 240 cctcaaggtg agttcacaat ctctcctaac aatacccccg gtgatatttt gtttgattta 300 caattaggtc cacatttaaa ccctttcttg tcacatttgt cccaaatg 348 // ID MG023245; SV 1; linear; genomic RNA; STD; VRL; 168 BP. XX AC MG023245; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.6/VIR661F/BRAZIL/2015 capsid protein VP1 DE gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-168 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-168 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; 83a8bbd8d6b4a53eb22d3f7dfb49675c. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..168 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.6/VIR661F/BRAZIL/2015" FT /isolate="VIR661F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="02-Mar-2015" FT /note="genotype: GII.6" FT /db_xref="taxon:122929" FT CDS <1..>168 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K398" FT /protein_id="ATN44791.1" FT /translation="VPEANIEVMALERVVGASIAAPVVGQQNIIDPGIGENFVQAHQGE FT FTVSPRNLPGE" XX SQ Sequence 168 BP; 43 A; 34 C; 47 G; 44 T; 0 other; gtaccagagg ccaacattga ggttatggca cttgaacggg tggtgggagc ctcaattgca 60 gctcctgtcg tcggtcaaca aaatataatt gaccccggga ttggagaaaa ttttgttcag 120 gctcatcagg gtgagtttac tgtttcacca agaaacttgc ctggtgag 168 // ID MG023246; SV 1; linear; genomic RNA; STD; VRL; 168 BP. XX AC MG023246; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Norovirus GII strain NoV/Hu/GII.6/VIR639F/BRAZIL/2015 capsid protein VP1 DE gene, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-168 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT "Norovirus detection, quantification and genotyping in serum and stool RT samples from children hospitalized for acute gastroenteritis, in Belem, RT Para, Brazil, 2012- 2015"; RL Unpublished. XX RN [2] RP 1-168 RA Reymao T.K.A., Fumian T.M., Justino M.C.A., Hernandez J.M., Bandeira R.S., RA Lucena M.S.S., Teixeira D.M., Abreu E., da Silva L.D., Linhares A.C., RA Gabbay Y.B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Virology, Evandro Chagas Institute, BR316, ANANINDEUA, PARA 67030000, RL Brasil XX DR MD5; 759fb2288e76cdb5cd15d7395736d9eb. DR EuropePMC; PMC6028094; 29965979. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..168 FT /organism="Norovirus GII" FT /host="Homo sapiens" FT /strain="NoV/Hu/GII.6/VIR639F/BRAZIL/2015" FT /isolate="VIR639F" FT /mol_type="genomic RNA" FT /country="Brazil:Para" FT /isolation_source="human stool" FT /collection_date="12-Jan-2015" FT /note="genotype: GII.6" FT /db_xref="taxon:122929" FT CDS <1..>168 FT /codon_start=1 FT /product="capsid protein VP1" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A3G1K388" FT /protein_id="ATN44792.1" FT /translation="VPEANNEVMALEPVVGASIAAPVVGQQNIIDPWIGENFVQASQGE FT FSVSPRNLPGE" XX SQ Sequence 168 BP; 45 A; 34 C; 46 G; 43 T; 0 other; gtaccagagg ccaacaatga ggttatggca cttgaaccgg tggtgggagc ctcaattgca 60 gctcctgtcg tcggtcaaca aaatataatt gacccctgga ttggagaaaa ttttgttcag 120 gcatcacagg gtgagtttag tgtttcacca agaaatctgc ctggtgag 168 // ID MG025518; SV 1; circular; genomic DNA; STD; VRL; 4907 BP. XX AC MG025518; XX DT 16-FEB-2018 (Rel. 135, Created) DT 16-FEB-2018 (Rel. 135, Last updated, Version 1) XX DE Gokushovirus MK-2017 isolate AA_01, complete genome. XX KW . XX OS Gokushovirus MK-2017 OC Viruses; Microviridae; Gokushovirinae; unclassified Gokushovirinae. XX RN [1] RP 1-4907 RA Kluge M., Borges L.G.A., Franco A.C., Giongo A.; RT "Fur seal gut phageome"; RL Unpublished. XX RN [2] RP 1-4907 RA Kluge M., Borges L.G.A., Franco A.C., Giongo A.; RT ; RL Submitted (01-OCT-2017) to the INSDC. RL Institute of Petroleum and Natural Resources - LAGEB, PUCRS, 6681 Ipiranga RL Avenue, Building 96J, Porto Alegre, RS 90619-900, Brazil XX DR MD5; cd0544ba1c352cbd98e4479349e9eb27. XX CC ##Assembly-Data-START## CC Assembly Method :: Metavelvet v. v1.2.01; SPAdes v. 3.5.0; CC MIRA v. 4.0.2; Geneious v. 8.1.9 CC Coverage :: 58.87 CC Sequencing Technology :: Illumina; IonTorrent CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4907 FT /organism="Gokushovirus MK-2017" FT /isolate="AA_01" FT /mol_type="genomic DNA" FT /country="Brazil" FT /isolation_source="fur seal feces" FT /collection_date="2012/2013" FT /db_xref="taxon:2073143" FT CDS 1..117 FT /codon_start=1 FT /transl_table=11 FT /product="hypothetical protein" FT /db_xref="UniProtKB/TrEMBL:A0A2L0WUZ1" FT /protein_id="AVA31639.1" FT /translation="MSRRSMPKRRDSKVFRRTAVRSKKININPTIYRGGIRL" FT CDS 148..402 FT /codon_start=1 FT /transl_table=11 FT /product="VP5" FT /db_xref="UniProtKB/TrEMBL:A0A2L0WUY7" FT /protein_id="AVA31640.1" FT /translation="MIQNVYSVRDVKTGFGPLMILQNDAVALRSFEVSCRQSDSLMHWC FT AADYSLFCIGSFDDESGQLNPLDVPRHIADASAVKDGEE" FT CDS 402..956 FT /codon_start=1 FT /transl_table=11 FT /product="VP3" FT /db_xref="UniProtKB/TrEMBL:A0A2L0WUY2" FT /protein_id="AVA31641.1" FT /translation="MSKENFDALVTRWNTMYDPHERVFSNVGSPDKTLYQAKVDSNGTL FT DLVENGTESLYDYIQSFKDSCDINLIIQRYASGDVDVLSKRQGAYIDSVGLPTSYAEML FT DTVIAGREVFDSLPVEIKARFDYSFERWMSTMDNWSEFTDLMGVNSDPVSGSGEQPPVA FT DAGHADNNQSPSPEGVIANEH" FT CDS 946..2640 FT /codon_start=1 FT /transl_table=11 FT /product="VP1" FT /db_xref="GOA:A0A2L0WUW9" FT /db_xref="InterPro:IPR003514" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR037002" FT /db_xref="UniProtKB/TrEMBL:A0A2L0WUW9" FT /protein_id="AVA31642.1" FT /translation="MSINANSRFAMNPTSIDISRSRFPINYSHKTTFNVGDLIPFYCQE FT VLPGDTFDVETSRVVRMQSLLTPVMDNLYLDMYYFFVPSRIVWSHWKQFMGENTESAWL FT PSTEYEVPQITAPSDVGWKTGSIADHFGIPTGVKSLSVNALPFRAYALICNEWFRDENL FT SDPLNIPVDDATVAGVNSSNYITDVAKGGMPFKACKYHDYFTSCLPAPQKGPDVLIPVA FT EAGTFDIVSNGKTPYYVRDGKLTQFPDGKSVGLYGEQYSGGAYRHFAFDFGDLETGGRV FT DLGFPTADQLATVPGATTGLVAAGSGASQAATINQLRLAFQIQKFYEKQARGGSRYIET FT LKAMFGVTSPDSRLQRPEYLGGNRIPVNINQVLQQSATEANGTPQGNPVGQSLTTDTHH FT DFKKSFTEHGFVIGMMVARYDHTYQQGLERFWSRKTKFDYYWPVFANIGEQAVLNKEIY FT AQGTDKDNEVFGYQEAWADYRYKPSQVTGEMRSSVENSLDVWHLADDYKQLPSLSDSWI FT REDSTNVNRVLAVSEQKANQLFCDIYVRNLATRPMPLYSVPGLIDHH" FT misc_feature 2723..3459 FT /note="similar to VP2" FT CDS 3956..4906 FT /codon_start=1 FT /transl_table=11 FT /product="VP4" FT /db_xref="UniProtKB/TrEMBL:A0A2L0WUX5" FT /protein_id="AVA31643.1" FT /translation="MRNGNWTDIYTDFISAAAERVCQNWVEIPCGKCLGCRLEYSRQWA FT NRCLLENEYHESSYFVTLTYDQEHVPETWYPDPSTGEALRALTLRKRDFQLFMKRLRKH FT TGQEIRYFAAGEYGTKTFRPHYHAIIFGLKLDDLTIVKKSPLGYPYYTSQTILRAWSVQ FT QQGGSYAPLGQIIVAPVTWETCAYTARYTAKKNGTQGSEYFASMNLEPPFVLMSRRPGI FT GNQWYVEHADLFDHAFINISTETGGKKFRPPKYFYNLLEKDNPKFAECLKDIRKANMDK FT YHKDKLLNTDLDYLDLLEVEERALQNRTKSLERRL" XX SQ Sequence 4907 BP; 1236 A; 895 C; 1133 G; 1643 T; 0 other; atgtctcgta gatctatgcc taaacgtcga gattctaagg tttttcgtcg tacagctgtt 60 cgttccaaaa aaattaatat taatcctact atttatcgcg gcggtatccg tctgtaaaaa 120 gaatttttga aagtgagttg ttttcctatg attcaaaatg tttattctgt tcgtgatgta 180 aagactggtt ttggtcctct gatgattctt cagaacgatg ccgttgctct tcgttctttt 240 gaagtctctt gtcgtcaatc tgattctctg atgcattggt gtgctgctga ttattctttg 300 ttctgcattg gttcttttga tgatgaaagc ggacagttaa atccgttgga tgttcctcgt 360 catattgccg atgcttctgc tgtgaaagat ggtgaggagt aatgtcaaag gaaaattttg 420 acgctttagt tactcgatgg aatactatgt atgaccctca tgagagagtc ttttcgaatg 480 taggttctcc tgataaaact ctttatcagg ctaaggttga tagtaatggt actcttgatt 540 tggtagaaaa tggtactgag tctttgtatg attacattca gtcttttaaa gattcctgtg 600 atattaactt gattattcag cgttatgcaa gtggtgatgt tgacgttctt tctaagcgcc 660 aaggtgctta cattgattct gttggtttgc ctacttctta tgctgaaatg ctggatactg 720 ttattgctgg tcgtgaagtt tttgatagtt tgcctgttga gataaaggct agatttgatt 780 atagctttga acgttggatg tctactatgg acaattggag tgagttcact gatctcatgg 840 gcgtgaactc tgatccagtt tctggatcag gtgagcagcc ccccgtggcc gatgcaggcc 900 acgccgataa caaccagagc cctagtccgg aaggagtgat tgctaatgag cattaatgca 960 aacagtcgtt ttgctatgaa tcctacttct attgatattt ctcgaagcag gttccctatt 1020 aattattcac ataaaaccac ctttaatgta ggtgatttga ttccctttta ttgtcaggaa 1080 gttttgccag gtgatacatt tgatgttgaa acgtctcgag ttgttcgtat gcagtctctt 1140 ttgactcctg ttatggataa cttgtatctg gatatgtatt acttctttgt tccttctcgt 1200 attgtttggt cgcattggaa gcagtttatg ggtgaaaata ctgagtctgc gtggttgcct 1260 agtactgagt acgaagtacc tcaaattact gctccttctg atgttggttg gaagactggt 1320 agtattgctg atcactttgg aattcctact ggtgttaaat ctttgtctgt aaatgcgctt 1380 cccttccgtg cttatgcatt gatttgtaat gagtggttta gagatgaaaa tttatctgac 1440 cctctgaata ttcctgttga tgatgctact gtcgctggag ttaattctag caattacatt 1500 actgatgttg ctaaaggcgg catgcctttt aaagcttgta agtatcatga ttatttcaca 1560 agttgcttgc cagcccctca aaaaggtcct gatgttttaa ttcctgttgc ggaagctggt 1620 acctttgata ttgtttctaa tggaaagact ccttattatg ttcgtgatgg taaactgact 1680 cagttccctg acggtaaatc tgttggatta tatggtgaac aatacagtgg tggtgcttat 1740 cgtcactttg cttttgattt tggtgatttg gaaactggtg gtcgtgttga tttaggattc 1800 cccacagccg atcaacttgc tactgtgcca ggcgctacta caggtcttgt tgctgctggt 1860 tctggtgcat cccaagccgc tacgattaat cagcttcgtc ttgctttcca gattcagaag 1920 ttctatgaaa agcaagctcg tggtggttct cgttatattg aaactctgaa agctatgttt 1980 ggcgttactt cgccggatag tcgtttgcag cgtcctgaat atcttggtgg taacaggatt 2040 cctgttaata ttaatcaggt tttacagcaa tctgctacgg aagctaatgg tactccgcaa 2100 ggtaaccccg ttggccagtc tcttacaact gatactcatc atgatttcaa gaaatccttt 2160 acggagcatg gttttgttat tggtatgatg gttgctagat atgatcatac ttatcaacag 2220 ggtcttgaac gtttttggtc tcgcaagact aaatttgact attattggcc tgtttttgcg 2280 aatattggtg agcaagcagt cttgaataaa gagatttatg cacaaggaac agataaagat 2340 aacgaagttt ttggttatca ggaagcttgg gctgattatc ggtataaacc ttctcaggtt 2400 actggtgaga tgaggtctag tgtcgagaat tctcttgatg tttggcattt ggctgatgac 2460 tataagcagt taccttctct ttctgattct tggattagag aagattccac taatgttaat 2520 cgtgttcttg ctgttagtga gcagaaagct aatcagctat tttgtgatat ttatgtgcgt 2580 aatttggcta ctcgcccgat gccactttat agtgtacccg gtctaattga ccatcattaa 2640 caagaatctt gttagttcac aagaaacttg taatttttac tagagggcct ttgggccctc 2700 ttttttttta ggagtgattt tgatgttaac atctgctaat ggtttgagtc ctgttgcacc 2760 tgctgcagca acttctgctt ctgttgctgg taattatatg gcacgtaacg gtgctggtac 2820 tgctgttact agcgcaatga aagagaataa tgctactggt gttcttgatc aggtggctaa 2880 taatgccacc ggcaataatg agtggtctgc ttctcaggca gacagagcta atgcttggca 2940 agctgctatg tggcaagcgc aagccgattt taatgcggca gaagccgcta agaatcgtga 3000 ttggcaagag tatatgagtt ctactgcgca tcagcgccag atggctgatc ttaaagccgc 3060 tggccttaat cctgttcttg ccgctatgaa tggtaatggt gcttccgttg gtagcggtgc 3120 taccgcttct gtaggttctg ctggttctgc tcataaaggt gatacggata ctagcgctaa 3180 tcaggcgctt gttggtatcc ttgcttctat gcttaacgct caaacaaccc tcgagagcca 3240 gcgtataaac gcgcagaaca acctcgcggt cgcagacaag tataatgcca cgtctgagct 3300 cgtagcccgt ttaacgggcg aatatggcct aaagagcgca ggtattcatg ctggcgctac 3360 tagatacgct gctgataaga gttatgcggg cactttaggg tccgcttcta tccattctgc 3420 ggcatctcga tatggtgcag atcaatctgc tgctgcttct agatatcaat cagataagag 3480 ctatgctagc gctaagtata gctctaataa gcattatcag ggtacgaagt atagttctga 3540 tactgcttac aattcctctt tcgataaccg taattcgtgg tccggtttag cggacaaagt 3600 cgtgcgatac gccaccggag gaggcggaaa gcatcgatga cgaataaggt cggccccatt 3660 acctacttga tgtaatgggg ccgactgaca ccatggcagt atcgattacg tattgacagc 3720 gtggcagtta tgtgctaagt tctaatagaa aggagttgtt tctggctatg gcagatcagt 3780 tttatgtctt aatgatgtta actgtaatgt gtactactac tttatgtctt tataagattt 3840 tgaaggagtg gttcaagtga gttgtaaaca tccgctgaaa gctttccaga ttggagtacg 3900 tgataacggt aaagccgaat tgaggattcg tccgtaccgt gttgatcatt tggaaatgcg 3960 aaatggtaac tggactgata tttatactga ctttatctct gctgcggcag agcgcgtttg 4020 tcagaattgg gttgaaatcc cttgtggtaa atgtcttgga tgtcgattgg aatattcgcg 4080 ccaatgggcg aataggtgtt tactcgaaaa tgaatatcat gagagtagtt attttgttac 4140 tcttacttac gatcaggagc atgtcccaga gacttggtac ccagaccctt ctactggtga 4200 agcgttgaga gctttaacct taagaaaacg tgattttcag ttatttatga agcgtcttcg 4260 caaacatact ggacaggaaa ttagatattt tgctgctggt gaatatggta ctaaaacatt 4320 tcgtcctcat tatcatgcta taatttttgg tttaaaattg gatgatttga ctattgtaaa 4380 aaagtctcct ttaggttatc cttattatac aagccagacg attcttcgtg cttggtcagt 4440 ccaacagcaa ggggggagct atgctcccct cggacagatt atagttgctc ccgttacatg 4500 ggaaacctgt gcttatactg ctagatatac tgcaaagaaa aatggaacac aaggttctga 4560 gtatttcgct tctatgaatt tagaacctcc atttgttttg atgtctcgtc gtcctggcat 4620 tggtaatcaa tggtacgttg agcatgctga tttgtttgat catgctttta ttaacatttc 4680 cacggaaact ggtggaaaaa agtttcgtcc tccaaagtac ttttataatt tattggaaaa 4740 agataatcca aaatttgcag agtgcttaaa agatatacgc aaagctaata tggacaaata 4800 tcacaaggac aaattgctta atactgattt agactattta gatcttttgg aggtagagga 4860 gcgagctttg caaaatagaa ctaagtcgtt agaaaggagg ttgtaat 4907 // ID MG025597; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025597; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2315977 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; 6a4207e46069b051615d6724d4e1e0ff. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2315977" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="06-Mar-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ2" FT /protein_id="ATT59246.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 76 A; 41 C; 59 G; 55 T; 0 other; atcatgctat ggatgttacc acacaggttg gagatgattc aggaggtttt tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aggtaggcat aacaactatg agggacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt gcaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcaa agaaagtacc tgagacattt c 231 // ID MG025598; SV 1; linear; genomic RNA; STD; VRL; 194 BP. XX AC MG025598; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2489814 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-194 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-194 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; f55460f6f520cec4eff4f5b7ab3c5031. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..194 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2489814" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="28-Apr-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>194 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHR1" FT /protein_id="ATT59247.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANK FT GKMDVSGVQAPVGAITTIE" XX SQ Sequence 194 BP; 63 A; 34 C; 51 G; 46 T; 0 other; atcatgctat ggatgttacc acacaggttg gagatgattc agggggtttt tcaacaacag 60 tttctacaga acagaatgtt cctgatcccc aagttggcat aacaactatg agggacctaa 120 aagggaaagc caataagggg aagatggatg tttcaggagt gcaagcacct gtgggagcta 180 ttacaacaat tgag 194 // ID MG025599; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025599; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2427851 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; ab57744c10ccc32b7ef64813383e5e86. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2427851" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="09-Apr-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ4" FT /protein_id="ATT59248.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 73 A; 44 C; 58 G; 56 T; 0 other; atcatgctat ggatgttacc acacaggttg gtgatgattc agggggtttc tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aagttggcat tacaaccatg agagacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt ccaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcga agaaagtacc tgagacattt c 231 // ID MG025600; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025600; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2656489 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; 2cbaaae0e4b60ce9d685720a21f8e3ca. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2656489" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="20-Jun-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ5" FT /protein_id="ATT59249.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 75 A; 41 C; 60 G; 55 T; 0 other; atcatgctat ggatgttacc acacaggttg gagatgattc aggaggtttt tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aggtaggcat aacaactatg agggacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt gcaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcaa agaaagtgcc tgagacattt c 231 // ID MG025601; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025601; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2670995 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; ab57744c10ccc32b7ef64813383e5e86. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2670995" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="26-Jun-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ6" FT /protein_id="ATT59250.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 73 A; 44 C; 58 G; 56 T; 0 other; atcatgctat ggatgttacc acacaggttg gtgatgattc agggggtttc tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aagttggcat tacaaccatg agagacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt ccaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcga agaaagtacc tgagacattt c 231 // ID MG025602; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025602; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2671596 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; ab57744c10ccc32b7ef64813383e5e86. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2671596" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="26-Jun-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ9" FT /protein_id="ATT59251.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 73 A; 44 C; 58 G; 56 T; 0 other; atcatgctat ggatgttacc acacaggttg gtgatgattc agggggtttc tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aagttggcat tacaaccatg agagacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt ccaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcga agaaagtacc tgagacattt c 231 // ID MG025603; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025603; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2674276 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; 2cbaaae0e4b60ce9d685720a21f8e3ca. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2674276" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="26-Jun-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHR2" FT /protein_id="ATT59252.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 75 A; 41 C; 60 G; 55 T; 0 other; atcatgctat ggatgttacc acacaggttg gagatgattc aggaggtttt tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aggtaggcat aacaactatg agggacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt gcaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcaa agaaagtgcc tgagacattt c 231 // ID MG025604; SV 1; linear; genomic RNA; STD; VRL; 231 BP. XX AC MG025604; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Hepatovirus A isolate HAV_Mars_short_2663835 polyprotein gene, partial cds. XX KW . XX OS Hepatovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Hepatovirus. XX RN [1] RP 1-231 RA Colson P., Menard A.; RT "Foretold hepatitis A outbreak among MSM in Southeastern France"; RL Unpublished. XX RN [2] RP 1-231 RA Colson P., Menard A.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL URMITE UM63 CNRS 7278 IRD 198 INSERM U1095, IHU Mediterranee Infection, RL Aix-Marseille Univ., 27 boulevard Jean Moulin, Marseille 13385, France XX DR MD5; 2cbaaae0e4b60ce9d685720a21f8e3ca. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..231 FT /organism="Hepatovirus A" FT /host="Homo sapiens" FT /isolate="HAV_Mars_short_2663835" FT /mol_type="genomic RNA" FT /country="France" FT /collection_date="23-Jun-2017" FT /note="genotype: I" FT /db_xref="taxon:12092" FT CDS <1..>231 FT /codon_start=3 FT /product="polyprotein" FT /db_xref="UniProtKB/TrEMBL:A0A2D3BHQ8" FT /protein_id="ATT59253.1" FT /translation="HAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANR FT GKMDVSGVQAPVGAITTIEDPVLAKKVPETF" XX SQ Sequence 231 BP; 75 A; 41 C; 60 G; 55 T; 0 other; atcatgctat ggatgttacc acacaggttg gagatgattc aggaggtttt tcaacaacag 60 tttctacaga gcagaatgtt cctgatcccc aggtaggcat aacaactatg agggacttaa 120 aagggaaagc caatagggga aagatggatg tttcaggagt gcaagcacct gtgggagcta 180 tcacaacaat tgaggatcca gttttagcaa agaaagtgcc tgagacattt c 231 // ID MG025802; SV 1; linear; genomic RNA; STD; VRL; 8914 BP. XX AC MG025802; XX DT 24-OCT-2017 (Rel. 134, Created) DT 24-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Tomato spotted wilt orthotospovirus isolate TSWV-QLD2 segment L, complete DE sequence. XX KW . XX OS Tomato spotted wilt tospovirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Tospoviridae; Orthotospovirus. XX RN [1] RP 1-8914 RA Moyle R.L., Schenk P.M.; RT "Complete nucleotide sequence of the Australian Tomato spotted wilt virus RT isolate TSWV-QLD2"; RL Unpublished. XX RN [2] RP 1-8914 RA Moyle R.L., Schenk P.M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL School of Agriculture and Food Science, University of Queensland, St Lucia, RL Brisbane, Qld 4072, Australia XX DR MD5; 3b2aa277d83ba9ed26f0ce1199733860. DR EuropePMC; PMC5701467; 29167242. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1.7 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..8914 FT /organism="Tomato spotted wilt tospovirus" FT /segment="L" FT /host="Capsicum annuum cv. warlock" FT /isolate="TSWV-QLD2" FT /mol_type="genomic RNA" FT /country="Australia" FT /collected_by="Leanne Forsyth" FT /collection_date="20-Jul-2015" FT /db_xref="taxon:1933298" FT CDS 34..8673 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A291RBS1" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR015841" FT /db_xref="UniProtKB/TrEMBL:A0A291RBS1" FT /protein_id="ATL64764.1" FT /translation="MNIQKIQKLIENGTTLLLSIEDCVGSNHDLALDLHKRNSDEIPED FT VIINNNAKNYETMRELIVKITADGEGLNKGTATVDVKKLSEMVSLFEQKYLETELARHD FT IFGELISRHLRIKPKQRSEVEIEHALREYLDELNKKSCINKLSDDEFERINKEYVATNA FT TPDNYVIYKESKNSELCLIIYDWKISVDARTETKTMEKYYKNIWKSFKDIKVNGKPFLE FT DHPVFVSIVILKPIAGMPITVTSSRVLGKFEDSPSALHGERIKHARNAKLLNISYVGQI FT VGTTPTVVRNYYANTQKIKSEVRGILGDDFGSKDVFFSHWTSKYKERNTTEIAYSEDIE FT RIIDSLVTDEIPREEIIHFLFGNFCFHIETMNDQHIADKFKGYQNSCINLKIEPKTDLA FT DLKDHLIQKQQIWDSLYGKHLEKIMLRIREKKKKEKEIPDITTAFNQNAAEYEEKYPNC FT FTNDLSETKTNFSMTWSPSFEKVELSSEVDYNNAIINKFRESFKSSSRVVYNSPYSSIN FT NQTNKARDITNLVRLCLTELSCNTTKMEKQELEDEIDINTGSIKVERTKKSKEWNKLGS FT CLTRNKNEFCMKETGRENKTXYFKGLAVMNIGMXSKKRILKKEETKERISKGLEYDTSE FT RQADPNDDYSSIDMSSLTHMKKLIRHDNEDSLSWCEKIKDSLFVLHNGDIREEGKITSV FT YNNYAKNPECLYIQDSVLKSELETCKKINKLCNDLAIYHYSEDMMQFSKGLMVADRYMT FT KESFKILTTANTSMMLLAFKGDGMNTGGSGVPYIALHIVDEDMSDQFNICYTKEIYSYF FT RSGSNYIYIMRPQRLNQVRLLSLFKTPSKVPVCFAQFSKKANEMEKWLKNKDIEKVSVF FT SMTMTVKQILINIVFSSVMIGTVTKLSRMGIFDFMRYAGFLPLSDYSNIKEYIRDKFDP FT DITNVADIYFVNGIKKLLFRMEDLNLSTNAKPVVVDHENDIIGGITDLNIKCPITGSTL FT LTLEDLYNNVYLAIYMMPKSLHNHVHNLTSLLNVPAEWELKFRKELGFNILEDIYPKKA FT MFDDKDLFSINGALNVKALSDYYLGNIENVGLMRSEIENKEDFLSPCYKISTLKSSKKC FT SQSNIISTDEIIECLQDAKIQDIENWKGNNLAIIKGLMRTYNEEKSRLMEFFEDNCVNS FT LYLVEKLKEIINSGSITVGKSVTSKFIRNNHPLTVETYLKTKLYYRNNVTVLKSKKVSE FT ELYDLVKQFHNMMEIDLDSVMNLGKGTEGKKHTFLQMLEFVMSKAKNVTGSVDFLVSVF FT EKMQRTKTDREIYLMSMKVKMMLYFIEHTFKHVAQSDPSEAISISGDNKIRALSTLSLD FT TITSYNDILNKNSKKSRLAFLSADQSKWSASDLTYKYVLAIILNPVLTTGEASLMIECI FT LMYVKLKKVCIPTDIFLNLRKAQETFGQNETAIGLLTKGLTTNTYPVSMNWLQGNLNYL FT SSVYHSCAMKAYHKTLECYKDCDFQTRWIVHSDDNATSLIASGEVDKMLTDFSSLSLPE FT MLFRSIEAHFKSFCITLNPKKSYASSSEVEFISERIVNGAIIPLYCRHLANCCTESSHI FT SYFDDLMSLSIHVTMLLRKGCPNEVIPFAYGAVQVQALSIYSMLPGEVNDSIRIFNKLG FT VSLKSNEIPTNMGGWLTSPIEPLSILGPSSNDQIIYYNVIRDFLNKKSLDEVKDSVSSS FT SYLQMRFRELKEKHEKGTLEEKDKKMIFLINLFEKASVSEDSDVLTIGMKFQTMLTQII FT KLPNFINENALNKMSSYKDFSKLYPNLKKNEDLYKSTKNLKIDEDAILEEDELYEKIAS FT SLEMESVHDIMIKNPETILIAPLNDRDFLLSQLFMYTSPSKRNQLSNQSTEKLALDRVL FT RSKAKTFVDISSTVKMTYEENMEKKILEMLKFDLDSYCSFKTCVNLVIKDVNFSMLIPI FT LDSAYPCESRKRDNYNFRWFQTEKWIPVVEGSPGLVVMHAVYGSNYIENLGLKNIPLTD FT DSINVLTSTFGTGLIMEDVKSLVKGKDSFETEAFSNSNECQRLVKACNYMIAAQNRLLA FT INTCFTRKSFPFYSKFNLGRGFISNTLALLSTIYSKEESYHFVSTASYKLDKTIRTVIS FT AQQDMNLEKILDTAVYISDKLQSLFPTITREDIVLILQNVCLDSKPIWQSLEDKMKKIN FT NSTASGFTVSNVILSHNSELNTIQKQIVWMWNMGLCSHRTLDFVIRYIRRSDVRYVKTE FT EQDESGNYVSGTMYKIGIMTRSCYVQLIASDQDVAVSLRTPFEILNEREYLFDTYKESI FT EKLLQKFMFDKVNIIKSKQPQIVFLEPGDACIRMTTDNKMIVKVNATPRQIRLENVKLV FT VKIKYENVNSDVWDIIESQKSLVLRLPEVGECFSDMYKTADSENETIKTIKYRLMTSLT FT FIEAFGNLSQQIKEIVDDDIRETMDEFLMNIRDTCLEGLEICKSVEEYDSYLDGNGFND FT TVELFENLLRTHDNFENEYSPLFSEIVDKAKRYTRDLEGFKEILLMLKYSLINDASGFK FT SYRATGMHAVELTAKKHIEIGEFNLLGMIQLIKACETCHNNDSILNLASLRNVLSRTYA FT TFGRRIRLDHDLDLQNNLMEKSYDFKTLVLPEIKLSELSREILKGNGFVISGENLKIDR FT SDEEFEGLASFNVLRLDEEEMYEGLIKEMKIKRKKKGFLFPANTLLLSELIKFLIGGIK FT GTSFDIETLLRNSFRPDIFSTDRLGRLSSSVPALKVYATVYMEYKNVNCPLNEIADSLE FT GYLKLTKSKSKEHFLSGRVKKALIQLRDEQSRTKKLEVYKDIANFLSRHPLCLSEKTLY FT GRYTYSDINDYIMQTREIILSKISELDEVVETDEDDFLLSYLRGEEDAFDEDEFGEEED FT TD" XX SQ Sequence 8914 BP; 3344 A; 1288 C; 1725 G; 2553 T; 4 other; agagcaatca ggtaacaacg attttaagca aacatgaaca tccagaaaat acaaaaatta 60 atagaaaatg gaactacttt actgttgtct attgaggatt gtgtaggttc taaccatgat 120 ctagctttgg atttacataa gagaaatagt gatgagatcc cagaagatgt gattataaat 180 aataatgcaa aaaattatga gacaatgaga gagttaattg tcaaaatcac tgctgatggt 240 gaaggactaa acaaagggac ggcaactgtg gatgtcaaaa agctaagtga gatggtctct 300 ctgtttgagc aaaaatatct agaaacagag ttagcaaggc atgacatttt tggagagctg 360 atctccaggc acctgagaat aaagcccaaa caaagaagtg aagtggagat agagcatgca 420 ctaagagaat atctggatga actcaacaaa aaatcttgca ttaataagct ctctgatgat 480 gagtttgaga gaataaataa agaatatgta gcaactaatg ctacccctga taactatgtg 540 atatataaag aatcaaaaaa cagtgagctt tgtttaatca tttatgattg gaaaatatct 600 gttgatgcta ggactgaaac caaaacgatg gagaaatact acaaaaatat ctggaaatct 660 ttcaaagata taaaagtgaa tggaaagcca ttcttggaag atcatcctgt ttttgtttct 720 atagttatat tgaaacctat tgctgggatg ccaatcacag ttactagcag cagggttttg 780 gggaaatttg aagattctcc atcagcattg cacggagaga gaataaaaca tgccagaaat 840 gccaaattgc taaatatttc ttatgttggg caaatagttg gaaccacacc cacagtggtg 900 agaaactatt atgcaaacac tcaaaagatc aaatctgagg tcagaggaat cttaggtgat 960 gattttggat ctaaagatgt attttttagt cactggacca gcaagtacaa agaaagaaat 1020 actactgaaa tagcctattc tgaagatatt gaaagaataa ttgattcact tgtgacagat 1080 gaaatcccta gagaggaaat aatacatttt ttrtttggaa atttctgttt ccacatagaa 1140 acaatgaatg accagcatat tgctgacaaa tttaaagggt accaaaactc ttgtatcaat 1200 ttaaaaatag agccaaaaac tgatttggct gatttgaaag accacttaat ccaaaagcag 1260 caaatatggg attctttgta tgggaaacac cttgagaaga ttatgcttag aattagagag 1320 aaaaagaaaa aagaaaaaga aatacctgac ataaccacag cttttaacca gaatgctgcg 1380 gaatatgaag aaaagtaccc taactgtttt acaaatgatc tttctgaaac taaaactaac 1440 ttctccatga cttggtcccc aagttttgaa aaagttgaat tgagctcaga ggtagactac 1500 aacaatgcaa ttataaacaa gtttcgggaa agcttcaaaa gttcctcaag ggttgtttat 1560 aatagcccgt atagtagcat aaataaccaa acaaacaaag caagagatat aacaaacttg 1620 gttagactgt gcttaacaga gctaagctgt aatacaacaa aaatggaaaa gcaggaactt 1680 gaagatgaaa tagatataaa cactggaagt attaaagttg agagaacaaa aaaatctaaa 1740 gaatggaata agctgggttc atgtttaacc aggaacaaaa acgaattttg catgaaagag 1800 acaggcaggg agaacaaaac trcctatttc aaaggcttag cagtaatgaa tataggaatg 1860 rgttctaaga aaagaattct aaaaaaagaa gaaacaaaag aaaggatttc taaaggcctg 1920 gaatatgaca cctctgaaag gcaagctgat ccaaatgatg attattcaag tatagacatg 1980 tcttctctga ctcacatgaa gaaactaata aggcatgaca atgaggacag cttgagctgg 2040 tgtgaaaaaa ttaaggattc tttgtttgtc cttcataatg gcgatataag agaggaaggc 2100 aagatcacat ctgtttacaa taattatgct aaaaatcctg aatgcttgta cattcaagat 2160 tcagtactga agtctgaatt agagacttgc aaaaagataa acaaattatg caatgaccta 2220 gccatttacc attactctga ggacatgatg caattctcca aaggtttaat ggtggctgac 2280 aggtatatga ctaaagaaag tttcaagata ttaactacgg caaatacaag catgatgcta 2340 ttggcattca aaggggatgg aatgaacact ggtggatcgg gagtacctta catagcattg 2400 catatagtgg atgaagacat gtcagatcaa tttaacatat gttatactaa agaaatttat 2460 agctatttcc gaagtggtag taattacatt tatataatga ggccacagag actcaaccag 2520 gtgaggctgc ttagcctttt caaaacgcct agtaaagttc ctgtgtgttt tgcacaattt 2580 tcaaagaaag ctaacgagat ggaaaaatgg ctaaaaaaca aagatataga aaaggtaagt 2640 gttttttcta tgacaatgac tgtaaarcag atattaataa atattgtgtt ttcatctgtc 2700 atgataggaa ctgtgacaaa gctaagcagg atgggaattt ttgatttcat gagatatgca 2760 gggtttttgc cgctgtctga ttattctaat ataaaggaat acattagaga caaatttgat 2820 cctgatataa ctaatgtggc agatatctat tttgttaatg gaatcaaaaa actactgttt 2880 agaatggaag atctcaattt aagcacaaat gccaagcctg ttgttgtaga ccatgaaaat 2940 gatattatag gagggataac agacctgaat ataaaatgcc caataacagg atcaactttg 3000 ctgacacttg aggatttgta caataatgtt tatttggcta tttacatgat gcctaaatca 3060 ttgcacaatc atgttcacaa tctaacaagc ttgttaaatg tccctgctga gtgggaacta 3120 aagttcagaa aagagttagg tttcaacata ttagaagaca tataccccaa gaaagcaatg 3180 tttgatgaca aagacctatt ctctataaat ggagctttga atgtgaaagc attatctgat 3240 tattatctag gaaatataga aaatgttggt ttgatgagat cagaaataga aaataaagaa 3300 gatttcctaa gcccttgtta taaaatatct actttaaaat cttcaaaaaa atgctcacaa 3360 tcaaacatta taagtaccga tgaaataata gagtgtcttc aggatgcaaa gatccaagat 3420 atagaaaatt ggaaaggaaa taacttggcc attataaaag ggcttatgag aacctacaat 3480 gaggagaaga gtcgattgat ggaattcttt gaggataatt gtgtcaattc attatatctt 3540 gtagaaaagc ttaaagagat aattaatagt ggatcaataa ctgtagggaa atctgtaaca 3600 tctaaattca taagaaataa ccatccttta acagtagaaa catatctcaa aacaaaacta 3660 tattatagga ataatgtgac agttttaaag tctaaaaaag tgtcagagga gctttatgac 3720 cttgtgaaac agtttcataa catgatggaa atagacctag attctgtcat gaaccttggg 3780 aaaggtacag aagggaaaaa acacacattc ttgcagatgc ttgaatttgt catgtccaag 3840 gctaaaaatg tcaccgggtc tgtagatttc ctagtttctg tttttgagaa aatgcagaga 3900 accaaaacag acagagaaat atacttgatg agcatgaaag tgaaaatgat gctttatttt 3960 atagagcaca cattcaaaca tgtagcacaa agtgatccat cagaagccat ctctataagt 4020 ggagacaata aaataagagc actttctaca ttatctttgg acacaatcac gtcttacaat 4080 gatattttaa ataaaaattc aaaaaagtca agattggctt tcttatctgc tgatcagtcg 4140 aaatggtcgg catcagatct cacctataaa tatgttttag ctatcatatt aaatccagtt 4200 ttaactactg gtgaggctag tttgatgata gaatgcattt taatgtatgt taaattgaag 4260 aaggtttgta taccaacaga tatttttttg aatctaagaa aagctcaaga aacttttggg 4320 caaaatgaaa ctgccatagg acttttgact aaaggtttga cgacaaacac ataccctgtt 4380 agcatgaact ggttgcaagg caatttaaat tatctgtctt ctgtttatca ctcttgtgca 4440 atgaaagctt accacaagac tctagaatgt tacaaagact gtgatttcca aactagatgg 4500 attgtgcact ctgatgacaa tgcgacatca ttaatagcca gtggagaggt cgataaaatg 4560 ctaacagact tttcaagctt atctctgcct gaaatgttgt ttagaagcat tgaagctcat 4620 ttcaaaagct tttgcataac tttgaaccca aaaaagagtt atgcttcttc atcagaagta 4680 gagttcatat ctgaaagaat tgtgaatgga gcaattattc ctctctattg caggcattta 4740 gcaaactgtt gcacagaatc ttcacatata agttattttg atgatctaat gtcactcagt 4800 atacatgtta caatgcttct gagaaaaggc tgtcctaatg aagttatacc ttttgcttat 4860 ggggctgtgc aggtgcaagc attgagcatc tattcaatgc ttcctggtga agtgaatgat 4920 agcatcagaa tttttaacaa gcttggggta agtttaaagt caaatgagat tcctacaaac 4980 atggggggct ggttgacttc tcctatagag ccgttgtcta tattaggtcc atcatcaaat 5040 gatcaaatca tctattacaa tgtgataaga gattttttga acaagaaaag tttagatgaa 5100 gttaaagata gtgtctcttc ttctagctat ctacagatga gattcagaga gttaaaggaa 5160 aagcatgaaa aaggaactct agaagaaaag gataaaaaga tgatatttct tatcaatcta 5220 ttcgagaaag catcagtgtc tgaagattca gacgttctaa caatcgggat gaaatttcaa 5280 actatgttaa ctcagattat aaaattacct aattttataa atgagaatgc tttaaacaag 5340 atgtcaagtt ataaagattt ttcaaaactt taccccaatt tgaaaaaaaa tgaagattta 5400 tataaaagta ctaagaactt aaagatagac gaggatgcta ttttagagga agatgaatta 5460 tatgaaaaga ttgcatctag cttagaaatg gaatctgttc atgacataat gataaaaaat 5520 cctgaaacta ttttgatagc accattgaat gatagagatt ttttacttag tcagctgttc 5580 atgtacacaa gcccttctaa gaggaaccag ttatcgaacc aatccacaga gaaacttgct 5640 ttagatagag tgctaaggtc aaaagctaaa acatttgtag acatttcatc cactgtaaag 5700 atgacttatg aagaaaacat ggaaaagaaa atcttagaaa tgctaaaatt tgatttagat 5760 tcatattgtt catttaaaac atgtgtaaat ctagtgatca aggatgttaa ttttagcatg 5820 ctaattccaa tattggattc tgcataccct tgtgaatcta ggaaaagaga taactacaat 5880 ttcagatggt tccagactga gaaatggata cctgttgttg aaggctctcc gggactagta 5940 gtgatgcatg ctgtatatgg atcaaattat atagaaaatt taggtttaaa aaacatccct 6000 ctaacagacg atagcatcaa tgttttaaca agcacgtttg gaacaggttt gatcatggaa 6060 gatgtaaaat ccctagttaa aggcaaagac agctttgaaa cagaggcttt cagcaattct 6120 aatgaatgtc aaagattggt gaaagcatgc aattatatga tagcagctca aaacaggctt 6180 ttagcaatta acacatgctt tactaggaaa agcttcccct tctattctaa gttcaatcta 6240 gggagagggt ttatctcaaa cacattagct ctcctatcca ccatctacag taaagaagaa 6300 tcctatcatt ttgtttctac agctagttat aaattagaca aaactatcag aactgtgata 6360 agtgctcagc aagatatgaa cttagagaaa atactggaca ctgctgtata catatcagat 6420 aaattgcagt cacttttccc aacaattaca agagaggata tagttctgat attgcaaaat 6480 gtatgccttg acagcaaacc tatatggcag agtctagaag acaaaatgaa gaagatcaac 6540 aattcgacag caagtggttt cacagtatca aatgtgattc tatcacataa cagtgaattg 6600 aacacaatcc agaaacaaat tgtttggatg tggaacatgg gtttgtgttc tcatagaaca 6660 ttagattttg ttatcagata tattagaaga agtgatgtaa gatatgtgaa aactgaagaa 6720 caagacgaat caggaaatta tgtctctgga actatgtaca aaatcgggat catgacaaga 6780 agctgctatg ttcaattgat agcatctgat caagatgtag cagtttcttt gagaacacca 6840 tttgagatat tgaatgaaag agaatatctt tttgacacat acaaagaaag tatagagaaa 6900 ttgctgcaga aatttatgtt tgataaagtg aacataataa aatcaaaaca accacagatt 6960 gttttcttag aaccaggaga tgcctgcatc agaatgacca cagacaacaa aatgattgta 7020 aaggttaatg ccacaccaag acaaataaga ctagagaatg taaaattagt tgtaaagata 7080 aaatatgaaa acgtgaactc tgatgtgtgg gatattatag aaagccagaa atctctagtc 7140 ttgaggctcc ctgaagtagg agaatgtttc tctgatatgt acaaaactgc agattctgaa 7200 aatgaaacaa taaaaaccat aaaatacagg cttatgacct ctttaacttt catagaagcc 7260 tttggaaact tatcacagca gatcaaagag attgtagatg atgatatcag agaaacgatg 7320 gatgagtttt taatgaacat ccgggatacc tgcttagaag gtttggaaat ttgcaaaagt 7380 gtggaagaat atgatagcta tcttgatgga aatggtttta atgacacagt agaactattc 7440 gaaaacttgc taagaacaca tgacaacttt gaaaatgagt atagtccttt gttttcagag 7500 attgtcgaca aagcaaaacg gtatactaga gatttagaag gtttcaaaga aatactgctc 7560 atgcttaaat attctctgat aaatgatgca tcaggattta aaagctatag agccacagga 7620 atgcatgctg ttgaactaac ggcaaaaaag cacatagaga taggggaatt taacttattg 7680 ggaatgatcc agttgattaa agcttgtgaa acatgccaca acaatgattc tatattaaac 7740 ttagcaagtt taaggaatgt tcttagcagg acatatgcca catttggaag gagaataaga 7800 ttggatcatg atctggactt gcaaaacaac ttaatggaaa aaagttatga tttcaaaaca 7860 ctggttttac cagaaattaa attatcagaa ctatctaggg aaatactgaa aggaaatggg 7920 tttgttatat ctggagagaa tctaaaaata gataggtctg atgaagaatt tgagggtctt 7980 gccagtttta atgtgttgag gctagacgag gaagaaatgt atgaaggttt gatcaaagaa 8040 atgaaaatta aaaggaaaaa gaaagggttt ttattcccag cgaatacact tctactaagt 8100 gagttgataa agttcttgat tggtggaata aagggaacca gttttgatat agaaacattg 8160 ttgcggaaca gttttagacc agacatattt tcgactgaca gattgggcag attaagttcc 8220 agtgtacctg cactcaaagt ttatgcaact gtttatatgg aatataagaa tgttaattgt 8280 cccttaaatg aaatagctga cagcttagaa ggttatctaa aactgacaaa aagcaagtct 8340 aaagagcatt tcttgtctgg aagagttaag aaggccttga tacaattaag agatgaacaa 8400 tcgcgaacta aaaaactgga ggtctataaa gatattgcaa atttcctttc taggcaccca 8460 ctatgtttat ctgaaaaaac attgtatgga aggtacacct attctgacat caatgattat 8520 atcatgcaga caagagagat tattttgagt aaaataagtg agctagatga ggttgttgaa 8580 acagatgaag acgatttctt gcttagttat ctaagagggg aggaagatgc ctttgatgaa 8640 gatgagtttg gcgaagaaga agacacagat taaattgata gtaacgccta gctatacatg 8700 aataatagat tagatagaac ttaaaataca aatttattgt tactttggaa ctagattaga 8760 tctactcagc caaaaaatga tttggtgaac cgaatctata ttgtatataa atgtagagtc 8820 ctggtatagt ttcactggag ggaattctta tataatttat ttgtaaagtc tggctgtgga 8880 gagattatat gttttagttg tacctgattg ctct 8914 // ID MG025803; SV 1; linear; genomic RNA; STD; VRL; 4851 BP. XX AC MG025803; XX DT 24-OCT-2017 (Rel. 134, Created) DT 24-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Tomato spotted wilt orthotospovirus isolate TSWV-QLD2 segment M, complete DE sequence. XX KW . XX OS Tomato spotted wilt tospovirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Tospoviridae; Orthotospovirus. XX RN [1] RP 1-4851 RA Moyle R.L., Schenk P.M.; RT "Complete nucleotide sequence of the Australian Tomato spotted wilt virus RT isolate TSWV-QLD2"; RL Unpublished. XX RN [2] RP 1-4851 RA Moyle R.L., Schenk P.M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL School of Agriculture and Food Science, University of Queensland, St Lucia, RL Brisbane, Qld 4072, Australia XX DR MD5; 12f7fb1d01aef57c45043f7e7631a9d6. DR EuropePMC; PMC5701467; 29167242. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1.7 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4851 FT /organism="Tomato spotted wilt tospovirus" FT /segment="M" FT /host="Capsicum annuum cv. warlock" FT /isolate="TSWV-QLD2" FT /mol_type="genomic RNA" FT /country="Australia" FT /collected_by="Leanne Forsyth" FT /collection_date="20-Jul-2015" FT /db_xref="taxon:1933298" FT CDS 101..1009 FT /codon_start=1 FT /product="NSm" FT /note="nonstructural protein" FT /db_xref="InterPro:IPR000603" FT /db_xref="InterPro:IPR006889" FT /db_xref="UniProtKB/TrEMBL:A0A291RBP6" FT /protein_id="ATL64765.1" FT /translation="MLTLFGNKRPSKSAGKDEGPLVSLAKHNGNVEVSKPWSSSDEKLA FT LTKAMDASKGKILLNTEGTSSFGTYESDSITESEGYDLSARMIVDTNHHISNWKNDLFV FT GNGKQNANKVIKICPTWDSRKQYMMISRIVIWVCPTIPNPTGKLVAALVDPNMPSEKQV FT ILKGQGTITDPICFVFYLNWSIPKMNNTPENCCQLHLMCSQEYKKGVSFGSVMYSWTKE FT FCDSPRADKDKSCMVIPLNRAIRARSQAFIEACKLIIPKGNSEKQIKKQLKELSSNLER FT SVEEEEEGISDNVAQLSFDEI" FT CDS complement(1360..4767) FT /codon_start=1 FT /product="Gn-Gc" FT /note="glycoprotein precursor" FT /db_xref="GOA:A0A291RBP8" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR014414" FT /db_xref="UniProtKB/TrEMBL:A0A291RBP8" FT /protein_id="ATL64766.1" FT /translation="MRILKLLELVVKVSLFTIALSSVLLAFLIFKATDAKVEIIRGDHP FT EIYDDSAENEVPTAASIQREAILETLTNLMLESQTSGTRQIREEKSTIPISAEPTTQKT FT ISVLDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVMYSCVSDSAEGLEKCDNSLNLP FT KRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSG FT DCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVENKVVSFSGSASITFTEEMLDGEH FT NLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINFSWIRLILIALLIYFPIRWIVN FT KTTKPLFLWYDLMGLITYPILLLINCLWKYFPFKCSNCGNLCIVTHECTKVCICNKSKA FT SKEHSSECPILSKEADHDYNKHKWTSMEWFHLIVNTKLSLSLLKFVTEILIGLIILSQM FT PMSMAQTTQCLSGCFYVPGCPFLVTSKFEKCPGKDQCYCNVKEDKIIESIFGTNIVIEG FT PNDCIENQNCIARPSIDNLIKCRLGCEYLDLFQNKPLYNGFSDYTGSSLGLTSVGLYEA FT KRLRNGIIDSYNRTDKISGMIAGDSLDRNETSIPENILPRQSLIFDSVVDGKYRYMIEQ FT SLLGGGGTVFMLNDKTSETAKKFVIYIKSVGIHYEVSEKYTTAPIQSTHTDFYSTCTGN FT CDTCRKNQXLTGFQDFCITPTSYWGCEEAWCFAINEGATCGFCRNIYDMDKSYRIYSVL FT KSTIVADVCISGILGGQCSRITEEVPYENTLFQADIQADLHNDGITIGELIAHGPDSHI FT YSGNIANLNDPVKMFGHPQLTHDGVPIFTKKTLEGDDMSWDCAAIGKKSVTIKTCGYDT FT YRFRSGLEQISDIPVSFKDFSSFFLERSFSLGXLKIVVDLPSDLFKVAPKKPSITSTSL FT NCNGCLLCSQGLSCILEFFSDLTFSTAISIDACSLSTYQLAVKKGSNKYNITMFCSANP FT DKKKMTLYPEGNPDISVEVLVNNVIVKEPENIIDQNDEYAHEEQQYNSDSSAWGFWDYI FT KSPFNFIASYFGSFFDTIRVVLLIAFIFLVIYFCSILTSICKGYVKHESYKSRSKIEDD FT DEPEIKAPMLMKDTMTRRRPPMDFSHLV" XX SQ Sequence 4851 BP; 1546 A; 871 C; 838 G; 1594 T; 2 other; agagcaatca gtgcgtcaga aatataccta ttatacactt tgctaagaat caatcaatta 60 cattacacaa gctcctctac cttaggctgt tgaactcaaa atgttgactc ttttcggtaa 120 caagaggcct tctaagtctg ccggaaagga tgaaggtcct ttagtttcac ttgctaaaca 180 taatggcaat gttgaagtct caaaaccatg gtcttcttct gatgaaaagc ttgctttaac 240 caaagccatg gatgcatcaa aaggaaagat actgttgaac actgagggaa catcttcctt 300 cggaacctat gaatctgatt ctatcacaga atcagagggt tatgatcttt ctgctagaat 360 gatagtagat acaaaccatc atatctcaaa ctggaaaaat gatctttttg ttggcaacgg 420 aaagcaaaat gctaataagg ttatcaagat ctgtccaact tgggatagca ggaaacaata 480 catgatgatt tctaggattg tgatatgggt ctgccctact ataccaaacc ctacagggaa 540 acttgtggct gctttagttg atcccaacat gccatctgaa aagcaagtca tcctgaaggg 600 tcaagggaca ataactgatc ctatctgctt tgttttttat ctgaactggt ctatcccgaa 660 gatgaacaac actccagaaa actgttgtca gctgcatttg atgtgcagcc aagaatacaa 720 gaaaggggtt tcttttggca gtgtcatgta ttcttggaca aaagagtttt gtgattcacc 780 cagagctgat aaagacaaaa gttgtatggt tatacctcta aacagagcca ttagagctag 840 gtctcaagca ttcattgaag cctgcaagct gataattccc aaaggaaaca gtgagaagca 900 gattaaaaaa cagcttaaag aattgagctc aaatcttgag agatcagttg aagaagaaga 960 ggaagggatt tctgacaatg ttgctcaatt atcctttgat gaaatatagt tctttaaata 1020 tcacttattt aagcttaaat ttctttctat tttgcatttt gaatccaaaa aaccaaaaca 1080 gaacaaaaaa taaaaaacaa aaaagaaaac aaacaaaaaa tcaaaccaaa aacaaaacaa 1140 aataaggctg aaaagccaaa ctttggtccg aagactcttt tttcgttttt tgttttgttt 1200 tttgtttttt gctttttgtt tatttttatt tttgtttgtt gttttttgct tatttcatat 1260 ttgcttttat tagttaataa ttgattctaa agatttttac atatatttaa tcctgctaat 1320 atagaagatt gaatcaaatt ttacctatga caagcatctt cagacaaggt gagagaaatc 1380 cataggtggc cttcgtcttg tcattgtatc tttcattaac atgggggctt tgatctcagg 1440 ttcatcatca tcctctatct tggatctaga tttataagat tcatgcttta catatccttt 1500 acaaatggat gtcagaatag aacagaaata aatcacaagg aaaatgaatg caataagcag 1560 taccactctg atagtatcaa aaaatgaacc aaagtaactt gcaatgaaat tgaatggact 1620 cttaatataa tcccagaagc cccatgctga agaatcagaa ttatattgct gttcttcatg 1680 agcatactca tcattttgat ctattatatt ctctggttct tttacaataa cattattaac 1740 caaaacttcc acagatatat ccggattgcc ttctggatat agtgtcattt tcttcttgtc 1800 tggattggct gaacaaaaca ttgttatatt gtatttatta gatccttttt taacagccag 1860 ctgataagta gataaagagc aagcgtctat agaaattgca gtagaaaatg tcaaatctga 1920 gaaaaattct aaaatgcaag ataaaccttg actgcataga agacagccgt tgcaatttaa 1980 gctcgtcgaa gttatggaag gttttttagg agcaacttta aaaagatcag atggaagatc 2040 gactacaatt ttcagttycc ctaaactaaa agatctttcc aggaaaaaac tagagaaatc 2100 tttgaaacta acaggaatat ctgatatttg ctctaaaccg gatctaaacc tgtatgtgtc 2160 atatccacac gttttaatag tgactgattt ttttcctatt gctgcacaat cccaagacat 2220 gtcatcccct tctagagttt tcttagtaaa aataggcact ccatcatgag tcaattgtgg 2280 atgaccaaac attttcacag gatcattcaa gtttgcaata ttcccagaat aaatatggct 2340 gtcaggtcca tgagctatta gttcacctat agtgatacca tcattatgca aatctgcttg 2400 tatatcagct tgaaacaatg tattttcata aggaacttct tcagtaatcc ttgagcattg 2460 acctcccaaa ataccagaaa tacaaacatc tgctactata gttgatttaa gcactgagta 2520 aattctgtat gatttgtcca tatcataaat atttcgacag aatccgcatg tggcaccctc 2580 attaattgca aaacaccaag cttcttcaca tccccaataa gaagttggtg ttatacaaaa 2640 gtcttggaaa cctgtcaaar cttgattttt cctgcaagta tcgcagtttc ctgtacaagt 2700 ggaataaaaa tctgtgtggg tgctttggat gggagctgtt gtgtattttt ctgacacttc 2760 ataatgaatt cccacacttt tgatataaat aacaaatttt ttggctgttt ctgaggtctt 2820 gtcatttagc atgaatacag ttcctcctcc tcccaaaaga gattgttcta tcatatatct 2880 atatttccca tctacaacag aatcaaagat taacgattgc ctgggcagaa tattctctgg 2940 tatgcttgtt tcatttctgt ctaaagagtc tcctgcaatc attccggaaa ttttgtctgt 3000 acgattatag gaatctatta taccatttct caatctctta gcctcataaa gacctactga 3060 tgttaatcct aaagagcttc ctgtgtaatc cgaaaaccca ttgtacaaag gtttgttttg 3120 aaataaatct aggtattcac aacctaatct gcattttata agattatcaa tagatggacg 3180 tgcaatgcaa ttctggttct ctatgcaatc attaggacct tctataacaa tattagtgcc 3240 aaagatactt tctatgatct tgtcttcttt tacattgcag taacattgat cttttccagg 3300 gcatttttca aatttgctcg taaccaaaaa tggacagcct ggaacataaa agcatccact 3360 caaacattgg gttgtttgag ccatagacat gggcatctga gacaaaatga ttaaacctat 3420 taaaatttcg gtcacaaatt ttagcaaact caagctcagc ttagtgttca ctattagatg 3480 gaaccattcc atgctcgtcc acttatgttt gttgtagtcg tgatctgctt ctttggacaa 3540 tatgggacat tccgaagaat gctcttttga agctttgctt ttgttgcaaa tgcacacttt 3600 agtacactca tgtgtgacta tgcacaaatt gccgcagtta gaacatttga atgggaaata 3660 tttccataaa caatttatga gcaataagat agggtatgtg atcaagccca taagatcata 3720 ccagagaaag agaggtttag tcgtcttgtt cactatccat cggataggga aatagatcaa 3780 caaagctatt aatatcaatc ttatccaaga aaaattgatg caggctgttt gcttataaat 3840 actctttgaa tatttgatta tgcaatctct gactctcttg tttgtttttg gtattttagc 3900 tgatttgtca ccgcacaaga gattgtgttc accatccaac atttcctcag tgaaagtgat 3960 acttgctgat ccagaaaaag atacaacctt gttttctaca ttttcaccag gtttttttat 4020 caaataaccc atgatcttct cagggctagt gatgctcaca gtatagggat ttgcaaagtt 4080 tgatttagtt attttgcagt caccagataa ctttacagtt tgtaatgata ctgttccatt 4140 agtggggtat gagttgtaag ttataggata attatcttgc gtcaggcttt ctgaaatgaa 4200 gaattttgtt cctactgaaa aatgtctttt gttgtcaagt tttgtaatag gaataactgg 4260 gactttggag aatctctttg gcaaatttaa agaattatca catttttcta aaccttctgc 4320 tgaatcagaa acacaggaat acatgacacc attgttttca acttgataat aaacattata 4380 agtagatatc ccctttatct cacattttaa tgaagaagca ttcaagcaat tgttgggaag 4440 atctaaaaca gaaattgttt tttgcgttgt tggctcagca gaaataggga tggttgattt 4500 ttcttctcgt atttgacggg ttccagaagt ctgagattct agcatcagat tggttaaagt 4560 ctccaagata gcttcgcgtt gaatcgatgc agcagtgggt acctcattct cagcagaatc 4620 atcataaatc tcaggatgat ctccacggat tatttctact ttagcatctg tggctttgaa 4680 gatcaagaat gccaacaaaa cagaactcag ggcaattgtg aaaagactca cttttaccac 4740 tagttctagt agtttcagaa ttctcatctt agatgtctac ccagattaca atggttgtgt 4800 gattaatttc aagatgtctg gattaaggtt tttgtttgca ctgattgctc t 4851 // ID MG025804; SV 1; linear; genomic RNA; STD; VRL; 2987 BP. XX AC MG025804; XX DT 24-OCT-2017 (Rel. 134, Created) DT 24-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Tomato spotted wilt orthotospovirus isolate TSWV-QLD2 segment S, complete DE sequence. XX KW . XX OS Tomato spotted wilt tospovirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Tospoviridae; Orthotospovirus. XX RN [1] RP 1-2987 RA Moyle R.L., Schenk P.M.; RT "Complete nucleotide sequence of the Australian Tomato spotted wilt virus RT isolate TSWV-QLD2"; RL Unpublished. XX RN [2] RP 1-2987 RA Moyle R.L., Schenk P.M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL School of Agriculture and Food Science, University of Queensland, St Lucia, RL Brisbane, Qld 4072, Australia XX DR MD5; 28079aab54a50df28289c88a4c67d474. DR EuropePMC; PMC5701467; 29167242. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1.7 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2987 FT /organism="Tomato spotted wilt tospovirus" FT /segment="S" FT /host="Capsicum annuum cv. warlock" FT /isolate="TSWV-QLD2" FT /mol_type="genomic RNA" FT /country="Australia" FT /collected_by="Leanne Forsyth" FT /collection_date="20-Jul-2015" FT /db_xref="taxon:1933298" FT CDS 89..1492 FT /codon_start=1 FT /product="NSs" FT /note="silencing suppressor" FT /db_xref="InterPro:IPR004915" FT /db_xref="UniProtKB/TrEMBL:A0A291RBQ2" FT /protein_id="ATL64767.1" FT /translation="MSSSVYESIIQTRASVWGSTASGKAVVDSYWIHELGTGSQLVQTQ FT LYSDSRSKSSFGYTAKVGNLPCEEEEILSQHVYIPIFDDIDFSINIDDSVLALSVCSNT FT VNTNGVKHQGHLKVLSPAQLHSIGSTMNRSDITDRFQLQEKDIIPNDKYIEAANKGSLS FT CVKEHTYKIEMCYNQALGKVNVLSPNRNVHEWLYSFKPSFNQVESNNRTVNSLAVKSLL FT MSAENNIMPNSQAFVKASTDSHFKLSLWLRVPKVLKQVSIQKLFKVARDETNKTFYLSI FT ACIPNHNNVETALNISVICKHQLPIRKCKAPFELSMMFSDLKEPYNIVHDPSYPQRIVH FT ALLETHTSFAQVLCNNLQEDVIIYTLNNYELTPGKLDLGERTLNYSEDIYKRKYFLSKT FT LECLPSNTQTMSYLDSIQIPSWKIDFTRGEIKISPQPISVAKSLLKLDLSKIKKKESKI FT SETYASGSK" FT CDS complement(2060..2836) FT /codon_start=1 FT /product="Nc" FT /note="nucleocapsid" FT /db_xref="GOA:A0A120MF37" FT /db_xref="InterPro:IPR002517" FT /db_xref="UniProtKB/TrEMBL:A0A120MF37" FT /protein_id="ATL64768.1" FT /translation="MSKVKLTKENIVALLTQGKDLEFEEDQNLIAFNFKTFCLENLDQI FT KKMSVISCLTFLKNRQSIMKVIKQSDFTFGKITIKKTSDRIGAKDMTFRRLDSLIRVRL FT VEETGNSENLNTIKSKIASHPLIQAYGLPLDDAKSVRLAIMLGGSLPLIASVDSFEMIS FT VVLAIYQDAKYKDLGIDPKKYDTKEALGKVCTVLKSKAFEMNEDQVKKGKEYAAILSSS FT NPNAKGSVAMEHYSETLNKFYEMFGVKKQAKLTELA" XX SQ Sequence 2987 BP; 950 A; 546 C; 465 G; 1023 T; 3 other; agagcaattg tgtcataatt ttattcttaa tcaaacctca cttagaaaat cacaatactg 60 taataagaac acagtaccaa taaccataat gtcttcaagt gtttatgagt cgatcattca 120 gacaagagct tcagtctggg gatcaactgc atctggtaaa gctgttgtag attcttactg 180 gattcatgaa cttggcactg gttctcaact agttcaaacc caactgtatt ctgattcaag 240 aagcaaaagt agctttggct atactgcaaa ggtagggaat cttccctgtg aagaagaaga 300 gattctttct cagcatgtgt atatccctat ttttgatgat attgatttta gcatcaatat 360 tgatgactct gttctggcac tatctgtttg ctcaaataca gttaatacta acggagtgaa 420 acatcaaggt catttgaaag ttttgtctcc tgctcagctc cactctattg gatctaccat 480 gaacagatct gatattacag accgattcca gcttcaggaa aaagacataa ttcccaatga 540 taaatatatt gaagctgcaa acaaaggctc tttgtcttgt gtcaaagagc atacttataa 600 gatcgaaatg tgctataatc aagctttggg caaagtgaat gttttatccc ctaacaggaa 660 tgtccatgaa tggttgtaca gtttcaagcc aagtttcaat caagttgaaa gcaacaacag 720 aactgtaaat tctcttgcag tgaaatctct actcatgtca gcagaaaaca acatcatgcc 780 yaactctcag gcttttgtca aagcttccac tgattctcat ttcaaactga gcctctggct 840 aagggttcca aaggttttga agcaggtctc cattcaaaaa ttgttcaagg ttgcaagaga 900 tgaaacaaat aaaacatttt atttatctat tgcttgcatt ccaaaccata acaatgttga 960 gacagcttta aacatttctg ttatttgcaa gcatcagctc ccaattcgta aatgtaaagc 1020 tccttttgaa ttatcaatga tgttttctga tttaaaggag ccttacaaca ttgttcatga 1080 tccttcatat cctcagagga ttgttcatgc tctgcttgaa actcacacat cttttgcaca 1140 agttctttgc aacaacttac aagaagatgt gatcatctac actttgaaca actatgagct 1200 aactcctgga aagttagatt taggtgaaag aaccttaaat tacagtgaag atatctacaa 1260 aaggaaatat ttcctttcaa aaacacttga atgtcttcca tctaacacac aaactatgtc 1320 ttacttggac agcatccaaa tcccttcctg gaagatagac tttaccaggg gagaaattaa 1380 gatttctcca caacctattt cagttgcaaa atctttgtta aagcttgatt taagcaagat 1440 caaaaagaaa gaatctaaga tttcggaaac atatgcttca ggatcaaaat aatcttgctg 1500 cgtccggttt ttctaattat gttatgttta ttttctttct tyacttataa ttatttctct 1560 gttttgtcat ttcttttaag ttcctcctgc ttaatagaaa ccataaaaca aaaataataa 1620 aaaataaaat aaaaataaaa atcaaaaaat gaaacaaaaa ccaaaaaatg aaacaaaaat 1680 aaaatgaaat aaaaacaaca aaaaaattaa aacaaaaaac caaaaaagat cccgaaaggg 1740 acaattttgg ccaaatttgg gttttgtttt gttttgtttt ttgttttttg ttttatttta 1800 ttttatttat ttattttatt ttatttttta ttttttattt ttatttttta ttttttattt 1860 ttatttttat ttttgytttt attttatgtt ttttgttgtt tttgttattt tgtttattta 1920 tcaagcacaa cacacagaaa gcaaacttta attaaacaca cttattttaa atttagcaca 1980 ctaagcaagc acaagcaata aagttaaaga aagctttata tatttgtaga cttttccata 2040 atttaactta cagctgcttt taagcaagtt ctgtgagttt tgcctgtttt ttaaccccga 2100 acatttcata gaacttgtta agagtttcgc tgtaatgttc catagcaaca cttcctttag 2160 cattaggatt gctggagctg agtatagcag catactcttt ccctttcttc acctgatctt 2220 cattcatttc aaatgctttg cttttcagca cagtgcaaac ttttcccaag gcttccttgg 2280 tgtcatactt ctttgggtca atcccgaggt ctttgtattt tgcatcctga tatatagcca 2340 agacaacact gatcatctca aagctatcaa ctgaagcaat aagaggtaag ctacctccca 2400 gcattatggc aagcctcaca gactttgcat catcaagagg taatccatag gcttgaatca 2460 aagggtggga agcaatctta gatttgatag tgttgagatt ctcagaattc ccagtttcct 2520 ctacaagcct gaccctgatc aagctatcaa gccttctgaa ggtcatgtct ttggctccaa 2580 tcctgtctga agttttcttt atggtaattt taccaaaagt aaaatcactt tgtttaataa 2640 ccttcattat actctgacga ttcttcagga atgtcagaca tgaaataaca ctcatcttct 2700 tgatctggtc taggttttcc agacaaaaag tcttgaagtt gaatgctatc agattctgat 2760 cttcctcaaa ctcaaggtct ttgccttgtg tcaacaaagc aacaatgttt tccttagtga 2820 gcttaacctt agacatgatg attgtagaag ctgttatatg ctttgaccgt atgtaattca 2880 aggtgcgaaa gtgcaactct gtattccgca gtcgtttctt agggttttaa tgtgatgatt 2940 tgcaagactg agtgttaagg tttgaataaa atcgacacaa ttgctct 2987 // ID MG025806; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025806; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/27 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/27" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSV4" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV4" FT /protein_id="AUN27711.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XTE0" FT /protein_id="AUN27712.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025807; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025807; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/28 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/28" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XTY7" FT /db_xref="UniProtKB/TrEMBL:A0A384XTY7" FT /protein_id="AUN27713.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSU9" FT /protein_id="AUN27714.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025808; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025808; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/29 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/29" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSV1" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV1" FT /protein_id="AUN27715.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSU8" FT /protein_id="AUN27716.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025809; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025809; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/30 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/30" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSV2" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV2" FT /protein_id="AUN27717.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XVH5" FT /protein_id="AUN27718.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025810; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025810; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/31 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/31" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSV6" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV6" FT /protein_id="AUN27719.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XXM0" FT /protein_id="AUN27720.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025811; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025811; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/32 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/32" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSW1" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW1" FT /protein_id="AUN27721.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XTE8" FT /protein_id="AUN27722.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025812; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025812; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/33 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/33" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XTZ2" FT /db_xref="UniProtKB/TrEMBL:A0A384XTZ2" FT /protein_id="AUN27723.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV9" FT /protein_id="AUN27724.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025813; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025813; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/34 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/34" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSW3" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW3" FT /protein_id="AUN27725.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSV8" FT /protein_id="AUN27726.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025814; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025814; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/35 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/35" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSW0" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW0" FT /protein_id="AUN27727.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XVH8" FT /protein_id="AUN27728.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025815; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025815; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/36 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/36" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSW6" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW6" FT /protein_id="AUN27729.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XXM3" FT /protein_id="AUN27730.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025816; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025816; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/37 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/37" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX2" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX2" FT /protein_id="AUN27731.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XTF1" FT /protein_id="AUN27732.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025817; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025817; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/38 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/38" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XTZ8" FT /db_xref="UniProtKB/TrEMBL:A0A384XTZ8" FT /protein_id="AUN27733.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW7" FT /protein_id="AUN27734.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025818; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025818; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/39 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/39" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX1" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX1" FT /protein_id="AUN27735.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSW9" FT /protein_id="AUN27736.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025819; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025819; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/40 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/40" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX0" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX0" FT /protein_id="AUN27737.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XVI0" FT /protein_id="AUN27738.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025820; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025820; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/41 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/41" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX6" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX6" FT /protein_id="AUN27739.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XXM6" FT /protein_id="AUN27740.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025821; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025821; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/42 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/42" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX9" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX9" FT /protein_id="AUN27741.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XTF6" FT /protein_id="AUN27742.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025822; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025822; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/43 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/43" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XU03" FT /db_xref="UniProtKB/TrEMBL:A0A384XU03" FT /protein_id="AUN27743.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX5" FT /protein_id="AUN27744.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025823; SV 1; linear; viral cRNA; STD; VRL; 695 BP. XX AC MG025823; XX DT 03-OCT-2018 (Rel. 138, Created) DT 03-OCT-2018 (Rel. 138, Last updated, Version 1) XX DE Rabies lyssavirus isolate IND/NCDC/44 transmembrane glycoprotein G gene, DE partial cds; G-L intergenic spacer, complete sequence; and L protein gene, DE partial cds. XX KW . XX OS Rabies lyssavirus OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Rhabdoviridae; Lyssavirus. XX RN [1] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT "Characterization of G-L region of Rabies virus"; RL Unpublished. XX RN [2] RP 1-695 RA Jaiswal R., Singh G., Gupta N., Chhabra M.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Zoonosis Division, National Centre for Disease Control, 22- Sham Nath Marg, RL Delhi, Delhi 110054, India XX DR MD5; 13bbeab882ec6127e93435e900ef733a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..695 FT /organism="Rabies lyssavirus" FT /host="spotted deer" FT /isolate="IND/NCDC/44" FT /mol_type="viral cRNA" FT /country="India" FT /collection_date="2016" FT /db_xref="taxon:11292" FT mRNA <1..208 FT /product="transmembrane glycoprotein G" FT CDS <1..144 FT /codon_start=1 FT /product="transmembrane glycoprotein G" FT /db_xref="GOA:A0A384XSX8" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX8" FT /protein_id="AUN27745.1" FT /translation="TCCGRVHRPKSTQHSLGGTGRKVSVTSQSGKVISSWESYKSGGET FT RL" FT misc_feature 209..633 FT /note="G-L intergenic spacer" FT mRNA 634..>695 FT /product="L protein" FT CDS 664..>695 FT /codon_start=1 FT /product="L protein" FT /db_xref="UniProtKB/TrEMBL:A0A384XSX4" FT /protein_id="AUN27746.1" FT /translation="MLDPGEVYDDP" XX SQ Sequence 695 BP; 190 A; 165 C; 179 G; 161 T; 0 other; acatgttgcg gaagagtcca tcgacccaag tctacacaac acagtctcgg gggaacgggg 60 aggaaggtgt cggtcacttc ccaaagcggg aaggtcatat cttcatggga gtcatataag 120 agtgggggtg agaccagact gtaaagaccg gtcatccttt tcaagcttta agtcctgaag 180 atcaccttcc cttggggtta gggggaaatc tctgggttca atagccctcc ctaaactccg 240 tgtaacaggg tagattccag agtcacgaga cctccattaa tcatctcagt tgatcagaca 300 tggtcgtgca ggttcttaaa atataagaaa tcttctggca gtttcagtga ccaacggtgc 360 ttccatcctc caggggtcga taccaaaggt tgcggacagg tcgagggtta cctcagatga 420 ctccgtgctc gggcacggac agaggtcata gtgggtcccc tgacagcaga ctcaacatga 480 gtcaacggag cagggcgatc tgcctctcat gagggacata agcaatagct cacaatcatc 540 tcgcatctca gtcaagtgtg cataattata aagggctggg tcatctaagc tttttagtcg 600 agaaaaaaac tgttggccaa aaatgcaact ggcaacactt ctcatcctca gacctagatc 660 aagatgcttg atccgggaga ggtctatgat gaccc 695 // ID MG025947; SV 1; linear; genomic RNA; STD; VRL; 3309 BP. XX AC MG025947; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Cucumber mosaic virus isolate Rs segment RNA 1, complete sequence. XX KW . XX OS Cucumber mosaic virus (cucumber mosaic cucumovirus) OC Viruses; Riboviria; Bromoviridae; Cucumovirus. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-3309 RX DOI; .1073/pnas.1714916114. RX PUBMED; 29087346. RA Andika I.B., Wei S., Cao C., Salaipeth L., Kondo H., Sun L.; RT "Phytopathogenic fungus hosts a plant virus: A naturally occurring RT cross-kingdom viral infection"; RL Proc. Natl. Acad. Sci. U.S.A. 114(46):12267-12272(2017). XX RN [2] RP 1-3309 RA Andika I.B., Sun L.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL College of Plant Protection, Northwest A&F University, Taicheng Road 3#, RL Xiaan, Shaanxi 712100, China XX DR MD5; cef765df72d23783ad5b943384debc85. DR EuropePMC; PMC5699089; 29087346. XX CC ##Assembly-Data-START## CC Assembly Method :: Trinity v. 2.4.0 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..3309 FT /organism="Cucumber mosaic virus" FT /segment="RNA 1" FT /host="Rhizoctonia solani" FT /isolate="Rs" FT /mol_type="genomic RNA" FT /country="China" FT /collection_date="Nov-2016" FT /db_xref="taxon:12305" FT gene 82..3063 FT /gene="1a" FT CDS 82..3063 FT /codon_start=1 FT /gene="1a" FT /product="replicase" FT /db_xref="GOA:A0A2D3HXC9" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR021002" FT /db_xref="InterPro:IPR022184" FT /db_xref="InterPro:IPR027351" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:A0A2D3HXC9" FT /protein_id="ATU79659.1" FT /translation="MATSSFNINELVASHGDKGLLATALVDKTAHEQLEEQLQHQRRGR FT KVYIRNVLGVKDSEVIRNRYGGKYDLHLTQQEFAPHGLAGALRLCETLDCLDSFPSSGL FT RQDLVLDFGGSWVTHYLRGHNVHCCSPCLGIRDKMRHAERLMNMRKIILNDPQQFDGRQ FT PDFCTQPAADCKVQAHFAISIHGGYDMGFRGLCEAMNAHGTTILKGTMMFDGAMMFDDQ FT GVIPELNCQWRKIRSAFSETEDVTSLSGKLNSTVFSRVRKFKTMVAFDFINESTMSYVH FT DWENIKSFLTDQTYSYRGMTYGIERCVIHAGIMTYKIIGVPGMCPPELIRHCIWFPSIK FT DYVGLKIPASQDLVEWKTVRILMSTLRETEEIAMRCYNDKKAWMEQFKVILGVLSAKSS FT TIVINGMSMQSGERIDINDYHYIGFAILLHTKMKYEQLGKMYDMWNASSISKWFAALTR FT PLRVFFSSVVHALFPTLRPREEKEFLIKLSTFVTFNEECSFDGGEEWDVISSAAYVATQ FT AVTDGKILAAQKAEKLAEKLAQPVSEVSDSPEASSQTPDDTAEVCGKEREVSELDSLSA FT QTRSPITRVAERATAMLEYAAYEKQLHDTTVSNLKRIWNMAGGDDKRNSLEGNLKFVFD FT TYFTVDPMVNIHFSTGRWMRPVPEGVVYSVGYNERGLGPKSDGELYIVNSECVICNSES FT LFTVTRSLQAPTGTISQVDGVAGCGKTTAIKSIFEPSTDMIVTANKKSAQDVRMALFKS FT SDSKEACTFVRTADSVLLNECPTVSRVLVDEVVLLHFGQLCAVMSKLKAVRAICFGDSE FT QIAFSSRDASFDMRFSKIIPDETSDADTTFRSPQDVVPLVRLMATKTLPKGTHSKYTKW FT VSQSKVKRSVTSRAIVSVTLVDLDPSRFYITMTQADKASLISRAKEMNLPKTFWNERIK FT TVHESQGISEDHVTLVRLKSTKCDLFKQFSYCLVALTRHKVTFRYEYCGVLNGDLIAEC FT VARA" XX SQ Sequence 3309 BP; 840 A; 731 C; 803 G; 935 T; 0 other; agcgtacggt tcaatccctg cctcccctgt aaaattaccc tttgaaaacc tctctttctt 60 aatcttttct ttgcaattcc tatggcgacg tcctcgttca acatcaatga actggtagcc 120 tcccacggcg ataaaggact actcgcgacc gccctcgttg ataagacagc tcatgagcag 180 ctcgaggagc aattacagca tcaacgtaga ggccgtaagg tctacatccg gaatgtattg 240 ggtgtaaagg attccgaggt catccggaat cggtatggag ggaagtacga cctccatctt 300 acccagcagg agtttgctcc ccacggccta gctggtgccc tccgcttgtg tgaaactctc 360 gattgtctag actctttccc ttcatcaggt ctgcggcagg acctcgtctt agacttcgga 420 ggaagttggg tcacacatta cctccgcgga cataacgtac actgctgttc cccttgtttg 480 ggtatccgcg ataaaatgcg ccacgcggaa cgtttaatga acatgcgcaa gatcatcttg 540 aacgatccac aacagttcga tggtcgacag ccggatttct gcactcaacc ggctgcagat 600 tgcaaagtac aagcccactt tgctatatct attcatggag gctatgatat gggcttcaga 660 ggattatgtg aagcgatgaa cgctcacgga accactattt tgaagggaac gatgatgttc 720 gatggtgcga tgatgtttga cgaccaaggt gtaattcccg aacttaactg tcagtggagg 780 aagattagga gtgctttctc cgaaactgaa gacgtcacat cgttgtctgg taagcttaat 840 tccacagtat tctcccgcgt gcggaaattc aagactatgg tagcttttga tttcattaac 900 gagtctacca tgtcttatgt tcatgactgg gagaatataa aatcttttct tacagaccag 960 acttactcgt accgagggat gacttacggt attgagcgct gtgttattca tgccggcatt 1020 atgacgtaca agattatcgg cgtacctggg atgtgtccac ccgaactcat tcgacattgt 1080 atttggttcc cctccattaa agactatgtt ggtctgaaga ttcccgcgtc gcaggatttg 1140 gttgagtgga agacagtgcg gattttaatg tcaacattac gtgagactga agagattgcc 1200 atgaggtgtt ataatgataa gaaagcgtgg atggaacaat ttaaggttat cttaggtgtt 1260 ctatctgcga aatcatctac cattgttatc aatggtatgt ctatgcaatc tggcgagcga 1320 atagatatta atgattatca ttacatcggt ttcgccattc ttcttcacac aaaaatgaag 1380 tatgaacaac ttggaaaaat gtatgacatg tggaatgctt cgagcatctc gaagtggttc 1440 gcagcgttga ctcgtccgct gcgtgtgttt ttctccagtg ttgttcacgc gctgttcccg 1500 actttgagac cccgcgagga aaaagaattc ctgattaagc tctccacctt cgtaactttc 1560 aatgaagagt gctcatttga cggtggagag gaatgggacg tgatatcatc tgctgcatac 1620 gttgctacgc aggctgttac tgatgggaaa attttggctg cgcagaaggc cgagaagctt 1680 gctgagaagc ttgcacaacc cgtaagtgag gtatcagaca gcccagaggc gtcatctcaa 1740 acgcctgatg atactgctga agtttgtgga aaggagcgag aggtttcgga actcgactcc 1800 ttgtcagctc aaacacgttc ccccatcact agagttgctg aaagggctac tgctatgtta 1860 gagtatgccg cttatgagaa acagttacac gacaccacgg tgtctaactt aaaacgtatt 1920 tggaacatgg cgggtggtga tgacaaaaga aactccctcg aaggtaattt gaagttcgtt 1980 ttcgatacgt attttaccgt tgatcctatg gtgaacattc atttctccac gggtcggtgg 2040 atgcgtcctg tgcccgaggg tgttgtctat tctgttggtt ataatgaacg cggtttaggt 2100 ccgaagtctg atggagagct atacattgtc aatagtgaat gcgtgatttg caacagtgag 2160 tctttattta ctgtcacgcg ttctcttcaa gctccaaccg gaaccattag tcaggtcgac 2220 ggagttgctg gttgtgggaa aaccacggca attaaatcca tttttgaacc gtccactgac 2280 atgatcgtta ccgcgaacaa gaagtccgcc caagatgtac gtatggcact tttcaaatcg 2340 tcggattcca aagaagcttg cactttcgtc cgaacagccg attctgtcct acttaatgaa 2400 tgtccgactg ttagtagagt tttggttgat gaggttgtgt tgttgcactt tggtcaattg 2460 tgtgccgtca tgtccaagtt gaaggccgtg cgagctatat gttttgggga ttcggagcag 2520 attgcttttt cctcgcgaga tgcctcattt gacatgcgtt tctctaagat cattcctgat 2580 gaaactagtg atgcggacac cacattccgt agcccacaag atgtcgtgcc gcttgtgcgt 2640 ttaatggcta cgaagaccct tccgaaagga acccattcaa aatacacgaa atgggtttct 2700 caatctaaag tgaaaagatc tgttacgtct cgcgccatcg ttagtgtgac actggttgac 2760 ctggatccct ccaggtttta tataacgatg acccaagctg ataaggcttc actgatttca 2820 agggcgaaag agatgaattt accaaagacc ttttggaacg aaaggattaa aaccgtgcat 2880 gagtctcaag gtatctctga agatcacgtt actttggtaa gattaaagag cacaaagtgt 2940 gacttgttta aacagttttc ttattgtctc gtcgcattga ctagacacaa agtcacattc 3000 cgctacgagt actgcggtgt attaaacggt gatttaatcg ccgaatgtgt tgctcgtgct 3060 tagcggcgtc tctccttcgg gcgggacctg agttggcggt aatctgcaaa ccgtctgaag 3120 tcactaaaca catcgtgtgg tgaacgggtt gtccatccag ctaacggcta aaatggtcag 3180 tcgtggagaa atccacgcca gtaaacttac aagtttctga ggcacctttg aaaccatctc 3240 ctaggtttct tcggaaggac ttcggtccgt gtacttctag cacaatgtgc tagtttcagg 3300 gtacgggtg 3309 // ID MG025948; SV 1; linear; genomic RNA; STD; VRL; 3053 BP. XX AC MG025948; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Cucumber mosaic virus isolate Rs segment RNA 2, complete sequence. XX KW . XX OS Cucumber mosaic virus (cucumber mosaic cucumovirus) OC Viruses; Riboviria; Bromoviridae; Cucumovirus. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-3053 RX DOI; .1073/pnas.1714916114. RX PUBMED; 29087346. RA Andika I.B., Wei S., Cao C., Salaipeth L., Kondo H., Sun L.; RT "Phytopathogenic fungus hosts a plant virus: A naturally occurring RT cross-kingdom viral infection"; RL Proc. Natl. Acad. Sci. U.S.A. 114(46):12267-12272(2017). XX RN [2] RP 1-3053 RA Andika I.B., Sun L.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL College of Plant Protection, Northwest A&F University, Taicheng Road 3#, RL Xiaan, Shaanxi 712100, China XX DR MD5; c9cf6ca249d18b4d4678aed9620cda0a. DR EuropePMC; PMC5699089; 29087346. XX CC ##Assembly-Data-START## CC Assembly Method :: Trinity v. 2.4.0 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..3053 FT /organism="Cucumber mosaic virus" FT /segment="RNA 2" FT /host="Rhizoctonia solani" FT /isolate="Rs" FT /mol_type="genomic RNA" FT /country="China" FT /collection_date="Nov-2016" FT /db_xref="taxon:12305" FT gene 73..2646 FT /gene="2a" FT CDS 73..2646 FT /codon_start=1 FT /gene="2a" FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2D3HXD2" FT /db_xref="InterPro:IPR001788" FT /db_xref="InterPro:IPR007094" FT /db_xref="UniProtKB/TrEMBL:A0A2D3HXD2" FT /protein_id="ATU79660.1" FT /translation="MAFSAPAFSLANLLNGSYGVDTPEDVERLRFEQREEAAAACRNYR FT PLPAVDVSESVTEDAHSLQTPDGAPAEAVSDEFVTYGAEDYLEKSDTELLVAFETMVKP FT MRIGQLWCPAFNKCSFISSIAMARALLLAPRTSHRTMKCFEDLVAAIYTKSDFYYSEEC FT EADDVQMDISSRDVPGYSFEPWSRTSGFEPPPICEACDMIMYQCPCFDFNALKKSCAER FT TFADDYVIEGLDGVVDNATLLSNLGPFLVPVKCQYEKCPTPTIAIPPNLNRATDRVDIN FT LVQSICDSTLPTHSNYDDSFHQVFVESADYSIDLDHVRLRQSDLIAKIPDSGHMIPVLN FT TGSGHKRVGTTKEVLTAIKKRNADVPELGDSVNLSRLSKAVAERFFISYINGNSLASSN FT FVNVVSNFHDYMEKWKSSGLSYDDLPDLHAENLQFYDHMIKSDVKPVVSDTLNIDRPVP FT ATITYHKKSITSQFSPLFTALFERFQRCLRERIILPVGKISSLEMAGFDVKNKYCLEID FT LSKFDKSQGEFHLLIQEHILNGLGCPAPITKWWCDFHRFSYIRDRRAGVGMPISFQRRT FT GDAFTYFGNTIVTMAEFAWCYDTDQFEKLLFSGDDSLGFSQLPPVGDPSKFTTLYNMEA FT KVMEPSVPYICSKFLLSDEFGNTFSVPDPLREVQRLGTKKIPYSDNDEFLFAHFMSFVD FT RLKFLDRMSQSCIDQLSIFFELKYKKSGEEAALMLGAFKKYTANFQSYKELYYSDRRQC FT ELINSFCSTEFRVERVNSNKQRKKHGIERRRDDKRRTPTGSYGGGEEAETKVSQAESTG FT TRSQKSQRESAFKSQTVPLPTVLSSGWSGTDRVIPPCERGGVTRA" FT gene 2405..2737 FT /gene="2b" FT CDS 2405..2737 FT /codon_start=1 FT /gene="2b" FT /product="2b protein" FT /db_xref="InterPro:IPR004946" FT /db_xref="UniProtKB/TrEMBL:A0A2D3HXC2" FT /protein_id="ATU79661.1" FT /translation="MELNVGAMTNVELQLARMVEVKKQRRRSHKQNRRERGHKSPSERA FT RSNLRLFRFLPFYQVDGPELTGSYRHVNVAELPEPEASRLELSAEDHDFDDTDWFAGNE FT WAEGVF" XX SQ Sequence 3053 BP; 777 A; 683 C; 705 G; 888 T; 0 other; agcgtacggt tcaatccctg cctcccctgt aaaactccct agactttcaa acttctttct 60 agtatctttt ctatggcttt ttccgccccc gcattctcac tagccaatct tttgaatggt 120 agttacggtg tcgacactcc cgaggatgtg gaacgcttgc gatttgagca acgcgaagag 180 gctgccgcgg cctgccgtaa ttacaggccc ttacccgctg tggatgtcag cgagagtgtc 240 acagaggacg cgcattccct ccaaactcct gacggagctc ccgctgaagc ggtgtctgat 300 gagtttgtaa cttatggtgc tgaagattac cttgaaaaat ctgatactga gctccttgtc 360 gcttttgaga cgatggtcaa acccatgcgt atcggacaat tatggtgccc tgcgtttaat 420 aaatgttctt ttatttccag cattgctatg gccagagctt tgctgttggc acctagaaca 480 tcccaccgaa ccatgaagtg ttttgaagac ctggtcgcgg ctatttacac taaatctgat 540 ttctattaca gtgaagagtg tgaagccgac gacgttcaga tggatatctc gtctcgcgat 600 gtacccggtt attctttcga accgtggtcc cgaacgtctg gattcgaacc gccgcccatt 660 tgtgaagcgt gcgacatgat catgtaccag tgcccgtgtt ttgatttcaa tgctttaaag 720 aaatcgtgcg ctgagaggac tttcgctgat gattatgtta ttgaaggttt agatggtgtt 780 gttgataatg cgactttgtt gtcgaacttg ggtccatttt tggtacccgt gaagtgtcaa 840 tatgaaaaat gtccaacacc gaccatcgcg attcctccga acttaaatcg tgctactgat 900 cgtgttgata tcaatttggt tcaatccatt tgtgactcga ctctgcccac tcatagtaac 960 tatgacgact cttttcatca agtgttcgtc gaaagtgcag attactccat agatctggat 1020 catgttagac ttcgacagtc tgatcttatt gcaaaaattc cagactcagg gcatatgata 1080 ccggttctga acaccgggag cggtcacaag agagtaggta caacgaagga ggttcttaca 1140 gcaattaaga aacgtaatgc tgacgttcca gagctaggtg attccgttaa tctgtccaga 1200 ttgagtaaag ctgtggctga gagattcttt atttcataca tcaatggtaa ctctctagca 1260 tccagcaact ttgtcaatgt cgttagtaat ttccacgatt acatggagaa gtggaaatcc 1320 tcaggtcttt cttatgatga tcttccagat cttcatgctg aaaatctgca gttttatgat 1380 cacatgataa aatctgatgt gaaacctgtg gtgagcgaca cgcttaatat cgacagaccg 1440 gttccagcta ctataacgta tcataagaag agtataacct cccagttctc accgttattc 1500 acagcgctat tcgagcgctt ccagagatgc cttcgagaac gtattattct tcctgttggt 1560 aagatttcat ctcttgagat ggcaggattt gatgtcaaaa acaagtactg cctcgagatt 1620 gatttgtcta agtttgataa gtctcaaggt gaatttcact tactaattca ggaacatatt 1680 ttgaatggtc taggatgtcc agctccgata accaaatggt ggtgcgattt ccaccgattc 1740 tcttacatca gagaccgtag agctggcgtt ggtatgccta ttagtttcca gagacgaact 1800 ggtgatgcat tcacttattt tggcaatacc attgtcacca tggctgagtt tgcctggtgt 1860 tatgacaccg accaattcga aaagctttta ttctcaggcg atgactctct aggattttca 1920 cagcttcccc ctgttggtga tccgagtaaa ttcacgactc tttacaacat ggaagctaag 1980 gtgatggaac cgtcagtacc atatatttgt tcgaagttct tactctctga cgagttcggt 2040 aacacatttt ccgttccaga tccattgcgc gaggttcagc ggttaggaac aaagaaaatt 2100 ccctattccg acaatgatga attcttgttt gctcacttca tgagctttgt tgatcgattg 2160 aagtttttgg accgaatgtc tcagtcgtgt atcgatcaac tttcgatttt ctttgaattg 2220 aaatacaaga agtctgggga agaggctgct ttaatgttag gcgcctttaa gaaatacacc 2280 gctaatttcc agtcctacaa agaactctat tattcagatc gtcgtcagtg cgaattgatc 2340 aattcgtttt gtagtacaga gttcagggtt gagcgtgtaa attccaataa acagcgaaag 2400 aaacatggaa ttgaacgtag gcgcgatgac aaacgtcgaa ctccaactgg ctcgtatggt 2460 ggaggtgaag aagcagagac gaaggtctca caagcagaat cgacgggaac gaggtcacaa 2520 aagtcccagc gagagagcgc gttcaaatct cagactgttc cgcttcctac cgttctatca 2580 agtggatggt ccggaactga cagggtcata ccgccatgtg aacgtggcgg agttacccga 2640 gcctgaggcc tctcgtttag agttatcggc ggaagaccat gattttgacg ataccgattg 2700 gttcgccggt aacgaatggg cggaaggtgt tttctgaacc actccttcct ctccctccgg 2760 tttctgtggc gggagctgag ttggcagtat tgctataaac tgtctgaagt cactaaacac 2820 attgtggtga acgggttgtc catccagctt acggctaaaa tggtcagtcg tagagaaatc 2880 tacgccagca aacttacaag tttctgaggc acctttgaaa ccatctccta ggtttcttcg 2940 gaaggacttc ggtccgtgta cttctagcac aatgtgctag tttcagggta cgggtgcccc 3000 cccactttcg tgggggcctc caaaaggaga ccagcttacg gctaaaatgg tca 3053 // ID MG025949; SV 1; linear; genomic RNA; STD; VRL; 2214 BP. XX AC MG025949; XX DT 20-NOV-2017 (Rel. 134, Created) DT 20-NOV-2017 (Rel. 134, Last updated, Version 1) XX DE Cucumber mosaic virus isolate Rs segment RNA 3, complete sequence. XX KW . XX OS Cucumber mosaic virus (cucumber mosaic cucumovirus) OC Viruses; Riboviria; Bromoviridae; Cucumovirus. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-2214 RX DOI; .1073/pnas.1714916114. RX PUBMED; 29087346. RA Andika I.B., Wei S., Cao C., Salaipeth L., Kondo H., Sun L.; RT "Phytopathogenic fungus hosts a plant virus: A naturally occurring RT cross-kingdom viral infection"; RL Proc. Natl. Acad. Sci. U.S.A. 114(46):12267-12272(2017). XX RN [2] RP 1-2214 RA Andika I.B., Sun L.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL College of Plant Protection, Northwest A&F University, Taicheng Road 3#, RL Xiaan, Shaanxi 712100, China XX DR MD5; c7eb2fb09b59cae5cffdd76a5eca2b6e. DR EuropePMC; PMC5699089; 29087346. XX CC ##Assembly-Data-START## CC Assembly Method :: Trinity v. 2.4.0 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2214 FT /organism="Cucumber mosaic virus" FT /segment="RNA 3" FT /host="Rhizoctonia solani" FT /isolate="Rs" FT /mol_type="genomic RNA" FT /country="China" FT /collection_date="Nov-2016" FT /db_xref="taxon:12305" FT gene 98..937 FT /gene="3a" FT CDS 98..937 FT /codon_start=1 FT /gene="3a" FT /product="movement protein" FT /db_xref="InterPro:IPR000603" FT /db_xref="UniProtKB/TrEMBL:Q787R2" FT /protein_id="ATU79662.1" FT /translation="MAFQGTSRTLTQQSSAATSDDLQKILFSPEAIKKMATECDLGRHH FT WMRADNAISVRPLVPEVTHGRIASFFKSGYDVGELCSKGYMSVPQVLCAVTRTVSTDAE FT GSLRIYLADLGDKELSPIDGQCVSLHNHDLPALVSFQPTYDCPMETVGNRKRCFAVVIE FT RHGYIGYTGTTASVCSNWQARFSSKNNNYTHIAAGKTLVLPFNRLAEQTKPSAVARLLK FT SQLNNIESSQYLLTNAKINQNARSESEELNVESPPAAIGSSSASRSEAFRPQVVNGL" FT gene 1237..1893 FT /gene="CP" FT CDS 1237..1893 FT /codon_start=1 FT /gene="CP" FT /product="capsid protein" FT /db_xref="GOA:A0A2D3HXC7" FT /db_xref="InterPro:IPR000247" FT /db_xref="InterPro:IPR023800" FT /db_xref="InterPro:IPR037137" FT /db_xref="UniProtKB/TrEMBL:A0A2D3HXC7" FT /protein_id="ATU79663.1" FT /translation="MDKSESTSAGRNRRRRPRRGSRSASSSSDANFRVLSQQLSRLNKT FT LAAGRPTINHPTFVGSERCKPGYTFTSITLKPPKIDRGSYYGKRLLLPDSVMEYDKKLV FT SRIQIRVNPLPKFDSTVWVTVRKVPASSDLSVAAISAMFADGASPVLVYQYAASGVQAN FT NKLLYDLSAMRADIGDMRKYAVLVYSKDDALETDELVLHVDVEHQRIPTSGVLPV" XX SQ Sequence 2214 BP; 521 A; 518 C; 527 G; 648 T; 0 other; gaaatcttac cactgtgtgt gtgtgtgtgt gtgtcgagtc gtgttgtccg cacatatatt 60 ttgttttctt tgtacagtgt gttagatttc ccgaggcatg gctttccaag gtaccagtag 120 gactttaact caacagtcct cagcggctac gtctgacgat cttcaaaaga tattatttag 180 ccctgaagcc attaagaaaa tggctactga gtgtgaccta ggccggcatc attggatgcg 240 cgctgataat gctatttcag tccggcccct cgttcccgaa gtaacccacg gtcgtattgc 300 ttccttcttt aagtctggat atgatgttgg tgaattgtgc tcaaaaggat acatgagcgt 360 ccctcaagta ttatgtgctg ttactcgaac agtttccact gatgctgaag ggtctttgag 420 aatttactta gctgatctag gcgacaagga gttatctccc atagacgggc aatgcgtttc 480 gttacataac catgatcttc ccgctttggt gtctttccaa ccgacgtatg attgtcccat 540 ggagacagtt gggaatcgta agcggtgttt tgctgttgtt atcgaaagac atggttacat 600 tgggtatacc ggtaccacag ctagcgtgtg tagtaattgg caagcaaggt tttcatctaa 660 gaataacaac tacactcata tcgcagctgg gaagactcta gtactgcctt tcaacagatt 720 agctgagcaa acaaaaccgt cagctgtcgc tcgcctgttg aagtcgcaat tgaacaacat 780 tgaatcttcg caatatttgt taacgaacgc gaagatcaat cagaatgcgc gcagtgagtc 840 cgaggaatta aatgttgaga gccctcccgc cgcaatcggg agttcttccg cgtcccgctc 900 cgaagccttc agaccgcagg tggttaacgg tctttagcac tttggtgcgt attagtatat 960 aagtatttgt gagtctgtac ataatactat atctatagtg tcctgtgtga gttgatacag 1020 tagacatctg tgacgcgatg ccgtgttgag aagggaacac atctggtttt agtaagccta 1080 catcatagtt ttgaggttca attcctctta ctccctgttg agccccttac tttctcatgg 1140 atgcttctcc gcgagattgc gttattgtct actgactata tagagagtgt gtgtgtgctg 1200 tgttttctct tttgtgtcgt agaattgagt cgagtcatgg acaaatctga atcaaccagt 1260 gctggtcgta accgtcgacg tcgtccgcgt cgtggttccc gctccgcttc ctcctcttcg 1320 gatgctaact ttagagtctt gtcgcagcag ctttcgcgac ttaataagac gttagcagct 1380 ggtcgtccaa ctattaacca cccaaccttt gtagggagtg aacgctgtaa acctgggtac 1440 acgttcacat ctattaccct aaagccacca aaaatagacc gtgggtctta ttatggtaaa 1500 aggttgttat tacctgattc agtcatggaa tatgataaga agcttgtttc gcgcattcaa 1560 attcgagtta atcccttgcc gaaattcgat tctaccgtgt gggtgacagt ccgtaaagtt 1620 cctgcctcct cggacttatc cgttgccgcc atctctgcta tgtttgcgga cggagcctca 1680 ccggtactgg tttatcagta tgccgcatct ggagtccaag ctaacaacaa attgttgtat 1740 gatctttcgg cgatgcgcgc tgatataggc gacatgagaa agtacgccgt cctcgtgtat 1800 tcaaaagacg atgcgctcga gacggacgag ctagtacttc atgttgacgt cgagcaccaa 1860 cgcattccca catctggagt gctcccagtt tgattccgtg ttcccagaat cctccctccg 1920 atctttgtgg cgggagctga gttggcagtt ctgctataaa ctgcctgaag tcactaaacg 1980 ttttacggtg aacgggttgt ccatccagct tacggctaaa atggtcagtc gtggagaaat 2040 ccacgccagc agatttacaa atctctgagg cgcctttgaa accatctcct aggtttcttc 2100 ggaaggactt cggtccgtgt acctctagca caacgtgcta gtttcagggt acgggtgccc 2160 ccccactttc gtgggggcct ccaaaaggag accagcttac ggctaaaatg gtca 2214 // ID MG025952; SV 1; linear; genomic DNA; STD; VRL; 5126 BP. XX AC MG025952; XX DT 22-FEB-2018 (Rel. 135, Created) DT 22-FEB-2018 (Rel. 135, Last updated, Version 1) XX DE Canine bocavirus 2, complete genome. XX KW . XX OS Canine bocavirus 2 OC Viruses; Parvoviridae; Parvovirinae; Bocaparvovirus; OC unclassified Bocaparvovirus. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-5126 RX DOI; .1177/0300985818755253. RX PUBMED; 29421972. RA Piewbang C., Jo W.K., Puff C., Ludlow M., van der Vries E., Banlunara W., RA Rungsipipat A., Kruppa J., Jung K., Techangamsuwan S., Baumgartner W., RA Osterhaus A.D.M.E.; RT "Canine Bocavirus Type 2 Infection Associated With Intestinal Lesions"; RL Vet. Pathol. 55(3):434-441(2018). XX RN [2] RP 1-5126 RA Piewbang C., Jo W.K., Puff C., Ludlow M., van der Vries E., Banlunara W., RA Rungsipipat A., Kruppa J., Jung K., Techangamsuwan S., Baumgartner W., RA Osterhaus A.D.M.E.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Veterinary Pathology, Chulalongkorn University, 39 Henrydunant, Pathumwan, RL Bangkok 10330, Thailand XX DR MD5; d0242e3f73559e38dbd299f8e0cb73c9. XX CC ##Assembly-Data-START## CC Assembly Method :: BioEdit Sequence Alignment Editor v. 7.2.5 CC Sequencing Technology :: Illumina; Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..5126 FT /organism="Canine bocavirus 2" FT /host="Canis lupus familiaris" FT /strain="TH-2016" FT /mol_type="genomic DNA" FT /country="Thailand" FT /isolation_source="lung" FT /collection_date="2016" FT /db_xref="taxon:2093270" FT CDS 226..2607 FT /codon_start=1 FT /product="nonstructural protein 1" FT /note="NS1" FT /db_xref="GOA:A0A2L1IPZ5" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:A0A2L1IPZ5" FT /protein_id="AVD96862.1" FT /translation="MALALAGVDDIIHFARPAYTYVLKFPYAEWRRDEARLQSALGYPH FT HDLLKDSTPFLTMPGQDSPAEQASFLESKGPQYGYALLLARTAHAAAYSIFSQKQGKYP FT PAASIYVQCELGIKYLHVHVVMGGDGLNRYNAKATCSNLAYKWLDNIQSQLEINVKTGH FT NTDLDMCNSLIGCVYQAKRECFDMRTEICTILQYKCRNGEMYACRVDPIEFICNYLLCK FT NLKFFSMVDPDRATPFVSHFACSGKTYAATYVNGKWVLPQVRKQWLNYLRDSVCQKADP FT VFSGDMFENLPKVPRATWSADVSSNKSKITKKETLMIDCIDRCEKNHLLTYEDLVNECS FT DLVIMLGSQTGGTKLIETLLQMVHIKICQKYTALSYVLSRYLSIELLPENKATQLLIFQ FT GYNPWQVGHWLCCVLHKTAGKQNTVCFFGPASTGKTNFAKAIVNAVKLYGCVNHQNKNF FT VFNDCASKLVNWWEECLMHNDWVEQAKCLLGGTEFRIDRKHKDSQLLPQTPVVISTNHD FT VYTVVGGNTTTMVHAKPLRERIVQFNFMKQLSSTFGEIDPMDAVALLQACSSRFDASLD FT SFYAQWQLQCTPNDFPLASFCDGHSQDFVLHEVGFCDTCGGYAPLETTDRSQPLPARPA FT SSGKSLLSACMLPYMFHFPCAVLDSVLLLVSGVKRRLDFDPDPAPSTSTAPPAKRHSKV FT RRPVFHDDWCSQPVDRLDRIRYEKFVESVVGASDESPSEPESESTGLTPSEWGEMLGVV FT CKSLEEEPIVLHCFEDIASLSETEDDSDGGLQSTPRQNKD" FT CDS 2173..2607 FT /codon_start=1 FT /product="ORF4" FT /note="hypothetical protein" FT /db_xref="UniProtKB/TrEMBL:A0A2L1IQ03" FT /protein_id="AVD96863.1" FT /translation="MFHFPCAVLDSVLLLVSGVKRRLDFDPDPAPSTSTAPPAKRHSKV FT RRPVFHDDWCSQPVDRLDRIRYEKFVESVVGASDESPSEPESESTGLTPSEWGEMLGVV FT CKSLEEEPIVLHCFEDIASLSETEDDSDGGLQSTPRQNKD" FT CDS 2372..2959 FT /codon_start=1 FT /product="nucleoprotein 1" FT /note="NP1" FT /db_xref="GOA:A0A2L1IQ14" FT /db_xref="InterPro:IPR021075" FT /db_xref="UniProtKB/TrEMBL:A0A2L1IQ14" FT /protein_id="AVD96859.1" FT /translation="MKSSSRASSARQTSHHRSRSRSPRDSRLQSGERCSESSASRWRKS FT RSSYTASKTSPLSRRPKTTPMEVFNQHRAKTKTDISMCGFYWHSTRLARSGTDWIFNSG FT KPLFQSKCSNNLVSWDVVREILFEFKKTIDQKYRNMLWHFGRGGYCNKCEYWDNVYLEH FT LANVDSSNDVVMQEISDAEMLEAAMEIDGASE" FT CDS 2943..5060 FT /codon_start=1 FT /product="viral capsid protein 1" FT /note="VP1" FT /db_xref="GOA:A0A2L1IQ17" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:A0A2L1IQ17" FT /protein_id="AVD96860.1" FT /translation="MAPANRKPGGWVVPGYKYLGPFNPADNGEPVNSADEAARSHDLAY FT QSYLDAGVNPYFSYNKADSDFIESLAHDSSFGGWLGRSAFGLKKLLAPHLADTKGNPDA FT PSTSRAGSSVSKSDRAQKRKLYFARSNKQAKQQKMSAPEAPTEDVAEPGPSGSDPRAGG FT NGGGGGMGGGGGHGVGVSTGGWKAGTVFGNDFVITTNTRQWYAPIFNGHEYKRLHPDVD FT RNWVGISTPWGYFNFNEYSSHFSPQDWQRLTNEYKRWRPKAMRVKVYNLQIKQVVNLGS FT DTLYNNDLTAGVHIFCDGSHQFPYSQHPWDTGTMPELPHRIWRISQYGYFQLQSDLTDG FT GNSSSSPDVQNQEKQLLKSAPLFMLETASHQVLRTGEESSFSFSFDCGWAINDKAYAIP FT QADFNPLIPTRRYFPTRNSTTGAGNLMFYHRYNPYNKPSNWMPGPSLGYLGSTQTSQNP FT HKARGPITVVTQPPGTTAQGANRDEQSTTHVPSETTMQFSGYDVNPVNCASSRLDAHSL FT AYDSGPESANQNIITVRGIDLDMARWSSVMVQDGTNNELGTSTPRTHFTELKNVWMYPN FT QAWDTTPISRDTPIWVKIPKTDRHTMHDTSDGTLPMAHPPGTIFVRVAKVPIPGESDSY FT LNLYVTGQITCEILWETERFQTKNWRPEIKNDPSTFSDPLLYTFNDTGVYNTPETFIEG FT MPTKRGINRVL" FT CDS 3357..5060 FT /codon_start=1 FT /product="viral capsid protein 2" FT /note="VP2" FT /db_xref="GOA:A0A2L1IPZ7" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:A0A2L1IPZ7" FT /protein_id="AVD96861.1" FT /translation="MSAPEAPTEDVAEPGPSGSDPRAGGNGGGGGMGGGGGHGVGVSTG FT GWKAGTVFGNDFVITTNTRQWYAPIFNGHEYKRLHPDVDRNWVGISTPWGYFNFNEYSS FT HFSPQDWQRLTNEYKRWRPKAMRVKVYNLQIKQVVNLGSDTLYNNDLTAGVHIFCDGSH FT QFPYSQHPWDTGTMPELPHRIWRISQYGYFQLQSDLTDGGNSSSSPDVQNQEKQLLKSA FT PLFMLETASHQVLRTGEESSFSFSFDCGWAINDKAYAIPQADFNPLIPTRRYFPTRNST FT TGAGNLMFYHRYNPYNKPSNWMPGPSLGYLGSTQTSQNPHKARGPITVVTQPPGTTAQG FT ANRDEQSTTHVPSETTMQFSGYDVNPVNCASSRLDAHSLAYDSGPESANQNIITVRGID FT LDMARWSSVMVQDGTNNELGTSTPRTHFTELKNVWMYPNQAWDTTPISRDTPIWVKIPK FT TDRHTMHDTSDGTLPMAHPPGTIFVRVAKVPIPGESDSYLNLYVTGQITCEILWETERF FT QTKNWRPEIKNDPSTFSDPLLYTFNDTGVYNTPETFIEGMPTKRGINRVL" XX SQ Sequence 5126 BP; 1303 A; 1262 C; 1284 G; 1277 T; 0 other; agattatatc aattgcgcta cctagtggcc tgttaatgta taacgtgctg tgtcagctgt 60 ttgtgattaa ttggtgtttt atggacttat catcataatg actaccaacc gatgaatata 120 gagagaaatt actgactata tatataagtg tgcttcctgc ttcgtgtcat tctgcttccg 180 gctttcgtcg cggtgacatc tcgctgctct atctgggggt gagtgatggc tcttgctctg 240 gccggggtcg acgatatcat tcactttgct cgtcctgcct atacctatgt tcttaaattt 300 ccttacgctg agtggcggcg ggatgaggct cgtcttcaga gcgctctggg gtacccgcat 360 catgatttac tcaaggactc gactccgttc cttactatgc cggggcaaga ttctccggcc 420 gagcaggctt cctttttgga gtctaagggt cctcagtacg gatatgcctt attattggca 480 cgcacggctc acgctgctgc atattctata ttctcgcaga agcagggtaa atatcctcct 540 gctgctagca tatatgttca gtgcgagctg ggcattaagt accttcacgt acacgtcgtg 600 atgggcggtg acggcttgaa ccggtacaat gccaaggcca cttgctcaaa cctggcctac 660 aagtggctgg acaatattca gtctcagctc gagattaacg tcaagaccgg tcacaataca 720 gaccttgaca tgtgcaattc tctcatcggc tgcgtctacc aggccaagag agagtgcttt 780 gacatgagga ccgagatttg taccatcctg cagtacaagt gccggaacgg cgagatgtac 840 gcctgccggg tcgatcctat agagtttatc tgtaactacc tattgtgcaa aaacttgaaa 900 ttcttttcta tggtcgaccc tgacagagca actccgtttg tctctcactt tgcctgttct 960 ggtaaaacgt acgcggctac gtacgtcaat gggaagtggg tcttgcctca ggttaggaag 1020 cagtggctaa attatcttcg agactctgtc tgtcagaagg ccgatcccgt cttttccggc 1080 gacatgtttg aaaatctacc taaggtacct cgcgcgacct ggtcggcaga cgtttcctct 1140 aataagtcta aaatcactaa aaaggaaact ctgatgattg actgtatcga tcgctgcgaa 1200 aagaatcact tgcttaccta tgaagatttg gtcaatgagt gttctgatct tgtaatcatg 1260 ctcggctcac agacgggcgg aactaaactg attgagacct tgcttcagat ggttcacatt 1320 aagatttgtc agaaatatac ggccttgtct tatgtcttgt cgcggtactt gtcgatcgag 1380 ctactgcctg aaaataaagc tacacagctc ttgatctttc agggatacaa tccctggcag 1440 gtcggccact ggctgtgctg cgtgctgcac aagacggccg gtaaacagaa tacagtgtgc 1500 tttttcggtc cggccagcac gggcaagacc aactttgcca aggctatagt gaatgccgtt 1560 aagctgtacg gatgtgtgaa ccatcagaat aagaattttg tgtttaacga ctgcgcgtcc 1620 aagctggtca attggtggga agagtgcctc atgcacaatg attgggtaga gcaggccaag 1680 tgtctgctgg gaggaacgga atttagaatc gaccgtaagc ataaagactc tcagctgctg 1740 ccgcagactc ctgttgtgat cagtaccaat cacgacgtgt acaccgtggt cggtgggaac 1800 accactacta tggttcacgc taagccgctt cgggagagga tcgttcagtt taatttcatg 1860 aaacaactgt cttccacatt tggggagatt gatcctatgg atgcggtggc tctgttgcaa 1920 gcctgctctt ctcgattcga tgcgtcgctc gactcgtttt acgctcagtg gcagcttcag 1980 tgcactccta acgattttcc tctcgcttcg ttctgtgacg gccattcgca ggatttcgtc 2040 cttcacgagg tgggcttctg cgacacgtgc ggtggctacg ctcctctgga gactacggac 2100 cgcagtcagc cgctgccggc tcgacctgct tcgtccggta agtctttact ttctgcctgt 2160 atgctgcctt acatgttcca ttttccgtgt gctgtacttg actctgtgct tttgttagtg 2220 tcaggtgtga agcgtcgcct ggactttgac ccggatcctg ctccttccac gtcgacggct 2280 cctccggcga agcgccactc caaggtgagg cgtcccgtgt tccacgacga ctggtgtagt 2340 cagccggtag atcgcctaga ccgcatccgc tatgaaaagt tcgtcgagag cgtcgtcggc 2400 gcgtcagacg agtcaccatc ggagccggag tcggagtcca cgggactcac gccttcagag 2460 tggggagaga tgctcggagt cgtctgcaag tcgctggagg aagagccgat cgtcttacac 2520 tgcttcgaag acatcgcctc tctctcggag accgaagacg actccgatgg aggtcttcaa 2580 tcaacaccgc gccaaaacaa agactgacat ttcaatgtgt ggcttttact ggcacagtac 2640 tcgcctcgcg cggtcgggta cagattggat ctttaacagt ggaaagcctc tgtttcaatc 2700 taaatgttct aataatcttg tatcttggga tgtggttcgt gagattttgt ttgaatttaa 2760 aaaaactata gatcagaaat atagaaatat gctgtggcac tttggtcggg gtgggtactg 2820 taataaatgt gagtactggg ataacgtgta ccttgaacac ttggctaatg tagactcctc 2880 taatgatgtt gttatgcagg agataagtga cgctgagatg ctggaggctg ccatggagat 2940 tgatggcgcc agcgaataga aagcccggtg gttgggtcgt gcctggctat aaatatttgg 3000 gtccctttaa ccctgctgac aacggggaac ctgtaaattc tgctgacgag gccgctcggt 3060 ctcatgatct cgcctatcag tcctatctcg atgctggtgt aaacccgtac tttagctaca 3120 ataaagctga ttctgatttt attgagtcct tggctcacga ctcttcattc ggcggctggc 3180 tggggcgctc ggcctttggc ctcaaaaaat tgcttgcgcc gcatctcgcg gatacaaagg 3240 gcaatcctga cgctccgtcc acctcgcggg cgggttcctc cgtatccaag tcagacagag 3300 ctcaaaagag aaaactctat tttgccagat caaacaaaca agccaaacaa caaaagatgt 3360 cagctccaga agctccgacc gaagatgtgg cagaaccggg tccatctggc tccgatccgc 3420 gggcaggagg aaatggaggt ggtggaggca tgggaggagg tggaggacat ggagtgggag 3480 tgagcaccgg agggtggaag gcggggaccg tatttggaaa tgactttgtc atcaccacca 3540 acaccagaca gtggtacgct cccatcttta acggccatga atacaaacgt ttacacccag 3600 acgttgatag aaactgggtg ggaatcagca ctccatgggg atactttaac tttaacgagt 3660 acagttcgca tttttcacca caagactggc agcgtctcac caacgagtat aaacggtgga 3720 gaccaaaagc catgagagtt aaagtataca accttcaaat aaaacaggta gtcaacctag 3780 ggtctgacac tttatacaat aatgacctga cggccggagt tcacatcttt tgtgacggga 3840 gccatcagtt tccgtactct cagcatccgt gggacacagg gaccatgccc gaactgcctc 3900 atcgcatctg gaggatatcg cagtacgggt actttcaact acaatctgac ctgacggacg 3960 gaggaaattc ttcgtctagt ccagacgtcc agaaccaaga aaaacagcta ctaaagagtg 4020 cgccgctttt tatgctggaa actgcatctc atcaagtatt gagaacgggg gaggaatcca 4080 gcttttcatt ctcgtttgac tgtgggtggg ctattaacga caaggcgtac gccattccac 4140 aggcagactt taaccctctg attccaacca gacgatactt tcctacacga aacagtacca 4200 cgggagcggg gaaccttatg ttttatcata gatataatcc atacaacaaa ccgagcaact 4260 ggatgccggg accgagctta ggttatctag ggtcaacaca aacatcacaa aatccacaca 4320 aagcacgtgg tccgatcact gtcgtcacgc agccgcccgg cacgacggca cagggcgcca 4380 atagggacga acaatcgact acacacgtcc cgtcagaaac gaccatgcaa ttttcagggt 4440 acgacgtgaa tcctgtcaac tgcgccagca gcaggctaga cgcgcactcg ctcgcgtacg 4500 attcggggcc agaaagtgcc aatcaaaaca taataacagt tagaggcata gacttagata 4560 tggccaggtg gtcgtctgtc atggtgcagg acggaacaaa taacgaactt ggaacatcta 4620 cacctagaac acactttaca gaacttaaaa acgtgtggat gtacccaaat caggcgtggg 4680 acaccactcc gatatccaga gacactccta tctgggtcaa aataccaaaa acagacaggc 4740 acaccatgca cgacacctcg gatgggacgc tgccaatggc acatccgccg ggaaccatat 4800 ttgtcagggt cgcaaaggtg cccattccgg gggagtcaga ctcttaccta aacctatacg 4860 tcacagggca aatcacttgt gagatactat gggaaacaga aaggttccaa accaaaaatt 4920 ggagaccgga aatcaaaaac gatccttcca cgtttagcga ccctctactg tatacattta 4980 acgacaccgg ggtctacaat actccagaaa cattcattga gggcatgccc actaaacggg 5040 gaataaacag ggtactgtaa ctttaagaaa cataaagcca taaaacggaa acttttgcgc 5100 atttgttatt tctttaaaag gaccat 5126 // ID MG026486; SV 1; linear; genomic RNA; STD; VRL; 7108 BP. XX AC MG026486; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 1 isolate ETH_P5_2016 polyprotein gene, complete cds. XX KW . XX OS Human parechovirus 1 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7108 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7108 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; a38c253aa0658046822d871301a950d2. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7108 FT /organism="Human parechovirus 1" FT /host="Homo sapiens" FT /isolate="ETH_P5_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:12063" FT CDS 546..7085 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M129" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009407" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M129" FT /protein_id="AXQ03885.1" FT /translation="METIKSIADMATGVISSVDSSVNAVNEKVENVGNEIGGNLLTKVA FT DDASNVLGPNCYATTAEPENKDVVQATTTVNTTNLTQHPSAPTMPFTPDFSNVDTFHSM FT AYDITTGEKNPSKLVRLETHEWTPNWARGHQITHVELPKVFWDKNSKPAYGQSRYFAAV FT RCGFHFQVQVNVNQGTAGSALVVYEPKPVVTYDSKLEFGAFTNLPHVLMNLAETTQADL FT CIPYVADTNYVKTDSSDLGQLKVYVWTPLSIPSGSATQVDVTILGSLLQLDFQNPRVFG FT QDVGIYDNAPTRKQNLKKILTMSTKYKWTRQKIDIAEGPGSMNMANVLSTTAAQSIALV FT GERAFYDPRTAGSKSRFDDLVKISQLFSVMSDSTTPSANHGIDAKGYFKWSSTTAPQSV FT VHRNIVYLKLFPNLNVFVNSYSYFRGSIVLRLSVYASTFNRGRLRMGFFPNATEDSTSE FT IDNAIYTICDLGSDNSFEITIPYSFSTWMRKTDGHPIGLFQIEVLNRLTYNSSSPNEVY FT CIVQGKMGQDARFFCPTGSVVTFQNSWGSQMDLTDPLCLEESAEECKQTISPNELGLTS FT AQDDGPLGDNKPNYFLNFKSMNVDIFTVSHTKVDNLFGRAWFYQGHTFTDEGQWRVSLE FT FPKQGHGSLSLLFAYFTGELNIHVLFLAEKGFLRVAHTYDTSENRVNFLSSNGVITIPA FT GEQMTLSAPYYSNKPLRTVRDSNSLGYLMCKPFLTGTTTGKIEVYLSLRCPNFFFPLPA FT PKVTTGRALRGDLANFSDQSPYDRQPQNQVMKLAYLDRGFYKHYGIVVGDSIYQLDSDD FT IFKTALTGKARFTKTKLTPDWIIEEECELDYFRVKYLESSVNSEHIFSVDNNCETIAKD FT IFGTHTLSQHQAIGLVGTILLTAGLMSTIKTPVTATTIKEFFNHAVDGDEQGLSLLVQK FT CTTFFSSAATEILDNDLVKFIVKILVRILCYMVLYCHKPNILTTACLSTLLIMDVTSSS FT VLSPSCKALMQCLMDGDVKKLAEVVAESMSNTDDDEIKEQICDTVKYTKTILSNQGPFK FT GFNEVSTAFRHIDWWIHTLLKIKDMVLSVFKPSLESKAIQWLERNKEHVCGVLDYASDI FT IVESKDQSKVKTQDFYQRYSDCLAKFKPIMAICFRSCHNSISNTVYRLFQELARIPNRI FT STNNDLIRIEPIGIWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVFTNPTASEFMDGYDN FT QDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTNKSDFS FT STVLQDSGALKRRFPYIMHIRAAKAYSKAGKLNVSQAMATMSTGECWEVSKNGRDWETL FT KLKDLVDKITSDYNERVKNYNAWKQQLENQTLDDLDDAVSYIKHNFPDAIPYIDEYLNI FT EMSTLIEQMEAFIEPKPSVFKCFANRIGSKISKASREVVDWFSDKMKSMLSFVERNKAW FT LTVVSAVTSAISILLLVTKIFKKEDSKDERAYNPTLPVTKPKGAFPVSQREFKNEAPYD FT GQLEHIISQMAYITGSTTGHLTHCAGYQHDEIILHGHSIKYLEQEEDLTLHYKNKVFPI FT EQPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGIITKEVQ FT RVHHSGGIKTREGTESTKTISYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNGEMGVA FT IPFNFLKNDMSDQGIVTEVTPIQPMYINTKTQIHKSPVYGAVEVKMGPAVLSKSDPRLE FT EPVECLIKKSATKYRVNKFQVNNELWQGVKACVKSKFREIFGINGIVDMKTAILGTSHV FT SSXXXXXXXXXXXXXXXXXXXXXXXLEPFSVSPMLEKLVQDKFHNLLKGNQITTIFNTC FT LKDELRKLDKIAAGKTRCIEACEVDYCIVYRMIMMEIYDKIYQTPCYYSGLAVGINPYK FT DWHFMINALNDYNYEMDYTQYDGSLSSMLLWEAVEVLAYCHDSPDLVMQLHKPVIDSDH FT VVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDVILSLD FT KEIDPEKLQSIMADSFGAEVTGSRKDEPPSLKPKMEVEFLKRKPGYFPESTFIVGKLDT FT ENMIQHLMWMKNFSTFKQQLQSYLMELCLHGKDTYLHYIKILEPYLKEWNITVDDYDVV FT IAKLMPMVFD" XX SQ Sequence 7108 BP; 2244 A; 1349 C; 1467 G; 1981 T; 67 other; ctggttgaag gcaacttgca ataagattag tgggaacaag acgcttaaag catggtgcaa 60 aataactttt ctaactcaca ttctatgtgg ggtggcagat ggcgtgccat aattctatca 120 gtgagatacc acgcttgtgg accttatgct cacacagcca tcctctagta agtttgtgag 180 acgtctggtg acgtgtggga acttattgga aacaacattt tgctgtaaag catccaattg 240 ccagcggaac aacacctggt aacaggtgcc tctggggcca aaagccaagg tttaacagac 300 ccttttggat tggttctaaa cctgagatgt tgtggaagat acttagtacc taccaatctg 360 gtagtagtgc aaacactagt tgtaaggccc acgaaggatg cccagaaggt acccgtaggt 420 aacaagtgac actatggatc tgatctgggg ctaggtgcct ctatcttggt gacctggtta 480 aaaaacgtct agtgggccaa acccaggggg gatccctggt ttccctttat tttatcaatg 540 ccactatgga gacaattaaa agtattgcag atatggcgac cggagtgatc agctcagttg 600 attcatctgt caatgcagtc aatgagaagg tggagaatgt gggcaatgaa atcggaggca 660 atctactaac caaagttgca gatgatgcat ctaatgtgct tggaccaaat tgttatgcta 720 caacagctga gccagagaat aaagatgtag tacaagcaac cacaactgtt aatacaacaa 780 atttaacaca acatccttct gcacctacaa tgcctttcac tcctgatttc tccaatgtgg 840 acacgttcca ctcaatggca tatgatatca ccactggaga gaaaaacccc agcaaattgg 900 ttagattgga aacacatgaa tggacaccaa actgggctag aggacatcaa attactcatg 960 tggaattacc aaaagtcttc tgggataaga acagtaagcc agcttacggt caatcaagat 1020 actttgcagc agtacggtgt ggtttccatt ttcaggtaca agtgaatgtt aatcaaggta 1080 cagctggtag tgcactggtg gtatatgaac ctaaacctgt tgtgacatat gactcaaagt 1140 tagaatttgg agcatttact aatcttccgc atgtgctaat gaacttggca gagaccacac 1200 aggctgattt atgtatcccc tatgtagctg acacaaacta tgttaaaaca gattcgtcag 1260 acttagggca actaaaagtc tatgtttgga cacccttgtc cataccttca ggttctgcta 1320 cacaagttga tgtgaccata ttgggtagcc tattgcaatt ggacttccaa aatcctaggg 1380 tatttggtca agacgttggt atttatgaca atgcaccaac acggaagcaa aatcttaaga 1440 aaatactcac catgagcact aaatacaagt ggactagaca aaagattgac atagctgagg 1500 gaccaggttc catgaatatg gcgaatgtat tgagtaccac tgcagcgcaa tcaattgcac 1560 tagttggaga aagagcattt tatgacccaa gaacagcagg aagcaagagt agatttgatg 1620 atttagtaaa aatatctcaa cttttttctg taatgagtga ctccacaacc ccttcagcca 1680 atcacggtat agatgcaaaa ggttatttca agtggtcatc tacaactgct ccacaatctg 1740 tggtacatag aaatattgtt tacttaaaat tgtttcccaa tttgaatgta tttgtcaaca 1800 gctattcata ttttagaggc tcaatagtgc tgaggttgag tgtctatgct agcactttca 1860 atagaggccg tttacggatg ggtttctttc caaatgccac tgaagacagc acttcagaaa 1920 tagataatgc catatacaca atttgtgatt tgggcagtga caatagtttt gaaatcacca 1980 tcccatactc attttccacc tggatgcgga aaacagatgg ccaccctatt ggactgtttc 2040 agatagaggt gcttaatagg ttaacttaca acagctccag tcctaatgaa gtttactgta 2100 ttgtgcaagg taaaatgggg caggatgcca ggttcttttg tccaactggt tctgtagtga 2160 ctttccaaaa ttcatggggt tcacaaatgg acttaactga tccactatgt ttagaagaat 2220 ctgcagagga atgcaaacag accatatcgc caaatgaact gggattaaca tcagctcagg 2280 acgatgggcc tttgggtgac aacaagccaa attatttcct aaacttcaag tctatgaatg 2340 tagacatttt cactgtttcc cataccaagg tagacaacct atttggaaga gcttggtttt 2400 accagggaca cactttcacc gatgaaggac agtggagagt tagtttagag ttcccaaaac 2460 aaggtcatgg ttcactttcc ctgctattcg cctattttac aggtgaacta aatatacatg 2520 ttttgttcct ggctgaaaag ggatttctta gagtggctca cacttatgac acatcagaaa 2580 acagagtaaa cttcttgtca tccaatggtg ttatcacaat cccagcagga gaacaaatga 2640 cattgtctgc accctactat tcaaataagc cccttagaac agttagagat agcaatagtc 2700 ttgggtacct aatgtgtaaa ccatttctta caggaacaac aacaggaaaa atagaggtct 2760 atcttagtct gaggtgtcca aatttctttt tccctctccc cgcacctaaa gttacaactg 2820 gtcgtgcctt acggggtgat ttggcaaact tctcagatca gagtccatat gatcgacaac 2880 cacagaatca agtgatgaaa ttagcctatt tggacagggg tttctacaag cactatggca 2940 ttgtggtggg ggacagtatt tatcaattgg actcagatga cattttcaag acagctttaa 3000 caggaaaagc taggttcact aagacaaagt taactccaga ttggattatt gaggaagaat 3060 gcgaattgga ttatttcagg gtgaaatacc ttgaatcctc tgtcaactca gagcatatct 3120 tctcagtgga caataactgt gaaactattg ccaaggacat ctttggcacc catacactta 3180 gtcaacacca ggctataggg ttagtaggta caatcctctt aaccgctggc ctgatgtcaa 3240 ctatcaaaac cccggtgact gctaccacaa ttaaagaatt tttcaatcat gcagttgatg 3300 gtgatgaaca aggtttgtct ttgcttgtgc aaaaatgtac cactttcttc tcttcagctg 3360 caacagagat cctagacaac gacttggtta aatttatagt caaaatattg gttagaatcc 3420 tatgttacat ggtcttgtat tgtcataaac caaatatcct gactacagct tgtttgtcca 3480 ctcttttgat catggacgtg acttcttcat cagtcttgtc accttcctgc aaagctttga 3540 tgcagtgctt gatggatggc gatgtgaaaa agcttgctga agtcgtagct gagtcaatgt 3600 ccaacacaga tgatgatgag attaaggagc aaatttgtga cacagtaaaa tacactaaga 3660 caatcctatc aaatcaggga ccattcaaag gttttaatga ggtttccact gcatttaggc 3720 atatagattg gtggatccac actttgctta aaattaagga tatggtgttg agtgtgttca 3780 aacccagtct agaaagtaaa gccatacaat ggttggaaag aaacaaggaa catgtgtgtg 3840 gggttctgga ttatgcttct gatatcattg ttgagtcaaa agatcagtca aaggtcaaaa 3900 cccaagattt ttatcaaaga tattcagatt gtctagctaa atttaagcca atcatggcca 3960 tttgcttcag gagttgtcat aatagtatta gcaacacagt atatagactt ttccaagaat 4020 tggctagaat tcccaatagg atcagcacta ataatgactt aatcagaatt gaacctattg 4080 gcatttggat ccagggcgaa ccggggcaag gcaaatcttt cctaactcat actttatcaa 4140 gacagttaca aaaatcatgt aaactcaatg gagttttcac caacccaact gccagtgagt 4200 tcatggatgg ttatgataac caggacatcc atctaataga tgacttgggc caaacaagga 4260 aggaaaaaga cattgaaatg ctatgcaact gtatttcatc tgttcccttt atagtaccaa 4320 tggcacacct tgaggaaaaa ggaaagttct acactagtaa gttagttatt gccaccacca 4380 acaaatcaga tttctctagt acagtccttc aagattctgg agcactgaag aggagattcc 4440 cctacattat gcacattcgg gcagcaaagg cctatagtaa agctggaaag ctcaatgtga 4500 gccaagctat ggctacaatg tcaactggag aatgttggga agtgtcaaag aatggtagag 4560 attgggaaac attaaaatta aaagatctgg ttgacaaaat cacttctgat tacaatgaga 4620 gggtcaaaaa ttacaatgct tggaaacagc aattagagaa tcaaaccctt gatgatttag 4680 atgatgcagt ttcatatatt aagcacaatt tcccggatgc cataccatac attgatgagt 4740 acctcaatat tgaaatgtca accttaattg agcaaatgga agcattcatt gagccaaaac 4800 ccagtgtgtt taagtgtttt gctaacagaa ttggatcaaa aatttctaaa gcttctagag 4860 aagttgtgga ttggttctca gataagatga agtccatgct cagctttgtt gaaagaaaca 4920 aagcgtggct tacagttgtt tctgcagtca ccagtgctat tagtatacta ctattagtga 4980 caaaaatctt caagaaagaa gattcaaagg atgaaagagc atacaaccca accctcccag 5040 ttactaagcc taagggagct ttcccagtct ctcaacggga gttcaagaat gaagcacctt 5100 atgatggaca actggagcac attatttctc agatggctta cattactggt tcaaccactg 5160 gccacttgac acattgtgca ggttaccaac atgatgaaat tatccttcat ggtcactcca 5220 ttaagtactt ggagcaggag gaagatttga cattacatta taaaaacaaa gtcttcccaa 5280 tagaacagcc ttctgtgact caagtcactt tgggtggtaa acctatggat ttagctatcc 5340 ttaaatgcaa attgccattt aggtttaaaa agaattccaa gtactatacc aataaaattg 5400 gaacagagag catgttgatt tggatgactg aacaaggtat aataacaaag gaagtccaga 5460 gagttcacca ctccggtggc attaaaacta gagaaggaac tgagagcaca aaaaccatta 5520 gttacacagt aaaatcttgc aaaggtatgt gtgggggttt gctcatttct aaagtagaag 5580 gaaatttcaa aatccttgga atgcacatag ctggtaatgg ggaaatgggt gttgccatcc 5640 catttaactt tcttaaaaat gatatgtctg accaaggcat tgtgacagaa gtgacaccca 5700 tacaacccat gtacatcaac actaagaccc aaattcacaa gagcccagtg tatggtgctg 5760 tagaagtcaa aatgggccct gcagttttga gtaagtcaga tccaaggctt gaagagccgg 5820 tagaatgctt gattaagaaa tcagctacaa aatatagggt taataaattc caggtcaaca 5880 atgaactgtg gcagggcgtt aaggcctgtg tcaagtccaa atttagagag atttttggga 5940 tcaatggtat tgttgacatg aaaacagcta ttttgggaac atcccatgta agctccannn 6000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6060 nnnntcttga acctttctct gtctcaccta tgcttgagaa acttgtgcaa gacaagtttc 6120 ataacttact taagggcaac caaattacta caattttcaa cacatgtctt aaagatgagc 6180 ttaggaaatt agataaaatt gcagctggta agactagatg cattgaagca tgtgaagttg 6240 attactgtat tgtttacagg atgatcatga tggagattta tgacaaaatt taccagaccc 6300 cttgttatta ctctggtctt gcagttggaa ttaaccccta taaagattgg cacttcatga 6360 ttaatgcatt aaatgattac aattatgaaa tggactacac ccagtatgat ggttccctta 6420 gttcgatgtt attgtgggaa gcagtggagg tcttggctta ctgtcacgat tcacctgatc 6480 ttgtcatgca attacataaa ccagtaattg actcagacca tgtggtcttc aacgagagat 6540 ggttaataca tggcggcatg ccatcggggt caccatgcac tactgtgttg aattcattgt 6600 gcaatctaat gatgtgcatt tacactacca atttaatcag cccaggaatt gattgcttgc 6660 caattgttta tggagatgat gttatcctgt cacttgataa agaaatagac ccagagaaac 6720 tgcaaagtat catggcagat tcatttggtg ctgaagtgac tggttcgcgc aaggatgagc 6780 ctccttcatt aaaacccaaa atggaggtgg aatttctgaa gcgtaagccc ggttacttcc 6840 cagagtctac atttatagta ggaaaattgg acactgaaaa catgatacaa cacttaatgt 6900 ggatgaaaaa tttcagcaca ttcaagcagc aacttcaatc ctacttaatg gagttatgcc 6960 tccatggaaa agacacttat ttgcactaca tcaaaatttt ggaaccatac ttgaaggagt 7020 ggaatatcac agtggatgat tatgatgttg tcattgctaa gttgatgccc atggtgtttg 7080 attaaaatta atgttttggt ttttcttt 7108 // ID MG026487; SV 1; linear; genomic RNA; STD; VRL; 7054 BP. XX AC MG026487; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 1 isolate ETH_P6_2016 polyprotein gene, complete cds. XX KW . XX OS Human parechovirus 1 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7054 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7054 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 99b63ba84d407b4aec7495abd030f518. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7054 FT /organism="Human parechovirus 1" FT /host="Homo sapiens" FT /isolate="ETH_P6_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:12063" FT CDS 508..7047 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M130" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009407" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M130" FT /protein_id="AXQ03886.1" FT /translation="METIKNIADMATGVVSSVDSTINAVNERVENIGNEIGGNLLTKVA FT DDASNVLGPNCYATTAEPENKDVVQATTTVNTTNLTQHPSAPTMPFTPDFSNVDTFHSM FT AYDITTGEKNPSKLVRLETHEWTPTWARGHQITHVELPKVFWDKNSKPAYGQSRYFAAV FT RCGFHFQVQVNVNQGTAGSALVVYEPKPVVTYDSKLEFGAFTNLPHVLMNLAETTQADL FT CIPYVADTNYVKTDSSDLGQLKVYVWTPLSIPTGSATQVDVTILGSLLQLDFQNPRVFG FT QDVGIYDNAPTRKQNLKKILTMSTKYKWTRQKIDIAEGPGSMNMANVLSTTAAQSIALV FT GERAFYDPRTAGSKSRFDDLVKISQLFSVMSDSTTPSANHGIDAKGYFKWSSTTAPQSV FT VHRNIVYLKLFPNLNVFVNSYSYFGGSIVLRLSVYASTFNRGRLRMGFFPNATEDSTSE FT IDNAIYTICDLGSDNSFEITIPYSFSTWMRKTDGHPIGLFQIEVLNRLTYNSSSPNEVY FT CIVQGKMGQDARFFCPTGSVVTFQNSWGSQMDLTDPLCLEESAEECKQTISPNELGLTS FT AQDDGPLGDNKPNYFLNFKSMNVDIFTVSHTKVDNLFGRAWFYKEHTFTNEGQWRVNLE FT FPKQGHGSLSLLFAYFTGELNIHVLFLAEKGFLRVAHTYDTSENRVNFLSSNGVITIPA FT GEQMTLSAPYYSNKPLRTVRDSNSLGYLMCRPFLTGTTTGKIEVYLSLRCPNFFFPLPA FT PKVTTGRALRGDMANFSDESPYNQQPQNQVMKLAYLDRGFYKHYGIIVGDYVYQLDSDD FT IFKTALTGKAKFTKTQLTKDWIIEEECELDYFRVKYLESSVNSEHIFSVDTNCETIAKD FT IFGTHTLSQHQAIGLIGTILLTAGLMSTIKTPVNATTIKEFFNHAVEGNEQGLSLLVQK FT CTTFFSSAATEILDNDLVKFIVKILVRILCYMVLYCHKPNILTTACLSTLLIMDVTSSS FT VLSPSCKALMQCLMDGDVKKLAEVVAESMSNTDDDEIKEQICDTVRYTKTILSNQGPFK FT GFNEVSTAFRHIDWWVHTLIKIKDMVLSVFKPSLESKAIQWLERNKEHVCGILDYASDI FT IVESKDQSKVKTQDFYQRYSDCLAKFKPIMAICFRSCHNSISNTVYRLFQELARIPNRI FT STNNDLIRIEPIGIWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVFTNPTASEFMDGYDN FT QDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTNKSDFS FT STVLQDSGALKRRFPYIMHIRAAKAYSKAGKLNVSQAMATMSTGECWEVSKNGRDWETL FT KLKELVDKITSDFNERVKNYNAWKHQLENQTLDDLDDAVSYIKHNFPDAIPYIDEYLNI FT EMSTLIEQMEAFIEPKPSVFKCFANKIGSKISKASKEVVDWFSDKIKSMLSFVERNKAW FT LTVVSAVTSAISILLLVTKIFKKEDSKDERAYNPTLPVAKPKGAFPVSLREFKNEAPYD FT GQLEHIISQMAYITGSTTGHLTHCAGYQHDEIILHGHSIKYLEQEEELTLHYKNKVFPI FT DNPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGIITKEVQ FT RVHHSGGIKTREGTESTKTISYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNGEMGVA FT IPFNFLKNDMSDQGIVTEVTPIQPMYINTKTQIHKSPVYGAVEVKMGPAVLSKSDPRLE FT EPVDCLIKKSAAKYRVNKFQVNNGLWQGVKACVKSKFREIFGINGIVDMKTAILGTSHV FT NSMDLSTSAGYSFVKSGYKKKDLICLEPFSVSPMLEKLVQEKFHNLLKGNQITTIFNTC FT LKDELRKLDKIAAGKTRCIEACEVDYCIVYRMIMMELYDKIYQTPCYYSGLAVGINPYK FT DWHFMINALNDFNYEMDYSQYDGSLSSALLWEAVEVLAYCHDSPDLVMQLHKPVIDSDH FT VVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDVILSLD FT KEIDPEKLQSIMADSFGAEVTGSRKDEPPSLKPRLEVEFLKRKPGYFPESTFIVGKLDT FT ENMIQHLMWMKNFSTFKQQLQSYLMELCLHGKDTYLHYIKILDPYLKEWNITVDDYDVV FT IAKLMPMVFD" XX SQ Sequence 7054 BP; 2269 A; 1348 C; 1444 G; 1993 T; 0 other; agacgcttaa agcatggtgt aaatcaactt ttctaactca cattttatgt ggggtggcag 60 atggcgtgcc ataactctat tagtgagata ccacgcttgt ggaccttatg ctcacacagc 120 catcctctag taagtttgtg agacgtctgg tgacgtgtgg gaacttattg gaaacaacat 180 tttgctgcaa agcatcctat cgccagcgga ataacatctg gtaacagatg cctctggggc 240 caaaagccaa ggtttaacag accctttagg attggttcaa aacctggaat gttgtggaag 300 atacttagta cctgctgatc tggtagtagt gcaaacacta gttgtaaggc ccacgaagga 360 tgcccagaag gtacccgtag gtaacaagtg acactatgga tctgatctgg ggccaggtac 420 ctctatcttg gtgacctggt taaaaaacgt ctagtgggcc aaacccgggg gggatccccg 480 gtttcctttt attttatcaa tgccacaatg gagacaatta agaacattgc agatatggcg 540 actggtgtag tcagttcagt tgattcaact atcaatgcag ttaatgagag agtggagaac 600 ataggcaatg aaattggggg taacttacta actaaagttg cagatgacgc atctaatgtg 660 cttggaccaa attgttatgc cactacagct gaacctgaga acaaagatgt agtacaagca 720 accacaactg tcaacacaac taatttgaca caacatccct cagcaccaac aatgccattc 780 actcctgatt tctctaatgt tgacacattt cactcaatgg catatgatat taccactgga 840 gagaaaaacc ctagcaaatt agttagattg gagactcatg agtggacacc aacttgggct 900 agaggacatc aaataaccca tgtggaatta ccaaaagtct tttgggacaa gaatagtaag 960 ccagcctatg gtcagtcaag gtattttgcg gctgtgcggt gtggtttcca ttttcaggta 1020 caagtaaatg ttaaccaagg gactgctggt agtgcattgg tagtatacga acctaaacct 1080 gttgtgacat atgactcaaa gctggaattt ggagcgttta ctaatttgcc acatgtgtta 1140 atgaatttgg ctgaaaccac acaggctgat ttatgtatcc cctatgttgc tgacacaaac 1200 tatgttaaga cagattcgtc agacttaggg caattaaaag tctatgtttg gacacctttg 1260 tccataccta caggctctgc tacacaagtt gatgtgacca tattaggtag tttattgcaa 1320 ttggatttcc aaaatcctag ggtatttggt caagatgttg gcatttatga caatgcacca 1380 acacggaagc aaaatcttaa aaagatactc acaatgagca ccaaatacaa gtggactagg 1440 caaaaaattg acatagcaga aggaccaggt tctatgaata tggcaaatgt tttgagcact 1500 actgctgcac aatcaattgc tttggttgga gaaagagcgt tctatgaccc aagaacagca 1560 ggcagtaaga gtagatttga tgatctggta aagatatccc agcttttctc tgtgatgagt 1620 gactcaacaa ctccctcagc caaccatggt atagatgcaa aaggctattt taagtggtca 1680 tctacaactg caccacaatc agtggtgcat agaaatattg tttacttaaa attgtttccc 1740 aatctaaatg tttttgtcaa cagttattca tattttggag gatcaatagt gcttaggtta 1800 agtgtttacg ctagcacttt caatagaggc cgtctgcgga tgggcttctt tccaaatgct 1860 accgaagaca gtacttcaga aatagacaat gctatataca caatttgtga tctgggtagt 1920 gacaacagtt ttgaaatcac tatcccatat tcattttcca cttggatgcg gaaaacagat 1980 ggccacccga ttggactatt ccaaattgaa gtgctcaata ggttaaccta caatagctcc 2040 agccccaatg aagtttattg tattgtgcaa ggtaaaatgg ggcaagatgc caggttcttt 2100 tgtccaaccg gttctgttgt gacttttcaa aactcatggg gttcacaaat ggatctgact 2160 gatccgttgt gtctggaaga atctgcagag gaatgtaaac aaaccatatc accaaatgaa 2220 ttaggattaa catcagccca ggatgatgga ccattgggtg acaacaaacc aaattatttc 2280 ctaaatttca agtccatgaa cgtagacatc ttcactgttt cccacactaa agtggataat 2340 ttatttggaa gagcttggtt ttataaggaa cacactttca ccaatgaagg acagtggaga 2400 gttaacctgg agtttccaaa acaaggtcat ggttcgcttt ctttgctatt tgcttatttt 2460 acaggtgagt taaacatcca tgttctgttc ctagctgaaa agggattcct tagagtagcc 2520 cacacctatg acacatcaga aaatagggtt aatttcctat catctaatgg tgttattacg 2580 atcccagcag gggaacaaat gacactatct gcaccatatt actcaaataa accccttaga 2640 acagttagag acagcaatag ccttgggtat ctgatgtgcc gaccattcct cactggaact 2700 acaactggga aaatagaggt atatcttagt ttgagatgcc caaatttctt ctttcccctc 2760 cccgcaccta aggttaccac tggtcgtgct ttgcggggtg acatggcaaa tttctcagat 2820 gagagcccat acaaccagca gccacagaat caggttatga agttagctta cctagatagg 2880 ggtttctaca agcattatgg catcatagtt ggagactatg tttaccaact agattcagat 2940 gatattttca aaactgcact aactggtaaa gctaagttca ccaagacaca gctgaccaag 3000 gattggatta ttgaagaaga gtgtgagctt gattatttta gagtcaaata tcttgagtca 3060 tcagtaaact ctgagcacat attttcagta gatacaaatt gtgagacaat tgcaaaagat 3120 atttttggca cccacaccct cagtcaacac caagctattg gtctgattgg cacaattctt 3180 ttaactgccg gtctgatgtc aactataaaa actccagtaa atgcaaccac aatcaaagag 3240 ttctttaatc acgcagtgga aggcaatgaa caggggttgt cattacttgt gcagaagtgt 3300 accaccttct tttcctcagc agccacagaa attctggaca acgatttagt caaattcatt 3360 gtaaaaatac ttgtcagaat cctttgctat atggttctat actgccacaa gccaaacata 3420 ctaactactg cctgtctgtc tacactattg ataatggatg taacatcctc atcagttttg 3480 tctccatctt gcaaagctct gatgcagtgc ttgatggatg gtgatgttaa aaaacttgct 3540 gaggttgtag ctgaatcaat gtcaaacact gatgatgatg agattaagga gcaaatttgt 3600 gacacagtaa gatacactaa aacgatctta tcaaaccagg gaccatttaa aggattcaat 3660 gaggtttcta ctgcattcag gcacattgat tggtgggttc atactttgat taagatcaaa 3720 gatatggtat tgagtgtttt caaacctagt ctagaaagta aagccataca gtggctggaa 3780 agaaataaag aacatgtttg cggcattctt gattatgcct ctgacattat tgtggagtca 3840 aaagatcagt caaaggttaa aactcaagat ttttatcaaa gatattcaga ttgcttagcc 3900 aaatttaaac caattatggc catttgcttc aggagttgtc ataacagcat tagtaataca 3960 gtgtaccggc tcttccaaga attggccagg attcccaata gaatcagcac taataatgat 4020 ttgatcagaa ttgaacctat tggcatttgg atccagggtg agccagggca aggcaagtct 4080 ttcctaaccc atactttatc aagacaacta caaaagtcat gtaagctcaa tggagttttc 4140 actaacccaa ctgctagtga gttcatggat ggttacgaca accaagacat ccacctaatt 4200 gacgacttgg gccaaacaag gaaggaaaag gatattgaaa tgttatgcaa ctgtatttca 4260 tctgttccct tcattgtacc aatggcacat cttgaggaga aaggaaagtt ctatactagt 4320 aagttagtaa ttgccaccac caataaatca gatttctcta gtacagtcct acaagattct 4380 ggggcattaa agaggagatt cccttacatt atgcacattc gggcagcaaa agcttacagt 4440 aaggctggaa aacttaatgt gagtcaagct atggctacaa tgtcaactgg agaatgttgg 4500 gaagtgtcaa agaatggtag agattgggaa acattaaaat taaaagaact ggttgacaag 4560 attacatctg attttaatga gagggttaaa aactacaatg cttggaaaca tcaattagag 4620 aatcaaaccc tcgatgattt agatgatgca gtttcttata ttaagcataa cttcccagat 4680 gctataccat acattgatga atatctcaac attgaaatgt caactttaat agaacaaatg 4740 gaggcattca ttgaacccaa acccagtgta tttaagtgtt ttgctaataa aattggttca 4800 aaaatctcta aagcttctaa agaagttgtg gactggttct cagataagat aaaatccatg 4860 ctcagctttg ttgaacgaaa taaagcgtgg cttacagttg tttcagcagt taccagtgct 4920 attagtatac tactattggt gacaaaaatt ttcaagaagg aagattctaa ggatgaaaga 4980 gcatacaatc caactctccc tgtggccaaa ccaaaaggtg cattccctgt gtctctaaga 5040 gaattcaaaa atgaagcacc ttatgatggt caactagagc acataatatc ccaaatggca 5100 tacataactg gttccaccac agggcatctt acacattgtg caggatatca gcacgatgag 5160 attattttgc atggacactc aatcaagtac cttgaacagg aagaagaact aaccctacac 5220 tacaagaaca aagtgtttcc aattgataat ccatctgtaa ctcaagtcac actgggtggt 5280 aagcccatgg acttggctat tcttaagtgc aagctaccat ttagattcaa gaagaattcc 5340 aaatattata ctaataaaat tggcacagag agcatgttga tttggatgac tgaacaaggt 5400 attatcacta aggaagttca aagagtccac cattccgggg gaatcaaaac ccgggaggga 5460 actgaaagca caaagacaat tagttacact gtaaagtcct gtaaaggtat gtgtggaggc 5520 ctactcatct caaaagtgga aggaaatttt aaaatacttg ggatgcatat agctggcaat 5580 ggggaaatgg gtgttgccat cccatttaac ttcctcaaga atgacatgtc tgatcaaggc 5640 atcgtaacgg aagttacacc catacaaccc atgtacatca acactaaaac ccaaatccac 5700 aagagtccag tatatggtgc tgtagaggtt aagatgggac cagcagtcct gagcaaatca 5760 gatccgaggc tcgaagaacc agttgattgc ttaattaaga aatcagctgc taaatataga 5820 gtcaacaaat ttcaggttaa caatggacta tggcagggcg tcaaagcttg tgtcaagtct 5880 aagtttagag aaatttttgg catcaatggc attgttgaca tgaagacagc aattttggga 5940 acatctcacg tgaactctat ggacctaagt acatcagctg gatatagttt tgttaaatca 6000 ggttacaaaa agaaagatct catttgcctt gaacctttct cagtttcacc catgcttgag 6060 aaacttgtac aggaaaaatt tcataacctg ctgaagggca atcaaatcac cacaatcttt 6120 aacacatgtc tcaaagatga actcaggaaa ttggacaaga ttgcagctgg taagactaga 6180 tgcattgaag cctgtgaggt tgactactgt attgtctaca gaatgattat gatggaactt 6240 tatgacaaaa tttaccagac cccatgttat tactctggtc ttgcagttgg gattaatccc 6300 tacaaggatt ggcatttcat gattaatgca cttaatgatt tcaactatga aatggattat 6360 tctcaatatg atggctcact tagctcagca ctgctatggg aggctgtgga agtgttagct 6420 tattgtcatg attcaccaga cctagtcatg cagttgcaca aaccagtaat tgattcagat 6480 catgtagttt tcaatgagag gtggttgata catggtggta tgccatcagg ttctccatgt 6540 accactgtgt taaattcatt atgtaatttg atgatgtgca tctacactac caatctaatt 6600 agtccaggaa ttgattgttt accaattgtt tatggtgacg atgtcatttt gtcacttgat 6660 aaagaaatag acccagagaa actgcaaagt atcatggcag attcatttgg tgctgaagtg 6720 acaggttctc gcaaggatga gcctccatca ttaaaaccta gattggaggt tgaattccta 6780 aagcgcaagc ctggttactt cccagaatcc acatttatag tagggaaatt ggatactgaa 6840 aatatgatac aacatttaat gtggatgaaa aatttcagca cattcaagca gcaacttcaa 6900 tcctacttga tggagttatg cctccatgga aaagacactt atctacacta catcaaaatt 6960 ttggatcctt atcttaaaga gtggaatatc actgtagatg attatgatgt tgttattgct 7020 aagttgatgc ccatggtgtt tgattaagac taat 7054 // ID MG026488; SV 1; linear; genomic RNA; STD; VRL; 6957 BP. XX AC MG026488; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 5 isolate ETH_P9_2016 polyprotein gene, partial cds. XX KW . XX OS Human parechovirus 5 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-6957 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-6957 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 3655bf5e693b960797b71508bb560da8. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6957 FT /organism="Human parechovirus 5" FT /host="Homo sapiens" FT /isolate="ETH_P9_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:376148" FT CDS 538..>6957 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M131" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009407" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M131" FT /protein_id="AXQ03887.1" FT /translation="METIKSIADMATGFTNTIDSTVNAVTEGVSKIGNDSGGEILTKVA FT DDASNLLGPNCIASTSKPENKDVVQATTTVNTTNLTQHPSAPTMPFTPDFSNVDNFHSM FT AYDITTGDKNPSKLIRLDTTTWQHTWPRQHLINDVQLPKAFWDKNSKPAYGQSRYFAAV FT RCGFHFQVQINVNQGTAGCALVVYEPKPIVTHGSHLEFGSFTNLPHVLMNLAETTQADL FT CIPYVSDTNYVKTDSSDLGRLRVYVWTPLTIPSSATNDVDVTVLGSLLQLDFQNPRTYD FT TDVDIYDNSPPNRKTKYSHTRMAKKILTMSTKYKWTRNKIDIAEGPGSMNMANVLSTTG FT AQSIALVGERAFYDPRTAGSKSRFGDLINIAQLFSVMSDTTTPSTSSGIDDFGYFDWSA FT TYVPQQVIHRNVVKLSQFSNLKPFVNAYTYFRGSLVLRMSVYASTFNRGRLRMGFFPNF FT TTNTTSEMDNAIYTICDIGSDNSFEITIPYTFSTWMRKTDGRPLGLFQVEVLNRLTYNS FT SCPNKVHCIVQGRLGNDARFYCPTGSLVEFQNSWGSQMDLSDPLCLEDDESEDCKQTIS FT PDELGLTSAQDDGPLGVEKPNYFLNFRAINVDIFTVSHTKVDNIFGRAWLAHEHTFADD FT GTWRVNLDFPTQGHGTLTRLFTYYSGELNVHVLYLSDNGFLRVTHAYDHNDNRSNFLSS FT NGVITVPAGEQMTLSVPFYSSKPLRTIRDSGALGRLICKPLLTGTHSGKIEVYLSLRCP FT NLFFPSPAPKQKASRSITSNSFEDESPYGQQETRKMKLAYLDRGFYKHYGIIVDDYVYQ FT LDSDDIFKTALTGKAKFTKTKLTTDWIVEEECELDYFRIKYLESSVNSEHIFSVGSNCE FT TIAKDIFGTHTLSQHQAIGLLGTILLTAGLMSTIKTPVNATTIKEFFNHAVDGDEQGLS FT LLVQKCTTFLSXXXXXXXXXXXXXXXXXXXXXXXXXXXLYCHKPNILTTACLSTLLIMD FT ITSSSVLSPSCKALMQCLMDGDVKKLAEVVAESMSNTDDEEIKEQICDTVKYTKSILSN FT QGPFKGFNEVSTAFRHIDWWIHTLLKIKDMVLSVFKPSMESKAIQWLERNKEHVCAILD FT YASDIIVESKDQTRMKSQEFYQKYTDCLTKFKPIMAICFRSCHNSISNTVYRLFQELAR FT IPSRISTQNDLIRVEPIGVWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVYTNPTASEFM FT DGYDNQDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTN FT KSDFSSTVLQDSGALKRRFPYIMHIRAAKAYSKSGKLNVSQAMSTMATGECWEVSKNGR FT DWETLKLQDLVNKITEDYIERQKNYNCWKQQLENQTLDDLDDAVSYIKHNFPDAIPYID FT EYLNIEMSTLIEQMEAFIEPRPSVFKCFATKVANQTRKAAKEVVEWFSSKIKSMLSFVE FT RNKAWLTVVSAVTSAISILLLVTKIFKKEDSKDERAYNPTLPVAKPKGTFPVSQREFKN FT EAPYDGQLEHIVSQMAYITGSTTGHITHCAGYQHDEIILHGHSIKYLEQEDELTLHYKN FT KIFPVENPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGII FT TKEVQRVHHSGGIKTREGTESTKTISYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNG FT EMGVAIPFNFLKNDISDQGIVTEVTPIQPMYVNTKSQIHKSPVYGAVEVKMGPAVLSKS FT DPRLEDPVECLIKKSAAKYRVNKFQVNNELWQGVKACVKSKFREIFGVNGVVDMKTAIL FT GTSHVNSMDLSTSAGYSFVKSGYKKKDLICLEPFFVSPILEKLVQDKFHALLKGNQIST FT IFNTCLKDELRKLDKISAGKTRCIEACEVDYCIVYRMIMMEIYDKIYQTPCYYSGLAVG FT INPYKDWHFMINALNDYNYEMDYSQYDGSLSSMLLWEAVEVLAYCHDSPDLVMQLHKPV FT IDSDHVVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDV FT ILSLDKEIDPEKLQGIMADSFGAEVTGSRKDEPPSLKPRMEVEFLKRKPGYFPESTFIV FT GKLDTENMIQHLMWMKNFSTFKQQLQSYLM" XX SQ Sequence 6957 BP; 2232 A; 1300 C; 1397 G; 1948 T; 80 other; gatatcatct tgcaataaga agagtggggt taagacgctt aaagcataga gacaattttc 60 ttttctaacc cacatttatg tggggtggca gatggcgtgc catgactcta ttagtgagat 120 accacgcttg tggaccttat gctcacacag ccatcctcta gtaagtttgt gagatgtctg 180 atgacgtgtg ggaacttatt ggaagcaaca ttttgctgta aagcatccta ttgccagcgg 240 aacaacacct ggtaacaggt gcctctgggg ccaaaagcca aggtttaaca gaccctttag 300 gattggttca aacctgaaat gctgtggaaa atatttagta cctgccaatt tggtagtaat 360 gcaaacacta gttgtaaggc ccacgaagga tgcccagaag gtacccgtag gtaacaagtg 420 acactatgga tctgatctgg ggccaaatac ctctatcttg gtgatttggt taaaaaacgt 480 ctagtgggcc aaacccaggg gggatccctg gtttcctttt attttataat cgctattatg 540 gagacaatca agagcattgc agacatggcc actggtttca ccaataccat tgattcaact 600 gttaatgctg taacagaagg tgtctctaaa attggcaatg attcaggtgg tgagatccta 660 acaaaggtag ctgatgatgc ttccaacctc cttggtccaa attgtatagc atcaacgtca 720 aaaccagaaa acaaggatgt tgtgcaagca actacaactg tcaatactac caatcttaca 780 cagcacccat ctgcgccaac tatgccattc acaccagact tttcaaatgt tgacaatttc 840 cactctatgg cttatgacat tactacaggt gacaaaaacc ctagcaaact cataaggctg 900 gacaccacca catggcaaca cacttggcct aggcaacacc tcattaatga tgtacaatta 960 ccaaaagctt tctgggacaa aaacagtaaa ccagcttacg gacaatcaag gtattttgct 1020 gctgttaggt gcggttttca ttttcaagtt caaattaatg tgaaccaagg aactgctgga 1080 tgtgctctag tagtctatga acctaagccc attgttacac atggtagtca tcttgaattc 1140 ggttcattta ctaacttacc acatgtttta atgaacctag cagaaaccac tcaggcagac 1200 ttatgtatcc cctatgtatc agatacaaac tatgtcaaga cagattcatc tgatttaggg 1260 cgtttgagag tctatgtctg gacaccatta acaattccct ctagtgccac aaatgatgtg 1320 gatgtgactg tacttgggag tctgttgcaa ctggatttcc agaacccacg cacctatgat 1380 actgatgtag atatttatga caacagtcca ccaaatagga aaactaaata cagtcatact 1440 agaatggcta agaaaatctt gacaatgtca acaaaatata agtggactag gaacaaaatt 1500 gacatcgctg agggtcctgg atccatgaat atggctaatg tgttgagtac aacaggtgca 1560 cagtcaatag ctttagttgg tgaaagggct ttttatgacc ctcgcactgc aggtagcaaa 1620 tctagatttg gagacctaat taacatagcc caattgtttt ctgtgatgtc agacaccaca 1680 acaccatcca cttctagtgg gattgatgat tttggatact ttgattggtc agctacctat 1740 gtaccacaac aggtcattca ccgcaatgtg gtgaagttga gtcaattttc aaatttgaaa 1800 ccattcgtaa acgcatacac ctattttagg ggttcccttg tgctcagaat gtcagtatat 1860 gctagcactt tcaatagggg acggctgcgc atgggtttct ttccaaattt tactacaaac 1920 acaacttcag agatggataa tgctatatat actatttgtg atattggatc agataatagt 1980 tttgaaatta ctataccata taccttttcc acttggatga gaaaaactga tggtagacct 2040 cttggtcttt tccaggttga agttctgaat agattgacat acaacagttc atgtccaaat 2100 aaagtacatt gcattgtgca ggggagattg ggaaatgatg ccagatttta ttgcccaaca 2160 ggatcattag ttgagtttca aaattcatgg ggatcacaaa tggatctgag tgacccattg 2220 tgcctggaag atgatgagtc tgaagattgt aagcaaacca tctcaccaga tgaattggga 2280 ctgacttctg cacaggatga tgggcctttg ggtgtagaaa aaccaaatta ttttttgaac 2340 ttcagggcca tcaatgttga catattcact gttagtcata cgaaagtgga caacatcttt 2400 ggaagggcct ggttagcaca cgaacacact tttgctgatg atggtacatg gagagtgaac 2460 ttagattttc caactcaagg ccatggtaca ctaaccagac ttttcactta ttactcaggt 2520 gagctaaatg tacatgttct atatctcagt gacaatggat ttctcagagt gacacatgca 2580 tatgaccaca atgataatag atccaatttt ctatcatcaa atggtgttat cacagtacct 2640 gcgggagaac aaatgacctt gtcagtgcca ttttattcat caaaaccact taggacaatc 2700 agggactctg gtgcattggg cagattaata tgcaaaccat tgttgactgg aacccattca 2760 gggaaaattg aggtttactt aagcttacgc tgcccaaatt tgttcttccc ttccccggca 2820 cctaaacaga aagcttctag atcaattacc agcaattctt ttgaggatga aagtccatat 2880 gggcagcaag aaactagaaa gatgaagtta gcatatttgg atagaggatt ctataagcac 2940 tatggtataa ttgttgatga ttatgtttac caactggact cagatgacat ttttaaaaca 3000 gctttaactg gaaaggccaa gttcaccaag acaaagttaa ctacagattg gattgtagaa 3060 gaagaatgtg agttagacta tttcagaatc aaataccttg agtcttcagt aaattcggaa 3120 catatatttt cagtaggttc taattgtgag actattgcta aagacatctt tggtactcat 3180 actcttagtc aacaccaggc tattggacta ttaggtacaa ttcttttgac tgctggtttg 3240 atgtcaacca ttaaaacacc tgtaaatgcc acaaccatta aggagttttt caatcatgct 3300 gtggatgggg atgagcaggg tttatcttta cttgtacaaa aatgcactac cttcctctcn 3360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3420 nnnnnnnnnn nnnnnnnnnt attatattgc cacaaaccaa atattttaac aactgcttgt 3480 ctatccacac tgttaataat ggatatcact tcttcatctg tactgtcgcc ttcttgtaaa 3540 gcacttatgc agtgtttaat ggacggtgat gttaagaagt tggcagaagt tgtagctgaa 3600 tccatgtcta atacggatga tgaagaaatc aaagaacaaa tttgtgacac agtaaaatat 3660 actaagagca ttctatcaaa ccagggacct ttcaagggct ttaatgaagt atcaacagca 3720 ttcaggcaca ttgattggtg gatacacaca ctactcaaaa ttaaggacat ggtgctcagt 3780 gtttttaaac caagcatgga aagcaaagcc atccaatggt tggagagaaa caaagaacat 3840 gtgtgtgcta ttctagatta tgcctcagac attattgtgg agtcaaaaga ccagacaaga 3900 atgaagtcac aagagtttta ccagaaatac acggattgct tgactaaatt caaaccaatt 3960 atggccattt gttttaggag ctgtcataac agtattagta acacagttta tagacttttt 4020 caagaactgg ccaggatacc atcaaggatt agtacccaaa atgacttaat tagagttgag 4080 ccaattggtg tgtggataca aggtgaacca gggcaaggga agtctttctt gacacacact 4140 ctgtccaggc aattacaaaa atcctgcaaa ttaaatggtg tttacactaa cccaaccgct 4200 agtgaattta tggatggtta tgacaaccag gacatccatc tcattgacga tttgggtcag 4260 actaggaaag agaaagatat tgaaatgttg tgcaactgca tatcatcagt accctttata 4320 gtccctatgg cacaccttga ggaaaaaggt aaattctaca ctagtaaatt agtgattgca 4380 actactaaca aatcagactt ttcaagtact gttttacaag actcaggtgc tttgaagaga 4440 agattcccat atatcatgca catcagggct gctaaagctt acagtaagtc gggtaaactc 4500 aatgttagcc aagctatgtc tacaatggct acaggtgaat gctgggaagt gtccaaaaat 4560 ggtagagatt gggaaactct aaaattacaa gatttggtca ataaaattac tgaagattat 4620 atagagagac agaagaacta caattgctgg aagcaacagc ttgaaaatca gactctagat 4680 gacctggatg atgcagtctc ttacattaag cacaatttcc cagatgcgat cccttacatt 4740 gatgaatatc ttaatattga gatgtccact ttaattgaac aaatggaagc ctttattgag 4800 cctagaccca gtgtttttaa atgttttgca actaaagttg caaatcaaac aagaaaggca 4860 gccaaagaag ttgtggaatg gttcagtagt aagatcaaat caatgttgag tttcgtggaa 4920 agaaataagg cttggttaac agtagtttct gctgtcacta gtgcaataag cattttactt 4980 ttggtgacta agatattcaa aaaagaagat tcaaaggatg agagagccta taacccaaca 5040 ctccctgttg caaaaccaaa aggtaccttc ccagtgtcac aaagggaatt caaaaatgag 5100 gcaccctatg atggccaatt ggagcacata gtgtcccaaa tggcatacat aactggatcc 5160 accacaggac acatcactca ttgtgctggt tatcaacatg atgagattat actgcatgga 5220 cattcaatta agtaccttga gcaggaggat gaattgactc tacactataa aaacaagatt 5280 ttcccagttg agaatccatc tgtgacacaa gtcactttgg gtggcaaacc tatggacttg 5340 gccatcctca aatgtaaatt gccatttagg ttcaagaaaa actctaaata ttacaccaac 5400 aagattggaa cagaaagtat gctaatttgg atgactgaac aaggtataat cacaaaggaa 5460 gttcagagag ttcaccattc aggtggtatt aaaaccagag aggggaccga gagcacaaag 5520 actatcagtt acacagtgaa atcttgcaaa ggcatgtgtg gtggtttact catttctaaa 5580 gtagaaggaa atttcaaaat tcttgggatg cacatagcag gcaatgggga aatgggtgta 5640 gccatcccat tcaacttcct caaaaatgac atttctgatc aaggcattgt gacagaagtg 5700 acacctatac aacccatgta cgttaacact aagtctcaaa tccacaagag cccagtctac 5760 ggtgcagtgg aagtcaaaat ggggccagca gttttgagta aatcagatcc cagacttgag 5820 gatccagttg aatgcctaat taagaaatca gctgcaaaat atagagttaa caaattccaa 5880 gttaacaatg aactgtggca aggtgtcaaa gcctgtgtca aatccaaatt cagagagatc 5940 tttggagtca atggtgttgt agacatgaaa acagccatat tgggtacgtc tcatgtgaac 6000 tctatggatt tgagtacatc cgctggatat agctttgtga aatcaggtta taagaagaaa 6060 gatttaattt gcctagagcc attctttgtt tcacctatac ttgaaaaact cgtacaagat 6120 aaattccatg cattacttaa gggtaaccaa atttctacaa ttttcaacac ttgtctaaag 6180 gatgaactaa gaaaattgga taaaatttca gccggtaaga ctagatgcat tgaggcctgt 6240 gaagttgact attgtatagt ctatagaatg attatgatgg aaatttatga caagatttat 6300 cagacaccat gttattattc aggtcttgct gttgggatta acccatataa agattggcac 6360 ttcatgatta atgcactaaa tgactataat tatgagatgg actattctca gtatgatggc 6420 tctcttagtt caatgctatt gtgggaggca gtagaggtct tggcttattg tcatgattca 6480 cctgatcttg tcatgcaatt acacaaacca gtaattgact cagaccatgt ggtcttcaat 6540 gagagatggc taattcacgg tggtatgcca tcagggtcac catgtactac tgtgttaaat 6600 tcattgtgca atctgatgat gtgcatttac actaccaatt tgatcagccc tggaattgac 6660 tgtctaccaa ttgtttatgg tgatgacgta atcttgtcac ttgataaaga aatagatcca 6720 gagaaactac aaggtatcat ggcagattca tttggagctg aagtaactgg ctctcgcaag 6780 gatgagcctc catcattaaa acccaggatg gaggtggaat tcctaaagcg taaacctggt 6840 tacttcccag agtccacatt tatagtagga aaattggaca ctgaaaatat gatacaacat 6900 ttaatgtgga tgaaaaattt cagcacattc aagcagcaac ttcaatccta cttaatg 6957 // ID MG026489; SV 1; linear; genomic RNA; STD; VRL; 7124 BP. XX AC MG026489; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 1 isolate ETH_P16_2016 polyprotein gene, complete cds. XX KW . XX OS Human parechovirus 1 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7124 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7124 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 00d7ab39a54f7ce11345ea0ca8821adf. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7124 FT /organism="Human parechovirus 1" FT /host="Homo sapiens" FT /isolate="ETH_P16_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:12063" FT CDS 549..7091 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M132" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009407" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M132" FT /protein_id="AXQ03888.1" FT /translation="METIKNIADMATGVVSSVDSTINAVNEKVENVGNEIGGNLLTKVA FT DDASNVLGPNCFATTAEPENKNVVQATTTVNTTNLTQHPSAPTMPFSPDFSNVDNFHSM FT AYDITTGDKNPSKLVRLETHEWTPSWARGHQITHVELPKVFWNNQDKPAYGQSRYFAAV FT RCGFHFQVQVNVNQGTAGSALVVYEPKPVVTYDSKLEFGAFTNLPHVLMNLAETTQADL FT CIPYVADTNYVKTDSSDLGQLKVYVWTPLTIPTGSANQVDVTILGSLLQLDFQNPRVFG FT QDVNIYDNAPNSKKKNWKKIMTMSTKYKWTRTKIDIAEGPGSMNMANVLSTTGAQSVAL FT VGERAFYDPRTAGSKSRFDDLVKISQLFSVMADSTTPSENHGADAKGYFKWSATTAPQS FT IVHRNIVYLKLFPNLNVFVNSYSYFRGSLVLRLSVYASTFNRGRLRMGFFPNATADSTS FT TLDNAIYTICDIGSDNSFEITIPYSFSTWMRKTNDHPIGLFQIEVLNRLTYNSSSPSEV FT YCIVQGKMGQDARFFCPTGSVVTFQNSWGSQMDLTDPLCIEDDAENCKQTMSPNELGLT FT SAQDDGPLGQEKPNYFLNFRSMNVDIFTVSHTKVDNLFGRAWFFEEHTFTNEGQWRVPL FT KFPKQGHGSLSLLFAYFTGELNIHVLFLSERGFLRVAHTYDTSGDRVNFLSSNGVITIP FT AGEQMTLSAPYYSNKPLRTVRDDNSLGYLMCKPFLTGTTTGKIEVYLSLRCPNFFFPLP FT APKVTTSRAIRGDMANFSDQSPYGQQSQGQVMKLAYLDRGFYKHYGIIVGDSVYQLDSD FT DIFKTALTGKARFTKTKLTPDWVVEEECELDYFRVKYLESSVNSEHIFSVDHNCETIAK FT DIFGTHTLSQHQAIGLIGTILLTAGLMSTIKTPVNATTIKEFFNHAVEGDEQGLSLLVQ FT KCTTFFSSAATEILDNDLVKFIVKILVRILCYMVLYCHKPNILTTACLSTLLIMDVTSS FT TVLSPSCKALMQCLMDGDVKKLAEVVAESMSNTDDEEIKEQICDTVKYTKSILSNQGPF FT KGFNEVSTAFRHIDWWIHTLLKIKDMVLSVFKPSMESKAIQWLERNKEHVCAILDYASD FT IIVESKDQTRMKSQEFYQKYTDCLAKFKPIMAICFRSCHNSISNTVYRLFQELARIPSR FT ISTQNDLIRVEPIGVWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVYTNPTASEFMDGYD FT NQDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTNKSDF FT SSTVLQDSGALKRRFPYIMHIRAAKAYSKSGKLNVSQAMSTMATGECWEVSKNGRDWET FT LKLQDLVKKITEDYEERQKNYNCWKQQLENQTLDDLDDAVSYIKHNFPDAIPYIDEYLN FT IEMSTLIEQMEAFIEPRPSVFKCFATKVANQTRKAAKEVVEWFSNKIKSMLSFVERNKA FT WLTVVSAVTSAISILLLVTKIFKKEDSKDERAYNPTLPVAKPKGTFPVTQREFKNEAPY FT DGQLEHIISQMAYITGSTTGHLTHCAGYQHDEVILHGHSIKYLEQEEELTLHYKNKVFP FT IEQPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGIITKEV FT HRVHHSGGIRTREGTESTKTISYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNGEMGV FT AIPFNFLKNDMSDQGIVTEVTPVQPIYINTKSQIHKSPVYGAVEVKMGPAVLNKSDPRL FT EEPVECLIKKSAAKYRVNKFQVNNELWQGVKACVKSKFREIFGVNGVVDMKTAILGTSH FT VNSMDLSTSAGYSFVKSGXXXXXXXXXXXFSVSPMLEKLVQDKFHALLKGNQISTIFNT FT CLKDELRKLDKISAGKTRCIEACEVDYCIVYRMIMMEIYDKIYQTPCYYSGLAVGINPY FT KDWHFMINALNEYNYEMDYSQYDGSLSSMLLWEAVEVLAYCHDSPDLVMQLHKPVIDSD FT HVVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDVILSL FT DKEIDPEKLQGIMADSFGAEVTGSRKDEPPSLKPRMEVEFLKRKPGYFPESTFIVGKLD FT TENMIQHLMWMKNFSTFKQQLQSYLMELCLHGKDTYLHYIKILDPYLKEWNITVDDYDV FT VIAKLMPMVFD" XX SQ Sequence 7124 BP; 2271 A; 1351 C; 1477 G; 1992 T; 33 other; gatacactat tcgaaggcat ctagcaataa gaagagtgga tttagggcgc ttaaagcata 60 gtgcaaataa tcatttctag cctgtgtttt acacagggtg gcagatggcg tgccataact 120 ctactagtga gataccacgc ttgtggacct tatgctcaca cagccatcct ctagtaagtt 180 tgtaaagcgt ctgatgacgt gtgggaactt attggaaaca atagtttgct gcaaagcatc 240 ctactgccag cggaataaca cctggtaaca ggtgcctctg gggccaaaag ccaaggttta 300 acagaccctt taggattggt tcaaacctga actattgtgg aagatattta gtacctgctg 360 atttggtagt tgtgcaaaca ctagttgtaa ggcccacgaa ggatgcccag aaggtacccg 420 taggtaacaa gtgacactat ggatctgatc tggggccaac tatctctatc ttgatgagtt 480 ggttaaaaaa cgtctagtgg gccaaacctg ggggggaccc cagtttcctt ttattttata 540 atgccactat ggagacaatc aagaacattg cagatatggc tactggagtt gttagttcag 600 ttgactcaac tattaatgct gtcaatgaaa aggtggaaaa cgttggtaat gaaattggag 660 ggaatttgtt aactaaagtt gcagacgatg cttcaaatgt gctcggacct aattgttttg 720 ccaccactgc tgaaccagag aacaaaaatg ttgtccaggc tactaccaca gttaatacta 780 caaatttgac acaacatcca tcagccccga ccatgccatt ctcaccagac ttttccaatg 840 ttgacaattt tcactcaatg gcatacgaca tcacaactgg agacaaaaac cccagtaagt 900 tggttaggct tgaaactcat gaatggaccc catcttgggc taggggacat cagataactc 960 atgttgagtt gcccaaagtt ttctggaata accaagataa accagcttat ggacaatctc 1020 gctactttgc tgcagtgaga tgcggctttc attttcaagt tcaagtaaat gttaaccaag 1080 gaaccgctgg tagtgcttta gtggtttatg aaccaaaacc agtagttacc tatgattcaa 1140 aattggaatt tggtgcattt acaaatttgc ctcatgtcct tatgaattta gcagaaacta 1200 cacaggctga cttgtgcatc ccctatgttg ctgacacaaa ctatgtaaag acagattcgt 1260 cagacctagg gcaactaaaa gtttatgttt ggacaccact tacaattcct actggttctg 1320 ctaatcaggt ggatgtgaca atcttaggta gcctacttca attagatttc caaaacccta 1380 gggtttttgg tcaagatgtt aacatatatg ataatgcacc taacagcaaa aagaaaaatt 1440 ggaagaaaat catgacaatg agcactaaat ataagtggac tagaactaag atcgacattg 1500 cagaagggcc aggatctatg aatatggcta atgtactcag caccactggc gcccaatcag 1560 tggccttagt gggcgagaga gcattctatg accctagaac tgctggtagt aagtcaaggt 1620 ttgatgatct tgtgaaaata tcacaactgt tttcagtcat ggctgattca accacaccat 1680 ctgaaaacca tggcgctgac gctaaaggat acttcaaatg gtcagccaca actgccccac 1740 aaagcatagt tcataggaat atagtgtacc tgaaactttt cccaaattta aatgtctttg 1800 tcaatagcta ctcttatttc agaggctcac ttgtccttag gttgagtgtg tatgctagca 1860 cctttaatag aggacgattg aggatggggt tcttccctaa tgccacagca gactccactt 1920 ccacactgga caatgcaata tacacaattt gtgatattgg gagtgataat agttttgaaa 1980 tcactattcc atactctttc tccacttgga tgaggaaaac aaatgatcac ccaattggac 2040 tattccagat tgaagtctta aacaggctca catataacag ctctagcccc tcagaagttt 2100 actgtatagt gcaaggcaaa atggggcaag atgccagatt cttctgccca actggttcag 2160 ttgtaacttt tcagaattca tggggatcac agatggatct aactgatcca ctctgcattg 2220 aagatgatgc tgaaaattgt aagcagacaa tgtcacctaa tgaattagga ctcacatcag 2280 ctcaagatga tggtccttta ggtcaggaga aacccaacta ttttcttaac tttaggtcta 2340 tgaatgtaga tatttttact gtttcacaca caaaagtaga caacttgttt ggaagggctt 2400 ggttctttga ggagcacaca tttactaatg aggggcagtg gagagtacca ttgaaattcc 2460 ctaaacaagg acatggatcc ttatcacttc tgtttgctta tttcactggt gagctaaaca 2520 ttcatgttct gttccttagt gagagagggt ttcttagggt tgcccacacc tatgatacca 2580 gtggagacag agtcaatttt ctatcgtcaa atggtgtaat aaccatacca gctggagaac 2640 aaatgacgct ctcagctcca tattattcaa acaagccact gaggacagtc agggacgata 2700 atagccttgg gtacctgatg tgtaaacctt tcctgactgg gaccaccact ggtaagattg 2760 aagtttatct tagtttgagg tgtccaaatt ttttctttcc tttacctgca cctaaagtca 2820 cgaccagccg cgccatacgg ggcgatatgg caaatttttc agaccagagt ccttacggcc 2880 aacaatcaca aggtcaagtg atgaagctgg cttacttgga taggggtttc tacaagcatt 2940 atggaattat tgttggggat agtgtgtatc aattagactc agatgatata ttcaaaacag 3000 ccctaacagg aaaagctagg ttcactaaga caaaactgac cccagactgg gtcgtagagg 3060 aagaatgtga attagattat ttcagggtga aataccttga gtcatcagtt aactcagagc 3120 atatcttctc agtagaccac aactgtgaaa ccattgctaa ggatattttt ggcacccaca 3180 ctcttagcca acatcaggcc attggtttaa taggcacaat ccttttgact gctggtttaa 3240 tgtcaactat taaaacacca gtcaatgcca caaccatcaa agagttcttc aatcacgcag 3300 ttgaaggtga tgaacaggga ttatctttac ttgtacagaa atgtaccact ttcttttctt 3360 ctgctgctac agaaatttta gataatgacc tggttaaatt tatagtgaag atacttgtta 3420 gaattctttg ttacatggtg ttatattgcc acaaaccaaa catcttaacc actgcttgct 3480 tatccacatt gctaataatg gatgttacat cttcaacagt gttgtcaccc tcttgtaagg 3540 cgctcatgca atgcctaatg gacggtgatg tcaagaaatt ggctgaagtt gtggcagagt 3600 ccatgtccaa tactgatgat gaagaaatca aagaacaaat ttgtgataca gtaaaataca 3660 caaagagtat tctgtcaaac cagggaccct ttaagggttt caatgaagta tcaactgcat 3720 tcaggcatat tgattggtgg atacataccc tattgaaaat aaaagacatg gttctgagtg 3780 tatttaaacc cagtatggag agtaaagcaa ttcagtggtt agagagaaac aaagagcatg 3840 tatgtgccat attagactat gcttctgata ttattgtaga atcaaaggac caaacaagaa 3900 tgaaatcaca agaattttat caaaagtata cagattgtct agccaaattc aaaccaatta 3960 tggccatttg cttcagaagt tgtcacaaca gtattagtaa tactgtctac aggcttttcc 4020 aggaattggc aagaatacca tcaaggatta gcacccagaa tgatttaatc agggttgaac 4080 caattggagt ttggatccaa ggtgaaccag gtcaaggtaa atctttccta acacacacac 4140 tgtcacgaca attacaaaaa tcttgtaaat tgaatggtgt ttatacaaac ccaacagcca 4200 gtgaatttat ggatggttat gacaatcagg atatacatct tatagacgac ttgggtcaaa 4260 caagaaaaga aaaagacatt gaaatgttat gtaattgcat ctcctcagtt ccttttattg 4320 ttccaatggc acatttggaa gagaaaggta aattttacac cagtaaattg gtaatagcta 4380 ctaccaataa gtctgacttt tccagtacag tactacaaga ttcaggtgct ctgaagagaa 4440 ggttcccata cattatgcat atcagggccg ctaaagccta cagcaaatca ggtaagctta 4500 atgttagtca agccatgtca acgatggcca caggtgagtg ttgggaagtg tctaagaatg 4560 gtagggattg ggaaactctc aaattgcagg atctagtcaa gaaaattact gaagattatg 4620 aagagagaca aaagaattat aattgctgga aacaacagct tgagaaccag accttagatg 4680 acctagatga tgctgtgtca tacatcaagc ataatttccc agatgcaatt ccctatattg 4740 atgagtatct taacattgag atgtccaccc tgattgaaca aatggaagct tttattgaac 4800 caagacctag tgtctttaaa tgttttgcca ccaaagttgc gaaccagaca aggaaagcag 4860 ccaaggaagt tgtggaatgg tttagcaaca agatcaaatc aatgttgagc tttgtggaaa 4920 gaaataaggc ttggttaaca gtagtctcgg ctgttactag tgcgatcagc atattacttc 4980 tggttaccaa gatcttcaag aaagaagact ctaaggatga gagggcctat aaccccacac 5040 tccctgtggc aaaaccaaaa ggcacatttc cagtgacaca aagggaattt aagaatgaag 5100 caccttatga tggacaatta gaacacatta tatcccagat ggcatatatt actggctcaa 5160 ctactggcca tttgacacat tgtgccggtt atcagcatga tgaggttata ctccatggcc 5220 actccataaa gtatttggaa caggaagagg aattaacact acattacaaa aacaaggttt 5280 tcccaattga acaaccctct gtgacacagg tcacacttgg agggaagcca atggatttag 5340 caatactcaa atgtaagtta ccatttagat ttaagaagaa ctctaaatat tacaccaaca 5400 agattggaac ggaaagtatg ctaatttgga tgactgaaca gggtataatc acaaaggaag 5460 tccatagagt tcaccactct ggtggcatca gaaccagaga ggggactgag agcacaaaga 5520 ccattagtta cacagtgaaa tcttgtaaag gcatgtgtgg tggtttactc atttctaaag 5580 tagaaggaaa tttcaaaatt cttgggatgc acatagctgg caatggggag atgggtgtag 5640 ccatcccgtt taacttcctc aaaaatgaca tgtctgatca aggtattgtg acggaagtga 5700 cacctgtaca acccatatac attaacacta agtctcaaat ccacaagagc ccagtctacg 5760 gtgcagtgga agtcaaaatg gggccagcag ttttgaataa atcagatccc aggcttgagg 5820 aaccagttga atgcctaatt aagaaatcag ctgcaaagta tagagttaac aaattccaag 5880 tcaacaatga actatggcaa ggtgtcaagg cctgtgtcaa atccaaattc agagaaattt 5940 ttggagtcaa tggtgttgta gacatgaaaa cagccatttt gggtacgtca catgtgaact 6000 ctatggattt gagtacatct gctggatata gctttgtgaa atcaggtnnn nnnnnnnnnn 6060 nnnnnnnnnn nnnnnnnnnn ttctctgttt cacctatgct tgaaaaactc gtacaagata 6120 aattccatgc attacttaag ggtaaccaaa tttctacaat tttcaacact tgtctaaaag 6180 atgaactaag aaaattggat aaaatttcag ctggtaagac taggtgcatt gaggcctgtg 6240 aagttgacta ttgcatagtc tatagaatga ttatgatgga aatttatgac aagatttatc 6300 agacaccatg ctattattca ggtctcgctg ttgggattaa cccatacaaa gattggcact 6360 tcatgatcaa tgcactaaat gagtataatt atgagatgga ctattctcag tatgatggct 6420 cccttagctc aatgctattg tgggaggcag tagaggtctt ggcttactgt catgattcac 6480 ctgatcttgt catgcaatta cacaaaccag taattgactc agaccatgtg gttttcaatg 6540 agagatggtt aattcacggc ggtatgccgt cagggtcacc atgtactact gtgttgaatt 6600 cattgtgcaa tctgatgatg tgcatttaca ctaccaattt gatcagccca ggaattgact 6660 gtttaccaat tgtttatggt gatgatgtaa tcttgtcact tgataaagaa atagatccag 6720 agaaactgca aggtatcatg gcagattcat ttggagctga agtgactggt tctcgcaagg 6780 atgagcctcc atcattaaaa cccaggatgg aggtggaatt cctaaagcgt aaacctggtt 6840 acttcccaga gtctacattt atagtaggga aattggacac tgaaaatatg atacaacact 6900 taatgtggat gaagaatttc agcacattta agcagcaact tcaatcctac ttaatggagt 6960 tatgcctcca tggaaaagac acttatttac actacatcaa aattttagat ccttatttga 7020 aagagtggaa tatcacagtg gatgattatg atgtggtcat agctaagttg atgcccatgg 7080 tgtttgattg aaataatgtt ttggtttctt tttgttatgg acta 7124 // ID MG026490; SV 1; linear; genomic RNA; STD; VRL; 7102 BP. XX AC MG026490; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 1 isolate ETH_P28_2016 polyprotein gene, complete cds. XX KW . XX OS Human parechovirus 1 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7102 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7102 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 606326ea661ce5f9076434c26ab2c8cf. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7102 FT /organism="Human parechovirus 1" FT /host="Homo sapiens" FT /isolate="ETH_P28_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:12063" FT CDS 538..7077 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M133" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009407" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M133" FT /protein_id="AXQ03889.1" FT /translation="METIKSIADMATGVISSVDSSVNAVNEKVENVGNEIGGNLLTKVA FT DDASNVLGPNCYATTAEPENKDVVQATTTVNTTNLTQHPSAPTMPFTPDFSNVDTFHSM FT AYDITTGEKNPSKLVRLETHEWTPNWARGHQITHVELPKVFWDKNSKPAYGQSRYFAAV FT RCGFHFQVQVNVNQGTAGSALVVYEPKPVVTYDSKLEFGAFTNLPHVLMNLAETTQADL FT CIPYVADTNYVKTDSSDLGQLKVYVWTPLSIPSGSATQVDVTILGSLLQLDFQNPRVFG FT QDVGIYDNAPTRKQNLKKILTMSTKYKWTRQKIDIAEGPGSMNMANVLSTTAAQSIALV FT GERAFYDPRTAGSKSRFDDLVKISQLFSVMSDSTTPSANHGIDAKGYFKWSSTTAPQSV FT VHRNIVYLKLFPNLNVFVNSYSYFRGSIVLRLSVYASTFNRGRLRMGFFPNATEDSTSE FT IDNAIYTICDLGSDNSFEITIPYSFSTWMRKTDGHPIGLFQIEVLNRLTYNSSSPNEVY FT CIVQGKMGQDARFFCPTGSVVTFQNSWGSQMDLTDPLCLEESAEECKQTISPNELGLTS FT AQDDGPLGDNKPNYFLNFKSMNVDIFTVSHTKVDNLFGRAWFYQGHTFTDEGQWRVSLE FT FPKQGHGSLSLLFAYFTGELNIHVLFLAEKGFLRVAHTYDTSENRVNFLSSNGVITIPA FT GEQMTLSAPYYSNKPLRTVRDSNSLGYLMCKPFLTGTTTGKIEVYLSLRCPNFFFPLPA FT PKVTTGRALRGDLANFSDQSPYDRQPQNQVMKLAYLDRGFYKHYGIVVGDSIYQLDSDD FT IFKTALTGKARFTKTKLTPDWIIEEECELDYFRVKYLESSVNSEHIFSVDNNCETIAKD FT IFGTHTLSQHQAIGLVGTILLTAGLMSTIKTPVTATTIKEFFNHAVDGDEQGLSLLVQK FT CTTFFSSAATEILDNDLVKFIVKILVRILCYMVLYCHKPNILTTACLSTLLIMDVTSSS FT VLSPSCKALMQCLMDGDVKKLAEVVAESMSNTDDDEIKEQICDTVKYTKTILSNQGPFK FT GFNEVSTAFRHIDWWIHTLLKIKDMVLSVFKPSLESKAIQWLERNKEHVCGVLDYASDI FT IVESKDQSKVKTQDFYQRYSDCLAKFKPIMAICFRSCHNSISNTVYRLFQELARIPNRI FT STNNDLIRIEPIGIWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVFTNPTASEFMDGYDN FT QDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTNKSDFS FT STVLQDSGALKRRFPYIMHIRAAKAYSKAGKLNVSQAMATMSTGECWEVSKNGRDWETL FT KLKDLVDKITSDYNERVKNYNAWKQQLENQTLDDLDDAVSYIKHNFPDAIPYIDEYLNI FT EMSTLIEQMEAFIEPKPSVFKCFANRIGSKISKASREVVDWFSDKMKSMLSFVERNKAW FT LTVVSAVTSANSILLLVTKIFKKEDSKDERAYNPTLPVTKPKGAFPVSQREFKNEAPYD FT GQLEHIISQMAYITGSTTGHLTHCAGYQHDEIILHGHSIKYLEQEEDLTLHYKNKVFPI FT EQPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGIITKEVQ FT RVHHSGGIKTREGTESTKTISYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNGEMGVA FT IPFNFLKNDMSDQGIVTEVTPIQPMYINTKTQIHKSPVYGAVEVKMGPAVLSKSDPRLE FT EPVECLIKESATKYRVNKFQVNNELWQGVKACVKSKFREIFGINGIVDMKTAILGTSHV FT SSMDLSTSAGYSFVKSGYKKKDLICLEPFSVSPMLEKLVQDKFHNLLKGNQITTIFNTC FT LKDELRKLDKIAAGKTRCIEACEVDYCIVYRMIMMEIYDKIYQTPCYYSGLAVGINPYK FT DWHFMINALNDYNYEMDYTQYDGSLSSMLLWEAVEVLAYCHDSPDLVMQLHKPVIDSDH FT VVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDVILSLD FT KEIDPEKLQSIMADSFGAEVTGSRKDEPPSLKPKMEVEFLKRKPGYFPESTFIVGKLDT FT ENMIQHLMWMKNFSTFKQQLQSYLMELCLHGKDTYLHYIKILEPYLKEWNITVDDYDVV FT IAKLMPMVFD" XX SQ Sequence 7102 BP; 2265 A; 1357 C; 1483 G; 1997 T; 0 other; aggctacttg caataagatt agtgggaaca agacgcttaa agcatggtgc aaaataactt 60 ttctaactca cattctatgt ggggtggcag atggcgtgcc ataattctat cagtgagata 120 ccacgcttgt ggaccttatg ctcacacagc catcctctag taagtttgtg agacgtctgg 180 tgacgtgtgg gaacttattg gaaacaacat tttgctgtaa agcatccaat tgccagcgga 240 acaacacctg gtaacaggtg cctctggggc caaaagccaa ggtttaacag acccttttgg 300 attggttcta aacctgagat gttgtggaag atacttagta cctaccaatc tggtagtagt 360 gcaaacacta gttgtaaggc ccacgaagga tgcccagaag gtacccgtag gtaacaagtg 420 acactatgga tctgatctgg ggctaggtgc ctctatcttg gtgacctggt taaaaaacgt 480 ctagtgggcc aaacccaggg gggatccctg gtttcccttt attttatcaa tgccactatg 540 gagacaatta aaagtattgc agatatggcg accggagtga tcagctcagt tgattcatct 600 gtcaatgcag tcaatgagaa ggtggagaat gtgggcaatg aaatcggagg caatctacta 660 accaaagttg cagatgatgc atctaatgtg cttggaccaa attgttatgc tacaacagct 720 gagccagaga ataaagatgt agtacaagca accacaactg ttaatacaac aaatttaaca 780 caacatcctt ctgcacctac aatgcctttc actcctgatt tctccaatgt ggacacgttc 840 cactcaatgg catatgatat caccactgga gagaaaaacc ccagcaaatt ggttagattg 900 gaaacacatg aatggacacc aaactgggct agaggacatc aaattactca tgtggaatta 960 ccaaaagtct tctgggataa gaacagtaag ccagcttacg gtcaatcaag atactttgca 1020 gcagtacggt gtggtttcca ttttcaggta caagtgaatg ttaatcaagg tacagctggt 1080 agtgcactgg tggtatatga acctaaacct gttgtgacat atgactcaaa gttagaattt 1140 ggagcattta ctaatcttcc gcatgtgcta atgaacttgg cagagaccac acaggctgat 1200 ttatgtatcc cctatgtagc tgacacaaac tatgttaaaa cagattcgtc agacttaggg 1260 caactaaaag tctatgtttg gacacccttg tccatacctt caggttctgc tacacaagtt 1320 gatgtgacca tattgggtag cctattgcaa ttggacttcc aaaatcctag ggtatttggt 1380 caagacgttg gtatttatga caatgcacca acacggaagc aaaatcttaa gaaaatactc 1440 accatgagca ctaaatacaa gtggactaga caaaagattg acatagctga gggaccaggt 1500 tccatgaata tggcgaatgt attgagtacc actgcagcgc aatcaattgc actagttgga 1560 gaaagagcat tttatgaccc aagaacagca ggaagcaaga gtagatttga tgatttagta 1620 aaaatatctc aacttttttc tgtaatgagt gactccacaa ccccttcagc caatcacggt 1680 atagatgcaa aaggttattt caagtggtca tctacaactg ctccacaatc tgtggtacat 1740 agaaatattg tttacttaaa attgtttccc aatttgaatg tatttgtcaa cagctattca 1800 tattttagag gctcaatagt gctgaggttg agtgtctatg ctagcacttt caatagaggc 1860 cgtttacgga tgggtttctt tccaaatgcc actgaagaca gcacttcaga aatagataat 1920 gccatataca caatttgtga tttgggcagt gacaatagtt ttgaaatcac catcccatac 1980 tcattttcca cctggatgcg gaaaacagat ggccacccta ttggactgtt tcagatagag 2040 gtgcttaata ggttaactta caacagctcc agtcctaatg aagtttactg tattgtgcaa 2100 ggtaaaatgg ggcaggatgc caggttcttt tgtccaactg gttctgtagt gactttccaa 2160 aattcatggg gttcacaaat ggacttaact gatccactat gtttagaaga atctgcagag 2220 gaatgcaaac agaccatatc gccaaatgaa ctgggattaa catcagctca ggacgatggg 2280 cctttgggtg acaacaagcc aaattatttc ctaaacttca agtctatgaa tgtagacatt 2340 ttcactgttt cccataccaa ggtagacaac ctatttggaa gagcttggtt ttaccaggga 2400 cacactttca ccgatgaagg acagtggaga gttagtttag agttcccaaa acaaggtcat 2460 ggttcacttt ccctgctatt cgcctatttt acaggtgaac taaatataca tgttttgttc 2520 ctggctgaaa agggatttct tagagtggct cacacttatg acacatcaga aaacagagta 2580 aacttcttgt catccaatgg tgttatcaca atcccagcag gagaacaaat gacattgtct 2640 gcaccctact attcaaataa gccccttaga acagttagag atagcaatag tcttgggtac 2700 ctaatgtgta aaccatttct tacaggaaca acaacaggaa aaatagaggt ctatcttagt 2760 ctgaggtgtc caaatttctt tttccctctc cccgcaccta aagttacaac tggtcgtgcc 2820 ttacggggtg atttggcaaa cttctcagat cagagtccat atgatcgaca accacagaat 2880 caagtgatga aattagccta tttggacagg ggtttctaca agcactatgg cattgtggtg 2940 ggggacagta tttatcaatt ggactcagat gacattttca agacagcttt aacaggaaaa 3000 gctaggttca ctaagacaaa gttaactcca gattggatta ttgaggaaga atgcgaattg 3060 gattatttca gggtgaaata ccttgaatcc tctgtcaact cagagcatat cttctcagtg 3120 gacaataact gtgaaactat tgccaaggac atctttggca cccatacact tagtcaacac 3180 caggctatag ggttagtagg tacaatcctc ttaaccgctg gcctgatgtc aactatcaaa 3240 accccggtga ctgctaccac aattaaagaa tttttcaatc atgcagttga tggtgatgaa 3300 caaggtttgt ctttgcttgt gcaaaaatgt accactttct tctcttcagc tgcaacagag 3360 atcctagaca acgacttggt taaatttata gtcaaaatat tggttagaat cctatgttac 3420 atggtcttgt attgtcataa accaaatatc ctgactacag cttgtttgtc cactcttttg 3480 atcatggacg tgacttcttc atcagtcttg tcaccttcct gcaaagcttt gatgcagtgc 3540 ttgatggatg gcgatgtgaa aaagcttgct gaagtcgtag ctgagtcaat gtccaacaca 3600 gatgatgatg agattaagga gcaaatttgt gacacagtaa aatacactaa gacaatccta 3660 tcaaatcagg gaccattcaa aggttttaat gaggtttcca ctgcatttag gcatatagat 3720 tggtggatcc acactttgct taaaattaag gatatggtgt tgagtgtgtt caaacccagt 3780 ctagaaagta aagccataca atggttggaa agaaacaagg aacatgtgtg tggggttctg 3840 gattatgctt ctgatatcat tgttgagtca aaagatcagt caaaggtcaa aacccaagat 3900 ttttatcaaa gatattcaga ttgtctagct aaatttaagc caatcatggc catttgcttc 3960 aggagttgtc ataatagtat tagcaacaca gtatatagac ttttccaaga attggctaga 4020 attcccaata ggatcagcac taataatgac ttaatcagaa ttgaacctat tggcatttgg 4080 atccagggcg aaccggggca aggcaaatct ttcctaactc atactttatc aagacagtta 4140 caaaaatcat gtaaactcaa tggagttttc accaacccaa ctgccagtga gttcatggat 4200 ggttatgata accaggacat ccatctaata gatgacttgg gccaaacaag gaaggaaaaa 4260 gacattgaaa tgctatgcaa ctgtatttca tctgttccct ttatagtacc aatggcacac 4320 cttgaggaaa aaggaaagtt ctacactagt aagttagtta ttgccaccac caacaaatca 4380 gatttctcta gtacagtcct tcaagattct ggagcactga agaggagatt cccctacatt 4440 atgcacattc gggcagcaaa ggcctatagt aaagctggaa agctcaatgt gagccaagct 4500 atggctacaa tgtcaactgg agaatgttgg gaagtgtcaa agaatggtag agattgggaa 4560 acattaaaat taaaagatct ggttgacaaa atcacttctg attacaatga gagggtcaaa 4620 aattacaatg cttggaaaca gcaattagag aatcaaaccc ttgatgattt agatgatgca 4680 gtttcatata ttaagcacaa tttcccggat gccataccat acattgatga gtacctcaat 4740 attgaaatgt caaccttaat tgagcaaatg gaagcattca ttgagccaaa acccagtgtg 4800 tttaagtgtt ttgctaacag aattggatca aaaatttcta aagcttctag agaagttgtg 4860 gattggttct cagataagat gaagtccatg ctcagctttg ttgaaagaaa caaagcgtgg 4920 cttacagttg tttctgcagt caccagtgct aatagtatac tactattagt gacaaaaatc 4980 ttcaagaaag aagattcaaa ggatgaaaga gcatacaacc caaccctccc agttactaag 5040 cctaagggag ctttcccagt ctctcaacgg gagttcaaga atgaagcacc ttatgatgga 5100 caactggagc acattatttc tcagatggct tacattactg gttcaaccac tggccacttg 5160 acacattgtg caggttacca acatgatgaa attatccttc atggtcactc cattaagtac 5220 ttggagcagg aggaagattt gacattacat tataaaaaca aagtcttccc aatagaacag 5280 ccttctgtga ctcaagtcac tttgggtggt aaacctatgg atttagctat ccttaaatgc 5340 aaattgccat ttaggtttaa aaagaattcc aagtactata ccaataaaat tggaacagag 5400 agcatgttga tttggatgac tgaacaaggt ataataacaa aggaagtcca gagagttcac 5460 cactccggtg gcattaaaac tagagaagga actgagagca caaaaaccat tagttacaca 5520 gtaaaatctt gcaaaggtat gtgtgggggt ttgctcattt ctaaagtaga aggaaatttc 5580 aaaatccttg gaatgcacat agctggtaat ggggaaatgg gtgttgccat cccatttaac 5640 tttcttaaaa atgatatgtc tgaccaaggc attgtgacag aagtgacacc catacaaccc 5700 atgtacatca acactaagac ccaaattcac aagagcccag tgtatggtgc tgtagaagtc 5760 aaaatgggcc ctgcagtttt gagtaagtca gatccaaggc ttgaagagcc ggtagaatgc 5820 ttgattaagg aatcagctac aaaatatagg gttaataaat tccaggtcaa caatgaactg 5880 tggcagggcg ttaaggcctg tgtcaagtcc aaatttagag agatttttgg gatcaatggt 5940 attgttgaca tgaaaacagc tattttggga acatcccatg taagctccat ggatctgagt 6000 acatcagcag gatacagttt tgtcaagtca gggtacaaaa agaaagatct gatttgtctt 6060 gaacctttct ctgtctcacc tatgcttgag aaacttgtgc aagacaagtt tcataactta 6120 cttaagggca accaaattac tacaattttc aacacatgtc ttaaagatga gcttaggaaa 6180 ttagataaaa ttgcagctgg taagactaga tgcattgaag catgtgaagt tgattactgt 6240 attgtttaca ggatgatcat gatggagatt tatgacaaaa tttaccagac cccttgttat 6300 tactctggtc ttgcagttgg aattaacccc tataaagatt ggcacttcat gattaatgca 6360 ttaaatgatt acaattatga aatggactac acccagtatg atggttccct tagttcgatg 6420 ttattgtggg aagcagtgga ggtcttggct tactgtcacg attcacctga tcttgtcatg 6480 caattacata aaccagtaat tgactcagac catgtggtct tcaacgagag atggttaata 6540 catggcggca tgccatcggg gtcaccatgc actactgtgt tgaattcatt gtgcaatcta 6600 atgatgtgca tttacactac caatttaatc agcccaggaa ttgattgctt gccaattgtt 6660 tatggagatg atgttatcct gtcacttgat aaagaaatag acccagagaa actgcaaagt 6720 atcatggcag attcatttgg tgctgaagtg actggttcgc gcaaggatga gcctccttca 6780 ttaaaaccca aaatggaggt ggaatttctg aagcgtaagc ccggttactt cccagagtct 6840 acatttatag taggaaaatt ggacactgaa aacatgatac aacacttaat gtggatgaaa 6900 aatttcagca cattcaagca gcaacttcaa tcctacttaa tggagttatg cctccatgga 6960 aaagacactt atttgcacta catcaaaatt ttggaaccat acttgaagga gtggaatatc 7020 acagtggatg attatgatgt tgtcattgct aagttgatgc ccatggtgtt tgattaaaat 7080 taatgttttg gtttttcttt gt 7102 // ID MG026491; SV 1; linear; genomic RNA; STD; VRL; 6871 BP. XX AC MG026491; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 1 isolate ETH_P21_2016 polyprotein gene, partial cds. XX KW . XX OS Human parechovirus 1 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-6871 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-6871 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; c7957cc0beabe6ab8ae3d67fe4e976ba. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6871 FT /organism="Human parechovirus 1" FT /host="Homo sapiens" FT /isolate="ETH_P21_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:12063" FT CDS 466..>6871 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M134" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR009419" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M134" FT /protein_id="AXQ03890.1" FT /translation="METIKSIADMATGVVSSVDSSVNAVNEKVENVGNEIGGNLLTKVA FT DDASNVLGPNCYATTAEPENKDVVQATTTVNTTNLTQHPSAPTMPFTPDFSNVDTFHSM FT AYDITTGEKNPSKLVRLETHEWTPTWARGHQITHVELPKVFWDKNSKPAYGQSRYFAAV FT RCGFHFQVQVNVNQGTAGSALVVYEPKPVVTYDSKLEFGAFTNLPHVLMNLAETTQADL FT CIPYVADTNYVKTDSSDLGQLKVYVWTPLSIPSGSATQVDVTILGSLLQLDFQNPRVFG FT QDVGIYDNAPTRKQNLKKILTMSTKYKWTRQKIDIAEGPGSMNMANVLSTTAAQSIALV FT GERAFYDPRTAGSKSRFDDLVKISQLFSVMSDSTTPSANHGIDAKGYFKWSSTTAPQSV FT VHRNIVYLKLFPNLNVFVNSYSYFRGSIVLRLSVYASTFNRGRLRMGFFPNATEDXXXE FT IDNAIYTICDLGSDNSFEITIPYSFSTWMRKTDGHPIGLFQIEVLNRLTYNSSSPNEVY FT CIVQGKMGQDARFFCPTGSVVTFQNSWGSQMDLTDPLRLEESAEECKQTISPNELGLTS FT AQDDGPLGDNKPNYFLNFKSMNVDIFTVSHTKVDNLFGRAWFYQEHTFTNEGQWRVSLE FT FPKQGHGSLSLLFAYFTGELNIHVLFLAEKGFLRVAHTYDTSENRVNFLSSNGVITIPA FT GEQMTLSAPYYSNKPLRTVRDSNSLGYLMCKPFLTGTTTGKIEVYLSLRCPNFFFPLPA FT PKVTTGRALRGDLANFSDQSPYDRQPQNQVMKLAYLDRGFYKHYGIVVGDSIYQLDSDD FT IFKTALTGKARFTKTKLTPDWIIEEECELDYFRVKYLESSVNSEHIFSVDNNCETIAKD FT IFGTHTLSQHQAIGLVGTILLTAGLMSTIKTPVTATTIKEFFNHAVDGDEQGLSLLVQK FT CTTFFSSAATEILDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTKTILSNQGPFK FT GFNEVSTAFRHIDWWIHTLLRIKDMVLSVFKPSLESKAIQWLERNKEHVCGVLDYASDI FT IVESKDQSKVKTQDFYQRYSDCLAKFKPIMAICFRSCHNSISNTVYRLFQELARIPNRI FT STNNDLIRIEPIGIWIQGEPGQGKSFLTHTLSRQLQKSCKLNGVFTNPTASEFMDGYDN FT QDIHLIDDLGQTRKEKDIEMLCNCISSVPFIVPMAHLEEKGKFYTSKLVIATTNKSDFS FT STVLQDSGALKRRFPYIMHIRAAKAYSKAGKLNVSQAMAIMSTGECWEVSKNGRDWETL FT KLKDLVDKITSDYNERVKNYNAWKQQLENQTLDDLDDAVSYIKHNFPDAIPYIDEYLNI FT EMSTLIEQMEAFIDPIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVTKPKGAFPVSQREFKNEAPYD FT GQLEHIISQMAYITGSTTGHLTHCAGYQHDEIILHGHSIKYLEQEEDLTLHYKNKVFPI FT EQPSVTQVTLGGKPMDLAILKCKLPFRFKKNSKYYTNKIGTESMLIWMTEQGIITKEVQ FT RXXXXXXXXXXXXXXXXXXXXYTVKSCKGMCGGLLISKVEGNFKILGMHIAGNGEMGVA FT IPFDFLKNDMSDQGIVTEVTPIQPMYINTKTQIHKSPVYGAVEVKMGPAVLSKSDPRLE FT EPVECLIKKSATKYRVNKFQVNNELWQGVKACVKSKFREIFGINGIVDMKTAILGTSHV FT NSMDLSTSAGYSFVKSGYKKKDLICLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXYDGSLSSMLLWEAVEVLAYCHDSPDLVMQLHKPVIDSDH FT VVFNERWLIHGGMPSGSPCTTVLNSLCNLMMCIYTTNLISPGIDCLPIVYGDDVILSLD FT KEIDPEKLQGIMADSFGAEVTGSRKDEPPSLKPRMEVEFLKRKPGYFPESTFIVGKLDT FT ENMIQHLMWMKNFSTFKQQLQSYLM" FT gap 3303..3573 FT /estimated_length=271 FT gap 4720..4956 FT /estimated_length=237 FT gap 5992..6324 FT /estimated_length=333 XX SQ Sequence 6871 BP; 1889 A; 1153 C; 1269 G; 1654 T; 906 other; ttctatgtgg ggtggcagat ggcgtgccat aattctatta gtgagatacc acgcttgtgg 60 accttatgct cacacagcca tcctctagta agtttgtgag acgtctggtg acgtgtggga 120 acttattgga aacaatatct tgctgtaaag cattcaactg ccagcggaac aacacctggt 180 aacaggtgcc tctggggcca aaagccaagg tttaacagac ccttttggat tggttctaaa 240 cctgagatgt tgtggaagat acttagtacc tactgatctg gtagtagtgc aaacactagt 300 tgtaaggccc acgaaggata cccaganggt acccgtaggt aacaagtgac actatggatc 360 tgatctgggg ctaggtgcct ctatcttggt gacctggtta aaaaacgtct agtgggccaa 420 acccaggggg gatccctggt ttccctttat tttatcgatg ccactatgga gacaattaaa 480 agtattgcag atatggcgac cggagtggtc agctcagttg attcatctgt caatgcagtc 540 aatgagaagg tggagaatgt gggcaatgaa atcggaggca atctactaac caaagttgca 600 gatgatgcat ctaatgtgct tggaccaaat tgttatgcta caacagctga gccagagaat 660 aaagatgtag tacaagcaac cacaactgtt aacacaacta atttaacaca acatccttcg 720 gcacctacaa tgcctttcac tcctgatttc tccaatgtgg acacattcca ctcaatggca 780 tatgatatca ccactggaga gaagaacccc agtaaattgg ttagattgga aacacatgaa 840 tggacaccaa cctgggctag aggacatcaa attactcatg tggaattacc aaaagtcttc 900 tgggataaga acagtaagcc agcttacggt caatcaagat actttgcagc agtacggtgt 960 ggtttccatt ttcaggtaca agtgaatgtt aatcaaggta cagctggtag tgcactggtg 1020 gtatatgaac ctaaacctgt tgtgacatat gactcaaagt tagaatttgg agcatttacc 1080 aatcttccgc atgtgctaat gaacttggca gagaccacac aggctgattt atgtatcccc 1140 tatgttgctg acacaaacta tgttaaaaca gattcgtcag acttagggca actaaaagtc 1200 tatgtttgga cacccttgtc cataccatca ggttctgcta cacaagttga tgtaaccata 1260 ttgggtagcc tattgcaatt ggacttccaa aatcctaggg tatttggtca agacgttggt 1320 atttatgaca atgcaccaac acggaagcaa aatcttaaga aaatactcac catgagcact 1380 aaatacaagt ggactagaca aaaaattgac attgctgagg gaccaggttc tatgaatatg 1440 gcgaacgtgt tgagtaccac tgcagcgcaa tcaattgcac tagttggaga aagagcattt 1500 tatgacccaa gaacagcagg aagcaagagt agatttgatg atttagtaaa aatatctcaa 1560 cttttttctg taatgagtga ctccacaacc ccttcagcca atcacggtat agatgcaaaa 1620 ggttatttca agtggtcatc tacaactgct ccacaatctg tggtacatag aaatattgtt 1680 tacttaaaat tgttccccaa tttgaatgta tttgtcaaca gctattcata ttttagaggc 1740 tcaatagtgc tgaggttgag tgtctatgct agcactttca atagaggccg tttacggatg 1800 ggtttctttc caaatgccac tgaagacagn nnnnnngaga tagataatgc catatacaca 1860 atttgtgatt tgggcagtga caatagtttt gaaatcacca tcccatactc attttccacc 1920 tggatgcgga aaacagatgg tcaccctatt ggactgtttc agatagaggt gcttaatagg 1980 ttaacttaca acagctccag tcctaatgaa gtttactgta ttgtgcaagg taaaatgggg 2040 caggatgcca ggttcttttg tccaactggt tctgtagtga ctttccagaa ttcatggggt 2100 tcacaaatgg acttgactga tccactacgt ttagaagaat ctgcagagga atgcaaacag 2160 accatatcgc caaatgagct gggattaaca tcagctcagg acgatgggcc tttgggtgac 2220 aacaagccaa attatttcct aaacttcaag tctatgaatg tagacatttt cactgtttcc 2280 cataccaagg tagacaacct atttggaaga gcttggtttt accaggaaca cactttcacc 2340 aatgaaggac agtggagagt tagtttagag ttcccaaaac aaggtcatgg ttcactttcc 2400 ctgctattcg cctattttac aggtgaacta aatatacatg ttttgttcct ggctgaaaag 2460 ggatttctta gagtggctca cacttatgac acatcagaaa acagagtaaa cttcttgtca 2520 tccaatggtg ttatcacaat cccagcagga gaacaaatga cattgtctgc accctactat 2580 tcaaataagc cccttagaac agttagagat agcaatagtc ttgggtacct aatgtgtaaa 2640 ccattcctta caggaacaac aacaggaaaa atagaggtct atcttagtct gaggtgtcca 2700 aatttctttt ttcctcttcc cgcacctaaa gttacaactg gtcgtgcctt acgaggtgat 2760 ttggcaaact tctcagatca gagcccatat gatcgacaac cacagaatca agtgatgaaa 2820 ttagcctatt tggacagggg tttctacaag cactatggca ttgtggtggg ggacagtatt 2880 tatcaattgg attcagatga cattttcaag acagctttaa caggaaaagc taggttcact 2940 aagacaaagt taactccaga ctggattatt gaggaagaat gcgaattgga ttatttcaga 3000 gtgaaatacc ttgaatcctc tgtcaactca gaacatatct tctcggtgga caacaactgt 3060 gaaactattg ccaaggacat ctttggcacc catacactta gccaacacca ggctataggg 3120 ttagtaggta caatcctctt aaccgctggc ttgatgtcaa ctatcaaaac cccggtgact 3180 gctaccacaa ttaaagaatt tttcaatcat gcagttgatg gtgatgaaca aggtttgtct 3240 ttacttgtgc aaaaatgtac tactttcttc tcttcagctg caacagagat cctagacaac 3300 gannnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3420 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnactaaga caatcctatc aaatcaggga 3600 ccattcaaag gttttaatga ggtttccact gcatttaggc atatagattg gtggatccat 3660 actttgctta gaattaagga tatggtgttg agtgtgttca aacccagttt agaaagtaaa 3720 gccatacaat ggttggaaag aaacaaggaa catgtgtgtg gggtactgga ttatgcttct 3780 gatatcattg ttgagtcaaa agatcagtca aaggtcaaaa cccaagattt ttaccaaaga 3840 tattcagatt gtctagctaa atttaagcca atcatggcca tttgcttcag gagttgtcat 3900 aatagtatta gcaacacagt atataggctt ttccaagaat tggctagaat tcccaatagg 3960 atcagcacta ataatgactt aatcagaatt gaacctattg gtatttggat ccagggcgaa 4020 ccggggcaag gcaaatcttt cctaactcat actttatcaa gacagttaca aaaatcatgt 4080 aaactcaatg gagttttcac caacccaact gccagtgagt tcatggatgg ttatgataac 4140 caggacatcc atctaataga tgacttgggc caaacaagga aggaaaaaga tattgaaatg 4200 ctatgcaact gtatttcatc tgttcccttt atagtaccaa tggcacacct tgaggaaaaa 4260 ggaaagttct acactagtaa gttagtgatt gccaccacca acaaatcaga tttctctagt 4320 acagtccttc aagattccgg agcactgaag aggagattcc cctacattat gcacattcgg 4380 gcagcaaagg cctatagcaa agctggaaag ctcaatgtga gccaagctat ggctataatg 4440 tcaactggag aatgttggga agtgtcaaag aatggtagag attgggaaac attaaaatta 4500 aaagatctgg ttgacaaaat cacttctgat tacaatgaga gggtcaaaaa ttacaatgct 4560 tggaaacagc aattagagaa tcaaaccctt gatgatttag atgatgcagt atcttatatt 4620 aagcacaatt tcccggatgc tataccatac attgatgagt atctcaatat tgaaatgtca 4680 accttaattg agcaaatgga agcattcatt gatcctatcn nnnnnnnnnn nnnnnnnnnn 4740 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4920 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnccag ttactaagcc taagggagct 4980 ttcccagtct ctcaacggga gttcaagaat gaagcacctt atgatggaca actggagcac 5040 attatttccc agatggctta tattactggt tcaaccactg gccacttgac acattgtgca 5100 ggttaccaac atgatgaaat tatccttcat ggtcactcca ttaaatactt ggagcaggag 5160 gaagatttga cattacatta taaaaacaaa gtcttcccaa tagaacagcc ttctgtgact 5220 caagtcacgt tgggtggtaa gcctatggat ttagctattc ttaaatgcaa attgccattt 5280 aggtttaaaa agaattctaa gtactatacc aataaaattg gaacagagag catgttgata 5340 tggatgactg aacaaggtat aataacaaag gaagtccaga gagnnnnnnn nnnnnnnnnn 5400 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn gttacacagt gaaatcttgc 5460 aaaggtatgt gtgggggttt gctcatttct aaagtagaag gaaatttcaa aatccttgga 5520 atgcacatag ctggtaatgg ggaaatgggt gttgccatcc cattcgactt tcttaaaaat 5580 gatatgtctg accaaggcat tgtgacagaa gtgacaccca tacaacccat gtacatcaac 5640 actaagaccc aaattcacaa gagcccagtg tatggtgctg tagaagtcaa aatgggccct 5700 gcggttttga gtaagtcaga tccaaggctt gaagagccgg tagaatgctt gattaagaaa 5760 tcagctacaa aatatagggt taataaattc caggtcaata atgaactgtg gcagggcgtt 5820 aaggcctgtg tcaagtcgaa atttagagag atttttggga tcaatggtat tgttgacatg 5880 aagacagcta ttttgggaac atcccatgta aactccatgg atctgagtac atcagcagga 5940 tacagttttg tcaagtcagg atacaaaaag aaagatctga tttgtcttga annnnnnnnn 6000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6060 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6300 nnnnnnnnnn nnnnnnnnnn nnnntatgat ggttccctta gttcgatgtt attgtgggaa 6360 gcagtggagg tcttggctta ctgtcacgat tcacctgatc ttgtcatgca attacataaa 6420 ccagtaattg actcagacca tgtggtcttc aacgagagat ggttaataca tggcggtatg 6480 ccatcggggt caccatgcac tactgtgttg aattcattat gcaatctaat gatgtgcatt 6540 tacactacca atttaatcag tccaggaatt gattgcttgc caattgttta tggagatgat 6600 gtcatcctgt ctcttgataa agaaatagac ccagagaaac tgcaaggtat catggcagat 6660 tcatttggtg ctgaagtgac tggttcgcgc aaggatgagc ctccttcatt aaaacccaga 6720 atggaggtgg aatttctgaa gcgtaagccc ggttacttcc cagagtctac atttatagta 6780 ggaaaattgg acactgaaaa catgatacaa catttaatgt ggatgaaaaa tttcagcaca 6840 ttcaagcagc aacttcaatc ctacttaatg g 6871 // ID MG026492; SV 1; linear; genomic RNA; STD; VRL; 2851 BP. XX AC MG026492; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Human parechovirus 8 isolate ETH_P25_2016 polyprotein gene, partial cds. XX KW . XX OS Human parechovirus 8 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2851 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-2851 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 3e086d323315100b088e54446f6dadbd. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2851 FT /organism="Human parechovirus 8" FT /host="Homo sapiens" FT /isolate="ETH_P25_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:602074" FT CDS 551..>2851 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M135" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M135" FT /protein_id="AXQ03891.1" FT /translation="MEAIKGIADMATAAVNGLDSTLNQSNETPINSNATGGDIITRVAD FT DASNVLGPNCYATTSEPENKNVVQATTTVNTTNLTQHPSAPTIPFAPDFSNVDSFHSMA FT YDVTTGDKNPSKLVRLKTTEWMQTWERSHQIDYVELPKAFWASEKMPAFGQSKYFAAVR FT CGFHFQVQINVNQGTAGSALVVYKPKPVVDNDGKLEFGSYTNLPHVLMNLAETTQADLC FT IPYVSDTNYVMTDSSDLGLLEVYVWTPLSIPHGSATQVDVTILGSLLQLDFQNPRVYGQ FT NVNIYDNAPTPGRMRKYLTMSTKYKWTRSKIDIAEGPGSMNLANVLSTTGAQSLALVGE FT RAFYDPRTAGSKSRFDDLVKIAQMFSVMGDNSQPSTSEGIDARGYFKWAASKAPQQVIH FT RNLIYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXIDIGTDNSFEITIPYSFSTWMRKTKGHTIGLFQVEVLNRLTYNSSSPSEVYCI FT VQGRLGSDAKFYCPTGSVTTFQNSWGSQMDLSDPLCIEDKVEECKQTLSPNELGLTSAQ FT DDGPLGNEKPNYFLNFKAINVDIFTVSHTKVDNIFGRAWYSMAHEFRNEGLWRTKLTFP FT KQGHGALSQFFAYYTGELNIHVLFLCEKGFLRVAHTYXXXXXXXXXXXXXXXXXIPAGE FT QMSLSAPFYSHRPLRTIRSEHALGYLLCQPMLTGTSSGKIEVYLSLRCPNLFFPIPAPK FT PANASWSLNPFSDE" FT gap 1763..1942 FT /estimated_length=180 XX SQ Sequence 2851 BP; 802 A; 538 C; 562 G; 720 T; 229 other; ggaccccatt attcgagggc atctagcaat aagaactgga gattaaggac gctcaaagca 60 tagtgtaagt attcttttct aacctgtgtt ttacacaggg tggcagatgg cgtgccataa 120 ccctattagt gagataccac acttgtggac cttatgctca cacagccatc ctctagtaag 180 tttgtaaggc gtctgatgac gtgtgggaac ttgttggaaa taatggtttg ctgtaaagca 240 tcctactgcc agtgggtaaa cacctggtaa caggtgcctc tggggccgaa agccaaggtt 300 taacagaccc tttaggattg gttcaaacct gaactattat ggaagacact tagtacctgc 360 tgatttggta gttgtgcaaa cactagttgt aaggcccacg aaggatgccc agaaggtacc 420 cttaggtaac aagtgacact atggatctga tctggggcca accacctcta tctcggtggg 480 ttggttaaaa aacgtctagt gggccaaacc caggggggac cctggtttcc ttttatttat 540 cataaacaat atggaggcta ttaagggcat tgcagatatg gcgacagcag ctgtcaatgg 600 acttgactct accctaaacc aatctaatga aacgccaatt aattccaatg ctactggggg 660 tgacataata actagagtgg cagatgatgc ctcaaatgtg ctgggaccta attgttatgc 720 tacaacatca gagccagaga acaaaaatgt tgtgcaggct acaactacag ttaacaccac 780 caatctcaca caacatccct ctgcacccac aatacccttt gcccctgatt tctcaaatgt 840 agacagtttt cattcaatgg catatgatgt gacaactggt gacaagaacc cgagtaaatt 900 agttaggttg aagaccactg agtggatgca aacatgggag agatcacacc aaatagacta 960 tgtggagcta ccaaaagcat tttgggcaag tgaaaagatg ccagctttcg gtcagtctaa 1020 gtattttgca gcagtaagat gtggatttca ttttcaagtg cagataaatg tgaatcaagg 1080 aactgctggt agtgctttgg ttgtttacaa gcctaaacca gtggttgata atgacggtaa 1140 attggaattt ggatcatata ccaacttgcc acatgttctg atgaacttag cagagaccac 1200 acaagctgac ttatgtatcc cctatgtttc agatacaaac tacgttatga cagattcatc 1260 tgacttaggg ctactagagg tgtatgtttg gaccccacta tcaatccccc atggctctgc 1320 aacccaagta gatgtgacaa tactgggtag tctattacag ctagactttc aaaatcctag 1380 agtttatggg caaaatgtaa acatctatga taatgctccc acaccaggta gaatgaggaa 1440 atatctcaca atgagtacta agtataaatg gactagatcc aagattgata ttgcagaagg 1500 acctggttct atgaatttgg caaatgttct cagtacaaca ggtgctcagt ctcttgcttt 1560 ggttggtgaa agagccttct acgatcccag gactgcaggc agcaaatcaa gatttgacga 1620 cttagttaaa attgcacaaa tgttttcagt tatgggtgat aatagtcaac cttcaacatc 1680 agaaggcatt gatgcgagag gttacttcaa atgggcagct agcaaagcac cgcaacaagt 1740 catacaccga aatctgatat acnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1920 nnnnnnnnnn nnnnnnnnnn nnatagacat tggtacagac aacagctttg aaataacaat 1980 accttattca ttctcaacat ggatgagaaa gaccaaaggc cataccattg gcttatttca 2040 agttgaagtc ttgaacagat tgacatacaa tagttcaagc cctagtgagg tatattgtat 2100 agtgcaaggg agattgggca gtgacgcaaa attttactgt cccactggtt cagtaacaac 2160 ctttcaaaat tcatggggtt cccaaatgga tttgagcgat cctttgtgta tagaggacaa 2220 agttgaagaa tgtaaacaga ctttatctcc caatgagtta ggcttaactt cagcccagga 2280 tgatggacca cttgggaatg aaaaacccaa ctatttccta aacttcaagg caataaatgt 2340 tgatatcttt actgtgagtc acaccaaagt agataacatc tttggcagag cttggtattc 2400 tatggcacat gagttcagaa atgagggcct atggagaacc aaactcacct ttccaaaaca 2460 gggtcatggt gctctttcac aattctttgc ttactacaca ggagaattaa atattcatgt 2520 gttgtttttg tgtgaaaaag gttttcttag agtggcccac acctatgann nnnnnnnnnn 2580 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnata ccagctggtg aacaaatgtc 2640 tctctctgct ccattttact cacatagacc actaagaaca attcgcagtg agcatgcgtt 2700 gggttatttg ttatgccaac ccatgctcac tggtacttca agtggcaaaa ttgaggttta 2760 tttaagcttg aggtgcccca atttattttt cccaattcca gcaccaaaac ctgctaatgc 2820 ttcttggtca ctaaaccctt ttagtgacga g 2851 // ID MG026493; SV 1; linear; genomic RNA; STD; VRL; 6926 BP. XX AC MG026493; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Salivirus sp. isolate ETH_P2_2016 polyprotein gene, partial cds. XX KW . XX OS Salivirus sp. OC Viruses; Riboviria; Picornavirales; Picornaviridae; Salivirus. XX RN [1] RC Publication Status: Online-Only RP 1-6926 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-6926 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; f4d162b774120a7290af57ecab4dc85a. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6926 FT /organism="Salivirus sp." FT /host="Homo sapiens" FT /isolate="ETH_P2_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:2039694" FT CDS <1..6906 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M136" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M136" FT /protein_id="AXQ03892.1" FT /translation="RYPGHKMYPSVHTLFPDVSPLKIPQSVPAFAHLVQRQGLRRQGNS FT TTNIYGNGNDVTTDVGANGMSLPIAVGDMPTASCSEAPLGSNKGGSSTSPKSTSNGNVV FT RGSRYSKWWEPAAARALDRALDHAVDATDAVAGAASKGIKAGATKLSNKLSGSQTTALL FT ALPGNIAGGAPSATVNANNTSISSQALLPSVDPYPPTPAVSLPNPDAPTQVGPAADRQW FT LVDTLSWSETIPPLTVFSGPKALTPGAYPPTIEPNTGVYPLPAALCLSHPESVFSTAYN FT AHAYFNCGFDVTVVVNASQFHGGSLIVLAMAEGLGDITPADSSTWFNFPHTIINLANSN FT AATLKLPYIGVTPNTSTEGLHNYWTILFAPLTPLAVPTGSPTTVKVSLFVSPIDSAFYG FT LRFPVPFPTPQHWKTRAVPGAGTYGSVVAGQEIPLVGYAPAAPPRDYLPGRVHNWLEYA FT ARHSWERNLTWTSADEVGDQLVSYPIQPEALANTQTNTAYVLSLFSQWRGSLQISLIFT FT GPAQCYGRLLLAYTPPSANPPTTIDEANNGTYDVWDVNGDSTYTFTIPFCSQAYWKTVD FT IGSSSGLVSNNGYFTAFVMNPLVTPGPSPPSVTVAAFLHVADDFDVRLPQCPALGFQSG FT ADGAEVQPAPTSDLSDGNPTTDPAPRDNFDYPHHPVDPSTDLAFYFSQYRWFGLNDSLT FT PLGVTGGLFYHVSLNPINFQQSSLLSVLGAFTYVYANLSLNINISAPSQPCTFYVFYAP FT PGASVPSVQTLAELSFFTHTATPLSLTSPTNITVSIPYSSPQSVLCTSFGGFGLQNGGD FT AGNLHSNTWGTLIFYVDLPQSDSVSVSTYISFRDFEAYVPRQTPGVGPVPTSTSIVRVA FT RPTPKPRTTRRQGGTLADLILSPESRCFIVAHTTAPYYSILLVNPDEEYAISMFPHGDE FT SILRYSSRDGTRLAPTAPAFFLCAAASVDTTLPYSVSQSHLWLTDLTGIPLPVSXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVA FT KVAGDANIATSSQAIASSINSLSTSIDGATTFMQNFFSGLAPRNPTSPLQHLFAKLIKW FT VTKIIGSLIIICNNPTPSALIGVSLMLCGDLAEDITEFFSNLGNPLAAVFYRCARALGL FT SPTPQSAAQAAGGRQGVRDYNDIMNALRNTDWFFEKIMTHIKNLLEWLGVLVKDDPRSK FT LNSQHEKILELYTDSVTASSTPPSELSADAIRSNLDLAKQLLTLSHAANSVTHIQLCTR FT AITNYSTALSAISLVGTPGTRPEPLVVYLYGPPGTGKSLLASLLASTLAQALSGDPNNY FT YSPSSPDCKFYDGYSGQPVHYIDDIGQDPDGADWADFVNIVSSAPFIVPMADVNDKGRF FT YTSRVIIVTSNFPGPNPRSARCVAALERRLHIRMNVTARNGVAFSAAAALQPXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYLFTRAVNENRPVSFLSKIWSWRAPIFA FT ASSFLSLIAATLTIVRCLRDLRSTQGAYNGTPVPKPRKKDLPKQPVYSGPVRRQGFDPA FT VMKIMGNVDSFVTLSGSKPIWTMSCLWIGGRNLIAPSHAFVSDEYEITHIRVGSRTLDV FT SRVTRVDDGELSLLSVPDGPEHKSLIRYIRSASPKSGILASKFSDTPVFVSFWNGKPHS FT TPLPGVVDEKDSFTYRCSSFQGLCGSPMIATDPGGLGILGIHVAGVAGYNGFSARLTPE FT RVQTFLSHLATPQSVLHFHPPMGPPAHVSRRSRLHPSPAFGAFPITKEPAALSRKDPRL FT PEGTDLDAITLAKHDKGDIATPWPCMEEAADWYFSQLPDNLPVLSQEDAIRGLDHMDAI FT DLSQSPGYPWTTQGRSRRSLFDEDGNPLPELQEAIDSVWDGGSYIYQSFLKDELRPTAK FT ARAGKTRIVEAAPIQAIVVGRRLLGSLINHLQGNPLQHGSAVGCNPDIHWTQIFHSLTS FT FSNVWSIDYSCFDATIPSVLLSAIASRIAARSDQPGRVLDYLSYTTTSYHVYDSLWYTM FT IGGNPSGCVGTSILNTIANNIAVISAMMYCNKFDPRDPPVLYCYGDDLIWGSNQDFHPR FT ELQAFYQKFTNFVVTPADKASDFPDSSSIFDITFLKRYFVPDDVHPHLIHPVMDEQTLT FT NSIMWLRGGEFEEVLRSLETLAFHSGPKNYSAWCEKIKAKIRENGCDATFTPYSVLQRG FT WVSLCMTGPYPLTG" FT gap 2953..3137 FT /estimated_length=185 FT gap 4363..4651 FT /estimated_length=289 XX SQ Sequence 6926 BP; 1174 A; 2344 C; 1302 G; 1632 T; 474 other; cgctacccag gccacaaaat gtatccctct gtccacactc tattccctga tgtttcacct 60 ctcaagatcc cccaatctgt ccctgccttt gcccaccttg tccagcgaca agggctgcgg 120 cggcaaggca attccaccac caacatctac ggcaatggca acgacgtcac cactgacgtc 180 ggtgccaacg ggatgtctct tcccatcgcc gtgggtgaca tgcctaccgc ctcctgctct 240 gaagctcccc ttggttccaa caaaggtggc tcttccactt ccccaaaatc cacatccaac 300 ggcaacgtcg tccgcggatc ccgctactcc aagtggtggg aacccgcggc cgcacgcgcc 360 ttggaccgtg ctcttgacca tgctgttgac gcaactgacg cagttgctgg cgccgcctcc 420 aagggcatca aggctggtgc caccaagctt tccaacaagc tttctggctc tcaaaccaca 480 gcccttcttg ctcttccagg caacatcgcc ggtggtgccc cctctgcgac agtcaatgcc 540 aacaacacct ccatctcttc ccaagccctt ctaccctctg ttgaccctta tcctcctacc 600 cctgctgtct cactccccaa ccccgacgct cccactcaag ttggccctgc tgccgaccga 660 cagtggctcg tcgacaccct ttcttggtct gagacaatcc cacctctcac tgtattctct 720 ggacccaagg ctctcacccc tggtgcctac ccccccacca tcgaacccaa cactggtgtc 780 tatcccttac cagctgcact ctgtctttcc catcctgaat ctgtcttctc cactgcctac 840 aatgcccacg cctacttcaa ctgtggcttc gacgtcacag tcgtcgtgaa cgcctcccag 900 ttccacggcg gctcgctgat cgtcttagcc atggccgagg gtctgggcga catcacccca 960 gctgattctt ccacttggtt caacttcccc cacacaatca tcaatttggc taattccaac 1020 gctgccaccc tcaaacttcc ctacattgga gtcaccccca acacctccac tgaaggactc 1080 cacaactatt ggaccattct ctttgcccct ctgactcctc ttgcagtccc gactggctca 1140 cccaccactg tcaaagtctc tctctttgtc tcccccattg actcagcttt ctacggcctc 1200 agattccctg tccctttccc gacaccccag cactggaaaa cacgtgctgt tcctggtgct 1260 ggcacctatg gctcggtcgt ggccggccag gaaatccccc tggtcggtta tgcccctgct 1320 gcccctcccc gcgactacct ccccgggcgc gtacacaatt ggctcgagta cgctgcccgc 1380 cactcctggg agaggaactt gacctggacc tccgccgacg aagtcggcga ccagcttgtc 1440 tcctacccca tccaaccaga ggctctcgca aacacccaaa ccaatacagc ttatgttctg 1500 tctctctttt cccagtggcg tggctctctg cagatctccc tcatcttcac tggtcctgct 1560 cagtgctacg gccgccttct tcttgcttac acccccccct ccgccaatcc tcccactacc 1620 atcgacgagg ccaacaacgg cacatacgat gtctgggatg tgaatggcga ttccacctac 1680 accttcacca tacccttctg ctcacaggcc tactggaaga ctgtcgatat cggctcgtcg 1740 tctggtctgg tctcgaacaa tgggtacttc actgcctttg tcatgaaccc tcttgtcact 1800 cctggcccct ctcctccttc tgtcactgtt gctgctttcc ttcatgttgc ggacgacttc 1860 gacgttcgcc tcccccagtg ccccgccctt ggcttccaat caggagccga tggtgcagaa 1920 gtccaacctg cccctaccag tgacctctct gatggcaacc ccaccactga ccctgcccct 1980 cgtgacaact ttgattaccc ccatcaccct gttgatcctt ccactgacct agctttctac 2040 ttttcccaat accgctggtt tggcctcaat gattctctca ccccattggg cgtcaccggt 2100 ggcctctttt accatgtttc cctcaacccc atcaacttcc agcaaagctc cctcctcagt 2160 gtcctaggtg cattcaccta tgtgtatgcc aacctttccc tcaacatcaa catctctgct 2220 ccttcccagc cctgcacatt ttatgtcttc tacgcccctc cgggcgcgtc tgtcccctct 2280 gtgcaaactc ttgctgagct ttctttcttt actcacactg ccacccctct cagcttgacc 2340 tcaccaacta acatcactgt ctccatccct tactcctccc cccagtctgt cctctgcacg 2400 tctttcgggg gcttcggcct ccaaaacggt ggagatgcag gcaacctcca ctccaacaca 2460 tggggtactc tcatcttcta tgttgatctc ccccaatctg acagtgtctc tgtttccacc 2520 tacatttcct tccgcgactt tgaagcatac gtccctcgcc aaacccctgg tgttggccct 2580 gtgcccacga gcacttccat cgttcgtgtc gcccgcccca cccccaaacc ccgcacgact 2640 cgccgtcaag gcggcactct tgcggacctc atcctctccc ccgagagtcg gtgcttcatt 2700 gttgcccaca ccactgcccc atactactct atccttttgg tcaacccaga cgaggagtat 2760 gccatcagca tgttcccaca tggcgacgag tcaatcctcc gttactcgtc gcgtgatggc 2820 actcgcctcg ctcccactgc cccagccttc ttcctctgcg ctgctgcatc tgttgacaca 2880 accctcccct actctgtctc ccaatcacac ctctggctta ctgatctgac tggcatcccc 2940 ctccccgtat ccnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3060 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3120 nnnnnnnnnn nnnnnnntgt agccaaggtt gctggcgatg ccaacattgc cacttcctcc 3180 caagccattg cttcttctat caactccctc tccacttcca ttgatggtgc taccactttc 3240 atgcaaaact tcttttctgg tcttgccccg aggaatccca cctcccctct ccagcacctc 3300 tttgccaagc tcatcaagtg ggtgaccaaa atcattggtt cactcatcat catctgcaat 3360 aaccccaccc cttcagcact gattggtgtc tccctcatgc tttgcggtga cctcgctgag 3420 gacatcacag agttcttctc aaaccttgga aaccccctcg ctgctgtctt ctatcgctgt 3480 gctagagctc ttggtctctc ccccactcca cagtctgctg cccaggccgc cggtggccgc 3540 cagggcgttc gtgactacaa tgacatcatg aacgctctgc ggaacaccga ctggttcttc 3600 gagaagatta tgacccacat caagaatctt ctcgagtggc ttggggtcct cgtcaaagac 3660 gaccccaggt ccaagctcaa ctcgcagcat gagaagatcc tggaactcta cactgactct 3720 gttactgctt cttcaacccc cccctctgag ctttctgcgg acgccatccg gtctaacctg 3780 gatttggcta agcaacttct cactctctcg cacgctgcca actctgtcac ccacattcag 3840 ctgtgcacgc gtgctatcac caactattct actgcccttt ccgccatttc ccttgttggc 3900 acgcctggaa cgcggccaga gccactggtc gtgtacctgt acgggccccc tgggactggc 3960 aaatccctcc ttgcttctct ccttgcttcc acccttgctc aggccctctc tggtgacccc 4020 aacaactact actcaccctc ctcccctgac tgcaagttct acgatggtta ctctggccag 4080 cccgtccact acatcgacga catcggacaa gaccccgatg gcgccgactg ggctgacttt 4140 gtgaacatcg tttcctctgc ccccttcatt gttcccatgg ctgacgttaa tgacaaggga 4200 cgtttctaca cctctcgtgt catcatcgtc acttccaact ttcccggccc caatccccgc 4260 tccgcgcgtt gtgtagctgc gctggaacgt cgcctgcaca ttcgcatgaa tgtgacggca 4320 cgcaacggtg tggccttctc ggcggcggcc gctctccagc ccnnnnnnnn nnnnnnnnnn 4380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4500 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4560 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4620 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngatacctct tcacccgcgc ggtcaacgag 4680 aaccgccctg tctcttttct ttccaagatc tggtcgtggc gagcccccat cttcgccgct 4740 tcctctttcc tttctcttat cgctgcaact ctcaccattg tccgctgtct tcgtgacctg 4800 cggtctaccc agggtgcata caatgggact cctgttccga aaccacggaa aaaggatctc 4860 cctaaacaac ccgtgtactc tggaccagtc cgccggcagg gtttcgaccc tgctgtcatg 4920 aagatcatgg gcaatgtgga ctcctttgtc actctttcgg gttctaaacc catctggact 4980 atgtcctgcc tctggattgg cggtcgcaat ctgattgctc cttcccacgc attcgtctcc 5040 gatgagtacg agatcaccca catccgcgtt ggatcgcgca cgcttgacgt gtcgcgtgtt 5100 acgcgggtgg atgacggtga gttatctcta ctctcggtgc cggatgggcc ggagcataag 5160 agcctgatcc gctacatccg ctctgcctct cctaaatctg gcattctagc ctctaaattc 5220 tctgacaccc ctgtctttgt ctctttctgg aatggcaagc cccactctac ccccctccct 5280 ggggtcgtgg acgagaaaga ctcgttcacg taccgctgct cttctttcca aggtctgtgc 5340 ggttcgccga tgattgccac tgatcctggc ggcctgggta tcctcggtat ccacgtcgct 5400 ggggtcgctg gctacaacgg cttctccgca cgcctcaccc ctgagcgtgt ccaaactttc 5460 ctttcccacc tggctacacc tcaatctgtc ctccacttcc acccacctat gggtccgcct 5520 gcgcacgtct ctcgtcgcag tcggctccac ccctcccccg cttttggtgc cttccccatc 5580 actaaggagc cagcagccct ctctaggaag gaccctcgcc tccccgaggg caccgacctg 5640 gacgccatca ccctcgccaa gcacgacaag ggcgacatcg cgacgccctg gccttgcatg 5700 gaagaggcgg ctgactggta cttttcccag ctccctgaca acctcccagt cctctcccag 5760 gaagatgcca ttcgtggtct cgaccacatg gatgccatcg acctttccca atctcctggc 5820 tacccttgga caacacaggg tcggtcccgc cggtctctgt ttgacgagga tggcaaccct 5880 ctccctgagc tccaggaagc catcgattcc gtctgggacg gtggttccta catctaccag 5940 tctttcctca aggatgagtt gcgccccacg gcgaaagcca gagctggaaa aacccggatt 6000 gtggaggcgg ctccgataca agcaattgtg gtcggacgtc gccttcttgg ctctctcatc 6060 aaccacctcc agggtaaccc tctccagcac ggcagcgccg ttggatgcaa ccccgacatc 6120 cactggactc aaatctttca ctctctcact tctttctcta atgtctggtc tattgattac 6180 tcttgctttg atgccactat cccttctgtc cttctctctg caattgcctc ccgcattgct 6240 gcccgctctg accaacctgg tcgtgttctg gactatctct cttacactac tacttcctac 6300 catgtctatg actccctgtg gtacaccatg atcggtggta atccctctgg gtgtgttggg 6360 acctccatcc tcaacacaat tgcaaataac atcgcggtca tctccgcgat gatgtattgc 6420 aacaaatttg acccacggga tcctccggtc ttgtactgct acggggacga cttgatatgg 6480 ggctccaatc aagactttca ccctcgtgaa ctccaggctt tctatcagaa attcaccaac 6540 tttgttgtca cacctgctga caaagcttct gactttcctg actcttcttc catctttgac 6600 atcactttcc tcaaacgcta ctttgtccct gatgatgtcc acccccacct catccaccct 6660 gtgatggatg agcaaaccct caccaactca atcatgtggt tgcgtggcgg ggagtttgag 6720 gaggtgttgc ggtcactcga gactctggcc tttcactccg gaccgaagaa ctactcggcc 6780 tggtgtgaga aaatcaaggc taagattcga gagaacggct gcgacgccac cttcactccc 6840 tactccgtcc tccaacgtgg ttgggtgtcc ctctgcatga ctggacccta ccctctcacc 6900 gggtagcccc ctttgaaacc cctcta 6926 // ID MG026494; SV 1; linear; genomic RNA; STD; VRL; 6866 BP. XX AC MG026494; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Salivirus sp. isolate ETH_P6_2016 polyprotein gene, partial cds. XX KW . XX OS Salivirus sp. OC Viruses; Riboviria; Picornavirales; Picornaviridae; Salivirus. XX RN [1] RC Publication Status: Online-Only RP 1-6866 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-6866 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; e4668d434eefe3ba07e1d52eb51b9917. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6866 FT /organism="Salivirus sp." FT /host="Homo sapiens" FT /isolate="ETH_P6_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:2039694" FT CDS <1..6864 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M137" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M137" FT /protein_id="AXQ03893.1" FT /translation="FPDVSPLKIPQSVPAFAHLVQRQGLRRQGNSITNIYGNGNDVTTD FT VGANGMSLPIAVGDMPTASSSEAPLGSNKGGSSTSPKSTSNGNVVRGSRYSKWWEPAAA FT RALDRALDHAVDATDAVAGAASKGIKAGATKLSNKLSGSQTTALLALPGNIAGGAPSAT FT VNANNTSISSQALLPSVNPYPSTPAVSLPNPDAPTQVGPAADRQWLVDTLSWSETIAPL FT TVFSGPKALTPGVYPPTIEPNTGVYPLPAALCVSHPESVFSTAYNAHAYFNCGFDVTVV FT VNASQFHGGSLIVLAMAEGLGDITPADSSTWFNFPHAIINLANSNSATLKLPYIGVTPN FT TSTEGLHNYWTILFAPLTPLAVPTGSPTAVKVSLFVSPIDSAFYGLRFPVPFPTPQHWK FT TRAVPGAGTYGSVVAGQEIPLVGYAPAAPPRXYLPGRVHNWLEYAARHSWERNVNWTSA FT EEVGDQLVSYPIQPEALANTQTNTAFVLSLFSQWRGSLQISLIFTGPAQCYGRLLLAYT FT PPSANPPTTIDEASNGTYDVWDVNGDSTYTFTIPFCSQAYWKTVDIGTSSGLVSNNGYF FT TVFVMNPLVTPGPSPPSATVAAFLHVADDFDVRLPQCPALGFQSGADGAEVQPAPTSDL FT SDGNPTTDPAPRDNFDYPHHPVDPSTDLAFYFSQYRWFGLNDSLTPLDATGGLFYHISL FT NPINFQQSSLLSVLGAFTYVYANLSFNINVSAPSQPCTFYVFYAPPGASVPTVQSLAEL FT SFFTHTATPLNLASPTNITVSIPYSSPQSVLCTSFGGFGLQNGGDAGNLHSNTWGTLIL FT YVDLPQSDSVSVSAYISFRDFEAYVPRQTPGVGPVPTSNSIVRVARPTPKPHMTRRQGG FT TLADLILSPESRCFIVAHTTAPYYSILLVNPDEEYAIGMSSHGDESILHYSSRAGTRLA FT PTAPAFFLCAAASVDTILPYSISQSHLWLSDLTGIPLRAVPPLTLFLSAGAALCAGAQT FT LIAVAQGGATPDTPPIPNRALLRRQGLGDLPDAAKGLSVALENVAKVAGDANIATSSQA FT IASSINSLSNSIDGATTFMQNFFSGLAPKNPTSPLQHLFAKLIKWVTKIIGSLIIICNN FT PTPSALIGVSLMLCGDLAEDITEFFSNLGNPLAAVFYRCARALGLSPTPQSAAQAAGGR FT QGVRDYNDIMSALRNTDWFFEKIMTHVKNLLEWLGVLVKDDPRTKLNSQHDKILELYTD FT SVTASSTPPSELSADAIRSNLDLAKQLLTLSHAASSVTHIQLCTRAITNYSTALSAISL FT AGTPGTRPEPLVVYLYGPPGTGKSLLASLLASTLAQALSGDPNNYYSPSSPDCKFYDGY FT SGQPVHYIDDIGQDPDGADWADFVNIVSSAPFIVPMADVNDKGRFYTSRVIIVTSNFPG FT PNPRSARCVAALERRLHIRMNVTARNGVAFSAAAALQPSDPPSATRYCKFSNPLTQFSM FT FNLAVDYKSVVLPNTPLTCFDDLVDFILDSLRGRASVNSLLSGMVRTDVTRQGGNAGAP FT APSAAPLPSVVPSVPSQDPFTRAVNENRPVSFLSKIWSWRAPIFAASSFLSLIAATLTI FT VRCLRDLRSTQGAYTGTPVPKPRKKDLPKQPVYSGPVRRQGFDPAVMKIMGNVDSFVTL FT SGTKPIWTMSCLWIGGRNLIAPSHAFVSDEYEITHIRVGSRTLDVSRVTRVDDGELSLL FT SVPDGPEHKSLIRYIRSASPKSGILASKFSDTPVFVSFWNGKPHSTPLPGVVDEKDSFT FT YRCSSFQGLCGSPMIATDPGGLGILGIHVAGVAGYNGFSARLTPERIQAFLSHLATPQS FT VLHFHPPMGPPAHVSRRSRLHPSPAFGAFPVTKEPAALSRKDPRLPEGTDLDAITLAKH FT DKGDIATPWPCMEEAADWYFSQLPDDLPVLSQEDAIRGLDHMDAIDLSQSPGYPWTTQG FT RSRRSLFDEDGNPLPELQEAIDSVWDGGSYIYQSFLKDELRPTAKARAGKTRIVEAAPI FT QAIVVGRRLLGSLINHLQGNPLQHGSAVGCNPDIHWTQIFHSLTPFSNVWSIDYSCFDA FT TIPSVLLSAIASRIAARSDQPGRVLDYLSYTTTSYHVYDSLWYTMIGGNPSGCVGTSIL FT NTIANNIAVISAMMYCNKFDPRDPPVLYCYGDDLIWGSNQDFHPRELQAFYQKFTNFVV FT TPADKASDFPDSSSIFDITFLKRYFVPDDIHPHLVHPVMDEQTLTNSIMWLRGGEFEEV FT LRSLETLAFHSGPKNYSAWCEKIKAKIRENGCDATFTPYSVLQRGWVSTCMTGPYPLTG FT " XX SQ Sequence 6866 BP; 1226 A; 2502 C; 1405 G; 1731 T; 2 other; ttccctgatg tttcgcccct caagattccc caatctgttc ccgcctttgc tcaccttgtc 60 cagcgacagg ggctgcggcg acaaggcaat tccatcacca acatctacgg caatggtaac 120 gacgtcacca ctgacgtcgg cgccaatggg atgtctctcc ccatcgccgt aggtgacatg 180 cctaccgcct cttcctctga agctcctctt ggttccaaca aaggtggctc ctccacttct 240 ccaaaatcca cgtccaacgg caacgtcgtc cgcggatccc gctactccaa gtggtgggaa 300 cccgcggctg cacgcgccct ggaccgtgct cttgaccatg ctgttgatgc aactgacgca 360 gttgctggcg ccgcctccaa gggcatcaag gctggtgcca ccaagctttc caacaagctt 420 tctggctctc aaaccacagc tcttcttgct cttcccggca atatcgccgg tggtgccccc 480 tctgcaacag tcaatgccaa caacacttcc atctcttccc aagctcttct accttctgtc 540 aacccttacc cctccactcc tgctgtctcg cttcccaatc ccgacgcccc cactcaagtt 600 ggccccgccg ctgaccgcca gtggctcgtc gacaccctct cttggtctga gacaattgca 660 cctcttactg tcttctctgg acccaaggcc ctcacccctg gtgtctatcc ccctactatc 720 gaacctaaca ctggtgtcta ccccctacca gctgcactct gtgtttccca ccctgaatct 780 gtcttttcca ctgcctacaa tgcccacgcc tacttcaatt gtggcttcga tgtcacagtc 840 gtcgtgaatg cttcccagtt tcacggcggc tcgctgattg tcttggccat ggctgaaggt 900 ctaggcgata tcactccagc tgactcctcc acttggttca acttccccca cgctattatt 960 aatctggcta actctaattc tgctaccctc aagcttcctt acattggagt cactcccaac 1020 acctccactg aaggactcca caactattgg accattctct ttgcccctct gactcctctt 1080 gctgttccga ctggctcacc caccgctgtc aaagtctctc tctttgtctc ccctattgac 1140 tcagctttct atggcctcag attccctgtc cccttcccga caccccagca ctggaaaaca 1200 cgtgctgtcc ctggtgctgg cacctacggc tcggtcgtgg ccggccagga aatccccctg 1260 gttggttacg cccccgccgc ccctccccgt rwttacctcc ctgggcgcgt acacaattgg 1320 ctcgagtacg ccgcccgcca ctcctgggag aggaatgtaa actggacttc cgccgaggaa 1380 gtcggtgacc agcttgtttc ctatcccatc caaccagagg ctcttgcaaa cacccaaacc 1440 aacacagcct ttgttctctc cctcttctcc cagtggcgtg gctctttgca gatctccctc 1500 atcttcactg gtcctgctca gtgctacggc cgccttcttc tcgcctacac ccctccctcc 1560 gccaatcctc ccactaccat cgatgaggcc agcaatggca catacgatgt ttgggatgtg 1620 aacggcgact ctacctacac cttcaccata cccttctgct cgcaggccta ctggaagact 1680 gtcgacattg gcacgtcgtc tggtctggtc tcgaacaatg ggtacttcac cgtctttgtc 1740 atgaaccctc tcgtcactcc tggcccctct cctccttctg ccactgtcgc tgctttcctt 1800 catgttgcgg acgacttcga cgttcgcctc ccccagtgcc ccgcccttgg cttccaatca 1860 ggagctgatg gtgcagaagt ccaacctgcc cccaccagtg acctctctga tggtaacccc 1920 accactgacc cggcccctcg tgacaacttc gactaccccc accaccctgt tgatccttcc 1980 actgatctgg ctttctactt ttcccagtac cgctggtttg gcctcaatga ttccctcacc 2040 ccattggacg ccaccggtgg gctgttttac cacatctctc ttaaccccat caacttccag 2100 caaagctccc tcctcagtgt tctgggtgca ttcacctatg tgtatgccaa cctctctttc 2160 aacatcaatg tctctgcccc ttctcagccc tgcacattct atgtcttcta cgcccctcca 2220 ggcgcatctg tccctactgt gcagtccctt gcagaactct ctttcttcac tcacactgcc 2280 actcccctca acctggcttc accaactaac atcactgtct ctatccctta ctcctccccc 2340 cagtctgtcc tctgcacgtc ctttggaggc ttcggtctcc aaaacggcgg agatgcaggc 2400 aacctccact ccaacacatg gggtactctc atcctttatg ttgatctccc ccaatctgac 2460 agtgtctctg tttctgctta catttccttc cgtgactttg aagcatacgt ccctcgccaa 2520 acccctggcg ttggccccgt gcccacgagc aactccattg ttcgtgtcgc ccgacccacc 2580 cccaaacccc acatgacccg ccgccaaggc ggcacccttg cagacctcat cctctcccct 2640 gagagtcggt gcttcattgt tgcccacacc actgccccct actactccat ccttctggtc 2700 aaccctgacg aggagtatgc catcggcatg tcctcacatg gtgatgagtc aatcctccac 2760 tactcgtcac gtgctggcac tcgcctcgcc cccaccgccc cagccttctt tctctgcgct 2820 gctgcatctg ttgacacaat ccttccctac tctatctctc aatcacacct ctggctttct 2880 gatctaactg gcatccccct ccgcgcagtc ccccctctca ctctcttcct ctccgcggga 2940 gctgccctgt gcgccggcgc gcaaacactg atagccgtcg cgcagggtgg tgccaccccg 3000 gacacccctc ccatccccaa ccgcgccctc ctccgtcgcc agggcctcgg tgacctcccg 3060 gatgcggcga aaggtctttc cgtcgcactc gagaatgtag ccaaggtcgc tggtgatgcc 3120 aatattgcca catcctccca agccattgct tcttctatca actctctctc aaactccatt 3180 gatggtgcta ccactttcat gcaaaatttc ttctctggtc tcgccccgaa gaatcctacc 3240 tcccccctcc agcacctctt tgccaaactc atcaaatggg tgaccaaaat catcggttca 3300 ctcatcatca tctgtaacaa ccccactcct tcagcattga ttggtgtctc cctcatgctc 3360 tgcggtgacc tcgccgagga catcacagag ttcttctcaa accttggaaa ccccctcgct 3420 gctgtcttct accgctgtgc tagagctctt ggcctctccc ccaccccaca gtctgctgcc 3480 caggccgccg gtggccgtca gggcgttcgt gattacaacg acatcatgag cgctctacgg 3540 aacaccgact ggttcttcga gaagatcatg actcacgtca aaaatcttct cgagtggctt 3600 ggggtcctcg tcaaagacga ccccaggacc aaactcaact cacagcatga caagatcctg 3660 gaactctaca ccgattctgt tactgcttct tcaacccccc cctctgagct ttctgcggac 3720 gccatccgat ccaacctgga tttggctaag caactcctca ccctctcaca cgctgccagc 3780 tctgtcactc acatccagct gtgcacgcgt gctatcacca actactccac tgccctctcc 3840 gccatctccc tcgctggcac gcctgggacg cggccagagc cactggtcgt atacctgtac 3900 gggcctcctg ggactggcaa gtcccttctt gcttctcttc tcgcttccac ccttgctcag 3960 gccctctctg gtgaccccaa caactattac tcaccttcct cccctgactg caagttctac 4020 gatggttact ctggccagcc cgtccactac atcgacgaca tcgggcaaga ccctgatggc 4080 gccgattggg ctgactttgt gaacatcgtt tcctctgctc ctttcattgt tcccatggct 4140 gacgttaatg acaagggacg tttctacacc tctcgtgtca tcatcgtcac ttccaacttt 4200 cccggcccca atccccgctc cgcgcgttgt gtagctgcgc tggaacgtcg cctgcacatt 4260 cgcatgaatg tgacggcacg caacggtgtg gccttctcgg cggcggccgc tctccagccc 4320 tccgaccccc cctccgcaac acgctattgc aaattctcca accctcttac ccagttctcc 4380 atgttcaatc tggctgttga ctacaaatca gttgtcctcc ccaacactcc cctcacctgc 4440 tttgatgatc tggttgactt tattctggac tctctccgcg gccgggcctc ggtgaactcg 4500 ctcctctctg gcatggtgcg cactgacgtc acacgccagg gcgggaatgc cggtgccccc 4560 gctccctctg cagctcctct cccttctgta gtcccatctg tcccctccca ggaccccttt 4620 actcgcgcgg tcaacgagaa ccgccctgtc tctttcctct ctaagatctg gtcgtggcga 4680 gctcctattt tcgccgcttc ctctttcctt tctctcattg ctgcaactct caccattgtc 4740 cgctgtcttc gtgacttgcg gtctacccag ggtgcatata ccgggactcc tgttccgaaa 4800 ccacggaaaa aggaccttcc caaacaacct gtgtactctg ggccagtccg ccggcagggt 4860 ttcgaccccg ccgtcatgaa gatcatgggc aatgtggact cttttgtcac cctctcgggc 4920 actaagccca tttggaccat gtcctgcctc tggattggcg gtcgcaatct gattgctcct 4980 tcccacgcat tcgtctccga cgagtacgag atcacccaca tccgcgttgg atcgcgcacg 5040 cttgacgtgt cgcgtgttac gcgggtggat gacggtgagt tatctctact ctcggtgccg 5100 gatgggccgg agcataagag tctgatccgc tatatccgct ctgcctctcc taaatctggt 5160 attctggcct ccaaattctc tgacactcct gtctttgtct ctttctggaa tggcaagccc 5220 cactccaccc ctctccctgg ggtcgtggac gagaaagact cgttcacgta ccgctgctct 5280 tctttccagg gcttgtgcgg ttcgccgatg attgccactg atcctggcgg tttgggtatc 5340 ctcggtatcc acgtcgccgg ggtggctggc tacaacggct tctccgcacg cctcaccccc 5400 gagcgcatcc aagctttcct ttctcacctg gcaacacccc aatctgtcct ccacttccac 5460 ccacccatgg gcccgcctgc gcacgtctct cgtcgcagtc gactccatcc ctcccctgct 5520 tttggtgcct tccccgtcac caaagagcca gcagccctct ccaggaagga ccctcgcctc 5580 cctgagggca ccgacctgga tgccatcacc ctcgccaagc acgacaaggg cgacatcgcg 5640 acgccctggc cttgcatgga agaggcggct gactggtact tctcccagct ccctgatgac 5700 ctcccagtcc tctcccagga agatgccatt cgtggcctcg accacatgga tgccattgac 5760 ctctcccaat cccctggcta cccttggaca acacagggcc ggtcccgccg gtctctgttt 5820 gacgaggatg gcaaccctct ccctgagctc caggaagcca tcgactccgt gtgggacggt 5880 ggctcctaca tctaccaatc tttcctcaag gatgagttgc gccccacggc gaaagccaga 5940 gctggaaaaa cccggattgt ggaggcggct ccgatacaag caattgtggt cggccgtcgc 6000 cttctcggtt ctctcatcaa ccacctccag ggtaaccccc tccagcatgg cagcgccgtt 6060 ggatgcaacc ccgacatcca ctggactcaa atctttcact ctctcacccc tttctctaat 6120 gtctggtcta tcgactactc ttgctttgat gccactatcc cttctgtcct tctctctgca 6180 atcgcatctc gcattgctgc ccgctctgac caacctggtc gcgttctgga ctacctctct 6240 tacactacta cttcctacca tgtctatgac tccctgtggt acaccatgat cggtggtaat 6300 ccctctgggt gtgttgggac ctccatcctc aacacgattg caaacaacat tgcggtcatc 6360 tccgcgatga tgtattgcaa caaatttgac ccacgggatc ctccggtctt gtactgctac 6420 ggggacgact tgatatgggg ctccaatcaa gactttcacc ctcgtgaact ccaggccttc 6480 tatcagaaat tcactaactt tgttgtcacc cctgccgaca aggcttctga ctttcctgac 6540 tcttcttcca tctttgacat caccttcctt aaacgctact ttgtccctga tgatatccac 6600 ccccacctcg tccatcctgt gatggatgag caaaccctca ccaactcaat catgtggttg 6660 cgcggcgggg agtttgagga ggtgttgcgg tcactcgaga ctctggcctt ccactccgga 6720 ccgaagaact attcggcttg gtgtgagaag atcaaggcta agattcgaga gaacggctgc 6780 gacgctacct tcactcccta ctccgtcctc caacgtggtt gggtttccac ctgtatgact 6840 ggaccctacc ccctcaccgg gtagcc 6866 // ID MG026495; SV 1; linear; genomic RNA; STD; VRL; 7603 BP. XX AC MG026495; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Salivirus sp. isolate ETH_P14_2016 polyprotein gene, complete cds. XX KW . XX OS Salivirus sp. OC Viruses; Riboviria; Picornavirales; Picornaviridae; Salivirus. XX RN [1] RC Publication Status: Online-Only RP 1-7603 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7603 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 8b1b41952363fec5740e10d0975d62aa. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7603 FT /organism="Salivirus sp." FT /host="Homo sapiens" FT /isolate="ETH_P14_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:2039694" FT CDS 485..7594 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M138" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M138" FT /protein_id="AXQ03894.1" FT /translation="MEGSNGFSSSLAGLSSSRSSLRLLTHFLSLSPPNRXARRHSGWYP FT XXXXXPVHIYLNEQFDNLCLAALRYPGHKMYPSVYTLFPDVSPLKIPQSVPAFAHLVQR FT QGLRRQGNSITNIYGNGNDVTTDVGANGMSLPIAVGDMPTASSSEAPLGSNKGGSSTSP FT KSTSNGNVVRGSRYSKWWEPAAARALDRALDHAVDATDAVAGAASKGIKAGATKLSNKF FT SGSQTTALLALPGNIAGGAPSATVNANNTSISSQALLPSVNPYPSTPAVSLPNPDAPTQ FT VGPAADRQWLVDTLSWSETIAPLTVFSGPKALTPGVYPPTIEPNTGVYPLPAALCVSHP FT ESVFSTAYNAHAYFNCGFDVTVVVNASQFHGGSLIVLAMAEGLGDVTPADSSTWFNFPH FT AIINLANSNSATLKLPYIGVTPNTSTEGLHNYWTILFAPLTPLAVPTGSPTTVKVSLFV FT SPIDSAFYGLRFPVPFPTPQHWKTRAVPGAGTYGSVVAGQEIPLVGYAPAAPPRDYLPG FT RVHNWLEYAARHSWERNVSWTSAEEVGAQLVSYPIQPEALANTQTNTAFVLSLFSQWRG FT SLQISLIFTGPAQCYGRLLLAYTPPSANPPTTIDEANNGTYDVWDVNGDSTYTFTIPFC FT SQAYWKTVDIGTSSGLVSNNGYFTVFVMNPLVTPGPSPPSATVAAFLHVADDFDVRLPQ FT CPALGFQSGADGAEVQPAPTSDLSDGNPTTDPAPRDNFDYPHHPVDPSTDLAFYFSQYR FT WFGLNDSLTPLDATGGLFYHISLNPINFQQSSLLSVLGAFTYVYANLSFNINVSAPSQP FT CTFYVFYAPPGASVPTVQSLAELTFFTHTATPLNLASPTNITVSIPYSSPQSVLCTSFG FT GFGLQNGGDAGNLHSNTWGTLILYVDLPQSDSVSVSAYISFRDFEAYVPRQTPGVGPVP FT TSNSIVRVARPTPKPHKARRQGGTLADLILSPVSRCFIVAHTTAPYYSILLVNPDEEYA FT IGMSSHGDESILHYSSRVGTRLAPTAPAFFLCAAASVDTILPYSVSQSHLWLSDLTGIP FT LRAVPPLTLFLSAGAALCAGAQTLIAVAQGGATPDTPPIPNRALLRRQGLGDLPDAAKG FT LSVALENVAKVAGDANIATSSQAIASSINSLSNSIDGATTFMQNFFSGLAPKNPTSPLQ FT HLFAKLIKWVTKIIGSLIIICNNPTPSALIGVSLMLCGDLAEDITEFFSNLGNPLAAVF FT YRCARALGLSPTPQSAAQAAGGRQGVRDYNDIMSALRNTDWFFEKIMTHVKNLLEWLGV FT LVKDDPRTKLNSQHDKILELYTDSVTASSTPPSELSADAIRSNLDLAKQLLTLSHAASS FT VTHIQLCTRAITNYSTALSAISLVGTPGTRPEPLVVYLYGPPGTGKSLLASLLASTLAQ FT ALSGDPNNYYSPSSPDCKFYDGYSGQPVHYIDDIGQDPDGADWADFVNIVSSAPFIVPM FT ADVNDKGRFYTSRVIIVTSNFPGPNPRSARCVAALERRLHIRLNVTARNGVTFSAAAAL FT QPSDPPSATRYCKFSNPLTQFSMFNLAVDYKSVVLPNTPLTCFDDLVDFVLGSLRDRAS FT VNSLLSGMVRTDVTRQGGNADAPAPSAAPLPSVVPSVPSQDPFTRAVNENRPVSFLSKI FT WSWRAPIFAASSFLSLIAATLTIVRCLRDLRSTQGAYTGTPVPKPRKKDLPKQPVYSGP FT VRRQGFDPAVMKIMGNVDSFVTLSGTKPIWTMSCLWIGGRNLIAPSHAFVSDEYEITHI FT RVGSRTLDVSRVTRVDDGELSLLSVPDGPEHKSLIRYIRSASPKSGILASKFSDTPVFV FT SFWNGKPHATPLPGVVDEKDSFTYRCSSFQGLCGSPMIATDPGGLGILGIHVAGVAGYN FT GFSARLTPERIQAFLSHLATPQSVLHFHPPMGPPAHVSRRSRLHPSPAFGAFPITKEPA FT ALSRKDPRLPEGTDLDAITLAKHDKGDIATPWPCMEEAADWYFSQLPDDLPVLSQEDAI FT RGLDHMDAIDLSQSPGYPWTTQGRSRRSLFDEDGNPLPELQEAIDSVWDGGSYIYQSFL FT KDELRPTAKARAGKTRIVEAAPIQAIVVGRRLLGSLINHLQGNPLQHGSAVGCNPDIHW FT TQIFHSLTPFSNVWSIDYSCFDATIPSVLLSAIASRIAARSDQPGRVLDYLSYTTTSYH FT VYDSLWYTMIGGNPSGCVGTSILNTIANNIAVISAMMYCNKFDPRDPPVLYCYGDDLIW FT GSNQDFHPRELQAFYQKFTNFVVTPADKASDFPDSSSIFDITFLKRYFVPDDIHPHLVH FT PVMDEQTLTNSIMWLRGGEFEEVLRSLETLAFHSGPKNYSAWCEKIKAKIRENGCDATF FT TPYSVLQRGWVSTCMTGPYPLTG" XX SQ Sequence 7603 BP; 1340 A; 2739 C; 1584 G; 1923 T; 17 other; gtgctttggc aggtaagcat cctgatcccc cgcggaagct gctcacgtgg caactgtggg 60 gacccagaca ggttatcaaa ggcacccggt ctttccgcct tcaggagtac cctcactagt 120 gaattctagt ggggctctgc ttggtgccaa cctcccccaa atgcgcgctg cgggagtgct 180 cttccccaac ccatcctagt accctctcat gtgtgtgctt ggtcagcaca tctgagacga 240 cgttccgctg tcccagacca gtccagtaat ggacgggcca gtgtgcgtag tcgtcttccg 300 gcttgtccgg cgcatgtttg gtgaaccggt ggggtaaggt tggtgtgccc aatgcccgta 360 ctttggtgat acctcaagac cacccaggaa tgccagggag gtaccccgcc tctcggcggg 420 atctgaccct gggctaattg tctacggtgg ttcttcttgc ttccactctt ttcttctgtt 480 cacgatggag ggctctaacg gattctcgag ttcgttggct ggcctttctt catcgcgctc 540 ttcacttcgc ctcctcactc attttctctc cctctccccc cccaatcgcs acgcccgccg 600 tcactcggga tggtatccnn nnnnnnnnnn nnnncccgtc cacatctacc tcaatgaaca 660 atttgacaac ctctgcctgg cggctttgcg ctacccaggc cacaaaatgt atccctctgt 720 ctacactcta ttccctgatg tctcgcccct caagattcct caatctgttc ccgcctttgc 780 tcaccttgtc cagcgacagg ggctgcggcg acaaggcaat tccatcacca acatttacgg 840 caatggcaac gacgtcacca ctgacgtcgg cgccaatggg atgtctctcc ccattgccgt 900 aggtgacatg cctaccgcct cttcctctga agctcctctt ggttctaaca aaggtggctc 960 ttccacttct ccaaaatcca cgtccaacgg caacgtcgtc cgcggatccc gctactccaa 1020 gtggtgggaa cccgcggctg cacgcgccct ggaccgtgct cttgaccatg ctgttgatgc 1080 aactgatgca gttgctggcg ccgcctccaa gggcatcaag gctggtgcca ccaagctttc 1140 caacaagttt tctggctctc aaaccacagc tcttcttgct cttcccggca acatcgccgg 1200 tggtgccccc tctgcaacag tcaatgccaa caacacttcc atctcctccc aagctcttct 1260 accctctgtc aacccttacc cctctactcc tgctgtctcg cttcccaatc ccgacgcccc 1320 cactcaagtt ggccccgccg ccgaccgcca gtggctcgtc gacaccctct cttggtctga 1380 gacaattgcc cctcttaccg ttttctctgg acccaaggcc ctcacccctg gtgtatatcc 1440 ccccactatc gaacccaaca ctggtgtcta ccccctacca gctgcactct gtgtttccca 1500 ccctgaatct gtcttttcca ctgcctacaa tgcccacgcc tacttcaatt gtggtttcga 1560 tgtcacagtc gtcgtgaatg cttcccagtt tcacggcggc tcgttgattg tcttggccat 1620 ggctgaaggt ctaggcgatg tcactccagc tgactcctcc acttggttca acttccccca 1680 cgctattatt aatctggcta actctaattc tgctaccctc aagcttcctt acattggagt 1740 cactcccaac acctccactg aaggactcca caattattgg accattcttt ttgcccctct 1800 gactcctctt gctgttccga ctggctcacc caccactgtc aaagtctccc tctttgtctc 1860 ccctattgac tcagctttct atggcctcag attccctgtc cccttcccga cgccccagca 1920 ctggaaaaca cgtgctgtcc ctggtgctgg cacctacggc tcggtcgtgg ccggccagga 1980 aatccccctg gttggttacg cccccgccgc ccctccccgt gattacctcc ctgggcgcgt 2040 acacaattgg ctcgagtacg ccgcccgcca ctcctgggag aggaatgtaa gctggacttc 2100 cgccgaggaa gtcggtgccc agcttgtttc ctaccccatc caaccagagg ctcttgcaaa 2160 cacccaaacc aacacagcct ttgttctctc cctcttctct cagtggcgtg gctctttgca 2220 gatctccctc attttcactg gtcctgctca gtgctacggc cgccttcttc ttgcctacac 2280 ccctccctcc gccaatcctc ccaccaccat cgatgaggcc aacaatggca catacgatgt 2340 ttgggatgtg aacggcgact ctacctacac cttcaccata cccttctgct cgcaggccta 2400 ctggaagact gtcgacattg gcacgtcgtc tggtctggtc tcgaacaatg ggtacttcac 2460 cgtctttgtc atgaaccctc tcgtcactcc tggcccctct cctccttctg ccactgtcgc 2520 tgctttcctt catgttgcgg acgacttcga cgttcgcctc ccccagtgcc ccgcccttgg 2580 cttccaatca ggagctgatg gtgcagaagt ccaacctgcc cccaccagtg acctctctga 2640 tggtaacccc accactgacc cggcccctcg tgacaacttc gactaccccc accaccctgt 2700 tgatccttcc actgatctgg ctttctactt ttcccagtac cgctggtttg gcctcaatga 2760 ttctctcacc ccattggacg ccaccggtgg gttgttttac cacatctccc ttaaccccat 2820 caacttccag caaagctccc tcctcagtgt tctaggtgca ttcacctatg tgtatgccaa 2880 cctctccttc aacatcaatg tctctgcccc ttctcagccc tgcacattct atgtcttcta 2940 cgcccctccg ggcgcatctg tccccactgt gcagtccctt gcagaactca ctttcttcac 3000 tcacactgcc actcccctca acctggcttc accaactaac atcactgtct ctatccctta 3060 ctcctctccc cagtctgtcc tctgcacgtc ctttggtggc ttcggtctcc aaaacggcgg 3120 agatgcaggc aacctccatt ctaacacatg gggtactctc atcctttatg ttgatctccc 3180 ccaatctgac agtgtctctg tttctgctta catttccttc cgtgactttg aagcatacgt 3240 cccccgccaa acccctggcg ttggccccgt gcccacgagc aactccattg ttcgtgtcgc 3300 ccgacccacc cccaaacccc acaaggcccg ccgccaaggc ggcacccttg cggacctcat 3360 cctctcccct gtgagtcggt gcttcattgt tgcccacacc actgccccct actactccat 3420 ccttctggtc aaccctgacg aggagtatgc catcggcatg tcctcacatg gtgatgagtc 3480 aatcctccac tactcgtcac gtgttggcac tcgcctcgcc cccaccgccc cagccttctt 3540 tctctgcgct gctgcatctg ttgacacaat ccttccctac tctgtctctc aatcacacct 3600 ctggctttct gatctgactg gcatccccct ccgcgcagtc ccccctctca ctctctttct 3660 ctccgcggga gctgccctgt gcgccggcgc gcaaacactg atagccgtcg cgcagggtgg 3720 tgccaccccg gacacccctc ccatccccaa ccgcgccctc ctccgtcgcc agggcctcgg 3780 cgacctcccg gatgcggcga aaggtctctc cgtcgcactc gagaatgtag ccaaggtcgc 3840 tggtgatgcc aacattgcca catcctctca agccattgct tcttctatca actctctctc 3900 aaactccatt gatggtgcta ccactttcat gcaaaatttc ttctctggtc tcgccccgaa 3960 gaatcctacc tcccccctcc agcacctctt tgccaaactc atcaaatggg tgaccaaaat 4020 catcggctca ctcattatca tctgtaacaa ccccactcct tcagcattga ttggtgtctc 4080 cctcatgctc tgcggtgacc tcgccgagga catcacagag ttcttctcaa accttggaaa 4140 ccccctcgct gctgtcttct accgctgtgc tagagctctt ggcctctccc ccaccccaca 4200 gtctgctgct caggccgccg gtggccgtca gggcgttcgt gattacaacg acatcatgag 4260 cgctctgcgg aacaccgact ggttcttcga gaagatcatg actcacgtca aaaatcttct 4320 cgagtggctt ggggtcctcg tcaaagacga ccccaggacc aaactcaact cacagcatga 4380 caagatcctg gaactctaca ccgattctgt tactgcttct tcaacccccc cctctgagct 4440 ttctgcggat gccatccggt ccaacctgga cttggctaag caactcctca ccctctcaca 4500 cgctgccagc tccgtcaccc acatccaact gtgcacgcgt gctatcacca actactccac 4560 tgccctctcc gccatctccc tcgttggcac gcctgggacg cggccagagc cactggtcgt 4620 atacctgtac gggcctcctg ggactggcaa gtccctcctt gcttcccttc tcgcttccac 4680 ccttgctcag gctctctctg gtgaccccaa caactattac tcaccttcct cccctgactg 4740 caagttctac gatggttact ctggccagcc cgtccactac atcgacgaca tcgggcaaga 4800 ccctgatggc gccgattggg ctgactttgt aaacatcgtt tcctctgctc ctttcattgt 4860 tcccatggcc gacgttaatg acaagggacg tttctacacc tctcgtgtca tcatcgtcac 4920 ttccaacttt cccggcccca atccccgctc cgcgcgttgt gtggctgcgc tggaacgtcg 4980 cctgcacatt cgcttgaatg tgacggcacg caacggtgtg accttctcgg cggcggctgc 5040 tctccagccc tccgaccccc cctccgcaac acgctattgc aaattctcca accctctcac 5100 ccagttctcc atgttcaatc tggctgttga ctacaaatca gttgtcctcc ccaacacccc 5160 cctcacctgc tttgatgatc tggttgactt tgttctgggc tctctccgcg accgggcctc 5220 ggtgaactcg cttctctctg gcatggtgcg cactgacgtc acacgccagg gcgggaatgc 5280 cgacgccccc gctccctctg cagctcctct cccttctgta gtcccatctg tcccctccca 5340 ggaccccttc actcgcgcgg ttaatgagaa ccgccctgtc tctttcctct ctaagatctg 5400 gtcgtggcga gcccctattt tcgccgcttc ctctttcctt tctctcattg ctgcaactct 5460 taccattgtc cgctgtcttc gtgacttgcg gtctacccag ggtgcataca ccgggactcc 5520 tgttccgaaa ccacggaaaa aggaccttcc caaacaacct gtgtactctg ggccagtccg 5580 ccggcagggt ttcgaccccg ccgtcatgaa gatcatgggc aatgtggact cttttgtcac 5640 tctctcgggc actaagccca tttggaccat gtcctgcctc tggattggcg gtcgcaatct 5700 gattgctcct tcccacgcat tcgtctccga cgagtacgag atcacccaca tccgcgttgg 5760 atcgcgcacg cttgacgtgt cgcgtgttac gcgggtggat gacggtgagt tatctctact 5820 ctcggtgccg gatgggccgg agcataagag tctgatccgc tacatccgct ccgcctctcc 5880 taaatctggt attctggcct ccaaattctc tgacacccct gtctttgtct ctttctggaa 5940 tggcaagccc cacgccaccc ctctccctgg ggtcgtggac gagaaggact cgttcacgta 6000 ccgctgctcc tcttttcagg gcctgtgcgg ttcgccgatg attgccactg atcctggcgg 6060 cttgggtatc ctcggtatcc acgtcgccgg ggtggctggc tacaacggct tctccgcacg 6120 cctcaccccc gagcgcattc aagctttcct ttctcacctg gccacacccc aatctgtcct 6180 ccacttccac ccacccatgg gcccgcctgc gcacgtctct cgtcgcagtc gactccatcc 6240 ctcccctgcc tttggtgcct ttcccatcac caaagaacca gcagccctct ccaggaagga 6300 ccctcgtctc cccgagggca ccgacctgga tgccatcacc ctcgccaagc acgacaaggg 6360 cgacatcgcg acgccctggc cttgcatgga agaggcggct gactggtact tctcccagct 6420 ccctgatgac ctcccagtcc tctcccagga agatgccatt cgtggtctcg accacatgga 6480 tgccattgac ctctcccaat cccctggcta cccttggaca acacagggcc ggtcccgccg 6540 gtctctgttt gacgaggatg gcaaccctct ccctgagctc caggaagcca tcgactctgt 6600 gtgggacggt ggctcctaca tctaccaatc tttcctcaag gatgagttgc gccccacggc 6660 gaaagccaga gctggaaaaa cccggattgt ggaggcggct ccgatacaag caattgtggt 6720 cggccgtcgc cttctcggtt ctctcatcaa ccacctccag ggtaaccccc tccagcatgg 6780 cagcgccgtt ggatgcaacc ccgacatcca ctggactcaa atctttcact ctctcacccc 6840 tttctctaat gtctggtcta ttgattactc ttgctttgac gccactatcc cttctgtcct 6900 tctctctgca attgcttctc gcattgctgc ccgctctgac caacctggtc gtgttctgga 6960 ctatctctct tacactacta cttcctacca tgtctatgac tccctgtggt acaccatgat 7020 cggtggcaat ccctctgggt gtgttgggac ctccatcctc aacacgattg caaacaacat 7080 tgcggtcatc tccgcgatga tgtattgcaa caaatttgac ccgcgggatc ctccggtctt 7140 gtactgctac ggggacgact tgatatgggg ctccaatcaa gactttcacc ctcgtgaact 7200 ccaggccttc tatcagaaat tcactaactt tgttgtcacc cctgctgaca aggcttctga 7260 ctttcctgac tcttcttcca tctttgacat cactttcctc aaacgctact ttgtccctga 7320 tgatatccac ccccacctcg tccatcccgt gatggatgag caaaccctca ccaactcaat 7380 catgtggttg cgcggcgggg agtttgagga ggtgttgcgg tcactcgaga ctctggcctt 7440 tcactccgga ccgaagaact attcggcttg gtgtgaaaaa atcaaggcta agattcgaga 7500 gaatggctgc gacgccacct tcactcccta ctccgtcctc caacgtggtt gggtttccac 7560 ctgcatgact ggaccctacc ccctcactgg gtagcccccc ccc 7603 // ID MG026496; SV 1; linear; genomic RNA; STD; VRL; 7828 BP. XX AC MG026496; XX DT 01-SEP-2018 (Rel. 137, Created) DT 23-SEP-2018 (Rel. 138, Last updated, Version 2) XX DE Salivirus sp. isolate ETH_P28_2016 polyprotein gene, complete cds. XX KW . XX OS Salivirus sp. OC Viruses; Riboviria; Picornavirales; Picornaviridae; Salivirus. XX RN [1] RC Publication Status: Online-Only RP 1-7828 RX DOI; .1371/journal.pone.0202054. RX PUBMED; 30114205. RA Altan E., Aiemjoy K., Phan T.G., Deng X., Aragie S., Tadesse Z., RA Callahan K.E., Keenan J., Delwart E.; RT "Enteric virome of Ethiopian children participating in a clean water RT intervention trial"; RL PLoS One 13(8):e0202054-e0202054(2018). XX RN [2] RP 1-7828 RA Altan E., Delwart E.; RT ; RL Submitted (28-SEP-2017) to the INSDC. RL Molecular Virology, Blood Systems Research Institute, 270 Masonic Ave, San RL Francisco, CA 94118, USA XX DR MD5; 066b76a649419ac682800f7d3e84b32f. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R10 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7828 FT /organism="Salivirus sp." FT /host="Homo sapiens" FT /isolate="ETH_P28_2016" FT /mol_type="genomic RNA" FT /country="Ethiopia" FT /isolation_source="feces" FT /collection_date="Apr-2016" FT /db_xref="taxon:2039694" FT CDS 702..7823 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:A0A346M139" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:A0A346M139" FT /protein_id="AXQ03895.1" FT /translation="MEGSNGFSSSLAGLSSSRSSLRLLTHFLSLPXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSVHTLFPDVSPLKIPQSVPAFAH FT LVQRQGLRRQGNSITNIYGNGNDVTTDVGANGMSLPIAVGDMPTASSSEAPLGSNKGGS FT STSPKSTSNGNVVRGSRYSKWWEPAAARALDRALDHAVDATDAVAGAASKGIKAGATKL FT SNKLSGSQTTALLALPGNIAGGAPSATVNANNTSISSQALLPSVNPYPSTPAVSLPNPD FT APTQVGPAADRQWLVDTLSWSETIAPLTVFSGPKALTPGVYPPTIEPNTGVYPLPAALC FT VSHPESVFSTAYNAHAYFNCGFDVTVVVNASQFHGGSLIVLAMAEGLGDVTPADSSTWF FT NFPHAIINLANSNSATLKLPYIGVTPNTSTEGLHNYWTILFAPLTPLAVPTGSPTAVKV FT SLFVSPIDSAFYGLRFPVPFPTPQHWKTRAVPGAGTYGSVVAGQEIPLVGYAPAAPPRD FT YLPGRVHNWLEYAARHSWERNVSWTSAEEVGDQLVSYPIQPEALANTQTNTAFVLSLFS FT QWRGSLQISLIFTGPAQCYGRLLLAYTPPSANPPTTIDEASNGTYDVWDVNGDSTYTFT FT IPFCSQAYWKTVDIGTSSGLVSNNGYFTVFVMNPLVTPGPSPPSATVAAFLHVADDFDV FT RLPQCPAPXLQSGADGAEVQPAPTSDLSDGNPTTXXXXXXXXXYPHHPVDPSTDLAFYF FT SQYRWFGLNDSLTPLDATGGLFYHISLNPINFQQSSLLSVLGAFTYVYANLSFNINVSA FT PSQPCTFYVFYAPPGASVPTVQSLAELSFFTHTATPLNLASPTNITVSIPYSSPQSVLC FT TSFGGFGLQNGGDAGNLHSNTWGTLILYVDLPQSDSVSVSAYISFRDFEAYVPRQTPGV FT GPVPTSNSIVRVARPTHYPHMTRRQGGTLADLILSPESRCFIVAHTTAPYYSILLVNPD FT EEYAIGMSSHGDESILHYSSRAGTRLAPTAPAFFLCAAXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGDLPD FT AAKGLSVALENVAKVAGDANIATSSQAIASSINSLSNSIDGATTFMQNFFSGLAPKNPT FT SPLQHLFAKLIKWVTKIIGSLIIICNNPTPSALIGVSLMLCGDLAEDITEFFSNLGNPL FT AAVFYRCARALGLSPTPQSAAQAAGGRQGVRDYNDIMSALRNTDWFFEKIMTHVKNLLE FT WLGVLVKDDPRTKLNSQHDKILELYTDSVTASSTPPSELSADAIRSNLDLAKQLLTLSH FT AASSVTHIQLCTRAITNYSTALSAVSLAGTPGTRPEPLVVYLYGPPGTGKSLLASLLAS FT TLAQALSGDPNNYYSPSSPDCKFYDGYSGQPVHYIDDIGQDPDGADWADFVNIVSSAPF FT IVPMADVNDKGRFYTSRVIIVTSNFPGPNPRSARCVAALERRLHIRMNVTARNGVAFSA FT AAALQPSDPPSATRYCRFSNPLTQFSMFNLAVDYKSVVLPNTPLTCFDDLVDFILDSLR FT GRASVNSLLSGMVRTDVTRQGGNADAPAPSAAPLPSVVPSVPSQDPFTRAVNENRPVSF FT LSKIWSWRAPIFAASSFLSLVAATLTIVRCLRDLRSTQGAYTGTPVPKPRKKDLPKQPV FT YSGPVRRQGFDPAVMKIMGNVDSFVTLSGTKPIWTMSCLWIGGRNLIAPSHAFVSDEYE FT ITHIRVGSRTLDVSRVTRVDDGELSLLSVPDGPEHKSLIRYIRSASPKSGILASKFSDT FT PVFVSFWNGKPHSTPLPGVVDEKDSFTYRCSSFQGLCGSPMIATDPGGLGILGIHVAGV FT AGYNGFSARLTPERIQAFLSHLATPQSVLHFHPPMGPPAHVSRRSRLHPSPAFGAFPIT FT KEPAALSRKDPRLPEGTDLDAITLAKHDKGDIATPWPCMEEAADWYFSQLPDDLPVLSQ FT EDAIRGLDHMDAIDLSQSPGYPWTTQGRSRRSLFDEDGNPLPELQEAIDSVWDGGSYIY FT QSFLKDELRPTAKARAGKTRIVEAAPIQAIVVGRRLLGSLINHLQGNPLQHGSAVGCNP FT DIHWTQIFHSLTPFSNVWSIDYSCFDATIPSVLLSAIASRIAARSDQPGRVLDYLSYTT FT TSYHVYDSLWYTMIGGNPSGCVGTSILNTIANNIAVISAMMYCNKFDPRDPPVLYCYGD FT DLIWGSNQDFHPRELQAFYQKFTNFVVTPADKASDFPDSSSIFDITFLKRYFVPDDIHP FT HLVHPVMDEQTLTNSIMWLRGGEFEEVLRSLETLAFHSGPKNYSAWCEKIKAKIRENGC FT DATFTPYSVLQRGWVSTCMTGPYPLAG" FT gap 795..940 FT /estimated_length=146 FT gap 3784..4001 FT /estimated_length=218 XX SQ Sequence 7828 BP; 1318 A; 2648 C; 1559 G; 1899 T; 404 other; ggcgggcttg tggacggctt cggcccaccc acagcaagaa tgccatcatc tgtcctcacc 60 cccaatctcc cttttcttcc cctgcaacca ttacgcttac tcgcatgtgc attgagtggt 120 gcatatgttg aacaaacagc tacactcaca tgggggcggg ttttcccgcc ctgcggcctc 180 tcgcgaggcc taccccnnnn nnnnnnntat aactacagtg ctttggcagg taagcatcct 240 gatcccccgc ggaagctgct cacgtggcaa ctgtggggac ccagacaggt tatcaaaggc 300 acccggtctt tccgccttca ggagtatcct cactagtgaa ttctagtggg gctctgcttg 360 gtgccaacct cccccaaatg cgcgctgcgg gagtgctctt ccccaaccca tcctagtatc 420 ctctcatgtg tgtgcttggt cagcatatct gagacgatgt tccgctgtcc cagaccagtc 480 cagtaatgga cgggccagtg tgcgtaatcg tcttccggct tgtccggcgc atgtttggtg 540 aaccggtggg gtaaggttgg tgtgcccaac gcccgtactt tggtgacacc tcaagaccac 600 ccaggaatgc cagggaggta ccccgcctca cggcgggatc tgaccctggg ctaattgtct 660 acggtggttc ttcttgcttc cactcttttc ttctgttcac gatggagggc tctaacggat 720 tctcgagttc gttggctggc ctttcttcat cgcgctcctc acttcgtctc ctcactcatt 780 ttctctccct ccccnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn cccctctgtc cacactctgt 960 tccctgatgt ttcgcccctc aagattcccc aatctgttcc cgcctttgct caccttgtcc 1020 agcgacaggg gctgcggcga caaggcaatt ccatcaccaa catctacggc aatggtaacg 1080 acgtcaccac tgacgtcggc gccaatggga tgtctctccc catcgccgta ggtgacatgc 1140 ctaccgcctc ttcctctgaa gctcctcttg gttccaacaa aggtggctct tccacttctc 1200 caaaatccac gtccaacggc aacgtcgtcc gcggatcccg ctactccaag tggtgggaac 1260 ccgcggctgc acgcgccctg gaccgtgctc ttgaccatgc tgttgatgca actgacgcag 1320 ttgctggcgc cgcctccaag ggcatcaagg ctggtgccac caagctttcc aacaagcttt 1380 ctggctctca aaccacagct cttcttgctc ttcccggcaa tatcgccggt ggtgccccct 1440 ctgcaacagt caatgccaac aacacttcca tctcttccca agctctttta ccttctgtca 1500 acccttaccc ctccactcct gctgtctcgc ttcccaatcc cgacgccccc actcaagttg 1560 gccccgccgc tgaccgccag tggctcgtcg acaccctctc ttggtctgag acaattgcac 1620 ctcttactgt cttctctgga cccaaagccc tcacccctgg tgtctatccc cctactatcg 1680 aacctaacac tggtgtctac cccctaccag ctgcactctg tgtttcccac cctgaatctg 1740 tcttttccac tgcctacaat gcccacgcct acttcaattg tggcttcgat gtcacagtcg 1800 tcgtgaatgc ttcccagttt cacggcggct cgctgattgt cttggccatg gctgaaggtc 1860 taggcgatgt cactccagct gactcttcca cttggttcaa cttcccccac gctattatta 1920 atctggctaa ctctaattct gctaccctca agcttcctta cattggagtc actcccaaca 1980 cctccactga aggactccac aactattgga ccattctctt tgcccctctg actcctcttg 2040 ctgttccgac tggctcaccc accgctgtca aagtctctct ctttgtctcc cctattgact 2100 cagctttcta tggcctcaga ttccctgtcc ccttcccgac accccagcac tggaaaacac 2160 gtgctgtccc tggtgctggc acctacggct cggtcgtggc cggccaggaa atccccctgg 2220 ttggttacgc ccccgccgcc cctccccgcg attacctccc tgggcgcgta cacaattggc 2280 tcgagtacgc cgcccgccac tcctgggaga ggaatgtaag ctggacttcc gccgaggaag 2340 tcggtgacca gcttgtttcc tatcccatcc aaccagaggc tcttgcaaac acccaaacca 2400 acacagcctt tgttctctcc ctcttctccc agtggcgtgg ctctttgcag atctccctca 2460 tcttcactgg tcctgctcag tgctacggcc gccttcttct tgcctacacc cctccctccg 2520 ccaatcctcc cactaccatc gatgaggcca gcaatggcac atacgatgtt tgggatgtga 2580 acggcgactc tacctacact ttcaccatac ccttctgctc gcaggcctac tggaagactg 2640 tcgacattgg cacgtcgtct ggtctggtct cgaacaatgg gtacttcacc gtctttgtca 2700 tgaaccctct cgtcactcct ggcccctctc ctccttctgc cactgtcgct gctttccttc 2760 atgttgcgga cgacttcgac gttcgcctcc cccagtgccc cgcccctvbc ctccaatcag 2820 gagctgatgg tgcagaagtc caacctgccc ccaccagtga cctctctgat ggtaacccca 2880 ccacnnnnnn nnnnnnnnnn nnnnnnnnnn nctaccccca ccaccctgtt gatccttcca 2940 ctgatctggc tttctacttt tcccagtacc gctggtttgg cctcaatgac tccctcaccc 3000 cattggacgc caccggtggg ctgttttacc acatctctct taaccccatc aacttccagc 3060 aaagctccct cctcagtgtt ctgggtgcat tcacctatgt gtatgccaac ctctctttca 3120 acatcaatgt ctctgcccct tctcagccct gcacattcta tgtcttctac gcccctccag 3180 gcgcatctgt ccctactgtg cagtcccttg cagaactctc tttcttcact cacactgcca 3240 ctcccctcaa cctggcttca ccaactaaca tcactgtctc tatcccttac tcctcccccc 3300 agtctgtcct ctgcacgtcc tttggtggct tcggtctcca aaacggcgga gatgcaggca 3360 acctccactc caacacatgg ggtactctca tcctttatgt tgatctcccc caatctgaca 3420 gtgtttctgt ttctgcttac atttccttcc gtgactttga agcatacgtc cctcgccaaa 3480 cccctggcgt tggccccgtg cccacgagca actccattgt tcgtgtcgcc cgacccaccc 3540 actatcccca catgacccgc cgccaaggcg gcacccttgc agacctcatc ctctcccctg 3600 agagtcggtg cttcattgtt gcccacacca ctgcccccta ctactccatc cttctggtca 3660 accctgacga ggagtatgcc atcggcatgt cctcacatgg tgatgagtca atcctccact 3720 actcgtcacg tgctggcact cgcctcgccc ccaccgcccc agccttcttt ctctgcgctg 3780 ctgnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nggcctcggt gacctcccgg 4020 atgcggcgaa aggtctttcc gtcgcactcg agaatgtagc caaggtcgct ggtgatgcca 4080 atattgccac atcctcccaa gccattgctt cttctatcaa ctctctctca aactccattg 4140 atggtgctac cactttcatg caaaatttct tctctggtct cgctccgaag aatcctacct 4200 ccccccttca gcacctcttt gccaaactca tcaaatgggt gaccaaaatc atcggctcac 4260 tcatcatcat ctgtaacaac cccactcctt cagcattgat tggtgtctcc ctcatgctct 4320 gcggtgacct cgccgaggac atcacagagt tcttctcaaa ccttggaaac cccctcgccg 4380 ctgtcttcta ccgctgtgct agagctcttg gcctctcccc caccccacag tctgctgccc 4440 aggccgccgg tggccgtcag ggcgttcgtg attacaacga catcatgagc gctctgcgga 4500 acaccgactg gttcttcgag aagatcatga ctcacgtcaa aaatcttctc gagtggcttg 4560 gggtcctcgt caaagacgac cccaggacca aactcaactc acagcatgac aagatcctgg 4620 aactctacac cgattctgtt actgcttctt caaccccccc ctctgagctt tctgcggacg 4680 ccatccgatc caacctggac ttggctaagc aactcctcac cctctcacac gctgccagct 4740 ctgtcactca catccagctg tgcacgcgtg ctatcaccaa ctactccact gccctctctg 4800 ccgtctccct cgctggcacg cctgggacgc ggccagagcc actggtcgta tacctgtacg 4860 ggcctcctgg gactggcaag tcccttcttg cttctcttct cgcttccacc cttgctcagg 4920 ccctctctgg tgaccccaac aactattact caccttcctc ccctgactgc aagttctacg 4980 atggttactc tggccagccc gtccactaca tcgacgacat cgggcaagac cctgatggcg 5040 ccgattgggc tgactttgtg aacatcgttt cctctgctcc tttcattgtt cccatggctg 5100 acgttaatga caagggacgt ttctatacct ctcgtgtcat catcgtcact tccaactttc 5160 ccggccccaa tccccgctcc gcgcgttgtg tggctgcgct ggaacgtcgc ctgcacattc 5220 gcatgaatgt gacggcacgc aacggtgtgg ccttctcggc ggcggccgct ctccagccct 5280 ccgacccccc ctccgcaaca cgctattgca gattctccaa ccctcttacc cagttctcca 5340 tgttcaatct ggctgttgac tacaaatcag ttgtcctccc caacaccccc ctcacctgct 5400 ttgatgatct ggttgacttt attctggact ctctccgcgg ccgggcctcg gtgaactcgc 5460 tcctctctgg catggtgcgc actgacgtca cacgccaggg cgggaatgcc gatgcccccg 5520 ctccctctgc agctcctctc ccttctgtag tcccatctgt cccctcccag gaccccttta 5580 ctcgcgcggt caacgagaac cgccctgtct ctttcctctc taagatctgg tcgtggcgag 5640 ctcctatttt cgccgcttcc tctttccttt ctctcgttgc tgcaactctc accattgtcc 5700 gctgtcttcg tgacttgcgg tctacccagg gtgcatatac cgggactcct gttccgaaac 5760 cacggaaaaa ggaccttccc aaacaacctg tgtactctgg gccagtccgc cggcagggtt 5820 tcgaccccgc cgtcatgaag atcatgggca atgtggactc ttttgtcacc ctctcgggta 5880 ctaagcccat ttggaccatg tcctgccttt ggattggcgg tcgcaatctg attgctcctt 5940 cccacgcatt cgtctccgac gagtacgaga tcacccacat ccgcgttgga tcgcgcacgc 6000 ttgacgtgtc gcgtgttacg cgggtggatg acggtgagtt atctctactc tcggtgccgg 6060 atgggccgga gcataagagt ctgatccgct atatccgctc tgcctctcct aaatctggta 6120 ttctggcctc caaattctct gacactcctg tctttgtctc tttctggaat ggcaagcccc 6180 actccacccc tctccctggg gtcgtggacg agaaagactc gttcacgtac cgctgctctt 6240 ctttccaggg cttgtgcggt tcgccgatga ttgccactga tcctggcggt ttgggtatcc 6300 tcggtatcca cgtcgccggg gtggctggct acaacggctt ctccgcacgc ctcacccctg 6360 agcgcatcca agctttcctt tctcacctgg caacacccca atctgtcctc cacttccacc 6420 cacccatggg cccgcctgcg cacgtctctc gtcgcagtcg actccatccc tcccctgctt 6480 ttggtgcctt ccccatcacc aaagaaccag cagccctctc caggaaggac cctcgcctcc 6540 ctgagggcac cgacctggat gccatcaccc tcgccaagca cgacaagggc gacatcgcga 6600 cgccctggcc ttgcatggaa gaggcggctg actggtactt ctcccagctc cctgatgacc 6660 tcccagtcct ctcccaggaa gatgccattc gtggtctcga ccacatggat gccattgacc 6720 tctcccaatc ccctggctac ccttggacaa cacagggccg gtcccgccgg tctctgtttg 6780 acgaggatgg caaccctctc cctgagctcc aggaagccat cgactccgtg tgggacggtg 6840 gctcctacat ctaccaatct ttcctcaagg atgagttgcg ccccacggcg aaagccagag 6900 ctggaaaaac ccggattgtg gaggcggctc cgatacaagc aattgtggtc ggccgtcgcc 6960 ttctcggttc tctcatcaac cacctccagg gtaaccccct ccagcatggc agcgccgttg 7020 gatgcaaccc cgacatccac tggactcaaa tctttcactc tctcacccct ttctctaatg 7080 tctggtctat cgactactct tgctttgatg ccactatccc ttctgtcctt ctctctgcaa 7140 tcgcatctcg cattgctgcc cgctctgacc aacctggtcg tgttctggac tatctctctt 7200 acactactac ttcctaccat gtctatgact ccctgtggta caccatgatc ggtggcaatc 7260 cctctgggtg tgttgggacc tccatcctca acacgattgc aaacaacatt gcggtcatct 7320 ccgcgatgat gtattgcaac aaatttgacc cgcgggatcc tccggtcttg tactgctacg 7380 gggacgactt gatatggggc tccaatcaag actttcaccc tcgtgaactc caggccttct 7440 atcagaaatt cactaacttt gttgtcaccc ctgccgacaa ggcttctgac tttcctgact 7500 cttcttccat ctttgacatc accttcctta aacgctactt tgtccctgat gatatccacc 7560 cccacctcgt ccatcctgtg atggatgagc aaactctcac caactcaatc atgtggttgc 7620 gcggcgggga gtttgaggag gtgttgcggt cactcgagac tctggccttt cactccggac 7680 cgaagaacta ttcggcttgg tgtgagaaga tcaaggctaa gattcgagag aacggctgcg 7740 acgccacctt cactccctac tccgtcctcc aacgtggttg ggtttccacc tgcatgactg 7800 gaccctaccc cctcgccggg tagccccc 7828 // ID MG026727; SV 1; linear; genomic DNA; STD; VRL; 5100 BP. XX AC MG026727; XX DT 31-JAN-2018 (Rel. 135, Created) DT 31-JAN-2018 (Rel. 135, Last updated, Version 1) XX DE Bovine parvovirus 3 isolate ujs1794 nonstructural protein and structural DE protein genes, complete cds. XX KW . XX OS Bovine parvovirus 3 OC Viruses; Parvoviridae; Parvovirinae; Erythroparvovirus. XX RN [1] RC Publication Status: Online-Only RP 1-5100 RX DOI; .1186/s12985-018-0923-9. RX PUBMED; 29334978. RA Wang H., Li S., Mahmood A., Yang S., Wang X., Shen Q., Shan T., Deng X., RA Li J., Hua X., Cui L., Delwart E., Zhang W.; RT "Plasma virome of cattle from forest region revealed diverse small circular RT ssDNA viral genomes"; RL Virol J 15(1):11-11(2018). XX RN [2] RP 1-5100 RA Wang H., Zhang W.; RT ; RL Submitted (01-OCT-2017) to the INSDC. RL School of Medicine, Jiangsu University, 301 Xuefu Road, Zhenjiang, Jiangsu RL 212013, China XX DR MD5; 6e3c98a6409aafdcb8e5804e7454e583. DR EuropePMC; PMC5769433; 29334978. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..5100 FT /organism="Bovine parvovirus 3" FT /isolate="ujs1794" FT /mol_type="genomic DNA" FT /country="China" FT /isolation_source="Cattle blood" FT /collection_date="2015" FT /db_xref="taxon:172297" FT CDS 26..1981 FT /codon_start=1 FT /product="nonstructural protein" FT /note="NS" FT /db_xref="GOA:A0A2I7YUU4" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUU4" FT /protein_id="AUS83805.1" FT /translation="MESYSRAVIRLPWENIYEAIQEAAWPSLAAVEPQRPGDLPYDWPL FT LYDEDRRYVVACDALWSILQRRAAVFGRWAGYLQLEPSQAGGPGRHLHLLLSAPGIRGR FT SWTAFLRNAVAEWARTTVHLNYVDAIDIPRNTHGRILEADADFVFRYLAPKLPLREVTW FT AWTNEDQFKPFALCEPKRRELIQRATTQDRANGLDGPPAKRSRAADEFHQLVHFLADKG FT IVDPDKWMALFPDSYITWSSSAQGRQQVNSACELALQIILTRGVLSRFLAPNPSNIFPE FT NNRAVELLRMQGHDPVSFGQLVLAWADKQLGKRNTLWFWGPPSTGKTNLALAIARALPR FT FGMVNWTNENFPFNDAPHKCVLVWDEGRITAKIVEAVKSILGGQAVRVDQKCKGSVSLS FT PTPVLITSNADIRYVRDGNIVTGDHVKALSERMVIVHFSTPCPANFGLLKAEEIVDWLN FT YVKSCPGSITADTVQATWGTRSAPNLFEIKRKAPQTASPLGPQAEEQEEAAAYRCPSSP FT ASSRSSSPDIFGITKSPAPLEDLSSDSSSECSLPFTPSNAAWFTPMPPARPLQPPLFGV FT DWIYSTQWKQPVCCLDHETEPCNLCIDIAERCVLFRVSEPDLLRCPDHRHEENPFDVLL FT CRHCQALSGLETLQSA" FT CDS 1984..4827 FT /codon_start=1 FT /product="structural protein" FT /note="VP" FT /db_xref="GOA:A0A2I7YUT4" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUT4" FT /protein_id="AUS83806.1" FT /translation="MAFNPLTMSSRLLVPVTPVSKLDLLKKKWFAFPDVSKILLEALSH FT SGFGDPKKWKEADADIIEALLDEALRLGPRLEKPAWFYDLQRAIGLARFSASLEQTVFL FT NEMLIKLTRGPVVPKYPEPDIVIRDPAPLTPEVEAPTSTPENSPDQSSVASDPVEMEEG FT SSTPIPDPVPQSEDMETEETTIPDQPPPPSPQIVDEVEDMAMGVEDLSIVEDASEQHQS FT PAGEPTPDITSSVGNRDDESREESREADLQDLSAGLGAAGGSAIAALGSGLIPAATVAT FT AYPRPDQFLRDYLARYDQMYPSGSRYPPRWEQLKSLYDKGMTVKEVWDLLNKNSNNSNL FT QAKDTDKKQTAPSSSSAPQESAAAMASGDKSGVNPSGGSAPLSATVWASGAQFEADHVI FT THMSRTVFIPFQQAHRYEPIVWRGRRTADGWLSFWPDHPVIGYKTPWFYLDVNAINRHF FT SPGEWQEVLERYGSIVPESMEIILSDFCIKDVSVVDGKTTVTDSSTGGVCVFVDDGYKF FT PYVLGHSQNTLPGPLPTDIYSPPQYAYLTTGKKTKVAAYASGEGPMPMDSIAIPSQETA FT FYVLENSFYTIQRAGGGFAHSYNFPSLKPISLEGFSQHWMLMDNPLYPSRLWVPEKVGG FT ASKWGAVKNDDYGKKPLNWMPGPNIPSHTIEQSDQAGQRVELDRDVEGQKVWTGTSFGS FT RPENRWSMRPLGVNQPYAYDAYEDETDKIVTVDAIGYGTAKASAALGQDTGEVPENASV FT GRVPDDTECNKQGGGGNHLFQVKSLAHNNFTEQMKNQTVPLMPGSVWQNRALHYESQIW FT AKIPNVDGEFMCERPALGGWGMHDPPPQIFMKMQPVPAPKSLNSTTEAGFPSEHYLHQY FT AYCVMTVRMRWKTTTRTGPTRWNPQPTFGPPEATDHIPYILYDRLSTIHKTRGQFTNAY FT YEEPESVWTARGRVRHL" XX SQ Sequence 5100 BP; 1249 A; 1316 C; 1307 G; 1228 T; 0 other; gctgtaacac tgcctcctcg ctgcgatgga gtcctattcc cgcgctgtga ttcggctacc 60 atgggaaaac atctacgaag ccattcaaga ggcggcgtgg ccatccttgg ctgctgtgga 120 acctcaacga ccgggagatt tgccctatga ttggcctctg ctctatgacg aagaccgtcg 180 atacgtggtg gcctgtgatg ctctgtggtc aatcctccaa agacgcgcgg cagtcttcgg 240 tcgctgggct ggctacttac aactagagcc atctcaggcc ggcgggcccg gccgacatct 300 acacctgctt ctaagtgctc ctggcattcg aggacgaagt tggacggcat ttctccgcaa 360 cgctgtcgca gagtgggcgc gcaccaccgt tcatctcaac tacgttgatg ccatagacat 420 cccacggaac acgcacggaa gaattcttga agcggacgcg gactttgtat tcaggtacct 480 ggctcctaaa cttccactta gagaggtcac atgggcatgg actaatgagg atcaatttaa 540 accctttgct ctctgcgaac ctaagcgccg agagcttatt caacgagcta cgacccaaga 600 tcgagctaat ggcttagacg ggccacccgc taaaagaagc cgagcggccg atgaatttca 660 tcaattggtc cactttttag ctgataaggg cattgtggat ccagataaat ggatggcttt 720 attccccgat agctatatta cctggagtag ctctgctcag ggcaggcaac aggttaatag 780 tgcttgtgaa ttggctttac agatcatcct gacgcgtggc gtcttgtcca gatttttggc 840 gccgaatccg tccaacattt tccccgaaaa taatagagct gtggagcttc tgcgcatgca 900 gggtcatgat cctgtttctt ttgggcaatt agtgctagct tgggcggaca agcaactagg 960 caagcgcaat actttgtggt tttggggtcc cccgagcacc ggaaaaacta atcttgcctt 1020 agctattgcc agagctttgc cgcggtttgg gatggtcaat tggaccaatg aaaacttccc 1080 cttcaatgat gcgccgcata aatgtgtctt ggtgtgggac gagggtcgaa taacggccaa 1140 aattgttgag gctgtaaaga gtattctggg gggccaggcg gtacgggtgg atcagaaatg 1200 caagggctct gtaagtttgt ctcccactcc cgtattaatt acgtctaatg ctgacattcg 1260 atatgtacgt gatggaaaca ttgttactgg ggatcatgta aaggctttaa gtgagaggat 1320 ggtcattgtg catttctcta ctccatgccc cgccaatttt gggcttctga aagcggagga 1380 gattgttgat tggctaaact atgtaaagtc atgtcctggg agtatcactg ctgataccgt 1440 tcaggccacg tggggaacac gctccgcccc caacctattt gagataaagc ggaaagcccc 1500 acagacggcc agcccacttg gacctcaggc ggaggaacaa gaagaagcag ccgcatatcg 1560 ctgtcccagc agtcccgcga gcagtcgcag cagctcgccg gacatctttg gaatcacgaa 1620 gagccccgct cctctggagg acctttccag cgacagtagc agtgagtgca gcttaccttt 1680 cactcccagt aatgcggcat ggttcacccc tatgccgcct gcccgcccct tacaaccccc 1740 tctttttgga gtagattgga tctactccac acaatggaaa caacctgtgt gttgcttgga 1800 tcatgaaact gaaccttgta atttgtgcat agatatagcg gagcgctgcg tcttgtttcg 1860 ggtttccgaa ccagatcttc tgcggtgccc ggatcaccgg cacgaagaga acccgtttga 1920 cgtcttgctc tgccgccact gtcaagctct ttctgggtta gaaacgctgc aatctgctta 1980 ggtatggcgt tcaaccctct gactatgtcc tctcgcttgc tagtacctgt cacgcctgta 2040 agcaaattag acctgctcaa aaaaaagtgg tttgcttttc cggatgtatc gaaaatttta 2100 ctcgaggcat tgtcgcattc tgggtttggg gatcctaaaa aatggaaaga ggccgatgct 2160 gatatcattg aggctttgct tgatgaagcg ctgcgcttag ggccaagact agaaaagcca 2220 gcctggtttt atgatttgca aagagctatt gggttggcca ggttttctgc ttccttggag 2280 cagaccgtgt ttcttaatga aatgctaata aagcttacgc gtggtcctgt tgtaccgaaa 2340 tacccagagc cagatatcgt cattcgggac cctgcccctt tgactccaga ggtcgaggct 2400 ccaacctcta cccccgaaaa ctccccagat cagtcttcag ttgcatcaga cccagtagaa 2460 atggaagagg gttcctctac tcctattccg gacccagtgc cccagtcgga agatatggag 2520 acagaagaga caaccattcc cgatcaaccc ccccctcctt ctccccaaat agtagacgag 2580 gtagaggata tggcgatggg agtagaagac ctttctattg tagaagatgc ctccgagcaa 2640 caccagtcac ctgcagggga gcctacccca gatatcacat cttctgtcgg aaacagagat 2700 gatgaatcta gagaagagtc tagagaggct gatttacaag atctgtctgc tggcctggga 2760 gctgccgggg gtagcgctat tgctgctctt ggtagtgggc ttattcctgc ggcgacggtg 2820 gccacagcat atcctcggcc tgatcagttc ttgcgggact acttggcccg gtatgatcaa 2880 atgtatccta gtgggtcccg gtatccccca cgatgggagc agttaaagtc cttgtatgac 2940 aaggggatga cagttaagga ggtctgggac cttctcaaca aaaattccaa taactctaac 3000 ttacaggcaa aggataccga caaaaaacag acggccccgt cctcgtccag tgcccctcaa 3060 gagagtgcag cggccatggc ttcgggagat aagtcgggcg taaaccctag cggtgggagt 3120 gcccctttat cggccactgt atgggcctcc ggagctcagt tcgaggctga tcacgtgatc 3180 acccacatga gccgcaccgt cttcatccct tttcagcaag cacaccgcta tgagcctata 3240 gtttggcgcg ggagacgcac cgcagacggc tggctgtcat tttggcctga tcaccctgtc 3300 atcggctata aaaccccatg gttctacttg gacgtcaacg ccatcaatcg ccatttttct 3360 cctggcgaat ggcaagaggt actcgaaaga tatggtagca ttgtaccaga gagcatggaa 3420 ataatactgt ccgatttctg tattaaagat gtgagtgtgg tggacggaaa gaccacagtg 3480 actgacagca gcacgggcgg ggtgtgcgtg tttgtagatg acggctacaa atttccctat 3540 gtgctaggtc atagtcaaaa cactttaccc ggcccattac ccacagatat atattctccg 3600 cctcagtacg cctaccttac tacaggaaaa aaaactaagg tagccgcata tgcctctgga 3660 gaaggcccaa tgcccatgga ttccattgca atcccctctc aagaaactgc cttttatgtc 3720 ttagaaaact ccttttacac cattcaacgt gccggggggg gatttgccca ctcttataac 3780 ttcccctcct taaagccaat ttccttagaa ggcttttctc aacactggat gcttatggac 3840 aaccctctat atccctcccg tctgtgggtg cctgaaaaag tggggggcgc ttctaaatgg 3900 ggagcagtga aaaacgacga ttacgggaaa aaaccattaa attggatgcc tggtcccaac 3960 attccctctc acaccataga gcagagtgat caggctggac agagggttga attggatcga 4020 gacgtcgaag gtcaaaaagt gtggactggc acctcctttg gcagccgtcc agaaaataga 4080 tggtctatgc ggccgctggg cgttaatcag ccctatgcat atgatgctta tgaagatgaa 4140 acagacaaaa tagtgacagt ggatgccatt ggctacggaa ccgctaaggc ctcagcggct 4200 ttaggccaag atacagggga agttccagaa aatgcatccg tgggtcgcgt cccggatgac 4260 actgaatgta ataaacaagg gggcggggga aatcacctat ttcaagttaa gtctttggcc 4320 cataataact ttacagagca aatgaaaaac caaacagtac ccctgatgcc tggcagcgtt 4380 tggcagaatc gtgctctgca ctatgagtcg caaatttggg ctaaaattcc aaatgtagat 4440 ggtgagttta tgtgtgagcg accagcattg ggtggatggg ggatgcacga ccctccgcct 4500 caaatattca tgaaaatgca acctgtccca gctcccaagt cattaaattc cactacagag 4560 gcagggtttc cctcggagca ttatctgcac cagtatgcgt actgtgtcat gacggtccgc 4620 atgcggtgga agacaacaac ccgcacaggg cccactcgtt ggaaccctca acctaccttc 4680 ggacccccag aggctacaga ccacattccc tacattcttt atgaccgact ctcaaccata 4740 cataaaacac gagggcagtt cactaatgct tattatgaag agccagagag tgtatggacc 4800 gctcggggac gggtgcgtca cctgtgagtt gctttgtctg tttttggaac gcttttcaat 4860 aaactgagtg gccaagagct tatttgcgtc cgcgtgttca cttagggcga ctttgctcat 4920 ttgcatgttc ccgcccagac acgcccacgg gggctggttg taagctacta atcatccccc 4980 tcgcctggcg ccgcgccaag ggcggagcct ggcacagcgc cagcatgacg tcactaaaat 5040 gacgtcactt ccgcttccgg gtcaagggga ggagcttggc agaatgccag gcgcagtctg 5100 // ID MG026728; SV 1; linear; genomic DNA; STD; VRL; 5333 BP. XX AC MG026728; XX DT 31-JAN-2018 (Rel. 135, Created) DT 31-JAN-2018 (Rel. 135, Last updated, Version 1) XX DE Bovine parvovirus 3 isolate ujs497 nonstructural protein and structural DE protein genes, complete cds. XX KW . XX OS Bovine parvovirus 3 OC Viruses; Parvoviridae; Parvovirinae; Erythroparvovirus. XX RN [1] RC Publication Status: Online-Only RP 1-5333 RX DOI; .1186/s12985-018-0923-9. RX PUBMED; 29334978. RA Wang H., Li S., Mahmood A., Yang S., Wang X., Shen Q., Shan T., Deng X., RA Li J., Hua X., Cui L., Delwart E., Zhang W.; RT "Plasma virome of cattle from forest region revealed diverse small circular RT ssDNA viral genomes"; RL Virol J 15(1):11-11(2018). XX RN [2] RP 1-5333 RA Wang H., Zhang W.; RT ; RL Submitted (01-OCT-2017) to the INSDC. RL School of Medicine, Jiangsu University, 301 Xuefu Road, Zhenjiang, Jiangsu RL 212013, China XX DR MD5; 91bc586f3c57e50a4a510effa8a09d93. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..5333 FT /organism="Bovine parvovirus 3" FT /isolate="ujs497" FT /mol_type="genomic DNA" FT /country="China" FT /isolation_source="Cattle blood" FT /collection_date="2015" FT /db_xref="taxon:172297" FT CDS 210..2165 FT /codon_start=1 FT /product="nonstructural protein" FT /note="NS" FT /db_xref="GOA:A0A2I7YUT1" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUT1" FT /protein_id="AUS83807.1" FT /translation="MESYSRAVIRLPWENIYEAIQEAAWPSLAAVEPQRPGDLPYDWPL FT LYDEDRRYVVAWDTLWSILKRLAAGFGRWAGYLQLEPSQAGGPRRHLHLLLSAPGIRGR FT SWTAFLRNAVAEWARTTVHLNYADAIDIPRNTHGRILEADADFVFRYLAPKLPLREVTW FT AWTNEDQFKPFALCEPKRRELIQRATTQDRANGLDGPPAKRSRAADEFHQLVHFLADKG FT IVDPDKWMALFPDSYITWSSSAQGRQQVNSACELALQIILTRGVLSRFLAPNPSNIFPE FT NNRAVELLRMQGHDPVSFGQLVLAWADKQLGKRNTLWFWGPPSTGKTNLALAIARALPR FT FGMVNWTNENFPFNDAPHKCVLVWDEGRITAKIVEAVKSILGGQAVRVDQKCKGSVSLS FT PTPVLITSNADIRYVRDGNIVTGDHVKALSERMVIVHFSTPCPANFGLLKAEEIVDWLN FT YVKSCPGSITADTVQATWGTRSAPNLFEIKRKAPQTASPFGPQAEEQEEAAVYRCPSSP FT ASSRSSSPDIFGITKSPAPLEDLSSDSGSECSLPFTPSNAAWFTPMPPAHPLQPPLFGV FT DWIYSTQWKQPVCCLDHETEPCNLCIDIAERCVLFRVSEPDLLRCPDHRHEENPFDVLL FT CRHCQALSGLETLQSA" FT CDS 2168..5011 FT /codon_start=1 FT /product="structural protein" FT /note="VP" FT /db_xref="GOA:A0A2I7YUS5" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUS5" FT /protein_id="AUS83808.1" FT /translation="MAFNPLTMSSRLLVPVTPVSKLDLLKKKWFAFPDVSKILLEALSH FT SGFGDPKKWKEADADIIEALLDEALRLGPRLEKPAWFYDLQRAIGLARFSASLEQTVFL FT NEMLIKLTRGPVVPKYPEPKIVIRDPAPLTPDVEAPTSTPENSPDQSSVASDPVEMEEG FT SFNPIPDPVPQSEDMETEETTIPDQPPPLSPQIVDEVEDMAMGVEDLSIVEDASEQHQS FT PAGEPTPDITSSVGNRDDESREESREADLQDLSAGLGAAGGSAIAALGGGLIPAATVAT FT AYPRPDQFLRDYLARYDQMYPSGSRYPPRWEQLKSLYDKGMTVKEVWDLLNKNSNNSNL FT QAKDTDKNQTAPSSPSAPQESAAAMASGDKSGVNPSGGSAPLSATVWASGAQFEADHVI FT THMSRTVFIPFQQAHRYEPIVWRGRRTADGWLSFWPDHPVIGYKTPWFYLDVNAINRHF FT SPGEWQEVLERYGSIVPESMEIILSDFCIKDVSVVDGKTTVTDSSTGGVCVFVDDGYKF FT PYVLGHSQNTLPGPLPTDIYSPPQYAYLTTGKKTKVAAYASGEGPMPMDSIAIPSQETA FT FYVLENSFYTIQRAGGGFAHSYNFPSLKPISLEGFSQHWMLMDNPLYPSRLWVPEKVGG FT ASKWGAVKNDDYGKKPLNWMPGPNIPSHTIEQSDQAGQRVELDRDVEGQKVWTGTSFGS FT RPENRWSMRPLGVNQPYAYDAYEDETDKIVTVDAIGYGTAKASAALGQDTGEVPENASV FT GRVPDDTECNKQGGGGNHLFQVKSLAHNNFTEQMKNQTVPLMPGSVWQNRALHYESQIW FT AKIPNVDGEFMCERPALGGWGMHDPPPQIFMKMQPVPAPKSLNSTTEAGFPSEHYLHQY FT AYCVMTVRMRWKTTTRTGPTRWNPQPTFGPPEATDHIPYILYDRLSTIHKTRGQFTNAY FT YEEPESVWTARGRVRHL" XX SQ Sequence 5333 BP; 1311 A; 1396 C; 1361 G; 1265 T; 0 other; cgccaggcga gggggatgat tagtagctta caaccagccc ctgtgggtgt gtctgggcgg 60 ggacatgcaa atgagcaaag tcgccctaag tgaacacgcg gacgcaaata agctcttggc 120 gcaactgtct ctgctctata ccgtcttcgg tactgagctg taacacttga gctgtaacac 180 tcgagctgta acactgccta ctcgctgcga tggagtccta ttcccgggct gtgattcgtc 240 taccatggga aaacatctac gaggccattc aagaggcggc gtggccgtcc ttggctgctg 300 tggaacctca acgaccggga gatttgccct atgattggcc cctactctat gacgaagatc 360 gtcgatacgt ggtggcctgg gatacactgt ggtcaatcct caaaagactc gcggcaggct 420 tcggtcgctg ggctggctac ttacaactag agccatctca ggccggcggc ccccgccgac 480 acctccattt gcttctaagt gcccctggca ttcgaggacg aagttggacg gcatttctcc 540 gcaacgctgt cgcagaatgg gcgcgcacca ccgttcatct caactacgct gacgccatag 600 atatcccacg gaacacacac ggaagaattc ttgaagcgga cgcggacttt gtattcaggt 660 acctggctcc taaacttccg cttagagagg tcacatgggc atggactaat gaggatcaat 720 tcaaaccctt tgctctctgc gaacctaagc gccgagagct tattcaacga gctacaaccc 780 aagatcgagc taatggctta gacgggccgc ccgctaaaag aagccgagcg gccgatgaat 840 ttcatcaact ggtccacttt ttagctgata agggcattgt ggatccagat aaatggatgg 900 ctttattccc cgatagctat attacctgga gtagctccgc tcagggcagg caacaggtta 960 atagtgcttg tgaattggct ctacagatca tcctgacgcg tggcgtcttg tccagatttt 1020 tggcgccgaa tccgtccaat attttccccg aaaataatag agcggtggaa cttctgcgca 1080 tgcagggtca tgatcctgtt tcctttgggc aattagtact agcttgggca gataaacaac 1140 taggcaagcg caatactttg tggttttggg gtcccccgag cacaggaaaa actaatcttg 1200 ccttagctat tgccagagct ttgccgcggt ttgggatggt caattggacc aatgaaaact 1260 ttccgttcaa tgatgcgccg cataaatgtg tcttggtgtg ggatgagggt cgaataacag 1320 ccaaaattgt tgaggctgta aagagcattc tgggaggcca ggcagtgcgg gtggaccaaa 1380 aatgcaaggg ctctgtaagt ttgtctccca ctcccgtatt aatcacgtct aatgctgaca 1440 ttcgatacgt gcgcgatgga aacattgtta ctggggatca tgtaaaggct ttaagtgaga 1500 ggatggtcat tgtgcatttc tccactccat gccccgccaa ttttgggctt ttgaaagcgg 1560 aggagattgt tgattggcta aactatgtaa agtcatgtcc tgggagcatc actgctgata 1620 ccgttcaggc cacgtgggga acacgctccg cccccaactt atttgagata aagcggaaag 1680 ccccacagac ggctagccca tttggacctc aggcggagga acaagaagaa gcagccgtat 1740 atcgctgtcc tagcagtccc gcaagcagtc gcagcagctc gccggacatc tttggaatca 1800 cgaagagccc agctcctctg gaggaccttt ccagcgacag tggcagtgag tgcagcttac 1860 ctttcactcc cagtaatgcg gcatggttca cccctatgcc gcctgcccac cccttacaac 1920 cccctctttt tggagtagat tggatctact ccacacaatg gaaacaacct gtatgttgct 1980 tggatcatga aactgaacct tgtaatttgt gcatagatat agcggagcgc tgtgtcttgt 2040 ttcgggtttc cgaaccagat cttctgcggt gcccggatca ccggcacgaa gagaacccgt 2100 ttgacgtctt gctctgccgc cactgtcaag ctctttctgg gttagaaacg ctgcaatccg 2160 cttaggtatg gcgttcaatc ccctgactat gtcctctcgc ttgctagtac ctgtcacgcc 2220 tgtaagcaaa ttagacctgc tcaaaaaaaa gtggtttgct tttccggatg tatcgaaaat 2280 tttactcgag gcattgtcgc attctgggtt tggggaccct aaaaaatgga aagaggccga 2340 tgctgatatc attgaggctt tgcttgatga agcgctgcgc ttagggccaa gactagagaa 2400 gccagcgtgg ttttatgatt tgcaaagagc tattgggttg gccaggtttt ctgcttccct 2460 ggagcagacc gtgtttctta atgaaatgct aataaagctt acgcgtggtc ctgttgtacc 2520 gaaatacccc gagccaaaaa tcgtcattcg ggaccctgcc cctttgactc cagatgttga 2580 ggctccaacc tctacccccg aaaactcccc agatcagtct tcagttgcat cagacccagt 2640 agaaatggaa gagggttcct ttaatcctat cccggaccca gtaccccagt cggaagatat 2700 ggagacagaa gaaacaacca ttcccgatca gccccccccc ctctctcccc aaatagtaga 2760 cgaggtagag gatatggcga tgggagtaga agacctttct attgtagaag atgcctccga 2820 gcaacaccag tcacctgcag gggagcctac cccagatatc acatcttctg tcggaaacag 2880 agatgatgaa tctagagaag agtctagaga ggctgattta caagatctgt ctgctggcct 2940 gggagctgcc gggggtagcg ctattgctgc tcttggtggt gggcttattc ctgcggcgac 3000 ggtggccaca gcatatcctc ggcctgatca gttcttgcgg gactacttgg cccggtatga 3060 tcaaatgtat cctagtgggt cccggtatcc cccacgatgg gagcagttaa agtccttgta 3120 tgacaaggga atgacagtta aggaggtctg ggaccttctc aacaaaaatt ccaataactc 3180 taacttacag gcaaaggata ccgacaaaaa ccagacggcc ccgtcctcgc ccagtgcccc 3240 tcaagagagt gcagcggcca tggcttcggg agataagtcg ggcgtaaacc ctagcggtgg 3300 gagtgcccct ttatcggcca ctgtatgggc ctccggagct caattcgagg ctgatcatgt 3360 gatcacccac atgagccgca ccgtcttcat cccttttcag caagcacacc gctatgagcc 3420 tatagtttgg cgcgggagac gcaccgcaga cggctggcta tcattttggc ctgaccaccc 3480 tgtcatcggc tataaaaccc catggttcta cctggacgtc aacgccatca atcgccattt 3540 ttctcctggc gaatggcaag aggtactcga aagatatggt agcattgtgc cagagagcat 3600 ggaaataata ctgtccgatt tctgtattaa agatgtgagt gtggtggacg gaaagaccac 3660 agtgactgac agcagcacgg gcggggtgtg cgtgtttgta gatgacggct acaaatttcc 3720 atatgtgcta ggtcatagtc aaaacacttt gcccggccca ttacccacag atatatattc 3780 cccacctcag tacgcttacc ttactacagg gaaaaaaact aaggtagccg catatgcctc 3840 tggagaaggt ccaatgccca tggattccat tgcaatcccc tctcaagaaa ctgcctttta 3900 tgttttagaa aactcctttt acaccattca acgtgccgga gggggatttg cccactccta 3960 taacttcccc tccttaaagc caatttcctt agaaggcttt tctcaacact ggatgctcat 4020 ggacaaccct ttatatccct cccgtctgtg ggtgcctgaa aaagtggggg gcgcttctaa 4080 gtggggggca gtgaaaaacg atgattacgg gaaaaagcca ttaaattgga tgcctggtcc 4140 caacattccc tcccacacta tagagcagag tgatcaggct ggacagaggg ttgaattgga 4200 tcgagacgtc gaaggtcaaa aagtgtggac tggcacgtcc tttggcagcc gtccagaaaa 4260 tagatggtcc atgcggccac tgggcgtcaa tcagccctat gcatatgatg cttatgaaga 4320 tgaaacagat aaaatagtga cagtggacgc cattggctac ggaaccgcta aggcctccgc 4380 ggctctaggc caagatacag gtgaagttcc agaaaatgca tccgtgggtc gcgtcccgga 4440 tgacactgaa tgtaataaac aaggcggcgg gggaaatcac ctatttcaag ttaagtcttt 4500 ggcccataat aactttacag agcaaatgaa gaaccaaaca gtgcccctga tgcccggcag 4560 cgtttggcag aatcgcgctc tgcactatga gtcgcaaatt tgggctaaaa ttccaaatgt 4620 agatggggag tttatgtgtg agcggccagc attgggtggg tggggcatgc acgaccctcc 4680 gcctcaaata tttatgaaaa tgcaacctgt cccagctccc aagtcattaa attctaccac 4740 agaggcaggg tttccctcgg agcattacct gcaccagtat gcgtactgtg tcatgactgt 4800 ccgcatgcgt tggaagacaa caacccgcac agggcccact cgttggaacc ctcaacctac 4860 cttcggaccc ccagaggcta cagaccacat tccctacatt ctgtatgacc gcctgtcaac 4920 catacataaa acacgagggc agttcactaa tgcttattat gaagagccag agagtgtatg 4980 gaccgctcgg ggacgggtgc gtcacctgtg agttgctttg tctgtttttg gaacgctttt 5040 caataaatga gtggccaaga gcttatttgc gtccgcgtgt tcacttaggg cgactttgct 5100 catttgcatg tccccgccca gacacaccca caggggctgg ttgtaagcta ctaatcatcc 5160 ccctcgcctg gcgccgcgcc aagggcggag cttggcacag cgccagcatg acgtcactaa 5220 attgacgtca cttccgcttc cgggtcaagg ggaggagctt ggcagaatgc caggcgcagt 5280 ctgatgacgt cacgccacgc ccccctacgt cacttccggc cacgcccact cca 5333 // ID MG026729; SV 1; linear; genomic DNA; STD; VRL; 5573 BP. XX AC MG026729; XX DT 31-JAN-2018 (Rel. 135, Created) DT 31-JAN-2018 (Rel. 135, Last updated, Version 1) XX DE Bovine parvovirus - 2 isolate ujs2665 nonstructural protein and structural DE protein genes, complete cds. XX KW . XX OS Bovine parvovirus - 2 OC Viruses; Parvoviridae; Parvovirinae; Copiparvovirus. XX RN [1] RC Publication Status: Online-Only RP 1-5573 RX DOI; .1186/s12985-018-0923-9. RX PUBMED; 29334978. RA Wang H., Li S., Mahmood A., Yang S., Wang X., Shen Q., Shan T., Deng X., RA Li J., Hua X., Cui L., Delwart E., Zhang W.; RT "Plasma virome of cattle from forest region revealed diverse small circular RT ssDNA viral genomes"; RL Virol J 15(1):11-11(2018). XX RN [2] RP 1-5573 RA Wang H., Zhang W.; RT ; RL Submitted (01-OCT-2017) to the INSDC. RL School of Medicine, Jiangsu University, 301 Xuefu Road, Zhenjiang, Jiangsu RL 212013, China XX DR MD5; e34f3f67c0bfe659a41ac5d0e279d16c. DR EuropePMC; PMC5769433; 29334978. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. 8.1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..5573 FT /organism="Bovine parvovirus - 2" FT /isolate="ujs2665" FT /mol_type="genomic DNA" FT /country="China" FT /isolation_source="cattle" FT /collection_date="2015" FT /db_xref="taxon:172296" FT CDS 480..2093 FT /codon_start=1 FT /product="nonstructural protein" FT /note="NS" FT /db_xref="GOA:A0A2I7YUV1" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUV1" FT /protein_id="AUS83809.1" FT /translation="MSYYTFVVTIPDKIEDYGQAYFDVVNRVALTHRDNEKWVYDPEDR FT EIIFKQAQVYVDCLERRLKQMLVSGSMYNYFIQLEQGEKHKRWHIHVVLDISVGNPRNC FT REVIEKVEYEYNSICYGRPVRQTQINRTSNGSWKIETEDFIINYLLCKIPPKEAKYAWT FT NIKNKIGDACLNIDLRKEISVRPDIDVPMWRQASGDRMRDLVEWCIKNHVFTEDQYMAK FT FEESYYSFACTNQGRHMLQTSLELAAKRTTSMIPLGVYLAGFANIQEAIEEANKTEAAD FT IYNNKVFDLLQYQGYDPVVAGYIIYAWSIRATGRRGALWFYGPGQTGKSIMARAMATCS FT VRYGCVNWTNSNFPFQDLATNCQIGWWEEGVITEDIVESAKALLSGGKIRVDRKCRDSV FT EITPPPFVITSNNDMTLVQGGNQVSFVHKKPLEDRMIKFNFNKRLPANFGIVEREDMKQ FT FFKWSSFIYYNKLLPKEYNYLSDPKLIGHVVPYTSYLRQTAKVEKIVRPFEEQEQEDLR FT ELDSWFADPPYMGQPSKKSE" FT CDS 2440..5535 FT /codon_start=1 FT /product="structural protein" FT /note="VP" FT /db_xref="GOA:A0A2I7YUU3" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:A0A2I7YUU3" FT /protein_id="AUS83810.1" FT /translation="MIQGLGGAAGINLRQELEGKRLYLDKYKYPDNYIFREANQEQEMK FT KIRQLMAETNTDITETPDHKWYFNKYPVEGPLKHSDDVNYHGLSYVRLPSRDLIEGKHL FT VPPEQRMGGEQLIGLGGMPESHDLILEPMTNDPRFHKPKPPGDDIPKPLTHWQLANKIG FT PDYARQLAEMSGKDFTGRQPPIQIPDPGIRAGLQELEEQSKMLERVQAGEGENEPRGGL FT TLPKHRYVGPGGELPAGRPMSKLDEIAARHDIGYHTEIKHGHNPYYWYNFYDEQMIKDI FT KDNMDTIANEGESWLANFILSTWKAKASFMNPLGILLEHIQPDWNTYYDPTNTHHKQWV FT AFQRALTQHGTVSTERPATPPPADPDTSPPPAKKQRTESPNKSASCSAEMTSSCEISST FT QCTNDEDMEYNQTTVCGMPSSTTDSTIGNTGVQGTGDCAGGGGGSQRCTNKWLGGISWG FT NNTFTTYQTRRCILQPFTNKYTFTSSVDSTPGITVTTPWYYIDLNCFYSHIPPSTMQEI FT IETTDGFKPLTLTVTITEIVGKDVSCTTTSSGVPNTVTDSQTATILLHRDDHYELPYVL FT GGGQETVPEHLPGDWYKLPQYCYTTVGMESPWGNWSQRPDDCSGNAWTVQCTNYFSTQD FT SELFLLENLVNTQLHPGCSWTSTYRFPHLPMAYTTQYPWSTRRQDNPLQKQRIVAVRNC FT CSTQSTDVNKTKQVIEVDQDQADMGFFRKPTMWLPANRHRDGDCQIIPPKDRDFHLLPV FT RSGLPPVIVVRQGIFNPLPATGVFGLTSGTEQPGPPTEDKAVRTPGGTTVLTSNTLTVK FT RKHKYKNQNHDVHTVYEGKQEQKRLYQLVIQTQRGVGGPAEPDHVQERIIGNTGEKLPG FT SRYPLQSEITYGQHTGAVEESESGFYEFQIWERNPNTDLGKGGHKPPLAQWAMEKPPPT FT IYLRMLPMPCAPCKNKYTKSPGMKGIINSYVTFQLQYSIKWAYTPRTHTRRWNPTSPAL FT LPPPLPGSTVVYNLDSQKFSTDNQYTLAAESWQFKNRLRHNR" XX SQ Sequence 5573 BP; 1932 A; 1241 C; 1234 G; 1166 T; 0 other; tgtgtgcccc ggcgcgctag tgtttggccg ctggctagcc agcggcctaa gctaggccgg 60 ggcacacact cttctgcgtg tgtgtcgcaa cccagttccg gttccgggtc acaggtcacc 120 ggaagtgacc tttgacccca tgacgtagtt ccggtgacct ttgacctttc cggtacttcc 180 gggtcacacc cggtagttcc gggtcaagtg acgtagttcc ggtgacgtgg cactcacggc 240 aagggggttg ggttacgagt gacgaaaaaa tttacataag aggaaggaat attaatgaga 300 tggtgaaaca atgtctatat atggtaaagc tctgtgattg gctgtctaaa tatggtaaaa 360 cttaatgtga tttgctaccg tcatatccgg ggaacggcta tataacactg aacacggaag 420 taacagctca ctcttctctt gactgtcaga gggagaggac gtctcttggt gagtacaaaa 480 tgagctacta cacttttgtc gttacaatac cagataaaat tgaagactat ggacaagctt 540 actttgatgt tgttaacaga gtagcgttga ctcataggga caatgaaaag tgggtatatg 600 atccagagga cagggaaatt atttttaaac aggcacaggt atatgtagac tgcttagaaa 660 gacgacttaa acagatgtta gtttcaggat ctatgtacaa ctactttatt cagctggaac 720 agggagaaaa gcacaagcgg tggcatattc acgtggttct agacattagc gtggggaatc 780 ctagaaactg tagagaagtg atagaaaaag tagaatatga atacaacagt atctgctacg 840 gaaggccagt tagacagact caaataaaca gaacaagtaa cgggagttgg aaaatagaga 900 ctgaagactt tattataaac tacttactgt gcaagatacc acctaaagaa gcaaagtatg 960 catggacaaa cataaagaac aagataggag acgcatgttt aaacattgac ttaagaaaag 1020 aaataagtgt aagaccagat attgacgtgc caatgtggag acaagctagc ggagacagga 1080 tgagagactt ggtagaatgg tgtattaaaa accatgtatt tactgaagac cagtacatgg 1140 ctaagtttga ggaaagctac tacagctttg catgtacaaa ccaagggcga cacatgctgc 1200 aaacaagctt agaacttgct gctaaaagaa ctactagcat gataccttta ggagtatact 1260 tggcaggctt tgcaaacatt caagaagcaa tagaagaggc aaacaaaaca gaagcagcag 1320 acatatataa caacaaggta tttgacctgc ttcaatacca gggttatgac ccagtagtag 1380 ctggatacat aatatatgca tggagtataa gggctacagg gcgcagagga gcactatggt 1440 tttacggacc aggacaaact ggcaaaagca tcatggctag agcaatggca acttgcagtg 1500 tgcgatacgg gtgtgtgaac tggacaaact caaactttcc atttcaagac ttagctacaa 1560 actgccagat aggatggtgg gaagaagggg taattacaga agacatagta gaaagtgcaa 1620 aagcactttt aagtggaggc aaaataaggg tagatagaaa gtgtagagac agtgtagaaa 1680 ttacaccgcc accgtttgtc ataactagta acaatgacat gaccttagtg caaggaggaa 1740 atcaagtaag ctttgtccat aaaaaacctt tagaggacag gatgattaaa tttaacttta 1800 acaagagact tcctgctaac tttggcatag tagagagaga agacatgaaa caatttttca 1860 aatggagcag ttttatctac tacaacaaac tgctaccaaa agaatacaat tacctcagcg 1920 atccaaaact aataggccac gtagtaccat acactagcta cttaagacaa acagcaaaag 1980 tagaaaagat tgtaaggcca tttgaagaac aggaacaaga agaccttcga gaactagact 2040 cttggtttgc ggacccccca tacatggggc agccctcaaa aaagagtgag tagctatata 2100 aattttgtac tcactttgtg tgcgtgacac gtcactgaca cacaatgctt tttgcagaag 2160 tgcgctttga agacaaaccg ttattgccgc aagaagaact tgaccacatc tgtcacaacc 2220 aagagtgttg ccagtacccg tcgtgcgcag gatttcaaca gtaagtgaaa tcttgcatta 2280 cttattgcgt aacgcagtgg cttgcgggta cttagcaatg catggtttcc cttacaggcc 2340 accgtctacg tcctctcacc ttcctgaact aacagacgaa gaagtctgta gcatactaga 2400 cagtgaggaa gggtgggacg gagacaactt tgcagaacaa tgattcaggg cctagggggc 2460 gcagcaggca ttaacctgcg acaagagcta gaaggaaaaa gactatactt agataaatac 2520 aaatacccag acaattacat atttagagaa gctaatcaag aacaagaaat gaaaaaaatt 2580 cgacagctta tggcagaaac aaacacagac attacagaga caccagatca taaatggtac 2640 tttaacaagt acccagtaga aggaccttta aaacactcag acgacgtgaa ctaccacggt 2700 ctttcttatg taagactccc aagtagggac ttaatagaag gaaaacactt ggtgccccct 2760 gaacagagaa tgggagggga acaactcata ggtcttggag gaatgcctga atcgcacgac 2820 ctcattttag aacctatgac taatgaccct cgcttccaca aaccgaaacc acctggggac 2880 gacataccaa aaccgcttac gcattggcag cttgcaaaca aaatagggcc tgactacgct 2940 agacagctag cagaaatgtc aggaaaagac tttacagggc gacagcctcc aatacaaata 3000 ccagatcctg gaattagagc agggctacaa gaactagaag aacaatctaa aatgctggaa 3060 agagtacaag ccggtgaagg ggaaaacgaa cccagaggcg gtcttactct tcctaaacac 3120 aggtatgtcg gacctggagg ggaactcccg gcagggcgtc cgatgtccaa actggatgaa 3180 attgctgctc gacatgatat tgggtatcac actgaaatta aacacgggca taacccatat 3240 tactggtata acttttatga tgaacaaatg ataaaagaca ttaaggataa catggataca 3300 atagccaatg aaggggaatc atggctagct aactttattt tatccacatg gaaagcaaaa 3360 gcttccttta tgaatccact aggtatacta ctagaacaca tacaaccaga ctggaatact 3420 tactatgacc caactaacac acatcacaaa caatgggttg catttcagcg tgccttaaca 3480 cagcacggga cggtgtcgac ggagagaccc gcgacgccac caccagcaga tccagacacg 3540 agtccaccac cagcgaagaa acagagaact gaaagcccta ataaatctgc ttcttgctct 3600 gcagaaatga cgtcgtcttg tgaaatttct tccacacaat gcaccaacga cgaggacatg 3660 gaatataacc aaactaccgt ctgcggtatg cctagctcta ccacagacag caccattgga 3720 aacaccgggg tgcagggaac aggtgactgc gcagggggag gggggggatc gcaacgctgt 3780 acaaacaaat ggcttggagg catatcgtgg ggaaacaaca catttaccac ataccaaacg 3840 cgccgttgca tacttcaacc tttcactaac aagtacacat tcacaagcag cgtagacagc 3900 acgccaggaa taacggtgac aacgccttgg tactacattg acttaaactg cttttacagt 3960 cacattccac catctacaat gcaagaaata attgaaacca cagacggatt taaaccactt 4020 acactaacgg ttacaattac agaaattgta ggaaaagacg taagttgcac tactactagt 4080 tctggagtcc caaacacagt cacagactca caaacagcaa ccatcctttt acacagggac 4140 gaccactacg aactgccata tgtactggga gggggacaag aaacagttcc agaacactta 4200 ccaggagact ggtacaaact gccacagtac tgctatacta cagtaggcat ggaaagccca 4260 tggggtaact ggtcacagcg acctgacgac tgctcaggaa acgcgtggac agtgcaatgc 4320 acaaactatt ttagcacaca agacagcgaa cttttcctct tagaaaactt ggtaaacacc 4380 caacttcacc cagggtgctc atggacaagc acatacaggt tcccacattt acctatggca 4440 tacaccacac agtacccatg gagcacacga agacaagaca acccactgca aaagcagaga 4500 atagtggcag tacgcaactg ctgcagtaca caatcaacag acgtaaacaa aactaaacaa 4560 gtgatagaag ttgaccagga ccaagcagat atggggtttt ttagaaagcc cacaatgtgg 4620 ctgccagcaa acagacaccg agacggagac tgccaaataa tcccaccaaa agacagagac 4680 tttcacctac tgcctgtgcg ctcgggcctg ccaccagtaa tagtagtacg gcaaggcata 4740 tttaacccgc tgccagcaac gggagtcttt ggactcacca gtggcacaga acaaccagga 4800 cctcctacag aggataaagc agttagaacg ccagggggca ccacagtctt aacttctaat 4860 acactcacag taaaacgaaa acacaaatac aagaatcaaa accacgacgt tcacacagtg 4920 tatgaaggaa aacaagaaca aaaacgacta taccaattgg tcatacaaac acaaagggga 4980 gtagggggtc cagcagagcc tgaccacgtt caagaacgaa taataggaaa cacgggggaa 5040 aaactaccag gcagcagata cccactacaa tcagaaatta catacggaca acacacggga 5100 gcagtagaag aaagcgaatc tgggttttat gagtttcaaa tatgggaaag aaacccaaat 5160 acagacttag gaaaaggagg ccataaaccc ccactggccc aatgggccat ggaaaagccc 5220 ccacctacta tataccttag aatgctccca atgccctgtg ctccatgtaa aaacaaatac 5280 acaaaaagcc caggaatgaa aggaattata aatagttacg taacatttca actgcagtac 5340 tcaattaaat gggcttacac ccccaggacc cacacccgaa ggtggaaccc cacaagtcca 5400 gcacttcttc caccaccact tcctggttct accgtcgtgt acaaccttga ctcccagaaa 5460 ttctccaccg acaatcaata caccctcgca gctgaatcct ggcaattcaa aaacaggtta 5520 agacataaca gatgattatg taattgtagt cattgcctta atgtcaatgt atg 5573 // ID MG027859; SV 1; linear; viral cRNA; STD; VRL; 15232 BP. XX AC MG027859; XX PR Project:PRJNA227457; XX DT 03-JAN-2018 (Rel. 135, Created) DT 03-JAN-2018 (Rel. 135, Last updated, Version 6) XX DE UNVERIFIED: Respiratory syncytial virus type A isolate RSV-A/US/BID-V8366, DE partial genome. XX KW UNVERIFIED. XX OS Respiratory syncytial virus type A OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Pneumoviridae; Orthopneumovirus. XX RN [1] RP 1-15232 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT "Comparative Genomics of Respiratory Syncytial Virus for Broad Institute RT Viral Genomics Initiative"; RL Unpublished. XX RN [2] RP 1-15232 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Infectious Disease Initiative, Broad Institute, 75 Ames Street, Cambridge, RL MA 02142, USA XX DR MD5; 1ade39ead1f2c264359a3e090ffc31e9. DR BioSample; SAMN02646076. XX CC GenBank staff is unable to verify sequence and/or annotation CC provided by the submitter. CC ##Assembly-Data-START## CC Assembly Method :: Vicuna v. 1 CC Assembly Name :: V8366-1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..15232 FT /organism="Respiratory syncytial virus type A" FT /host="Homo sapiens" FT /isolate="RSV-A/US/BID-V8366" FT /mol_type="viral cRNA" FT /country="USA" FT /db_xref="taxon:1439707" FT 5'UTR 1..65 FT /note="indels in UTR have not been validated" FT gene 66..485 FT /gene="NS1" FT CDS 66..485 FT /codon_start=1 FT /gene="NS1" FT /product="non-structural protein 1" FT /db_xref="InterPro:IPR005099" FT /db_xref="PDB:5VJ2" FT /db_xref="UniProtKB/TrEMBL:X5FN71" FT /protein_id="AUH26155.1" FT /translation="MGSNSLSMIKVRLQNLFDNDEVALLKITCYTDKLIHLTNALAKAV FT IHTIKLNGIVFVHVITSSDICPNNNIVVKSNFTTMPVLQNGGYIWEMMELTHCSQPNGL FT IDDNCEIKFSKKLSDSTMTNYMNQLSELLGFDLNP" FT gene 595..969 FT /gene="NS2" FT CDS 595..969 FT /codon_start=1 FT /gene="NS2" FT /product="non-structural protein 2" FT /db_xref="InterPro:IPR004336" FT /db_xref="UniProtKB/TrEMBL:X5FP42" FT /protein_id="AUH26156.1" FT /translation="MDTTHNGTTPQRLMITDMRPLSLETIITSLTRDIITHRFIYLINH FT ECIVRKLDERQATFTFLVNYEMKLLHKVGSTKYKKYTEYNTKYGTFPMPIFINHDGFLE FT CIGIKPTKHTPIIYKYDLNP" FT gene 1107..2282 FT /gene="N" FT misc_feature 1107..2282 FT /gene="N" FT /note="similar to nucleoprotein" FT gene 2314..3039 FT /gene="P" FT CDS 2314..3039 FT /codon_start=1 FT /gene="P" FT /product="phosphoprotein" FT /db_xref="GOA:X5EYT2" FT /db_xref="InterPro:IPR003487" FT /db_xref="UniProtKB/TrEMBL:X5EYT2" FT /protein_id="AUH26157.1" FT /translation="MEKFAPEFHGEDANNRATKFLESIKGKFTSPKDPKKKDSIISVNS FT IDIEVTKESPITSNSTIINPTNETDDTAGNKPNYQRKPLVSFKEDPTPSDNPFSKLYKE FT TIETFDNNEEESSYSYEEINDQTNDNITARLDRIDEKLSEILGMLHTLVVASAGPTSAR FT DGIRDAMVGLREEMIEKIRTEALMTNDRLEAMARLRNEESEKMAKDTSDEVSLNPTSEK FT LNNLLEGNDSDNDLSLEDF" FT gene 3223..3993 FT /gene="M" FT CDS 3223..3993 FT /codon_start=1 FT /gene="M" FT /product="matrix protein" FT /db_xref="GOA:X5F6K9" FT /db_xref="InterPro:IPR005056" FT /db_xref="UniProtKB/TrEMBL:X5F6K9" FT /protein_id="AUH26158.1" FT /translation="METYVNKLHEGSTYTAAVQYNVLEKDDDPASLTIWVPMFQSSMPA FT DLLIKELANVNILVKQISTPKGPSLRVMINSRSAVLAQMPSKFTICANVSLDERSKLAY FT DVTTPCEIKACSLTCLKSKNMLTTVKDLTMKTLNPTHDIIALCEFENIVTSKKVIIPTY FT LRSISVRNKDLNTLENITTTEFKNAITNAKIIPYSGLLLVITVTDNKGAFKYIKPQSQF FT IVDLGAYLEKESIYYVTTNWKHTATRFAIKPMED" FT gene 4264..4458 FT /gene="SH" FT CDS 4264..4458 FT /codon_start=1 FT /gene="SH" FT /product="small hydrophobic protein" FT /db_xref="GOA:X5FML2" FT /db_xref="InterPro:IPR005327" FT /db_xref="UniProtKB/TrEMBL:X5FML2" FT /protein_id="AUH26159.1" FT /translation="MENTSITIEFSSKFWPYFTLIHMITTIISLLIIISIMIAILNKLC FT EYNVFHNKTFELPRARVNT" FT gene 4649..5545 FT /gene="G" FT CDS 4649..5545 FT /codon_start=1 FT /gene="G" FT /product="attachment protein" FT /db_xref="GOA:X5FQ75" FT /db_xref="InterPro:IPR000925" FT /db_xref="UniProtKB/TrEMBL:X5FQ75" FT /protein_id="AUH26160.1" FT /translation="MSKTKDQRTAKTLEKTWDTLNHLLFISSCLYKLNLKSIAQITLSI FT LAMIISTSLIIVAIIFIASANNKVTLTTAIIQDATSQIKNTTPTYLTQNPQLGISFFNL FT SGTISQTTAILAPTTPSVEPILQSTTVKTKNTTTTQIQPSKLTTKQRQNKPPNKPNDDF FT HFEVFNFVPCSICSNNPTCWAICKRIPSKKPGKKTTTKPTKKQTIKTTKKDLKPQTTKP FT KEAPTTKPTEKPTINITKPNIRTTLLTNSTTGNLEHTSQEETLHSTSSEGNTSPSQIYT FT TSEYLSQPPSPSNITDQ" FT gap 5596..5672 FT /estimated_length=77 FT gene <5673..7346 FT /gene="F" FT CDS <5673..7346 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A2H5CP38" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP38" FT /protein_id="AUH26161.1" FT /translation="VSLCLASSQNITEEFYQSTCSAVSKGYLSALRTGWYTSVITIELS FT NIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQSTPAANNRARRELPRFMNYTLNN FT TKNNNVTLSKKRKRRFLGFLLGVGSAIASGIAVSKVLHLEGEVNKIKSALLSTNKAVVS FT LSNGVSVLTSKVLDLKNYIDKQLLPIVNKQSCSISNIETVIEFQQKNNRLLEITREFSV FT NAGVTTPVSTYMLTNSELLSLINDMPITNDQKKLMSNNVQIVRQQSYSIMSIIKEEVLA FT YVVQLPLYGVIDTPCWKLHTSPLCTTNTKEGSNICLTRTDRGWYCDNAGSVSFFPQAET FT CKVQSNRVFCDTMNSLTLPSEVNLCNIDIFNPKYDCKIMTSKTDVSSSVITSLGAIVSC FT YGKTKCTASNKNRGIIKTFSNGCDYVSNKGVDTVSVGNTLYYVNKQEGKSLYVKGEPII FT NFYDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVNVGKSTTNIMITTIIIVI FT IVILLLLIAVGLFLYCKARSTPVTLSKDQLSGINNIAFSN" FT gene 7565..8149 FT /gene="M2" FT CDS 7565..8149 FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2-1" FT /db_xref="GOA:X5F060" FT /db_xref="InterPro:IPR000571" FT /db_xref="InterPro:IPR009452" FT /db_xref="InterPro:IPR036855" FT /db_xref="UniProtKB/TrEMBL:X5F060" FT /protein_id="AUH26162.1" FT /translation="MSRRNPCKFEIRGHCLNGKRCHFSHNYFEWPPHALLVRQNFMLNR FT ILKSMDKSIDTLSEISGAAELDRTEEYALGVVGVLESYIGSINNITKQSACVAMSKLLT FT ELNSDDIKKLRDNEEPNSPKIRVYNTVISYIESNRKNNKQTIHLLKRLPADVLKKTIKN FT TLDIHKSITINNPKESTVNDTNDHAKNNDTT" FT gene 8457..14954 FT /gene="L" FT CDS 8457..14954 FT /codon_start=1 FT /gene="L" FT /product="L polymerase" FT /db_xref="GOA:X5F520" FT /db_xref="InterPro:IPR014023" FT /db_xref="InterPro:IPR016269" FT /db_xref="InterPro:IPR025786" FT /db_xref="InterPro:IPR026890" FT /db_xref="InterPro:IPR039736" FT /db_xref="UniProtKB/TrEMBL:X5F520" FT /protein_id="AUH26163.1" FT /translation="MDPIINGNSANVYLTDSYLKGVISFSECNALGSYIFNGPYLKNDY FT TNLISRQNPLIEHINLKKLNITQSLISKYHKGEIKIEEPTYFQSLLMTYKSMTSSEQIA FT TTNLLKKIIRRAIEISDVKVYAILNKLGLKEKDKIKSNNEQDENNSVITTIIKDDILLA FT VKDNQSHLKAGKNHSTKQKDTIKTTLLKKLMCSMQHPPSWLIHWFNLYTKLNNILTQYR FT SNEVKNHGFILIDNHTLNGFQFILNQYGCIVYHKDLKRITVTTYNQFLTWKDISLSRLN FT VCLITWISNCLNTLNKSLGLRCGFNNVILTQLFLYGDCILKLFHNEGFYIIKEVEGFIM FT SLILNITEEDQFRKRFYNSMLNNITDAANKAQKNLLSRVCHTLLDKTVSDNIINGRWII FT LLSKFLKLIKLAGDNNLNNLSELYFLFRIFGHPMVDERQAMDAVKVNCNETKFYLLSSL FT SMLRGAFIYRIIKGFVNNYNRWPTLRNAIVLPLRWLTYYKLNTYPSLLELTERDLIVLS FT GLRFYREFRLPKKVDLEMIINDKAISPPKNLIWTSFPRNYMPSHIQNYIEHEKLKFSES FT DKSRRVLEYYLRDNKFNECDLYNCVVNQSYLNNPNHVVSLTGKERELSVGRMFAMQPGM FT FRQVQILAEKMIAENILQFFPESLTRYGDLELQKILELKAGISNKSNRYNDNYNNYISK FT CSIITDLSKFNQAFRYETSCICSDVLDELHGVQSLFSWLHLTIPHVTIICTYRHAPPYI FT RDHIVDLNNVDEQSGLYRYHMGGIEGWCQKLWTIEAISLLDLISLKGKFSITALINGDN FT QSIDISKPVRLMEGQTHAQADYLLALNSLKLLYKEYAGIGHKLKGTETYISRDMQFMSK FT TIQHNGVYYPASIKKVLRVGPWINTILDDFKVSLESIGSLTQELEYRGESLLCSLIFRN FT VWLYNQIALQLKNHALCNNKLYLDILKVLKHLKTFFNLDNIDTALTLYMNLPMLFGGGD FT PNLLYRSFYRRTPDFLTEAIVHSVFILSYYTNHDLKDKLQDLSDDRLNKFLTCIITFDK FT NPNAEFVTLMRDPQALGSERQAKITSEINRLAVTEVLSTAPNKIFSKSAQHYTTTEIDL FT NDIMQNIEPTYPHGLRVVYESLPFYKAEKIVNLISGTKSITNILEKTSAIDLTDIDRAT FT EMMRKNITLLIRIFPLDCNRDKREILSMENLSITELSKYVRERSWSLSNIVGVTSPSIM FT YTMDIKYTTSTIASGIIIEKYNVNSLTRGERGPTKPWVGSSTQEKKTMPVYNRQVLTKK FT QRDQIDLLAKLDWVYASIDNKDEFMEELSIGTLGLTYEKAKKLFPQYLSVNYLHRLTVS FT SRPCEFPASIPAYRTTNYHFDTSPINRILTEKYGDEDIDIVFQNCISFGLSLMSVVEQF FT TNVCPNRIILIPKLNEIHLMKPPIFTGDVDIHKLKQVIQKQHMFLPDKISLTQYVELFL FT SNKTLKSGSHVNSNLILAHKISDYFHNTYILSTNLAGHWILIIQLMKDSKGIFEKDWGE FT GYITDHMFINLKVFFNAYKTYLLCFHKGYGRAKLECDMNTSDLLCVLELIDSSYWKSMS FT KVFLEQKVIKYILSQDASLHRVKGCHSFKLWFLKRLNVAEFTVCPWVVNIDYHPTHMKA FT ILTYIDLVRMGLINIDRIYIKNKHKFNDEFYTSNLFYINYNFSDNTHLLTKHIRIANSE FT LENNYNKLYHPTPETLENILTNPVKCDDKKTLNDYCIGKNVDSIMLPLLSNKKLIKSST FT TIRTNYSKQDLYNLFPTVVIDKIIDHSGNTAKSNQLYTTTSHQISLVHNSTSLYCMLPW FT HHINRFNFVFSSTGCKISIEYILKDLKIKDPSCIAFIGEGAGNLLLRTVVELHPDIRYI FT YRSLKDCNDHSLPIEFLRLYNGHINIDYGENLTIPATDATNNIHWSYLHIKFAEPISLF FT VCDAELPVTVNWSKIIIEWSKHVRKCKYCSSVNKCTLIVKYHAQDDIDFKLDNITILKT FT YVCLGSKLKGSEVYLVLTIGPANVFPVFNVVQNAKLILSRTKNFIMPKKADKESIDANI FT KSLIPFLCYPITKKGINTALSKLKSVVSGDILSYSIAGRNEVFSNKLINHKHMNILKWF FT NHVLNFRSTELNYNHLYMVESTYPYLSELLNSLTTNELKKLIKITGSLLYNFHNE" FT 3'UTR 14955..15232 FT /note="indels in UTR have not been validated" XX SQ Sequence 15232 BP; 5878 A; 2668 C; 2364 G; 4245 T; 77 other; atattattat tagggcaaat aagaatttga taagtaccac ttaaatttaa ctcctttggt 60 tagagatggg cagcaattca ttaagtatga taaaagttag attacaaaat ttatttgaca 120 atgatgaagt agcattgtta aaaataacct gctatactga caaattgata catttaacta 180 atgctttagc taaggcagtg atacatacaa tcaaattgaa tggcattgta tttgtgcatg 240 ttattacaag tagtgatatt tgccctaata ataatattgt agtgaaatcc aacttcacaa 300 caatgccagt gttacaaaat ggaggttata tatgggaaat gatggaatta acacactgct 360 ctcaacccaa tggcctaata gatgacaatt gtgaaatcaa attctccaaa aaactaagcg 420 attcaacaat gaccaactat atgaatcaat tatctgaatt acttggattt gatctcaatc 480 cataaattat aacaaatatc aactagcaaa tcaatgtcaa taacaccatt agttaatata 540 aaacttgaca gaagataaaa atggggcaaa taaataaact cagctgaccc aaccatggac 600 acaacacaca atggtactac accacaaaga ctgatgatca cagacatgag accattgtca 660 cttgagacta taataacatc actaaccaga gacatcataa cacacagatt tatatacttg 720 ataaatcatg aatgtatagt gagaaaactt gatgaaagac aggccacatt tacattcctg 780 gtcaactatg aaatgaaact attgcacaaa gtgggaagca ctaaatacaa aaaatatact 840 gaatacaaca caaaatatgg cacttttcct atgccaatat ttatcaatca tgatgggttc 900 ttagaatgca ttggcattaa gcctacaaag cacactccca taatatacaa gtatgatctc 960 aatccatgaa tttcaacaca agattcacac aatctgaaat aacaacttca tgcataacta 1020 cactccatag tccaaatgga gcctgaaaat tatagtaatt taaaattaag gagagacata 1080 atatgaaaga tggggcaaat acaaaaaagg ctcttagcaa agtcaagttg aatgatacac 1140 tcaacaaaga tcaacttctg tcatccagca aatacaccat ccaacggagc acaggagata 1200 gtattgatac tcctaattat gatgtgcaga aacacatcaa caagttatgt ggcatgttat 1260 taatcacaga agatgctaat cataaattca ctggggtaat aggtatgtta tatgctatgt 1320 ctagattagg aagagaagac accataaaaa tactcagaga tgcgggatat catgtaaaag 1380 ctaatggagt ggatgtaaca acacatcgtc aagacattaa tggaaaagaa atgaaatttg 1440 aagtgttaac attggcaagc ttaacaactg aaattcaaat caacattgag atagaatcta 1500 gaaaatccta caaaaaaatg ctaaaagaaa tgggagaggt ggctccagaa tacaggcatg 1560 actctcctga ttgtggaatg ataatattat gtatagcagc attagtaata accaaattag 1620 cagcagggga tagatctggt cttacagccg tgattaggag agctaataat gttctaaaaa 1680 atgaaatgaa acgttataaa ggcttactac caaaggatat agccaacagt ttctatgaag 1740 tgtttgaaaa atatcctcac tttatagatg tttttgttca ttttggtata gcacaatctt 1800 ctaccagagg tggcagtaga gttgaaggga tttttgcagg attgtttatg aatgcctatg 1860 gtgcagggca agtgatgtta cggtggggag tcttagcaaa atcagttaaa aatattatgc 1920 taggacacgc tagtgtgcaa gcagaaatgg aacaagttgt ggaagtttat gaatatgccc 1980 aaaaattggg tggagaagca ggattctacc atatattgaa taacccaaaa gcatcattat 2040 tatctttgac tcaatttccc cacttctcca gtgtagtatt aggcaatgct gctggcctag 2100 gcataatggg agaatacaga ggtacaccaa ggaatcaaga tctatatgat gctgcaaagg 2160 catatgctga acaactcaaa gaaaatggtg tgattaacta cagtgtatta gacttgacag 2220 cagaagaact agaggctatc aaacatcagc ttaatccaaa agataatgat gtagagcttt 2280 gagttaataa aaaagtgggg caaataaatc atcatggaaa agtttgctcc tgaattccat 2340 ggagaagacg caaacaacag agccactaaa ttcctagaat caataaaggg caaattcaca 2400 tcacctaaag atcccaagaa aaaagatagt atcatatctg tcaactcaat agatatagaa 2460 gtaaccaaag aaagccctat aacttcaaat tcaaccatta taaaccctac aaatgagaca 2520 gatgatactg cagggaacaa gcccaattat caaagaaaac ctctagtgag tttcaaagaa 2580 gaccctacgc caagtgataa tcccttttca aaactataca aagaaaccat agaaacattt 2640 gataacaatg aagaagaatc tagctattca tatgaagaaa taaatgatca gacaaatgat 2700 aatataacag caagattaga taggattgat gaaaaattaa gtgaaatact aggaatgctt 2760 cacacactag tagtagcaag tgcaggacct acgtctgctc gggatggtat aagagatgcc 2820 atggttggtt taagagaaga aatgatagaa aaaatcagaa ctgaagcatt aatgaccaat 2880 gatagattag aagctatggc aaggctcagg aatgaggaaa gtgaaaagat ggcaaaagac 2940 acatcagatg aagtgtctct caatccaaca tcagagaaat tgaacaacct gttggaaggg 3000 aatgatagtg acaatgatct atcacttgaa gatttctgat cagttaccaa tctgcacatt 3060 aacacacaac accaacagaa gaccaacaaa caaaacaact cacctatcca accaaacatc 3120 tatctgccaa tcagccaacc agccaaaaaa acacccagcc aatacaaaat tagtcacccg 3180 gaaaaaatcg atactatagt tacaaaaaaa gatggggcaa atatggaaac atacgtgaac 3240 aaacttcacg aaggctccac atacacagct gctgttcaat acaatgtcct agaaaaagac 3300 gatgaccctg catcacttac aatatgggtg cccatgttcc aatcatccat gccagcagat 3360 ttacttataa aagaactagc taatgtcaac atactagtga aacaaatatc cacacccaaa 3420 ggaccttcat taagagtcat gataaactcg agaagtgcag tgctagcaca aatgcccagc 3480 aaattcacta tatgtgccaa tgtgtccttg gatgaaagaa gcaagctggc atatgatgta 3540 accacaccct gcgaaatcaa ggcatgtagt ctgacatgcc taaaatcaaa aaatatgtta 3600 actacagtta aagatctcac tatgaaaaca ctcaacccaa cacatgacat tattgcttta 3660 tgtgaatttg aaaatatagt aacatcaaaa aaagtcataa taccaacata cttaagatcc 3720 atcagtgtca gaaataaaga tctgaacaca cttgaaaata taacaaccac cgaattcaaa 3780 aatgccatca caaatgcaaa aatcatccct tactcaggat tactgttagt catcacagtg 3840 actgacaaca aaggagcatt caaatacata aagccacaaa gtcaattcat agtagatctt 3900 ggagcttacc tggaaaaaga aagtatatat tatgttacaa caaattggaa gcacacagct 3960 acacgatttg caatcaaacc catggaggat taaccttttt cctctacatc agttagttga 4020 ttcatacaca ctttccacct acattcctca cctcacaatc acaatcacca accctctgtg 4080 gtttaaccaa tcaaacaaaa cttatctgga gtctcagatc atcccaagtc attgttcatc 4140 agatctagta cccaaataag ttaataaaaa tacccacatg gggcaaataa tcatcggagg 4200 aaatccatcc aatcacaata tctgtcaaca tagaccagtc aacacgccaa acaaaataaa 4260 ccaatggaaa atacatccat aacaatagaa ttctcaagca aattctggcc ttactttaca 4320 ctaatacata tgatcacaac aataatctct ttgctaatca taatctccat catgattgca 4380 atactaaaca aactctgtga atataacgta ttccataaca aaacctttga gctaccaaga 4440 gctcgagtca atacatagca ttcaccaatc tgatggctca aaacagtaac cttgcatttg 4500 taagtgaaca atcttcacct ttttacaaaa tcacatcaac atctcaccat gcaagccatc 4560 atccatacta taaagtagtt aattaaaaat agtcataaca atgaactaag atattaagac 4620 taacaacaac gttggggcaa atgcaaacat gtccaaaacc aaggaccaac gcaccgccaa 4680 gacactagaa aagacctggg acactctcaa tcatctatta ttcatatcat cgtgcttata 4740 caagttaaat cttaaatcta tagcacaaat cacattatcc attctggcaa tgataatatc 4800 aacttcactt ataattgtag ctatcatatt catagcctca gcaaacaaca aagtcacact 4860 aacaactgca atcatacaag atgcaacaag ccagatcaag aacacaaccc caacatacct 4920 gacccagaat ccccagcttg gaatcagctt cttcaatctg tctggaacta tatcacaaac 4980 caccgccata ctagctccaa caacaccaag tgtcgagcca atcctgcaat ctacaacagt 5040 caagaccaaa aacacaacaa caacccaaat acaacccagc aagctcacca caaaacaacg 5100 ccaaaacaaa ccaccaaaca aacccaacga tgattttcac tttgaagtgt tcaactttgt 5160 accctgcagc atatgcagca acaatccaac ttgctgggcc atctgcaaaa gaataccaag 5220 caaaaaacct ggaaagaaaa ccaccaccaa gcccacgaaa aaacaaacca tcaagacaac 5280 caaaaaagat ctcaaacctc aaactacaaa accaaaggaa gcacctacca ccaagcccac 5340 agaaaagcca accatcaaca tcaccaaacc aaacatcaga actacactgc tcaccaacag 5400 taccacagga aatctagaac acacaagtca agaggagacc ctccattcaa cctcctccga 5460 aggcaataca agcccttcac aaatctatac aacatccgag tacctatcac aacctccatc 5520 tccatccaac ataacagacc agtagtcatt aaaaagcgta ttattgcaaa aaaccatgac 5580 caaatcaaac agaatnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngtctcact ctgtttagct tccagtcaaa 5700 acatcactga agaattttat caatcaacat gcagtgcagt tagcaaaggc tatcttagtg 5760 ctttaagaac tggttggtat actagtgtta taactataga attaagtaat atcaaggaaa 5820 ataagtgtaa tggaacagac gctaaggtaa aattgataaa acaagaatta gataaatata 5880 aaaatgctgt aacagaattg cagttgctca tgcaaagcac accagcagcc aacaatcgag 5940 ccagaagaga actaccaagg tttatgaatt atacactcaa caataccaaa aataacaatg 6000 taacattaag caagaaaagg aaaagaagat ttcttggctt tttgttaggt gttggatctg 6060 caatcgccag tggcattgct gtatctaaag tcctgcacct agaaggggaa gtgaacaaaa 6120 tcaaaagtgc tctactatcc acaaacaagg ctgtagtcag cttatcaaat ggagttagtg 6180 tcttaaccag caaagtgtta gacctcaaaa actatataga taaacagttg ttacccattg 6240 tgaacaagca aagctgcagc atatcaaaca ttgaaactgt gatagaattc caacaaaaga 6300 acaacagact actagagatt accagggaat ttagtgttaa tgcaggtgta actacacctg 6360 taagcactta tatgttaaca aatagtgaat tattatcatt aatcaatgat atgcctataa 6420 caaatgatca gaaaaagtta atgtccaaca atgttcaaat agttagacag caaagttact 6480 ctatcatgtc cataataaag gaggaagtct tagcatatgt agtacaatta ccactatatg 6540 gtgtaataga tacaccttgt tggaaactac acacatcccc tctatgcaca accaacacaa 6600 aggaagggtc caacatctgt ttaacaagaa ccgacagagg atggtactgt gacaatgcag 6660 gatcagtttc tttcttccca caagctgaaa catgcaaagt tcaatcgaat cgagtatttt 6720 gtgacacaat gaacagttta acattaccaa gtgaagtaaa tctctgcaac attgacatat 6780 tcaaccctaa atatgattgc aaaattatga cttcaaaaac agatgtaagc agctccgtta 6840 tcacatctct aggagccatt gtgtcatgct atggcaaaac taaatgtaca gcatccaata 6900 aaaatcgtgg aatcataaag acattttcta acgggtgtga ttatgtatca aataaggggg 6960 tggacactgt atctgtaggt aatacattat attatgtaaa taagcaagaa ggaaaaagtc 7020 tctatgtaaa aggtgaacca ataataaatt tctatgaccc attagtgttc ccttctgatg 7080 aatttgatgc atcaatatct caagtcaatg agaagattaa ccagagccta gcatttattc 7140 gtaaatccga tgaattatta cataatgtaa atgttggtaa atccaccaca aatatcatga 7200 taactactat aattatagtg attatagtaa tattgttatt attaattgca gttgggctgt 7260 tcctatactg caaggccaga agcacaccag tcacactaag caaggatcaa ctgagtggta 7320 taaataatat tgcatttagt aactgaataa gaatagtacc taatcatgtt cttacaatgg 7380 ttcatcatcc gaccatagat gacccatcta tcattggatt ttcttcaagt ctgaacttca 7440 tcgcaactct catctataaa ccatctcact tacactattt aagtagattc ctattttata 7500 gttatataaa actactgagt accagattaa ctcactattt gtaaaaatta gaaatggggc 7560 aaatatgtca cgaaggaatc cttgcaaatt tgaaattcga ggtcattgct tgaatggtaa 7620 gaggtgtcat tttagtcata attattttga atggccacct catgcactgc ttgtaagaca 7680 aaactttatg ttaaacagaa tacttaagtc tatggataaa agcatagata ctttatcaga 7740 aataagtgga gctgcagagt tggacagaac tgaagagtat gccctcggtg tagttggagt 7800 gctagagagt tatataggat caataaataa tataactaaa caatcagcat gtgttgccat 7860 gagcaaactc ctcactgaac tcaacagtga tgacatcaag aaactaagag acaatgaaga 7920 gccaaattca cctaagataa gagtgtacaa tactgtcata tcatatattg aaagcaacag 7980 gaaaaacaat aaacaaacta tccatctgtt aaaaagattg ccagcagacg tattgaagaa 8040 aaccatcaaa aacacattgg atatccacaa gagcataacc atcaacaacc caaaagaatc 8100 aactgttaat gatacaaacg accatgccaa aaataatgat actacctgac aaatatcctt 8160 gtagtataaa ttccatacta ataacaagta gttgtagagt tactatgtat aatcaaaaga 8220 acacactata tttcaatcaa aacaaccaac ataaccatac atactcacca aatcaaccat 8280 tcaatgaaat ccattggacc tctcaagact taattgatgc aattcaaaat tttctacaac 8340 atctaggtat tactgatgac atatatacaa tatatatatt agtgtcataa cactcaatac 8400 caatacttac cacatcatca aattattaac tcaaacaatt caaaccatgg gacaaaatgg 8460 atcccattat taatggaaat tctgctaatg tttatctaac cgatagttat ttaaaaggtg 8520 ttatttcttt ctcagaatgt aatgctttag gaagttacat attcaatggt ccttatctca 8580 aaaatgatta caccaactta attagtagac aaaatccatt aatagaacac ataaatctaa 8640 agaaattaaa tataacacag tctttaatat ctaagtatca taaaggtgaa ataaaaatag 8700 aagaacctac ttattttcag tcattactta tgacatacaa gagtatgacc tcgtcagaac 8760 agattgctac tactaattta cttaaaaaga taataagaag agctatagaa attagtgatg 8820 tcaaagtcta tgctatattg aataaactgg ggcttaaaga aaaagacaag attaaatcca 8880 acaatgaaca agatgaaaac aactcagtta ttacaaccat aatcaaagat gatatacttt 8940 tagctgttaa ggataatcaa tctcatctta aagcaggcaa aaatcactct acaaaacaaa 9000 aagatactat caaaacaaca ctcttgaaaa aattaatgtg ttcgatgcaa catcctccat 9060 catggttaat acattggttt aatttataca caaaattaaa caacatatta acacagtatc 9120 gatcaaatga ggtaaaaaac catggtttta tattgataga taatcatact ctcaatggat 9180 tccaatttat tttgaatcaa tatggttgta tagtttatca taaggatctc aaaagaatta 9240 ctgtgacaac ctataatcaa ttcttgacat ggaaagatat tagccttagt agattaaatg 9300 tttgtttaat tacatggatt agtaactgtt tgaacacatt aaacaaaagc ttaggcttaa 9360 gatgtggatt caataatgtt atcttgacac aactattcct ttatggagat tgtatattaa 9420 aactattcca caatgaaggg ttctacataa taaaagaggt agagggtttt attatgtctc 9480 taattttaaa tataacagaa gaagatcaat tcagaaaacg gttttataat agtatgctca 9540 acaacatcac agatgctgct aataaagctc agaaaaatct gctatcaaga gtatgtcata 9600 cattattaga taagacagta tccgataata taataaatgg cagatggata attctattaa 9660 gtaagtttct taaattaatt aaacttgcag gtgacaataa ccttaacaat ctgagtgaat 9720 tatatttttt attcagaata tttggacacc caatggtaga tgaaagacaa gccatggatg 9780 ctgttaaagt taattgcaac gagaccaaat tttacttgtt aagcagtttg agtatgttaa 9840 gaggtgcctt tatatataga attataaaag ggtttgtaaa taattacaac agatggccta 9900 ctttaaggaa tgctattgtt ttacccttaa gatggttaac ttactataaa ctaaacactt 9960 atccttcctt attggaactt acagaaagag atttgattgt tttatcagga ctacgtttct 10020 atcgtgagtt tcggttgcct aaaaaagtgg atcttgaaat gatcataaat gataaggcta 10080 tatcacctcc taaaaatttg atatggacta gtttccctag aaattatatg ccgtcacaca 10140 tacaaaatta tatagaacat gaaaaattaa aattttccga gagtgataaa tcaagaagag 10200 tattagagta ctatttaaga gataacaaat tcaatgaatg tgatttatat aactgtgtag 10260 ttaatcaaag ctatcttaac aaccctaatc atgtggtatc attgactggc aaagaaagag 10320 aactcagtgt aggtagaatg tttgcaatgc aaccaggaat gttcagacaa gttcaaatat 10380 tagcagagaa aatgatagct gaaaacattt tacaattctt tcctgaaagt cttacaagat 10440 atggtgatct agaattacag aaaatattag aattgaaagc gggaataagt aacaaatcaa 10500 atcgttacaa tgacaattac aacaattaca tcagtaagtg ctctatcatc acagatctca 10560 gcaaattcaa tcaagcattc cggtatgaaa catcatgtat ttgtagtgat gtactggatg 10620 aactgcatgg tgtacaatct ctattttcct ggttacattt aactattcct catgtcacaa 10680 taatatgcac atataggcat gcacccccct atataagaga tcacattgta gatcttaaca 10740 atgtagatga acaaagtgga ttatatagat atcatatggg tggtatcgaa gggtggtgtc 10800 aaaaactatg gaccatagaa gctatatcac tattggatct aatatctctc aaagggaaat 10860 tctcaattac tgccttaatt aatggtgaca atcaatcaat agatataagc aaaccagtca 10920 gactcatgga aggtcaaact catgctcaag cagattattt gctagcatta aatagtctta 10980 aattgctgta taaagagtat gcaggcatag gccacaaatt aaaaggaact gagacttata 11040 tatcaagaga tatgcaattt atgagtaaaa caattcaaca taacggtgta tattacccag 11100 ctagtataaa gaaagtccta agagtgggac catggataaa cactatactt gatgatttca 11160 aagtgagtct agaatctata ggtagtttga cacaagaatt agaatataga ggtgaaagtc 11220 tattatgcag tttaatattt agaaatgtgt ggttatataa tcaaattgct ttacaactaa 11280 aaaatcatgc attatgtaac aataaattat atttggacat attaaaggtt ctgaaacact 11340 taaaaacctt ttttaatctt gataatattg atacagcatt aacattgtat atgaatttgc 11400 ccatgttatt tggtggtggt gatcccaact tgttatatcg aagtttctat agaagaactc 11460 ctgatttcct cacagaggct atagttcact ctgtgttcat acttagttat tatacaaacc 11520 atgatttaaa ggataaactt caagatctgt cagacgatag attgaataag ttcttaacat 11580 gcataatcac gtttgacaaa aaccctaatg ctgaattcgt aacattgatg agagatcctc 11640 aagctttagg gtctgagagg caagctaaaa ttactagcga aatcaataga ctggcagtta 11700 ctgaggtttt gagcacagct ccaaacaaaa tattctccaa aagtgcacaa cactatacca 11760 ctacagagat agatctaaat gatattatgc aaaatataga acctacatat cctcatgggc 11820 taagagttgt ttatgaaagt ttaccctttt ataaagcaga gaaaatagta aatcttatat 11880 ccggtacaaa atctataact aacatactgg aaaagacttc tgccatagac ttaacagata 11940 ttgatagagc cactgagatg atgaggaaaa acataacttt gcttataagg atatttccat 12000 tagattgtaa cagagacaaa agagaaatat tgagtatgga aaacctaagt attactgaat 12060 taagcaaata tgttagagaa agatcttggt ctttatccaa tatagttggt gttacatcac 12120 ccagtatcat gtatacaatg gacatcaaat atacaacaag cactatagct agtggcataa 12180 tcatagagaa atataatgtc aacagtttaa cacgtggtga gagaggaccc actaaaccat 12240 gggttggttc atctacacaa gagaaaaaaa caatgccagt ttataataga caagttttaa 12300 ccaaaaaaca gagagatcaa attgatctat tagcaaaatt ggattgggtg tatgcatcta 12360 tagataacaa ggatgaattc atggaagaac tcagcatagg aactcttggg ttaacatatg 12420 agaaagccaa aaaattattt ccacaatatt taagtgttaa ctatttgcat cgccttacag 12480 tcagtagtag accatgtgaa ttccctgcat caataccagc ttatagaact acaaattatc 12540 actttgatac tagccctatt aatcgcatat taacagaaaa gtatggtgat gaagatattg 12600 atatagtatt ccaaaactgt ataagttttg gccttagctt aatgtcagta gtagagcaat 12660 ttaccaatgt atgtcctaac agaattattc tcatacccaa gcttaatgag atacatttga 12720 tgaaacctcc catattcaca ggtgatgttg atattcacaa gttaaaacaa gtgatccaaa 12780 aacagcatat gtttttacca gacaaaataa gtttgactca atatgtggaa ttattcttaa 12840 gtaataaaac actcaaatct ggatctcatg ttaattctaa tttaatattg gcacataaga 12900 tatctgacta ttttcataat acttacattt taagtactaa tttagctgga cattggattc 12960 tgattataca acttatgaaa gattctaaag gtatttttga aaaagattgg ggagagggat 13020 atataactga tcatatgttc attaatttga aagttttctt caatgcttat aagacctatc 13080 tcttgtgttt tcataaaggt tacggcagag caaagctgga gtgtgatatg aatacttcag 13140 atctcctatg tgtattggaa ttaatagaca gtagttattg gaagtctatg tctaaggtat 13200 ttttagaaca aaaagttatc aaatacattc tcagccagga tgcaagttta catagagtaa 13260 aaggatgtca tagcttcaaa ctatggtttc ttaaacgtct taatgtagca gaattcacag 13320 tttgcccttg ggttgttaac atagattatc atccaacaca catgaaagca atattaactt 13380 atatagatct tgttagaatg ggattgataa atatagatag aatatacatt aaaaataaac 13440 acaaattcaa tgatgaattt tatacttcta atctctttta cattaattat aacttctcag 13500 ataatactca tctattaact aaacatataa ggattgctaa ttctgaatta gaaaataatt 13560 acaacaaatt atatcatcct acacctgaaa ctctagaaaa tatactaacc aatccggtta 13620 aatgtgatga caaaaagaca ctgaatgact attgtatagg taaaaatgtt gactcaataa 13680 tgttaccatt gttatctaat aagaagctta ttaaatcgtc tacaacgatt agaaccaatt 13740 acagcaaaca agatttgtat aatttatttc ctacggttgt gattgataaa attatagatc 13800 attcaggtaa tacagccaaa tctaaccaac tttacactac tacttctcat caaatatctt 13860 tagtacacaa tagcacatca ctttattgca tgcttccttg gcatcatatt aatagattca 13920 attttgtgtt tagttctaca ggttgtaaaa ttagtataga gtatatttta aaagacctta 13980 aaattaaaga tcctagttgt atagcattca taggtgaagg agcagggaat ttattgttgc 14040 gtacagtagt ggaacttcat cctgatataa gatatattta cagaagtctg aaagattgca 14100 atgatcatag tttacctatt gagtttttaa ggctatacaa tggacatatt aacattgatt 14160 atggtgaaaa tttgaccatt cccgctacag atgcaaccaa caacattcat tggtcttatt 14220 tgcatataaa gtttgctgaa cctatcagtc tttttgtttg tgatgctgaa ttgcctgtaa 14280 cagtcaactg gagtaaaatt ataatagagt ggagcaagca tgtaagaaaa tgcaagtact 14340 gttcctcagt taataaatgt acgttaatag taaaatacca tgctcaagat gatatcgatt 14400 tcaaattaga caacataact atattaaaaa cttatgtatg cttgggcagt aagttaaagg 14460 ggtctgaagt ttacttagtc cttacaatag gtcctgcaaa tgtgttccca gtatttaatg 14520 tagtacaaaa tgctaaattg atactatcaa gaaccaaaaa tttcatcatg cctaagaagg 14580 ctgataaaga gtctattgat gcaaatatta aaagtttgat accctttctt tgttacccta 14640 taacaaaaaa aggaattaat actgcattat caaaactaaa gagtgttgtt agtggagata 14700 tactatcata ttctatagct ggacgtaatg aagttttcag caataaactt ataaatcata 14760 agcatatgaa catcttaaag tggttcaacc atgttttaaa tttcagatca acagaactta 14820 actataatca tttatatatg gtagaatcca catatcctta tctaagtgaa ttgttaaaca 14880 gcttgacaac taatgaactt aaaaaactga ttaaaatcac aggtagtttg ttatacaact 14940 ttcataatga ataatgaata aaaatcttat attaaaaaat tcccacagct acacactaac 15000 actgtattca attatagtta tttaaaatta aaaattatat aatttttaat aacttttagt 15060 ggactaatcc taaaattatc attttgatct aggaggaata aatttaaatc caaatctaat 15120 tggtttatat gtatattaac taaactacct gtgattttaa tcagtttttt aagttcatta 15180 gttgtcaagc tgtttaacaa ttcacttaga taagggatat ttggttccta ca 15232 // ID MG027860; SV 1; linear; viral cRNA; STD; VRL; 15334 BP. XX AC MG027860; XX PR Project:PRJNA227457; XX DT 03-JAN-2018 (Rel. 135, Created) DT 03-JAN-2018 (Rel. 135, Last updated, Version 6) XX DE UNVERIFIED: Respiratory syncytial virus type A isolate DE RSV-A/US/BID-V8534/2003, partial genome. XX KW UNVERIFIED. XX OS Respiratory syncytial virus type A OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Pneumoviridae; Orthopneumovirus. XX RN [1] RP 1-15334 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT "Comparative Genomics of Respiratory Syncytial Virus for Broad Institute RT Viral Genomics Initiative"; RL Unpublished. XX RN [2] RP 1-15334 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Infectious Disease Initiative, Broad Institute, 75 Ames Street, Cambridge, RL MA 02142, USA XX DR MD5; ed8c23fb02b0c572bd45c1ae95acf04e. DR BioSample; SAMN06677517. XX CC GenBank staff is unable to verify sequence and/or annotation CC provided by the submitter. CC ##Assembly-Data-START## CC Assembly Method :: Vicuna v. 1 CC Assembly Name :: V8534-1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..15334 FT /organism="Respiratory syncytial virus type A" FT /host="Homo sapiens" FT /isolate="RSV-A/US/BID-V8534/2003" FT /mol_type="viral cRNA" FT /country="USA" FT /collection_date="2003" FT /db_xref="taxon:1439707" FT 5'UTR 1..65 FT /note="indels in UTR have not been validated" FT gene 66..485 FT /gene="NS1" FT CDS 66..485 FT /codon_start=1 FT /gene="NS1" FT /product="non-structural protein 1" FT /db_xref="InterPro:IPR005099" FT /db_xref="PDB:5VJ2" FT /db_xref="UniProtKB/TrEMBL:X5FN71" FT /protein_id="AUH26164.1" FT /translation="MGSNSLSMIKVRLQNLFDNDEVALLKITCYTDKLIHLTNALAKAV FT IHTIKLNGIVFVHVITSSDICPNNNIVVKSNFTTMPVLQNGGYIWEMMELTHCSQPNGL FT IDDNCEIKFSKKLSDSTMTNYMNQLSELLGFDLNP" FT gene 595..969 FT /gene="NS2" FT CDS 595..969 FT /codon_start=1 FT /gene="NS2" FT /product="non-structural protein 2" FT /db_xref="InterPro:IPR004336" FT /db_xref="UniProtKB/TrEMBL:X5FP42" FT /protein_id="AUH26165.1" FT /translation="MDTTHNGTTPQRLMITDMRPLSLETIITSLTRDIITHRFIYLINH FT ECIVRKLDERQATFTFLVNYEMKLLHKVGSTKYKKYTEYNTKYGTFPMPIFINHDGFLE FT CIGIKPTKHTPIIYKYDLNP" FT gene 1107..2282 FT /gene="N" FT CDS 1107..2282 FT /codon_start=1 FT /gene="N" FT /product="nucleoprotein" FT /db_xref="GOA:A0A2H5CP39" FT /db_xref="InterPro:IPR004930" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP39" FT /protein_id="AUH26166.1" FT /translation="MALSKVKLNDTLNKDQLLSSSKYTIQRSIRDXXXXXXXXVQKHIN FT KLCGMLLITEDANHKFTGVIGMLYAMSRLGREDTIKILRDAGYHVKANGVDVTTHRQDI FT NGKEMKFEVLTLASLTTEIQINIEIESRKSYKKMLKEMGEVAPEYRHDSPDCGMIILCI FT AALVITKLAAGDRSGLTAVIRRANNVLKNEMKRYKGLLPKDIANSFYEVFEKYPHFIDV FT FVHFGIAQSSTRGGSRVEGIFAGLFMNAYGAGQVMLRWGVLAKSVKNIMLGHASVQAEM FT EQVVEVYEYAQKLGGEAGFYHILNNPKASLLSLTQFPHFSSVVLGNAAGLGIMGEYRGT FT PRNQDLYDAAKAYAEQLKENGVINYSVLDLTAEELEAIKHQLNPKDNDVEL" FT gap 1202..1223 FT /estimated_length=22 FT gene 2313..3038 FT /gene="P" FT CDS 2313..3038 FT /codon_start=1 FT /gene="P" FT /product="phosphoprotein" FT /db_xref="GOA:X5F0K8" FT /db_xref="InterPro:IPR003487" FT /db_xref="UniProtKB/TrEMBL:X5F0K8" FT /protein_id="AUH26167.1" FT /translation="MEKFAPEFHGEDANNRATKFLESIKGKFTSPKDPKKKDSIISVNS FT IDIEVTKESPITSNSTIINSTNETDDTAGNKPNYQRKPLVSFKEDPTPSDNPFSKLYKE FT TIETFDNNEEESSYSYEEINDQTNDNITARLDRIDEKLSEILGMLHTLVVASAGPTSAR FT DGIRDAMVGLREEMIEKIRTEALMTNDRLEAMARLRNEESEKMAKDTSDEVSLNPTSEK FT LNNLLEGNDSDNDLSLEDF" FT gene 3222..3992 FT /gene="M" FT CDS 3222..3992 FT /codon_start=1 FT /gene="M" FT /product="matrix protein" FT /db_xref="GOA:X5F6K9" FT /db_xref="InterPro:IPR005056" FT /db_xref="UniProtKB/TrEMBL:X5F6K9" FT /protein_id="AUH26168.1" FT /translation="METYVNKLHEGSTYTAAVQYNVLEKDDDPASLTIWVPMFQSSMPA FT DLLIKELANVNILVKQISTPKGPSLRVMINSRSAVLAQMPSKFTICANVSLDERSKLAY FT DVTTPCEIKACSLTCLKSKNMLTTVKDLTMKTLNPTHDIIALCEFENIVTSKKVIIPTY FT LRSISVRNKDLNTLENITTTEFKNAITNAKIIPYSGLLLVITVTDNKGAFKYIKPQSQF FT IVDLGAYLEKESIYYVTTNWKHTATRFAIKPMED" FT gene 4263..4457 FT /gene="SH" FT CDS 4263..4457 FT /codon_start=1 FT /gene="SH" FT /product="small hydrophobic protein" FT /db_xref="GOA:X5FNF2" FT /db_xref="InterPro:IPR005327" FT /db_xref="UniProtKB/TrEMBL:X5FNF2" FT /protein_id="AUH26169.1" FT /translation="MENTSITIEFSSKFWPYFTLIHMITTIISLLIIISIMIAILNKLC FT EYNLFHNKTFELPRARVNT" FT gene 4648..5544 FT /gene="G" FT CDS 4648..5544 FT /codon_start=1 FT /gene="G" FT /product="attachment protein" FT /db_xref="GOA:X5F5V2" FT /db_xref="InterPro:IPR000925" FT /db_xref="UniProtKB/TrEMBL:X5F5V2" FT /protein_id="AUH26170.1" FT /translation="MSKTKDQRTAKTLERTWDTLNHLLFISSCLYKLNLKSIAQITLSI FT LAMIISTSLIIVAIIFIASANNKVTLTTAIIQDATSQIKNTTPTYLTQNPQLGISFFNL FT SGTISQTTAILALTTPSVESILQSTTVKTKNTTTTQIQPSKPTTKQRQNKPPNKPNDDF FT HFEVFNFVPCSICSNNPTCWAICKRIPSKKPGKKTTTKPTKKPTIKITKKDLKPQTTKP FT KEAPTTKPTDKPTINITKLNIRTTLLTNSTTGNLEHTSQEETLHSTSSEGNTSPSQIYT FT TSEYLSQPPSPSNITDQ" FT gene 5621..7345 FT /gene="F" FT CDS 5621..7345 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:X5F7V2" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:X5F7V2" FT /protein_id="AUH26171.1" FT /translation="MELPILKTNAITTILAAVTLCFASSQNITEEFYQSTCSAVSKGYL FT SALRTGWYTSVITIELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQSTPASN FT NRARRELPRFMNYTLNNTKNNNVTLSKKRKRRFLGFLLGVGSAIASGIAVSKVLHLEGE FT VNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQSCSISNIETVIE FT FQQKNNRLLEITREFSVNAGVTTPVSTYMLTNSELLSLINDMPITNDQKKLMSNNVQIV FT RQQSYSIMSIIKEEVLAYVVQLPLYGVIDTPCWKLHTSPLCTTNTKEGSNICLTRTDRG FT WYCDNAGSVSFFPQAETCKVQSNRVFCDTMNSLTLPSEVNLCNIDIFNPKYDCKIMTSK FT TDVSSSVITSLGAIVSCYGKTKCTASNKNRGIIKTFSNGCDYVSNKGVDTVSVGNTLYY FT VNKQEGKSLYVKGEPIINFYDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN FT VGKSTTNIMITTIIIVIIVILLLLIAVGLFLYCKARSTPVTLSKDQLSGINNIAFSN" FT gene 7564..8148 FT /gene="M2" FT CDS 7564..8148 FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2-1" FT /db_xref="GOA:X5F060" FT /db_xref="InterPro:IPR000571" FT /db_xref="InterPro:IPR009452" FT /db_xref="InterPro:IPR036855" FT /db_xref="UniProtKB/TrEMBL:X5F060" FT /protein_id="AUH26172.1" FT /translation="MSRRNPCKFEIRGHCLNGKRCHFSHNYFEWPPHALLVRQNFMLNR FT ILKSMDKSIDTLSEISGAAELDRTEEYALGVVGVLESYIGSINNITKQSACVAMSKLLT FT ELNSDDIKKLRDNEEPNSPKIRVYNTVISYIESNRKNNKQTIHLLKRLPADVLKKTIKN FT TLDIHKSITINNPKESTVNDTNDHAKNNDTT" FT gene 8456..14947 FT /gene="L" FT misc_feature 8456..14947 FT /gene="L" FT /note="similar to L polymerase" FT 3'UTR 14948..15334 FT /note="indels in UTR have not been validated" XX SQ Sequence 15334 BP; 5932 A; 2688 C; 2393 G; 4299 T; 22 other; atattattaa tggggcaaat aagaatttga taagtaccac ttaaaattaa ctcctttggt 60 ttgagatggg cagcaattca ttgagtatga taaaagttag attacaaaat ttatttgaca 120 atgatgaagt agcattgtta aaaataacct gctatactga caaattgata catttaacta 180 atgctttggc taaggcagtg atacatacaa tcaaattgaa tggcattgta tttgtgcatg 240 ttattacaag tagtgatatt tgccctaata ataatattgt agtgaaatcc aacttcacaa 300 caatgccagt gttacaaaat ggaggttata tatgggaaat gatggaatta acacactgct 360 ctcaacccaa tggcctaata gatgacaatt gtgaaatcaa attctccaaa aaactaagcg 420 attcaacaat gaccaactat atgaatcaat tatctgaatt acttggattt gatctcaatc 480 cataaattat aataaatatc aactagcaaa tcaatgtcac taacaccatt agttaatata 540 aaacttgaca gaagataaaa atggggcaaa taaataaact cagctgaccc aaccatggac 600 acaacacaca atggtactac accacaaaga ctgatgatca cggacatgag accattgtca 660 cttgagacta taataacatc actaaccaga gacatcataa cacacagatt tatatacttg 720 ataaatcatg aatgtatagt gagaaaactt gatgaaagac aggccacatt tacattcctg 780 gtcaactatg aaatgaaact attgcacaaa gtgggaagca ctaaatacaa aaaatatact 840 gaatacaaca caaaatatgg cacttttcct atgccaatat ttatcaatca tgatgggttc 900 ttagaatgca ttggcattaa gcctacaaag cacactccca taatatacaa gtatgatctc 960 aatccatgaa tttcaacaca agattcacac aatctgaaac aacaacctca tgcataacta 1020 cactccatag tccaaatgga gcctgaaaat tatagtaatt taaaattaag gagagacata 1080 agatgaaaga tggggcaaat acaaaaatgg ctcttagcaa agtcaagttg aacgatacac 1140 tcaacaaaga tcaacttctg tcatccagca aatacaccat ccaacggagt ataagagaca 1200 gnnnnnnnnn nnnnnnnnnn nnngtgcaga aacacatcaa caagttatgt ggcatgttat 1260 taatcacaga agatgctaat cataaattca ctggggtaat aggtatgtta tatgctatgt 1320 ctagattagg aagagaagac accataaaaa tactcagaga tgcaggatat catgtaaaag 1380 caaatggagt ggatgtaaca acacatcgtc aagacattaa tggaaaagaa atgaaatttg 1440 aagtgttaac attagcaagc ttaacaactg aaattcaaat caacattgag atagaatcta 1500 ggaaatccta caaaaaaatg ctaaaagaaa tgggagaggt ggctccagaa tacaggcatg 1560 actcacctga ttgtggaatg ataatattat gtatagcagc attagtaata accaaattag 1620 cagcagggga tagatctggt cttacagctg tgattaggag agctaataat gttctaaaaa 1680 atgaaatgaa acgttataaa ggcttactac caaaggatat agccaacagc ttctatgaag 1740 tgtttgaaaa atatcctcac tttatagatg tttttgttca ttttggtata gcacaatctt 1800 ctaccagagg aggcagtaga gttgaaggga tttttgcagg attgtttatg aatgcctatg 1860 gtgcagggca agtgatgtta cggtggggag tcttagcaaa atcagttaaa aatattatgc 1920 taggacacgc tagtgtgcaa gcagaaatgg aacaagttgt ggaagtttat gaatatgccc 1980 aaaaattggg tggagaagca ggattctacc atatattgaa caacccaaaa gcatcattat 2040 tatctttgac tcaatttccc cacttctcca gtgtagtatt gggcaatgct gctggcctag 2100 gcataatggg agaatacaga ggtacaccaa ggaatcaaga tctatatgat gctgcaaagg 2160 catatgctga acaactcaaa gaaaatggtg tgattaacta cagtgtatta gacttgacag 2220 cagaagaact agaggctatc aaacatcagc ttaatccaaa agataatgat gtagagcttt 2280 gagttaataa aaaatggggc aaataaatca tcatggaaaa gtttgctcct gaattccatg 2340 gagaagacgc aaacaacaga gccactaaat tcctagaatc aataaagggc aaattcacat 2400 cacctaaaga tcctaagaaa aaagatagta tcatatctgt caactcaata gatatagaag 2460 taaccaaaga aagccctata acttcaaatt caaccattat aaactctaca aatgagacag 2520 atgatactgc agggaacaag cccaattatc aaagaaaacc tctagtgagt ttcaaagaag 2580 accctacgcc aagtgataat cctttttcaa aactatacaa agaaaccata gaaacatttg 2640 ataacaatga agaagaatct agctattcat atgaagaaat aaatgatcag acaaacgata 2700 atataacagc aagattagat aggattgatg aaaaattaag tgaaatacta ggaatgcttc 2760 acacactagt ggtagcaagt gcaggaccta catctgctcg ggatggtata agagatgcca 2820 tggttggttt aagagaagaa atgatagaaa aaatcagaac tgaagcatta atgaccaatg 2880 atagattaga agctatggca agactcagga atgaggaaag tgaaaagatg gcaaaagaca 2940 catcagatga agtatctctc aatccaacat cagagaaatt gaacaacctg ttggaaggga 3000 atgatagtga caatgatcta tcacttgaag atttctgatc agttaccaat ctgcacatca 3060 acacacaaca ccaacagaag accaacaaac aaaacaactc acctatccaa ccaaacatct 3120 atctgccaat cagccaacca gccaaaaaaa cacccagcca atccaaaatt agtcatccgg 3180 aaaaaatcga tactatagtt acaaaaaaag atggggcaaa tatggaaaca tacgtgaaca 3240 aacttcacga aggctccaca tacacagctg ctgttcaata caatgtccta gaaaaagacg 3300 atgaccctgc atcacttaca atatgggtgc ccatgttcca atcatccatg ccagcagatt 3360 tacttataaa agaactagct aatgtcaaca tactagtgaa acaaatatcc acacccaaag 3420 gaccttcatt aagagtcatg ataaactcaa gaagtgcagt gctagcacaa atgcccagca 3480 aattcaccat atgtgccaat gtgtccttgg atgaaagaag caagctggca tatgatgtaa 3540 ccacaccctg cgaaatcaaa gcatgtagtc taacatgcct aaaatcaaaa aatatgttaa 3600 ctacagttaa agatctcact atgaaaacac tcaacccaac acatgacatc attgctttat 3660 gtgaatttga aaatatagta acatcaaaaa aagtcataat accaacatac ttaagatcca 3720 tcagtgtcag aaataaagat ctgaacacac ttgaaaatat aacaaccact gaattcaaaa 3780 atgccatcac aaatgcaaaa atcatccctt actcaggatt actgttagtc atcacagtga 3840 ctgacaacaa aggagcattc aaatacataa agccacaaag tcaattcata gtagatcttg 3900 gagcttacct agaaaaagaa agtatatatt atgttacaac aaattggaag cacacagcta 3960 cacgatttgc aatcaaaccc atggaagatt aacctttttc ctctacatca gttagttgat 4020 ctatacacac tttctaccta cattcttcac ttcacgatca caatcaccaa ccctctgtgg 4080 cttaaccaat caaacaaaac ttatctggag tctcggatca tcccaagtca ttgttcatca 4140 gatctagtac tcaaataagt taataaaaat acccacatgg ggcaaataat catcggagga 4200 aatccaacca atcacaatat ctgttaacat agaccagtca actcgccaaa caaaataaac 4260 caatggaaaa tacatccata acaatagaat tctcaagcaa attctggcct tactttacac 4320 taatacatat gataacaaca ataatctctt tgctaatcat aatctccatc atgattgcaa 4380 tactaaacaa actctgtgaa tataacttat tccacaacaa aacctttgag ctaccaagag 4440 ctcgagtcaa tacatagcat tcaccaatct gatggctcaa aacagtaacc ttgcatttgt 4500 aagtgaacta ttttcacctt tttacaaaat cacatcaaca tctcaccatg caagccatta 4560 tccatactat aaagtagtta attaaaaata gtcataacaa tgaactaaga tattaagact 4620 aacaacaacg ttggggcaaa tgcaaacatg tccaaaacca aggaccaacg caccgccaag 4680 acactagaaa ggacctggga cactctcaat catctattat tcatatcatc atgcttatac 4740 aagttaaatc ttaaatctat agcacaaatc acattatcca ttctggcaat gataatctca 4800 acttcactta taattgtagc tatcatattc atagcctcag caaacaacaa agtcacacta 4860 acaactgcaa tcatacaaga tgcaacaagc cagatcaaga acacaacccc aacatacctg 4920 acccagaatc cccagcttgg aatcagcttc ttcaatctgt ctggaactat atcacaaacc 4980 accgccatac tagctttaac aacaccaagt gtcgagtcaa tcctgcaatc tacaacagtc 5040 aagaccaaaa acacaacaac aacccaaata caacccagca agcccaccac aaaacaacgc 5100 caaaacaaac caccaaacaa acccaatgat gattttcact ttgaagtgtt caactttgta 5160 ccctgcagca tatgcagcaa caatccaact tgctgggcca tctgcaaaag aataccaagc 5220 aaaaaacctg gaaagaaaac caccaccaag cccacgaaaa aaccaaccat caagataacc 5280 aaaaaagatc tcaaacctca aaccacaaaa ccaaaggaag cacctaccac caagcccaca 5340 gataagccaa ccatcaacat caccaaacta aacatcagaa ctacactgct caccaacagt 5400 accacaggaa atctagaaca cacaagtcaa gaggaaaccc tccattcaac ctcctccgaa 5460 ggcaatacaa gcccttcaca aatctataca acatccgagt acctatcaca acctccatct 5520 ccatccaaca taacagacca gtagtcatta aaaagcgtat tattgcaaaa aaccatgacc 5580 aaatcaaaca gaatcaaaat aagctctggg gcaaataaca atggagttgc caatcctcaa 5640 aacaaatgca attaccacaa tccttgctgc agtcacactc tgtttcgctt ccagtcaaaa 5700 catcactgaa gaattttatc aatcaacatg cagtgcagtt agcaaaggct atcttagtgc 5760 tttaagaact ggttggtata ctagtgttat aactatagaa ttaagtaata tcaaagaaaa 5820 taagtgtaat ggaacagatg ctaaggtaaa attgataaaa caagaattag ataaatataa 5880 aaatgctgta acagaattgc agttgctcat gcaaagcaca ccagcatcca acaatcgagc 5940 cagaagagaa ctaccaaggt ttatgaatta tacactcaac aataccaaaa ataacaatgt 6000 aacattaagc aaaaaaagga aaagaagatt tcttggcttt ttgttaggtg ttggatctgc 6060 aatcgccagt ggcattgctg tgtctaaagt cctgcaccta gaaggggaag tgaacaaaat 6120 caaaagtgct ctactatcca caaacaaggc tgtagtcagc ttatcaaatg gagttagtgt 6180 cttaaccagc aaagtgttag acctcaaaaa ctatatagat aaacagttgt tacccattgt 6240 gaacaagcaa agctgcagca tatcaaacat tgaaactgtg atagaattcc aacaaaagaa 6300 caacaggcta ctagagatta ccagggaatt tagtgttaat gcaggtgtaa ctacacctgt 6360 aagcacttat atgttaacaa acagtgaatt attatcatta atcaatgata tgcctataac 6420 aaatgatcag aaaaagttaa tgtccaacaa tgttcaaata gttagacagc agagttactc 6480 tatcatgtcc ataataaagg aggaagtctt agcatatgtc gtacaattac cactatatgg 6540 tgtaatagat acaccttgtt ggaaactaca cacatcccct ctatgcacaa ccaacacaaa 6600 ggaagggtcc aacatctgtt taacaagaac cgacagagga tggtactgtg acaatgcagg 6660 atcagtgtct ttcttcccac aagctgaaac atgcaaagtt caatcgaatc gagtattttg 6720 tgacacaatg aacagtttaa cattaccaag tgaagtaaat ctctgcaaca ttgacatatt 6780 caaccctaaa tatgattgca aaattatgac ttcaaaaaca gatgtgagca gctccgttat 6840 cacatctcta ggagccattg tgtcatgcta tggcaaaact aaatgtacag catccaataa 6900 aaatcgtgga atcataaaga cattttctaa cgggtgtgat tatgtatcaa ataagggagt 6960 ggacactgta tctgtaggta atacattata ttatgtaaat aagcaagaag gaaaaagtct 7020 ctatgtaaaa ggtgaaccaa taataaattt ctatgaccca ttagtgttcc cttctgatga 7080 atttgatgca tcaatatctc aagtcaatga gaagattaac cagagtctag catttattcg 7140 taaatccgat gaattattac ataatgtaaa tgttggtaaa tccaccacaa atatcatgat 7200 aactactata attatagtga ttatagtaat attgttatta ttaattgcag ttgggctgtt 7260 cctatactgc aaggccagaa gcacaccagt cacactaagc aaggatcaac tgagtggtat 7320 aaataatatt gcatttagta actgaataaa aatagtacct aatcatgttc ttacaatggt 7380 tcactatccg accatagacg acccatctat cattggattt tcttaaagtc tgaacttcat 7440 cgcaactctc atctataaac catctcactt acactattta agtagattcc tattttatag 7500 ttatataaaa ctactgagtg ccagattaac tcactatttg taaaaattag aaatggggca 7560 aatatgtcac gaaggaatcc ttgcaaattt gaaattcgag gtcattgctt gaatggtaag 7620 aggtgtcatt ttagtcataa ttattttgaa tggccacccc atgcactgct tgtaagacaa 7680 aactttatgt taaacagaat acttaagtct atggataaaa gcatagatac tttatcagaa 7740 ataagtggag ctgcagagtt ggacagaact gaagagtatg ccctcggtgt agttggagtg 7800 ctagagagtt atataggatc aataaataat ataactaaac aatcagcatg tgttgccatg 7860 agcaaactcc tcactgaact caacagtgat gacatcaaaa aactaagaga taatgaagag 7920 ccaaattcac ctaagataag agtgtacaat actgtcatat catatattga aagcaacagg 7980 aaaaacaata aacaaactat ccatctgtta aaaagattgc cagcagacgt attgaagaaa 8040 accatcaaaa acacattgga tatccacaag agcataacca tcaacaaccc aaaagaatca 8100 actgttaatg atacaaacga ccatgccaaa aataatgata ctacctgaca aatatccttg 8160 tagtataaat tccatactaa taacaagtag ttgtagagtt actatgtata atcaaaagaa 8220 cacactatat ttcaatcaaa acaaccaaaa taaccataca tactcaccaa atcaaccatt 8280 caatgaaatc cattggacct ctcaagactt gattgatgca attcaaaatt ttctacaaca 8340 tctaggtatt actgatgata tatatacaat atatatatta gtgtcataac actcaatacc 8400 aatactcacc acatcatcaa actatcaact caaacaattc aaaccatggg acaaaatgga 8460 tcccattatt aatggaaatt ctgctaatgt ttatctaacc gatagttatt taaaaggtgt 8520 tatttctttc tcagaatgta atgctttagg aagctacata ttcaatggtc cttatctcaa 8580 aaatgattac accaacttaa ttagtagaca aaatccatta atagaacaca taaatctaaa 8640 gaaattaaat ataacacagt ctttaatatc taagtatcat aaaggtgaaa taaaaataga 8700 agaacctact tattttcagt cattacttat gacatacaag agtatgacct cgtcagaaca 8760 gattactacc actaatttac ttaaaaagat aataagaaga gcaatagaaa ttagtgatgt 8820 caaagtctat gctatattga ataaactggg gcttaaagaa aaagacaaga ttaaatccaa 8880 caatggacaa gatgaaaaca actcagttat tacaaccata atcaaagatg atatactttt 8940 agctgttaag gataatcaat ctcatcttaa agcaggcaaa aatcactcca caaaacaaaa 9000 agatactatc aaaacaacac tcttgaaaaa attaatgtgt tcgatgcaac atcctccatc 9060 atggttaata cattggttta atttatacac aaaattaaac aacatattaa cacagtatcg 9120 atcaaatgag gtaaaaaacc atggttttat attgatagat aatcatactc tcaatggatt 9180 ccaatttatt ttgaatcaat atggttgtat agtttatcat aagggactca aaagaattac 9240 tgtgacaacc tataatcaat tcttgacatg gaaagatatt agccttagta gattaaatgt 9300 ttgtttaatt acatggatta gtaactgttt gaacacatta aacaaaagct taggcttaag 9360 atgtggattc aataatgtta tcttgacaca actattcctt tatggagatt gcatattaaa 9420 actattccac aatgaagggt tctacataat aaaagaggta gagggtttta ttatgtctct 9480 aattttaaac ataacagaag aagatcaatt cagaaaacgg ttttataata gtatgctcaa 9540 caacatcaca gatgctgcta ataaagctca gaaaaatctg ctatcaagag tatgtcatac 9600 attattagat aagacagtat ctgataatat aataaatggc agatggataa ttctattaag 9660 taagtttctt aaattaatta agcttgcagg tgacaataac cttaacaatc tgagtgaatt 9720 atatttttta ttcagaatat ttggacaccc aatggtagat gaaagacaag ccatggatgc 9780 tgttaaagtt aattgcaacg agaccaaatt ttacttgtta agcggtttga gtatgttaag 9840 aggtgccttt atatatagaa ttataaaagg gtttgtaaat aattacaaca gatggcctac 9900 tttaaggaat gctgttgttt tacccttaag atggttaact tactataaac taaacactta 9960 tccttcctta ttggaactta cagaaagaga tttgattgtt ttatcaggac tacgtttcta 10020 tcgtgagttt cggttgccta aaaaagtgga tcttgaaatg atcataaatg ataaggctat 10080 atcacctcct aaaaatttga tatggactag tttccctaga aattatatgc cgtcacacat 10140 acaaaattat atagaacatg aaaaattaaa attttccgaa agtgataaat caagaagagt 10200 attagagtac tatttaagag ataacaaatt caatgaatgt gatttatata actgtgtagt 10260 taatcaaagc tatctaaaca accctaatca tgtggtatca ttgactggca aagaaagaga 10320 actcagtgta ggtagaatgt ttgcaatgca accaggaatg ttcagacaag ttcaaatatt 10380 agcagagaaa atgatagctg aaaacatttt acaattcttt cctgaaagtc ttacaagata 10440 tggtgatcta gaattacaga aaatattaga attgaaagcg ggaataagta acaaatcaaa 10500 tcgttacaat gacaattaca acaattacat cagtaagtgc tctatcatca cagatctcag 10560 caaattcaat caagcattcc ggtatgaaac atcatgtatt tgtagtgatg tactagatga 10620 actgcatggt gtacaatctc tattttcctg gttacattta actattcctc atgtcacaat 10680 aatatgcaca tataggcatg caccccccta tataagagat cacattgtag atcttaacaa 10740 tgtagatgaa caaagtggat tatatagata tcatatgggt ggtatcgaag ggtggtgtca 10800 aaaactatgg accatagaag ctatatcact attggatcta atatctctca aaggaaaatt 10860 ctcaattact gccttaatta atggtgacaa tcaatcaata gatataagca aaccagtcag 10920 actcatggaa ggtcaaactc atgctcaagc agattatttg ctagcattaa atagtcttaa 10980 attgctgtat aaagagtatg caggcatagg ccacaaatta aaaggaactg agacttatat 11040 atcaagagat atgcaattta tgagtaaaac aattcaacat aacggtgtat attacccagc 11100 tagtataaag aaagtcctaa gagtgggacc atggataaac actatacttg atgatttcaa 11160 agtgagtcta gagtctatag gtagtttgac acaagaatta gaatatagag gtgaaagtct 11220 attatgcagt ttaatattta gaaatgtgtg gttatataat caaattgctt tacaactaaa 11280 aaatcatgca ttatgtaaca ataaattata tttggacata ttaaaggttc tgaaacactt 11340 aaaaaccttt tttaatcttg ataatattga tacagcatta acattgtata tgaatttgcc 11400 catgttattt ggtggtggtg atcccaactt gttatatcga agtttctata gaagaactcc 11460 tgatttcctc acagaggcta tagttcactc tgtgttcata cttagttatt atacaaacca 11520 tgatttaaag gataaacttc aagatctgtc agacgataga ttgaataagt tcttaacatg 11580 cataatcacg tttgacaaaa accctaatgc tgaattcgta acattgatga gagatcctca 11640 ggctttaggg tctgagaggc aagctaaaat tactagcgaa atcaatagac tggcagttac 11700 tgaggttttg agcacagctc caaacaaaat attctccaaa agtgcacaac actataccac 11760 tacagagata gatctaaatg atattatgca aaatatagaa cccacatatc ctcatgggct 11820 aagagttgtt tatgaaagtt taccctttta taaagcagag aaaatagtaa atcttatatc 11880 tggtacaaaa tctataacta acatactgga aaagacttct gccatagact taacagatat 11940 tgatagagcc actgagatga tgaggaaaaa cataactttg cttataagga tatttccatt 12000 agattgtaac agagataaaa gagaaatatt gagtatggaa aacctaagta ttactgaatt 12060 aagcaaatat gttagagaaa gatcttggtc tttatccaat atagttggtg ttacatcacc 12120 cagtatcatg tatacaatgg acatcaaata tacaacaagc actatagcta gtggcataat 12180 catagagaaa tataatgtca acagtttaac acgtggtgag agaggaccca ctaaaccatg 12240 ggttggttca tctacacaag agaaaaaaac aatgccagtt tataatagac aagttttaac 12300 caaaaaacag agagatcaaa ttgatctatt agcaaaattg gactgggtgt atgcttctat 12360 agataacaag gatgaattca tggaagaact cagcatagga actcttgggt taacatatga 12420 gaaagccaaa aaattatttc cacaatattt aagtgttaac tatttgcatc gccttacagt 12480 cagtagtaga ccatgtgaat tccctgcatc aataccagct tatagaacta caaattatca 12540 ttttgatact agccctatta atcgcatatt aacagaaaag tatggtgatg aagatattga 12600 tatagtattc caaaactgta taagttttgg ccttagctta atgtcagtag tagagcaatt 12660 taccaatgta tgtcctaaca gaattattct catacccaag cttaatgaga tacatttgat 12720 gaaacctccc atattcacag gtgatgttga tattcacaag ttaaaacaag tgatccaaaa 12780 acagcatatg tttttaccag acaaaataag tttgactcaa tatgtggaat tattcttaag 12840 taataaaaca ctcaaatctg gatctcatgt taattctaat ttaatattgg cacataagat 12900 atctgactat tttcataata cttacatttt aagtactaat ttagctggac attggattct 12960 gattatacaa cttatgaaag attctaaagg tatttttgaa aaagactggg gagagggata 13020 tataactgat catatgttca ttaatttgaa agttttcttc aatgcttata agacctatct 13080 cttgtgtttt cataaaggtt acggcagagc aaagctggag tgcgatatga atacttcaga 13140 tctcctatgt gtattggaat taatagacag tagttattgg aagtccatgt ctaaggtatt 13200 tttagaacaa aaagttatca aatacattct cagccaggat gcaagtttac atagagtaaa 13260 aggatgtcat agcttcaaac tatggtttct taaacgtctt aatgtagcag aattcacagt 13320 ttgcccttgg gttgttaaca tagattatca tccaacacat atgaaagcaa tattaactta 13380 tatagatctt gttagaatgg gattgataaa tatagataga atatacatta aaaataaaca 13440 caaattcaat gatgaatttt atacttctaa tctcttttac attaattata acttctcaga 13500 taatactcat ctattaacta aacatataag gattgctaat tctgaattag aaaataatta 13560 caacaaatta tatcatccta cacctgaaac tctagaaaat atactaacca atctggttaa 13620 atgtgatgac aaaaagacac tgaatgacta ttgtataggt aaaaatgttg actcaataat 13680 gttaccattg ttatctaata agaaacttat taaatcgtct acaatgatta gaaccaatta 13740 cagcaaacaa gatttgtata atttatttcc tacggttgtg attgataaaa ttatagatca 13800 ttcaggtaat acagccaaat ctaaccaact ttacactact acttctcatc aaatatcttt 13860 agtacacaat agcacatcac tttattgcat gcttccttgg catcatatta atagattcaa 13920 ttttgtgttt agttctacag gttgtaaaat tagtatagag tatattttaa aagaccttaa 13980 aattaaagat cctaattgta tagcattcat aggtgaagga gcagggaatt tattgttgcg 14040 tacagtagtg gaacttcatc ctgatataag atatatttac agaagtctga aagattgcaa 14100 tgatcatagt ttacctattg agtttttaag gctgtacaat gggcatatca acattgatta 14160 tggtgaaaat ttgaccattc ccgctacaga tgcaaccaac aacattcatt ggtcttattt 14220 gcatataaag tttgctgaac ctatcagtct ttttgtttgt gatgctgaat tacctgtaac 14280 agtcaactgg agtaaaatta taatagagtg gagcaagcat gtaagaaaat gcaagtactg 14340 ttcctcagtt aataaatgta cgttaatagt aaaatatcat gctcaagatg atatcgattt 14400 caaattagac aacataacta tattaaaaac ttatgtatgc ttaggcagta agttaaaggg 14460 gtctgaagtt tacttagtcc ttacaatagg tcctgcaaat gtgttcccag tatttaatgt 14520 agtacaaaat gctaaattga tactatcaag aaccaaaaat ttcatcatgc ctaagaaggc 14580 tgataaggag tctattgatg caaatattaa aagtttgata ccatttcttt gttaccctat 14640 aacaaaaaaa ggaattaata ctgcattatc aaaactaaag agtgttgtta gtggagatat 14700 actatcatat tctatagctg gacgtaatga agttttcagc aataaactta taaatcataa 14760 gcatatgaac atcttaaagt ggttcaacca tgttttaaat ttcagatcaa cagaacttaa 14820 ctataatcat ttatatatgg tagaatccac atatccttat ctaagtgaat tgttaaacag 14880 cttgacaact aatgaactta aaaaactgat taaaatctgt ctcttataca atatacatat 14940 aaaccaatta gatttggatt taaatttgtt cctcctagat caaaatgata attttaggat 15000 tagtccacta aaagttatta aaaaatatat aatttttaat tttaaataac tataattgaa 15060 tacagtgtta gtgtgtagct gaatttttaa tataagattt ttattcatta ttcattatga 15120 aagttgtata acaaactacc tgtgatttta atcagttttt taagttcatt agttgtcaag 15180 ctgtttaaca attcacttag ataaggatat gtggattcta ccatatataa atgattatag 15240 ttaagttctg ttgatctgaa atttaaaaca tggttgaacc actttaagat gttcatatgc 15300 ttatgattta taagtttatt gctgaaaact gtca 15334 // ID MG027861; SV 1; linear; viral cRNA; STD; VRL; 14901 BP. XX AC MG027861; XX PR Project:PRJNA227457; XX DT 03-JAN-2018 (Rel. 135, Created) DT 03-JAN-2018 (Rel. 135, Last updated, Version 6) XX DE Respiratory syncytial virus type A isolate RSV-A/US/BID-V8368/2003, partial DE genome. XX KW . XX OS Respiratory syncytial virus type A OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Pneumoviridae; Orthopneumovirus. XX RN [1] RP 1-14901 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT "Comparative Genomics of Respiratory Syncytial Virus for Broad Institute RT Viral Genomics Initiative"; RL Unpublished. XX RN [2] RP 1-14901 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Infectious Disease Initiative, Broad Institute, 75 Ames Street, Cambridge, RL MA 02142, USA XX DR MD5; dd0291685d1d235245364d88810fbcf5. DR BioSample; SAMN06677487. XX CC ##Assembly-Data-START## CC Assembly Method :: Vicuna v. 1 CC Assembly Name :: V8368-1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..14901 FT /organism="Respiratory syncytial virus type A" FT /host="Homo sapiens" FT /isolate="RSV-A/US/BID-V8368/2003" FT /mol_type="viral cRNA" FT /country="USA" FT /collection_date="2003" FT /db_xref="taxon:1439707" FT 5'UTR 1..7 FT /note="indels in UTR have not been validated" FT gene 8..427 FT /gene="NS1" FT CDS 8..427 FT /codon_start=1 FT /gene="NS1" FT /product="non-structural protein 1" FT /db_xref="InterPro:IPR005099" FT /db_xref="PDB:5VJ2" FT /db_xref="UniProtKB/TrEMBL:X5FN71" FT /protein_id="AUH26173.1" FT /translation="MGSNSLSMIKVRLQNLFDNDEVALLKITCYTDKLIHLTNALAKAV FT IHTIKLNGIVFVHVITSSDICPNNNIVVKSNFTTMPVLQNGGYIWEMMELTHCSQPNGL FT IDDNCEIKFSKKLSDSTMTNYMNQLSELLGFDLNP" FT gene 537..911 FT /gene="NS2" FT CDS 537..911 FT /codon_start=1 FT /gene="NS2" FT /product="non-structural protein 2" FT /db_xref="InterPro:IPR004336" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP42" FT /protein_id="AUH26174.1" FT /translation="MDTTHNGTTPQRLMITDMRPLSLETIITSLTRDIITHKFIYLINH FT ECIVRKLDERQATFTFLVNYEMKLLHKVGSTKYKKYTEYNTKYGTFPMPIFINHDGFLE FT CIGIKPTKHTPIIYKYDLNP" FT gap 1024..1705 FT /estimated_length=682 FT gene <1706..2224 FT /gene="N" FT CDS <1706..2224 FT /codon_start=1 FT /gene="N" FT /product="nucleoprotein" FT /db_xref="GOA:A0A2H5CP49" FT /db_xref="InterPro:IPR004930" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP49" FT /protein_id="AUH26175.1" FT /translation="IDVFVHFGIAQSSTRGGSRVEGIFAGLFMNAYGAGQVMLRWGVLA FT KSVKNIMLGHASVQAEMEQVVEVYEYAQKLGGEAGFYHILNNPKASLLSLTQFPHFSSV FT VLGNAAGLGIMGEYRGTPRNQDLYDAAKAYAEQLKENGVINYSVLDLTAEELEAIKHQL FT NPKDNDVEL" FT gene 2256..2981 FT /gene="P" FT CDS 2256..2981 FT /codon_start=1 FT /gene="P" FT /product="phosphoprotein" FT /db_xref="GOA:X5F0A5" FT /db_xref="InterPro:IPR003487" FT /db_xref="UniProtKB/TrEMBL:X5F0A5" FT /protein_id="AUH26176.1" FT /translation="MEKFAPEFHGEDANNRATKFLESIKGKFTSPKDPKKKDSIISVNS FT IDIEVTKESPITSNSTIINPTNETDDTAGNKPNYQRKPLVSFKDDPTPSDNPFSKLYKE FT TIETFDNNEEESSYSYEEINDQTNDNITARLDRIDEKLSEILGMLHTLVVASAGPTSAR FT DGIRDAMVGLREEMIEKIRTEALMTNDRLEAMARLRNEESEKMAKDTSDEVSLNPTSEK FT LNNLLEGNDSDNDLSLEDF" FT gene 3165..>3796 FT /gene="M" FT CDS 3165..>3796 FT /codon_start=1 FT /gene="M" FT /product="matrix protein" FT /db_xref="GOA:A0A2H5CP48" FT /db_xref="InterPro:IPR005056" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP48" FT /protein_id="AUH26177.1" FT /translation="METYVNKLHEGSTYTAAVQYNVLEKDDDPASLTIWVPMFQSSMPA FT DLLIKELANVNILVKQISTPKGPSLRVMINSRSAVLAQMPSKFTICANVSLDERSKLAY FT DVTTPCEIKACSLTCLKSKNMLTTVKDLTMKTLNPTHDIIALCEFENIVTSKKVIIPTY FT LRSISVRNKDLNTLENITTTEFKNAITNAKIIPYSGLLLVITVTDNKG" FT gap 3797..4487 FT /estimated_length=691 FT gene 4592..5488 FT /gene="G" FT CDS 4592..5488 FT /codon_start=1 FT /gene="G" FT /product="attachment protein" FT /db_xref="GOA:X5FQ75" FT /db_xref="InterPro:IPR000925" FT /db_xref="UniProtKB/TrEMBL:X5FQ75" FT /protein_id="AUH26178.1" FT /translation="MSKTKDQRTAKTLEKTWDTLNHLLFISSCLYKLNLKSIAQITLSI FT LAMIISTSLIIVAIIFIASANNKVTLTTAIIQDATSQIKNTTPTYLTQNPQLGISFFNL FT SGTISQTTAILAPTTPSVEPILQSTTVKTKNTTTTQIQPSKLTTKQRQNKPPNKPNDDF FT HFEVFNFVPCSICSNNPTCWAICKRIPSKKPGKKTTTKPTKKQTIKTTKKDLKPQTTKP FT KEAPTTKPTEKPTINITKPNIRTTLLTNSTTGNLEHTSQEETLHSTSSEGNTSPSQIYT FT TSEYLSQPPSPSNITDQ" FT gap 5491..6019 FT /estimated_length=529 FT gene <6020..7288 FT /gene="F" FT CDS <6020..7288 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A2H5CP64" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP64" FT /protein_id="AUH26179.1" FT /translation="AVSKVLHLEGEVNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKN FT YIDKQLLPIVNKQSCSISNIETVIEFQQKNNRLLEITREFSVNAGVTTPVSTYMLTNSE FT LLSLINDMPITNDQKKLMSNNVQIVRQQSYSIMSIIKEEVLAYVVQLPLYGVIDTPCWK FT LHTSPLCTTNTKEGSNICLTRTDRGWYCDNAGSVSFFPQAETCKVQSNRVFCDTMNSLT FT LPSEVNLCNIDIFNPKYDCKIMTSKTDVSSSVITSLGAIVSCYGKTKCTASNKNRGIIK FT TFSNGCDYVSNKGVDTVSVGNTLYYVNKQEGKSLYVKGEPIINFYDPLVFPSDEFDASI FT SQVNEKINQSLAFIRKSDELLHNVNVGKSTTNIMITTIIIVIIVILLLLIAVGLFLYCK FT ARSTPVTLSKDQLSGINNIAFSN" FT gap 7442..7595 FT /estimated_length=154 FT gene <7596..8093 FT /gene="M2" FT CDS <7596..8093 FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2-1" FT /db_xref="GOA:A0A2H5CP56" FT /db_xref="InterPro:IPR009452" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP56" FT /protein_id="AUH26180.1" FT /translation="WPPHALLVRQNFMLNRILKSMDKSIDTLSEISGAAELDRTEEYAL FT GVVGVLESYIGSINNITKQSACVAMSKLLTELNSDDIKKLRDNEEPNSPKIRVYNTVIS FT YIESNRKNNKQTIHLLKRLPADVLKKTIKNTLDIHKSITINNPKESTVNDTNDHAKNND FT TT" FT gene 8402..14899 FT /gene="L" FT CDS 8402..14899 FT /codon_start=1 FT /gene="L" FT /product="L polymerase" FT /db_xref="GOA:A0A2H5CP59" FT /db_xref="InterPro:IPR014023" FT /db_xref="InterPro:IPR025786" FT /db_xref="InterPro:IPR026890" FT /db_xref="InterPro:IPR039736" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP59" FT /protein_id="AUH26181.1" FT /translation="MDPIINGNSANVYLTDSYLKGVISFSECNALGSYIFXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL FT SMLRGAFIYRIIKGFVNNYNRWPTLRNAIVLPLRWLTYYKLNTYPSLLELTERDLIVLS FT GLRFYREFRLPKKVDLEMIINDKAISPPKNLIWTSFPRNYMPSHIQNYIEHEKLKFSES FT DKSRRVLEYYLRDNKFNECDLYNCVVNQSYLNNPNHVVSLTGKERELSVGRMFAMQPGM FT FRQVQILAEKMIAENILQFFPESLTRYGDLELQKILELKAGISNKSNRYNDNYNNYISK FT CSIITDLSKFNQAFRYETSCICSDVLDELHGVQSLFSWLHLTIPHVTIICTYRHAPPYI FT RDHIVDLNNVDEQSGLYRYHMGGIEGWCQKLWTXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELEYRGESLLCSLIFRN FT VWLYNQIALQLKNHALCNNKLYLDILKVLKHLKTFFNLDNIDTALTLYMNLPMLFGGGD FT PNLLYRSFYRRTPDFLTEAIVHSVFILSYYTNHDLKDKLQDLSDDRLNKFLTCIITFDK FT NPNAEFVTLMRDPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHYTTTEIDL FT NDIMQNIEPTYPHGLRVVYESLPFYKAEKIVNLISGTKSITNILEKTSAIDLTDIDRAT FT EMMRKNITLLIRIFPLDCNRDKREILSMENLSITELSKYVRERSWSLSNIVGVTSPSIM FT YTMDIKYTTSTIASGIIIEKYNVNSLTRGERGPTKPWVGSSTQEKKTMPVYNRQVLTKK FT QRDQIDLLAKLDWVYASIDNKDEFMEELSIGTLGLTYEKAKKLFPQYLSVNYLHRLTVS FT SRPCEFPASIPAYRTTNYHFDTSPINRILTEKYGDEDIDIVFQNCISFGLSLMSVVEQF FT TNVCPNRIILIPKLNEIHLMKPPIFTGDVDIHKLKQVIQKXXXXXXXXXXXXXXXXXXX FT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDWGE FT GYITDHMFINLKVFFNAYKTYLLCFHKGYGRAKLECDMNTSDLLCVLELIDSSYWKSMS FT KVFLEQKVIKYILSQDASLHRVKGCHSFKLWFLKRLNVAEFTVCPWVVNIDYHPTHMKA FT ILTYIDLVRMGLINIDRIYIKNKHKFNDEFYTSNLFYINYNFSDNTHLLTKHIRIANSE FT LENNYNKLYHPTPETLENILTNPVKCDDKKTLNDYCIGKNVDSIMLPLLSNKKLIKSST FT TIRTNYSKQDLYNLFPTVVIDKIIDHSGNTAKSNQLYTTTSHQISLVHNSTSLYCMLPW FT HHINRFNFVFSSTGCKISIEYILKDLKIKDPSCIAFIGEGAGNLLLRTVVELHPDIRYI FT YRSLKDCNDHSLPIEFLRLYNGHINIDYGENLTIPATDATNNIHWSYLHIKFAEPISLF FT VCDAELPVTVNWSKIIIEWSKHVRKCKYCSSVNKCTLIVKYHAQDDIDFKLDNITILKT FT YVCLGSKLKGSEVYLVLTIGPANVFPVFNVVQNAKLILSRTKNFIMPKKADKESIDANI FT KSLIPFLCYPITKKGINTALSKLKSVVSGDILSYSIAGRNEVFSNKLINHKHMNILKWF FT NHVLNFRSTELNYNHLYMVESTYPYLSELLNSLTTNELKKLIKITGSLLYNFHNE" FT gap 8511..9770 FT /estimated_length=1260 FT gap 10759..11140 FT /estimated_length=382 FT gap 11585..11694 FT /estimated_length=110 FT gap 12728..12949 FT /estimated_length=222 XX SQ Sequence 14901 BP; 4194 A; 1971 C; 1709 G; 2997 T; 4030 other; gttttagatg ggcagcaatt cattaagtat gataaaagtt agattacaaa atttatttga 60 caatgatgaa gtagcattgt taaaaataac ctgctatact gacaaattga tacatttaac 120 taatgctttg gctaaggcag tgatacatac aatcaaattg aatggcattg tatttgtgca 180 tgttattaca agtagtgata tttgccctaa taataatatt gtagtgaaat ccaacttcac 240 aacaatgcca gtgttacaaa atggaggtta tatatgggaa atgatggaat taacacactg 300 ctctcaaccc aatggcctaa tagatgacaa ttgtgaaatc aaattctcca aaaaactaag 360 cgattcaaca atgaccaact atatgaatca attatctgaa ttacttggat ttgatctcaa 420 tccataaatt ataacaaata tcaactagca aatcaatgtc aataacacca ttagttaata 480 taaaacttga cagaagataa aaatggggca aataaataaa ctcagctgac ccaaccatgg 540 acacaacaca caatggtact acaccacaaa gactgatgat cacagacatg agaccattgt 600 cacttgagac tataataaca tcactaacca gagacatcat aacacacaaa tttatatact 660 tgataaatca tgaatgtata gtgagaaaac ttgatgaaag acaggccaca tttacattcc 720 tggtcaacta tgaaatgaaa ctattgcaca aagtgggaag cactaaatac aaaaaatata 780 ctgaatacaa cacaaaatat ggcacttttc ctatgccaat atttatcaat catgatgggt 840 tcttagaatg cattggcatt aagcctacaa agcacactcc cataatatac aagtatgatc 900 tcaatccatg aatttcaaca caagattcac acaatctgaa ataacaactt catgcataac 960 tacactccat agtccaaatg gagcctgaaa attatagtaa tttaaaatta aggagagaca 1020 taannnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1680 nnnnnnnnnn nnnnnnnnnn nnnnnataga tgtttttgtt cattttggta tagcacaatc 1740 ttctaccaga ggtggcagta gagttgaagg gatttttgca ggattgttta tgaatgccta 1800 tggtgcaggg caagtgatgt tacggtgggg agtcttagca aaatcagtta aaaatattat 1860 gctaggacac gctagtgtgc aagcagaaat ggaacaagtt gtggaagttt atgaatatgc 1920 ccaaaaattg ggtggagaag caggattcta ccatatattg aataacccaa aagcatcatt 1980 attatctttg actcaatttc cccacttctc cagtgtagta ttaggcaatg ctgctggcct 2040 aggcataatg ggagaataca gaggtacacc aaggaatcaa gatctatatg atgctgcaaa 2100 ggcatatgct gaacaactca aagaaaatgg tgtgattaac tacagtgtat tagacttgac 2160 agcagaagaa ctagaggcta tcaaacatca gcttaatcca aaagataatg atgtagagct 2220 ttgagttaat aaaaaagtgg ggcaaataaa tcatcatgga aaagtttgct cctgaattcc 2280 atggagaaga cgcaaacaac agagccacta aattcctaga atcaataaag ggcaaattca 2340 catcacctaa agatcccaag aaaaaagata gtatcatatc tgtcaactca atagatatag 2400 aagtaaccaa agaaagccct ataacttcaa attcaaccat tataaaccct acaaatgaga 2460 cagatgatac tgcagggaac aagcccaatt atcaaagaaa acctctagtg agtttcaaag 2520 atgaccctac gccaagtgat aatccctttt caaaactata caaagaaacc atagaaacat 2580 ttgataacaa tgaagaagaa tctagctatt catatgaaga aataaatgat cagacaaatg 2640 ataatataac agcaagatta gataggattg atgaaaaatt aagtgaaata ctaggaatgc 2700 ttcacacact agtagtagca agtgcaggac ctacatctgc tcgggatggt ataagagatg 2760 ccatggttgg tttaagagaa gaaatgatag aaaaaatcag aactgaagca ttaatgacca 2820 atgatagatt agaagctatg gcaaggctca ggaatgagga aagtgaaaag atggcaaaag 2880 acacgtcaga tgaagtgtct ctcaatccaa catcagagaa attgaacaac ctgttggaag 2940 ggaatgatag tgacaatgat ctatcacttg aagatttctg atcagttacc aatctgcaca 3000 ttaacacaca acaccaacag aagaccaaca aacaaaacaa ctcacctatc caaccaaaca 3060 tctatctgcc aatcagccaa ccagccaaaa aaacacccag ccaatacaaa attagtcacc 3120 cggaaaaaat caatactata gttacaaaaa aagatggggc aaatatggaa acatacgtga 3180 acaaacttca cgaaggctcc acatacacag ctgctgttca atacaatgtc ctagaaaaag 3240 acgatgaccc tgcatcactt acaatatggg tgcccatgtt ccaatcatcc atgccagcag 3300 atttacttat aaaagaacta gctaatgtca acatactagt gaaacaaata tccacaccca 3360 aaggaccttc attaagagtc atgataaact cgagaagtgc agtgctagca caaatgccca 3420 gcaaattcac tatatgtgcc aatgtgtcct tggatgaaag aagcaagctg gcatatgatg 3480 taaccacacc ctgcgaaatc aaggcatgta gtctaacatg cctaaaatca aaaaatatgt 3540 taactacagt taaagatctc actatgaaaa cactcaaccc aacacatgac attattgctt 3600 tatgtgaatt tgaaaatata gtaacatcaa aaaaagtcat aataccaaca tacttaagat 3660 ccatcagtgt cagaaataaa gatctgaaca cacttgaaaa tataacaacc accgaattca 3720 aaaatgccat cacaaatgca aaaatcatcc cttactcagg attactgtta gtcatcacag 3780 tgactgacaa caaaggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4080 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4200 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncac catgcaagcc 4500 atcatccata ctataaagta gttaattaaa aatagtcata acaatgaact aagatattaa 4560 gactaacaac aacgttgggg caaatgcaaa catgtccaaa accaaggacc aacgcaccgc 4620 caagacacta gaaaagacct gggacactct caatcatcta ttattcatat catcgtgctt 4680 atacaagtta aatcttaaat ctatagcaca aatcacatta tccattctgg caatgataat 4740 atcaacttca cttataattg tagctatcat attcatagcc tcagcaaaca acaaagtcac 4800 actaacaact gcaatcatac aagatgcaac aagccagatc aagaacacaa ccccaacata 4860 cctgacccag aatccccagc ttggaatcag cttcttcaat ctgtctggaa ctatatcaca 4920 aaccaccgcc atactagctc caacaacacc aagtgtcgag ccaatcctgc aatctacaac 4980 agtcaagacc aaaaacacaa caacaaccca aatacaaccc agcaagctca ccacaaaaca 5040 acgccaaaac aaaccaccaa acaaacccaa cgatgatttt cactttgaag tgttcaactt 5100 tgtaccctgc agcatatgca gcaacaatcc aacttgctgg gccatctgca aaagaatacc 5160 aagcaaaaaa cctggaaaga aaaccaccac caagcccacg aaaaaacaaa ccatcaagac 5220 aaccaaaaaa gatctcaaac ctcaaactac aaaaccaaag gaagcaccta ccaccaagcc 5280 cacagaaaag ccaaccatca acatcaccaa accaaacatc agaactacac tgctcaccaa 5340 cagtaccaca ggaaatctag aacacacaag tcaagaggag accctccatt caacctcctc 5400 cgaaggcaat acaagccctt cacaaatcta tacaacatcc gagtacctat cacaacctcc 5460 atctccatcc aacataacag accagtagtc nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5520 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5580 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5700 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5760 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5820 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5880 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5940 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6000 nnnnnnnnnn nnnnnnnnng ctgtatctaa agtcctgcac ctagaagggg aagtgaacaa 6060 aatcaaaagt gctctactat ccacaaacaa ggctgtagtc agcttatcaa atggagttag 6120 tgtcttaacc agcaaagtgt tagacctcaa aaactatata gataaacagt tgttacccat 6180 tgtgaacaag caaagctgca gcatatcaaa cattgaaact gtgatagaat tccaacaaaa 6240 gaacaacaga ctactagaga ttaccaggga atttagtgtt aatgcaggtg taactacacc 6300 tgtaagcact tatatgttaa caaatagtga attattatca ttaatcaatg atatgcctat 6360 aacaaatgat cagaaaaagt taatgtccaa caatgttcaa atagttagac agcaaagtta 6420 ctctatcatg tccataataa aggaggaagt cttagcatat gtagtacaat taccactata 6480 tggtgtaata gatacacctt gttggaaact acacacatcc cctctatgca caaccaacac 6540 aaaggaaggg tccaacatct gtttaacaag aaccgacaga ggatggtact gtgacaatgc 6600 aggatcagtt tctttcttcc cacaagctga aacatgcaaa gttcaatcga atcgagtatt 6660 ttgtgacaca atgaacagtt taacattacc aagtgaagta aatctctgca acattgacat 6720 attcaaccct aaatatgatt gcaaaattat gacttcaaaa acagatgtaa gcagctccgt 6780 tatcacatct ctaggagcca ttgtgtcatg ctatggcaaa actaaatgta cagcatccaa 6840 taaaaatcgt ggaatcataa agacattttc taacgggtgt gattatgtat caaataaggg 6900 ggtggacact gtatctgtag gtaatacatt atattatgta aataagcaag aaggaaaaag 6960 tctctatgta aaaggtgaac caataataaa tttctatgac ccattagtgt tcccttctga 7020 tgaatttgat gcatcaatat ctcaagtcaa tgagaagatt aaccagagcc tagcatttat 7080 tcgtaaatcc gatgaattat tacataatgt aaatgttggt aaatccacca caaatatcat 7140 gataactact ataattatag tgattatagt aatattgtta ttattaattg cagttgggct 7200 gttcctatac tgcaaggcca gaagcacacc agtcacacta agcaaggatc aactgagtgg 7260 tataaataat attgcattta gtaactgaat aagaatagta cctaatcatg ttcttacaat 7320 ggttcatcat ccgaccatag atgacccatc tatcattgga ttttcttaaa gtctgaactt 7380 catcgcaact ctcatctata aaccatctca cttacactat ttaagtagat tcctatttta 7440 tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7500 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnntggcc acctcatgca ctgcttgtaa 7620 gacaaaactt tatgttaaac agaatactta agtctatgga taaaagcata gatactttat 7680 cagaaataag tggagctgca gagttggaca gaactgaaga gtatgccctc ggtgtagttg 7740 gagtgctaga gagttatata ggatcaataa ataatataac taaacaatca gcatgtgttg 7800 ccatgagcaa actcctcact gaactcaaca gtgatgacat caagaaacta agagacaatg 7860 aagagccaaa ttcacctaag ataagagtgt acaatactgt catatcatat attgaaagca 7920 acaggaaaaa caataaacaa actatccatc tgttaaaaag attgccagca gacgtattga 7980 agaaaaccat caaaaacaca ttggatatcc acaagagcat aaccatcaac aacccaaaag 8040 aatcaactgt taatgataca aacgaccatg ccaaaaataa tgatactacc tgacaaatat 8100 ccttgtagta taaattccat actaataaca agtagttgta gagttactat gtataatcaa 8160 aagaacacac tatatttcaa tcaaaacaac caacataacc atacatactc accaaatcaa 8220 ccattcaatg aaatccattg gaccctctca agacttaatt gatgcaattc aaaattttct 8280 acaacatcta ggtattactg atgacatata tacaatatat atattagtgt cataacactc 8340 aataccaata cttaccacat catcaaatta ttaactcaaa caattcaaat catgggacaa 8400 aatggatccc attattaatg gaaattctgc taatgtttat ctaaccgata gttatttaaa 8460 aggtgttatt tctttctcag aatgtaatgc tttaggaagt tacatattcg nnnnnnnnnn 8520 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8580 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8700 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8760 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8820 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8940 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9060 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9420 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn gtttgagtat 9780 gttaagaggt gcctttatat atagaattat aaaagggttt gtaaataatt acaacagatg 9840 gcctacttta aggaatgcta ttgttttacc cttaagatgg ttaacttact ataaactaaa 9900 cacttatcct tccttattgg aacttacaga aagagatttg attgttttat caggactacg 9960 tttctatcgt gagtttcggt tgcctaaaaa agtggatctt gaaatgatca taaatgataa 10020 ggctatatca cctcctaaaa atttgatatg gactagtttc cctagaaatt atatgccgtc 10080 acacatacaa aattatatag aacatgaaaa attaaaattt tccgagagtg ataaatcaag 10140 aagagtatta gagtactatt taagagataa caaattcaat gaatgtgatt tatataactg 10200 tgtagttaat caaagctatc ttaacaaccc taatcatgtg gtatcattga ctggcaaaga 10260 aagagaactc agtgtaggta gaatgtttgc aatgcaacca ggaatgttca gacaagttca 10320 aatattagca gagaaaatga tagctgaaaa cattttacaa ttctttcctg aaagtcttac 10380 aagatatggt gatctagaat tacagaaaat attagaattg aaagcgggaa taagtaacaa 10440 atcaaatcgt tacaatgaca attacaacaa ttacatcagt aagtgctcta tcatcacaga 10500 tctcagcaaa ttcaatcaag cattccggta tgaaacatca tgtatttgta gtgatgtact 10560 ggatgaactg catggtgtac aatctctatt ttcctggtta catttaacta ttcctcatgt 10620 cacaataata tgcacatata ggcatgcacc cccctatata agagatcaca ttgtagatct 10680 taacaatgta gatgaacaaa gtggattata tagatatcat atgggtggta tcgaagggtg 10740 gtgtcaaaaa ctatggacnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10920 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10980 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11040 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11100 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn gaattagaat atagaggtga 11160 aagtctatta tgcagtttaa tatttagaaa tgtgtggtta tataatcaaa ttgctttaca 11220 actaaaaaat catgcattat gtaacaataa attatatttg gacatattaa aggttctgaa 11280 acacttaaaa acctttttta atcttgataa tattgataca gcattaacat tgtatatgaa 11340 tttgcccatg ttatttggtg gtggtgatcc caacttgtta tatcgaagtt tctatagaag 11400 aactcctgat ttcctcacag aggctatagt tcactctgtg ttcatactta gttattatac 11460 aaaccatgat ttaaaggata aacttcaaga tctgtcagac gatagattga ataagttctt 11520 aacatgcata atcacgtttg acaaaaaccc taatgctgaa ttcgtaacat tgatgagaga 11580 tcctnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnacacta 11700 taccactaca gagatagatc taaatgatat tatgcaaaat atagaaccta catatcctca 11760 tgggctaaga gttgtttatg aaagtttacc cttttataaa gcagagaaaa tagtaaatct 11820 tatatccggt acaaaatcta taactaacat actggaaaag acttctgcca tagacttaac 11880 agatattgat agagccactg agatgatgag gaaaaacata actttgctta taaggatatt 11940 tccattagat tgtaacagag acaaaagaga aatattgagt atggaaaacc taagtattac 12000 tgaattaagc aaatatgtta gagaaagatc ttggtcttta tccaatatag ttggtgttac 12060 atcacccagt atcatgtata caatggacat caaatataca acaagcacta tagctagtgg 12120 cataatcata gagaaatata atgtcaacag tttaacacgt ggtgagagag gacccactaa 12180 accatgggtt ggttcatcta cacaagagaa aaaaacaatg ccagtttata atagacaagt 12240 tttaaccaaa aaacagagag atcaaattga tctattagca aaattggatt gggtgtatgc 12300 atctatagat aacaaggatg aattcatgga agaactcagc ataggaactc ttgggttaac 12360 atatgagaaa gccaaaaaat tatttccaca atatttaagt gttaactatt tgcatcgcct 12420 tacagtcagt agtagaccat gtgaattccc tgcatcaata ccagcttata gaactacaaa 12480 ttatcacttt gatactagcc ctattaatcg catattaaca gaaaagtatg gtgatgaaga 12540 tattgatata gtattccaaa actgtataag ttttggcctt agcttaatgt cagtagtaga 12600 gcaatttacc aatgtatgtc ctaacagaat tattctcata cccaagctta atgagataca 12660 tttgatgaaa cctcccatat tcacaggtga tgttgatatt cacaagttaa aacaagtgat 12720 ccaaaaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12780 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnng attggggaga 12960 gggatatata actgatcata tgttcattaa tttgaaagtt ttcttcaatg cttataagac 13020 ctatctcttg tgttttcata aaggttacgg cagagcaaag ctggagtgtg atatgaatac 13080 ttcagatctc ctatgtgtat tggaattaat agacagtagt tattggaagt ctatgtctaa 13140 ggtattttta gaacaaaaag ttatcaaata cattctcagc caggatgcaa gtttacatag 13200 agtaaaagga tgtcatagct tcaaactatg gtttcttaaa cgtcttaatg tagcagaatt 13260 cacagtttgc ccttgggttg ttaacataga ttatcatcca acacacatga aagcaatatt 13320 aacttatata gatcttgtta gaatgggatt gataaatata gatagaatat acattaaaaa 13380 taaacacaaa ttcaatgatg aattttatac ttctaatctc ttttacatta attataactt 13440 ctcagataat actcatctat taactaaaca tataaggatt gctaattctg aattagaaaa 13500 taattacaac aaattatatc atcctacacc tgaaactcta gaaaatatac taaccaatcc 13560 ggttaaatgt gatgacaaaa agacactgaa tgactattgt ataggtaaaa atgttgactc 13620 aataatgtta ccattgttat ctaataagaa gcttattaaa tcgtctacaa cgattagaac 13680 caattacagc aaacaagatt tgtataattt atttcctacg gttgtgattg ataaaattat 13740 agatcattca ggtaatacag ccaaatctaa ccaactttac actactactt ctcatcaaat 13800 atctttagta cacaatagca catcacttta ttgcatgctt ccttggcatc atattaatag 13860 attcaatttt gtgtttagtt ctacaggttg taaaattagt atagagtata ttttaaaaga 13920 ccttaaaatt aaagatccta gttgtatagc attcataggt gaaggagcag ggaatttatt 13980 gttgcgtaca gtagtggaac ttcatcctga tataagatat atttacagaa gtctgaaaga 14040 ttgcaatgat catagtttac ctattgagtt tttaaggcta tacaatggac atattaacat 14100 tgattatggt gaaaatttga ccattcccgc tacagatgca accaacaaca ttcattggtc 14160 ttatttgcat ataaagtttg ctgaacctat cagtcttttt gtttgtgatg ctgaattgcc 14220 tgtaacagtc aactggagta aaattataat agagtggagc aagcatgtaa gaaaatgcaa 14280 gtactgttcc tcagttaata aatgtacgtt aatagtaaaa taccatgctc aagatgatat 14340 cgatttcaaa ttagacaaca taactatatt aaaaacttat gtatgcttgg gcagtaagtt 14400 aaaggggtct gaagtttact tagtccttac aataggtcct gcaaatgtgt tcccagtatt 14460 taatgtagta caaaatgcta aattgatact atcaagaacc aaaaatttca tcatgcctaa 14520 gaaggctgat aaagagtcta ttgatgcaaa tattaaaagt ttgataccct ttctttgtta 14580 ccctataaca aaaaaaggaa ttaatactgc attatcaaaa ctaaagagtg ttgttagtgg 14640 agatatacta tcatattcta tagctggacg taatgaagtt ttcagcaata aacttataaa 14700 tcataagcat atgaacatct taaagtggtt caaccatgtt ttaaatttca gatcaacaga 14760 acttaactat aatcatttat atatggtaga atccacatat ccttatctaa gtgaattgtt 14820 aaacagcttg acaactaatg aacttaaaaa actgattaaa atcacaggta gtttgttata 14880 caactttcat aatgaataat g 14901 // ID MG027862; SV 1; linear; viral cRNA; STD; VRL; 15171 BP. XX AC MG027862; XX PR Project:PRJNA227457; XX DT 03-JAN-2018 (Rel. 135, Created) DT 03-JAN-2018 (Rel. 135, Last updated, Version 6) XX DE Respiratory syncytial virus type A isolate RSV-A/US/BID-V8392/2003, DE complete genome. XX KW . XX OS Respiratory syncytial virus type A OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Pneumoviridae; Orthopneumovirus. XX RN [1] RP 1-15171 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT "Comparative Genomics of Respiratory Syncytial Virus for Broad Institute RT Viral Genomics Initiative"; RL Unpublished. XX RN [2] RP 1-15171 RA Newman R.M., Zody M.C., DeVincenzo J.P., Grad Y., Lipsitch M., Murphy R., RA Fitzgerald M., Young S., Gargeya S., Poon T.W., Charlebois P., Weiner B., RA Yang X., Piper M.E., McCowan C., Ireland A., Levin J., Malboeuf C., Qu J., RA Chapman S.B., Murphy C., Wortman J., Nusbaum C., Birren B.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Infectious Disease Initiative, Broad Institute, 75 Ames Street, Cambridge, RL MA 02142, USA XX DR MD5; 9f301899fc366939094fdf9f89e39070. DR BioSample; SAMN06677492. XX CC ##Assembly-Data-START## CC Assembly Method :: Vicuna v. 1 CC Assembly Name :: V8392-1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..15171 FT /organism="Respiratory syncytial virus type A" FT /host="Homo sapiens" FT /isolate="RSV-A/US/BID-V8392/2003" FT /mol_type="viral cRNA" FT /country="USA" FT /collection_date="2003" FT /db_xref="taxon:1439707" FT 5'UTR 1..65 FT /note="indels in UTR have not been validated" FT gene 66..485 FT /gene="NS1" FT CDS 66..485 FT /codon_start=1 FT /gene="NS1" FT /product="non-structural protein 1" FT /db_xref="InterPro:IPR005099" FT /db_xref="PDB:5VJ2" FT /db_xref="UniProtKB/TrEMBL:X5FN71" FT /protein_id="AUH26182.1" FT /translation="MGSNSLSMIKVRLQNLFDNDEVALLKITCYTDKLIHLTNALAKAV FT IHTIKLNGIVFVHVITSSDICPNNNIVVKSNFTTMPVLQNGGYIWEMMELTHCSQPNGL FT IDDNCEIKFSKKLSDSTMTNYMNQLSELLGFDLNP" FT gene 595..969 FT /gene="NS2" FT CDS 595..969 FT /codon_start=1 FT /gene="NS2" FT /product="non-structural protein 2" FT /db_xref="InterPro:IPR004336" FT /db_xref="UniProtKB/TrEMBL:X5FP42" FT /protein_id="AUH26183.1" FT /translation="MDTTHNGTTPQRLMITDMRPLSLETIITSLTRDIITHRFIYLINH FT ECIVRKLDERQATFTFLVNYEMKLLHKVGSTKYKKYTEYNTKYGTFPMPIFINHDGFLE FT CIGIKPTKHTPIIYKYDLNP" FT gene 1107..2282 FT /gene="N" FT CDS 1107..2282 FT /codon_start=1 FT /gene="N" FT /product="nucleoprotein" FT /db_xref="GOA:X5F650" FT /db_xref="InterPro:IPR004930" FT /db_xref="UniProtKB/TrEMBL:X5F650" FT /protein_id="AUH26184.1" FT /translation="MALSKVKLNDTLNKDQLLSSSKYTIQRSTGDSIDTPNYDVQKHIN FT KLCGMLLITEDANHKFTGVIGMLYAMSRLGREDTIKILRDAGYHVKANGVDVTTHRQDI FT NGKEMKFEVLTLASLTTEIQINIEIESRKSYKKMLKEMGEVAPEYRHDSPDCGMIILCI FT AALVITKLAAGDRSGLTAVIRRANNVLKNEMKRYKGLLPKDIANSFYEVFEKYPHFIDV FT FVHFGIAQSSTRGGSRVEGIFAGLFMNAYGAGQVMLRWGVLAKSVKNIMLGHASVQAEM FT EQVVEVYEYAQKLGGEAGFYHILNNPKASLLSLTQFPHFSSVVLGNAAGLGIMGEYRGT FT PRNQDLYDAAKAYAEQLKENGVINYSVLDLTAEELEAIKHQLNPKDNDVEL" FT gene 2314..3039 FT /gene="P" FT CDS 2314..3039 FT /codon_start=1 FT /gene="P" FT /product="phosphoprotein" FT /db_xref="GOA:X5EYQ0" FT /db_xref="InterPro:IPR003487" FT /db_xref="UniProtKB/TrEMBL:X5EYQ0" FT /protein_id="AUH26185.1" FT /translation="MEKFAPEFHGEDANNRATKFLESIKGKFTSPKDPKKKDSIISVNS FT IDIEVTKESPITSNSTIINPTNETDDTSGNKPNYQRKPLVSFKEDPTPSDNPFSKLYKE FT TIETFDNNEEESSYSYEEINDQTNDNITARLDRIDEKLSEILGMLHTLVVASAGPTSAR FT DGIRDAMVGLREEMIEKIRTEALMTNDRLEAMARLRNEESEKMAKDTSDEVSLNPTSEK FT LNNLLEGNDSDNDLSLEDF" FT gene 3223..3993 FT /gene="M" FT CDS 3223..3993 FT /codon_start=1 FT /gene="M" FT /product="matrix protein" FT /db_xref="GOA:X5F6K9" FT /db_xref="InterPro:IPR005056" FT /db_xref="UniProtKB/TrEMBL:X5F6K9" FT /protein_id="AUH26186.1" FT /translation="METYVNKLHEGSTYTAAVQYNVLEKDDDPASLTIWVPMFQSSMPA FT DLLIKELANVNILVKQISTPKGPSLRVMINSRSAVLAQMPSKFTICANVSLDERSKLAY FT DVTTPCEIKACSLTCLKSKNMLTTVKDLTMKTLNPTHDIIALCEFENIVTSKKVIIPTY FT LRSISVRNKDLNTLENITTTEFKNAITNAKIIPYSGLLLVITVTDNKGAFKYIKPQSQF FT IVDLGAYLEKESIYYVTTNWKHTATRFAIKPMED" FT gene 4264..4458 FT /gene="SH" FT CDS 4264..4458 FT /codon_start=1 FT /gene="SH" FT /product="small hydrophobic protein" FT /db_xref="GOA:A0A2H5CP71" FT /db_xref="InterPro:IPR005327" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP71" FT /protein_id="AUH26187.1" FT /translation="MENTSITIEFSSKFWPYFTLIHMITTIISLLIIISIMIAILNKLC FT EYNVFHNKTFEPPRARVNT" FT gene 4649..5545 FT /gene="G" FT CDS 4649..5545 FT /codon_start=1 FT /gene="G" FT /product="attachment protein" FT /db_xref="GOA:A0A2H5CP58" FT /db_xref="InterPro:IPR000925" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP58" FT /protein_id="AUH26188.1" FT /translation="MSKTKDQRTAKTLEKTWDTLNHLLFISSCLYKLNLKSIAQITLSI FT LAMIISTSLIIVAIIFIASANNKVTLTTAIIQDATSQIKNTTPTHLTQNPQLGISFFNL FT SGTISQTTAILAPTTPSVEPILQSTTVKTKNTTTTQIQPSKLTTKQRQNKPPNKPNDDF FT HFEVFNFVPCSICSNNPTCWAICKRIPSKKPGKKTTTKPTKKQTIKTTKKDLKPQTTKP FT KEAPTTKPTEKPTINITKPNIRTTLLTNSTTGNLEHTSQEETLHSTSSDGNTSPSQIYT FT TSEYLSQPPSPSNITDQ" FT gene 5622..7346 FT /gene="F" FT CDS 5622..7346 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A2H5CP76" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP76" FT /protein_id="AUH26189.1" FT /translation="MDLPILKTNAITAILAAVLLCFASSQNITEEFYQSTCSAVSKGYL FT SALRTGWYTSVITIELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQSTPAAN FT NRARRELPRFMNYTLNNTKNNNVTLSKKRKRRFLGFLLGVGSAIASGIAVSKVLHLEGE FT VNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKRLLPIVNKQSCSISNIETVIE FT FQQKNNRLLEITREFSVNAGVTTPVSTYMLTNSELLSLINDMPITNDQKKLMSNNVQIV FT RQQSYSIMSIIKEEVLAYVVQLPLYGVIDTPCWKLHTSPLCTTNTKEGSNICLTRTDRG FT WYCDNAGSVSFFPQAETCKVQSNRVFCDTMNSLTLPSEVNLCNIDIFNPKYDCKIMTSK FT TDVSSSVITSLGAIVSCYGKTKCTASNKNRGIIKTFSNGCDYVSNKGVDTVSVGNTLYY FT VNKQEGKSLYVKGEPIINFYDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN FT VGKSTTNIMITTIIIVIIVILLLLIAVGLFLYCKARSTPVTLSKDQLSGINNIAFSN" FT gene 7565..8149 FT /gene="M2" FT CDS 7565..8149 FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2-1" FT /db_xref="GOA:A0A2H5CP69" FT /db_xref="InterPro:IPR000571" FT /db_xref="InterPro:IPR009452" FT /db_xref="InterPro:IPR036855" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP69" FT /protein_id="AUH26190.1" FT /translation="MSRRNPCKFEIRGHCLNGKRCHFSHNYFEWPPHALLVRQNFMLNR FT ILKSMDKSIDTLSEISGAAELDRTEEYALGVVGVLESYIGSINNITKQSACVAMSKLLT FT ELNSDDIKKIRDNEEPNSPKIRVYNTVISYIESNRKNNKQTIHLLKRLPADVLKKTIKN FT TLDIHKSITINNPKESTVNDTNDHAKNNDTT" FT gene 8457..14954 FT /gene="L" FT CDS 8457..14954 FT /codon_start=1 FT /gene="L" FT /product="L polymerase" FT /db_xref="GOA:A0A2H5CP75" FT /db_xref="InterPro:IPR014023" FT /db_xref="InterPro:IPR016269" FT /db_xref="InterPro:IPR025786" FT /db_xref="InterPro:IPR026890" FT /db_xref="InterPro:IPR039736" FT /db_xref="UniProtKB/TrEMBL:A0A2H5CP75" FT /protein_id="AUH26191.1" FT /translation="MDPIINGNSANVYLTDSYLKGVISFSECNALGSYIFNGPYLKNDY FT TNLISRQNPLIEHINLKKLNITQSLISKYHKGEIKIEEPTYFQSLLMTYKSMTSSEQIA FT TTNLLKKIIRRAIEISDVKVYAILNKLGLKEKDKIKSNNEQDENNSVITTIIKDDILLA FT VKDNQSHLKAGKNHSTKQKDTIKTTLLKKLMCSMQHPPSWLIHWFNLYTKLNNILTQYR FT SNEVKNHGFILIDNHTLNGFQFILNQYGCIVYHKDLKRITVTTYNQFLTWKDISLSRLN FT VCLITWISNCLNTLNKSLGLRCGFNNVILTQLFLYGDCILKLFHNEGFYIIKEVEGFIM FT SLILNITEEDQFRKRFYNSMLNNITDAANKAQKNLLSRVCHTLLDKTVSDNIINGRWII FT LLSKFLKLIKLAGDNNLNNLSELYFLFRIFGHPMVDERQAMDAVKVNCNETKFYLLSSL FT SMLRGAFIYRIIKGFVNNYNRWPTLRNAIVLPLRWLTYYKLNTYPSLLELTERDLIVLS FT GLRFYREFRLPKKVDLEMIINDKAISPPKNLIWTSFPRNYMPSHIQNYIEHEKLKFSES FT DKSRRVLEYYLRDNKFNECDLYNCVVNQSYLNNPNHVVSLTGKERELSVGRMFAMQPGM FT FRQVQILAEKMIAENILQFFPESLTRYGDLELQKILELKAGISNKSNRYNDNYNNYISK FT CSIITDLSKFNQAFRYETSCICSDVLDELHGVQSLFSWLHLTIPHVTIICTYRHAPPYI FT RDHIVDLNNVDEQSGLYRYHMGGIEGWCQKLWTIEAISLLDLISLKGKFSITALINGDN FT QSIDISKPVRLMEGQTHAQADYLLALNSLKLLYKEYAGIGHKLKGTETYISRDMQFMSK FT TIQHNGVYYPASIKKVLRVGPWINTILDDFKVSLESIGSLTQELEYRGESLLCSLIFRN FT VWLYNQIALQLKNHALCNNKLYLDILKVLKHLKTFFNLDNIDTALTLYMNLPMLFGGGD FT PNLLYRSFYRRTPDFLTEAIVHSVFILSYYTNHDLKDKLQDLSDDRLNKFLTCIITFDK FT NPNAEFVTLMRDPQALGSERQAKITSEINRLAVTEVLSTAPNKIFSKSAQHYTTTEIDL FT NDIMQNIEPTYPHGLRVVYESLPFYKAEKIVNLISGTKSITNILEKTSAIDLTDIDRAT FT EMMRKNITLLIRIFPLDCNRDKREILSMENLSITELSKYVRERSWSLSNIVGVTSPSIM FT YTMDIKYTTSTIASGIIIEKYNVNSLTRGERGPTKPWVGSSTQEKKTMPVYNRQVLTKK FT QRDQIDLLAKLDWVYASIDNKDEFMEELSIGTLGLTYEKAKKLFPQYLSVNYLHRLTVS FT SRPCEFPASIPAYRTTNYHFDTSPINRILTEKYGDEDIDIVFQNCISFGLSLMSVVEQF FT TNVCPNRIILIPKLNEIHLMKPPIFTGDVDIHKLKQVIQKQHMFLPDKISLTQYVELFL FT SNKTLKSGSHVNSNLILAHKISDYFHNTYILSTNLAGHWILIIQLMKDSKGIFEKDWGE FT GYITDHMFINLKVFFNAYKTYLLCFHKGYGRAKLECDMNTSDLLCVLELIDSSYWKSMS FT KVFLEQKVINPILSQDASLHRVKGCHSFKLWFLKRLNVAEFTVCPWVVNIDYHPTHMKA FT ILTYIDLVRMGLINIDRIYIKNKHKFNDEFYTSNLFYINYNFSDNTHLLTKHIRIANSE FT LENNYNKLYHPTPETLENILTNPVKCDDKKTLNDYCIGKNVDSIMLPLLSNKKLIKSST FT TIRTNYSKQDLYNLFPTVVIDKIIDHSGNTAKSNQLYTTTSHQISLVHNSTSLYCMLPW FT HHINRFNFVFSSTGCKISIEYILKDLKIKDPSCIAFIGEGAGNLLLRTVVELHPDIRYI FT YRSLKDCNDHSLPIEFLRLYNGHINIDYGENLTIPATDATNNIHWSYLHIKFAEPISLF FT VCDAELPVTVNWSKIIIEWSKHVRKCKYCSSVNKCTLIVKYHAQDDIDFKLDNITILKT FT YVCLGSKLKGSEVYLVLTIGPANVFPVFNVVQNAKLILSRTKNFIMPKKADKESIDANI FT KSLIPFLCYPITKKGINTALSKLKSVVSGDILSYSIAGRNEVFSNKLINHKHMNILKWF FT NHVLNFRSTELNYNHLYMVESTYPYLSELLNSLTTNELKKLIKITGSLLYNFHNE" FT 3'UTR 14955..15171 FT /note="indels in UTR have not been validated" XX SQ Sequence 15171 BP; 5885 A; 2674 C; 2364 G; 4248 T; 0 other; atatttttat gggggcaaat aagaatttga taagtaccac ttaaatttaa ctcctttggt 60 tagagatggg cagcaattca ttaagtatga taaaagttag attacaaaat ttatttgaca 120 atgatgaagt agcattgtta aaaataacct gctatactga caaattgata catttaacta 180 atgctttggc taaggcagtg atacatacaa tcaaattgaa tggcattgta tttgtgcatg 240 ttattacaag tagtgatatt tgccctaata ataatattgt agtgaaatcc aacttcacaa 300 caatgccagt gttacaaaat ggaggttata tatgggaaat gatggaatta acacactgct 360 ctcaacccaa tggcctaata gatgacaatt gtgaaataaa attctccaaa aaactaagcg 420 attcaacaat gaccaactat atgaatcaat tatctgaatt acttggattt gatctcaatc 480 cataaattat aataaatatc aactagcaaa tcaatgtcaa taacaccatt agttaatata 540 aaacttgaca gaagataaaa atggggcaaa taaataaact cagctgaccc aaccatggac 600 acaacacaca atggtactac accacaaaga ctgatgatca cagacatgag accattgtca 660 cttgagacta taataacatc actaaccaga gacatcataa cacacagatt tatatacttg 720 ataaatcatg aatgtatagt gagaaaactt gatgaaagac aggccacatt tacattcctg 780 gtcaactatg aaatgaaact attgcacaaa gtgggaagca ctaaatacaa aaaatatact 840 gaatacaaca caaaatatgg cacttttcct atgccaatat ttatcaatca tgatgggttc 900 ttagaatgca ttggcattaa gcctacaaag cacactccca taatatacaa gtatgatctc 960 aatccatgaa tttcaacaca agattcacac aatctgaaat aacaacttca tgcataacta 1020 cactccatag tccaaatgga gcctgaaaat tatagtaatt taaaattaag gagagacata 1080 agatgaaaga tggggcaaat acaaaaatgg ctcttagcaa agtcaagttg aacgatacac 1140 tcaacaaaga tcaacttttg tcatccagca aatacaccat ccaacggagc acaggagata 1200 gtattgatac tcctaattat gatgtgcaga aacacatcaa caagttatgt ggcatgttat 1260 taatcacaga agatgctaat cataaattca ctggggtaat aggtatgtta tatgctatgt 1320 ctagattagg aagagaagac accataaaaa tactcagaga tgcgggatat catgtaaaag 1380 ctaatggagt ggatgtaaca acacatcgtc aagatattaa tggaaaagag atgaaatttg 1440 aagtgttaac attggcaagc ttaacaactg aaattcaaat caacattgag atagaatcta 1500 gaaaatccta caaaaaaatg ctaaaagaaa tgggagaggt ggctccagaa tacaggcatg 1560 actctcctga ttgtggaatg ataatattat gtatagcagc attagtaata accaaattag 1620 cagcagggga tagatctggt cttacagccg tgattaggag agctaataat gttctaaaaa 1680 atgaaatgaa acgttataaa ggcttactac caaaggatat agccaacagt ttctatgaag 1740 tgtttgaaaa atatcctcac tttatagatg tttttgttca ttttggtata gcacaatctt 1800 ctaccagagg tggcagtaga gttgaaggga tttttgcagg actgttcatg aatgcctatg 1860 gtgcagggca agtgatgtta cggtggggag tcttagcaaa atcagttaaa aatattatgc 1920 taggacacgc tagtgtgcaa gcagaaatgg aacaagttgt ggaagtttat gaatatgccc 1980 aaaaattggg tggagaagca ggattctacc atatattgaa taacccaaaa gcatcattat 2040 tatctttgac tcaatttccc cacttctcca gtgtagtatt aggcaatgct gctggcttgg 2100 gcataatggg agaatacaga ggtacaccaa ggaatcaaga tctatatgat gctgcaaagg 2160 catatgctga acaactcaaa gaaaatggtg tgattaacta cagtgtatta gacttgacag 2220 cagaagaact agaggctatc aaacatcagc ttaatccaaa agataatgat gtagagcttt 2280 gagttaataa aaaagtgggg caaataaatc atcatggaaa agtttgctcc tgaatttcat 2340 ggagaagacg caaacaacag agccactaaa ttcctagaat caataaaggg caaattcaca 2400 tcacctaaag atcccaagaa aaaagatagt atcatatctg tcaactcaat agatatagaa 2460 gtaaccaaag aaagccctat aacttcaaat tcaaccatta taaaccctac aaatgagaca 2520 gatgatactt cagggaacaa gcccaattat caaagaaaac ctctagtgag tttcaaagaa 2580 gaccctacgc caagtgataa tcccttttca aaactataca aagaaaccat agaaacattt 2640 gataacaatg aagaagaatc tagctattca tatgaagaaa taaatgatca gacaaatgat 2700 aatataacag caagattaga taggattgat gaaaaattaa gtgaaatact aggaatgctt 2760 cacacactag tagttgcaag tgcaggacct acatctgctc gggatggtat aagagatgcc 2820 atggttggtt taagagaaga aatgatagaa aaaatcagaa ctgaagcatt aatgaccaat 2880 gatagattag aagctatggc aaggctcagg aatgaggaaa gtgaaaagat ggcaaaagac 2940 acatcagatg aagtgtctct caatccaaca tcagagaaat tgaacaacct gttggaagga 3000 aatgatagtg acaatgatct atcacttgaa gatttctgat cagttaccaa tctgcacatc 3060 aacacacaac accaacagaa gaccaacaaa caaaacaact cacctatcca accaaacatc 3120 catctgccaa tcagccaacc agccaaaaaa acaaccagcc aatacaaaat tagtcacccg 3180 gaaaaaatcg atactatagt tacaaaaaaa gatggggcaa atatggaaac atacgtgaac 3240 aaacttcacg aaggctccac atacacagct gctgttcaat acaatgtcct agaaaaagac 3300 gatgatcctg catcacttac aatatgggtg cccatgttcc aatcatccat gccagcagat 3360 ttacttataa aagaactagc taatgtcaac atactagtga aacaaatatc cacacccaaa 3420 ggaccttcat taagagtcat gataaactcg agaagtgcag tgctagcaca aatgcccagc 3480 aaattcacta tatgtgccaa tgtgtccttg gatgaaagaa gcaagctggc atatgatgta 3540 accacaccct gcgaaatcaa ggcatgtagt ctaacatgcc taaaatcaaa aaatatgtta 3600 actacagtta aagatctcac tatgaaaaca ctcaacccaa cacatgacat cattgcttta 3660 tgtgaatttg aaaatatagt aacatctaaa aaagtcataa taccaacata cttaagatcc 3720 atcagtgtca gaaataaaga tctgaacaca cttgaaaata taacaaccac cgaattcaaa 3780 aatgccatca caaatgcaaa aatcatccct tactcaggat tactgttagt catcacagtg 3840 actgacaaca aaggagcatt caaatacata aagccacaaa gtcaattcat agtagatctt 3900 ggagcttacc tagaaaaaga aagtatatat tatgttacaa caaattggaa gcacacagct 3960 acacgatttg caatcaaacc catggaagat taaccttttt cttctacatc agttagttga 4020 ttcatacaca ctttctacct acattcttca cttcacaatc ataatcacca accctctgtg 4080 gtttaaccaa tcaaacaaaa cttatctgga gtctcagatc atcccaagtc attgttcatc 4140 agatctagta ctcaaataag ttaataaaaa tacccacatg gggcaaataa tcatcggagg 4200 aaatccaacc aatcacaata tctgtcaaca tagaccagtc aacacgccaa acaaaataaa 4260 ccaatggaaa atacatccat aacaatagaa ttctcaagca aattctggcc ttactttaca 4320 ctaatacata tgatcacaac aataatctct ttgctaatca taatctccat catgattgca 4380 atactaaaca aactctgtga atataacgta ttccataaca aaacctttga gccaccaaga 4440 gctcgagtca atacatagca ttcaccaatc tgatggctca aaacagtaac cttgcatttg 4500 taagtgaaca atcttcacct ttttacaaaa tcacatcaac atctcaccat gcaagccatc 4560 atccatacta taaagtagtt aattaaaaat agtcataaca atgaactaag atattaagac 4620 taacaacaac gttggggcaa atgcaaacat gtccaaaacc aaggaccaac gcaccgccaa 4680 gacactagaa aagacctggg acactctcaa tcatctatta ttcatatcat cgtgcttata 4740 caagttaaat cttaaatcta tagcacaaat cacattatcc attctggcaa tgataatctc 4800 aacttcactt ataattgtag ctatcatatt catagcctca gcaaacaaca aagtcacact 4860 aacaactgca atcatacaag atgcaacaag ccagatcaag aacacaaccc caacacacct 4920 gacccagaat ccccagcttg gaatcagctt cttcaatctg tctggaacta tatcacaaac 4980 caccgccata ctagctccaa caacaccaag tgtcgagcca atcctgcaat ctacaacagt 5040 caagaccaaa aacacaacaa caacccaaat acaacccagc aagctcacca caaaacaacg 5100 ccaaaacaaa ccaccaaaca aacccaatga tgattttcac tttgaagtgt tcaactttgt 5160 accctgcagc atatgcagca acaatccaac ttgctgggcc atctgcaaaa gaataccaag 5220 caaaaaacct ggaaagaaaa ccaccaccaa gcccacgaaa aaacaaacca tcaagacaac 5280 caaaaaagat ctcaaacctc aaactacaaa accaaaggaa gcacctacca ccaagcccac 5340 agaaaaacca accatcaaca tcaccaaacc aaacatcaga accacactgc tcaccaacag 5400 taccacagga aatctagaac acacaagtca agaggaaacc ctccattcaa cctcctccga 5460 tggcaataca agcccttcac aaatctatac aacatccgag tacctatcac aacctccatc 5520 tccatccaac ataacagacc agtagtcatt aaaaagcgta ttattgcaaa aaaccatgac 5580 caaatcaaac agaatcaaaa taagctctgg ggcaaataac aatggatttg ccaatcctca 5640 aaacaaatgc aattaccgca atccttgctg cagtcttact ctgtttcgct tccagtcaaa 5700 acatcactga agaattttat caatcaacat gcagtgcagt tagcaaaggc tatcttagtg 5760 ctttaagaac tggttggtat actagtgtta taactataga attaagtaat atcaaggaaa 5820 ataagtgtaa tggaacagac gctaaggtaa aattgataaa acaagaatta gataaatata 5880 aaaatgctgt aacagaattg cagttgctca tgcaaagcac accagcagcc aacaatcgag 5940 ccagaagaga actaccaagg tttatgaatt atacactcaa caataccaaa aataacaatg 6000 taacattaag caagaaaagg aaaagaagat ttcttggctt tttgttaggt gttggatctg 6060 caatcgccag tggcattgct gtgtctaaag tcctgcacct agaaggggaa gtgaacaaaa 6120 tcaaaagtgc tctactatcc acaaacaagg ctgtagtcag cttatcaaat ggagttagtg 6180 tcttaaccag caaagtgtta gacctcaaaa actatataga taaacggttg ttacccattg 6240 tgaacaagca aagctgcagc atatcaaaca ttgaaactgt gatagaattc caacaaaaga 6300 acaacagatt actagagatt accagggaat ttagtgttaa tgcaggtgta actacacctg 6360 taagcactta tatgttaaca aatagtgaat tattatcatt aatcaatgat atgcctataa 6420 caaatgatca gaaaaagtta atgtccaaca atgttcaaat agttagacag caaagttact 6480 ctatcatgtc cataataaag gaggaagtct tagcatatgt agtacaatta ccactatatg 6540 gtgtaataga tacaccttgt tggaaactac acacatcccc tctatgcaca accaacacaa 6600 aggaagggtc caacatctgt ttaacaagaa ccgacagagg atggtactgt gacaatgcag 6660 gatcagtttc tttcttccca caagctgaaa catgcaaagt tcaatcgaat cgagtatttt 6720 gtgacacaat gaacagttta acattaccaa gtgaagtaaa tctctgcaat attgacatat 6780 tcaaccctaa atatgattgc aaaattatga cttcaaaaac agatgtaagc agctccgtta 6840 tcacatctct aggagccatt gtgtcatgct atggcaaaac taaatgtaca gcatcaaata 6900 aaaatcgtgg aatcataaag acattttcta acgggtgtga ttatgtatct aataaggggg 6960 tggacactgt atctgtaggt aatacattat attatgtaaa taagcaagaa ggaaaaagtc 7020 tctatgtaaa aggtgaacca ataataaatt tctatgaccc attagtgttc ccttctgatg 7080 aatttgatgc atcaatatct caagtcaatg agaagattaa ccagagccta gcatttattc 7140 gtaaatccga tgaattatta cataatgtaa atgttggtaa atccaccaca aatatcatga 7200 taactactat aattatagtg attatagtaa tattgttatt attaattgca gttgggctgt 7260 tcctatactg caaagccaga agcacaccag tcacactaag caaggatcaa ctgagtggta 7320 taaataatat tgcatttagt aactgaataa aaatagtacc taatcatgtt cttacaatgg 7380 ttcatcatcc gaccatagat gacccatcta tcattggatt ttcttaaagt ctgaacttca 7440 tcgcaactct catctataaa ccatctcact tacactattt aagtagattc ctattttata 7500 gttatataaa actgctgagt accagattaa ctcactattt gtaaaaatta gaaatggggc 7560 aaatatgtca cgaaggaatc cttgcaaatt tgaaattcga ggtcattgct tgaatggtaa 7620 gaggtgtcat tttagtcata attattttga atggccacct catgcactgc ttgtaagaca 7680 aaactttatg ttaaacagaa tacttaagtc tatggataaa agcatagata ctttatcaga 7740 aataagtgga gctgcagagt tggacagaac tgaagagtat gccctcggtg tagttggagt 7800 gctagagagt tatataggat caataaataa tataactaaa caatcagcat gtgttgccat 7860 gagcaaactc ctcactgaac tcaacagtga tgacatcaaa aaaataagag acaatgaaga 7920 gccaaattca cctaagataa gagtgtacaa tactgtcata tcatatattg aaagcaacag 7980 gaaaaacaat aaacaaacta tccatctgtt aaaaagattg ccagcagacg tattgaagaa 8040 aaccatcaaa aacacattgg atatccacaa gagcataacc atcaacaacc caaaagaatc 8100 aactgttaat gatacaaacg accatgccaa aaataatgat actacctgac aaatatcctt 8160 gtagtataaa ttccatacta ataacaagtg gttgtagagt tactatgtat aatcaaaaga 8220 acacactata tttcaatcaa aacaaccaaa ataaccatac atactcatca aatcaaccat 8280 tcaatgaaat ccattggacc tctcaagact tgattgatgc aattcaaaat tttctacaac 8340 atctaggtat tactgatgat atatatacaa tatatatatt agtgtcataa cactcaatac 8400 caatacttac cacatcatca aactattaac tcaaacaatt caaaccatgg gacaaaatgg 8460 atcccattat taatggaaat tctgctaatg tttatctaac cgatagttat ttaaaaggtg 8520 ttatttcttt ctcagaatgt aatgctttag gaagttacat attcaatggt ccttatctca 8580 aaaatgatta caccaactta attagtagac aaaatccatt aatagaacat ataaatctaa 8640 agaaattaaa tataacacag tctttaatat ctaagtatca taaaggtgaa ataaaaatag 8700 aagaacctac ttattttcag tcattactca tgacatacaa gagtatgacc tcgtcagaac 8760 aaattgctac cactaattta cttaaaaaga taataagaag agctatagaa attagtgatg 8820 tcaaagtcta tgctatattg aataaactgg ggcttaaaga aaaagacaag attaaatcca 8880 acaatgaaca agatgaaaac aactcagtta ttacaaccat aatcaaagat gatatacttt 8940 tagctgttaa ggataatcaa tctcatctta aagcaggcaa aaatcactct acaaaacaaa 9000 aagatactat caaaacaaca ctcttgaaaa aattaatgtg ttcgatgcaa catcctccat 9060 catggttaat acattggttt aatttataca caaaattaaa caacatatta acacagtatc 9120 gatcaaatga ggtaaaaaac catggtttta tattgataga taatcatact ctcaatggat 9180 tccaatttat tttgaatcaa tatggttgta tagtttatca taaggatctc aaaagaatta 9240 ctgtgacaac ctataatcaa ttcttgacat ggaaagatat tagccttagt agattaaatg 9300 tttgtttaat tacatggatt agtaactgtt tgaacacatt aaacaaaagc ttaggcttaa 9360 gatgtggatt caataatgtt atcttgacac aactattcct ttatggagat tgtatattaa 9420 aactattcca caatgaaggg ttctacataa taaaagaggt agagggtttt attatgtctc 9480 taattttaaa cataacagaa gaagatcaat tcagaaaacg gttttataat agtatgctca 9540 acaacatcac agatgctgct aataaagctc agaaaaatct gctatcaaga gtatgtcata 9600 cattattaga taagacagta tccgataata taataaatgg cagatggata attctattaa 9660 gtaagtttct taaattaatt aagcttgcag gtgacaataa ccttaacaat ctgagtgaat 9720 tatatttttt attcagaata tttggacacc caatggtaga tgaaagacaa gccatggatg 9780 ctgttaaagt taattgcaac gagaccaaat tttacttgtt aagcagtttg agtatgttaa 9840 gaggtgcctt tatatataga attataaaag ggtttgtaaa taattacaac agatggccta 9900 ctttaaggaa tgctattgtt ttacccttaa gatggttaac ttactataaa ctaaacactt 9960 atccttcctt attggaactt acagaaagag atttgattgt tttatcagga ctacgtttct 10020 atcgtgagtt tcggttgcct aaaaaagtgg atcttgaaat gatcataaat gataaggcta 10080 tatcacctcc taaaaatttg atatggacta gtttccctag aaattatatg ccgtcacaca 10140 tacaaaatta tatagaacat gaaaaattaa aattttccga aagtgataaa tcgagaagag 10200 tattagagta ctatttaaga gataacaaat tcaatgaatg tgatttatat aactgtgtag 10260 ttaatcaaag ctatcttaac aaccctaatc atgtggtatc attgactggc aaagaaagag 10320 aactcagtgt aggtagaatg tttgcaatgc aaccaggaat gttcagacaa gttcaaatat 10380 tagcagagaa aatgatagcc gaaaacattt tacaattctt tcctgaaagt cttacaagat 10440 atggtgatct agaattacag aaaatattag aattgaaagc gggaataagt aacaaatcaa 10500 atcgttacaa tgacaattac aacaattaca tcagtaagtg ctctatcatc acagatctca 10560 gcaaattcaa tcaagcattc cggtatgaaa catcatgtat ttgtagtgat gtactggatg 10620 aactgcatgg tgtacaatct ctattttcct ggttacattt aactattcct catgtcacaa 10680 taatatgcac atataggcat gctcccccct atataagaga tcacattgta gatcttaaca 10740 atgtagatga acaaagtgga ttatatagat atcatatggg tggtatcgaa gggtggtgtc 10800 aaaaactatg gaccatagaa gctatatcac tattggatct aatatctctc aaagggaaat 10860 tctcaattac tgccttaatt aatggtgaca atcaatcaat agatataagc aaaccagtca 10920 gactcatgga aggtcaaact catgctcaag cagattattt gctagcatta aatagtctta 10980 aattgctgta taaagagtat gcaggcatag gccacaaatt aaaaggaact gagacttata 11040 tatcaagaga tatgcaattt atgagtaaaa caattcaaca taacggtgta tattacccag 11100 ctagtataaa gaaagtccta agagtgggac catggataaa cactatactt gatgatttca 11160 aagtgagtct agaatctata ggtagtttga cacaagaatt agaatataga ggtgaaagtc 11220 tattatgcag tttaatattt agaaatgtgt ggttatataa tcaaattgct ttacaactaa 11280 aaaatcatgc attatgtaac aataaattat atttggacat attaaaggtt ctgaaacact 11340 taaaaacctt ttttaatctt gataatattg atacagcatt aacattgtat atgaatttgc 11400 ccatgttatt tggtggtggt gatcccaact tgttatatcg aagtttctat agaagaactc 11460 ctgatttcct cacagaggct atagttcact ctgtgttcat acttagttat tatacaaacc 11520 atgatttaaa ggataaactt caagatctgt cagacgatag attgaataag ttcttaacat 11580 gcataatcac gtttgacaaa aaccctaatg ctgaattcgt aacattgatg agagatcctc 11640 aagctttagg gtctgagagg caagctaaaa ttactagcga aatcaataga ctggcagtta 11700 ctgaggtttt gagcacagct ccaaacaaaa tattctccaa aagtgcacaa cactatacca 11760 ctacagagat agatctaaat gatattatgc aaaatataga acctacatat cctcatgggc 11820 taagagttgt ttatgaaagt ttaccctttt ataaagcaga gaaaatagta aatcttatat 11880 ccggtacaaa atctataact aacatactgg aaaagacttc tgccatagac ttaacagata 11940 ttgatagagc cactgagatg atgaggaaaa acataacttt gcttataagg atatttccat 12000 tagattgtaa cagagataaa agagaaatat tgagtatgga aaacctaagt attactgaat 12060 taagcaaata tgttagagaa agatcttggt ctttatccaa tatagttggt gttacatcac 12120 ccagtatcat gtatacaatg gacatcaaat atacaacaag cactatagct agtggcataa 12180 tcatagagaa atataatgtc aacagtttaa cacgtggtga gagaggaccc actaaaccat 12240 gggttggttc atctacacaa gagaaaaaaa caatgccagt ttataacaga caagttttaa 12300 ccaaaaaaca gagagatcaa attgatctat tagcaaaatt ggattgggtg tatgcatcta 12360 tagataacaa ggatgaattc atggaagaac tcagcatagg aactcttggg ttaacatatg 12420 agaaagccaa aaaattattt ccacaatatt taagtgttaa ctatttgcat cgccttacag 12480 tcagtagcag accatgtgaa ttccctgcat caataccagc ttatagaact acaaattatc 12540 actttgatac tagccctatt aatcgcatat taacagaaaa gtatggtgat gaagatattg 12600 atatagtatt ccaaaactgt ataagttttg gccttagctt aatgtcagta gtagagcaat 12660 ttaccaatgt atgtcctaac agaattattc tcatacccaa gcttaatgag atacatttga 12720 tgaaacctcc catattcaca ggtgatgttg atattcacaa gttaaaacaa gttatccaaa 12780 aacagcatat gtttttacca gacaaaataa gtttgactca atatgtggaa ttattcttaa 12840 gtaataaaac actcaaatct ggatctcatg ttaattctaa tttaatattg gcacataaga 12900 tatctgacta ttttcataat acttacattt taagtactaa tttagctgga cattggattc 12960 tgattataca acttatgaaa gattctaaag gtatttttga aaaagattgg ggagagggat 13020 acataactga tcatatgttc attaatttga aagttttctt caatgcttat aagacctatc 13080 tcttgtgttt tcataaaggt tacggcagag caaagctgga gtgtgatatg aatacttcag 13140 atctcctatg tgtattggaa ttaatagaca gtagttattg gaagtctatg tctaaggtat 13200 ttttagaaca aaaagttatc aatcccattc tcagccagga tgcaagttta catagagtaa 13260 aaggatgtca tagcttcaaa ctatggtttc ttaaacgtct taatgtagca gaattcacag 13320 tttgcccttg ggttgttaac atagattatc atccaacaca catgaaagca atattaactt 13380 atatagatct tgttagaatg ggattgataa atatagatag aatatacatt aaaaataaac 13440 acaaattcaa tgatgaattt tatacttcta atctctttta cattaattat aacttctcag 13500 ataatactca tctattaact aaacatataa ggattgctaa ttctgaatta gaaaataatt 13560 acaacaaatt atatcatcct acacctgaaa ctctagaaaa tatactaacc aatccggtta 13620 aatgtgatga caaaaagaca ctgaatgact attgtatagg taaaaatgtt gactcaataa 13680 tgttaccatt gttatctaat aagaagctta ttaaatcgtc tacaacgatt agaaccaatt 13740 acagcaaaca agatttgtat aatttatttc ctacggttgt gattgataaa attatagatc 13800 attcaggtaa tacagccaaa tctaaccaac tttacactac tacttctcat caaatatctt 13860 tagtacacaa tagcacatca ctttattgca tgcttccttg gcatcatatt aatagattca 13920 attttgtgtt tagttctaca ggttgtaaaa ttagtataga gtatatttta aaagacctta 13980 aaattaaaga tcctagttgt atagcattca taggtgaagg agcagggaat ttattgttgc 14040 gtacagtagt ggaacttcat cctgatataa gatatattta cagaagtctg aaagattgca 14100 atgatcatag tttacctatt gagtttttaa ggctatacaa tggacatatc aacattgatt 14160 atggtgaaaa tttgaccatt cccgctacag atgcaaccaa caacattcat tggtcttatt 14220 tgcatataaa gtttgctgaa cctatcagtc tttttgtttg tgatgctgaa ttgcctgtta 14280 cagtcaactg gagtaaaatt ataatagagt ggagcaagca tgtaagaaaa tgcaagtact 14340 gttcctcagt taataaatgt acgttaatag taaaatacca tgctcaagat gatatcgatt 14400 tcaaattaga caacataact atattaaaaa cttatgtatg cttaggcagt aagttaaagg 14460 ggtctgaagt ttacttagtc cttacaatag gtcctgcaaa tgtgttccca gtatttaatg 14520 tagtacaaaa tgctaaattg atactatcaa gaaccaaaaa tttcatcatg cctaagaagg 14580 ctgataaaga gtctattgat gcaaatatta agagtttgat accctttctt tgttacccta 14640 taacaaaaaa aggaattaat actgcattat caaaactaaa gagtgttgtt agtggagata 14700 tactatcata ttctatagct ggacgtaatg aagttttcag caataaactt ataaatcata 14760 agcatatgaa catcttaaag tggttcaacc atgttttaaa tttcagatca acagaactta 14820 actataatca tttatatatg gtagaatcca catatcctta tctaagtgaa ttgttaaaca 14880 gcttgacaac taatgaactt aaaaaactga ttaaaatcac aggtagtttg ttatacaact 14940 ttcataatga ataatgaata aaaatcttat attaaaaaat tcccacagct acacactaac 15000 actgtattca attatagtta tttaaaatta aaaattatat aatttttaat aacttttagt 15060 ggactaatcc taaaattatc attttgatct aggaggaata aatttaaatc caaatctaat 15120 tggtttatat gtatattaac taaactacct gtgattttaa tcagtttttt a 15171 // ID MG027894; SV 1; linear; genomic RNA; STD; VRL; 315 BP. XX AC MG027894; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Norovirus GII isolate Hu/GII.2/Taoyuan/25-16/2017/TW nonstructural protein DE and major structural protein genes, partial cds. XX KW . XX OS Norovirus GII OC Viruses; Riboviria; Caliciviridae; Norovirus. XX RN [1] RP 1-315 RA Wu F.-T., Kuo T.-Y.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Centers for Disease Control, Center for Research & Diagnostic, No. 161, RL Kunyang St., Nangang Dist., Taipei, Taiwan 11561, (R.O.C.) XX DR MD5; 7c05cf2ccddb5b0bcd48ed973e35aa76. XX FH Key Location/Qualifiers FH FT source 1..315 FT /organism="Norovirus GII" FT /host="Homo sapiens; 20-year old male with diarrhea" FT /isolate="Hu/GII.2/Taoyuan/25-16/2017/TW" FT /mol_type="genomic RNA" FT /country="Taiwan" FT /isolation_source="stool" FT /collection_date="Sep-2017" FT /db_xref="taxon:122929" FT CDS <1..48 FT /codon_start=1 FT /product="nonstructural protein" FT /note="ORF1; RNA-dependent RNA polymerase" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQM4" FT /protein_id="ATN44837.1" FT /translation="DRNLAPNFVNEDGVE" FT CDS 29..>315 FT /codon_start=1 FT /product="major structural protein" FT /note="VP1" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CRS3" FT /protein_id="ATN44838.1" FT /translation="MKMASNDAAPSTDGAAGLVPESNNEVMALEPVAGAALAAPVTGQT FT NIIDPWIRANFVQAPNGEFTVSPRNAPGEVLLNLELGPELNPYLAHLARM" XX SQ Sequence 315 BP; 79 A; 80 C; 77 G; 79 T; 0 other; gatcgcaatc tggctcccaa ttttgtgaat gaagatggcg tcgaatgacg ccgctccatc 60 tactgatggt gcagccggcc tcgtgccaga aagtaacaat gaggtcatgg ctcttgaacc 120 cgtggctggt gccgccttgg cagccccggt caccggtcaa acaaatatta tagacccttg 180 gattagagca aattttgtcc aggcccccaa tggtgaattt acagtctctc cccgaaatgc 240 ccctggtgaa gtgctactga atctagagtt gggtccagaa ttaaatcctt atctggcaca 300 tttagcaaga atgta 315 // ID MG027898; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027898; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/01/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/01/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 2a374f166baf8801e7fc28c207ee35cf. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/01/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/01/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="05-Nov-2015" FT /db_xref="taxon:2042262" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI4" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI4" FT /protein_id="ATN44679.1" FT /translation="MNPNQKIITIGSICMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCHQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSQDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICRGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRNGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 444 A; 263 C; 336 G; 367 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcgatct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgccat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtcaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag gatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgccgtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaacggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtag 1410 // ID MG027899; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027899; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/05/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/05/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; c78c587d31ee3818fe9527feeb361926. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/05/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/05/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="04-Feb-2015" FT /db_xref="taxon:2042265" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI5" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI5" FT /protein_id="ATN44680.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYINISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDNGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQIGYICSGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 452 A; 261 C; 332 G; 365 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg gaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gtcagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatat taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctccccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacaat 600 ggggcagtgg ctgtgttaaa gtacaacggc ataataacag acactatcaa gagttggaga 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca agatcttcag aatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga taggatacat atgcagtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggagt ttggataggg 1080 agaactaaaa gcattagttc aagaaaaggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 ggatatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat cagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027900; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027900; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/06/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/06/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; d9e0cf71e3b095b594becdcb5e7ef484. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/06/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/06/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="05-Nov-2015" FT /db_xref="taxon:2042266" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI2" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI2" FT /protein_id="ATN44681.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSRGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICSGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 448 A; 257 C; 336 G; 369 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aggggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatat aactcaagat ttgagtcagt tgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag aatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgcagtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaaaggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027901; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027901; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/018/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/018/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; f0cca67a18f00c0e48762994167b0dd0. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/018/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/018/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="05-Nov-2015" FT /db_xref="taxon:2042263" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI7" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI7" FT /protein_id="ATN44682.1" FT /translation="MNPNQKIITIGSICMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCHQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSQDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICRGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRNGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 445 A; 263 C; 335 G; 367 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcgatct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgccat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtcaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag gatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgccgtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaacggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027902; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027902; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/025/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/025/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 91e4ff8ae8abf7990234aa0627112fe1. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/025/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/025/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="05-Nov-2015" FT /db_xref="taxon:2042264" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CRN5" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CRN5" FT /protein_id="ATN44683.1" FT /translation="MNPNQKIITIGSICMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCHQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSQDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDNGAVAVLKYNGIITETIKSWKKR FT ILRTQESECVCVNGSCFTIMTDGPSNGAASYKIFKIEKGKVTKSIELNAPNFHYEECSC FT YPDTGTVMCVCRDNWHGSNRPWVSFNQNLDYQIGYICSGVFGDNPRPKDGEGSCNPVTV FT DGADGVKGFSYKYGNGVWIGRTKSNRLRKGFEMIWDPNGWTDTDSDFSVKQDVVAITDW FT SGYSGSFVQHPELTGLDCIRPCFWVELVRGLPRENTIWTSGSSISFCGVNSDTANWSWP FT DGALFPFTIDK" XX SQ Sequence 1410 BP; 437 A; 261 C; 335 G; 377 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcgatct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgccat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtcaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg tccagacaat 600 ggagctgtgg ctgtactaaa atacaacggc ataataactg aaaccataaa aagttggaaa 660 aagcgaatat taagaacaca agagtctgaa tgtgtctgtg tgaacgggtc atgtttcacc 720 ataatgaccg atggcccgag taatggggcc gcctcgtaca aaatcttcaa gatcgaaaag 780 gggaaggtta ctaaatcaat agagttgaat gcacccaatt ttcattatga ggaatgttcc 840 tgttacccag acactggcac agtgatgtgt gtatgcaggg acaactggca tggttcaaat 900 cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa taggatacat ctgcagtggg 960 gtgttcggtg acaatccgcg tcccaaagat ggagagggca gctgtaatcc agtgactgtt 1020 gatggagcag acggagtaaa ggggttttca tacaaatatg gtaatggtgt ttggatagga 1080 aggactaaaa gtaacagact tagaaagggg tttgagatga tttgggatcc taatggatgg 1140 acagataccg acagtgattt ctcagtgaaa caggatgttg tggcaataac tgattggtca 1200 gggtacagcg gaagtttcgt tcaacatcct gagttaacag gattggactg tataagacct 1260 tgcttctggg ttgagttagt cagaggactg cctagagaaa atacaatctg gactagtggg 1320 agcagcattt ctttttgtgg cgtaaatagt gatactgcaa actggtcttg gccagacggt 1380 gctttgtttc catttaccat tgacaagtaa 1410 // ID MG027903; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027903; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/40/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/40/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 845f49c0ef181ddc36e6de77c06c8e7c. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/40/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/40/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="07-Nov-2015" FT /db_xref="taxon:2042267" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI6" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI6" FT /protein_id="ATN44684.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDNGAVAVLKYNGIITETIKSWKKR FT ILRTQESECVCVNGSCFTIMTDGPSNGAASYKIFKIEKGKVTKSIELNAPNFHYEECSC FT YPDTGTVMCVCRDNWHGSNRPWVSFNQNLDYQIGYICSGVFGDNPRPKDGEGSCNPVTV FT DGADGVKGFSYKYGNGVWIGRTKSNRLRKGFEMIWDPNGWTDTDSDFSVKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 442 A; 260 C; 342 G; 366 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg tccagacaat 600 ggagctgtgg ctgtactaaa atacaacggc ataataactg aaaccataaa aagttggaaa 660 aagcgaatat taagaacaca agagtctgaa tgtgtctgtg tgaacgggtc atgtttcacc 720 ataatgaccg atggcccgag taatggggcc gcctcgtaca aaatcttcaa gatcgaaaag 780 gggaaggtta ctaaatcaat agagttgaat gcacccaatt ttcattatga ggaatgttcc 840 tgttacccag acactggcac agtgatgtgt gtatgcaggg acaactggca tggttcaaat 900 cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa taggatacat ctgcagtggg 960 gtgttcggtg acaatccgcg tcccaaagat ggagagggca gctgtaatcc agtgactgtt 1020 gatggagcag acggagtaaa ggggttttca tacaaatatg gtaatggtgt ttggatagga 1080 aggactaaaa gtaacagact tagaaagggg tttgagatga tttgggatcc taatggatgg 1140 acagataccg acagtgattt ctcagtgaaa caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027904; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027904; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/41/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/41/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 8cadd31d6bd968abb08a6cfde317ae8f. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/41/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/41/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="06-Nov-2015" FT /db_xref="taxon:2042268" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQI0" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQI0" FT /protein_id="ATN44685.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICSGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 449 A; 259 C; 335 G; 367 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag aatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgcagtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaaaggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027905; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027905; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/50/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/50/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 9ddbafc72441eab82aae008fd91e0316. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/50/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/50/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="04-Nov-2015" FT /db_xref="taxon:2042269" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CRZ8" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CRZ8" FT /protein_id="ATN44686.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWISHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVMREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIVKSVEMNAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQIGYICSGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSTSSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSGISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 447 A; 260 C; 335 G; 368 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatgga ttagccactc aattcaaatt 120 ggaaatcaaa gtcagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcggtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat gagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttacta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaacatat tgagaacaca agaatctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca agatcttcag aatagaaaag 780 ggaaagatag tcaaatcagt cgaaatgaat gcccctaatt atcactatga ggaatgctcc 840 tgttatcctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga taggatacat atgcagtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcactagttc aagaaaaggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 ggatatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat cagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcggcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027906; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027906; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/61/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/61/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 8cadd31d6bd968abb08a6cfde317ae8f. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/61/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/61/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="08-Nov-2015" FT /db_xref="taxon:2042270" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQH7" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQH7" FT /protein_id="ATN44687.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICSGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRKGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 449 A; 259 C; 335 G; 367 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag aatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgcagtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaaaggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtaa 1410 // ID MG027907; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027907; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/63/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/63/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 843d808e73ff98562d587f107830ce53. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/63/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/63/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="08-Nov-2015" FT /db_xref="taxon:2042271" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQJ2" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQJ2" FT /protein_id="ATN44688.1" FT /translation="MNPNQKIITIGSICMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCHQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSQDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICRGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRNGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPVTIDK" XX SQ Sequence 1410 BP; 443 A; 263 C; 338 G; 366 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcgatct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgccat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtcaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag gatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgccgtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaacggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc cggttaccat tgacaagtag 1410 // ID MG027908; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027908; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/68/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/68/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 2a374f166baf8801e7fc28c207ee35cf. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/68/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/68/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="08-Nov-2015" FT /db_xref="taxon:2042272" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQJ7" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQJ7" FT /protein_id="ATN44689.1" FT /translation="MNPNQKIITIGSICMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCHQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSQDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDSGAVAVLKYNGIITDTIKSWRNN FT ILRTQESECACVNGSCFTIMTDGPSDGQASYKIFRIEKGKIIKSVEMKAPNYHYEECSC FT YPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQMGYICRGVFGDNPRPNDKTGSCGPVSS FT NGANGVKGFSFKYGNGVWIGRTKSISSRNGFEMIWDPNGWTGTDNKFSIKQDIVGINEW FT SGYSGSFVQHPELTGLDCIRPCFWVELIRGRPEENTIWTSGSSISFCGVNSDTVGWSWP FT DGAELPFTIDK" XX SQ Sequence 1410 BP; 444 A; 263 C; 336 G; 367 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcgatct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgccat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtcaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacagt 600 ggggcagtgg ctgtgttaaa gtacaatggc ataataacag acactatcaa gagttggagg 660 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttacc 720 ataatgaccg atggaccaag tgatggacag gcctcataca aaatcttcag gatagaaaag 780 ggaaagataa tcaaatcagt cgaaatgaaa gcccctaatt atcactatga ggaatgctcc 840 tgttaccctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 900 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga tgggatacat atgccgtggg 960 gttttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1020 aatggagcaa atggagtaaa aggattttca ttcaaatacg gcaatggtgt ttggataggg 1080 agaactaaaa gcattagttc aagaaacggt tttgagatga tttgggatcc gaatggatgg 1140 actgggactg acaataaatt ctcaataaag caagatatcg taggaataaa tgagtggtca 1200 gggtatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1260 tgcttctggg ttgaactaat aagagggcga cccgaagaga acacaatctg gactagcggg 1320 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1380 gctgagttgc catttaccat tgacaagtag 1410 // ID MG027909; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC MG027909; XX DT 31-OCT-2017 (Rel. 134, Created) DT 31-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/Shiraz/75/2015(H1N1)) segment 6 neuraminidase (NA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Shiraz/75/2015(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RA Rashidi O., Pirbonyeh N., Emami A., Edalat F., Tavakoli Movaghar N., RA Moattari A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Bacteriology & Virology, Shiraz University of Medical Sciences, Shiraz RL Medical School, Zand Street, Setad SQ, Shiraz, Fars 7134845794, Iran XX DR MD5; 336d416068db5503a9a1f260e6bf8d44. XX CC ##Assembly-Data-START## CC Assembly Method :: Chromas v. 2.6.1; CLC v. 6; Mega v. 6 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus (A/Shiraz/75/2015(H1N1))" FT /segment="6" FT /host="Homo sapiens" FT /strain="A/Shiraz/75/2015" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Iran" FT /isolation_source="throat swab" FT /collection_date="08-Nov-2015" FT /db_xref="taxon:2042273" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A2D1CQJ4" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A2D1CQJ4" FT /protein_id="ATN44690.1" FT /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWVSHSIQIGNQSQ FT IETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSKDN FT SVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMSCPI FT GEVPSPYNSRFESVAWSASACHDGIVCMTIGISGPDNGAVAVLKYNGIITETIKSWKKR FT ILRTQESECVCVNGSCFTIMTDGPSNGAASYKIFKIEKGKVTKSIELNAPNFHYEECSC FT YPDTGTVMCVCRDNWHGSNRPWVSFNQNLDYQIGYICSGVFGDNPRPKDGEGSCNPVTV FT DGADGVKGFSYKYGNGVWIGRTKSNRLRKGFEMIWDPNGWTDTDSDFSVKQDVVAITDW FT SGYSGSFVQHPELTGLDCIRPCFWVELVKGLPRENTIWTSGSSISFCGVNIDTANWSWP FT DGALFPFTIDK" XX SQ Sequence 1410 BP; 437 A; 259 C; 335 G; 379 T; 0 other; atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 60 aacttaatat tacaaattgg aaacataatc tcaatatggg ttagccactc aattcaaatt 120 ggaaatcaaa gccagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 180 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 240 gtttccgtga aattagcggg caattcctct ctctgccctg ttagtggatg ggctatatac 300 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 360 ttcatatcat gctctccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 420 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 480 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 540 gcaagtgctt gtcatgatgg cattgtctgt atgacaatcg gaatttctgg tccagacaat 600 ggagctgtgg ctgtactaaa atacaacggc ataataactg aaaccataaa aagttggaaa 660 aagcgaatat taagaacaca agagtctgaa tgtgtctgtg tgaacgggtc atgtttcacc 720 ataatgaccg atggcccgag taatggggcc gcctcgtaca aaatcttcaa gatcgaaaag 780 gggaaggtta ctaaatcaat agagttgaat gcacccaatt ttcattatga ggaatgttcc 840 tgttacccag acactggcac agtgatgtgt gtatgcaggg acaactggca tggttcaaat 900 cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa taggatacat ctgcagtggg 960 gtgttcggtg acaatccgcg tcccaaagat ggagagggca gctgtaatcc agtgactgtt 1020 gatggagcag acggagtaaa ggggttttca tacaaatatg gtaatggtgt ttggatagga 1080 aggactaaaa gtaacagact tagaaagggg tttgagatga tttgggatcc taatggatgg 1140 acagataccg acagtgattt ctcagtgaaa caggatgttg tggcaataac tgattggtca 1200 gggtacagcg gaagtttcgt tcaacatcct gagttaacag gattggactg tataagacct 1260 tgcttctggg ttgagttagt caaaggactg cctagagaaa atacaatctg gactagtggg 1320 agcagcattt ctttttgtgg cgtaaatatt gatactgcaa actggtcttg gccagacggt 1380 gctttgtttc catttaccat tgacaagtaa 1410 // ID MG027911; SV 1; linear; viral cRNA; STD; VRL; 890 BP. XX AC MG027911; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Mutant Influenza A virus (A/California/MA_07/2009(H1N1)) segment 8 nuclear DE export protein (NEP) and nonstructural protein 1 (NS1) genes, complete cds. XX KW . XX OS Influenza A virus (A/California/MA_07/2009(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RC Publication Status: Online-Only RP 1-890 RX PUBMED; 29783694. RA Slaine P.D., MacRae C., Kleer M., Lamoureux E., McAlpine S., Warhuus M., RA Comeau A.M., McCormick C., Hatchette T., Khaperskyy D.A.; RT "Adaptive Mutations in Influenza A/California/07/2009 Enhance Polymerase RT Activity and Infectious Virion Production"; RL Viruses 10(5):E272-E272(2018). XX RN [2] RP 1-890 RA Slaine P., MacRae C., Kleer M., Lamoeureoux E., McAlpine S., Warhuus M., RA Comeau A., Khaperskyy D., Hachette T., McCormick C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Microbiology and Immunology, Dalhousie University, 5850 College Street, RL Halifax, Nova Scotia B3H 4R2, Canada XX DR MD5; 0f012c06e8fbe9132f825d33a7138ca7. DR EuropePMC; PMC5977265; 29783694. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R 8.1.8 CC Coverage :: >20 000 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..890 FT /organism="Influenza A virus FT (A/California/MA_07/2009(H1N1))" FT /segment="8" FT /strain="A/California/MA_07/2009" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /note="California/07/2009 passaged in Swiss-webster mice 10 FT times" FT /db_xref="taxon:2042258" FT gene 27..864 FT /gene="NEP" FT /gene_synonym="NS2" FT CDS join(27..56,529..864) FT /codon_start=1 FT /gene="NEP" FT /gene_synonym="NS2" FT /product="nuclear export protein" FT /note="nonstructural protein 2" FT /db_xref="GOA:A0A343LDT2" FT /db_xref="InterPro:IPR000968" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT2" FT /protein_id="ATN44692.1" FT /translation="MDSNTMSSFQDILMRMSKMQLGSSSEDLNGMVTRFESLKIYRDSL FT GETVMRMGDLHYLQSRNEKWREQLGQKFEEIRWLIEEMRHRLKATENSFEQITFMQALQ FT LLLEVEQEIRAFSFQLI" FT gene 27..686 FT /gene="NS1" FT CDS 27..686 FT /codon_start=1 FT /gene="NS1" FT /product="nonstructural protein 1" FT /db_xref="GOA:A0A343LDT1" FT /db_xref="InterPro:IPR000256" FT /db_xref="InterPro:IPR004208" FT /db_xref="InterPro:IPR009068" FT /db_xref="InterPro:IPR038064" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT1" FT /protein_id="ATN44691.1" FT /translation="MDSNTMSSFQVDCFLWHIRKRFADNGLGDAPFLDRLRRDQKSLKG FT RGNTLGLDIETATLVGKQIVEWILKEESSETLRMTIASVPTSRYLSDMTLEEMSRDWFM FT LMPRQKIIGPLCVRLDQAIMEKNIVLKANFSVIFNRLETLILLRAFTEEGAIVGEISPL FT PSLPGHTYEDVKNAVGVLIGGLEWNGNTVRVSENIQRFAWRNCDENGRPSLPPEQK" XX SQ Sequence 890 BP; 285 A; 178 C; 210 G; 217 T; 0 other; agcgaaagca gggtgacaaa aacataatgg actccaacac catgtcaagc tttcaggtag 60 actgtttcct ttggcatatc cgcaagcgat ttgcagacaa tggattgggt gatgccccat 120 tccttgatcg gctccgccga gatcaaaagt ccttaaaagg aagaggcaac acccttggcc 180 tcgatatcga aacagccact cttgttggga aacaaatcgt ggaatggatc ttgaaagagg 240 aatccagcga gacacttaga atgacaattg catctgtacc tacttcgcgc tacctttctg 300 acatgaccct cgaggaaatg tcacgagact ggttcatgct catgcctagg caaaagataa 360 taggccctct ttgcgtgcga ttggaccagg cgatcatgga aaagaacata gtactgaaag 420 cgaacttcag tgtaatcttt aaccgattag agaccttgat actactaagg gctttcactg 480 aggagggagc aatagttggg gaaatttcac cattaccttc tcttccagga catacttatg 540 aggatgtcaa aaatgcagtt ggggtcctca tcggaggact tgaatggaat ggtaacacgg 600 ttcgagtctc tgaaaatata cagagattcg cttggagaaa ctgtgatgag aatgggagac 660 cttcactacc tccagagcag aaatgaaaag tggcgagagc aattgggaca gaaatttgag 720 gaaataaggt ggttaattga agaaatgcgg cacagattga aagcgacaga gaatagtttc 780 gaacaaataa catttatgca agccttacaa ctactgcttg aagtagaaca agagataaga 840 gctttctcgt ttcagcttat ttaatgataa aaaacaccct tgtttctact 890 // ID MG027912; SV 1; linear; viral cRNA; STD; VRL; 2236 BP. XX AC MG027912; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Mutant Influenza A virus (A/California/MA_07/2009(H1N1)) segment 3 DE polymerase PA (PA) and PA-X protein (PA-X) genes, complete cds. XX KW . XX OS Influenza A virus (A/California/MA_07/2009(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RC Publication Status: Online-Only RP 1-2236 RX PUBMED; 29783694. RA Slaine P.D., MacRae C., Kleer M., Lamoureux E., McAlpine S., Warhuus M., RA Comeau A.M., McCormick C., Hatchette T., Khaperskyy D.A.; RT "Adaptive Mutations in Influenza A/California/07/2009 Enhance Polymerase RT Activity and Infectious Virion Production"; RL Viruses 10(5):E272-E272(2018). XX RN [2] RP 1-2236 RA Slaine P., MacRae C., Kleer M., Lamoeureoux E., McAlpine S., Warhuus M., RA Comeau A., Khaperskyy D., Hachette T., McCormick C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Microbiology and Immunology, Dalhousie University, 5850 College Street, RL Halifax, Nova Scotia B3H 4R2, Canada XX DR MD5; a8d413af0ef965283793d065148c7ab1. DR EuropePMC; PMC5977265; 29783694. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R8.1.8 CC Coverage :: >400 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2236 FT /organism="Influenza A virus FT (A/California/MA_07/2009(H1N1))" FT /segment="3" FT /strain="A/California/MA_07/2009" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /note="California/07/2009 passaged 10 times in FT swiss-webster mice" FT /db_xref="taxon:2042258" FT gene 28..2178 FT /gene="PA" FT CDS 28..2178 FT /codon_start=1 FT /gene="PA" FT /product="polymerase PA" FT /db_xref="GOA:A0A343LDT3" FT /db_xref="InterPro:IPR001009" FT /db_xref="InterPro:IPR037534" FT /db_xref="InterPro:IPR038372" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT3" FT /protein_id="ATN44693.1" FT /translation="MEDFVRQCFNPMIVELAGKAMKEYGEDPKIETNKFAAICTHLEVC FT FMYSDFHFIDERGESIIVESGDPNALLKHRFEIIEGRDRIMAWTVVNSICNTTGVEKPK FT FLPDLYDYKENRFIEIGVTRREVHIYYLEKANKIKSEKTHIHIFSFTGEEMATKADYTL FT DEESRARIKTRLFTIRQEMASRSLWDSFRQSERGEETIEEKFEITGTMRKLADQSLPPN FT FPSLENFRAYVDGFEPNGCIEGKLSQMSKEVNAKIEPFLRTTPRPLRLPDGPLCHQRSK FT FLLMDALKLSIEDPSHEGEGIPLYDAIKCMKTFFGWKEPNIVKPHEKGINPNYLMAWKQ FT VLAELQDIGNEEKIPRTKNMKRTSQLKWALGENMAPEKVDFDDCKDVGDLKQYDSDEPE FT PRSLASWVQNEFNKACELTDSSWIELDEIGEDVAPIEHIASMRRNYFTAEVSHCRATEY FT IMKGVYINTALLNASCAAMDDFQLIPMISKCRTKEGRRKTNLYGFIIKGRSHLRNDTDV FT VNFVSMEFSLTDPRLEPHKWEKYCVLEIGDMLLRTAIGQVSRPMFLYVRTNGTSKIKMK FT WGMEMRRCLLQSLQQIESMIEAESSVKEKDMTKEFFENKSETWPIGESPRGVEEGSIGK FT VCRTLLAKSVFNSLYASPQLEGFSAESRKLLLIVQALRDNLEPGTFDLGGLYEAIEECL FT INDPWVLLNASWFNSFLTHALK" FT gene 28..727 FT /gene="PA-X" FT CDS join(28..597,599..727) FT /codon_start=1 FT /ribosomal_slippage FT /gene="PA-X" FT /product="PA-X protein" FT /db_xref="GOA:A0A343LDT4" FT /db_xref="InterPro:IPR001009" FT /db_xref="InterPro:IPR038372" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT4" FT /protein_id="ATN44694.1" FT /translation="MEDFVRQCFNPMIVELAGKAMKEYGEDPKIETNKFAAICTHLEVC FT FMYSDFHFIDERGESIIVESGDPNALLKHRFEIIEGRDRIMAWTVVNSICNTTGVEKPK FT FLPDLYDYKENRFIEIGVTRREVHIYYLEKANKIKSEKTHIHIFSFTGEEMATKADYTL FT DEESRARIKTRLFTIRQEMASRSLWDSFVSPKEAKRQLKKNLRLQELCASLPTKVSHRT FT SPALKTLEPM" XX SQ Sequence 2236 BP; 730 A; 440 C; 543 G; 523 T; 0 other; agcgaaagca ggtcaaatat attcaatatg gaagactttg tgcgacaatg cttcaatcca 60 atgatcgtcg agcttgcggg aaaggcaatg aaagaatatg gggaagatcc gaaaatcgaa 120 actaacaagt ttgctgcaat atgcacacat ttggaagttt gtttcatgta ttcggatttc 180 catttcatcg acgaacgggg tgaatcaata attgtagaat ctggtgaccc gaatgcacta 240 ttgaagcacc gatttgagat aattgaagga agagaccgaa tcatggcctg gacagtggtg 300 aacagtatat gtaacacaac aggggtagag aagcctaaat ttcttcctga tttgtatgat 360 tacaaagaga accggttcat tgaaattgga gtaacacgga gggaagtcca catatattac 420 ctagagaaag ccaacaaaat aaaatctgag aagacacaca ttcacatctt ttcattcact 480 ggagaggaga tggccaccaa agcggactac acccttgacg aagagagcag ggcaagaatc 540 aaaactaggc ttttcactat aagacaagaa atggccagta ggagtctatg ggattccttt 600 cgtcagtccg aaagaggcga agagacaatt gaagaaaaat ttgagattac aggaactatg 660 cgcaagcttg ccgaccaaag tctcccaccg aacttcccca gccttgaaaa ctttagagcc 720 tatgtagatg gattcgagcc gaacggctgc attgagggca agctttccca aatgtcaaaa 780 gaagtgaacg ccaaaattga accattcttg aggacgacac cacgccccct cagattgcct 840 gatgggcctc tttgccatca gcggtcaaag ttcctgctga tggatgctct gaaattaagt 900 attgaagacc cgagtcacga gggggaggga ataccactat atgatgcaat caaatgcatg 960 aagacattct ttggctggaa agagcctaac atagtcaaac cacatgagaa aggcataaat 1020 cccaattacc tcatggcttg gaagcaggtg ctagcagagc tacaggacat tggaaatgaa 1080 gagaagatcc caaggacaaa gaacatgaag agaacaagcc aattgaagtg ggcactcggt 1140 gaaaatatgg caccagaaaa agtagacttt gatgactgca aagatgttgg agaccttaaa 1200 cagtatgaca gtgatgagcc agagcccaga tctctagcaa gctgggtcca aaatgaattc 1260 aataaggcat gtgaattgac tgattcaagc tggatagaac ttgatgaaat aggagaagat 1320 gttgccccga ttgaacatat cgcaagcatg aggaggaact attttacagc agaagtgtcc 1380 cactgcaggg ctactgaata cataatgaag ggagtgtaca taaatacggc cttgctcaat 1440 gcatcctgtg cagccatgga tgactttcag ctgatcccaa tgataagcaa atgtaggacc 1500 aaagaaggaa gacggaaaac aaacctgtat gggttcatta taaaaggaag gtctcatttg 1560 agaaatgata ctgatgtggt gaactttgta agtatggagt tctcactcac tgacccgaga 1620 ctggagccac acaaatggga aaaatactgt gttcttgaaa taggagacat gctcttgagg 1680 actgcgatag gccaagtgtc gaggcccatg ttcctatatg tgagaaccaa tggaacctcc 1740 aagatcaaga tgaaatgggg catggaaatg aggcgctgcc ttcttcagtc tcttcagcag 1800 attgagagca tgattgaggc cgagtcttct gtcaaagaga aagacatgac caaggaattc 1860 tttgaaaaca aatcggaaac atggccaatc ggagagtcac ccaggggagt ggaggaaggc 1920 tctattggga aagtgtgcag gaccttactg gcaaaatctg tattcaacag tctatatgcg 1980 tctccacaac ttgaggggtt ttcggctgaa tctagaaaat tgcttctcat tgttcaggca 2040 cttagggaca acctggaacc tggaaccttc gatcttgggg ggctatatga agcaatcgag 2100 gagtgcctga ttaatgatcc ctgggttttg cttaatgcat cttggttcaa ctccttcctc 2160 acacatgcac tgaagtagtt gtggcaatgc tactatttgc tatccatact gtccaaaaaa 2220 gtaccttgtt tctact 2236 // ID MG027913; SV 1; linear; viral cRNA; STD; VRL; 2341 BP. XX AC MG027913; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Mutant Influenza A virus (A/California/MA_07/2009(H1N1)) segment 1 DE polymerase PB2 (PB2) gene, complete cds. XX KW . XX OS Influenza A virus (A/California/MA_07/2009(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RC Publication Status: Online-Only RP 1-2341 RX PUBMED; 29783694. RA Slaine P.D., MacRae C., Kleer M., Lamoureux E., McAlpine S., Warhuus M., RA Comeau A.M., McCormick C., Hatchette T., Khaperskyy D.A.; RT "Adaptive Mutations in Influenza A/California/07/2009 Enhance Polymerase RT Activity and Infectious Virion Production"; RL Viruses 10(5):E272-E272(2018). XX RN [2] RP 1-2341 RA Slaine P., MacRae C., Kleer M., Lamoeureoux E., McAlpine S., Warhuus M., RA Comeau A., Khaperskyy D., Hachette T., McCormick C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Microbiology and Immunology, Dalhousie University, 5850 College Street, RL Halifax, Nova Scotia B3H 4R2, Canada XX DR MD5; bb367965d20e2aeaeaf1d7891ad99dda. DR EuropePMC; PMC5977265; 29783694. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R 8.1.8 CC Coverage :: >1000 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2341 FT /organism="Influenza A virus FT (A/California/MA_07/2009(H1N1))" FT /segment="1" FT /strain="A/California/MA_07/2009" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /note="California/07/2009 passaged in Swiss-Webster mice 10 FT times" FT /db_xref="taxon:2042258" FT gene 28..2307 FT /gene="PB2" FT CDS 28..2307 FT /codon_start=1 FT /gene="PB2" FT /product="polymerase PB2" FT /db_xref="GOA:A0A343LDT5" FT /db_xref="InterPro:IPR001591" FT /db_xref="InterPro:IPR037258" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT5" FT /protein_id="ATN44695.1" FT /translation="MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPAL FT RMKWMMAMRYPITADKRIMDMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGP FT TTSTVHYPKVYKTYFEKVERLKHGTFGPVHFRNQVKIRRRVDTNPGHADLSAKEAQDVI FT MEVVFPNEVGARILTSESQLAITKEKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAG FT GTGSVYIEVLHLTQGTCWEQMYTPGGEVRNDDVDQSLIIAARNIVRRAAVSADPLASLL FT EMCHSTQIGGVRMVDILRQNPTEEQAVDICKAAIGLRISSSFSFGGFTFKRTSGSSVKK FT EEEVLTGNLQTLKIRVHEGYEEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAII FT VAMVFSQEDCMIKAVRGDLNFVNRANQRLNPMHQLLRHFQKDAKVLFQNWGIESIDNVM FT GMIGILPDMTPSTEMSLRGIRVSKMGVDEYSSTERVVVSIDRFLRVRDQRGNVLLSPEE FT VSETQGTEKLTITYSSSMMWEINGPESVLVNTYQWIIRNWEIVKIQWSQDPTMLYNKME FT FEPFQSLVPKATRSRYSGFVRTLFQQMRDVLGTFDTVQIIKLLPFAAAPPEQSRMQFSS FT LTVNVRGSGLRILVRGNSPVFNYNKATKRLTVLGKDAGALTEDPDEGTSGVESAVLRGF FT LILGKEDKRYGPALSINELSNLAKGEKANVLIGQGDVVLVMKRKRDSSILTDSQTATKR FT IRMAIN" XX SQ Sequence 2341 BP; 789 A; 443 C; 599 G; 510 T; 0 other; agcgaaagca ggtcaaatat attcaatatg gagagaataa aagaactgag agatctaatg 60 tcgcagtccc gcactcgcga gatactcact aagaccactg tggaccatat ggccataatc 120 aaaaagtaca catcaggaag gcaagagaag aaccccgcac tcagaatgaa gtggatgatg 180 gcaatgagat acccaattac agcagacaag agaataatgg acatgattcc agagaggaat 240 gaacaaggac aaaccctctg gagcaaaaca aacgatgctg gatcagaccg agtgatggta 300 tcacctctgg ccgtaacatg gtggaatagg aatggcccaa caacaagtac agttcattac 360 cctaaggtat ataaaactta tttcgaaaag gtcgaaaggt tgaaacatgg taccttcggc 420 cctgtccact tcagaaatca agttaaaata aggaggagag ttgatacaaa ccctggccat 480 gcagatctca gtgccaagga ggcacaggat gtgattatgg aagttgtttt cccaaatgaa 540 gtgggggcaa gaatactgac atcagagtca cagctggcaa taacaaaaga gaagaaagaa 600 gagctccagg attgtaaaat tgctcccttg atggtggcgt acatgctaga aagagaattg 660 gtccgtaaaa caaggtttct cccagtagcc ggcggaacag gcagtgttta tattgaagtg 720 ttgcacttaa cccaagggac gtgctgggag cagatgtaca ctccaggagg agaagtgaga 780 aatgatgatg ttgaccaaag tttgattatc gctgctagaa acatagtaag aagagcagca 840 gtgtcagcag acccattagc atctctcttg gaaatgtgcc acagcacaca gattggagga 900 gtaaggatgg tggacatcct tagacagaat ccaactgagg aacaagccgt agacatatgc 960 aaggcagcaa tagggttgag gattagctca tctttcagtt ttggtgggtt cactttcaaa 1020 aggacaagcg gatcatcagt caagaaagaa gaagaagtgc taacgggcaa cctccaaaca 1080 ctgaaaataa gagtacatga agggtatgaa gaattcacaa tggttgggag aagagcaaca 1140 gctattctca gaaaggcaac caggagattg atccagttga tagtaagcgg gagagacgag 1200 cagtcaattg ctgaggcaat aattgtggcc atggtattct cacaggagga ttgcatgatc 1260 aaggcagtta ggggcgatct gaactttgtc aatagggcaa accagcgact gaaccccatg 1320 caccaactct tgaggcattt ccaaaaagat gcaaaagtgc ttttccagaa ctggggaatt 1380 gaatccatcg acaatgtgat gggaatgatc ggaatactgc ccgacatgac cccaagcacg 1440 gagatgtcgc tgagagggat aagagtcagc aaaatgggag tagatgaata ctccagcacg 1500 gagagagtgg tagtgagtat tgaccgattt ttaagggtta gagatcaaag agggaacgta 1560 ctattgtctc ccgaagaagt cagtgaaacg caaggaactg agaagttgac aataacttat 1620 tcgtcatcaa tgatgtggga gatcaatggc cctgagtcag tgctagtcaa cacttatcaa 1680 tggataatca ggaactggga aattgtgaaa attcaatggt cacaagatcc cacaatgtta 1740 tacaacaaaa tggaatttga accatttcag tctcttgtcc ctaaggcaac cagaagccgg 1800 tacagtggat tcgtaaggac actgttccag caaatgcggg atgtgcttgg gacatttgac 1860 actgtccaaa taataaaact tctccccttt gctgctgccc caccagaaca gagtaggatg 1920 caattttcct cattgactgt gaatgtgaga ggatcagggt tgaggatact ggtaagaggc 1980 aattctccag tattcaatta caacaaggca accaaacgac ttacagttct tggaaaggat 2040 gcaggtgcat tgactgaaga tccagatgaa ggcacatctg gggtggagtc tgctgtcctg 2100 agaggatttc tcattttggg caaagaagac aagagatatg gcccagcatt aagcatcaat 2160 gaactgagca atcttgcaaa aggagagaag gctaatgtgc taattgggca aggggacgta 2220 gtgttggtaa tgaaacgaaa acgggactct agcatactta ctgacagcca gacagcgacc 2280 aaaagaattc ggatggccat caattagtgt cgaattgttt aaaaacgacc ttgtttctac 2340 t 2341 // ID MG027914; SV 1; linear; viral cRNA; STD; VRL; 1777 BP. XX AC MG027914; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Mutant Influenza A virus (A/California/MA_07/2009(H1N1)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/California/MA_07/2009(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1777 RX PUBMED; 29783694. RA Slaine P.D., MacRae C., Kleer M., Lamoureux E., McAlpine S., Warhuus M., RA Comeau A.M., McCormick C., Hatchette T., Khaperskyy D.A.; RT "Adaptive Mutations in Influenza A/California/07/2009 Enhance Polymerase RT Activity and Infectious Virion Production"; RL Viruses 10(5):E272-E272(2018). XX RN [2] RP 1-1777 RA Slaine P., MacRae C., Kleer M., Lamoeureoux E., McAlpine S., Warhuus M., RA Comeau A., Khaperskyy D., Hachette T., McCormick C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Microbiology and Immunology, Dalhousie University, 5850 College Street, RL Halifax, Nova Scotia B3H 4R2, Canada XX DR MD5; 53d38acb65fef8484c0141fa3e49d33a. DR EuropePMC; PMC5977265; 29783694. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R 8.1.8 CC Coverage :: >4000 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1777 FT /organism="Influenza A virus FT (A/California/MA_07/2009(H1N1))" FT /segment="4" FT /strain="A/California/MA_07/2009" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /note="California/07/2009 passaged 10 times in FT Swiss-Webster mice" FT /db_xref="taxon:2042258" FT gene 33..1733 FT /gene="HA" FT CDS 33..1733 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:A0A343LDT6" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT6" FT /protein_id="ATN44696.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETPSSDN FT GTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGDSYPKLSKSYINDKGKEVLVLWGIHHPPTSADQQSLYQNADAYVFVGSSR FT YSKKFKPEIAIRPKVRGQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNIPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDEITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" FT sig_peptide 33..83 FT /gene="HA" FT mat_peptide 84..1064 FT /gene="HA" FT /product="HA1" FT mat_peptide 1065..1730 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1777 BP; 633 A; 331 C; 394 G; 419 T; 0 other; agcgaaagca ggggaaaaca aaagcaacaa aaatgaaggc aatactagta gttctgctat 60 atacatttgc aaccgcaaat gcagacacat tatgtatagg ttatcatgcg aacaattcaa 120 cagacactgt agacacagta ctagaaaaga atgtaacagt aacacactct gttaaccttc 180 tagaagacaa gcataacggg aaactatgca aactaagagg ggtagcccca ttgcatttgg 240 gtaaatgtaa cattgctggc tggatcctgg gaaatccaga gtgtgaatca ctctccacag 300 caagctcatg gtcctacatt gtggaaacac ctagttcaga caatggaacg tgttacccag 360 gagatttcat cgattatgag gagctaagag agcaattgag ctcagtgtca tcatttgaaa 420 ggtttgagat attccccaag acaagttcat ggcccaatca tgactcgaac aaaggtgtaa 480 cggcagcatg tcctcatgct ggagcaaaaa gcttctacaa aaatttaata tggctagtta 540 aaaaaggaga ttcataccca aagctcagca aatcctacat taatgataaa gggaaagaag 600 tcctcgtgct atggggcatt caccatccac ctactagtgc tgaccaacaa agtctctatc 660 agaatgcaga tgcatatgtt tttgtggggt catcaagata cagcaagaag ttcaagccgg 720 aaatagcaat aagacccaaa gtgaggggtc aagaagggag aatgaactat tactggacac 780 tagtagagcc gggagacaaa ataacattcg aagcaactgg aaatctagtg gtaccgagat 840 atgcattcgc aatggaaaga aatgctggat ctggtattat catttcagat acaccagtcc 900 acgattgcaa tacaacttgt caaacaccca agggtgctat aaacaccagc ctcccatttc 960 agaatataca tccgatcaca attggaaaat gtccaaaata tgtaaaaagc acaaaattga 1020 gactggccac aggattgagg aatatcccgt ctattcaatc tagaggccta tttggggcca 1080 ttgccggttt cattgaaggg gggtggacag ggatggtaga tggatggtac ggttatcacc 1140 atcaaaatga gcaggggtca ggatatgcag ccgacctgaa gagcacacag aatgccattg 1200 acgagattac taacaaagta aattctgtta ttgaaaagat gaatacacag ttcacagcag 1260 taggtaaaga gttcaaccac ctggaaaaaa gaatagagaa tttaaataaa aaagttgatg 1320 atggtttcct ggacatttgg acttacaatg ccgaactgtt ggttctattg gaaaatgaaa 1380 gaactttgga ctaccacgat tcaaatgtga agaacttata tgaaaaggta agaagccagc 1440 taaaaaacaa tgccaaggaa attggaaacg gctgctttga attttaccac aaatgcgata 1500 acacgtgcat ggaaagtgtc aaaaatggga cttatgacta cccaaaatac tcagaggaag 1560 caaaattaaa cagagaagaa atagatgggg taaagctgga atcaacaagg atttaccaga 1620 ttttggcgat ctattcaact gtcgccagtt cattggtact ggtagtctcc ctgggggcaa 1680 tcagtttctg gatgtgctct aatgggtctc tacagtgtag aatatgtatt taacattagg 1740 atttcagaag catgagaaaa acacccttgt ttctact 1777 // ID MG027915; SV 1; linear; viral cRNA; STD; VRL; 1565 BP. XX AC MG027915; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Mutant Influenza A virus (A/California/MA_07/2009(H1N1)) segment 5 DE nucleocapsid protein (NP) gene, complete cds. XX KW . XX OS Influenza A virus (A/California/MA_07/2009(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1565 RX PUBMED; 29783694. RA Slaine P.D., MacRae C., Kleer M., Lamoureux E., McAlpine S., Warhuus M., RA Comeau A.M., McCormick C., Hatchette T., Khaperskyy D.A.; RT "Adaptive Mutations in Influenza A/California/07/2009 Enhance Polymerase RT Activity and Infectious Virion Production"; RL Viruses 10(5):E272-E272(2018). XX RN [2] RP 1-1565 RA Slaine P., MacRae C., Kleer M., Lamoeureoux E., McAlpine S., Warhuus M., RA Comeau A., Khaperskyy D., Hachette T., McCormick C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Microbiology and Immunology, Dalhousie University, 5850 College Street, RL Halifax, Nova Scotia B3H 4R2, Canada XX DR MD5; 78dd52c6c4ad402115d58cb7b3d1a4bb. DR EuropePMC; PMC5977265; 29783694. XX CC ##Assembly-Data-START## CC Assembly Method :: Geneious v. R 8.1.8 CC Coverage :: >6500 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1565 FT /organism="Influenza A virus FT (A/California/MA_07/2009(H1N1))" FT /segment="5" FT /strain="A/California/MA_07/2009" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /note="California/07/2009 passaged 10 times in FT Swiss-Webster mice" FT /db_xref="taxon:2042258" FT gene 46..1542 FT /gene="NP" FT CDS 46..1542 FT /codon_start=1 FT /gene="NP" FT /product="nucleocapsid protein" FT /db_xref="GOA:A0A343LDT7" FT /db_xref="InterPro:IPR002141" FT /db_xref="UniProtKB/TrEMBL:A0A343LDT7" FT /protein_id="ATN44697.1" FT /translation="MASQGTKRSYEQMETGGERQDATEIRASVGRMIGGIGRFYIQMCT FT ELKLSDYEGRLIQNSITIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVDGKW FT MRELILYDKEEIRRVWRQANNGEDATAGLTHIMIWHSNLNDATYQRTRALVRTGMDPRM FT CSLMQGSTLPRRSGAAGAAVKGVGTIAMELIRMIKRGINDRNFWRGENGRRTRVAYERM FT CNILKGKFQTAAQRAMMDQVRESRNPGNAEIEDLIFLARSALILRGSVAHKSCLPACVY FT GLAVASGHDFEREGYSLVGIDPFKLLQNSQVVSLMRPNENPAHKSQLVWMACHSAAFED FT LRVSSFIRGKKVIPRGKLSTRGVQIASNENVETMDSNTLELRSRYWAIRTRSGGNTNQQ FT KASAGQISVQPTFSVQRNLPFERATVMAAFSGNNEGRTSDMRTEVIRMMESAKPEDLSF FT QGRGVFELSDEKATNPIVPSFDMSNEGSYFFGDNAEEYDS" XX SQ Sequence 1565 BP; 517 A; 308 C; 414 G; 326 T; 0 other; agcgaaagca gggtagataa tcactcaatg agtgacatcg aagccatggc gtctcaaggc 60 accaaacgat catatgaaca aatggagact ggtggggagc gccaggatgc cacagaaatc 120 agagcatctg tcggaagaat gattggtgga atcgggagat tctacatcca aatgtgcact 180 gaactcaaac tcagtgatta tgagggacga ctaatccaga atagcataac aatagagagg 240 atggtgcttt ctgcttttga tgagagaaga aataaatacc tagaagagca tcccagtgct 300 gggaaggacc ctaagaaaac aggaggaccc atatatagaa gagtagacgg aaagtggatg 360 agagaactca tcctttatga caaagaagaa ataaggagag tttggcgcca agcaaacaat 420 ggcgaagatg caacagcagg tcttactcat atcatgattt ggcattccaa cctgaatgat 480 gccacatatc agagaacaag agcgcttgtt cgcaccggaa tggatcccag aatgtgctct 540 ctaatgcaag gttcaacact tcccagaagg tctggtgccg caggtgctgc ggtgaaagga 600 gttggaacaa tagcaatgga gttaatcaga atgatcaaac gtggaatcaa tgaccgaaat 660 ttctggaggg gtgaaaatgg acgaaggaca agggttgctt atgaaagaat gtgcaatatc 720 ctcaaaggaa aatttcaaac agctgcccag agggcaatga tggatcaagt aagagaaagt 780 cgaaacccag gaaacgctga gattgaagac ctcattttcc tggcacggtc agcactcatt 840 ctgaggggat cagttgcaca taaatcctgc ctgcctgctt gtgtgtatgg gcttgcagta 900 gcaagtgggc atgactttga aagggaaggg tactcactgg tcgggataga cccattcaaa 960 ttactccaaa acagccaagt ggtcagcctg atgagaccaa atgaaaaccc agctcacaag 1020 agtcaattgg tgtggatggc atgccactct gctgcatttg aagatttaag agtatcaagt 1080 ttcataagag gaaagaaagt gattccaaga ggaaagcttt ccacaagagg ggtccagatt 1140 gcttcaaatg agaatgtgga aaccatggac tccaataccc tggaactgag aagcagatac 1200 tgggccataa ggaccaggag tggaggaaat accaatcaac aaaaggcatc cgcaggccag 1260 atcagtgtgc agcctacatt ctcagtgcag cggaatctcc cttttgaaag agcaaccgtt 1320 atggcagcat tcagcgggaa caatgaagga cggacatccg acatgcgaac agaagttata 1380 agaatgatgg aaagtgcaaa gccagaagat ttgtccttcc aggggcgggg agtcttcgag 1440 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaaggg 1500 tcttatttct tcggagacaa tgcagaggag tatgacagtt gaggaaaaat acccttgttt 1560 ctact 1565 // ID MG029094; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029094; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-1/2017/G9P[23]I1 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 672151e712de1e40ca8b18cb5b3b256c. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-1/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I1" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9K4" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9K4" FT /protein_id="AVP27310.1" FT /translation="ESQRNGVAPQSEALRKLAGIKFKRINFDNSSEYIENWNLQNRRQR FT TGFVFHKPNIFPYSASFTLNRSQPMHDNLMGTMWLNAGSEIQVAGFDYSCAINAPANIQ FT QFEHIVQLRRALTTATITLLPDAERFSFPRVINSADGATTWFFNPVILRPNNVEVEFLL FT NGQIINTYQARFGTIIARNFDTIRLSFQLMRPPNMTPAVNALFPQAQPFQHHATVGLTL FT RIEPAVCESVLADANETLLANVTAVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 258 A; 161 C; 145 G; 222 T; 0 other; gagaatcaca acgaaatgga gtagctccac aatctgaagc gttgaggaag ttggcaggca 60 ttaaattcaa gagaataaat tttgataatt catcagagta catagagaat tggaatttac 120 aaaacagaag acagcgcact ggatttgttt ttcataaacc taacatattt ccatactcag 180 cttcatttac tctaaacaga tctcaaccaa tgcatgataa cttaatggga actatgtggc 240 ttaatgctgg atcagaaatt caagtggctg gatttgatta ctcatgcgcc ataaatgcac 300 cagcgaacat acagcagttt gagcacatcg tccaactcag acgtgcactg actacagcta 360 ctataacttt gttacctgat gcagaaagat ttagctttcc aagagtcatt aactcagctg 420 atggcgcgac tacatggttc ttcaatccag ttattctaag accaaataat gtagaagttg 480 aatttttatt gaatggacaa attattaata catatcaagc tagatttggc actattattg 540 caagaaactt cgatacaatt cgtttgtcat ttcaattaat gcgtccacca aacatgacac 600 cagctgttaa cgcactattc ccgcaagcac aaccttttca acaccatgca acagttggac 660 ttacattacg tattgaacct gcagtttgtg aatcagtgct tgcggatgca aatgaaactc 720 tgttggcgaa tgtgaccgca gtacgtcaag agtatgctat accagttgga ccagtatttc 780 caccag 786 // ID MG029095; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029095; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-1/2017/G9P[23]I1 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 33e944ac58beb2b86ce427df095ff465. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-1/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: G9" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L1" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L1" FT /protein_id="AVP27318.1" FT /translation="MYGIEYTTVLTFLISIILLNYILKSLTSAMDFIIYRFLLLVVIAS FT PFVKTQNYGINLPITGSMDTAYANSSQQETFLTSTLCLYYPTEASTQIGDTEWKDTLSQ FT LFLTKGWPTGSVYFKEYTNIASFSIDPQLYCDYNVVLMKYDSTLELDMSELADLILNEW FT LCNPMDITLYYYQQTDEANKWISKGQSCTIKVCPLNTQTLGIGCITTNTATFEEVATSE FT KLVITDVVDGVNHKLDVTTNTCTIRNCKKLGPRENIAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 310 A; 136 C; 161 G; 276 T; 0 other; atgtatggta ttgaatatac cacagttcta acctttctga tatcaataat tttattgaac 60 tatatattaa aatcactaac tagtgcgatg gactttataa tttatagatt tcttttactt 120 gttgttattg catcaccatt tgttaaaaca caaaattatg gaattaattt accgatcact 180 ggctccatgg acacagcata tgcaaattct tcacagcaag aaacattttt gacttcaacg 240 ctatgcttat attatcctac agaagcatca actcaaattg gagatacgga atggaaggat 300 actctgtccc aattattctt gactaagggg tggccaactg gatcagttta ttttaaagaa 360 tacactaata tcgcttcatt ctcaattgat ccgcaacttt attgtgatta taatgttgta 420 ctgatgaagt atgattcaac gttagagcta gatatgtctg aattagctga tttaattcta 480 aatgaatggt tatgtaaccc aatggatata acattatatt attatcagca aacagatgaa 540 gcgaataaat ggatatcgaa gggacagtct tgtaccataa aagtatgtcc attaaatacg 600 cagactttag gaataggttg tattactacg aatacagcaa catttgaaga ggtagctaca 660 agtgaaaaat tagtaataac cgatgttgtt gatggcgtga accataaact tgatgtaact 720 acaaatacct gtacaattag gaattgtaag aagttaggac caagagaaaa tatagcgatt 780 atacaagtcg gtggttcaga tgtgttagat attacagcgg atccaactac tgcaccacaa 840 actgagcgta tgatgcgagt aaattggaaa aagtggtggc aag 883 // ID MG029096; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029096; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-3/2017/G9P[23]I1 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 428edcdb0fc57f8e3284fff05a7f9d5b. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-3/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I1" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9K5" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9K5" FT /protein_id="AVP27311.1" FT /translation="ESQRNGVAPQSEALRKLAGIKFKRISFDNSSEYIENWNLQNRRQR FT TGFVFHKPNIFPYSASFTLNRSQPMHDNLMGTMWLNAGSEIQVAGFDYSCAINAPANIQ FT QFEHIIQLRRALTTATVTLLPDAERFSFPRVINSADGATTWFFNPVILRPNNVEVEFLL FT NGQIINTYQARFGTIIARNFDTIRLSFQLMRPPNMTPAVNALFPQAQPFQHHATVGLTL FT RIESAVCESVLADANETLLANVTAVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 256 A; 160 C; 147 G; 223 T; 0 other; gagaatcaca gcgaaatgga gtagctccac aatctgaagc gttgaggaag ttggcaggca 60 ttaaattcaa gagaataagt tttgataatt catcagagta catagagaat tggaatttac 120 aaaacagaag acagcgcact ggatttgttt ttcataaacc taacatattt ccatactcag 180 cttcatttac tctaaacaga tctcaaccaa tgcatgataa cttaatggga actatgtggc 240 ttaatgctgg atcagaaatt caagtggctg gatttgatta ctcatgcgcc ataaatgcac 300 cagcgaacat acagcagttt gagcacatca tccaactcag acgtgcactg actacagcta 360 ctgtaacttt gttacctgat gcagaaagat ttagctttcc aagagtcatt aactcagctg 420 atggcgcgac tacatggttc ttcaatccag ttattctaag accaaataat gtagaagttg 480 aatttttatt gaatggacaa attattaata catatcaagc tagatttggc actattattg 540 caagaaactt cgatacaatt cgtttgtcat ttcaattaat gcgtccacca aacatgacac 600 cagctgttaa cgcactattc ccgcaagcac aaccttttca acaccatgca acagttggac 660 ttacattacg tattgaatct gcagtttgtg aatcagtgct tgcggatgca aatgaaactc 720 tgttggcgaa tgtgaccgca gtacgtcaag agtatgctat accagttgga ccagtatttc 780 caccag 786 // ID MG029097; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029097; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-3/2017/G9P[23]I1 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 4e9a3ea4d0746c398706715bdbdc1d22. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-3/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: G9" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L5" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L5" FT /protein_id="AVP27319.1" FT /translation="MYGIEYTTVLTFLISIILLNYILKSLTSAMDFIIYRFLLLVVIAS FT PFVKTQNYGINLPITGSMDTAYANSSQQETFLTSTLCLYYPTEASTQIGDTEWKDTLSQ FT LFLTKGWPTGSVYFKEYTNIASFSIDPQLYCDYNVVLMKYDSTLELDMSELADLILNEW FT LCNPMDITLYYYQQTDEANKWISMGQSCTIKVCPLNTQTLGIGCITTNTATFEEVATSE FT KLVITDVVDGVNHKLDVTTNTCTIRNCKKLGPRENIAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 309 A; 136 C; 161 G; 277 T; 0 other; atgtatggta ttgaatatac cacagttcta acctttctga tatcaataat tttattgaac 60 tatatattaa aatcactaac tagtgcgatg gactttataa tttatagatt tcttttactt 120 gttgttattg catcaccatt tgttaaaaca caaaattatg gaattaattt accgatcact 180 ggctccatgg acacagcata tgcaaattct tcacagcaag aaacattttt gacttcaacg 240 ctatgcttat attatcctac agaagcatca actcaaattg gagatacgga atggaaggat 300 actctgtccc aattattctt gactaagggg tggccaactg gatcagttta ttttaaagaa 360 tacactaata tcgcttcatt ctcaattgat ccgcaacttt attgtgatta taatgttgta 420 ctgatgaagt atgattcaac gttagagcta gatatgtctg aattagctga tttaattcta 480 aatgaatggt tatgtaaccc aatggatata acattatatt attatcagca aacagatgaa 540 gcgaataaat ggatatcgat gggacagtct tgtaccataa aagtatgtcc attaaatacg 600 cagactttag gaataggttg tattactacg aatacagcaa catttgaaga ggtagctaca 660 agtgaaaaat tagtaataac cgatgttgtt gatggcgtga accataaact tgatgtaact 720 acaaatacct gtacaattag gaattgtaag aagttaggac caagagaaaa tatagcgatt 780 atacaagtcg gtggttcaga tgtgttagat attacagcgg atccaactac tgcaccacaa 840 actgagcgta tgatgcgagt aaattggaaa aagtggtggc aag 883 // ID MG029098; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029098; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-4/2017/I1 VP6 (VP6) gene, DE partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; e1ba86250128d8c82e6206fe8d469a79. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-4/2017/I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I1" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9K2" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9K2" FT /protein_id="AVP27312.1" FT /translation="ESQRNGVAPQSEALRKLAGIKFKRINFDNSSEYIENWNLQNRRQR FT TGFVFHKPNIFPYSASFTLNRSQPMHDNLMGTMWLNAGSEIQVAGFDYSCAINAPANIQ FT QFEHIVQLRRALTTATITLLPDAERFSFPRVINSADGATTWFFNPVILRPNNVEVEFLL FT NGQIINTYQARFGTIIARNFDTIRLSFQLMRPPNMTPAVNALFPQAQPFQHHATVGLTL FT RIESAVCESVLADANETLLANVTAVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 258 A; 161 C; 145 G; 222 T; 0 other; gagaatcaca acgaaacgga gtagctccac aatctgaagc gttgaggaag ttggcaggca 60 ttaaattcaa gagaataaat tttgataatt catcagagta catagagaat tggaatttac 120 aaaacagaag acagcgcact ggatttgttt ttcataaacc taacatattt ccatactcag 180 cttcatttac tctaaacaga tctcaaccaa tgcatgataa cttaatggga actatgtggc 240 ttaatgctgg atcagaaatt caagtggctg gatttgatta ctcatgcgcc ataaatgcac 300 cagcgaacat acagcagttt gagcacatcg tccaactcag acgtgcactg actacagcta 360 ctataacttt gttacctgat gcagaaagat ttagctttcc aagagtcatt aactcagctg 420 atggcgcgac tacatggttc ttcaatccag ttattctaag accaaataat gtagaagttg 480 aatttttatt gaatggacaa attattaata catatcaagc tagatttggc actattattg 540 caagaaactt cgatacaatt cgtttgtcat ttcaattaat gcgtccacca aacatgacac 600 cagctgttaa cgcactattc ccgcaagcac aaccttttca acaccatgca acagttggac 660 ttacattacg tattgaatct gcagtttgtg aatcagtgct tgcggatgca aatgaaactc 720 tgttggcgaa tgtgaccgca gtacgtcaag agtatgctat accagttgga ccagtatttc 780 caccag 786 // ID MG029099; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029099; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-6/2017/G9P[23]I1 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 2deea3a8e0de9642c15b0ca47baf752f. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-6/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I1" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9K3" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9K3" FT /protein_id="AVP27313.1" FT /translation="ESQRNGVAPQSEALRKLAGIKFKRINFDNSSEYIENWNLQNRRQR FT TGFVFHKPNIFPYSASFTLNRSQPMHDNLMGTMWLNAGSEIQVAGFDYSCAINAPANIQ FT QFEHIVQLRRALTTATITLLPDAERFSFPRVINSADGATTWFFNPVILRPNNVEVEFLL FT NGQIINTYQARFGTIIARNFDTIRLSFQLMRPPNMTPAVNALFPQAQPFQHHATVGLTL FT RIESAVCESVLADANETLLANVTAVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 258 A; 161 C; 145 G; 222 T; 0 other; gagaatcaca acgaaatgga gtagctccac aatctgaagc gttgaggaag ttggcaggca 60 ttaaattcaa gagaataaat tttgataatt catcagagta catagagaat tggaatttac 120 aaaacagaag acagcgcact ggatttgttt ttcataaacc taacatattt ccatactcag 180 cttcatttac tctaaacaga tctcaaccaa tgcatgataa cttaatggga actatgtggc 240 ttaatgctgg atcagaaatt caagtggctg gatttgatta ctcatgcgcc ataaatgcac 300 cagcgaacat acagcagttt gagcacatcg tccaactcag acgtgcactg actacagcta 360 ctataacttt gttacctgat gcagaaagat ttagctttcc aagagtcatt aactcagctg 420 atggcgcgac tacatggttc ttcaatccag ttattctaag accaaataat gtagaagttg 480 aatttttatt gaatggacaa attattaata catatcaagc tagatttggc actattattg 540 caagaaactt cgatacaatt cgtttgtcat ttcaattaat gcgtccacca aacatgacac 600 cagctgttaa cgcactattc ccgcaagcac aaccttttca acaccatgca acagttggac 660 ttacattacg tattgaatct gcagtttgtg aatcagtgct tgcggatgca aatgaaaccc 720 tgttggcgaa tgtgaccgca gtacgtcaag agtatgctat accagttgga ccagtatttc 780 caccag 786 // ID MG029100; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029100; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCLS-2-6/2017/G9P[23]I1 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 4e9a3ea4d0746c398706715bdbdc1d22. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCLS-2-6/2017/G9P[23]I1" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: G9" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L2" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L2" FT /protein_id="AVP27320.1" FT /translation="MYGIEYTTVLTFLISIILLNYILKSLTSAMDFIIYRFLLLVVIAS FT PFVKTQNYGINLPITGSMDTAYANSSQQETFLTSTLCLYYPTEASTQIGDTEWKDTLSQ FT LFLTKGWPTGSVYFKEYTNIASFSIDPQLYCDYNVVLMKYDSTLELDMSELADLILNEW FT LCNPMDITLYYYQQTDEANKWISMGQSCTIKVCPLNTQTLGIGCITTNTATFEEVATSE FT KLVITDVVDGVNHKLDVTTNTCTIRNCKKLGPRENIAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 309 A; 136 C; 161 G; 277 T; 0 other; atgtatggta ttgaatatac cacagttcta acctttctga tatcaataat tttattgaac 60 tatatattaa aatcactaac tagtgcgatg gactttataa tttatagatt tcttttactt 120 gttgttattg catcaccatt tgttaaaaca caaaattatg gaattaattt accgatcact 180 ggctccatgg acacagcata tgcaaattct tcacagcaag aaacattttt gacttcaacg 240 ctatgcttat attatcctac agaagcatca actcaaattg gagatacgga atggaaggat 300 actctgtccc aattattctt gactaagggg tggccaactg gatcagttta ttttaaagaa 360 tacactaata tcgcttcatt ctcaattgat ccgcaacttt attgtgatta taatgttgta 420 ctgatgaagt atgattcaac gttagagcta gatatgtctg aattagctga tttaattcta 480 aatgaatggt tatgtaaccc aatggatata acattatatt attatcagca aacagatgaa 540 gcgaataaat ggatatcgat gggacagtct tgtaccataa aagtatgtcc attaaatacg 600 cagactttag gaataggttg tattactacg aatacagcaa catttgaaga ggtagctaca 660 agtgaaaaat tagtaataac cgatgttgtt gatggcgtga accataaact tgatgtaact 720 acaaatacct gtacaattag gaattgtaag aagttaggac caagagaaaa tatagcgatt 780 atacaagtcg gtggttcaga tgtgttagat attacagcgg atccaactac tgcaccacaa 840 actgagcgta tgatgcgagt aaattggaaa aagtggtggc aag 883 // ID MG029101; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029101; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-2-1/2017/G9P[13]I5 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 9eb82ee93eb972ba9f9958c9aa50d750. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-2-1/2017/G9P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="Mar-2017" FT /note="genotype: I5" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9J9" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9J9" FT /protein_id="AVP27314.1" FT /translation="ESQRNGIAPQSEALRKLSGIKFKRINFDNSSDYIENWNLQNRRQR FT TGFVFHKPNILPYSASFTLNRSQPAHDNLMGTMWINAGSEIQVAGFDYSCALNAPANIQ FT QFEHVVPLRRALTTATITLLPDAERFSSPRVINSADGTTTWYFNPVVLRPSNVEVEFLL FT NGQIINTYQARFGTIVARNFDTIRLSFQLVRPPNMTPAVAHLFPQAPPFIFHATVGLTM FT RIESAVCESVLADASETLLANVTSVRQEYAIPAGPVFPP" XX SQ Sequence 786 BP; 254 A; 157 C; 145 G; 230 T; 0 other; gagaatcaca gcgaaatggg atagcaccac aatctgaagc actgagaaag ctgtcgggta 60 ttaaatttaa gagaattaat tttgataatt catctgatta tattgaaaat tggaatctac 120 agaataggcg acagcgcact ggatttgtat ttcataaacc aaatatactt ccatattcag 180 catcatttac tctaaatcga tcacagccag ctcatgataa tttgatggga actatgtgga 240 ttaacgctgg atcagaaatc caagtagctg gattcgatta ttcatgtgct ttaaacgcac 300 cagcaaacat tcagcagttt gaacacgttg taccgctaag acgtgccctc acaacagcta 360 caattacttt gctaccagac gcagaaagat ttagttcacc tagagtaatt aattcagctg 420 atggtactac tacatggtat tttaatccag ttgttctaag accaagtaat gtagaagttg 480 aatttctact gaatggacag ataattaaca catatcaagc acgatttgga actatcgtag 540 ctagaaattt tgatactatt cgtttatcat ttcaattagt acgtccaccg aatatgacac 600 cagcagttgc acacttattt ccgcaagcac caccatttat atttcacgct acagttggac 660 ttacaatgcg cattgaatct gcagtttgtg aatctgtgct tgcggacgct tcagaaactt 720 tgttggcaaa tgtaacatca gtacgtcagg agtatgcaat accagcagga ccagtttttc 780 caccag 786 // ID MG029102; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029102; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-2-1/2017/G9P[13]I5 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 91166f1d5a251171a916f5fb2a2d2656. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-2-1/2017/G9P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="Mar-2017" FT /note="genotype: G9" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L4" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L4" FT /protein_id="AVP27321.1" FT /translation="MYGIEYTTVLTFLISIILLNYILKSLTSAMDFIIYRFLLLVVIAS FT PFVKTQNYGINLPITGSMDTAYANSSQQETFLTSTLCLYYPTEASTQIGDTEWKDTLSQ FT LFLTKGWPTGSVYFKEYTNIASFSIDPQLYCDYNVVLMKYDSTLELDMSELADLILNEW FT LCNPMDITLYYYQQTDEANKWISMGQSCTIKVCPLNTQTLGIGCITTNTATFEEVATSE FT KLVITDVVDGVNHKLDVTTNTCTIRNCKKLGPRENIAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 308 A; 136 C; 162 G; 277 T; 0 other; atgtatggta ttgaatatac cacagttcta acctttctga tatcaataat tttattgaac 60 tatatattaa aatcactaac tagtgcgatg gactttataa tttatagatt tcttttactt 120 gttgttattg catcaccatt tgttaaaaca caaaattatg gaattaattt accgatcact 180 ggctccatgg acacagcata tgcaaattct tcacagcaag aaacattttt gacttcaacg 240 ctatgcttat attatcctac agaagcatca actcaaattg gagatacgga atggaaggat 300 actctgtccc aattattctt gactaagggg tggccaactg gatcagttta ttttaaagaa 360 tacactaata tcgcttcatt ctcaattgat ccgcaacttt attgtgatta taatgttgta 420 ctgatgaagt atgattcaac gttagagcta gatatgtctg aattagctga tttaattcta 480 aatgaatggt tatgtaaccc aatggatata acattatatt attatcagca aacagatgaa 540 gcgaataaat ggatatcgat gggacagtct tgtaccataa aagtatgtcc attaaatacg 600 cagactttag gaataggttg tattactacg aatacagcaa catttgaaga ggtagctaca 660 agtgaaaaat tagtaataac cgatgttgtt gatggcgtga accataaact tgatgtaact 720 acaaatacct gtacaattag gaattgtaag aagttaggac caagagaaaa tatagcgatt 780 atacaagtcg gtggttcaga tgtgttagat attacagcgg atccaactac tgcaccacaa 840 actgagcgta tgatgcgagt aaattggaag aagtggtggc aag 883 // ID MG029103; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029103; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-5-1/2017/G3P[13]I5 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; b22eb51b70a64cf6ec57d8b82ee99070. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-5-1/2017/G3P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I5" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9K9" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9K9" FT /protein_id="AVP27315.1" FT /translation="ESQRNGIAPQSEALRKLSGIKFKRINFDNSSDYIENWNLQNRRQR FT TGFVFHKPNILPYSASFTLNRSQPAHDNLMGTMWINAGSEIQVAGFDYSCAINAPANIQ FT QFEHVVPLRRALTTATITLLPDAERFSFQRVINSADGTTTWHFNPVILRPSNVEVEFLL FT NGQTINTYQARFGTIIARNFDTIRLSFQLVRPPNMTPAVAQLFPQAPPFIFHATVGLTM FT RIESAVCESVLADASETLLANVTSVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 258 A; 164 C; 144 G; 220 T; 0 other; gagaatcaca gcgaaatgga atagcaccac aatctgaagc actaagaaag ctgtcaggta 60 ttaaatttaa aagaattaat tttgataatt cgtctgatta tattgaaaat tggaatctac 120 agaatagacg gcagcgcact gggtttgtat ttcataaacc aaatatactt ccatattcgg 180 catcattcac cctaaatcgg tcacagccag ctcatgataa tttgatggga actatgtgga 240 ttaacgctgg atcagaaatt caagtagctg gattcgatta ttcatgtgct attaatgcac 300 cagcaaacat tcaacagttt gaacacgtcg taccgctaag acgcgctctc acaacagcta 360 cgattactct actgccagat gcagaaagat tcagtttcca gagagtaatt aattcagccg 420 atggtactac cacatggcac ttcaatccag ttatcctaag accaagtaat gtagaggttg 480 aatttctatt aaatggacag acaattaata catatcaagc acgatttgga accatcatag 540 ctagaaattt tgacaccatc cgtttatcat ttcaattggt acgtccaccg aatatgacac 600 cagcagttgc acaactattt ccgcaagcac caccatttat atttcatgct acagttggac 660 ttacaatgcg cattgaatct gcagtttgtg aatctgtgct tgcggatgct tcagaaactt 720 tgttagcaaa tgtaacatca gtacgtcagg agtatgcaat accagtagga ccagtatttc 780 caccag 786 // ID MG029104; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029104; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-5-1/2017/G3P[13]I5 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; 8b1c469a995959dea3b3d9efe34fcef9. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-5-1/2017/G3P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: G3" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L6" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L6" FT /protein_id="AVP27322.1" FT /translation="MYGIEYTTVLTFLISVILLNYVLKSLTRMMDYIIYRFLFIIVILS FT PLLNAQNYGINLPITGSMDTPYANSTQGEVFLTSTLCLYYPTEAATQINDNSWKDTLSQ FT LFLTKGWPTGSIYFKDYADIASFSVDPQLYCDYNLVLMKYDAALQLDMSELADLILNEW FT LCNPMDITLYYYQQTDETNKWISMGSSCTIKVCPLNTQTLGIGCLTTDTNTFEEVATAE FT KLVITDVVDGANHKLSVTTNTCTIRNCKKLGPRENVAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 315 A; 138 C; 156 G; 274 T; 0 other; atgtatggta ttgaatatac cacagtttta acctttttga tatcggttat attgttgaat 60 tacgtactta aatcattaac tagaatgatg gactacatta tctatagatt tcttttcatt 120 atagttatac tgtcaccact tcttaatgca caaaactatg gaataaatct tccaattact 180 ggatcaatgg atactccata tgcaaactcg acacaaggag aagtgtttct aacatcaact 240 ctatgtttgt attatccaac tgaagctgcg acacaaataa atgacaattc atggaaagat 300 acactttctc aactattttt aactaaagga tggccaacag gatctatcta ttttaaagat 360 tatgcagata ttgcttcatt ttcagttgat ccacaattat actgtgatta taatttagtg 420 ttaatgaagt acgacgccgc attacagtta gatatgtcag aattagcaga tttgatactt 480 aatgagtggt tatgtaatcc tatggatatt actttatatt attatcaaca aactgatgag 540 acaaacaagt ggatttcgat gggatcatct tgtaccataa aagtatgtcc attgaataca 600 caaacattag gaattgggtg tctgactact gatacaaata cgtttgaaga agttgcaaca 660 gctgaaaaat tagtaattac tgacgttgta gatggggcca atcataaact aagcgtgacg 720 acaaatactt gtacaattag gaactgtaaa aaattaggac caagagaaaa cgtagcaatt 780 atacaggttg gtggttcaga tgtacttgac ataacagctg atccaacgac agctccacaa 840 acagaaagaa tgatgcgagt aaattggaaa aagtggtggc aag 883 // ID MG029105; SV 1; linear; genomic RNA; STD; VRL; 786 BP. XX AC MG029105; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-5-2/2017/G3P[13]I5 VP6 (VP6) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-786 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; c412d5d73a63f630f139c1161b33ec4c. XX FH Key Location/Qualifiers FH FT source 1..786 FT /organism="Porcine rotavirus A" FT /segment="6" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-5-2/2017/G3P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: I5" FT /db_xref="taxon:10967" FT gene <1..>786 FT /gene="VP6" FT CDS <1..>786 FT /codon_start=3 FT /gene="VP6" FT /product="VP6" FT /db_xref="GOA:A0A2P1M9M6" FT /db_xref="InterPro:IPR001385" FT /db_xref="InterPro:IPR008935" FT /db_xref="InterPro:IPR008980" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9M6" FT /protein_id="AVP27316.1" FT /translation="ESQRNGIAPQSEALRKLSGIKFKRINFDNSSDYIENWNLQNRRQR FT TGFVFHKPNILPYSASFTLNRSQPAHDNLMGTMWINAGSEIQVAGFDYSCAINAPANIQ FT QFEHVVPLRRALTTATITLLPDAERFSFQRVVNSADGTTTWHFNPVILRPSNVEVEFLL FT NGQTINTYQARFGTIIARNFDTIRLSFQLVRPPNMTPAVAQLFPQAPPFIFHATVGLTM FT RIESAVCESVLADASETLLANVTSVRQEYAIPVGPVFPP" XX SQ Sequence 786 BP; 258 A; 165 C; 144 G; 219 T; 0 other; gagaatcaca acgaaatgga atagcaccac aatctgaagc actaagaaag ctgtcaggta 60 ttaaatttaa aagaattaat tttgataatt cgtctgatta tattgaaaat tggaatctac 120 agaatagacg gcagcgcact gggtttgtat ttcataaacc aaatatactt ccatattcgg 180 catcattcac cctaaatcgg tcacagccag ctcatgataa tttgatggga actatgtgga 240 ttaacgctgg atcagaaatt caagtagctg gattcgatta ttcatgtgct attaatgcac 300 cagcaaacat tcaacagttt gaacacgtcg taccgctaag acgcgctctc acaacagcta 360 cgattactct actgccagat gcagaaagat tcagtttcca gagagtagtt aattcagccg 420 atggtactac cacatggcac ttcaatccag ttatcctaag accaagtaat gtagaggttg 480 aatttctatt aaatggacag acaattaata catatcaagc acgatttgga accatcatag 540 ctagaaattt tgacaccatc cgtttatcat ttcaattggt acgtccaccg aatatgacac 600 cagcagttgc acaactattt ccgcaagcac caccatttat atttcatgct acagttggac 660 ttacaatgcg cattgaatct gcagtttgtg aatctgtgct tgcggatgct tcagaaactt 720 tgttagcaaa tgtaacatca gtacgtcagg agtatgcaat accagtagga ccagtattcc 780 caccag 786 // ID MG029106; SV 1; linear; genomic RNA; STD; VRL; 883 BP. XX AC MG029106; XX DT 27-MAR-2018 (Rel. 136, Created) DT 27-MAR-2018 (Rel. 136, Last updated, Version 1) XX DE Porcine rotavirus A strain RVA/Pig-wt/CHN/SCQL-5-2/2017/G3P[13]I5 VP7 (VP7) DE gene, partial cds. XX KW . XX OS Porcine rotavirus A OC Viruses; Riboviria; Reoviridae; Sedoreovirinae; Rotavirus; Rotavirus A. XX RN [1] RP 1-883 RA Hu H., Yue H., Tang C.; RT ; RL Submitted (29-SEP-2017) to the INSDC. RL College of Life Science and Technology, Southwest Minzu University, No. 16, RL South 4th Section 1st Ring Road, Chengdu, Sichuan 610041, China XX DR MD5; cbfd69020bf82dce08cf2a92ed0555e3. XX FH Key Location/Qualifiers FH FT source 1..883 FT /organism="Porcine rotavirus A" FT /segment="9" FT /host="pig" FT /strain="RVA/Pig-wt/CHN/SCQL-5-2/2017/G3P[13]I5" FT /mol_type="genomic RNA" FT /country="China" FT /isolation_source="fecal sample" FT /collected_by="Hu Huan" FT /collection_date="May-2017" FT /note="genotype: G3" FT /db_xref="taxon:10967" FT gene <1..>883 FT /gene="VP7" FT CDS <1..>883 FT /codon_start=1 FT /gene="VP7" FT /product="VP7" FT /db_xref="GOA:A0A2P1M9L3" FT /db_xref="InterPro:IPR001963" FT /db_xref="InterPro:IPR042207" FT /db_xref="InterPro:IPR042210" FT /db_xref="UniProtKB/TrEMBL:A0A2P1M9L3" FT /protein_id="AVP27323.1" FT /translation="MYGIEYTTVLTFLISVILLNYVLKSLTRMMDYIIYRFLFIIVILS FT PLLNAQNYGINLPITGSMDTPYANSTQGEVFLTSTLCLYYPTEAATQINDNSWKDTLSQ FT LFLTKGWPTGSIYFKDYADIASFSVDPQLYCDYNLVLMKYDAALQLDMSELADLILNEW FT LCNPMDITLYYYQQTDETNKWISMGSSCTIKVCPLNTQTLGIGCLTTDTNTFEEVATAE FT KLVITDVVDGVNHKLSVTTNTCTIRNCKKLGPRENVAIIQVGGSDVLDITADPTTAPQT FT ERMMRVNWKKWWQ" XX SQ Sequence 883 BP; 315 A; 137 C; 156 G; 275 T; 0 other; atgtatggta ttgaatatac cacagtttta acctttttga tatcggttat attgttgaat 60 tacgtactta aatcattaac tagaatgatg gactacatta tctatagatt tcttttcatt 120 atagttatac tgtcaccact tcttaatgca caaaactatg gaataaatct tccaattact 180 ggatcaatgg atactccata tgcaaactcg acacaaggag aagtgtttct aacatcaact 240 ctatgtttgt attatccaac tgaagctgcg acacaaataa atgacaattc atggaaagat 300 acactttctc aactattttt aactaaagga tggccaacag gatctatcta ttttaaagat 360 tatgcagata ttgcttcatt ttcagttgat ccacaattat actgtgatta taatttagtg 420 ttaatgaagt acgacgccgc attacagtta gatatgtcag aattagcaga tttgatactt 480 aatgagtggt tatgtaatcc tatggatatt actttatatt attatcaaca aactgatgag 540 acaaacaagt ggatttcgat gggatcatct tgtaccataa aagtatgtcc attgaataca 600 caaacattag gaattgggtg tctgactact gatacaaata cgtttgaaga agttgcaaca 660 gctgaaaaat tagtaattac tgacgttgta gatggggtca atcataaact aagcgtgacg 720 acaaatactt gtacaattag gaactgtaaa aaattaggac caagagaaaa cgtagcaatt 780 atacaggttg gtggttcaga tgtacttgac ataacagctg atccaacgac agctccacaa 840 acagaaagaa tgatgcgagt aaattggaaa aagtggtggc aag 883 // ID MG029107; SV 1; linear; genomic DNA; STD; VRL; 451 BP. XX AC MG029107; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus E isolate SA1 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus E OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-451 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-451 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; e7c636674c82b0be95a84edf3fa01651. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..451 FT /organism="Fowl aviadenovirus E" FT /host="falcon" FT /isolate="SA1" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /isolation_source="liver" FT /collection_date="02-Jan-2015" FT /note="FAdV/Falcon/Saudi Arabia" FT /db_xref="taxon:190065" FT CDS <1..>451 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTS8" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTS8" FT /protein_id="AVK71662.1" FT /translation="MGALTPTLAAQVGLAGRFAKVSDENTRLAYGAYVKPLKDDGSQSL FT GTTPYYVLDTTAQKYLGVMGVEDFTQSLTYPDSLLIPPPSEYGAVNSGVMKANRPNYIG FT FRDNFINLLYHDTGVCSGTLNSERSGMNVVVELQDRNTELSYQYML" XX SQ Sequence 451 BP; 110 A; 129 C; 121 G; 91 T; 0 other; atgggagccc tcaccccgac actagcagcg caggtcggtc tggccggtcg gtttgccaag 60 gtgtcggatg agaacacgcg cctggcttat ggagcgtatg tgaagcctct aaaagacgac 120 ggctcccagt cacttggaac aacgccttac tacgtgttag acaccaccgc acagaaatac 180 ttgggcgtca tgggggtaga agactttacg caaagtctta cgtacccaga cagtctgtta 240 atcccccctc cttctgagta cggagcggtt aacagcgggg tgatgaaagc caacagaccc 300 aattacatcg ggttccgtga caatttcatc aacctcctgt accacgatac cggcgtgtgc 360 tccggcaccc tgaactccga gcggtcaggc atgaacgtgg tggtggaatt gcaggaccgt 420 aacaccgaac tcagctacca gtacatgctc g 451 // ID MG029108; SV 1; linear; genomic DNA; STD; VRL; 652 BP. XX AC MG029108; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus D isolate SA3 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus D OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-652 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-652 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; eade54985e308b046f21944e31d82a7e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..652 FT /organism="Fowl aviadenovirus D" FT /host="falcon" FT /isolate="SA3" FT /serotype="D" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /isolation_source="liver" FT /collection_date="15-Mar-2015" FT /note="FAdV/Falcon/Saudi Arabia" FT /db_xref="taxon:190064" FT CDS <1..>652 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTP9" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTP9" FT /protein_id="AVK71663.1" FT /translation="MGASYFHIKGVLYRGPSFKPYGGTAYNPLAPREAFFNDWVDTYAS FT KTVITGQMTTPYENVQGAKDKTAAIVAALSGVYPDPNIGTAINEMGALNATSAAQVGLA FT ARFSKVSSDNTRLSYRAYVKPLKNDGSQSINPTPYWVMDSNATNYLGVMGVEDFSASLT FT YPDTLLIPPPTEYSEVNTGVMKANRPNYIGFRDNFINLLYHDTGVCSGTLNSER" XX SQ Sequence 652 BP; 169 A; 192 C; 149 G; 142 T; 0 other; atgggagcct cctacttcca catcaagggc gtcctataca gaggaccttc ttttaaaccg 60 tatggaggaa ccgcatacaa tcccctcgcg ccccgcgaag cctttttcaa cgattgggtt 120 gacacatatg cgagcaaaac cgtcatcacg ggtcagatga caactcccta cgaaaacgtc 180 cagggcgcta aagacaagac tgccgcgatc gtcgccgctc tttcaggggt ttatcccgat 240 cccaatatcg gtaccgccat caacgagatg ggcgccttaa acgcgacgtc ggcagcccaa 300 gtcggattgg ctgcccgatt ctcgaaagta tcgagcgata acacgcgcct atcctataga 360 gcctacgtta aaccgctcaa gaacgacggt tctcaatcga ttaaccccac tccttactgg 420 gtcatggaca gcaacgccac aaactatctc ggagtcatgg gagtcgaaga ctttagcgcc 480 tcgctaacct atcccgatac gctccttatt cccccgccaa ccgaatactc agaagtgaat 540 accggcgtca tgaaggcaaa caggccgaat tacatcggat ttagggacaa ttttatcaac 600 ctgctctatc atgatacggg tgtgtgctcg ggtactctga attcggagcg tt 652 // ID MG029109; SV 1; linear; genomic DNA; STD; VRL; 575 BP. XX AC MG029109; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus D isolate SA4 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus D OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-575 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-575 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; f13328f9d1dbb5dd0b4d6cfe2fe09f5c. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..575 FT /organism="Fowl aviadenovirus D" FT /host="chickens" FT /isolate="SA4" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="12-Jun-2016" FT /note="FAdV/Chicken/Saudi Arabia" FT /db_xref="taxon:190064" FT CDS <1..>575 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTP4" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTP4" FT /protein_id="AVK71664.1" FT /translation="MGATYFDIKGVLDRGPSFKPYGGTAYNPLAPREAFFNNWVDTEAS FT KTVITGQMTTPYENVQGAKDKTAAIVAALSGVYPGPNIGTAISEMGAFNATSAAQVGLA FT ARFAKVSSDNTRLAYGAYVKPLKNDGSQSINPTPYWVMDSNATNYLGVMGVEDFSASLT FT YPDTLLIPPPTEYSEVNTGVMKANRPNY" XX SQ Sequence 575 BP; 150 A; 177 C; 137 G; 111 T; 0 other; atgggagcca cctacttcga catcaagggc gtcctagaca gaggaccttc ttttaaaccg 60 tatggaggaa ccgcatacaa tcccctcgcg ccccgcgaag cctttttcaa caattgggtt 120 gacacagagg cgagcaagac cgtcatcacg ggtcagatga caactcccta cgaaaacgtc 180 cagggcgcta aagacaagac tgccgcgatc gtcgccgctc tttcaggggt ttatcccggt 240 cccaatatcg gtaccgccat cagcgagatg ggcgccttca acgcgacgtc ggcagcccaa 300 gtcggattgg ctgcccgatt cgcgaaagta tcgagcgata acacgcgtct agcctacgga 360 gcctacgtta aaccgctcaa gaacgacggt tctcaatcga ttaaccccac tccttactgg 420 gtcatggaca gcaacgccac aaactatctc ggagtcatgg gagtcgaaga ctttagcgcc 480 tcgctaacct atcccgatac gctccttatt cccccgccaa ccgaatactc agaagtgaat 540 accggcgtca tgaaggcaaa caggccgaat tacat 575 // ID MG029110; SV 1; linear; genomic DNA; STD; VRL; 717 BP. XX AC MG029110; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus D isolate SA19 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus D OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-717 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-717 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; 08062b1ac7e9a69f9d9842d4a4e602ac. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..717 FT /organism="Fowl aviadenovirus D" FT /host="chicken" FT /isolate="SA19" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="10-Apr-2017" FT /note="FAdV/Chicken/Saudi Arabia" FT /db_xref="taxon:190064" FT CDS <1..>717 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTP2" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTP2" FT /protein_id="AVK71665.1" FT /translation="MGATYFDIKGVLDRGPSFKPYGGTAYNPLAPREAFFNNWVDTEAS FT KTVITGQMTTPYENVQGAKDKTAAIVAALSGVYPDPNIGTAISEMGALNATSAAQVGLA FT ARFAKVSSDNTRLAYGAYVKPLKNDGSQSINPTPYWVMDSNATNYLGVMGVEDFSASLT FT YPDTLLIPPPTEYSEVNTGVMKANRPNYIGFRDNFINLLYHDTGVCSGTLNSERSGMNV FT VVELQDRNTELSYQYML" XX SQ Sequence 717 BP; 186 A; 206 C; 175 G; 150 T; 0 other; atgggagcca cctacttcga catcaagggc gtcctagaca gaggaccttc ttttaaaccg 60 tatggaggaa ccgcatacaa tcccctcgcg ccccgcgaag cctttttcaa caattgggtt 120 gacacagagg cgagcaagac cgtcatcacg ggtcagatga caactcccta cgaaaacgtc 180 cagggcgcta aagacaagac tgccgcgatc gtcgccgctc tttcaggggt ttatcccgat 240 cccaatatcg gtaccgccat cagcgagatg ggcgccttaa acgcgacgtc ggcagcccaa 300 gtcggattgg ctgcccgatt cgcgaaagta tcgagcgata acacgcgtct agcctacgga 360 gcctacgtta aaccgctcaa gaacgacggt tctcaatcga ttaaccccac tccttactgg 420 gtcatggaca gcaacgccac aaactatctc ggagtcatgg gagtcgaaga ctttagcgcc 480 tcgctaacct atcccgatac gctccttatt cccccgccaa ccgaatactc agaagtgaat 540 accggcgtca tgaaggcaaa caggccgaat tacatcggat ttagggacaa ttttatcaac 600 ctgctctatc atgatacggg tgtgtgctcg ggtactctga attcggagcg ttcgggtatg 660 aacgtcgtcg tcgagctcca ggacagaaac acggaactta gttaccagta catgttg 717 // ID MG029111; SV 1; linear; genomic DNA; STD; VRL; 714 BP. XX AC MG029111; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus D isolate SA20 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus D OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-714 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-714 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; 156c9ab64a059a3a55484bfb3fe109bc. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..714 FT /organism="Fowl aviadenovirus D" FT /host="chicken" FT /isolate="SA20" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="10-May-2017" FT /note="FAdV/Falcon/Saudi Arabia" FT /db_xref="taxon:190064" FT CDS <1..>714 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTP5" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTP5" FT /protein_id="AVK71666.1" FT /translation="MGATYFDIKGVLDRGPSFKPYGGTAYNPLAPREAFFNNWVDTEAS FT KTVITGQMTTPYENVQGAKDKTAAIVAALSGVYPDPNIGTAISEMGALNATSAAQVGLA FT ARFAKVSSDNTRLAYGAYVKPLKNDGSQSINPTPYWVMDSNATNYLGVMGVEDFSASLT FT YPDTLLIPPPTEYSEVNTGVMKANRPNYIGFRDNFINLLYHDTGVCSGTLNSERSGMNV FT VVELQDRNTELSYQYM" XX SQ Sequence 714 BP; 186 A; 206 C; 174 G; 148 T; 0 other; atgggagcca cctacttcga catcaagggc gtcctagaca gaggaccttc ttttaaaccg 60 tatggaggaa ccgcatacaa tcccctcgcg ccccgcgaag cctttttcaa caattgggtt 120 gacacagagg cgagcaagac cgtcatcacg ggtcagatga caactcccta cgaaaacgtc 180 cagggcgcta aagacaagac tgccgcgatc gtcgccgctc tttcaggggt ttatcccgat 240 cccaatatcg gtaccgccat cagcgagatg ggcgccttaa acgcgacgtc ggcagcccaa 300 gtcggattgg ctgcccgatt cgcgaaagta tcgagcgata acacgcgtct agcctacgga 360 gcctacgtta aaccgctcaa gaacgacggt tctcaatcga ttaaccccac tccttactgg 420 gtcatggaca gcaacgccac aaactatctc ggagtcatgg gagtcgaaga ctttagcgcc 480 tcgctaacct atcccgatac gctccttatt cccccgccaa ccgaatactc agaagtgaat 540 accggcgtca tgaaggcaaa caggccgaat tacatcggat ttagggacaa ttttatcaac 600 ctgctctatc atgatacggg tgtgtgctcg ggtactctga attcggagcg ttcgggtatg 660 aacgtcgtcg tcgagctcca ggacagaaac acggaactta gttaccagta catg 714 // ID MG029112; SV 1; linear; genomic DNA; STD; VRL; 698 BP. XX AC MG029112; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus E isolate SA2 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus E OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-698 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular characterization of fowl adenoviruses type D and E associated RT with IBH in chickens and Falcons indicates possible cross species RT transmission"; RL Unpublished. XX RN [2] RP 1-698 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; 6f559c7690b6ae9201de9c73ffcfdf5d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..698 FT /organism="Fowl aviadenovirus E" FT /host="falcon" FT /isolate="SA2" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="10-Apr-2016" FT /note="FAdV/Falcon/Saudi Arabia" FT /db_xref="taxon:190065" FT CDS <1..>698 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTQ0" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTQ0" FT /protein_id="AVK71667.1" FT /translation="MGATYFDIKGVLDRGPSFMPYGGTAYNPLAPREAFFNNWIEDEQN FT NTTITGQMTNPYKNEAQNTATATAAAIASVSGSYPNPNVGPAISEMGALTPTLAAQVGL FT AGRFAKVSDENTRLAYGAYVKPLKDDGSQSLGTTPYYVLDTTAQKYLGVMGVEDFTQSL FT TYPDSLLIPPPSEYGAVNSGVMKANRPKYIGFRDNFINLLYHDTGVCSGTLNSERSGMN FT VVVELQDRNT" XX SQ Sequence 698 BP; 178 A; 204 C; 189 G; 127 T; 0 other; atgggagcga cgtacttcga catcaaaggg gtgctcgaca gaggtccttc cttcatgccg 60 tacggcggca cggcgtacaa ccccctggcc cctcgcgaag ccttctttaa caactggatc 120 gaggacgaac aaaacaacac aacgatcacc gggcaaatga ccaatccgta caagaacgag 180 gcgcaaaaca cagctacggc aactgctgcg gcaatcgcca gcgtttcagg ctcgtatcct 240 aaccctaacg tggggccggc cattagcgaa atgggagccc tcaccccgac actagcagcg 300 caggtcggtc tggccggtcg gtttgccaag gtgtcggatg agaacacgcg cctggcgtat 360 ggagcgtatg tgaagcctct aaaagacgac ggctcccagt cacttggaac aacgccgtat 420 tacgtgttag acaccaccgc acagaaatac ttgggcgtca tgggggtaga agactttacg 480 caaagtctta cgtacccaga cagtctgtta atcccccctc cttctgagta cggagcggtt 540 aacagcgggg tgatgaaagc caacagaccc aagtacatcg ggttccgtga caatttcatc 600 aacctcctgt accacgatac cggcgtgtgc tccggcaccc tgaactccga gcggtcaggc 660 atgaacgtgg tggtggaatt gcaggaccgt aacaccga 698 // ID MG029113; SV 1; linear; genomic DNA; STD; VRL; 713 BP. XX AC MG029113; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus E isolate SA5 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus E OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-713 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-713 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; 35fa3b9948161d64d4622270990137a7. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..713 FT /organism="Fowl aviadenovirus E" FT /host="chicken" FT /isolate="SA5" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="20-May-2016" FT /note="FAdV/Falcon/Saudi Arabia" FT /db_xref="taxon:190065" FT CDS <1..>713 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTP6" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTP6" FT /protein_id="AVK71668.1" FT /translation="MGATYFDIKGVVDRGPSFKPYGGTAYNALAPREAFFNNWIEDDGN FT NTTITGQITNPYKNEAQNTATATAAAIASVSGSYPNPNVGPAISEIRALTPTLAAQVGL FT AGRYAKVSNENTRLAYGAYVNPLKDDGSQSLGTTPYYVLDTTAQKYLGVMGVEDFTQSL FT TYPDSLIIPPPSEYGEVNSGEMKAFRPNYIGFRDNFINLLYHDTGVCSGTLNSERSGMN FT VVVELQDRNTELSYH" XX SQ Sequence 713 BP; 192 A; 213 C; 175 G; 133 T; 0 other; atgggagcga cctacttcga catcaaaggg gtcgtcgaca gaggtccttc cttcaagccc 60 tacggcggca cggcttacaa cgccctggcc cctcgcgaag ccttctttaa caactggatc 120 gaggacgatg gaaacaacac aaccatcacg ggacaaataa ccaatccgta caagaacgag 180 gcgcaaaaca cagctacggc aacagctgca gcaatcgcca gcgtttcagg ctcttatcct 240 aaccctaacg tggggccggc cattagcgaa atccgagccc tcaccccgac actagcagca 300 caggtcggtc tggccggtcg ctatgccaag gtgtcgaatg agaacacgcg cctggcttat 360 ggagcgtatg tgaatcctct aaaagacgac ggctctcagt cacttggaac aacgccttac 420 tacgtgttag acaccaccgc acagaaatac ttgggcgtaa tgggggtaga agactttacg 480 caaagtctta cctaccctga cagtctgata atcccccctc cttctgagta cggagaggtt 540 aacagcgggg agatgaaagc gttcagaccc aactacatcg ggttccgtga caatttcatc 600 aacctcctgt accacgatac cggcgtctgc tccgggaccc tcaactccga acggtcaggc 660 atgaacgtgg tggtggaatt gcaggaccga aacaccgaac tcagctacca cta 713 // ID MG029114; SV 1; linear; genomic DNA; STD; VRL; 844 BP. XX AC MG029114; XX DT 15-MAR-2018 (Rel. 136, Created) DT 15-MAR-2018 (Rel. 136, Last updated, Version 3) XX DE Fowl aviadenovirus D isolate SA21 hexon gene, partial cds. XX KW . XX OS Fowl aviadenovirus D OC Viruses; Adenoviridae; Aviadenovirus. XX RN [1] RP 1-844 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT "Molecular Characterization of Fowl Adenoviruses Type D and E associated RT with inclusion bodies hepatitis in Chickens and Falcons indicates possible RT cross species transmission"; RL Unpublished. XX RN [2] RP 1-844 RA Mohamed M.H.A., El-Sabagh I.M., Al-Ankari A.S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Central Biotechnology Lab. - Departrment of Clinical Studies, Collage of RL Veterinary Medicine - King Faisal University, Qatar road, Hofuf, Al-Hassa RL 31982, Saudi Arabia XX DR MD5; 968800fd653a0935d07204d1893a4129. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..844 FT /organism="Fowl aviadenovirus D" FT /host="chickens" FT /isolate="SA21" FT /mol_type="genomic DNA" FT /country="Saudi Arabia" FT /collection_date="20-May-2017" FT /note="FAdV/Chicken/Saudi Arabia" FT /db_xref="taxon:190064" FT CDS <1..>844 FT /codon_start=1 FT /product="hexon" FT /note="surface protein" FT /db_xref="GOA:A0A2P1DTQ2" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:A0A2P1DTQ2" FT /protein_id="AVK71669.1" FT /translation="MYTEKAQRLQIRFYPTQTDDTPNSYRVRYSLNVGDSWVLDMGATY FT FDIKGVLDRGPSFKPYGGTAYNPLAPREAFFNNWVDTEASKTVITGQMTTPYENVQGAK FT DKTAAIVAALSGVYPDPNIGTAISEMGALNATSAAQVGLAARFAKVSSDNTRLAYGAYV FT KPLKNDGSQSINPTPYWVMDSNATNYLGVMGVEDFSASLTYPDTLLIPPPTEYSEVNTG FT VMKANRPNYIGFRDNFINLLYHDTGVCSGTLNSERSGMNVVVELQDRNTELSYQYMLAD FT " XX SQ Sequence 844 BP; 218 A; 242 C; 211 G; 173 T; 0 other; atgtacaccg agaaggccca gaggctccag atcaggtttt acccgacgca gaccgacgac 60 acgcccaaca gttaccgcgt gcggtacagc ttaaacgtgg gtgacagttg ggttctggac 120 atgggagcca cctacttcga catcaagggc gtcctagaca gaggaccttc ttttaaaccg 180 tatggaggaa ccgcatacaa tcccctcgcg ccccgcgaag cctttttcaa caattgggtt 240 gacacagagg cgagcaagac cgtcatcacg ggtcagatga caactcccta cgaaaacgtc 300 cagggcgcta aagacaagac tgccgcgatc gtcgccgctc tttcaggggt ttatcccgat 360 cccaatatcg gtaccgccat cagcgagatg ggcgccttaa acgcgacgtc ggcagcccaa 420 gtcggattgg ctgcccgatt cgcgaaagta tcgagcgata acacgcgtct agcctacgga 480 gcctacgtta aaccgctcaa gaacgacggt tctcaatcga ttaaccccac tccttactgg 540 gtcatggaca gcaacgccac aaactatctc ggagtcatgg gagtcgaaga ctttagcgcc 600 tcgctaacct atcccgatac gctccttatt cccccgccaa ccgaatactc agaagtgaat 660 accggcgtca tgaaggcaaa caggccgaat tacatcggat ttagggacaa ttttatcaac 720 ctgctctatc atgatacggg tgtgtgctcg ggtactctga attcggagcg ttcgggtatg 780 aacgtcgtcg tcgagctcca ggacagaaac acggaactta gttaccagta catgttagcc 840 gatt 844 // ID MG029119; SV 1; linear; viral cRNA; STD; VRL; 507 BP. XX AC MG029119; XX DT 09-SEP-2018 (Rel. 138, Created) DT 09-SEP-2018 (Rel. 138, Last updated, Version 1) XX DE Avian avulavirus 1 isolate NDV/Chicken/USC/Egypt/Behera/2014 fusion protein DE (F) gene, partial cds. XX KW . XX OS Avian avulavirus 1 OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Paramyxoviridae; Avulavirus. XX RN [1] RP 1-507 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT "Isolation and molecular characterization of recent outbreaks of velogenic RT newcastle disease virus genotype VII in Egypt"; RL Unpublished. XX RN [2] RP 1-507 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Birds and Rabbit Medicine Department, Faculty of Veterinary Medicine, RL University of Sadat City, Area No 1, Sadat, Menoufiya 32511, Egypt XX DR MD5; 358b37a5d5351d36fc472dea3916c2ed. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..507 FT /organism="Avian avulavirus 1" FT /host="chicken" FT /isolate="NDV/Chicken/USC/Egypt/Behera/2014" FT /mol_type="viral cRNA" FT /country="Egypt" FT /collection_date="2014" FT /note="genotype: VIId" FT /db_xref="taxon:11176" FT gene <1..>507 FT /gene="F" FT CDS <1..>507 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A346RRH8" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A346RRH8" FT /protein_id="AXS76169.1" FT /translation="PAPLMLITRIMLTLSCIRLTSSLDGRPLAAAGIVVTGDKAVNVYT FT SSQTGSIIVKLLPNMPRDKEACAKAPLEAYNRTLTTLLTPLGDSIRKIQGSVSTSGGRR FT QKRFIGAVIGSVALGVATAAQITAAAALIQAKQNAANILRLKESIAATNEAVHEVTDGL FT SQLAVA" XX SQ Sequence 507 BP; 142 A; 129 C; 125 G; 111 T; 0 other; ccagcacctc taatgctcat cactcggatt atgctgacat tgagctgcat ccggttgaca 60 agctctcttg acggcaggcc ccttgcagct gcaggaattg tagtaacggg agataaggca 120 gtcaatgtat acacctcgtc tcagacaggg tcaatcatag tcaagttgct cccgaatatg 180 cccagagata aggaggcatg tgcaaaagcc ccattggagg catataacag aacactgact 240 actctgctca ctcctcttgg tgactccatc cgcaagatcc aagggtctgt atccacgtcc 300 ggaggaagga gacaaaaacg ttttataggt gctgttattg gcagtgtagc tcttggagtt 360 gcaacagcgg cacagataac agcagctgcg gccctgatac aagccaaaca gaatgctgcc 420 aacatcctcc ggcttaagga gagcattgct gcaaccaatg aagctgtgca tgaagtcact 480 gacggattat cacaactagc agtggca 507 // ID MG029120; SV 1; linear; viral cRNA; STD; VRL; 498 BP. XX AC MG029120; XX DT 09-SEP-2018 (Rel. 138, Created) DT 09-SEP-2018 (Rel. 138, Last updated, Version 1) XX DE Avian avulavirus 1 isolate NDV/Chicken/USC/Egypt/El Gherbia/2015 fusion DE protein (F) gene, partial cds. XX KW . XX OS Avian avulavirus 1 OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Paramyxoviridae; Avulavirus. XX RN [1] RP 1-498 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT "Isolation and molecular characterization of recent outbreaks of velogenic RT newcastle disease virus genotype VII in Egypt"; RL Unpublished. XX RN [2] RP 1-498 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Birds and Rabbit Medicine Department, Faculty of Veterinary Medicine, RL University of Sadat City, Area No 1, Sadat, Menoufiya 32511, Egypt XX DR MD5; b9073e4a9d1161520f6fe8fcd7914d54. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..498 FT /organism="Avian avulavirus 1" FT /host="chicken" FT /isolate="NDV/Chicken/USC/Egypt/El Gherbia/2015" FT /mol_type="viral cRNA" FT /country="Egypt" FT /collection_date="2015" FT /note="genotype: VIId" FT /db_xref="taxon:11176" FT gene <1..>498 FT /gene="F" FT CDS <1..>498 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A346RRH9" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A346RRH9" FT /protein_id="AXS76170.1" FT /translation="LMLITRIMLTLSCIRLTSSLDGRPLAAAGIVVTGDKAVNVYTSSQ FT TGSIIVKLLPNMPRDKEACAKAPLEAYNRTLTTLLTPLGDSIRKIQGSVSTSGGRRQKR FT FIGAVIGSVALGVATAAQITAAAALIQAKQNAANILRLKESIAATNEAVHEVTDGLSQL FT AVA" XX SQ Sequence 498 BP; 140 A; 124 C; 124 G; 110 T; 0 other; ctaatgctca tcactcggat tatgctgaca ttgagctgca tccggttgac aagctctctt 60 gacggcaggc cccttgcagc tgcaggaatt gtagtaacgg gagataaggc agtcaatgta 120 tacacctcgt ctcagacagg gtcaatcata gtcaagttgc tcccgaatat gcccagagat 180 aaggaggcat gtgcaaaagc cccattggag gcatataaca gaacactgac tactctgctc 240 actcctcttg gtgactccat ccgcaagatc caagggtctg tatccacgtc cggaggaagg 300 agacaaaaac gttttatagg tgctgttatt ggcagtgtag ctcttggagt tgcaacagcg 360 gcacagataa cagcagctgc ggccctgata caagccaaac agaatgctgc caacatcctc 420 cggcttaagg agagcattgc tgcaaccaat gaagctgtgc atgaagtcac tgacggatta 480 tcacaactag cagtggca 498 // ID MG029121; SV 1; linear; viral cRNA; STD; VRL; 504 BP. XX AC MG029121; XX DT 09-SEP-2018 (Rel. 138, Created) DT 09-SEP-2018 (Rel. 138, Last updated, Version 1) XX DE Avian avulavirus 1 isolate NDV/Chicken/USC/Egypt/Kafr el shaikh/2015 fusion DE protein (F) gene, partial cds. XX KW . XX OS Avian avulavirus 1 OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Paramyxoviridae; Avulavirus. XX RN [1] RP 1-504 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT "Isolation and molecular characterization of recent outbreaks of velogenic RT newcastle disease virus genotype VII in Egypt"; RL Unpublished. XX RN [2] RP 1-504 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Birds and Rabbit Medicine Department, Faculty of Veterinary Medicine, RL University of Sadat City, Area No 1, Sadat, Menoufiya 32511, Egypt XX DR MD5; 4cd5c53ddde8be4281e36667d4062699. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..504 FT /organism="Avian avulavirus 1" FT /host="chicken" FT /isolate="NDV/Chicken/USC/Egypt/Kafr el shaikh/2015" FT /mol_type="viral cRNA" FT /country="Egypt" FT /collection_date="2015" FT /note="genotype: VIId" FT /db_xref="taxon:11176" FT gene <1..>504 FT /gene="F" FT CDS <1..>504 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A346RRI0" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A346RRI0" FT /protein_id="AXS76171.1" FT /translation="APPMLITRIMLTLSCIRLTSSLDGRPLAAAGIVVTGDKAVNVYTS FT SQTGSIIVKLLPNMPRDKEACAKAPLEAYNRTLTTLLTPLGDSIRKIQGSVSTSGGRRQ FT KRFIGAVIGSVALGVATAAQITAAAALIQAKQNAANILRLKESIAATNEAVHEVTDGLS FT QLAVA" XX SQ Sequence 504 BP; 141 A; 129 C; 125 G; 109 T; 0 other; gcacccccaa tgctcatcac tcggattatg ctgacattga gctgcatccg gttgacaagc 60 tctcttgacg gcaggcccct tgcagctgca ggaattgtag taacgggaga taaggcagtc 120 aatgtataca cctcgtctca gacagggtca atcatagtca agttgctccc gaatatgccc 180 agagataagg aggcatgtgc aaaagcccca ttggaggcat ataacagaac actgactact 240 ctgctcactc ctcttggtga ctccatccgc aagatccaag ggtctgtatc cacgtccgga 300 ggaaggagac aaaaacgttt tataggtgct gttattggca gtgtagctct tggagttgca 360 acagcggcac agataacagc agctgcggcc ctgatacaag ccaaacagaa tgctgccaac 420 atcctccggc ttaaggagag cattgctgca accaatgaag ctgtgcatga agtcactgac 480 ggattatcac aactagcagt ggca 504 // ID MG029122; SV 1; linear; viral cRNA; STD; VRL; 504 BP. XX AC MG029122; XX DT 09-SEP-2018 (Rel. 138, Created) DT 09-SEP-2018 (Rel. 138, Last updated, Version 1) XX DE Avian avulavirus 1 isolate NDV/Chicken/USC/Egypt/Menoufia/2015 fusion DE protein (F) gene, partial cds. XX KW . XX OS Avian avulavirus 1 OC Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; OC Mononegavirales; Paramyxoviridae; Avulavirus. XX RN [1] RP 1-504 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT "Isolation and molecular characterization of recent outbreaks of velogenic RT newcastle disease virus genotype VII in Egypt"; RL Unpublished. XX RN [2] RP 1-504 RA Sultan H.A., Kutkat M.A., Talaat S.M., Amer S.A.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Birds and Rabbit Medicine Department, Faculty of Veterinary Medicine, RL University of Sadat City, Area No 1, Sadat, Menoufiya 32511, Egypt XX DR MD5; 1f5ea5d62b986c0cea42e6284d634610. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..504 FT /organism="Avian avulavirus 1" FT /host="chicken" FT /isolate="NDV/Chicken/USC/Egypt/Menoufia/2015" FT /mol_type="viral cRNA" FT /country="Egypt" FT /collection_date="2015" FT /note="genotype: VIId" FT /db_xref="taxon:11176" FT gene <1..>504 FT /gene="F" FT CDS <1..>504 FT /codon_start=1 FT /gene="F" FT /product="fusion protein" FT /db_xref="GOA:A0A346RRI1" FT /db_xref="InterPro:IPR000776" FT /db_xref="UniProtKB/TrEMBL:A0A346RRI1" FT /protein_id="AXS76172.1" FT /translation="APLMLITRIMLTLSCIRLTSSLDGRPLAAAGIVVTGDKAVNVYTS FT SQTGSIIVKLLPNMPRDKEACARAPLEAYNRTLTTLLTPLGDSIRKIQGSVSTSGGRRQ FT KRFIGAVIGSVALGVATAAQITAAAALIQAKQNAANILRLKESIAATNEAVHEVTDGLS FT QLAVA" XX SQ Sequence 504 BP; 141 A; 130 C; 124 G; 109 T; 0 other; gcacctctaa tgctcatcac tcggattatg ctgacattga gctgcatccg tttgacaagc 60 tctcttgacg gcaggcccct tgcagctgca ggaattgtag taacgggaga taaggcagtc 120 aatgtataca cctcgtctca gacagggtca atcatagtca agttgctccc gaatatgccc 180 agagataagg aggcatgtgc aagagcccca ttggaggcat ataacagaac actgactact 240 ctgctcactc ctctcggtga ctccatccgc aagatccaag ggtctgtatc cacgtccgga 300 ggaaggagac aaaaacgttt tataggtgct gttattggca gtgtagctct tggagttgca 360 acagcggcac aaataacagc agctgcggcc ctgatacaag ccaaacagaa tgccgccaac 420 atcctccggc ttaaggagag cattgctgca accaatgaag ctgtgcatga agtcaccgac 480 ggattatcac aactagcagt ggca 504 // ID MG029162; SV 1; linear; genomic RNA; STD; VRL; 845 BP. XX AC MG029162; XX DT 08-MAR-2018 (Rel. 136, Created) DT 08-MAR-2018 (Rel. 136, Last updated, Version 6) XX DE Sugarcane mosaic virus isolate RR12-SCMV-Ritshuru polyprotein gene, partial DE cds. XX KW . XX OS Sugarcane mosaic virus OC Viruses; Riboviria; Potyviridae; Potyvirus. XX RN [1] RP 1-845 RA Lukanda M., Oresanya A.O., Ogunsanya P., Bekunda M., Hoeschle-Zeledon I., RA Kumar P.L.; RT "Surveys for distribution and diversity maize viruses in eastern Democratic RT Republic of Congo"; RL Unpublished. XX RN [2] RP 1-845 RA Kumar P.L., Oresanya A.O., Lukanda M.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Virology & Molecular Diagnostics Unit, International Institute of Tropical RL Agriculture (IITA), Oyo Road, Ibadan, Oyo PMB 5320, Nigeria XX DR MD5; 3ad018a70f118d7bf7c4eebf38e6e5c1. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..845 FT /organism="Sugarcane mosaic virus" FT /host="maize" FT /isolate="RR12-SCMV-Ritshuru" FT /mol_type="genomic RNA" FT /country="Democratic Republic of the Congo" FT /collected_by="M Lukanda" FT /collection_date="Mar-2017" FT /db_xref="taxon:12224" FT CDS <1..617 FT /codon_start=3 FT /product="polyprotein" FT /note="coat protein" FT /db_xref="GOA:A0A2P0W912" FT /db_xref="InterPro:IPR001592" FT /db_xref="UniProtKB/TrEMBL:A0A2P0W912" FT /protein_id="AUS91142.1" FT /translation="LHLDFLLTYKPQQQDISNTRATKEEFDRWYDAIKKEYEIDDTQMT FT VVMSGLMVWCIENGCSPNINGNWTMMDGDEQRVFPLKPVIENASPTFRQIMHHFSDAAE FT AYIEYRNSTERYMPRYGLQRNLTDYSLARYAFDFYEMTSRTPARAKEAHMQMKAAAVRG FT SNTRLFGLDGNVGETQENTERHTAGDVSRNMHSLLGVQQHH" XX SQ Sequence 845 BP; 250 A; 178 C; 200 G; 217 T; 0 other; tcttgcatct ggacttcttg cttacataca agccacagca gcaagacata tcaaacacaa 60 gagcaaccaa ggaagagttt gatagatggt acgatgccat aaagaaggaa tatgaaattg 120 atgacacaca aatgacagtt gttatgagcg gtcttatggt gtggtgcatt gagaacggtt 180 gctcaccaaa cataaacgga aattggacaa tgatggatgg agatgaacaa agagtctttc 240 cacttaaacc agttattgaa aacgcatctc caactttccg acaaattatg catcatttta 300 gtgatgcagc tgaagcgtac atagagtata gaaactctac tgagcgatat atgccaagat 360 acggacttca gcgcaatctc accgactata gcttagcacg gtatgcattt gatttctatg 420 aaatgacttc acgcacacct gctagagcta aagaagccca catgcagatg aaagccgcag 480 cagttcgtgg ttcaaacaca cgactgttcg gtctggacgg aaatgtcggc gagacccagg 540 agaatacaga gagacacaca gctggcgacg ttagtcgcaa tatgcactct ctgttgggag 600 tgcagcagca ccactagtct cctggaaacc ctgtttgcag tacctataat atgtactaat 660 atatagtatg tcagtgaggt tttacctcgt ctttactatt tgttatgtat gtatttaaag 720 cgtgaaccag tctgcagcat acagggttgg acccagtgtg ttctggtgta gcgtgtacta 780 gcgtcgagcc acgagatgga ctgcactgag tgtgggcttt gcccacttgt gctgcgagtc 840 tcttg 845 // ID MG029163; SV 1; linear; genomic RNA; STD; VRL; 846 BP. XX AC MG029163; XX DT 08-MAR-2018 (Rel. 136, Created) DT 08-MAR-2018 (Rel. 136, Last updated, Version 6) XX DE Sugarcane mosaic virus isolate NB22-SCMV-Nyiragongo polyprotein gene, DE partial cds. XX KW . XX OS Sugarcane mosaic virus OC Viruses; Riboviria; Potyviridae; Potyvirus. XX RN [1] RP 1-846 RA Lukanda M., Oresanya A.O., Ogunsanya P., Bekunda M., Hoeschle-Zeledon I., RA Kumar P.L.; RT "Surveys for distribution and diversity of maize viruses in eastern RT Democratic Republic of Congo"; RL Unpublished. XX RN [2] RP 1-846 RA Kumar P.L., Oresanya A.O., Lukanda M.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Virology & Molecular Diagnostics Unit, International Institute of Tropical RL Agriculture (IITA), Oyo Road, Ibadan, Oyo PMB 5320, Nigeria XX DR MD5; c7507d3c816947d7c2a55cdb13f8260d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..846 FT /organism="Sugarcane mosaic virus" FT /host="maize" FT /isolate="NB22-SCMV-Nyiragongo" FT /mol_type="genomic RNA" FT /country="Democratic Republic of the Congo" FT /collected_by="M Lukanda" FT /collection_date="Mar-2017" FT /db_xref="taxon:12224" FT CDS <1..617 FT /codon_start=3 FT /product="polyprotein" FT /note="coat protein" FT /db_xref="GOA:A0A2P0W903" FT /db_xref="InterPro:IPR001592" FT /db_xref="UniProtKB/TrEMBL:A0A2P0W903" FT /protein_id="AUS91143.1" FT /translation="LHLDFLLTYKPQQQDISNTRATKEEFDRWYDAIKKEYEIDDTQMT FT VVMSGLMVWCIENGCSPNINGNWTMMDGDEQRVFPLKPIIENASPTFRQIMHHFSDAAE FT AYIEYRNSTERYMPRYGLQRNLTDYSLARYAFDFYEMTSRTPARAKEAHMQMKAAAVRG FT SNTRLFGLDGNVGETQENTERHTAGDVSRNMHSLLGVQQHH" XX SQ Sequence 846 BP; 250 A; 178 C; 201 G; 217 T; 0 other; tcttgcatct ggacttcttg cttacataca agccacagca gcaagacata tcaaacacaa 60 gagcaaccaa ggaagagttt gatagatggt acgatgccat aaagaaagaa tatgagattg 120 atgacacaca aatgacagtt gttatgagcg gtcttatggt gtggtgcatt gagaacggtt 180 gctcaccaaa cataaacgga aattggacaa tgatggatgg agatgaacaa agagtctttc 240 cacttaaacc aattattgaa aacgcatctc caactttccg acaaattatg catcatttta 300 gtgatgcagc tgaagcgtac atagagtaca gaaactctac tgagcgatac atgccaagat 360 acggacttca gcgcaatctc accgactata gcttagcacg gtatgcattt gatttctatg 420 aaatgacttc acgcacacct gctagagcta aagaagccca catgcagatg aaagccgcag 480 cagttcgtgg ttcaaacaca cgactgttcg gtctggacgg aaatgtcggc gagactcagg 540 agaatacaga gagacacaca gctggcgacg ttagtcgcaa tatgcactct ctgttgggag 600 tgcagcagca ccactagtct cctggaaacc ctgtttgcag tacctataat atgtactaat 660 atatagtatg tcagtgaggt tttacctcgt ctttactatt tgttatgtat gtatttaaag 720 cgtgaaccag tctgcagcat acagggttgg acccagtgtg ttctggtgta gcgtgtacta 780 gcgtcgagcc acgagatgga ctgcactggg tgtggctttg ccacttgtgc tgcgagtctc 840 ttggtg 846 // ID MG029168; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC MG029168; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY918.4/2016(H5N6)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY918.4/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1701 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; d23892571b1a1b7436ed3b25afcf9272. DR EuropePMC; PMC5701206; 29176564. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/poultry/China/XY918.4/2016(H5N6))" FT /segment="4" FT /host="poultry" FT /strain="A/poultry/China/XY918.4/2016" FT /isolate="XY918.4" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042256" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:A0A291IDD7" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:A0A291IDD7" FT /protein_id="ATG88189.1" FT /translation="MEKIVLLLAVVSLVKGDQICIGYHANNSTEQVDTIMEKNVTVTHA FT QDILEKTHNGKLCDLNGVKPLILKDCSVAGWLLGNPMCDEFIRVPEWSYIVERANPAND FT LCYPGNLNEYEELKHLLSRINHFEKTPIIPKSSWPNHTSSGVSAACPYLGKPSFFRNVV FT WLTKKNDAYPTIKMSYNNTNREDLLILWGIHHSNNAEEQTNLYKNPTTYVSVGTSTLNQ FT RVVPKIATRSQVNGQSGRMDFFWTILKPDDAIHFESNGNFIAPEYAYKIVKKGDSTIMK FT SEMEYGNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNKLVLATGLRNSPLRER FT RRKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADRESTQKAIDGVTNKVNSI FT IDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDSN FT VKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECMESVRNGTYDYPQYSEEARLKREEI FT SGVKLESIGTYQILSIYSTVAGSLALAIIVAGLSLWMCSNGSLQCRICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1032 FT /gene="HA" FT /product="HA1" FT mat_peptide 1033..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 591 A; 312 C; 390 G; 408 T; 0 other; atggagaaaa tagtgcttct tcttgcagtg gttagccttg tcaaaggtga tcagatttgc 60 attggttacc atgcaaacaa ctcgactgag caggttgaca cgataatgga aaaaaacgtc 120 actgttacac atgctcaaga catactggaa aagacacaca acgggaagct ctgcgatctg 180 aatggagtga aacctctgat tttaaaggat tgtagtgtag ctggatggct tcttggaaac 240 ccaatgtgcg acgagttcat cagagtgccg gaatggtctt acatagtgga aagggctaac 300 ccagccaatg acctctgtta cccagggaac ctcaatgaat atgaagaact gaaacaccta 360 ttgagcagaa taaatcattt cgagaagact ccgatcatcc ccaagagttc ttggcccaat 420 catacatcat caggggtgag cgcagcatgt ccatacctgg gaaagccctc ctttttcaga 480 aatgtggtat ggcttaccaa gaagaacgat gcatacccaa caataaaaat gagctacaat 540 aacaccaata gggaagatct tttgatactg tgggggattc atcattccaa taatgcagaa 600 gagcagacaa atctctataa aaacccaacc acttatgttt ccgttgggac atcaacatta 660 aaccagagag tggtgcccaa aatagctact agatcccaag taaacgggca aagtggaaga 720 atggatttct tctggacaat tttaaaaccg gatgatgcaa tccacttcga gagtaatgga 780 aattttattg ctccagaata tgcatacaaa attgtcaaga aaggggactc aacaattatg 840 aaaagtgaaa tggaatatgg caattgcaac accaaatgtc aaactccaat aggggcgata 900 aactctagta tgccattcca caatatacac cctctcacta tcggggagtg ccccaaatac 960 gtgaaatcaa acaaattagt ccttgcgact gggctcagaa atagtcctct aagagaaaga 1020 agaagaaaaa gaggactatt tggggccata gcagggttta tagagggagg atggcaagga 1080 atggtagatg gttggtacgg gtaccaccat agcaatgaac aaggaagtgg gtatgctgca 1140 gacagagaat ccacccaaaa ggcaatagat ggagttacca ataaggtcaa ctcgataatt 1200 gacaaaatga acactcaatt tgaggccgtt ggaagggaat ttaataactt agaacggaga 1260 atagagaatt taaataagaa aatggaagac ggattcctag atgtctggac ttataatgct 1320 gaacttttag ttctcatgga aaatgagaga actctagatt tccatgactc aaatgtcaag 1380 aacctttatg acaaagtccg actacagctt agggataatg caaaggagct gggtaatggt 1440 tgtttcgagt tctatcacaa atgtgataat gaatgtatgg aaagtgtgag aaatgggacg 1500 tatgactacc cccagtattc agaagaagca agattaaaaa gggaagaaat aagcggagtg 1560 aaattggaat caataggaac ttaccaaata ctgtcaattt attcaacagt ggcgggttcc 1620 ctagcactgg caatcattgt ggctggtcta tctttatgga tgtgctccaa tgggtcgtta 1680 caatgcagaa tttgcattta g 1701 // ID MG029169; SV 1; linear; viral cRNA; STD; VRL; 1380 BP. XX AC MG029169; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY918.6/2016(H5N6)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY918.6/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1380 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1380 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; aea7b6f63c43ff3a52178f320be45d0f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1380 FT /organism="Influenza A virus FT (A/poultry/China/XY918.6/2016(H5N6))" FT /segment="6" FT /host="poultry" FT /strain="A/poultry/China/XY918.6/2016" FT /isolate="XY918.6" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042257" FT gene 1..1380 FT /gene="NA" FT CDS 1..1380 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A291IDD9" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A291IDD9" FT /protein_id="ATG88190.1" FT /translation="MNPNQKITCISATGVTLSVVSLLIGIANLGLNIGLHYKVSDSTTI FT NIPNINETNLTTTNIPNIIVNKNEERTFLNLTKPLCEVNSWHILSKDNAIRIGEDAHIL FT VTREPYLSCGPQGCRMFALSQGTTLRGRHANGTIHDRGPFRALISWEMGQAPSPYNTRV FT ECIGWSSTSCHDGISRMSICISGPNNNASAVVWYRGRPVTEIPSWVGNILRTQESECVC FT HKGICPVVMTDGPANNKAATKIIYFKEGKIQKIEELQGNAQHIEECSCYGAAGMIKCVC FT RDNWKGANRPIITIDPEMMTHTSKYLCSKILTDTSRPNDPTNGNCDAPIIGGSPDPGVK FT GFAFLDGENSWLGRTISKDSRSGYEMLKVPNAETDTQSGPTSYQLIVNNQNWSGYSGAF FT IDYWANKGCFNPCFYVELIRGRPKEIDVLWTSSSMVALCGSRERLGSWSWHDGAEIIYF FT K" XX SQ Sequence 1380 BP; 475 A; 287 C; 326 G; 292 T; 0 other; atgaatccaa atcaaaagat aacatgcatt tcagcaacag gagtaacact gtccgtggta 60 agcctactaa taggaatcgc caatttgggc ctaaatatcg ggctacacta caaagtgagt 120 gattcaacaa ctataaacat tccaaacata aatgagacca acctaacaac aacaaacatc 180 cctaacatta tagtgaataa gaacgaagaa agaacatttc tcaacttgac caagccgcta 240 tgtgaagtca actcatggca cattctatca aaagacaatg caataagaat aggtgaggat 300 gctcatatac tggtcacaag ggaaccttac ttgtcctgtg gcccacaggg atgcagaatg 360 tttgctctga gtcaaggcac aacactcaga gggcgacatg caaatggaac catacatgat 420 aggggcccat ttcgagctct tataagttgg gaaatgggtc aggcacccag tccatataat 480 actagggtcg aatgcatagg atggtcaagc acgtcatgcc atgatggcat atcaaggatg 540 tcaatatgca tatcaggacc gaataacaat gcatcggcag tggtgtggta cagggggaga 600 ccagtaacag aaatcccatc atgggtaggg aacattctca ggactcaaga atcagaatgt 660 gtgtgccata aaggaatctg cccagtggtc atgacagatg gtccagcaaa caacaaggca 720 gcaactaaaa taatctactt caaggaggga aagatacaga aaattgaaga actgcaaggg 780 aacgctcaac acatcgaaga gtgttcatgc tacggagcag cagggatgat caaatgtgta 840 tgcagagaca attggaaggg ggcaaataga ccaataatca ctatagatcc cgaaatgatg 900 acccacacaa gcaaatactt gtgttcaaaa atcttaaccg acacaagtcg tcctaatgac 960 cccaccaatg ggaactgcga tgcaccaata ataggaggga gcccagaccc aggggtaaaa 1020 gggtttgcat tcctagatgg ggagaattca tggcttggaa ggacaattag caaagactcc 1080 agatcaggct acgaaatgtt aaaggtccca aatgcagaaa ccgacactca atcagggcca 1140 acctcatacc agctgattgt caacaaccaa aattggtcag ggtactcggg ggcattcata 1200 gattactggg caaacaaggg atgcttcaat ccttgctttt atgtggagct aatcagaggg 1260 agacccaaag agattgatgt actgtggact tccagtagta tggtagctct ctgtggatcc 1320 agggagcgat tgggatcatg gtcctggcat gatggtgcag aaatcatcta ctttaagtag 1380 // ID MG029170; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC MG029170; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY165.4/2016(H5N6)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY165.4/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1701 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; 16f5cc1184e4a1c3b0e555ca88f1df50. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/poultry/China/XY165.4/2016(H5N6))" FT /segment="4" FT /host="poultry" FT /strain="A/poultry/China/XY165.4/2016" FT /isolate="XY165.4" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042254" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:A0A291IDF0" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:A0A291IDF0" FT /protein_id="ATG88191.1" FT /translation="MEKIVLLLAVVSLVKGDQICIGYHANNSTEQVDTIMEKNVTVTHA FT QDILEKTHNGKLCDLNGVKPLILKDCSVAGWLLGNPMCDEFIRVPEWSYIVERANPAND FT LCYPGNLNEYEELKHLLSRINHFEKTPIIPKSSWPNHTSSGVSAACPYLGKPSFFRNVV FT WLTKKNDAYPTIKMSYNNTNREDLLILWGIHHSNNAEEQTNLYKNPTTYVSVGTSTLNQ FT RVVPKIATRSQVNGQSGRMDFFWTILKPDDAIHFESNGNFIAPEYAYKIVKKGDSTIMK FT SEMEYGNCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNKLVLATGLRNSPLRER FT RRKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADRESTQKAIDGVTNKVNSI FT IDKMNTQFEAVGREFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDSN FT VKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECMESVRNGTYDYPQYSEEARLKREEI FT SGVKLESIGTYQILSIYSTVAGSLALAIIVAGLSLWMCSNGSLQCRICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1032 FT /gene="HA" FT /product="HA1" FT mat_peptide 1033..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 592 A; 311 C; 390 G; 408 T; 0 other; atggagaaaa tagtgcttct tcttgcagtg gttagccttg tcaaaggtga tcagatttgc 60 attggttacc atgcaaacaa ctcgactgag caggttgaca cgataatgga aaaaaacgtc 120 actgttacac atgctcaaga catactggaa aagacacaca acgggaagct ctgcgatctg 180 aatggagtga aacctctgat tttaaaggat tgtagtgtag ctggatggct tcttggaaac 240 ccaatgtgcg acgagttcat cagagtgccg gaatggtctt acatagtgga aagggctaac 300 ccagccaatg acctctgtta cccagggaac ctcaatgaat atgaagaact gaaacaccta 360 ttgagcagaa taaatcattt cgagaagact ccgatcatcc ccaagagttc ttggcccaat 420 catacatcat caggggtgag cgcagcatgt ccatacctgg gaaagccctc ctttttcaga 480 aatgtggtat ggcttaccaa gaagaacgat gcatacccaa caataaaaat gagctacaat 540 aacaccaata gggaagatct tttgatactg tgggggattc atcattccaa taatgcagaa 600 gagcagacaa atctctataa aaacccaacc acttatgttt ccgttgggac atcaacatta 660 aaccagagag tggtgccaaa aatagctact agatcccaag taaacgggca aagtggaaga 720 atggatttct tctggacaat tttaaaaccg gatgatgcaa tccacttcga gagtaatgga 780 aattttattg ctccagaata tgcatacaaa attgtcaaga aaggggactc aacaattatg 840 aaaagtgaaa tggaatatgg caattgcaac accaaatgtc aaactccaat aggggcgata 900 aactctagta tgccattcca caatatacac cctctcacta tcggggagtg ccccaaatac 960 gtgaaatcaa acaaattagt ccttgcgact gggctcagaa atagtcctct aagagaaaga 1020 agaagaaaaa gaggactatt tggggccata gcagggttta tagagggagg atggcaagga 1080 atggtagatg gttggtacgg gtaccaccat agcaatgaac aaggaagtgg gtatgctgca 1140 gacagagaat ccacccaaaa ggcaatagat ggagttacca ataaggtcaa ctcgataatt 1200 gacaaaatga acactcaatt tgaggccgtt ggaagggaat ttaataactt agaacggaga 1260 atagagaatt taaataagaa aatggaagac ggattcctag atgtctggac ttataatgct 1320 gaacttttag ttctcatgga aaatgagaga actctagatt tccatgactc aaatgtcaag 1380 aacctttatg acaaagtccg actacagctt agggataatg caaaggagct gggtaatggt 1440 tgtttcgagt tctatcacaa atgtgataat gaatgtatgg aaagtgtgag aaatgggacg 1500 tatgactacc cccagtattc agaagaagca agattaaaaa gggaagaaat aagcggagtg 1560 aaattggaat caataggaac ttaccaaata ctgtcaattt attcaacagt ggcgggttcc 1620 ctagcactgg caatcattgt ggctggtcta tctttatgga tgtgctccaa tgggtcgtta 1680 caatgcagaa tttgcattta g 1701 // ID MG029171; SV 1; linear; viral cRNA; STD; VRL; 1380 BP. XX AC MG029171; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY165.6/2016(H5N6)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY165.6/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1380 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1380 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; 29145c58f884f12008db26e89b0995ba. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1380 FT /organism="Influenza A virus FT (A/poultry/China/XY165.6/2016(H5N6))" FT /segment="6" FT /host="poultry" FT /strain="A/poultry/China/XY165.6/2016" FT /isolate="XY165.6" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042255" FT gene 1..1380 FT /gene="NA" FT CDS 1..1380 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A291IDD1" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A291IDD1" FT /protein_id="ATG88192.1" FT /translation="MNPNQKITCISATGVTLSVVSLLIGITNLGLNIGLHYKVSDSTTM FT NIPNMNETNPTTTNITNIVMNKNEERTFLKLTKPLCEVNSWHILSKDNAIRIGEDAHIL FT VTREPYLSCDPQGCRMFALSQGTTLRGQHANGTIHDRSPFRALISWEMGQAPSPYNTRV FT ECIGWSSTSCHDGISRMSICISGPNNNASAVVWYRGRPVTEIPSWAGNILRTQESECVC FT HKGICPVVMTDGPANSKAATKIIYFKEGKIQKTEELQGNAQHIEECSCYGAARMIKCVC FT RDNWKGANRPIITIDPEMMTHTSKYLCSKILTDTSRPNDPTNGNCDAPITGGSPDPGVK FT GFAFLDGENSWLGRTISKDSRSGYEMLKVPNAEIDTQSGPISYQLIVNNQNWSGYSGAF FT IDYWANKECFNPCFYVELIRGRPKESGVLWTSNSMVALCGSRERLGSWSWHDGAEIIYF FT K" XX SQ Sequence 1380 BP; 480 A; 287 C; 325 G; 288 T; 0 other; atgaatccaa atcaaaagat aacatgcatt tcagcaacag gagtaacact atcagtagta 60 agcctgctaa taggaatcac caatttgggc ctaaatattg gactacacta caaagtgagt 120 gattcaacaa ctatgaacat tccaaacatg aatgagacca acccaacaac aacaaacatc 180 actaacattg taatgaataa gaacgaagaa agaacatttc tcaaattgac caaaccgcta 240 tgtgaagtca actcatggca cattctatcg aaagacaatg caataagaat aggtgaggat 300 gctcatatac tggtcacaag ggaaccttat ctgtcctgtg atccacaagg ctgcaggatg 360 tttgctctga gtcagggcac aacactcaga gggcaacatg cgaatggaac catacatgat 420 aggagcccat ttcgagctct tataagttgg gaaatgggtc aggcacccag tccatacaac 480 actagggtcg aatgcatagg atggtcaagc acgtcatgcc atgatggcat atcaaggatg 540 tcaatatgca tatcagggcc gaataacaat gcatcggcag tggtgtggta ccgagggaga 600 ccagtaacag aaatcccatc atgggcaggg aacattctta ggacacaaga atcagaatgt 660 gtgtgccata aaggaatctg cccagtggtc atgacagatg gtccagcaaa cagcaaggca 720 gcaactaaga taatctactt caaagaggga aagatacaaa aaactgaaga actgcaaggg 780 aacgctcaac acatcgaaga atgttcatgc tacggagcag caaggatgat caaatgtgta 840 tgcagagaca attggaaggg ggcaaataga ccaataatca ctatagatcc cgaaatgatg 900 acccacacaa gcaaatactt gtgttcgaaa atcttaaccg acacaagtcg tcctaatgac 960 cccaccaatg ggaactgtga tgcgccaata acaggaggga gcccagaccc aggggtaaaa 1020 gggtttgcat tcctagacgg ggagaattca tggcttggaa ggacaattag caaagactcc 1080 agatcagggt acgaaatgtt aaaggtccca aatgcagaaa tcgacactca atcagggcca 1140 atctcatacc agctgattgt caacaaccaa aattggtcag gatactcagg ggcattcata 1200 gactactggg caaacaagga gtgcttcaat ccttgttttt atgtggagct aatcaggggg 1260 agacccaaag agagtggtgt actgtggact tccaatagca tggtagctct ctgtggatcc 1320 agggagcgat tgggatcatg gtcctggcat gatggtgcag aaatcatcta ctttaagtag 1380 // ID MG029172; SV 1; linear; viral cRNA; STD; VRL; 1704 BP. XX AC MG029172; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY01.4/2016(H5N6)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY01.4/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1704 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1704 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; 9e7b3bf0bc0cbfbfbf19ee78ec5d4a59. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1704 FT /organism="Influenza A virus FT (A/poultry/China/XY01.4/2016(H5N6))" FT /segment="4" FT /host="poultry" FT /strain="A/poultry/China/XY01.4/2016" FT /isolate="XY01.4" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042252" FT gene 1..1704 FT /gene="HA" FT CDS 1..1704 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:A0A291IDC5" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:A0A291IDC5" FT /protein_id="ATG88193.1" FT /translation="MEKIVLLLAVVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHA FT QDILEKTHNGRLCDLNGVKPLILKDCSVAGWLLGNPMCDEFIRVPKWSYIVERTNPAND FT LCYPGNLNDYEELKHLLSRINHFEKTLIIPKSSWPNHETSGVSAACPYQGVPSFFRNVV FT WLTKKNDAYPTIKMSYNNTNGEDLLILWGIHHSNNAAEQTNLYKNPTTYVSVGTSTLNQ FT RLVPKIATRSQVNGQEGRMDFFWTILKPNDAIHFESNGNFIAPEYAYKIVKKGDSTIMK FT SEMEYGHCNTKCQTPIGAINSSMPFHNIHPLTIGECPKYVKSNKLVLATGLRNSPLREK FT RRRKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADRESTQKAIDGVTNKVNS FT IIDKMNTQFEAVGREFNSLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDS FT NVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECMESVRNGTYDYPQYSEEARLKREE FT ISGVKLESIGTYQILSIYSTVASSLALAIIVAGLSLWMCSNGSLQCRICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1035 FT /gene="HA" FT /product="HA1" FT mat_peptide 1036..1701 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1704 BP; 603 A; 309 C; 385 G; 407 T; 0 other; atggagaaaa tagtacttct tcttgcagtg gttagccttg ttaaaagtga tcagatttgc 60 attggttacc atgcaaacaa ctcgacagag caggttgaca cgataatgga aaaaaacgtc 120 actgttacac atgcccaaga catactggaa aagacacaca acgggaggct ctgcgatctg 180 aatggagtga aacctctgat tttaaaggat tgtagtgtag ctggatggct tcttggaaac 240 ccaatgtgcg acgaattcat cagagtgccg aaatggtctt acatagtgga gaggactaac 300 ccagccaatg acctctgtta cccagggaac ctcaatgact atgaagaact gaaacaccta 360 ttgagcagaa taaatcattt tgagaagact ctgatcatcc ccaagagttc ttggcccaat 420 catgaaacat caggggtgag cgcagcatgc ccataccagg gagtgccctc ctttttcaga 480 aatgtggtat ggcttaccaa gaagaacgat gcatacccaa caataaagat gagctacaat 540 aataccaatg gggaagatct tttgatactg tgggggattc atcattccaa caatgcagca 600 gagcagacaa atctctataa aaacccaacc acctatgttt ccgttgggac atcaacatta 660 aaccagagat tggtgccaaa aatagctact agatcccaag taaacgggca agaaggaaga 720 atggatttct tctggacaat tttaaaaccg aatgatgcaa tccactttga gagtaatgga 780 aattttattg ctccagaata tgcatacaaa atagtcaaga aaggggactc aacaattatg 840 aaaagtgaaa tggaatatgg ccactgcaac accaaatgtc aaactccaat aggggcgata 900 aactctagta tgccattcca caatatacac cctctcacca tcggggagtg ccccaaatac 960 gtgaaatcaa acaaattagt cctagcgact ggactcagaa atagtccttt aagagaaaaa 1020 agaagaagaa aaagaggact atttggagct atagcagggt tcatagaggg aggatggcaa 1080 ggaatggtag atggttggta tgggtaccac catagcaatg aacaggggag tggatacgct 1140 gcagacagag aatccaccca aaaggcaata gatggagtta ccaataaggt caactcgata 1200 atcgacaaaa tgaacactca atttgaggcc gttggaaggg agtttaatag cttagaacgg 1260 agaatagaga atttaaataa gaaaatggaa gacggattcc tagatgtctg gacttacaat 1320 gctgaacttt tagttctcat ggaaaatgag agaactttag attttcacga ttcaaatgta 1380 aaaaaccttt atgacaaagt ccgattacag cttagggata atgcaaagga gctaggtaat 1440 ggttgtttcg agttctatca taaatgtgat aatgaatgta tggaaagtgt aagaaatggg 1500 acgtatgact atccccagta ttcagaagaa gcaagattaa aaagggaaga aataagcgga 1560 gtgaaattgg aatcaatagg aacttaccaa atactgtcaa tttattcaac agtggcgagt 1620 tccctagcac tggcaatcat tgtggctggt ctatctttat ggatgtgctc caatgggtcg 1680 ttacaatgca gaatttgcat ttaa 1704 // ID MG029173; SV 1; linear; viral cRNA; STD; VRL; 1380 BP. XX AC MG029173; XX DT 13-OCT-2017 (Rel. 134, Created) DT 13-OCT-2017 (Rel. 134, Last updated, Version 1) XX DE Influenza A virus (A/poultry/China/XY01.6/2016(H5N6)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/poultry/China/XY01.6/2016(H5N6)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1380 RX DOI; .1038/s41598-017-16139-1. RX PUBMED; 29176564. RA Zhao Z.; RT "Avian Influenza H5N6 Viruses Exhibit Differing Pathogenicities and RT Transmissibilities in Mammals"; RL Sci Rep 7(1):16280-16280(2017). XX RN [2] RP 1-1380 RA Zhao Z.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Institute of Military Veterinary, Academy of Military Medical Sciences, 666 RL West Liuying Road, Changchun, Jilin 130122, China XX DR MD5; 4e88314cc431a7e28128911f68ffdfe3. DR EuropePMC; PMC5701206; 29176564. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1380 FT /organism="Influenza A virus FT (A/poultry/China/XY01.6/2016(H5N6))" FT /segment="6" FT /host="poultry" FT /strain="A/poultry/China/XY01.6/2016" FT /isolate="XY01.6" FT /serotype="H5N6" FT /mol_type="viral cRNA" FT /country="China" FT /isolation_source="lung" FT /collection_date="Sep-2016" FT /db_xref="taxon:2042253" FT gene 1..1380 FT /gene="NA" FT CDS 1..1380 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:A0A291IDB7" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:A0A291IDB7" FT /protein_id="ATG88194.1" FT /translation="MNPNQKITCISATGVTLSVVSLLIGITNLGLNIGLHYKVSDSTTM FT NIPNMNETNPTTTNITNIVMNKNEERTFLKLTKPLCEVNSWHILSKDNAIRIGEDAHIL FT VTREPYLSCDPQGCRMFALSQGTTLRGQHANGTIHDRSPFRALISWEMGQAPSPYNTRA FT ECIGWSSTSCHDGISRMSICISGPNNNASAVVWYRGRPVTEIPSWAGNILRTQESECVC FT HKGICPVVMTDGPANSKAATKIIYFKEGKIQKTEELQGNAQHIEECSCYGAARMIKCVC FT RDNWKGANRPIITIDPEMMTHTSKYLCSKILTDTSRPNDPTNGNCDAPITGGSPDPGVK FT GFAFLDGENSWLGRTISKDSRSGYEMLKVPNAEIDTQSGPISYQLIVNNQNWSGYSGAF FT IDYWANKECFNPCFYVELIRGRPKESGVLWTSNSMVALCGSRERLGSWSWHDGAEIIYF FT K" XX SQ Sequence 1380 BP; 480 A; 287 C; 325 G; 288 T; 0 other; atgaatccaa atcaaaagat aacatgcatt tcagcaacag gagtaacact atcagtagta 60 agcctgctaa taggaatcac caatttgggc ctaaatattg gactacacta caaagtgagt 120 gattcaacaa ctatgaacat tccaaacatg aatgagacca acccaacaac aacaaacatc 180 actaacattg taatgaataa gaacgaagaa agaacatttc tcaaattgac caaaccgcta 240 tgtgaagtca actcatggca cattctatcg aaagacaatg caataagaat aggtgaggat 300 gctcatatac tggtcacaag ggaaccttat ctgtcctgtg atccacaagg ctgcaggatg 360 tttgctctga gtcagggcac aacactcaga gggcaacatg cgaatggaac catacatgat 420 aggagcccat ttcgagctct tataagttgg gaaatgggtc aggcacccag tccatacaac 480 actagggccg aatgcatagg atggtcaagc acgtcatgcc atgatggcat atcaaggatg 540 tcaatatgca tatcagggcc gaataacaat gcatcggcag tggtgtggta ccgagggaga 600 ccagtaacag aaatcccatc atgggcaggg aacattctta ggacacaaga atcagaatgt 660 gtgtgccata aaggaatctg cccagtggtc atgacagatg gtccagcaaa cagcaaggca 720 gcaactaaga taatctactt caaagaggga aagatacaaa aaactgaaga actgcaaggg 780 aacgctcaac acatcgaaga atgttcatgc tacggagcag caaggatgat caaatgtgta 840 tgcagagaca attggaaggg ggcaaataga ccaataatca ctatagatcc cgaaatgatg 900 acccacacaa gcaaatactt gtgttcgaaa atcttaaccg acacaagtcg tcctaatgac 960 cccaccaatg ggaactgtga tgcgccaata acaggaggga gcccagaccc aggggtaaaa 1020 gggtttgcat tcttagacgg ggagaattca tggcttggaa ggacaattag caaagactcc 1080 agatcagggt acgaaatgtt aaaggtccca aatgcagaaa tcgacactca atcagggcca 1140 atctcatacc agctgattgt caacaaccaa aattggtcag gatactcagg ggcattcata 1200 gactactggg caaacaagga gtgcttcaat ccttgttttt atgtggagct aatcaggggg 1260 agacccaaag agagtggtgt actgtggact tccaatagca tggtagctct ctgtggatcc 1320 agggagcgat tgggatcatg gtcctggcat gatggtgcag aaatcatcta ctttaagtag 1380 // ID MG029269; SV 1; linear; viral cRNA; STD; VRL; 6935 BP. XX AC MG029269; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Apeu virus isolate BeAn848 segment L, complete sequence. XX KW . XX OS Apeu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6935 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6935 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 5c772b122ac873fd09acc0849d3005db. DR EuropePMC; PMC5967719; 29795585. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6935 FT /organism="Apeu virus" FT /segment="L" FT /host="Sapajus apella" FT /isolate="BeAn848" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="14-Oct-1955" FT /db_xref="taxon:334520" FT CDS 55..6801 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2Z3DKX3" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DKX3" FT /protein_id="AVX48946.1" FT /translation="MAILLGDVIRQYTARIRTCTNPEVGRDILAEITMTRHNYFAQQFC FT EAINIEYRNDVPAADIILEMQPALDLTTIKIPNVTPDNYYRDGTKIYIIDFKVSVSDES FT ALHTYKKYNTLLGDVFNLLGVDYEVVIIRMNPSDMHLHISSDNFANLFPNIVLNLDFTW FT YFRLRDDLFQQFRDNEEFMELVAHGEFTPTIPWVIEDTPELYTHPVFLEFIGSMPDGTV FT EDFFYALNHNAFQSDKWNDLLHIMMRKYGTYYDKFIRDQAKNVFLLDENYNKPSKEEIL FT KGWSEMVGRIKEQRDVIDDCSKQKPSVHFIWSPNDKNSSNENNTKLIKLAKKLQSIKET FT DVFSQAFKNIGYLMDFGDDVEKYETFCLKLKAEARSSLKPKSTKVVPITIGKCTILWEQ FT QFKLDTEVIPKEVRIRFLKEFCGIGNHKQFKDRMMDDLDLSKPKILNFENPEIKTQAYI FT MMKNTQCLMSKESGLKKIGNVLEEFEYKIKDANPKTWEIIEEIANSRYWQAINDFSILI FT KNILSVSQYNKHNTFRVVCTANNNFFGILYPSASIKSRRSTVVFSSVCLHENENEVLKC FT GALYRTYKVKGGYLSISKAIRLDKERCQRLVTSPGIFLLTTLLFKSDNDVNLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDMIKRG FT CMSANEQRQMISIRDVFLNEFEITQKGVSNEKNLQSIWFPGKISLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRMNIPEPWSFDMKKQSANLYLLIFSVSKMLNMDT FT SRHNHLRSRVENRNNFKRSLTSISTFTSSKSCIKVGNFKDIKEKTAKHIKKINEKDAKK FT TRIANTEFVDESERDFEITKSTYMDLIKCVPEYTDYISTKVFDRLYEKFKLEEIEDKPA FT IEVIMDTMRNHKDFKFCFFNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDATRNQNAEQSIIDDILDVPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPAEKQRIIYFFCNYMNKELILPDEMMCSLLDQKAEREND FT LIREMTNGFRSNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDIMKETASLLEGRCNVSS FT MVHSDDNQTSVIMVQDKLNNDIITNFVCVTFEKCCLSFGNQANMKKTYITNHIKEFVSL FT FNIYGEPFSVFGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFLFDRRELPIELCGILQADLATVALVGLEAGNISFLTNL FT LKKMSPPQLIKESVQSQCNNIGNWDMECLSDSEILKLKLLRYVVLDSEITEDSKMGETS FT EMRSRSLITPRKFTTTSSLEKLVSYKDFQEVIVNSEKTEELLEKILEKPELLVTKGENS FT TEFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSILDMPKV FT QSGDGIIGRKTIPETFSAIKRDLNQLPLDTGDIKLIYSFCILNDPLNTTACNALLLSQI FT QSLLERTSMSAVTMPEFRNMKLIRYSPALVLRAYIHNDLTVGGANEDAMKRDLFHLNEF FT IVQTKIRERLNQRIVENQEIKGERDRLFEIKELTKFYQACYDYIKSTEHKIKVFILPSK FT AYTAFDFCATIHGNLIRDDGWFSVHYLKQIVSGTAKANISIAPASEMVIVEECFKLLSH FT FCDTFIDVNSRLAFVMNVIENFSYKNIPVKELLNLMRHSFKRQQFIPLLYWLGELNQDD FT LDKYDAFKTSERVSWNDWQINRALNTGTIDLTIKGYQRTLRIVGEDDFLQIAELEVLKG FT DNTSIETHGRKLLNCKHNLRFEKMKRYQIMEPNTYYICWQMRTKFAYTYQMLLSNIIEA FT RNSQTVSVTGGKFNELIPVCPVIVGRIDSIEKINLRQIKYLNMDCSLSKLQLTQKEFVT FT TKRSHFSKMIFFQGPALIIGNLNLTNLIRTPTLLTTNYPSLSQVPMMTLTRIFQCSGDE FT SQTDEFEFLSDEILEDIETTTVNTVPIFNAQYEVKSKRGYTYKQALQDALRRGIEEVED FT TLDFCGDGFYSPKNLAILALLTNLIDRLQTNEWSTILQTAIHMSFFRNGKDRMYHLMKI FT PKAFVKNPIGEILNWEKIRTFMIQLNTRNPGNHWDQMFNHFREKTLVLIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFE" XX SQ Sequence 6935 BP; 2583 A; 1075 C; 1286 G; 1991 T; 0 other; agtagtgtac ccttgattac ttattacatt gcgttcgctt cgaggaaaga cagtatggct 60 atattacttg gcgatgtcat aagacagtac acagcaagaa tccgtacatg taccaacccg 120 gaagttggta gggacatatt agctgaaata acgatgacta gacacaatta ttttgctcag 180 cagttctgtg aggctattaa tattgagtat agaaatgatg ttccagctgc agacattata 240 ctagaaatgc aaccggcact tgatttgact acaattaaga tccctaatgt aacgccagac 300 aattattaca gagatggtac taagatatac ataatcgatt tcaaagtatc agtaagtgat 360 gaatcagctt tacataccta taaaaagtac aacacattgc tgggagatgt atttaattta 420 cttggtgttg attatgaggt tgtaataatc agaatgaatc cgagtgatat gcatcttcat 480 atatctagcg acaactttgc aaatcttttc cccaatatcg tcctcaactt ggatttcact 540 tggtatttca gattgagaga tgacttattc caacaattca gagacaatga agaatttatg 600 gaattggttg cgcatggcga gtttactcca acaataccct gggtcattga agatacccca 660 gaattgtaca cacatccagt cttcttagaa tttattgggt ccatgccaga tggcactgtc 720 gaagatttct tttatgcatt aaatcacaat gcatttcaat ctgacaaatg gaacgatctt 780 ctacatatta tgatgaggaa atatggtacc tattatgata aattcattcg agatcaagca 840 aaaaatgtct tcttactaga cgaaaattat aacaagcctt caaaggaaga aatactcaaa 900 ggttggtcag aaatggtagg aaggatcaaa gaacaacgag atgttattga tgattgttcc 960 aaacagaagc caagtgttca ttttatatgg tcaccaaatg acaaaaattc ttctaatgag 1020 aacaatacta aattgataaa gttggcaaag aagcttcaat caataaaaga aacagatgtc 1080 tttagtcaag cattcaagaa tattggttac ctaatggact ttggggatga tgttgaaaaa 1140 tatgaaacat tctgtctaaa attgaaagct gaagctagat caagcctcaa acctaaaagt 1200 accaaggtgg ttccaatcac aattgggaaa tgcactatct tatgggaaca acaatttaag 1260 cttgatacag aagttatccc taaagaagtt agaattagat tcctaaaaga attttgtgga 1320 attggcaatc ataaacaatt caaagacaga atgatggacg atctagatct aagtaaacct 1380 aagatcctaa attttgaaaa cccagaaata aaaacccaag cctatataat gatgaaaaat 1440 acacaatgcc tcatgagcaa agagagcggt ttgaagaaaa ttggaaatgt cttagaggaa 1500 tttgaatata aaatcaaaga tgccaaccct aaaacatggg agattattga agagatagct 1560 aattctagat actggcaagc tataaatgat ttctctattt taatcaagaa tatactgtca 1620 gtttcacagt ataacaaaca caacacgttc agagttgtgt gtacagccaa taataacttc 1680 tttgggatac tttatccttc tgcaagtata aaatcaagga ggtcaactgt agtcttctca 1740 agtgtatgcc tacatgaaaa tgagaatgag gttttgaagt gtggcgcttt atacagaaca 1800 tataaagtta aggggggata tctttcaata tcgaaagcca ttcgattaga caaggagcgt 1860 tgtcagagat tggtaacatc tcctgggata ttcctactta ctacactact ttttaaaagc 1920 gacaatgatg tcaatttgaa tgatgtcatg aactttgcat tcttcacatc attatcgata 1980 acaaaaagta tgttgtcatt aactgaaccg tcaaggtata tgattatgaa ctctctagca 2040 ctctctagtc atgtaagaga atatattgct gaaaaattct caccatatac aaagactcta 2100 ttttcagttt atatgacaga catgattaaa agaggatgca tgtctgccaa tgaacagagg 2160 caaatgatat ctattagaga tgtcttcctc aatgagtttg aaataaccca gaaaggggtg 2220 tcaaatgaaa aaaacttaca atcaatttgg tttcctggca agattagttt aaaagaatat 2280 atcaatcaga tatacatgcc attctacttc aatgcaaaag gcttgcacaa caaacatcat 2340 gttatgatag acttagctaa aacagttctt gagatagaat tagatcagcg aatgaacatc 2400 cctgaacctt ggagtttcga catgaaaaaa caatcagcaa atttatactt gcttatattc 2460 tcagtatcaa agatgttgaa catggacact tcaagacata atcatctgag aagtagagtg 2520 gaaaacagaa ataattttaa aaggtcatta acgagtatct ctacatttac aagctccaaa 2580 tcatgtatta aagtaggcaa ctttaaagac ataaaagaaa aaacagccaa gcatataaag 2640 aagattaatg aaaaagacgc aaaaaagaca aggatagcaa atactgaatt tgtcgatgaa 2700 tcagaacgtg actttgaaat cacaaaaagt acatacatgg atctaatcaa gtgtgttccg 2760 gaatatactg attatatatc aactaaggta tttgataggc tatatgaaaa attcaaatta 2820 gaagaaattg aggataaacc agccatagaa gtcattatgg acacaatgag gaaccataaa 2880 gacttcaaat tctgtttctt taataaaggt caaaagactg caaaagatcg tgaaatcttt 2940 gtaggagaat ttgaagcaaa gctatgtttg tatggtgtag aaagaattgc taaagaaaga 3000 tgcaagttaa atccggaaga aatgatttca gaacctggag atgggaaatt gaaaaaatta 3060 gaaataaatg ctgagtctga aatcagatat ttaatagatg caacacggaa ccagaatgca 3120 gaacaatcca taatagatga tatcctagat gtcccaaaag gcatcaaact tgaaatcaat 3180 gcggacatgt caaaatggag tgctcaagat gtctttttta aatatttttg gttgatagtg 3240 ttagacccaa tattgtaccc agcagaaaaa caaagaatta tatatttctt ctgtaattat 3300 atgaataaag aattgatctt accggatgag atgatgtgct cattgctaga tcaaaaagct 3360 gaaagagaaa atgatttaat tagagaaatg acaaatggat tcagaagcaa tactgtaaac 3420 attagaagaa attggcttca aggcaattta aattatacat ctagttacat acatagttgt 3480 tctatgatgg tattcaaaga tattatgaaa gaaaccgcat cattgttaga agggaggtgc 3540 aatgtttcta gtatggtgca ctctgacgat aaccagacat ctgtaataat ggttcaggat 3600 aaattgaaca atgacatcat cactaatttt gtctgcgtaa catttgagaa atgctgctta 3660 tcttttggaa atcaagcaaa tatgaaaaag acatacatta caaatcacat taaagaattt 3720 gttagtctat ttaatatata tggtgaacca ttctcggtat ttggcaggtt cttactacca 3780 gcagttggag attgtgcata tataggtcca tatgaagata tggccagtag attatctgca 3840 acacaaactg ctatcaaaca tggttgccct cctagcttag catgggtaag cattgcacta 3900 aatcattgga taacatttaa cacttataat atgctgccgg gccagatcaa tgatcctaca 3960 aaggtatttt tgttcgacag gcgggaattg ccaatagaat tatgcgggat tctacaagca 4020 gacttggcaa ctgttgcact tgtaggatta gaagctggaa atatttcctt cttgacaaat 4080 cttctgaaga agatgtcgcc tccacaatta ataaaggaat cagtacaaag ccaatgcaat 4140 aatattggaa actgggatat ggaatgtttg tctgatagtg aaattttaaa acttaaattg 4200 ctgaggtatg tggtcttaga ttcagagatt actgaggata gtaaaatggg tgagacaagt 4260 gaaatgcgga gtcgttcttt aataactcct agaaagttca ctactacatc atctttagag 4320 aaattggtgt cttataaaga tttccaagaa gtcatagtca attcagagaa gacagaagaa 4380 ttgttggaaa agattttgga gaaaccagaa ttattagtca ctaaaggcga gaactcaaca 4440 gagtttatga caactatatt atttagatat aattcaaaga agtttaaaga atcattatct 4500 atacaaagcc caacacagtt attcatagaa caaatattat ttgcaaacaa acctgttatt 4560 gactataccg gaatccaaga taggtattta agtattctag acatgccaaa ggtgcaatca 4620 ggagatggga ttataggaag gaaaactata cctgaaacat tttctgctat aaaaagagat 4680 ctcaaccaat tacctcttga cactggagac atcaagttaa tatattcatt ctgtatatta 4740 aatgacccct tgaatactac tgcgtgcaat gcattattgc tatcacagat acagtcctta 4800 cttgaaagga caagcatgtc agctgtgaca atgcccgaat ttagaaatat gaagttgata 4860 cgctattccc ctgctctggt cttaagagca tatatccata atgaccttac agtgggcggt 4920 gctaatgaag acgctatgaa aagagattta ttccatttaa atgaattcat agttcaaaca 4980 aaaattagag agcgcctaaa tcaaaggatt gtagagaatc aagaaataaa aggagagaga 5040 gacagactat ttgaaattaa agagctcaca aaattttatc aagcttgtta tgattatatt 5100 aaatctacag agcataaaat taaggttttt atcttgccat caaaggcata tacagcattt 5160 gatttctgtg cgacaataca tggcaatttg ataagagatg acggttggtt ctctgtacat 5220 tatttgaaac agatagtatc aggaacagca aaggcaaata ttagcattgc tcctgccagc 5280 gaaatggtta tagtcgaaga atgcttcaaa ttgttatctc atttctgtga tacatttatc 5340 gacgtaaatt cgagattggc atttgttatg aatgttattg agaatttctc atataagaat 5400 atcccagtga aagagctttt aaatttaatg agacattcat ttaagcgaca gcagttcata 5460 ccactactgt actggcttgg cgaactaaat caggatgact tagataaata tgatgcattt 5520 aaaacaagtg aaagagtttc ttggaatgat tggcaaataa atagggcact aaatactggc 5580 actatcgatc taacaattaa aggatatcaa aggactttac gcatcgtagg agaagacgat 5640 ttcttgcaaa tagctgaatt agaagttcta aagggagata atacttctat cgagacccac 5700 ggcaggaagt tattgaattg caagcacaac ttaagatttg aaaaaatgaa gaggtatcaa 5760 ataatggaac cgaacacata ttatatatgc tggcagatga gaactaaatt cgcatatacg 5820 tatcagatgt tattatccaa tatcatagaa gcgaggaatt cacaaacagt gtctgttaca 5880 ggaggcaagt ttaatgaact gattcctgtc tgtccagtaa tagtagggag aatcgattct 5940 attgaaaaaa ttaatttgag gcaaataaaa tatttaaata tggattgttc attatccaaa 6000 ttacaattga cccaaaaaga atttgtgaca acaaagagat cccatttttc taaaatgata 6060 tttttccaag ggccggctct cattataggc aacttaaatt taaccaattt aatccggaca 6120 cctacgctat taactacaaa ttacccatcc ctatcgcaag ttcccatgat gacattaact 6180 aggatattcc aatgttctgg ggacgaaagt caaacagatg aatttgagtt cctttcagac 6240 gagatactag aagatattga aacaacaaca gtcaatactg tccctatatt taatgctcaa 6300 tatgaagtga aatcaaagag aggttataca tataaacagg cattgcaaga tgctttgaga 6360 agagggattg aagaagttga ggatacatta gatttctgtg gagatgggtt ctactctcca 6420 aaaaacttgg caattctggc actattgact aatctgattg ataggttgca aacaaatgaa 6480 tggtcaacta tattgcagac agcaatacat atgtctttct tccgcaatgg gaaagatagg 6540 atgtatcatt tgatgaagat accaaaagct tttgtcaaaa atcctattgg agaaatccta 6600 aactgggaga aaattaggac tttcatgata caattaaata caaggaatcc agggaaccat 6660 tgggaccaaa tgtttaatca tttcagggag aagacattgg tactgataga tagagagata 6720 aaaatggaag gaatgtcttg gggagaaatg ctagatgaac tggacgatta taaagatact 6780 gaaatgttcc attttgaata aagataaaat aataacatta attgatcttt aaaataaaag 6840 aacattgtct tttattttaa agatcaaaca gatgactgcc agatgaatga taagaactct 6900 cacagacgta ataacataat caagggaaca ctact 6935 // ID MG029270; SV 1; linear; viral cRNA; STD; VRL; 4534 BP. XX AC MG029270; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Apeu virus isolate BeAn848 segment M, complete sequence. XX KW . XX OS Apeu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4534 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4534 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; cd638025cf32ff22d0792fc8de4792ad. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4534 FT /organism="Apeu virus" FT /segment="M" FT /host="Sapajus apella" FT /isolate="BeAn848" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="14-Oct-1955" FT /db_xref="taxon:334520" FT CDS 56..4348 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DG33" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DG33" FT /protein_id="AVX48947.1" FT /translation="MEIIWLLAFVALTAQVPLSNRCFEGGVLVEERNMDHGIAELCVKD FT DISMIKTTSLQKRNESVFTNIIMRKMLIPNYQECNPIEVSNGPIMIFKPDRDLMLIPKT FT YACRVDCSISLDRDEASIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCEHIQVTC FT GSKSLSFHACFKYHMACIRLLNRSYMPAFMIQSVCQNKEIILMTCLVLIIFGLLYIMTM FT SYICYILMPIFIPIAYFWGWLYNKSCKKCSNCGLAYHPFTKCGKNCVCGSMFENSDRMK FT MHRTSGLCKGYKSLRAARILCKSRGSALVLAVILATLLLSFVQPLEAVQLSYNNSIIEI FT TELSHELDIIFQGLKTTQSIIVSQIAICITIILILILYLALHKKIEDKLIQRILYFCPE FT CQMTHPKNGLKKYFAGEFTNMCNSCMCGCTYNQEELNDGYAIPMTHQLTVGCYAPGRYY FT THRKMSNNLIYIVLSILIILASLSIAAASTEDNCIKSSAYKTQDPISCSAWVKVKTCSS FT ESGIQGVMKHLKLPKQETDILGTFKGSLDSILKKSEESIVPLQSYILESLALSLHCTYI FT ASAATETGNINSMIISQYTDKPLEICTANKASKLCGCLMRKSTCDYTSSDDVAAYYKQH FT PEIYKVDFARMIQTLTKYFPGVFSKELLLSVRNNSHAKIKTIIKALDNKITNARGIKAI FT FKIIDTSLSDTTIAGVSPVVPSTKDIKAFDSKWLTDSIFKDIVTSTALKVCSNGKIYKC FT FYPMSLRFTYYYSCSEANKFYQTGEYPISKQHSNNANLCVADPYCERDFTIVDAANKEM FT LLTLRCEEITININDMQSALPVNKCRVVSLQHCTVSDTANKTVAECSNGYFYEYTGDLH FT QSPKDDVGIYCFDKACKTTKFPHHPSNLQGCTSHNAEMLNRKLKEINYTNLEQLKHSLQ FT ESIKTDLIEHNYILTKNLPKLNPTFKAISIQGVETDSGIQSSYIETNLMVKTGLSLGLH FT LTTKAGDPLFDIVVFVKTAHYEAIYDEIYQTGPTVGINVQHNEKCTGHCPESLMKTGWL FT SFSREHTSQWGCEEFGCLAINTGCIFGHCKDIIKPEMTILRKNQEETPVIRICISLPHE FT TFCQPINAFTAIITDKIETQFVSNEAGKIPKLLGYKSNRIYTGMINDLGTFSKMCGSVQ FT SVAGNVLGAGNARFDYICHAAYRKDITVSRCFDNFYDSCQRLEVSDNIVYDNNVKKVSL FT LNKNMGELRLKIKLGDINYKLFEKMPSFDFKGSCVGCIKCIKGVDCEFDIHATGESVCL FT LTSNCNFYHNNLKIDPNVQKYGMKGKCSEEKIWIDLCGNKIEIQISIVQTHETIEVGNS FT DQTYFVKEKDNRCGTWLCKVSEQGISSIFAPFFAIFGDYAKIAFYCVLGIICIALLIYL FT LLPVCGKMRDVLKKNEIEYMKEFRGKRI" XX SQ Sequence 4534 BP; 1622 A; 762 C; 851 G; 1299 T; 0 other; agtagtgaac cgctgtgtac ttatatttta gtagagatca cttattcgcc tagagatgga 60 gatcatttgg ctgttagctt ttgtagcact cactgctcaa gtaccattgt cgaacaggtg 120 ctttgaaggt ggtgtgttag ttgaagaaag gaatatggat cacgggattg cagagttgtg 180 tgtaaaagat gacatcagca tgattaagac aacatctctt caaaaaagaa atgaatctgt 240 cttcactaat ataataatga ggaaaatgct tatcccaaac taccaagagt gtaaccctat 300 tgaagtttcg aatggaccaa taatgatttt caagcccgac cgagacctta tgcttatccc 360 aaagacgtat gcttgcagag ttgactgttc catttcacta gatcgagatg aagcatcaat 420 cattttacat tcagacaagc tcaataattt tgaagtgatg ggaactacta cagcaactag 480 gtggtttcaa ggtagtacta catattcctt agaacacaca tgtgaacata tacaagtgac 540 atgcggttcc aagagtctta gtttccatgc atgctttaaa tatcatatgg catgtattag 600 gctattaaat agaagttaca tgcctgcatt catgattcaa tcagtttgcc agaacaaaga 660 aataatactg atgacatgct tggtccttat aatattcggg ttattatata ttatgactat 720 gtcatacata tgttatatct taatgcctat tttcatacct atcgcatact tttggggctg 780 gttatacaac aaatcatgca aaaaatgttc taactgtgga cttgcatatc atcctttcac 840 gaaatgtggg aaaaactgtg tgtgtgggtc tatgtttgaa aattctgaca gaatgaaaat 900 gcatagaacg tctggactat gcaaaggtta caaatcacta agggcagcta gaatattgtg 960 caagagccga ggttctgcat tagttttggc agtgattcta gccacattat tgctctcatt 1020 tgttcaacca cttgaagcag tccaactgag ttataataat agtatcatcg aaataactga 1080 gttgtcgcat gaattagaca ttatatttca agggttaaaa acaactcaat ccattattgt 1140 gtcacaaatt gccatatgca taactataat attaatctta attttatatc tagcattgca 1200 taagaagatt gaagacaaac tgatacaaag gattttatac ttttgtccag aatgccagat 1260 gacacaccct aagaatgggc tgaagaaata ttttgcagga gaattcacaa atatgtgcaa 1320 cagctgtatg tgcggttgta catataatca agaagaatta aatgatggat atgcaatacc 1380 tatgacacat cagcttacag ttggatgcta tgcacctgga cgatattaca cacacaggaa 1440 aatgtccaat aatttaatct acattgtatt gtcaatcttg atcatacttg cttctttatc 1500 aatagctgct gcctcaacag aagacaactg cattaaaagc tcggcgtata aaactcaaga 1560 tccaatctca tgttctgctt gggtgaaagt aaaaacatgt tcaagcgaat cagggatcca 1620 aggggtcatg aaacacctga aactcccaaa acaggaaact gacatcctgg gaacatttaa 1680 agggagtctt gactcaatcc ttaaaaaatc ggaagaaagc atagtgccat tgcagtctta 1740 catcttggaa tcactagccc taagcctgca ttgcacttac attgctagtg ctgcaacaga 1800 gacaggaaac ataaattcca tgatcatatc acaatacaca gataagccct tagagatatg 1860 cactgcaaat aaggcatcaa aattgtgcgg ctgtttaatg aggaaatcaa catgtgacta 1920 caccagttct gatgatgttg cagcatatta taaacagcat ccagaaattt acaaagttga 1980 ttttgcaagg atgatacaaa ctctgactaa gtacttccct ggggtatttt ctaaggaatt 2040 actattatcc gtaagaaata atagtcatgc aaaaattaag accattataa aagcactcga 2100 taacaagata accaatgcca gagggatcaa ggcaatattc aagattatag acacatcatt 2160 gtcagataca actattgctg gagtttctcc tgtagtgcct tcaactaagg acataaaagc 2220 atttgattca aaatggttga cagacagtat ctttaaagat atagtgacat ctactgcatt 2280 gaaagtatgt tcaaatggga agatttataa atgcttttat ccaatgagct tgagatttac 2340 atattattac agctgcagtg aagctaataa gttttaccag acaggggaat atccaatatc 2400 taagcagcat agtaataatg caaacctctg tgtagcagac ccctattgtg agagagactt 2460 cacaattgta gacgctgcaa acaaagaaat gctccttaca ttgcgttgtg aggaaatcac 2520 aataaatata aatgacatgc aaagtgccct gcctgtcaac aaatgtaggg ttgtctcttt 2580 acagcattgt acagtttctg atactgcaaa caaaacagtt gcagaatgtt caaatggata 2640 cttctatgaa tacactggag atctacatca aagtccaaaa gatgatgtgg gaatttactg 2700 ttttgataaa gcatgtaaaa caactaagtt cccccatcac ccttcaaacc tacaggggtg 2760 cacttctcac aatgcagaga tgttgaatcg caaattaaag gagataaatt acaccaacct 2820 ggaacaactc aaacacagct tgcaagaatc aattaaaaca gatttgatag aacacaacta 2880 tatactgaca aagaatttac ctaaattgaa tccaacattt aaagccattt caatccaagg 2940 tgtggagaca gatagtggaa tccaaagctc atatattgag actaatttga tggttaaaac 3000 aggtttatct cttggattac atctaacaac aaaagcaggg gaccctctct ttgatattgt 3060 agtgtttgta aaaacagccc attatgaagc aatctatgac gaaatctatc aaacagggcc 3120 gacagttggg ataaatgtcc aacataatga aaaatgtact ggccattgtc cagaaagtct 3180 tatgaaaact ggatggctat ctttctctag ggaacataca agtcaatggg gatgtgaaga 3240 atttgggtgt ttagctataa acactggctg catctttggt cattgcaagg atataataaa 3300 gcctgaaatg accattctta gaaaaaacca ggaagagaca ccagtgataa gaatatgtat 3360 ttcactgcct cacgagacat tctgtcaacc aataaatgca ttcactgcaa ttatcactga 3420 caaaattgag actcagtttg tttctaatga agctgggaaa ataccaaaat tgttaggata 3480 taaatcaaat aggatatata caggaatgat aaatgaccta ggaacattct caaaaatgtg 3540 tggaagtgtc caatctgttg caggcaatgt cctaggtgct ggtaatgcca gatttgacta 3600 tatatgtcat gcagcatata ggaaagatat aacagtaagc agatgttttg acaattttta 3660 tgattcatgt cagaggcttg aagtatctga caatatagtc tatgataata atgttaaaaa 3720 ggtatcacta ctgaataaaa acatgggtga gcttaggttg aaaattaaat taggagacat 3780 aaattacaag ctgtttgaaa aaatgccatc ctttgacttc aaaggcagtt gcgtcggctg 3840 cataaaatgt atcaaaggtg tggattgcga atttgatatt catgccaccg gagaatcagt 3900 ctgtttatta acatccaatt gcaatttcta tcataataat ttgaaaattg acccgaatgt 3960 tcaaaaatat ggcatgaaag gaaaatgctc tgaagaaaag atctggatag atttatgtgg 4020 taataaaata gaaattcaga tatcaatagt ccaaacgcat gaaacaatag aagtaggtaa 4080 tagcgatcag acctatttcg tcaaagaaaa agacaatagg tgcggaacat ggctctgtaa 4140 ggttagtgaa caagggatat catcaatatt tgccccgttt tttgcaatat tcggagatta 4200 tgctaaaatt gcattttact gtgtattagg aataatctgt attgcattgc ttatatacct 4260 tttgttacct gtgtgtggga aaatgagaga tgttctgaag aaaaacgaaa tagaatacat 4320 gaaagagttt agaggtaaaa ggatataagt caaaactaga atataaatag agatgaatgg 4380 aatataacaa aattaattaa atgattcaaa taaaaataaa aaggattacc tgcaagccta 4440 ttagtggtag taagtagatt cctaagggta attactcaaa taattataag aatttttcca 4500 aacactaaaa tacaagacac agcggttcac tact 4534 // ID MG029271; SV 1; linear; viral cRNA; STD; VRL; 1082 BP. XX AC MG029271; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Apeu virus isolate BeAn848 segment S, complete sequence. XX KW . XX OS Apeu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1082 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1082 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 895ad2c900740e2a61cf26aaf5037f24. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1082 FT /organism="Apeu virus" FT /segment="S" FT /host="Sapajus apella" FT /isolate="BeAn848" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="14-Oct-1955" FT /db_xref="taxon:334520" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DH48" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DH48" FT /protein_id="AVX48949.1" FT /translation="MSLTVLLEIDTESWQLLCLSFRLRRGVRIRLLLTLNKHTKVLSMN FT TGRSSLLRILECSSSVRMRLNRSSVRVRQSSLTLNLALGRSLLLITITLPTQQIRSLMV FT S" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:A0A2Z3DIS8" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DIS8" FT /protein_id="AVX48948.1" FT /translation="MATPLFEFSVEERGQNSSTFDPKQAYKSFVDEHREELTLENIRVF FT FLRANEAKQKLRKSSAKLANLKFGTWKVTVVNNHYPANTANTVADGELTLHRISGFLAK FT FVLDLYADTEHRPEIEEKIINPIAESKGVTWAQSAKIYLAFFPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDVSLLKKPLRQQYKNDTPDKWMKEKKVMIQSAVSRISKLPWGTSGLSSQ FT AKEFLKEFGITMK" XX SQ Sequence 1082 BP; 358 A; 181 C; 236 G; 307 T; 0 other; agtagtgaac ttcttaggaa gttcatttat gtcacttaca gttctattgg agattgatac 60 agagtcatgg caactccttt gtttgagttt tcggttgagg agaggggtca gaattcgtct 120 acttttgacc ctaaacaagc atacaaaagt tttgtcgatg aacacaggga ggagctcact 180 cttgagaata ttagagtgtt cttcctccgt gcgaatgagg ctaaacagaa gctccgtaag 240 agttcggcaa agctcgctaa ccttaaattt ggcacttgga aggtcactgt tgttaataac 300 cattaccctg ccaacacagc aaatacggtc gctgatggtg agctaactct gcatagaatc 360 tctggattcc ttgcaaaatt tgttctggat ctctatgcag atacagagca tagacctgaa 420 attgaggaga aaatcatcaa tccaattgca gagtctaaag gagtgacatg ggcacagtca 480 gcaaaaatat acttagcctt cttccctgga acagaaatgt tcttacatga gtttgaaatg 540 ctaccactgg ccatttacat ctatcgagcc cagaaagggg agattgatgt ttcactgctg 600 aaaaagcctc tcaggcaaca gtataagaat gacacgccag acaaatggat gaaagagaaa 660 aaagtcatga ttcaaagtgc cgtatctaga atctcaaagc ttccatgggg aaccagtggc 720 ctgtcttcac aagccaagga attcctcaag gaatttggga tcactatgaa ataatcacat 780 agtgtaaact gatggtttta tatattactt aagttaagtt tagtgtaagt aggtattgaa 840 tagtatatta attaataatg ataagtaatg ataagtacaa atcaattggt tcaaaattgg 900 ggtttgattc aaaattgggg tttaaattgg ggaaatgagc agcctaagaa atgttcaagg 960 acattgaata ttagggttgg gtggttgggg aaacaataaa ggctgcattg cttactaact 1020 aaagttggaa gaaattaact atgttcatta acaaattgat actttctaag aagaacacta 1080 ct 1082 // ID MG029272; SV 1; linear; viral cRNA; STD; VRL; 6907 BP. XX AC MG029272; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Caraparu virus isolate BeAn3994 segment L, complete sequence. XX KW . XX OS Caraparu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6907 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6907 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; d79b2d714bf4d2256625e21dcdbb5d48. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6907 FT /organism="Caraparu virus" FT /segment="L" FT /host="Sapajus apella" FT /isolate="BeAn3994" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="15-Feb-1956" FT /db_xref="taxon:192196" FT CDS 61..6807 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:W8CZ90" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:W8CZ90" FT /protein_id="AVX48950.1" FT /translation="MAILLEDVIRQYSARIRNCNNPEIGRDILAEITMTRHNYFAQKFC FT EAIGIEYRNDVPAADIVLEMMPGLDLTRIRIPNITPDNYYRDGTKIYIIDFKVSVSDES FT AQHTYKKYDTLFGDVFNQLNIEYEVVIIRMNPSDMHLHISSDNFAALFPNITLNVTFDW FT YFRLRDDLFHQFRDNEEFMELIAHGEFTPTIPWVNENTPELFDHEVFQDFIASMPLENR FT EDFYYALNHNAFQSDKWNDLLHVIMRKYGDRYHDFVKANARRIFLTDDKYNRPTKEAIL FT QGWKEMIERVQEQREIIDDLSKQKPSIHFIWGPNSKNESNENNTKIIRLSKNLQSIKEK FT DSFSVAFKNIGLLMDFSEDIDKYEAFCAKLKADARSSLKPKSKKIDPIKIGPSTILWEQ FT QFKMDTDIIPKEIRLKFLKEFCGIGNHKQFKDRMMEDLDLDKPKILNFENQDIKNQAYA FT MMQNTSYFMSKPSNLVKVGNVLEEFKDKIVNANEETWATIEEIAKTRYWQSINDFSVLI FT KNILAVSQYNKHNTFRIVCTANNNFFGLVYPSTSIQSKRSTIVFSTIVLHDNESDVLKC FT GALYKTYRIKNGYLSISKAIRLDKERCQRLVISPGLFMLTSLLFKGDCDVSLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFAVYMTDLIKRG FT CMAANDQRQQISIKDVFLNEFEITQKGVTNDRNLQSIWFPGKVNLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRVNLPTPWGVDLKKQSVNLDVLIFSIAKMLNLDT FT SRHNHLRSRVENRNNFKRSLASISTFTSSKSCIKVGDFKEFKERNALHMKKLQGKEAKK FT TRIANTEFVSDNDRDLEIVHSTYLDLRKSVPNYTDYMSTKVFDRLYERFKNHDFEDKPA FT IEIIMDVMRCHTDFKFCFFNKGQKTAKDREIFVGELEAKLCLYGVERIAKERCKLNPDE FT MISEPGDGKLKKLEINGESEIRYLIDATRNQTREQSKVDDLLDTPKGIKLEINADMSKW FT SAQDVFYKYFWLIALDPILYPFEKQRIIFFFCNYMNKELILPDEMMCSLLDQKAQREND FT LIRQMTNGFTTNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDILKEVSALLEGRCNVNS FT MVHSDDNQTSVIMVQDKLHNDIITNFVCNTFERCCLTFGNQANMKKTYITNHIKEFVSL FT FNIYGEPYSVYGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFHFERRELPIELCGLLQADLSTIALVGLESGNISFLTSL FT LKRMSPPQLVKESVQAQCQNIESWDLNSLSESEIMKLKILRYVVLDSEVREDSTMGETS FT DMRSRSLITPRKFTTPASLERLVSYKDFQEILADPTRTEDLLETMINNPELLVTKGENS FT EEFMTTILYRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSILDVPRV FT QENEGIIGRKTIPETFVAIKRDLSLMNLDHKDIKLVYSFCILNDPLNTTACNAILLSQV FT QSLMDRSSLSAVTMPEFRNMKLIRYSPALVLRAYIHNNLTIAGANENAMRRDLFHLQEF FT INQTKIKERLDKRIQDNEEIKGERDRMFEIKETTKFYQACYDYIKSTEHRVKVFILPMK FT AYTAFDFCATIHGNLMKDKGWFAVHYLKQIVSGTAKASVSQAPASEMIIVGECFRLIAH FT FCDTFIDVGSRLQFLYNIIDNFTYKNIPVKNLLEIMMRTNKRQHFLPILHWLEELTQHD FT IDKYDAYKANERVVWNDWQVNRDMNTGNIDLTIKGYQRTLRVIGEDDTLKIGELEILKG FT DTTPIETHGRKLLNSKHGLKFEKMQRYKIIEPNTYYICWQNRTRFSYTYQLLLSNIIEA FT RNSQTVSVTGGKFNELVPVCPVIVSRIDSDDKMNLRQIKYLNMDCSLTRLQLNQNEFAV FT VKRCHFSKMVFFNGPEMVVGNINITNLIQTPSLLTTNYPSLSQMPMMTLTRIFNCNGEE FT KEVDEFEFLSDEILEETETAVINAQPMFNIQYETKSKKGYTYKRALQEALSRGIQEIEH FT NFDFCKDGFYSPKNIAIIALLVNVIDRLHTNEWSSIIRKSFHMCFFNNGKDKLFHMMNI FT PKTFIKNPIGEVPNWEKIRTFIIQLETAYPGNNWHQMFEHFKEKCILLIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFN" XX SQ Sequence 6907 BP; 2581 A; 1048 C; 1301 G; 1977 T; 0 other; agtagtgtac ccttgcttca gttatacaat tactaaagtg caaaaaaaga acataaagga 60 atggctattc tcttagaaga cgttatccgg cagtattctg ctaggattag aaattgcaac 120 aatccagaga tcggaagaga tatcctagca gaaataacta tgacaagaca taattatttt 180 gctcaaaagt tttgtgaagc tatagggatt gagtacagga atgatgtccc agctgcagat 240 attgttctgg aaatgatgcc aggattagac ttgacacgaa ttaggattcc taatatcacg 300 ccagataact attatcgcga tgggacaaag atctatatta tagatttcaa ggtatcagtt 360 agtgatgaat cagcccaaca tacttacaaa aagtatgata ctttgtttgg agatgttttc 420 aatcaattaa acattgaata tgaagtagtt ataatacgga tgaacccaag cgatatgcac 480 ctacatattt ctagtgacaa ttttgcagca ttgtttccca acattactct caatgtaaca 540 tttgattggt acttcagatt acgagatgac ctgttccatc agtttaggga caatgaggaa 600 ttcatggaat taattgctca tggagaattt actccaacta taccttgggt aaatgaaaat 660 actcctgaat tgtttgacca tgaagttttt caagacttta ttgcatcaat gccgttagaa 720 aatagagaag atttttacta tgctttgaat cacaatgcgt ttcaatcaga caaatggaat 780 gacttattac atgtgattat gaggaaatat ggtgatagat atcatgattt tgtcaaagcc 840 aatgcaagga ggatattttt aactgatgat aaatacaatc ggcccacaaa agaagccata 900 ttgcagggtt ggaaagagat gattgaaaga gttcaagagc aacgtgagat catagatgat 960 ttgtccaaac agaagcccag tatccatttt atctggggac ctaattcaaa gaatgaatca 1020 aatgaaaaca ataccaaaat aatccgcttg tcaaaaaatc tacaatctat aaaagaaaag 1080 gattctttca gtgttgcttt caagaatata ggattattaa tggactttag tgaagatatt 1140 gataaatatg aagcattttg tgcaaaatta aaagcagatg ctaggtctag tttgaaaccg 1200 aaaagcaaaa aaatagatcc aattaaaata ggaccatcga ctatattgtg ggaacagcag 1260 tttaaaatgg atacagatat aataccaaaa gaaattaggc taaaatttct caaagaattc 1320 tgcggaattg gtaatcataa gcagttcaaa gaccgaatga tggaagatct tgaccttgat 1380 aaacctaaga ttctaaattt tgaaaatcaa gacataaaga atcaagcata tgcaatgatg 1440 caaaacacat cttacttcat gtctaaacct agcaatttgg ttaaagtggg gaatgtgtta 1500 gaagaattta aagacaaaat tgttaatgct aatgaagaaa catgggcaac aatagaagag 1560 attgctaaaa ccaggtactg gcagagtata aatgactttt ctgttttaat taaaaacata 1620 ctggctgtat ctcaatacaa caagcataat acattcagaa tagtatgcac tgcaaacaat 1680 aatttcttcg gcttagtata tccttcaaca agtatacaat cgaaaagatc aactatagtc 1740 ttctcaacca ttgtactgca cgacaatgag tcagatgttt taaaatgtgg ggctttatac 1800 aaaacatata gaataaagaa tggctactta tcaatatcta aagcaattag gctggacaaa 1860 gaacgctgcc aaaggctagt gatatcccct gggttattta tgttaacctc attacttttt 1920 aaaggagatt gcgatgtgag tttaaatgat gtcatgaatt ttgcattttt cacctctctg 1980 tcaataacaa aaagtatgtt atcgctgaca gaaccatcta gatatatgat tatgaattcg 2040 ctagcactat ctagccatgt aagagaatat atagctgaaa aattttcacc atacacaaag 2100 acactttttg ctgtgtatat gacagaccta atcaaacgag ggtgtatggc tgctaacgat 2160 cagagacaac agatatcaat aaaagatgtt tttttgaatg agttcgaaat cactcaaaaa 2220 ggagtgacaa atgacagaaa cttacaatca atatggtttc caggaaaagt taatctgaag 2280 gaatacataa accagatcta tatgccattc tatttcaatg ctaaaggatt acataataaa 2340 catcatgtga tgattgatct tgctaaaaca gtactagaaa tagaattgga tcaaagagta 2400 aacctaccga caccttgggg cgttgatcta aaaaagcagt cagtcaactt agatgtctta 2460 atattctcta tagccaaaat gttaaattta gacacgtcta ggcataacca tttaagaagc 2520 agagttgaaa ataggaataa ttttaaaagg tcgttagcaa gcatctccac ttttactagt 2580 tcaaagtcat gtataaaagt aggagatttc aaggaattca aagagagaaa tgccttacat 2640 atgaaaaaat tacaagggaa agaggcaaaa aaaaccagaa tagcaaatac agaatttgtc 2700 tcagacaatg acagagattt ggaaattgtc cacagcactt acttagacct acggaaatct 2760 gtaccaaatt atacagatta tatgtcaaca aaagtatttg ataggttgta tgaaagattc 2820 aagaatcatg attttgaaga taaacctgca atagaaataa taatggacgt aatgagatgc 2880 catactgatt tcaaattttg cttctttaac aaagggcaga aaactgctaa agaccgagag 2940 atatttgttg gagaactaga ggcaaagctg tgcctatatg gtgttgagag aatagctaag 3000 gaaagatgta agttaaatcc agatgaaatg atttctgaac ctggcgacgg taaactaaaa 3060 aagctggaga taaacggaga atcggaaatt agatatttaa ttgatgctac taggaatcaa 3120 actagagaac aatcgaaagt agatgatctc ttggacacac caaagggaat caaattagaa 3180 ataaatgcag atatgtcgaa atggagtgct caagatgttt tctacaaata tttttggctt 3240 atagcattgg accctatact gtatccgttt gaaaaacaga ggatcatatt tttcttttgc 3300 aattacatga ataaagaact gatcctaccg gatgagatga tgtgctcatt attagatcag 3360 aaagctcagc gcgaaaatga cctgattagg caaatgacaa atggattcac tacaaacact 3420 gtcaatataa gaagaaattg gctccaaggt aacctgaact acacttctag ttatatacat 3480 agttgttcta tgatggtctt caaggatatc ttgaaagaag tgtctgcctt actagaagga 3540 agatgtaatg taaatagtat ggttcattct gatgacaacc aaacctctgt tataatggtc 3600 caggataagt tgcacaatga tataattaca aattttgtat gcaatacatt tgaaagatgt 3660 tgcttaactt ttggcaacca ggccaatatg aagaaaacat atattacaaa ccacataaaa 3720 gaatttgtga gtcttttcaa tatttatggc gaaccttact ctgtgtatgg ccgtttctta 3780 cttccggcag ttggggattg cgcatatatt ggaccttatg aagatatggc aagcaggtta 3840 tcagcaactc agacagcaat caagcatggg tgccctccaa gcttagcatg ggtaagcatt 3900 gcactgaatc actggattac attcaacaca tataatatgt taccagggca aatcaatgat 3960 cctactaaag tattccattt cgaaagaaga gaacttccta tcgaattatg tggtttattg 4020 caggcagatc tatcaactat tgctttagtc ggtttggagt cagggaatat ctcatttctg 4080 acatcgctat taaaaaggat gtctccgcct caattggtga aagaatcagt tcaggcgcaa 4140 tgtcaaaata ttgaatcatg ggatttaaac tcgttatctg aaagtgagat aatgaagtta 4200 aagattttaa ggtatgtagt tcttgactcg gaggtgaggg aagacagtac tatgggggag 4260 actagtgata tgcgtagtag atctctaatt acaccgcgga aatttacaac cccagcatct 4320 ttagaacgac ttgtttctta caaagatttc caagagatat tggctgaccc aactagaaca 4380 gaagatctac tggaaactat gataaataac cctgaattat tagtcacaaa aggagagaac 4440 tctgaggaat ttatgacgac tatattatat agatataatt caaagaagtt taaagaatct 4500 ttgtccatac agagtcctac ccagttattt atagaacaga tactgtttgc aaacaagcct 4560 gttatagatt atacaggtat tcaagacaga tacttaagta ttctcgatgt tccaagagtt 4620 caggaaaatg agggaataat aggaaggaaa acaattcccg aaacttttgt tgctatcaag 4680 agagatttat ctttaatgaa cttagatcat aaagacataa agttagtata ttctttttgt 4740 attttgaatg atcctctaaa tacaacagca tgcaatgcca tattattgtc acaagttcaa 4800 tcgcttatgg atagatcaag tctatcagca gttacaatgc ctgaatttag aaatatgaaa 4860 ttaatacggt attcaccagc cttagttctg cgagcttata tacacaataa tttaaccatt 4920 gctggagcta atgaaaatgc gatgagacga gatttatttc atttgcaaga atttatcaat 4980 caaactaaga taaaagaaag gctagataag agaatccaag acaacgaaga gatcaaaggt 5040 gaaagagata ggatgtttga aattaaagaa accactaaat tttaccaggc ttgttatgat 5100 tatattaaat ctactgagca tagagtgaaa gtctttattt tgcccatgaa agcatataca 5160 gcatttgatt tctgtgccac aatacatggc aatcttatga aagataaagg gtggtttgct 5220 gtccattatc taaaacaaat agtttctggg actgctaaag caagtgtcag tcaagctcct 5280 gcaagtgaaa tgattatcgt gggtgaatgt ttcagattga tagcacactt ctgcgacaca 5340 tttattgatg ttggatctag actacaattc ttatacaata taatagataa tttcacatat 5400 aaaaatatac ctgtgaaaaa tttactagag atcatgatga gaactaataa aaggcaacac 5460 ttcttgccaa tacttcattg gttagaagag ctaacacaac atgatataga caaatatgat 5520 gcctataagg caaatgaaag ggttgtatgg aatgattggc aagttaatag agatatgaat 5580 actgggaaca ttgacttaac aattaagggt taccagcgca cattgagagt aattggtgaa 5640 gatgatactc taaaaatagg ggaattagaa attttaaaag gagatacaac cccaatagaa 5700 actcacggga ggaagttact caactcaaag catggtttga agttcgagaa aatgcagaga 5760 tataaaatta tagagcctaa tacgtactat atctgctggc agaacagaac tagattctct 5820 tacacttatc aacttttgct atcgaatata atagaagcac gtaattctca aactgtatct 5880 gttacaggtg gtaagtttaa tgaattagtt ccagtatgcc cagtgatagt aagtagaatt 5940 gactctgatg ataaaatgaa tttaaggcaa attaagtatc ttaatatgga ctgctcatta 6000 actcgacttc agctaaatca gaatgagttt gctgtagtca aaagatgtca tttttctaaa 6060 atggtattct tcaatggacc agaaatggta gtcggcaata taaatataac aaatttgatt 6120 caaactccaa gtttgttaac aacgaactac ccatcattat ctcaaatgcc aatgatgaca 6180 ttgacaagaa tatttaactg caatggtgaa gaaaaagaag tagacgaatt tgaattttta 6240 tcggatgaaa tactagaaga aacagagact gcagtgataa atgcacagcc aatgttcaat 6300 attcaatatg aaacaaaatc aaagaaaggc tatacatata aaagagcttt gcaagaagcc 6360 ttatctagag ggatacaaga aattgagcat aatttcgact tctgcaaaga tggattttat 6420 tcaccaaaaa atatcgcgat aatagcattg ctagttaatg ttatagatag attgcataca 6480 aacgaatggt caagcatcat acggaaatct tttcatatgt gttttttcaa caatgggaaa 6540 gataaattgt ttcatatgat gaatatacct aaaacattta tcaaaaaccc gataggggaa 6600 gtcccaaatt gggagaagat aagaacattc ataatccaat tagaaacggc ttatcctggc 6660 aataattggc atcagatgtt tgagcatttc aaggagaaat gcatattatt gatagataga 6720 gaaattaaaa tggagggcat gagctggggt gaaatgttag atgaacttga tgattataaa 6780 gacactgaaa tgtttcattt caattagaca gatctagaga gtctttagca gtagggaagg 6840 caaaggaaag cagtaacata tgatgaggat aggtaaacaa aatgaaaaca aacaagggaa 6900 cactact 6907 // ID MG029273; SV 1; linear; viral cRNA; STD; VRL; 4551 BP. XX AC MG029273; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Caraparu virus isolate BeAn3994 segment M, complete sequence. XX KW . XX OS Caraparu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4551 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4551 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 03785f643e1262ae5fe2edbc32185488. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4551 FT /organism="Caraparu virus" FT /segment="M" FT /host="Sapajus apella" FT /isolate="BeAn3994" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="15-Feb-1956" FT /db_xref="taxon:192196" FT CDS 57..4349 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DCG9" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DCG9" FT /protein_id="AVX48951.1" FT /translation="MAIVILLALVALTAQAPLSNRCFEGGVLVEERNMDHGIAELCVKD FT DISMIKTTSLQKRNESIFTNIIMRKMLIPNYQECNPVEVANGPIMIFKPDRDLMLIPKT FT YACRVDCSISLDRDEASIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCEHIQVTC FT GSKSLSFHACFKYHMACIRLLNRSYMPAFMIQSVCQNKEIILMTCLVLIIFGLLYIMTM FT SYICYILMPIFIPIAYLWGWLYNKSCKKCAYCGLAYHPFTKCGKNCVCGSMFENSERMK FT MHRVSGLCKGYKSLRAARILCKSRGSAFILAVILATLLLSFIQPLEAVQLNYNNEVIEI FT TELPQELDIIFQGLKTTQSIIIAQITICTCIILILALYLILHKRIEDRLINRILYFCPE FT CQMTHPKNGLKKYFAGEFTNMCNSCMCGCTYNQEELNDGYTIPMTHQLTVGCYAPGRYY FT THRRMSNGLIYIVLAVLMILASLSIAAASTDDNCVKSSTYKTQEPVSCSAWVKVTTCSS FT SSGIQGVFNHLKLPKQESDILTTMKGSLDSILKKSEDSTVPLQAYILESLAVSLHCSYI FT ASAATEGGNINTMIVSQYTDKPLEICTANKASKLCGCLMRKTTCDTTSTEDVATYYKQH FT PEIYKVDYARMIQTLTKYFPGVFSKELLIAVKNANHTRVKSIIKLLDAKITNARGIKSI FT FKIIDAALSDTTVAGAAPPSSAVKDVKAFDTNWLAESIFKDIVTSSAMKICTSGKIYRC FT FYPMSLRFTFYYSCNEANKFYQTGEYPISKHHSNNANLCVADAYCEKDFTPVDASNKDM FT LLTLRCEEITININELPSALPVNKCRVISIQHCTVSGNSNKTVAECANGFFYEYNGDLH FT QSPKDDVGIYCFDKTCKTTRFPHHPSNLQGCTSHNAEMLNRKLKEINYTNLEQLKHSLQ FT ESIKTDLIEHNYILTKNLPKLNPTFKAISIQGVETDSGIQSSYIETNLMVKTGLSLGLH FT LTTKSGDPLFDIIIFVKTAHYEAVYDEIYQTGPTVGINVQHDEKCTGRCPENLMKTGWL FT SFAREHTSQWGCEEFGCLAINEGCLFGHCKDIIKPEMTVLRKNQEETPVIRICISLPQE FT TFCQPINAFTAIITDKIETQFISNEAGRIPKLLGYKSNRIYTGMINDLGTFSKMCGSVQ FT SVTGNVLGAGSPRFDYICHAAQRKDVTVSRCFDNFYDSCQRLEPTDNIVYDNNVKKVSL FT LNKNMGELRLKIKLGDINYKLYEKMPSFDFKGSCVGCIKCIKGVDCEFDIHATGESVCL FT LTSNCNFYHNNLKIDPNVQKYGMKGKCTDEKIWIDLCGNKIEIQISLVQSHETIEVGNS FT DQTYFVKEKDNRCGTWLCKVSEQGISSIFAPFFAVFGDYAKIAFYCVLGIICLALLIYL FT LLPVCGKMRDVLKKNEIEYMKELRGKRI" XX SQ Sequence 4551 BP; 1657 A; 725 C; 872 G; 1297 T; 0 other; agtagtgaac cgctgtgtat ttatactata gtagagttca ctttttagtt ggagcaatgg 60 caatcgttat tcttctagcc ctggtggcat taacagccca agcaccattg tcaaataggt 120 gctttgaagg cggtgtcttg gtcgaagaaa gaaatatgga tcatggaata gctgaattat 180 gcgttaaaga tgacataagt atgatcaaaa cgacttccct ccaaaagcgg aacgagagca 240 tttttacaaa tataattatg agaaaaatgt tgattcccaa ttatcaagag tgcaacccag 300 tcgaagttgc caatggcccc atcatgatct tcaaaccgga tagagattta atgttgattc 360 ctaaaacata cgcatgcaga gtagattgct ccatttctct agatagggat gaagcatcta 420 ttatactaca ctcagacaag ctaaacaatt ttgaggttat gggaactaca acagccacta 480 gatggtttca aggtagcact acatattcat tagagcatac ttgtgaacac atccaagtta 540 cttgtggatc aaagagtctt agtttccatg catgttttaa gtaccatatg gcatgtatca 600 ggttattgaa tagaagctat atgccagctt tcatgattca atctgtctgc caaaataaag 660 agatcatctt gatgacatgt ctagttttaa taatatttgg tttgttgtat ataatgacga 720 tgtcatacat atgctatatc ttaatgccaa tatttatacc tattgcttat ctatgggggt 780 ggctgtataa caaatcttgc aaaaaatgtg cttactgcgg ccttgcatat catcctttta 840 caaaatgtgg aaaaaactgt gtctgcggat caatgtttga aaattcggaa agaatgaaaa 900 tgcacagagt atcaggtcta tgtaagggat acaaatctct aagagctgct agaatcttat 960 gtaagagcag agggtcagct tttatactag ctgtaatact tgcaacattg ctgttatctt 1020 tcattcaacc attagaggct gtccagttga attacaataa tgaagtgata gaaatcactg 1080 aattgcccca agaactggat ataatattcc aagggttaaa gactacacaa tcaattatta 1140 tagcacaaat tacaatctgc acttgcataa tactaatact agcattgtat ttgatattgc 1200 ataagagaat agaagataga cttattaata gaattctata tttttgccct gaatgtcaaa 1260 tgacacaccc gaaaaatggt ctaaagaaat acttcgctgg agaattcaca aacatgtgta 1320 atagttgtat gtgtggatgc acttacaatc aagaagaatt aaatgacggt tatacaattc 1380 ctatgactca tcaacttaca gttggatgtt atgctccagg aagatattat acgcatagaa 1440 ggatgtcaaa tgggctaatc tatattgtac ttgcagtttt aatgatttta gcgtcattat 1500 caatagcagc tgcatcaacc gatgataatt gtgttaagag ttcaacatat aagactcaag 1560 agccagtatc atgctccgct tgggtcaaag taacaacatg ctcaagcagc tccggtattc 1620 agggtgtctt taaccactta aaattaccga aacaggaaag tgacattcta actaccatga 1680 agggaagctt ggattctatc ttgaaaaaat ctgaagatag tactgtgcca ttgcaggcat 1740 acattttgga atctttagca gtaagtttac attgctcgta tatagcaagt gcagccacag 1800 aaggggggaa tataaataca atgattgttt cgcaatacac ggataagccc ttagaaatat 1860 gcacagccaa caaagcatca aaactatgtg gatgtttaat gaggaaaaca acatgtgata 1920 caacaagtac tgaagatgtg gctacctatt ataagcaaca tccagaaata tacaaagttg 1980 attatgctag gatgattcaa actttaacta agtattttcc tggagtattt tctaaagaac 2040 tgttaattgc agttaagaat gctaatcaca caagggttaa atctataatt aaattactag 2100 atgctaaaat aacaaatgct cgaggcataa aatcaatatt taaaataatc gatgcagcat 2160 tatcagatac aacagtggca ggtgctgcgc ccccttcttc tgctgttaaa gatgttaaag 2220 catttgacac aaactggtta gctgagagca tattcaaaga tatagtaacc tcaagtgcaa 2280 tgaaaatttg cacaagtggc aagatatata gatgctttta ccctatgagt ctaagattca 2340 ctttctatta tagctgcaat gaggctaaca agttctacca aacaggtgaa tatccaattt 2400 cgaaacatca cagcaataat gctaatcttt gtgtagcaga tgcttactgt gaaaaggatt 2460 tcacaccggt cgatgcttca aataaggata tgttattaac tctgagatgt gaagagatca 2520 caataaacat taatgagctc cccagcgcgc ttcctgtaaa caaatgcaga gtgatatcaa 2580 tccaacattg cacggtatct gggaatagca acaagacagt agcagaatgt gcaaatgggt 2640 ttttctatga gtacaatggc gaccttcatc aaagtccaaa agatgatgtg gggatctatt 2700 gttttgataa aacctgcaag acaactaggt tcccacatca tccttcgaac ttacagggtt 2760 gcacatcgca taacgcagaa atgttgaata ggaaacttaa agagataaat tatacaaatt 2820 tagaacaact taagcatagc ttacaagaat cgattaagac agatttaata gagcataatt 2880 acatactcac aaagaattta ccaaaattaa atccaacatt caaagcaata tccatccaag 2940 gtgtagagac agatagtgga atacagagct catatataga aaccaatcta atggtcaaga 3000 ctggtctatc attaggattg catttgacta caaagtctgg tgatccttta tttgacataa 3060 taattttcgt caagacagca cattatgagg cagtttatga tgaaatttac caaactgggc 3120 caactgtggg aataaatgtt caacatgatg aaaaatgcac aggtagatgc cctgaaaatt 3180 taatgaagac aggttggcta tcttttgcta gggaacacac aagtcagtgg ggatgtgaag 3240 aatttggatg cctggctata aacgaaggtt gcctcttcgg tcattgcaaa gatataatta 3300 aaccagaaat gacagtcctc agaaagaatc aagaggaaac tccagtcata agaatatgta 3360 tctctctgcc tcaggagaca ttctgtcaac caattaatgc attcactgct attataactg 3420 ataaaattga aactcaattt atctcaaatg aggctggaag aatccctaaa ttactagggt 3480 ataagtcaaa tcgcatatat acaggaatga tcaatgactt gggaacattt tctaaaatgt 3540 gtggtagtgt tcagtcagta actgggaacg ttctaggagc tggtagcccg agatttgatt 3600 atatatgtca tgctgcgcaa agaaaagatg tcacagttag cagatgtttt gataattttt 3660 acgactcatg tcaaagattg gaacctactg ataatattgt ttatgataat aatgtgaaaa 3720 aagtgtcact gttaaataaa aacatgggcg aactcaggtt gaagattaaa ttaggagata 3780 taaattacaa attatatgag aaaatgccat ctttcgattt caaagggagc tgcgtaggat 3840 gcattaaatg tatcaaagga gtagattgcg aatttgacat acatgccaca ggggaatctg 3900 tttgtctttt aacatcaaac tgtaactttt accataataa cctaaagata gatccaaatg 3960 ttcaaaaata tggcatgaaa ggcaagtgta ctgatgaaaa gatatggata gacttgtgtg 4020 gtaataagat agagatacaa atatcgctgg ttcaatccca tgaaactatt gaagtaggaa 4080 atagtgatca gacatatttt gtgaaggaaa aagacaatag atgcggaaca tggttatgca 4140 aggttagcga gcagggtatc tcatctattt tcgccccatt ttttgctgtc tttggtgatt 4200 atgcaaagat tgcattctat tgtgtgctag ggataatatg tcttgcctta ttaatatatc 4260 tgctacttcc tgtatgtggg aaaatgaggg atgttctaaa gaagaatgaa atagaatata 4320 tgaaagagct taggggcaag agaatataaa tcagtgaatg gttaagatag agaggtagaa 4380 agtaatagac aaataaaaat atgtaggaat agagataaaa taaaataaaa acaaaataga 4440 aataaaaata aacaaaaata aaacaagaaa ataaaacaaa aataaaaata aaaataaaaa 4500 taaaaagaat aaaataagaa ataaaaatac aagacacagc ggttcactac t 4551 // ID MG029274; SV 1; linear; viral cRNA; STD; VRL; 1109 BP. XX AC MG029274; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Caraparu virus isolate BeAn3994 segment S, complete sequence. XX KW . XX OS Caraparu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1109 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1109 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; c30b234730cbe83baee258119eb67f36. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1109 FT /organism="Caraparu virus" FT /segment="S" FT /host="Sapajus apella" FT /isolate="BeAn3994" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="15-Feb-1956" FT /db_xref="taxon:192196" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DE64" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE64" FT /protein_id="AVX48953.1" FT /translation="MSIELISEVNNQRCQCLLLNLRMTTGAQLPLLLILNKPTMNLLAT FT TGRTSASIMLEFSSSALMRLNRNCVRALRRSLCLNLAVGRSRLLIIITPEMHQTRLQII FT V" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:W8CZV1" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:W8CZV1" FT /protein_id="AVX48952.1" FT /translation="MSVPTFEFTDDDRGPTASSFDPQQAYNEFIGNHGENLSVDNVRIF FT FLRANEAKQKLRKSSAKIAMLKFGSWKVEVVNNHYPGNASNPVADNSLTLYRISGFLAK FT YTLELHNDSEHRAEIEEKIVNPIAESKGVTWQAGAKIYLAFFPGTVMFLYEFEMLSLAI FT YLYRAQKDEIDPSLLKKPLRQKYKKDNPEKWMREKKVMIQGALGRIAKLPWGTTGLSAQ FT ARDFLKEFGITMK" XX SQ Sequence 1109 BP; 370 A; 184 C; 243 G; 312 T; 0 other; agtagtgaac ttcttaggaa gttcactaat gtcaattgag ttaatatcgg aggtaaataa 60 ccagagatgt cagtgcctac ttttgaattt acggatgacg acaggggccc aactgcctct 120 tcttttgatc ctcaacaagc ctacaatgaa tttattggca accacgggga gaacctcagc 180 gtcgataatg ttagaatttt cttcctccgc gctaatgagg ctaaacagaa actgcgtaag 240 agctctgcga agatcgctat gcttaaattt ggcagttgga aggtcgaggt tgttaataat 300 cattaccccg gaaatgcatc aaacccggtt gcagataata gtctaactct ctacagaatt 360 tcaggctttc tggctaaata cactctagaa ctgcacaacg actcagagca tagagcagaa 420 atagaagaaa agattgtcaa tccaattgct gagtcaaaag gagtgacatg gcaagctgga 480 gctaaaatct acttggcttt cttcccagga acagtaatgt tcctctacga gtttgagatg 540 ctttctctgg caatctatct atacagagca cagaaagatg aaattgatcc aagtctcttg 600 aaaaagcctc tcagacaaaa gtataagaaa gataacccag aaaaatggat gagagaaaag 660 aaagtgatga tccaaggagc cttgggaaga attgcaaagc ttccttgggg aaccactggg 720 ctctctgctc aggctagaga cttccttaag gaatttggca tcacaatgaa gtgagcctta 780 attgtacaaa ttaattaata tatagtataa accttaggtt aagttttaaa tagatttgca 840 aatggtagga taatggtaag gttaatggta agttttaaat ggtaagttta tgtaaatatt 900 ttaattttaa ataattattt aattgtaaat tggggtgggg ggaaataata gcagctgtcg 960 aaacgggtaa gggaaaacaa tctaatgggc ttttacatta tacaaaaatg ggttgggtgg 1020 ttggggaaag aagcagggct acatattttc tgcagtgtta tatatattca gtcatcactt 1080 ttgtcgaact ttctaagaag aacactact 1109 // ID MG029275; SV 1; linear; viral cRNA; STD; VRL; 6907 BP. XX AC MG029275; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Itaqui virus isolate BeAn12797 segment L, complete sequence. XX KW . XX OS Itaqui virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6907 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6907 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 03b6698575df75c20ad23aa80c30cc22. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6907 FT /organism="Itaqui virus" FT /segment="L" FT /host="sentinel mouse" FT /isolate="BeAn12797" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="02-Sep-1959" FT /db_xref="taxon:348026" FT CDS 61..6807 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2Z3DE16" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE16" FT /protein_id="AVX48954.1" FT /translation="MAILLEDVIRQYSARIRNCNNPEIGRDILAEITMTRHNYFAQKFC FT EAIGIEYRNDVPAADIVLEMMPGLDLTRIRIPNITPDNYYRDGTKIYIIDFKVSVSDES FT AQHTYKKYDTLFGDVFNQLNIEYEVVIIRMNPSDMHLHISSDNFAALFPNIALNVTFDW FT YFRLRDDLFHQFRDNEEFMELIAHGEFTPTIPWVNENTPELFDHEVFQDFIASMPLENR FT EDFYYALNHNAFQSDKWNDLLHVIMRKYGDKYHDFIKANARRIFLTDDKYNRPTKVAIL FT QGWKEMVERVQEGRETVDDLSKQKPSIHFIWGPNSKNESNENNTKIIRLSKTLQSIKEK FT DSFSVAFKSIGSLMDFSEDIDKYEAFCAKLKADARSSLKPKSKKIDPIKIGPNLIKVGQ FT QFKMDTDIIPKEIRLKFLKEFCGIGNHKQFKDRMMEDLDLDKPKILNFENQDIKNQAYA FT MVQNTSYFMSKPSNLIKVGNVLEEFKDKIVNANEETWATIEEIAKTRYWQSINDFSVLI FT KNILAVSQYNKHNTFRIVCTANNNFFGLVYPSTSIQSKRSTIVFSTIVLHDNESDVLKC FT GALYKTYRVKNGYLSISKAIRLDKERCQRLVISPGLFMLTSLLFKGDCDVSLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFAVYMTDLIKRG FT CMAANDQRQQISIKDVFLNEFEITQKGVTNDRNLQSIWFPGKVNLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRVNLPTPWGVDLKKQSVNLDVLIFSIAKMLNLDT FT SRHNHLRSRVENRNNFKRSLASISTFTSSKSCIKVGDFKEFKERNALHMKKLQGKEAKK FT TRIANTEFVSDNDRDLEIVHSTYLDLRKSVPNYTDYMSTKVFDRLYERFKNHDFEDKPA FT IEIIMDVMRCHTDFKFCFFNKGQKTAKDREIFVGELEAKLCLYGVERIAKERCKLNPDE FT MISEPGDGKLKKLEINGESEIRYLIDATRNQTREQSKVDDLLDTPKGIKLEINADMSKW FT SAQDVFYKYFWLIALDPILYPFEKQRIIFFFCNYMNKELILPDEMMCSLLDQKAQREND FT LIRQMTNGFTTNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDILKEVSALLEGRCNVNS FT MVHSDDNQTSVIMVQDKLHNDIITNFVCNTFERCCLTFGNQANMKKTYITNHIKEFVSL FT FNIYGEPYSVYGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFHFERRELPIELCGLLQADLSTIALVGLESGNISFLTSL FT LKKMSPPQLVKESGQAQCQNIESWDLDLLSESEIMKLKILRYVVLDSEVREDSTMGETS FT DMRSRSLITPRKFTTPASLERLISYKDFQEILADPTRTEDLLETMINNPELLVTKGENS FT EEFMTTILYRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSILDVPRV FT QENEGIIGRKTIPETFVAIKRDLSLMNLDHKDIKLVYSFCILNDPLNTTACNAILLSQV FT QSLMDRSSLSAVTMPEFRNMKLIRYSPALVLRAYIHNNLTIAGANENAMRRDLFHLQEF FT INQTKIKERLDKRIQDNEEIRGERDRMFEIKETTKFYQACYDYIKSTEHRVKVFILPMK FT AYTAFDFCATIHGNLMKDKGWFAVHYLKQIVSGTAKASVSQAPASEMIIVGECFRLIAH FT FCDTFIDVGSRLQFLYNIIDNFTYKNMPVKNLLVLIIISSKRQHFLPILHWLEELTQHD FT IDKYDAYKANERVVWNDWQVNRDMNTGNIDLTIKGYQRTLRVIGEDDTLKIGELEILKG FT DTTPIETHGRKLLNSKHGLKFEKMQKYKIIEPNTYYICWQNRTRFSYTYQLLLSNIIEA FT RNSQTVSVTGGKFNELVPVCPVIVSRIDSDDKMNLRQIKYLNMDCSLTRLQLNQNEFAV FT VKRCHFSKMVFFNGPEMIVGNINITNLIQTPSLLTTNYPSLSQMPMMTLTRIFNCNGEE FT KEVDEFEFLSDEILEETETAVINAQPMFNIQYETKSKKGYTYKRALQEALSRGIQEIEH FT NFDFCKDGFYSPKNIAIIALLVNVIDRLHTNEWSSIIRKSFHMCFFNNGKDKLFHMMNI FT PKTFIKNPIGEVPNWEKIRTFIIQIDTAYPGNNWHQMFEHFKEKCILLIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFN" XX SQ Sequence 6907 BP; 2569 A; 1052 C; 1304 G; 1982 T; 0 other; agtagtgtac ccttgcttca gttatacaat tactaaagtg caagaaaaga acataaagga 60 atggctattc tcttagaaga tgttatccgg cagtattctg ctaggattag aaattgcaac 120 aatccagaga tcggaagaga tatcttagca gaaataacta tgacaaggca taattatttt 180 gctcaaaaat tttgtgaagc tataggaatc gaatatagga atgacgtccc agctgcagat 240 attgttctgg agatgatgcc aggattagac ttgacacgaa ttaggattcc taatatcacg 300 ccagataact actatcgtga tgggacaaag atctatatta tagatttcaa ggtatcagtt 360 agtgacgaat cagctcaaca tacttacaaa aagtatgata ctttgtttgg agatgttttc 420 aatcaattga atattgaata tgaagtagtt ataatacgaa tgaacccaag cgatatgcac 480 ctacatattt ctagcgacaa ttttgcagca ttgtttccca acattgctct caatgtaaca 540 ttcgattggt actttagatt acgagatgat ttgttccatc aattcaggga caatgaagaa 600 ttcatggaat taatcgctca tggagaattc actccgacta taccttgggt aaatgaaaat 660 actcccgaat tatttgacca tgaagttttt caagacttta ttgcatcgat gccattagaa 720 aatagggagg atttttacta tgctttaaat cacaatgcgt ttcaatcgga taaatggaat 780 gatttattac atgtgattat gaggaaatat ggtgacaaat atcatgattt tattaaggcc 840 aatgcaagga ggatattttt aactgatgat aaatacaatc ggcccacaaa agtagctata 900 ttgcaaggtt ggaaagagat ggttgagaga gttcaagagg ggcgtgaaac cgtagatgat 960 ttgtctaaac agaagcctag tatccatttt atctggggac ctaactcaaa gaatgaatca 1020 aatgaaaaca ataccaaaat aatccgcttg tcgaaaactt tacaatctat aaaagaaaag 1080 gattctttca gtgttgcttt caaaagtata ggatcattga tggactttag tgaagatatc 1140 gataaatatg aagcattttg tgcaaaattg aaagcagatg ctaggtccag tttgaagccg 1200 aaaagcaaaa agatagatcc aattaaaata ggaccgaatt tgattaaagt ggggcagcag 1260 ttcaaaatgg atacagatat aataccaaaa gaaattaggc taaaatttct caaagaattc 1320 tgtggaattg gtaaccacaa gcagttcaaa gatcggatga tggaagatct tgatcttgac 1380 aagcctaaga ttctgaattt tgagaaccaa gacataaaga atcaggctta tgcaatggtg 1440 caaaacacat cttatttcat gtccaaacct agcaatttga ttaaagtggg gaatgtatta 1500 gaagaattca aagacaaaat tgttaatgct aatgaagaaa catgggcaac aatagaagaa 1560 attgccaaaa ctaggtactg gcagagtata aatgactttt ctgttttaat caaaaacata 1620 ttggctgtat ctcaatacaa caagcataat acatttagaa tagtatgcac tgcaaacaat 1680 aatttcttcg gcttagtata tccttcaaca agtatacaat ccaaaagatc aactatagtc 1740 ttctcaacta ttgtactgca tgacaatgag tcggatgttt tgaaatgtgg ggctttatac 1800 aaaacatata gagtgaagaa tggctactta tcaatatcta aagcgattag actggataaa 1860 gaacgctgcc agaggctggt aatatcccct gggctattca tgttaacttc attgcttttt 1920 aaaggagatt gcgatgtgag tttaaatgat gttatgaatt ttgcattttt cacctctctg 1980 tcaataacaa aaagtatgtt atcactgaca gaaccatcta gatatatgat catgaattcg 2040 ttagcactat ctagccatgt aagagaatat atagctgaaa aattctcgcc atacacaaaa 2100 acactttttg ctgtgtatat gacagaccta atcaaacgag ggtgtatggc tgccaacgat 2160 caaaggcaac aaatatcaat aaaagatgtt tttttgaatg aattcgaaat cactcaaaaa 2220 ggggtgacaa atgacagaaa cttacaatcg atatggtttc caggaaaagt taatctgaag 2280 gaatacataa accagatcta tatgccattc tatttcaatg ctaaaggatt acataacaaa 2340 catcatgtga tgatcgacct tgctaaaaca gtactagaaa tagaattaga tcaaagagta 2400 aacttgccaa caccttgggg tgttgatcta aaaaagcagt cagtcaattt agatgtctta 2460 atattttcta tagctaaaat gttaaattta gacacgtcta gacataacca tttaagaagt 2520 agggttgaaa ataggaataa tttcaaaagg tcattagcaa gtatctccac tttcactagt 2580 tctaaatcat gtataaaagt gggagatttc aaggaattca aagaaagaaa tgctttacat 2640 atgaaaaaat tacaagggaa agaggcaaaa aagaccagaa tagcaaatac agaatttgtc 2700 tcagacaatg acagagattt agaaatcgtc cacagcactt acttagacct acgaaaatct 2760 gtaccaaact atacagatta tatgtcaaca aaggtatttg atagattgta tgaaagattt 2820 aaaaatcatg attttgaaga caaacctgca atagaaataa taatggacgt aatgaggtgt 2880 catactgatt ttaaattttg cttttttaac aaagggcaaa aaactgccaa agaccgagag 2940 atattcgttg gagaactaga agcaaagctg tgtctatacg gtgttgagag aatagctaag 3000 gagagatgta agttgaatcc agatgaaatg atttctgaac ctggcgatgg caagctaaaa 3060 aagctagaga taaacggaga atcggaaatt agatatttaa ttgatgctac taggaatcaa 3120 actagagaac agtcaaaagt ggatgatctc ttggatacac caaaagggat taaattggaa 3180 ataaatgcag atatgtcgaa atggagtgct caagatgttt tctacaaata tttttggctt 3240 atagcattag accctatatt atacccattt gaaaaacaga ggatcatttt tttcttttgc 3300 aattacatga acaaagaact gatcctacca gatgagatga tgtgctcatt gttagatcag 3360 aaagcccagc gcgaaaatga cttgattagg caaatgacaa atggattcac tacaaacact 3420 gtcaatataa gaaggaactg gctccaaggt aatctgaact acacttctag ttatatacat 3480 agttgttcta tgatggtctt caaggatatc ttgaaagaag tgtctgcttt gctagaaggg 3540 agatgtaatg tcaatagtat ggttcattct gatgacaacc aaacctctgt tataatggta 3600 caggataagt tgcataatga tataattaca aattttgtat gtaatacatt tgaaagatgt 3660 tgcttaactt ttggcaatca agccaatatg aagaaaacat atattacaaa ccacataaaa 3720 gaatttgtga gtcttttcaa tatttatgga gaaccttatt ctgtatatgg ccgtttccta 3780 cttccggcag ttggggattg cgcatatatt ggaccttacg aagatatggc gagcagatta 3840 tcagcaactc agacagcaat taaacatggg tgccctccaa gcttagcatg ggtaagcatt 3900 gcactgaatc attggattac attcaacaca tacaacatgt taccagggca aatcaatgat 3960 cctaccaaag tattccattt tgaaaggagg gaacttccta ttgaattatg tggtttatta 4020 caagcagacc tatcgactat tgctttagtc ggtttggagt cggggaatat ctcgtttcta 4080 acatcgctac tgaaaaagat gtctccaccc caattggtga aagaatcagg tcaggcacag 4140 tgtcaaaata ttgaatcatg ggatttagac ttgttatctg aaagtgagat aatgaagtta 4200 aagattttaa ggtatgtagt tcttgactcg gaggtgaggg aagatagtac tatgggagag 4260 actagtgata tgcgtagtag atctctaatt acaccacgga aattcacaac cccagcatct 4320 ttagaacgac ttatttctta caaagatttc caagagatat tggctgaccc aactagaaca 4380 gaagatctac tggaaactat gataaataac cctgaattgt tagttacaaa aggggaaaac 4440 tctgaagaat ttatgacgac tatattgtat agatataact caaagaaatt caaagaatct 4500 ttatccatac agagtcctac ccagctattt atagaacaaa tactgtttgc gaacaagcct 4560 gtcatagatt atacaggtat tcaagataga tatttaagta tcctcgatgt ccctagagtt 4620 caggaaaatg agggaataat aggaaggaaa acaattcctg aaacttttgt tgccatcaag 4680 agagatttat ctttaatgaa cttagatcat aaagacataa agttagtata ttctttttgt 4740 attttgaatg atcctctaaa tacaacagca tgcaatgcta tattattatc acaagtccaa 4800 tcacttatgg atagatcaag tctatcagca gttacaatgc ctgaatttag aaatatgaaa 4860 ttaatacgat attcaccagc cttagttctg cgagcttata tacacaataa tttaaccatt 4920 gctggagcta atgaaaatgc gatgagacga gatttattcc atctacaaga gtttatcaat 4980 caaactaaga taaaagaaag gttagataaa agaatccaag acaacgaaga gatcagaggt 5040 gaaagagata ggatgtttga aattaaagaa actaccaaat tttaccaggc ttgttatgat 5100 tatattaaat ccactgaaca tagagtgaaa gtctttattt tgcccatgaa ggcatataca 5160 gcatttgatt tctgtgccac aatacatggc aatcttatga aagataaagg gtggtttgct 5220 gtccattatc taaaacaaat agtttctgga accgctaaag caagtgtcag ccaagcacct 5280 gctagtgaaa tgattattgt gggcgagtgt ttcagattaa tagcacactt ctgcgacaca 5340 tttattgatg ttggatctag attacaattc ttatataata taatagataa cttcacatat 5400 aaaaatatgc ctgtgaaaaa tttgttagtt ctcatcataa tctctagcaa aaggcaacac 5460 ttcttgccaa tacttcattg gttggaagag ctaacacaac atgatataga caaatatgat 5520 gcatataagg caaatgaaag ggttgtatgg aatgattggc aagttaatag agatatgaat 5580 actgggaaca ttgacttgac aattaagggt taccagcgca cattgagagt aattggtgaa 5640 gacgatactc taaaaatagg ggaattagaa attttaaaag gagatacaac cccaatagaa 5700 actcacggaa ggaagttact caactcaaag catggtttga aattcgagaa aatgcagaaa 5760 tataagatta tagaacccaa tacgtattat atctgctggc agaatagaac tagattctct 5820 tacacttatc aacttttgct atcaaatata atagaagcac gtaattctca aactgtatct 5880 gtcacaggtg gtaagtttaa tgaactagtt ccagtatgcc cagtgatagt aagcagaatc 5940 gactctgatg ataaaatgaa tttaaggcaa attaagtatc ttaatatgga ctgctcatta 6000 actcgacttc agttaaatca gaatgagttt gctgtagtga aaagatgcca tttttccaaa 6060 atggtatttt tcaatggacc agaaatgata gtcggcaata taaatataac aaatttgatt 6120 caaactccaa gtttattaac aacaaactac ccatcattat ctcaaatgcc aatgatgaca 6180 ttgacaagaa tatttaactg caatggtgaa gaaaaagagg tagatgaatt tgaatttttg 6240 tcggatgaaa tactagaaga aacagaaact gcagtgataa atgcgcagcc aatgttcaat 6300 atccaatatg aaacaaaatc aaagaaaggc tatacatata aaagagcttt acaagaagct 6360 ttatctagag ggatacaaga aattgagcac aactttgact tctgcaaaga tggattttat 6420 tcaccgaaga atattgcaat aatagcattg ctagttaatg ttatagatag attgcataca 6480 aacgaatggt caagcatcat acggaaatct tttcacatgt gttttttcaa caatgggaaa 6540 gataaattgt ttcatatgat gaatatacct aaaacattta ttaaaaatcc gattggggaa 6600 gttccaaatt gggagaagat tagaacattc ataatccaaa tagatacggc ttatcctggc 6660 aataattggc atcaaatgtt tgagcatttc aaggaaaaat gcatattatt gatagataga 6720 gaaattaaaa tggaaggcat gagctggggc gaaatgctag atgagctcga tgattataaa 6780 gataccgaaa tgtttcactt caattaaact gatctagagg gtatcttagt agtaggagag 6840 gcaaaggaaa gagtaacata tgatgaggat aggtaaacaa aatgaaaaca aacaagggaa 6900 cactact 6907 // ID MG029276; SV 1; linear; viral cRNA; STD; VRL; 4573 BP. XX AC MG029276; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Itaqui virus isolate BeAn12797 segment M, complete sequence. XX KW . XX OS Itaqui virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4573 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4573 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 01e1040bb71cefa464351a62651956e8. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4573 FT /organism="Itaqui virus" FT /segment="M" FT /host="sentinel mouse" FT /isolate="BeAn12797" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="02-Sep-1959" FT /db_xref="taxon:348026" FT CDS 47..4333 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DE59" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR017896" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE59" FT /protein_id="AVX48955.1" FT /translation="MLLLLVAFISLVGGIPLNNRCFDGGVIVEEKSMSHGIAELCIKDD FT ISMIKTSSTQIRNTTKFSNRIMRKMLVQNYQDCNPIETSNGPIMIFQPNRELTLVPRTY FT ACRMDCSISLDKEEATIILHSDKLNHYEVMGTTTATRWFQGSTSYSLEHTCEHIQVTCG FT SKTLNFHACFKYHMACIRLLNKSYMPSFMIQSVCRNKEIILMTCLILIIFALLYILTLS FT YICYILLPIFIPIAYLWGWLYNKSCKKCQYCGLAYHPFTKCGKNCVCGSMFENSERMKM FT HRTSGLCKGYKSLRAARILCKSRGSAFVLAVLLATLLLSFIQPLEAIQLKYNDSEINLP FT ELSYELDLIFHRMEVKKYNSVNTNHNLEYYHASNHSESNFYEKIEDRILDKYLYYCPEC FT QMTHPQSGLKHYFNGEFTNMCNSCMCGCVYNQEELNSPDGYMVPMTHKLTVGCYAPARY FT YTLRKITGFSGKVIIFLLTILISMSIAAAGKCDKANYYQIDEPVECSAWIKVSECNQAS FT SLETLMAKLKIAQIDKDLIKSTKTSLLELLIKSQDSTSPIGSYVMEDLAVTLYCKEIAK FT MNQENGEYNHLMKTLFVGKELEVCSNGKISKACNCIVGKQNCDYNSLGEALTFYKSHVE FT IYKSDVVRMVQAIAKVFPGIMAKELLLASKESNYTRILTILKILEPKLVNAKAIITVVK FT ILGKVLADTTVAAIKIPENSVKGVKPFDAEWIKKSIFDNMQTSTDRKTCTGATLYRCYY FT PVSLRFTFIVKCDEENKFYTSGEYMIATHYSTPDNLCVGDPFCELDFKGMDSNRKEELQ FT TFRCTEEKMTQQSQENSRPIPKCKVVSTQSCTVGSTGNRSVAECANGYFYEYVGELHQS FT SKDNIGIYCFEKGCKQNRFPHHVDNLRGCTSHNSEMINRRLREINYSSLEQLKHSMQEA FT IKTDLVEHNYILTKNLPKVTPAFRALSIQGVETDRGIESSYIETNLLVRTGMASGLTLT FT TKNGDPLFDIILFVKSAHYDALYEEIYKTGPTVGINMQHNERCTGSCPQNMTKIGWLSF FT SKEHTSQWGCEEFGCLAINEGCVYGHCRDLIKPDISILRKAQEEEPVIKICFTLPHETL FT CQNINSFNAIITEKFEIQFLSNEAGKIPKLLGYKSHKILSGMINDLGTFSKMCGSVQQT FT IHGVFGAGIPRFDYVCHAAQRKDVTISRCFDNFYESCSLLEERQDIVFDSSTKKISLLN FT RNMGELRVKIKLGDINYKLFEKTPSFDLKGSCVGCINCIRGVDCELEIIASSESVCMLN FT SDCTFYHNNLKIDPNTQKYGLKAKCQSEKIWVELCGNKIEIQVSISKVSETIEVGNSDQ FT TYFVKEHDMRCGTWLCKVSEQGISSIFAPFFAVFGEYGKIAFYTILGVLVAALIIFLSL FT PIMGKVRDMLKKNEYEYLKETLGKRR" XX SQ Sequence 4573 BP; 1608 A; 757 C; 892 G; 1316 T; 0 other; agtagtgaac cgctgcgtgt ttataatata gttgttcact tacaagatgt tgttgttgct 60 agtagcattc atatcgctcg ttggcggcat accgctaaac aatcgttgtt ttgacggagg 120 agttattgtg gaagaaaagt caatgtctca tggcatagca gaactctgca taaaagatga 180 tataagtatg attaaaacct catctacaca aattcggaat actacaaaat tcagtaatag 240 gatcatgagg aagatgctgg tacaaaatta tcaggactgt aatcctattg agacatcaaa 300 tgggcctatt atgatttttc aaccaaaccg agagctcact ctagtgccta gaacttatgc 360 ttgtaggatg gactgctcaa tttcattgga caaagaagag gcaacaatta ttctccactc 420 agacaaacta aatcattatg aagtgatggg gacaaccaca gctacacgtt ggtttcaagg 480 gagcacaagc tattcgctgg agcatacatg tgaacatata caagtcactt gcggttcaaa 540 aactctcaat ttccacgcat gtttcaaata tcacatggca tgtattagac tcctgaacaa 600 aagctatatg ccttcattca tgattcaatc tgtttgtaga aataaagaaa taattttaat 660 gacttgcttg atcttaatta tctttgcatt gctgtatata ttaacattat catacatctg 720 ttatatttta ttgccaattt ttatacctat agcatacctc tggggttggc tctacaacaa 780 atcatgcaaa aaatgtcaat actgcggtct agcataccat ccattcacca aatgcgggaa 840 aaattgtgtt tgtggttcaa tgtttgagaa ttctgaaaga atgaagatgc accgcacctc 900 tggattgtgc aaaggttata aatctttgag agctgcaaga atactatgca aaagccgagg 960 gtcagcattt gtgcttgcag tattgcttgc taccttactt ctatctttca ttcagcctct 1020 tgaagcaatt cagctgaaat ataacgacag tgaaatcaat ctcccagaac tttcatacga 1080 actagattta atatttcata gaatggaagt aaaaaaatat aatagtgtta acacaaatca 1140 taatcttgag tactatcatg cttctaatca ttctgagtct aatttttatg aaaaaataga 1200 agacagaata cttgacaagt atttgtatta ctgtcctgaa tgtcaaatga cccatcctca 1260 aagtgggctt aaacactact ttaatgggga attcaccaat atgtgcaatt catgtatgtg 1320 tggatgtgta tacaatcaag aagaattgaa ctcaccggat ggatatatgg ttccgatgac 1380 tcataaactg acagttggct gttatgctcc agctagatat tacactttga ggaaaataac 1440 aggatttagc ggcaaagtta taatatttct tttgactata ttgatatcta tgtcaatagc 1500 agcagctggg aaatgcgata aagctaatta ttatcaaatt gacgagcctg tagaatgctc 1560 tgcttggata aaagtttctg aatgcaacca ggcatcatct ttggagacat taatggcaaa 1620 attaaaaata gcccaaattg acaaagatct tattaaatcc acaaaaacat cattgctcga 1680 gttgctaatc aagtctcaag attcaaccag cccaataggt tcatatgtca tggaggatct 1740 agcagtaact ttatattgta aggaaatagc aaaaatgaac caagagaatg gggaatataa 1800 tcacttaatg aaaactctct tcgtagggaa agaattagag gtgtgttcta atgggaaaat 1860 tagcaaagca tgtaattgta ttgtagggaa gcagaactgc gattataact cattgggtga 1920 agctcttact ttctacaaaa gtcatgtaga aatttacaaa agcgatgtgg tcagaatggt 1980 ccaggcaatt gccaaagtct ttcccggcat aatggctaaa gaattactat tggcatcaaa 2040 ggaatctaat tatacaagaa tcttaacaat tttgaaaatc ttggaaccta aattagtcaa 2100 tgccaaggct ataatcactg ttgtcaagat attagggaaa gtattggcag acacaactgt 2160 tgctgctatt aaaatcccgg aaaattctgt caaaggagta aaacccttcg atgcagagtg 2220 gatcaaaaaa agcatatttg acaatatgca gacatcaaca gaccgtaaaa catgcacagg 2280 agcaacatta tatagatgtt attacccagt cagtcttaga ttcaccttta ttgttaaatg 2340 tgatgaagaa aacaaatttt atacatctgg agagtatatg attgctacac actattcaac 2400 cccagacaat ttgtgtgtag gagacccctt ttgcgaatta gacttcaaag gtatggattc 2460 caacagaaaa gaggagctac aaacctttag atgtacagaa gagaagatga cacagcaaag 2520 tcaagaaaat tcacgaccaa ttccaaaatg caaagttgtc tcaacacaaa gctgtactgt 2580 tggttcaact gggaatagaa gtgttgctga atgtgctaat ggatattttt atgagtatgt 2640 tggagagctc catcaaagtt ctaaagataa tatcggcatc tactgctttg aaaaaggttg 2700 caaacaaaat agattcccac atcatgtaga caacctcaga ggctgcacat cgcataattc 2760 agagatgatt aataggaggt tgagagaaat taactattct agcttagaac agctaaagca 2820 tagtatgcaa gaagcaatta aaacagattt ggttgagcat aattatattc tgacaaaaaa 2880 cctcccaaag gtaacacctg ccttcagagc tttgtctatt caaggtgttg agacagacag 2940 aggaatagaa agttcataca ttgagacgaa cctactggtt cgaactggga tggcatctgg 3000 cttgacattg acaacaaaaa atggcgatcc cctctttgat attatattat ttgttaaaag 3060 tgctcattat gatgccctat atgaagaaat atacaaaacg ggaccaacag taggaataaa 3120 tatgcaacat aatgaaagat gcactggaag ctgtcctcaa aatatgacta aaatcggttg 3180 gttgtccttc tcaaaagagc atacaagcca atggggctgc gaagagtttg ggtgcttagc 3240 aattaatgaa ggatgtgtgt atggacattg tcgagatcta ataaaaccag atatatccat 3300 attgaggaaa gctcaagaag aagaacctgt gatcaaaata tgctttacat tgcctcatga 3360 gactctttgt caaaacataa attcattcaa tgcaataatc actgagaaat ttgagatcca 3420 atttctatca aatgaggcag gaaagatccc aaaattatta gggtataagt cccataaaat 3480 cctgagtgga atgataaatg atttaggaac attctccaaa atgtgtggaa gtgttcagca 3540 gacaatccat ggtgtcttcg gggcgggtat cccgagattt gattatgttt gtcatgctgc 3600 ccagaggaaa gatgttacaa ttagccgctg ttttgacaat ttttatgaat cttgctcctt 3660 actagaagaa agacaagata ttgtttttga ttcatcaacg aagaagatct cacttttaaa 3720 tagaaatatg ggggaactcc gcgtcaagat aaaattaggt gacataaatt acaaattgtt 3780 tgaaaagaca ccctcatttg atttgaaagg atcttgtgta gggtgtataa attgcatcag 3840 gggggttgac tgtgaactgg agataatagc tagttcagag tcagtctgta tgctaaattc 3900 agactgtaca ttttaccata ataacctcaa aatagatcca aacactcaaa aatatggctt 3960 aaaagcaaaa tgccaatcag aaaaaatatg ggttgagttg tgtgggaata aaattgaaat 4020 ccaagtaagc atttctaaag tctcagaaac catagaggtt ggcaacagtg atcagacata 4080 ttttgtcaag gaacatgaca tgagatgtgg aacctggcta tgtaaagtta gtgagcaagg 4140 aataagctct atatttgcgc cgttctttgc tgtattcgga gaatatggaa agatagcctt 4200 ttacactatc cttggtgtcc ttgttgctgc actgataatc tttttatctt tgccgatcat 4260 ggggaaagtt agggatatgc taaagaagaa tgaatatgag tatcttaaag agactctcgg 4320 gaagaggaga tagaaacaaa agaccaaaat agagagaggt aaatacaaga tgaatggaaa 4380 agtaaaaaga taagaaatat caaaacaaaa ataaaacttc ttggcttttg gcagactcaa 4440 acctattttg gaactttcaa aatgcattaa tgtgtatagg atttttattt taaaaatgct 4500 acttatgctc tgttcattct acatgctatg aaattatggt gaaataaaat gtcagacaca 4560 gcggttcact act 4573 // ID MG029277; SV 1; linear; viral cRNA; STD; VRL; 1109 BP. XX AC MG029277; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Itaqui virus isolate BeAn12797 segment S, complete sequence. XX KW . XX OS Itaqui virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1109 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1109 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 09bfda4dde0f72fa4c5448dc1edcb908. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1109 FT /organism="Itaqui virus" FT /segment="S" FT /host="sentinel mouse" FT /isolate="BeAn12797" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="02-Sep-1959" FT /db_xref="taxon:348026" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DG35" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DG35" FT /protein_id="AVX48957.1" FT /translation="MSIELISEVNNQRCQCLLLNLRMTTGAQLPLLLILNKPTMNLLAT FT TGRTSASIMLEFSSSALMRLNRNCVRALRRSLCLNLAVGRSRLLIIITPEMHQTRLQII FT V" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:A0A2Z3DKY2" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DKY2" FT /protein_id="AVX48956.1" FT /translation="MSVPTFEFTDDDRGPTASSFDPQQAYNEFIGNHGENLSVDNVRIF FT FLRANEAKQKLRKSSAKVAMLKFGSWKVEVVNNHYPGNASNPVADNSLTLYRISGFLAK FT YTLELHNDSEHRAEIEEKIVNPIAESKGVTWQAGAKIYLAFFPGTVMFLYEFEMLSLAI FT YLYRAQKDEIDPSLLKKPLRQKYKKDNPEKWMREKKVMIQGALGRIAKLPWGTTGLSAQ FT ARDFLKEFGITMK" XX SQ Sequence 1109 BP; 366 A; 182 C; 249 G; 312 T; 0 other; agtagtgaac ttcttaggaa gttcactaat gtcaattgag ttaatatcgg aggtaaataa 60 ccagagatgt cagtgcctac ttttgaattt acggatgacg acaggggccc aactgcctct 120 tcttttgatc ctcaacaagc ctacaatgaa tttattggca accacgggga gaacctcagc 180 gtcgataatg ttagaatttt cttcctccgc gctaatgagg ctaaacagaa actgcgtaag 240 agctctgcga aggtcgctat gcttaaattt ggcagttgga aggtcgaggt tgttaataat 300 cattaccccg gaaatgcatc aaacccggtt gcagataata gtctgactct ctacagaata 360 tcaggctttc tggctaaata cactctagaa ctgcacaacg actcagagca cagagcagaa 420 atagaagaga agattgtcaa tccaattgct gagtcaaaag gagtgacatg gcaagctgga 480 gctaaaatct acttggcttt cttcccaggg acagtaatgt tcctctacga gtttgagatg 540 ctttctctgg caatctatct atacagagca cagaaagatg agattgatcc aagtctcttg 600 aaaaagcctc tcagacaaaa gtataagaaa gacaacccag aaaaatggat gagagaaaag 660 aaagtgatga tccagggagc tttgggaaga attgcaaagc ttccttgggg aaccactggg 720 ctctctgctc aagctagaga cttccttaag gaatttggca tcacaatgaa gtgagcttaa 780 ttgtacaaat tgattaatat atagtataaa ccttaggtta agttttaaat agatttgtaa 840 atggtaggat aatggtaagg tcaatggtaa gttttaaatg gtaagtttat gtaaatattt 900 taattttaaa taattattta attgtaaatt ggggtggagg gaaataatag cagctgttgg 960 gatgggtaag ggaaaacaat ctaatgggct tttacattat acaaaaaatg ggttgggtgg 1020 ttggggaaag aaacagggct acatattttc tgcagtgtta tatatattca gtcatcactt 1080 ttgtcgaact ttctaagaag aacactact 1109 // ID MG029278; SV 1; linear; viral cRNA; STD; VRL; 6931 BP. XX AC MG029278; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Madrid virus isolate BT4075 segment L, complete sequence. XX KW . XX OS Madrid virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6931 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6931 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; c39d0efca9e53be8c181703187ca617f. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6931 FT /organism="Madrid virus" FT /segment="L" FT /host="Homo sapiens" FT /isolate="BT4075" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="03-Mar-1961" FT /db_xref="taxon:348013" FT CDS 55..6801 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2Z3DIT3" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DIT3" FT /protein_id="AVX48958.1" FT /translation="MAILLGDVIRQYTARIRACTNPEVGRDILAEITMTRHNYFAQQFC FT EAINIEYRNDVPAADIILEMLPALDLTTIKVPNITPDNYYRDGTKIYIIDFKVSVSDES FT ALHTYKKYDTLLGDVFNQLNVDYEVVIIRMNPSDMHLHISSDNFANLFPNIVLNLDFTW FT YFRLRDDLFQQFRDNEEFMELVAHGEFTPTIPWVIDDTPELYTHPVFLEFVGSMPDDTT FT EDFFYALNHNAFQADKWNDLLHIMMRKYGTYYDKFIRDQAKNVFLLDNNYNKPSKEEIL FT KGWSEMVERIRDQRNVIDDCSKQKPSIHFIWSPNDKNSSNENNTKLIKLAKKLRSIKDT FT DTFSLAFKNIGHLMDFSEDVEKYETFCLKLKAEARSSLKPKSSKVTPITIGKCTVLWEQ FT QFKLDTEIIPKEVRIRFLKEFCGIGNHKQFKDRMMDDLDLGKPKILNFENPEIKNQAHI FT MMKNTQCFMSKESGLKKIGNVLEEFEYKIKDANPKTWETIGEVANSRYWQAINDFSVLI FT KNILSVSQYNKHNTFRVVCTANNNFFGILYPSASIKSRRSTVVFSSVCLHENENEILKC FT GALYQTYKVKGGYLSISKAIRLDKERCQRLVTSPGIFLLTTLLFKSDNDVNLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDMIKKG FT CMSANEQRQMISIRDVFLNEFEITQKGVSNEKNQQSIWFPGKISLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRMNIPEPWSLDMKKQSANLYLLIFSISKMLNMDT FT SRHNHLRSRVENRNNFKRSLTSISTFTSSKSCIKVGNFKDLKEKTAKHIKKINEKDARK FT TRIANTEFVDESERDFEITKSTYMDLIRCVPEYTDYISTKVFDRLYEKYKLEEIEDKPA FT IEIIMDTMKNHKDFKFCFFNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDATRNQNAEQSIIDDILDTPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPAEKKRIIYFFCNYMNKELILPDEMMCSLLDQKAEREND FT LIKEMTNGFRRNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDIMKEVASLLEGRCNVSS FT MVHSDDNQTSVIMVQDKIDNDIITNFVCTAFEQCCLSFGNQANMKKTYITNHIKEFVSL FT FNIYGEPFSVFGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFLFDRRELPIELCGILQADLATIALVGLEAGNISFLTNL FT LRKMSPPQLVKESVQSQCSNIENWDMDCLSDSEILKLKLLRYVVLDSEISEDSKMGETS FT EMRSRSLITPRKFTTTSSLEKLISYKDFQEIIVNSEKTEELLERILGKPELLVTKGENS FT TEFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSVLDMPKV FT QSGEGIIGRKTIPETFSAIKKDLSQLPLEPADVKLIYSFCILNDPLNTTACNALLLSQI FT QSLLERTSMSAVTMPEFRNMKLIRYSPALVLRAYIHGDLSVGGANEDAMRRDIFHLNEF FT IIQTRIRERLDQRIIENQEIKGERDRLFEIKELTKFYQACYDYIKSTEHKIKVFILPSK FT AYTAFDFCATIHGNLMRDDGWFSVHYLKQIVSGTAKANISIAPASEMVIVEECFKLLSH FT FCDTFIDTSSRLTFALNVIENFSYKNIPVKELLNLMKHSFRRQQFIPLLYWIGELSQED FT LDKYDAFKTSERVSWNDWQINRTLNTGTVDLTIKGYQRTLRIVGEDDFLQIAELEILKG FT DNTSIETHGRKLLNCKHNLRFEKMRKYQIMEPNTYYICWQMRTRFAYTYQMLLSNIIEA FT RNSQTVSVTGGKFNELIPVCPVIVGRIDSMERINLRQVKYLNMNCSLSRLQLTQKEFVT FT VKRSHFSKMIFFQGPNLIVGNMNLTNLIRTPTLLTTNYPSLSQVPMMTLTRIFHCIGDE FT DQTDEFEFLSDELLEDIETTTVNTVPIFNAQYEVKSKKGYTYKQALQDALRRGIEEIEN FT TLDFCGDGFYSPKNLAIIALLTNLIDRLQTNEWSTILQTAIHMSFFHNGKDRMYHLMKI FT PKAFVKNPIGEILNWEKIRTFVIQLNTRNPGNHWDQMFNHFREKTLILIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFE" XX SQ Sequence 6931 BP; 2599 A; 1065 C; 1275 G; 1992 T; 0 other; agtagtgtac ccttgattac ttattacatt gcgttcactt tcaggaaaga caatatggct 60 atattacttg gtgatgtcat aagacaatac acagctagaa ttcgagcatg cactaatcca 120 gaagtgggtc gagacatact agctgaaata actatgacaa gacataacta ttttgcacaa 180 caattctgtg aggcaataaa catcgagtac agaaatgatg taccggcagc agatattata 240 ctagagatgc tccctgcgct tgacttgaca actatcaaag ttcccaatat aacaccagat 300 aactattaca gggatggcac taagatttac ataatagatt ttaaagtttc tgtaagtgat 360 gagtcagcct tacatactta caaaaagtat gacacgttgt tgggggatgt gttcaatcaa 420 ctgaatgttg attatgaagt tgttattatt cggatgaatc caagtgatat gcaccttcat 480 atatcaagtg ataatttcgc aaatcttttc cccaatatcg tgctcaattt agatttcact 540 tggtatttcc ggctaaggga cgacttgttc cagcaattta gagacaatga agaattcatg 600 gaactagttg cgcatggaga atttaccccc acaatacctt gggttattga cgacacccca 660 gaactataca cacatccagt attccttgaa tttgttgggt ctatgccaga tgacaccact 720 gaagatttct tttatgcatt aaaccacaat gcattccagg cagacaagtg gaatgacctc 780 ttacatataa tgatgaggaa atatggcact tattacgaca aatttattcg agatcaggcg 840 aaaaatgttt ttttgcttga caataattat aacaagccct caaaagaaga aatactcaaa 900 ggctggtcag agatggttga aagaatcaga gatcagagga atgttattga tgactgttcc 960 aaacaaaagc caagtatcca cttcatctgg tcacctaatg acaaaaattc ctctaatgaa 1020 aacaatacca aattaattaa attagcaaag aagctccggt caataaaaga tacagataca 1080 tttagtttgg cctttaagaa tatcggacat ttaatggact ttagtgagga tgtcgaaaaa 1140 tatgagacgt tctgtttaaa attaaaagca gaagctaggt caagtttaaa gcctaagagt 1200 tcaaaagtaa ctccaataac aattgggaaa tgtactgtat tgtgggaaca acaatttaaa 1260 cttgatactg aaattatccc taaagaagtc aggataagat ttttgaaaga attttgtgga 1320 ataggcaatc ataaacaatt taaagatagg atgatggatg atttagattt agggaagcct 1380 aaaatattga atttcgaaaa tccagaaata aagaatcagg cccatataat gatgaaaaat 1440 acacagtgct tcatgagtaa agaaagtggc ttgaagaaaa taggcaatgt cttggaagaa 1500 tttgagtaca aaataaaaga tgccaatccc aaaacatggg aaactattgg agaagtagcc 1560 aactccagat attggcaagc tataaatgat ttctccgttt tgatcaaaaa tatattatca 1620 gtttcgcaat acaataaaca caatacattc agggttgtat gtacagcaaa taataatttc 1680 tttggaatac tatatccatc tgccagtatc aaatcaagac gttcaacagt ggtgttttca 1740 agtgtatgtt tgcacgaaaa tgaaaatgaa atactgaagt gtggtgcatt gtatcaaaca 1800 tataaagtga aaggggggta cctttcgata tcaaaagcca tacgattaga taaagaacgt 1860 tgtcaaagac tggtgacatc tcctggaatt ttcttactca caaccttact tttcaaaagc 1920 gataatgatg tcaatttaaa tgatgttatg aattttgcat tcttcacatc gttatcaata 1980 actaagagta tgttatcact aactgaaccc tcaagatata tgataatgaa ctcgttagca 2040 ctgtccagtc atgtcagaga gtacatagca gaaaaattct caccatatac aaagacttta 2100 ttttcagttt atatgactga tatgattaag aaaggatgta tgtctgctaa tgaacaaagg 2160 cagatgatat caattaggga tgttttcctc aacgagtttg aaataaccca gaaaggagta 2220 tccaatgaaa aaaatcaaca gtctatctgg tttccaggga aaataagtct aaaagaatat 2280 atcaatcaaa tatatatgcc attttatttt aatgcaaaag ggttgcataa taaacatcat 2340 gttatgattg atttggctaa aacagttctc gagatagaac tagatcagag gatgaatatc 2400 ccagagcctt ggagcctcga tatgaagaag caatctgcaa atctgtatct actcatattc 2460 tccatatcaa agatgctgaa tatggataca tcaaggcata atcatttaag gagtagagta 2520 gagaacagga acaacttcaa gagatctttg acaagcatat caacattcac cagctccaaa 2580 tcatgcatca aagtaggcaa ttttaaagat ctaaaagaga aaacagccaa acacattaag 2640 aagataaatg aaaaagatgc aaggaaaacc cgtattgcaa atactgaatt tgttgacgag 2700 tcagagagag attttgagat tacaaaaagt acatatatgg atttaataag atgtgtccca 2760 gaatatactg attatatttc caccaaagta tttgatcgtt tatatgaaaa atacaaatta 2820 gaggaaattg aagataaacc agcaatagaa attataatgg acacaatgaa gaatcataag 2880 gattttaaat tctgtttttt taacaaaggt caaaaaacag caaaagaccg tgaaattttt 2940 gttggggaat ttgaagcaaa attatgcctg tatggtgttg agagaattgc taaagagaga 3000 tgcaaactaa acccagagga aatgatttca gagcctggtg atgggaaatt aaagaaatta 3060 gaaataaatg ctgaatctga gattaggtat ttaatagatg caacaagaaa tcaaaacgct 3120 gaacaatcta ttatagatga tatcttggac acacctaagg gcattaagct tgaaatcaat 3180 gcagacatgt caaaatggag tgctcaagat gttttcttca aatacttttg gttgatagta 3240 ttagatccaa tattgtatcc agctgaaaaa aagaggataa tttacttctt ttgtaattat 3300 atgaacaagg aattaatttt gccagatgaa atgatgtgct cattactaga tcagaaagct 3360 gagagagaaa acgatttaat taaggagatg acaaatggct ttagaagaaa tactgttaat 3420 attagaagaa attggcttca aggtaacttg aactacacat ctagttatat acatagctgc 3480 tctatgatgg tttttaaaga tataatgaaa gaggttgcct ctctattaga aggaaggtgt 3540 aatgtttcta gcatggtaca ttcagatgac aaccaaacct ctgtaattat ggttcaagat 3600 aagatagata atgatatcat aacaaacttt gtttgcacag cattcgaaca gtgctgtcta 3660 tcatttggta atcaagcaaa tatgaagaaa acgtatatta caaaccacat taaagaattt 3720 gttagtctat tcaacatata tggcgagccg ttttcagtat ttggtcgttt tttgttacct 3780 gcagtaggag actgtgcata cattggtcca tatgaagata tggctagcag gctgtcggct 3840 actcaaaccg ccattaaaca tggatgtccg cctagcctag catgggttag tatcgcattg 3900 aaccattgga taacattcaa cacatacaat atgctgcctg gtcaaatcaa tgaccctact 3960 aaggttttct tatttgatag gcgagaatta ccaatagaat tgtgtggaat tcttcaagct 4020 gatttagcaa ctatagcact tgtagggctg gaagctggga atatttcatt tttaacaaac 4080 ctcttaagga agatgtcacc accacaactt gtgaaagagt cggtgcagag tcaatgtagt 4140 aatatagaaa attgggacat ggattgcctt tctgatagtg aaatcttaaa actcaagtta 4200 ttaagatatg ttgttttaga ctcggaaatt tcagaagata gtaaaatggg tgaaactagt 4260 gaaatgagga gccgatcatt gataactcct aggaaattta ccactacatc atctttagag 4320 aagctaattt catacaaaga ctttcaggag attatagtca attctgaaaa aactgaagaa 4380 ttattggaga ggattctagg aaaaccagaa ctattggtta ctaaaggtga aaattcaaca 4440 gagtttatga caactatatt gtttagatat aattcaaaaa aattcaaaga atccttgtct 4500 atacaaagtc caacacagct atttatagaa cagatactat ttgcaaacaa gccagtcatt 4560 gactacacag gaatccaaga taggtactta agtgtcctgg atatgcctaa agtgcagtca 4620 ggtgaaggaa ttattggtcg gaaaactatc cctgagacat tttctgctat aaagaaagat 4680 ttaagccaac tacctcttga gccagctgat gtaaagttaa tatactcctt ttgcatccta 4740 aatgatccct taaataccac tgcatgcaat gcattgttat tatcacaaat acaatctcta 4800 ctagaaagga caagcatgtc agctgtaaca atgccagaat ttagaaatat gaaactgata 4860 agatattctc ctgctctggt tttaagagca tacattcatg gtgatctttc tgtaggggga 4920 gcaaatgagg atgcaatgag gagagacata tttcatttaa atgagtttat aattcagaca 4980 agaattagag agcgtctgga tcaacgaatt atagaaaatc aagaaataaa aggggaaaga 5040 gatagattat ttgaaattaa agaattgaca aaattctatc aggcttgcta tgactatatt 5100 aaatctacag aacacaaaat caaagtattt atcttgccat caaaggcata cacagcattt 5160 gacttctgtg ccactataca tggtaactta atgagagatg atggttggtt ttctgtacat 5220 tatttgaaac aaatagtctc tggaacagct aaggcaaata ttagtatagc ccctgcaagc 5280 gagatggtta tagttgaaga atgcttcaaa cttttatcac atttctgtga tacatttatt 5340 gataccagtt caagattgac atttgctcta aatgtgatcg agaatttctc atataagaat 5400 attccagtca aggagctctt aaatttgatg aaacattcat ttaggagaca acaatttata 5460 cctttactat actggatagg ggaattaagt caagaagatc tagataagta tgatgcattt 5520 aagactagtg aaagggtttc ctggaatgat tggcaaataa acagaacatt aaacactggt 5580 actgtagact taactattaa agggtaccaa agaactttgc gtattgtagg tgaagatgat 5640 ttcctacaaa tagccgaatt ggaaatctta aagggagaca atacttcgat agaaactcat 5700 ggaagaaaac tgctgaactg taaacacaat cttagatttg aaaaaatgag aaaatatcag 5760 ataatggaac ctaacactta ctatatatgt tggcaaatga ggacaagatt tgcatacaca 5820 tatcagatgt tgctctcaaa cattatagaa gcaagaaatt ctcaaacagt ttcagttaca 5880 ggtggaaaat tcaatgaact aattccagta tgtccagtca tagtagggag gattgattct 5940 atggaaagaa tcaatttaag gcaagtaaaa tatctaaata tgaattgctc attatctaga 6000 ctacaattga ctcaaaaaga atttgtgact gtaaaaagat ctcatttttc caaaatgata 6060 ttcttccaag ggcctaactt gatagtagga aatatgaact tgacaaatct aattagaacg 6120 ccaacattat tgactacaaa ttacccatct ttatcgcaag tccccatgat gacattaact 6180 aggatattcc actgcatagg agatgaagat caaactgatg aattcgaatt tttatctgat 6240 gaattattgg aagatattga aacaacaaca gtcaacactg ttcctatatt caatgcccaa 6300 tatgaggtta agtcaaaaaa aggttacaca tataaacaag cattacaaga tgcactgaga 6360 agaggaatag aagaaattga aaacacattg gatttctgtg gggacggatt ttattctcca 6420 aaaaacttag caattatagc attactgact aacttaattg acagactgca gacaaatgaa 6480 tggtcaacta tactacaaac agcaatacat atgtcttttt ttcacaatgg gaaagataga 6540 atgtaccatt tgatgaaaat accaaaggca tttgttaaaa atcctattgg ggagatccta 6600 aattgggaaa aaattagaac ttttgtgata cagttaaata caagaaaccc tgggaatcat 6660 tgggaccaga tgttcaatca tttcagagag aaaacattaa tattgataga ccgtgagatt 6720 aaaatggaag ggatgtcttg gggagaaatg ctagatgaat tagatgatta caaggacaca 6780 gaaatgttcc attttgagtg aaaagagagg aaaagtaatt gatctttaaa ataaaagaca 6840 ttgtctttta ttttaaagat caaactaata actgctaaac aaatagtaag aactcacaca 6900 aacgtaataa tataaacaag ggaacactac t 6931 // ID MG029279; SV 1; linear; viral cRNA; STD; VRL; 4602 BP. XX AC MG029279; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Madrid virus isolate BT4075 segment M, complete sequence. XX KW . XX OS Madrid virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4602 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4602 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 22b6a27e3fed6c451911d2ba9a35a3e7. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4602 FT /organism="Madrid virus" FT /segment="M" FT /host="Homo sapiens" FT /isolate="BT4075" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="03-Mar-1961" FT /db_xref="taxon:348013" FT CDS 55..4362 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DH52" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR017896" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DH52" FT /protein_id="AVX48959.1" FT /translation="MMFKMLAKLLLFLWTTLQAWSIPLSNRCFEGGVLVEERSMDHGIA FT EICVKDDVSMIKTSSLQKRNETRFTNIIMRKMLIPNYQDCNPIEMPNGPIMIFKPNSDL FT MLIPKTYACRVDCTISLDRDEATIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCE FT HIQVTCGSKSLSFHACFKYHMACIRLLNKSYMPAFMIQSVCQNKEIIIMTCLILIIFAL FT LYILTMSYICYILLPLFIPIAYLYGWVYNKSCKKCSYCGLAYHPFSKCGRNCVCGAMFE FT NSDRMRMHRQTGLCKGYKSLRAARILCKSRGSAFCLAVILATLLLSFIQPLEAVKLTYN FT NEVIELPELSRELEVIFSGIDYVKSIVVIQLTICGFLILLLLSYLILSKRIEDKLIYNV FT LYYCPECEMTHPKRGLKKYFSGEFTNMCNSCMCGCTYNQEDLNDGYTIPMMHKLTVRCY FT APGRYYTNKKMNSVSMYITLACLMVLASMTIVAASEDNCLKTSLYKTQEPVSCSAWIKA FT TTCTASSGIEGMMARIKLPKQDTDLISGIRGSLDSILRKSQESSVPIQSYIFEALAVTM FT YCSELVGASLENGNINSNIKMQYTERKLEVCTAKKAVSACACLTGITGCNPSNADDVKD FT YYKTHHEIFKSDVSRMIQTIAKMFPGVFAKELLLASKTSNYTRIQAILKMLEPKLTNAR FT ATKALLKILMTAISESSIANVPAPVPSSKSINPFDPKWSQDSIFKNIVAATAIKTCPSS FT KAYKCFFPVSLRFTFYFSCGEENKFYHTGDDLIAIHYNTGTTLCVADPYCEKDFTPVDV FT ANKDSLLTMKCDQITLNVSEMPNSMPINKCRVISMQQCTVTGTANRSVAECSNGYFYEY FT TGELHQSPKDDVGVYCFEKACKPNRFPHHPSNLQGCVSHNNEMLTRKLKEINYSNLEQL FT KHSLQETIKTDLVEHHYILTKNLPKINPIFKALSIQGVETDSGIQSSYIETNLMVKTGL FT SVGLHLTTKNGDSLFDIIVFVKTAHYEAVYDEIYQTGPTIGINVQHNEKCTGKCPESLM FT KTGWLSFSKEHTSQWGCEEFGCLAINEGCLYGHCKDIIKPEMTVLRKAQDETPVIKICI FT SLPHETFCQPINAFNAIITDKVETQFISNEAGKLPKLLAYKSNKIHTGMINDLGTFSKM FT CGSVQSINNNVLGAGTARFDYICHAAQRKDVTVSRCFDNFYDSCLRLEASDNIVYDNNQ FT KRVSLLNKNMGELRLKIKMGDLNYKLFEKMPSFDFKGSCVGCLKCIKGVDCEFDIHSTS FT EAVCVLTSNCAFYHNNLKIDPNIQKYGMKAKCSEEKIWIDLCGNKIEIQLSIVQTHETI FT EVGNSDQTYFVKEKDDRCGTWLCKVSEQGISSIFAPFFAIFGDYAKIAFYTVLAILAIA FT LMVYLMLPMIGKLKDILKKNEIEYIKEFRGKKI" XX SQ Sequence 4602 BP; 1639 A; 792 C; 854 G; 1317 T; 0 other; agtagtgaac cgctgtgtat ttatactata gtagagttca ctatttattt ggatatgatg 60 ttcaagatgc ttgcaaagct cctcttgttt ttatggacca cacttcaggc atggtctata 120 ccgctgtcaa atagatgctt tgaaggagga gtcctggtag aagaaagatc aatggaccat 180 ggaattgctg aaatctgtgt caaagatgat gttagtatga tcaaaacatc atcactgcaa 240 aagagaaatg aaactagatt cacaaacata attatgagga aaatgttgat cccaaattac 300 caagactgca atcccattga aatgccaaat ggtcccatta tgatatttaa accaaatagc 360 gacctgatgt tgatcccaaa gacatatgca tgtagagtag actgcactat atcattagat 420 agagatgagg caacaataat tctccactca gacaaactta ataatttcga agttatgggt 480 actacaacag caacaagatg gttccaagga agtacaacat attcgctaga gcacacatgt 540 gagcatatac aagttacctg cggctcaaaa agtctcagtt tccatgcttg ttttaaatat 600 catatggcat gcatccgtct gctaaataag agctacatgc cagcattcat gattcagtct 660 gtctgccaaa ataaggaaat tattataatg acctgtttaa ttttgataat ctttgcacta 720 ttatatattc taactatgtc atacatctgc tacatattat tgccactgtt tataccgatt 780 gcatatctct atggttgggt atacaataaa tcttgcaaaa aatgcagtta ctgtgggctg 840 gcataccacc cgttcagcaa atgtggacgc aactgtgtgt gtggagcaat gtttgagaac 900 tcagacagaa tgagaatgca tagacaaaca ggattatgta aaggatacaa atcactcaga 960 gctgcaagga tactttgcaa aagcagagga tctgctttct gcttggcagt aattttagct 1020 accctgttgc tttcatttat ccaaccactt gaagcagtaa aattaactta caataatgaa 1080 gtaatagaat tacctgaatt atcacgggaa ttagaagtta tattcagtgg tatagattat 1140 gttaaaagta ttgtggtcat tcaacttact atttgtggat ttttaatttt actgcttttg 1200 agttatttaa tcctttctaa aagaattgaa gacaagttaa tatataatgt cttatattac 1260 tgtccagaat gcgagatgac acacccaaaa agaggattga aaaaatattt ctctggtgaa 1320 ttcacaaata tgtgtaacag ctgtatgtgc gggtgtactt ataatcagga ggatttaaat 1380 gatggatata caataccgat gatgcacaaa ctaactgtca gatgctatgc accagggaga 1440 tattacacta acaaaaaaat gaacagtgtc agtatgtata taacattagc atgcctgatg 1500 gtgctagctt ctatgaccat agtggctgca tctgaagata attgcctaaa aacatcactt 1560 tataagacac aagaaccagt tagctgttca gcatggatca aggccactac atgtactgca 1620 tcatccggca ttgaaggaat gatggcccgc atcaagctcc caaagcaaga cacagattta 1680 atttccggga taagaggtag cttggactct atcctccgga agtctcaaga gagttcagta 1740 ccaatccaat catatatctt tgaagccttg gctgtaacga tgtactgttc tgaacttgtg 1800 ggtgcatcat tagaaaatgg aaatattaat tcaaacataa aaatgcaata tacagaaagg 1860 aaattagaag tgtgcactgc aaaaaaagct gtctcagcat gcgcatgctt aactggcata 1920 actggctgta acccatcaaa tgcagatgat gtcaaagatt actacaaaac ccatcatgag 1980 atttttaaat ctgatgtttc tagaatgatc caaacaatag ctaaaatgtt tccaggagtg 2040 ttcgctaaag agctattact tgcatcaaaa acatccaact acactagaat acaagcaatt 2100 ttgaagatgc tggagccaaa attgacaaat gccagggcaa cgaaagcact attgaaaatc 2160 ttaatgactg caatctcaga atcttctatt gcaaatgtgc ctgctcctgt cccatcctcc 2220 aaaagtatca atccatttga tccaaagtgg tctcaagata gtattttcaa aaacattgtt 2280 gcagccactg caataaagac atgtccttcg agtaaagcat ataaatgctt cttccctgtc 2340 agccttagat ttacgttcta tttttcatgt ggagaggaaa acaaatttta tcacactggt 2400 gacgatttga tagccataca ttataatact ggaaccactc tctgtgtagc agatccttac 2460 tgtgaaaagg acttcacacc agtagatgtt gcaaataagg attcgctgtt gacaatgaaa 2520 tgtgatcaga ttacactgaa tgtgtctgaa atgcctaatt ctatgccaat caataaatgc 2580 cgagttattt caatgcagca atgtacagtc acaggaactg caaacagaag tgttgcagaa 2640 tgttcgaatg gatactttta tgaatatact ggggaattac atcaaagccc aaaagatgat 2700 gttggtgtat attgctttga gaaggcatgc aaacctaaca gatttccgca tcatcctagc 2760 aaccttcaag gttgtgtttc ccataataat gaaatgttga ccaggaaact taaagaaata 2820 aactattcaa acttggaaca attgaaacac agtctgcaag agacaattaa aacagatttg 2880 gttgagcatc attacatcct gacaaaaaac ttaccaaaaa tcaaccctat ttttaaagca 2940 ctctctattc aaggagtgga aactgatagt ggtattcaga gctcatatat agaaaccaat 3000 ttaatggtaa aaactggatt gtctgtgggt ttgcatctaa caacaaagaa tggggactcc 3060 ctttttgata ttatagtgtt tgtcaaaact gctcattatg aagctgtcta tgacgaaata 3120 tatcaaactg ggcctacaat tgggataaat gtccagcaca atgaaaaatg taccgggaag 3180 tgtccagaat cgctaatgaa aactggctgg ctgtcattct caaaagaaca cacaagtcaa 3240 tgggggtgcg aggaatttgg atgcttagca atcaatgaag ggtgcctcta cgggcattgt 3300 aaagacatca ttaagccaga aatgacagta ttgagaaagg ctcaagatga aactcctgtt 3360 attaaaatat gtatctcatt acctcatgaa acattttgtc aacccataaa tgcatttaat 3420 gcgatcatca cagataaagt agaaactcaa tttatatcaa atgaagcagg gaaactacct 3480 aagttgttag catataaatc aaataagatt cacacaggca tgataaatga tcttgggacc 3540 ttttcgaaga tgtgtgggag tgttcaatca atcaacaaca atgttctagg tgctggaact 3600 gcacgttttg actatatctg ccatgctgcg cagagaaaag atgtgactgt tagcagatgc 3660 tttgataatt tttatgactc atgcctgaga ctagaagctt cagataacat tgtttatgat 3720 aacaatcaaa aacgtgtctc actattaaat aaaaacatgg gtgagctaag gctaaaaata 3780 aaaatgggcg atctaaatta taaactattt gaaaaaatgc cttcttttga cttcaagggg 3840 agctgtgttg gctgcctaaa atgtattaaa ggagtagatt gtgagtttga tatacattcc 3900 acttctgaag cagtttgtgt tttgacatca aactgtgctt tctaccataa caacttaaag 3960 attgacccga atatccagaa atatggcatg aaagcgaaat gctcagaaga gaaaatatgg 4020 atagatctct gcgggaataa aatagaaatc caactgtcta tagtgcagac acatgaaact 4080 atcgaagtcg gcaacagtga ccaaacatat tttgtaaagg agaaagatga cagatgtggg 4140 acatggttat gtaaagtaag tgaacaaggg atttcctcaa tatttgctcc attctttgcc 4200 atatttggag attatgccaa gattgctttc tatactgttt tagcaatact tgctatagca 4260 ctgatggtat accttatgtt acctatgatt ggcaaattaa aagatatatt aaagaaaaat 4320 gaaatagaat atattaaaga attcaggggg aagaaaatct agaaagataa agtaatataa 4380 aagagtattt taatcaaaaa tataagatat aattagatga tgaactaaaa aataaaataa 4440 ataaaataaa aataaagcag aatacatata aaaactccca aaccaataac agctcaacaa 4500 ctataggata cttgtaatca tattgtaaca cttgtagggt tgttcaatca tacataatat 4560 ctgcatgtaa atttaaaata caagacacag cggttcacta ct 4602 // ID MG029280; SV 1; linear; viral cRNA; STD; VRL; 1107 BP. XX AC MG029280; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Madrid virus isolate BT4075 segment S, complete sequence. XX KW . XX OS Madrid virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1107 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1107 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; acf2feaa448a75ea9494a774e7e662ab. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1107 FT /organism="Madrid virus" FT /segment="S" FT /host="Homo sapiens" FT /isolate="BT4075" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="03-Mar-1961" FT /db_xref="taxon:348013" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DCN7" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DCN7" FT /protein_id="AVX48961.1" FT /translation="MSLTVLLEIDTESWQLLCLSFRLRSGARIRLLLTLNKHTKVLSMN FT TGRSSLLRILECSSSVRMRLNRSSVRVRQSSLTLNLALGRSLLLTTITLPTQQIRSLMV FT N" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:A0A2Z3DG15" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DG15" FT /protein_id="AVX48960.1" FT /translation="MATPLFEFSVEERGQNSSTFDPKQAYKSFVDEHREELTLENIRVF FT FLRANEAKQKLRKSSAKLANLKFGTWKVTVVNNHYPANTANTVADGELTLHRISGFLAK FT FILDLYADTEHRPEIEEKIINPIAESKGVTWAQSAKIYLAFFPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDVSLLKKPLRQQYKNDTPDKWMKEKKVMIQGAVSRISKLPWGTSGLSSQ FT AKDFLKEFGITMK" XX SQ Sequence 1107 BP; 361 A; 183 C; 241 G; 322 T; 0 other; agtagtgaac ttcttaggaa gttcatttat gtcacttaca gttctattgg agattgatac 60 agagtcatgg caactccttt gtttgagttt tcggttgagg agcggggcca gaattcgtct 120 acttttgacc ctaaacaagc atacaaaagt tttgtcgatg aacacaggga ggagctcact 180 cttgagaata ttagagtgtt cttcctccgt gcgaatgagg ctaaacagaa gctccgtaag 240 agttcggcaa agctcgctaa ccttaaattt ggcacttgga aggtcactgt tgttaacaac 300 cattaccctg ccaacacagc aaatacggtc gctgatggtg aattaacttt gcacagaatc 360 tctggattcc tcgcaaagtt tatcctggac ctctatgcag atacggaaca tagacctgag 420 atcgaggaaa aaattataaa tccaattgca gagtctaaag gggtcacatg ggcacaatct 480 gcaaaaatat atcttgcttt cttccctgga acagagatgt ttctacacga atttgagatg 540 ctgccgttgg ccatctacat ctaccgtgcc caaaaggggg aaatagatgt ctcattgctg 600 aagaaacctc tcagacagca gtacaaaaat gacactccag acaagtggat gaaggagaag 660 aaagttatga ttcagggagc cgtttctagg atttcaaagc tcccatgggg caccagtggc 720 ttgtcatcac aggccaaaga tttcttgaag gagtttggga taactatgaa ataatctata 780 aattgtttgc tttcatatgt attctatagg ttaaatttta ggtagttaag tctttagtag 840 tatacaaatt aaaaatgata agtgatggta ggtgaaaata aatgtatata tttaaaataa 900 ataaattcaa aattggggtt taatctgggt gattgggttt caaattaggg ggggataaaa 960 aaactgcttt taaaatttat ttgacatatt aaatataagg gttgggtggt tggggaaaca 1020 acaaaggctg cattgcatac aaatcaaatt tagaagagat taattgtatt cattaacaag 1080 ttgatacttt ctaagaagaa cactact 1107 // ID MG029281; SV 1; linear; viral cRNA; STD; VRL; 6910 BP. XX AC MG029281; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Marituba virus isolate BeAn15 segment L, complete sequence. XX KW . XX OS Marituba virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6910 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6910 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; cb4341d637fdfdd3a5d14ad3b5a951ab. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6910 FT /organism="Marituba virus" FT /segment="L" FT /host="Sapajus apella" FT /isolate="BeAn15" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="27-Dec-1954" FT /db_xref="taxon:292278" FT CDS 33..6779 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A343W6Z6" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A343W6Z6" FT /protein_id="AVX48962.1" FT /translation="MAILLGDVIRQYTARIRACTNPEVGRDILAEITMTRHNYFAQQFC FT EAINIEYRNDVPAADIILEMLPALDLTTIKVPNITPDNYYRDGTKIYIIDFKVSVSDES FT ALHTYKKYDTLLGDVFNQLNVDYEVVIIRMNPSDMHLHISSDNFANLFPNIVLNLDFTW FT YFRLRDDLFQQFRDNEEFMELVAHGEFTPTIPWVIDDTPELYTHPVFLEFVGSMPDDTT FT EDFFYALNHNAFQADKWNDLLHIMMRKYGTYYDKFIRDQAKNVFLLDNNYNKPSKEEIL FT KGWSEMVERIRDQRNVIDDCSKQKPSIHFIWSPNDKNSSNENNTKLIKLAKKLRSIKDT FT DTFSLAFKNIGHLMDFSEDVEKYETFCLKLKAEARSSLKPKSSKVTPITIGKCTVLWEQ FT QFKLDTEIIPKEVRIRFLKEFCGIGNHKQFKDRMMDDLDLGKPKILNFENPEIKNQAHI FT MMKNTQCFMSKESGLKKIGNVLEEFEYKIKDANPKTWETIGEVANSRYWQAINDFSVLI FT KNILSVSQYNKHNTFRVVCTANNNFFGILYPSASIKSRRSTVVFSSVCLHENENEILKC FT GALYQTYKVKGGYLSISKAIRLDKERCQRLVTSPGIFLLTTLLFKSDNDVNLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDMIKKG FT CMSANEQRQMISIRDVFLNEFEITQKGVSNEKNQQSIWFPGKISLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRMNIPEPWSLDMKKQSANLYLLIFSISKMLNMDT FT SRHNHLRSRVENRNNFKRSLTSISTFTSSKSCIKVGNFKDLKEKTAKHIKKINEKDARK FT TRIANTEFVDESERDFEITKSTYMDLIRCVPEYTDYISTKVFDRLYEKYKLEEIEDKPA FT IEIIMDTMKNHKDFKFCFFNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDATRNQNAEQSIIDDILDTPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPAEKKRIIYFFCNYMNKELILPDEMMCSLLDQKAEREND FT LIKEMTNGFRRNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDIMKEVASLLEGRCNVSS FT MVHSDDNQTSVIMVQDKIDNDIITNFVCTAFEQCCLSFGNQANMKKTYITNHIKEFVSL FT FNIYGEPFSVFGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFLFDRRELPIELCGILQADLATIALVGLEAGNISFLTNL FT LRKMSPPQLVKESVQSQCSNIENWDMDCLSDSEILKLKLLRYVVLDSEISEDSKMGETS FT EMRSRSLITPRKFTTTSSLEKLISYKDFQEIIVNSEKTEELLERILGKPELLVTKGENS FT TEFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSVLDMPKV FT QSGEGIIGRKTIPETFSAIKKDLSQLPLEPADVKLIYSFCILNDPLNTTACNALLLSQI FT QSLLERTSMSAVTMPEFRNMKLIRYSPALVLRAYIHGDLSVGGANEDAMRRDIFHLNEF FT IIQTRIRERLDQRIIENQEIKGERDRLFEIKELTKFYQACYDYIKSTEHKIKVFILPSK FT AYTAFDFCATIHGNLMRDDGWFSVHYLKQIVSGTAKANISIAPASEMVIVEECFKLLSH FT FCDTFIDTSSRLTFALNVIENFSYKNIPVKELLNLMKHSFRRQQFIPLLYWIGELSQED FT LDKYDAFKTSERVSWNDWQINRTLNTGTVDLTIKGYQRTLRIVGEDDFLQIAELEILKG FT DNTSIETHGRKLLNCKHNLRFEKMRKYQIMEPNTYYICWQMRTRFAYTYQMLLSNIIEA FT RNSQTVSVTGGKFNELIPVCPVIVGRIDSMERINLRQVKYLNMNCSLSRLQLTQKEFVT FT VKRSHFSKMIFFQGPNLIVGNMNLTNLIRTPTLLTTNYPSLSQVPMMTLTRIFHCIGDE FT DQTDEFEFLSDELLEDIETTTVNTVPIFNAQYEVKSKKGYTYKQALQDALRRGIEEIEN FT TLDFCGDGFYSPKNLAIIALLTNLIDRLQTNEWSTILQTAIHMSFFHNGKDRMYHLMKI FT PKAFVKNPIGEILNWEKIRTFVIQLNTRNPGNHWDQMFNHFREKTLILIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFE" XX SQ Sequence 6910 BP; 2593 A; 1061 C; 1272 G; 1984 T; 0 other; attacattgc gttcactttc aggaaagaca atatggctat attacttggt gatgtcataa 60 gacaatacac agctagaatt cgagcatgca ctaatccaga agtgggtcga gacatactag 120 ctgaaataac tatgacaaga cataactatt ttgcacaaca attctgtgag gcaataaaca 180 tcgagtacag aaatgatgta ccggcagcag atattatact agagatgctc cctgcgcttg 240 acttgacaac tatcaaagtt cccaatataa caccagataa ctattacagg gatggcacta 300 agatttacat aatagatttt aaagtttctg taagtgatga gtcagcctta catacttaca 360 aaaagtatga cacgttgttg ggggatgtgt tcaatcaact gaatgttgat tatgaagttg 420 ttattattcg gatgaatcca agtgatatgc accttcatat atcaagtgat aatttcgcaa 480 atcttttccc caatatcgtg ctcaatttag atttcacttg gtatttccgg ctaagggacg 540 acttgttcca gcaatttaga gacaatgaag aattcatgga actagttgcg catggagaat 600 ttacccccac aataccttgg gttattgacg acaccccaga actatacaca catccagtat 660 tccttgaatt tgttgggtct atgccagatg acaccactga agatttcttt tatgcattaa 720 accacaatgc attccaggca gacaagtgga atgacctctt acatataatg atgaggaaat 780 atggcactta ttacgacaaa tttattcgag atcaggcgaa aaatgttttt ttgcttgaca 840 ataattataa caagccctca aaagaagaaa tactcaaagg ctggtcagag atggttgaaa 900 gaatcagaga tcagaggaat gttattgatg actgttccaa acaaaagcca agtatccact 960 tcatctggtc acctaatgac aaaaattcct ctaatgaaaa caataccaaa ttaattaaat 1020 tagcaaagaa gctccggtca ataaaagata cagatacatt tagtttggcc tttaagaata 1080 tcggacattt aatggacttt agtgaggatg tcgaaaaata tgagacgttc tgtttaaaat 1140 taaaagcaga agctaggtca agtttaaagc ctaagagttc aaaagtaact ccaataacaa 1200 ttgggaaatg tactgtattg tgggaacaac aatttaaact tgatactgaa attatcccta 1260 aagaagtcag gataagattt ttgaaagaat tttgtggaat aggcaatcat aaacaattta 1320 aagataggat gatggatgat ttagatttag ggaagcctaa aatattgaat ttcgaaaatc 1380 cagaaataaa gaatcaggcc catataatga tgaaaaatac acagtgcttc atgagtaaag 1440 aaagtggctt gaagaaaata ggcaatgtct tggaagaatt tgagtacaaa ataaaagatg 1500 ccaatcccaa aacatgggaa actattggag aagtagccaa ctccagatat tggcaagcta 1560 taaatgattt ctccgttttg atcaaaaata tattatcagt ttcgcaatac aataaacaca 1620 atacattcag ggttgtatgt acagcaaata ataatttctt tggaatacta tatccatctg 1680 ccagtatcaa atcaagacgt tcaacagtgg tgttttcaag tgtatgtttg cacgaaaatg 1740 aaaatgaaat actgaagtgt ggtgcattgt atcaaacata taaagtgaaa ggggggtacc 1800 tttcgatatc aaaagccata cgattagata aagaacgttg tcaaagactg gtgacatctc 1860 ctggaatttt cttactcaca accttacttt tcaaaagcga taatgatgtc aatttaaatg 1920 atgttatgaa ttttgcattc ttcacatcgt tatcaataac taagagtatg ttatcactaa 1980 ctgaaccctc aagatatatg ataatgaact cgttagcact gtccagtcat gtcagagagt 2040 acatagcaga aaaattctca ccatatacaa agactttatt ttcagtttat atgactgata 2100 tgattaagaa aggatgtatg tctgctaatg aacaaaggca gatgatatca attagggatg 2160 ttttcctcaa cgagtttgaa ataacccaga aaggagtatc caatgaaaaa aatcaacagt 2220 ctatctggtt tccagggaaa ataagtctaa aagaatatat caatcaaata tatatgccat 2280 tttattttaa tgcaaaaggg ttgcataata aacatcatgt tatgattgat ttggctaaaa 2340 cagttctcga gatagaacta gatcagagga tgaatatccc agagccttgg agcctcgata 2400 tgaagaagca atctgcaaat ctgtatctac tcatattctc catatcaaag atgctgaata 2460 tggatacatc aaggcataat catttaagga gtagagtaga gaacaggaac aacttcaaga 2520 gatctttgac aagcatatca acattcacca gctccaaatc atgcatcaaa gtaggcaatt 2580 ttaaagatct aaaagagaaa acagccaaac acattaagaa gataaatgaa aaagatgcaa 2640 ggaaaacccg tattgcaaat actgaatttg ttgacgagtc agagagagat tttgagatta 2700 caaaaagtac atatatggat ttaataagat gtgtcccaga atatactgat tatatttcca 2760 ccaaagtatt tgatcgttta tatgaaaaat acaaattaga ggaaattgaa gataaaccag 2820 caatagaaat tataatggac acaatgaaga atcataagga ttttaaattc tgttttttta 2880 acaaaggtca aaaaacagca aaagaccgtg aaatttttgt tggggaattt gaagcaaaat 2940 tatgcctgta tggtgttgag agaattgcta aagagagatg caaactaaac ccagaggaaa 3000 tgatttcaga gcctggtgat gggaaattaa agaaattaga aataaatgct gaatctgaga 3060 ttaggtattt aatagatgca acaagaaatc aaaacgctga acaatctatt atagatgata 3120 tcttggacac acctaagggc attaagcttg aaatcaatgc agacatgtca aaatggagtg 3180 ctcaagatgt tttcttcaaa tacttttggt tgatagtatt agatccaata ttgtatccag 3240 ctgaaaaaaa gaggataatt tacttctttt gtaattatat gaacaaggaa ttaattttgc 3300 cagatgaaat gatgtgctca ttactagatc agaaagctga gagagaaaac gatttaatta 3360 aggagatgac aaatggcttt agaagaaata ctgttaatat tagaagaaat tggcttcaag 3420 gtaacttgaa ctacacatct agttatatac atagctgctc tatgatggtt tttaaagata 3480 taatgaaaga ggttgcctct ctattagaag gaaggtgtaa tgtttctagc atggtacatt 3540 cagatgacaa ccaaacctct gtaattatgg ttcaagataa gatagataat gatatcataa 3600 caaactttgt ttgcacagca ttcgaacagt gctgtctatc atttggtaat caagcaaata 3660 tgaagaaaac gtatattaca aaccacatta aagaatttgt tagtctattc aacatatatg 3720 gcgagccgtt ttcagtattt ggtcgttttt tgttacctgc agtaggagac tgtgcataca 3780 ttggtccata tgaagatatg gctagcaggc tgtcggctac tcaaaccgcc attaaacatg 3840 gatgtccgcc tagcctagca tgggttagta tcgcattgaa ccattggata acattcaaca 3900 catacaatat gctgcctggt caaatcaatg accctactaa ggttttctta tttgataggc 3960 gagaattacc aatagaattg tgtggaattc ttcaagctga tttagcaact atagcacttg 4020 tagggctgga agctgggaat atttcatttt taacaaacct cttaaggaag atgtcaccac 4080 cacaacttgt gaaagagtcg gtgcagagtc aatgtagtaa tatagaaaat tgggacatgg 4140 attgcctttc tgatagtgaa atcttaaaac tcaagttatt aagatatgtt gttttagact 4200 cggaaatttc agaagatagt aaaatgggtg aaactagtga aatgaggagc cgatcattga 4260 taactcctag gaaatttacc actacatcat ctttagagaa gctaatttca tacaaagact 4320 ttcaggagat tatagtcaat tctgaaaaaa ctgaagaatt attggagagg attctaggaa 4380 aaccagaact attggttact aaaggtgaaa attcaacaga gtttatgaca actatattgt 4440 ttagatataa ttcaaaaaaa ttcaaagaat ccttgtctat acaaagtcca acacagctat 4500 ttatagaaca gatactattt gcaaacaagc cagtcattga ctacacagga atccaagata 4560 ggtacttaag tgtcctggat atgcctaaag tgcagtcagg tgaaggaatt attggtcgga 4620 aaactatccc tgagacattt tctgctataa agaaagattt aagccaacta cctcttgagc 4680 cagctgatgt aaagttaata tactcctttt gcatcctaaa tgatccctta aataccactg 4740 catgcaatgc attgttatta tcacaaatac aatctctact agaaaggaca agcatgtcag 4800 ctgtaacaat gccagaattt agaaatatga aactgataag atattctcct gctctggttt 4860 taagagcata cattcatggt gatctttctg tagggggagc aaatgaggat gcaatgagga 4920 gagacatatt tcatttaaat gagtttataa ttcagacaag aattagagag cgtctggatc 4980 aacgaattat agaaaatcaa gaaataaaag gggaaagaga tagattattt gaaattaaag 5040 aattgacaaa attctatcag gcttgctatg actatattaa atctacagaa cacaaaatca 5100 aagtatttat cttgccatca aaggcataca cagcatttga cttctgtgcc actatacatg 5160 gtaacttaat gagagatgat ggttggtttt ctgtacatta tttgaaacaa atagtctctg 5220 gaacagctaa ggcaaatatt agtatagccc ctgcaagcga gatggttata gttgaagaat 5280 gcttcaaact tttatcacat ttctgtgata catttattga taccagttca agattgacat 5340 ttgctctaaa tgtgatcgag aatttctcat ataagaatat tccagtcaag gagctcttaa 5400 atttgatgaa acattcattt aggagacaac aatttatacc tttactatac tggatagggg 5460 aattaagtca agaagatcta gataagtatg atgcatttaa gactagtgaa agggtttcct 5520 ggaatgattg gcaaataaac agaacattaa acactggtac tgtagactta actattaaag 5580 ggtaccaaag aactttgcgt attgtaggtg aagatgattt cctacaaata gccgaattgg 5640 aaatcttaaa gggagacaat acttcgatag aaactcatgg aagaaaactg ctgaactgta 5700 aacacaatct tagatttgaa aaaatgagaa aatatcagat aatggaacct aacacttact 5760 atatatgttg gcaaatgagg acaagatttg catacacata tcagatgttg ctctcaaaca 5820 ttatagaagc aagaaattct caaacagttt cagttacagg tggaaaattc aatgaactaa 5880 ttccagtatg tccagtcata gtagggagga ttgattctat ggaaagaatc aatttaaggc 5940 aagtaaaata tctaaatatg aattgctcat tatctagact acaattgact caaaaagaat 6000 ttgtgactgt aaaaagatct catttttcca aaatgatatt cttccaaggg cctaacttga 6060 tagtaggaaa tatgaacttg acaaatctaa ttagaacgcc aacattattg actacaaatt 6120 acccatcttt atcgcaagtc cccatgatga cattaactag gatattccac tgcataggag 6180 atgaagatca aactgatgaa ttcgaatttt tatctgatga attattggaa gatattgaaa 6240 caacaacagt caacactgtt cctatattca atgcccaata tgaggttaag tcaaaaaaag 6300 gttacacata taaacaagca ttacaagatg cactgagaag aggaatagaa gaaattgaaa 6360 acacattgga tttctgtggg gacggatttt attctccaaa aaacttagca attatagcat 6420 tactgactaa cttaattgac agactgcaga caaatgaatg gtcaactata ctacaaacag 6480 caatacatat gtcttttttt cacaatggga aagatagaat gtaccatttg atgaaaatac 6540 caaaggcatt tgttaaaaat cctattgggg agatcctaaa ttgggaaaaa attagaactt 6600 ttgtgataca gttaaataca agaaaccctg ggaatcattg ggaccagatg ttcaatcatt 6660 tcagagagaa aacattaata ttgatagacc gtgagattaa aatggaaggg atgtcttggg 6720 gagaaatgct agatgaatta gatgattaca aggacacaga aatgttccat tttgagtgaa 6780 aagagaggaa aagtaattga tctttaaaat aaaagacatt gtcttttatt ttaaagatca 6840 aactaataac tgctaaacaa atagtaagaa ctcacacaaa cgtaataata taatcaaggg 6900 aacactactg 6910 // ID MG029282; SV 1; linear; viral cRNA; STD; VRL; 4578 BP. XX AC MG029282; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Marituba virus isolate BeAn15 segment M, complete sequence. XX KW . XX OS Marituba virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4578 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4578 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 431244e443cbdf4ff559917306e221c0. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4578 FT /organism="Marituba virus" FT /segment="M" FT /host="Sapajus apella" FT /isolate="BeAn15" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="27-Dec-1954" FT /db_xref="taxon:292278" FT CDS 55..4362 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A343W6Z7" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR017896" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A343W6Z7" FT /protein_id="AVX48963.1" FT /translation="MMFKMLAKLLLFLWTTLQAWSIPLSNRCFEGGVLVEERSMDHGIA FT EICVKDDVSMIKTSSLQKRNETRFTNIIMRKMLIPNYQDCNPIEMPNGPIMIFKPNSDL FT MLIPKTYACRVDCTISLDRDEATIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCE FT HIQVTCGSKSLSFHACFKYHMACIRLLNKSYMPAFMIQSVCQNKEIIIMTCLILIIFAL FT LYILTMSYICYILLPLFIPIAYLYGWVYNKSCKKCSYCGLAYHPFSKCGRNCVCGAMFE FT NSDRMRMHRQTGLCKGYKSLRAARILCKSRGSAFCLAVILATLLLSFIQPLEAVKLTYN FT NEVIELPELSRELEVIFSGIDYVKSIVVIQLTICGFLILLLLSYLILSKRIEDKLIYNV FT LYYCPECEMTHPKRGLKKYFSGEFTNMCNSCMCGCTYNQEDLNDGYTIPMMHKLTVRCY FT APGRYYTNKKMNSVSMYITLACLMVLASMTIVAASEDNCLKTSLYKTQEPVSCSAWIKA FT TTCTASSGIEGMMARIKLPKQDTDLISGIRGSLDSILRKSQESSVPIQSYIFEALAVTM FT YCSELVGASLENGNINSNIKMQYTERKLEVCTAKKAVSACACLTGITGCNPSNADDVKD FT YYKTHHEIFKSDVSRMIQTIAKMFPGVFAKELLLASKTSNYTRIQAILKMLEPKLTNAR FT ATKALLKILMTAISESSIANVPAPVPSSKSINPFDPKWSQDSIFKNIVAATAIKTCPSS FT KAYKCFFPVSLRFTFYFSCGEENKFYHTGDDLIAIHYNTGTTLCVADPYCEKDFTPVDV FT ANKDSLLTMKCDQITLNVSEMPNSMPINKCRVISMQQCTVTGTANRSVAECSNGYFYEY FT TGELHQSPKDDVGVYCFEKACKPNRFPHHPSNLQGCVSHNNEMLTRKLKEINYSNLEQL FT KHSLQETIKTDLVEHHYILTKNLPKINPIFKALSIQGVETDSGIQSSYIETNLMVKTGL FT SVGLHLTTKNGDSLFDIIVFVKTAHYEAVYDEIYQTGPTIGINVQHNEKCTGKCPESLM FT KTGWLSFSKEHTSQWGCEEFGCLAINEGCLYGHCKDIIKPEMTVLRKAQDETPVIKICI FT SLPHETFCQPINAFNAIITDKVETQFISNEAGKLPKLLAYKSNKIHTGMINDLGTFSKM FT CGSVQSINNNVLGAGTARFDYICHAAQRKDVTVSRCFDNFYDSCLRLEASDNIVYDNNQ FT KRVSLLNKNMGELRLKIKMGDLNYKLFEKMPSFDFKGSCVGCLKCIKGVDCEFDIHSTS FT EAVCVLTSNCAFYHNNLKIDPNIQKYGMKAKCSEEKIWIDLCGNKIEIQLSIVQTHETI FT EVGNSDQTYFVKEKDDRCGTWLCKVSEQGISSIFAPFFAIFGDYAKIAFYTVLAILAIA FT LMVYLMLPMIGKLKDILKKNEIEYIKEFRGKKI" XX SQ Sequence 4578 BP; 1631 A; 785 C; 850 G; 1312 T; 0 other; agtagtgaac cgctgtgtat ttatactata gtagagttca ctatttattt ggatatgatg 60 ttcaagatgc ttgcaaagct cctcttgttt ttatggacca cacttcaggc atggtctata 120 ccgctgtcaa atagatgctt tgaaggagga gtcctggtag aagaaagatc aatggaccat 180 ggaattgctg aaatctgtgt caaagatgat gttagtatga tcaaaacatc atcactgcaa 240 aagagaaatg aaactagatt cacaaacata attatgagga aaatgttgat cccaaattac 300 caagactgca atcccattga aatgccaaat ggtcccatta tgatatttaa accaaatagc 360 gacctgatgt tgatcccaaa gacatatgca tgtagagtag actgcactat atcattagat 420 agagatgagg caacaataat tctccactca gacaaactta ataatttcga agttatgggt 480 actacaacag caacaagatg gttccaagga agtacaacat attcgctaga gcacacatgt 540 gagcatatac aagttacctg cggctcaaaa agtctcagtt tccatgcttg ttttaaatat 600 catatggcat gcatccgtct gctaaataag agctacatgc cagcattcat gattcagtct 660 gtctgccaaa ataaggaaat tattataatg acctgtttaa ttttgataat ctttgcacta 720 ttatatattc taactatgtc atacatctgc tacatattat tgccactgtt tataccgatt 780 gcatatctct atggttgggt atacaataaa tcttgcaaaa aatgcagtta ctgtgggctg 840 gcataccacc cgttcagcaa atgtggacgc aactgtgtgt gtggagcaat gtttgagaac 900 tcagacagaa tgagaatgca tagacaaaca ggattatgta aaggatacaa atcactcaga 960 gctgcaagga tactttgcaa aagcagagga tctgctttct gcttggcagt aattttagct 1020 accctgttgc tttcatttat ccaaccactt gaagcagtaa aattaactta caataatgaa 1080 gtaatagaat tacctgaatt atcacgggaa ttagaagtta tattcagtgg tatagattat 1140 gttaaaagta ttgtggtcat tcaacttact atttgtggat ttttaatttt actgcttttg 1200 agttatttaa tcctttctaa aagaattgaa gacaagttaa tatataatgt cttatattac 1260 tgtccagaat gcgagatgac acacccaaaa agaggattga aaaaatattt ctctggtgaa 1320 ttcacaaata tgtgtaacag ctgtatgtgc gggtgtactt ataatcagga ggatttaaat 1380 gatggatata caataccgat gatgcacaaa ctaactgtca gatgctatgc accagggaga 1440 tattacacta acaaaaaaat gaacagtgtc agtatgtata taacattagc atgcctgatg 1500 gtgctagctt ctatgaccat agtggctgca tctgaagata attgcctaaa aacatcactt 1560 tataagacac aagaaccagt tagctgttca gcatggatca aggccactac atgtactgca 1620 tcatccggca ttgaaggaat gatggcccgc atcaagctcc caaagcaaga cacagattta 1680 atttccggga taagaggtag cttggactct atcctccgga agtctcaaga gagttcagta 1740 ccaatccaat catatatctt tgaagccttg gctgtaacga tgtactgttc tgaacttgtg 1800 ggtgcatcat tagaaaatgg aaatattaat tcaaacataa aaatgcaata tacagaaagg 1860 aaattagaag tgtgcactgc aaaaaaagct gtctcagcat gcgcatgctt aactggcata 1920 actggctgta acccatcaaa tgcagatgat gtcaaagatt actacaaaac ccatcatgag 1980 atttttaaat ctgatgtttc tagaatgatc caaacaatag ctaaaatgtt tccaggagtg 2040 ttcgctaaag agctattact tgcatcaaaa acatccaact acactagaat acaagcaatt 2100 ttgaagatgc tggagccaaa attgacaaat gccagggcaa cgaaagcact attgaaaatc 2160 ttaatgactg caatctcaga atcttctatt gcaaatgtgc ctgctcctgt cccatcctcc 2220 aaaagtatca atccatttga tccaaagtgg tctcaagata gtattttcaa aaacattgtt 2280 gcagccactg caataaagac atgtccttcg agtaaagcat ataaatgctt cttccctgtc 2340 agccttagat ttacgttcta tttttcatgt ggagaggaaa acaaatttta tcacactggt 2400 gacgatttga tagccataca ttataatact ggaaccactc tctgtgtagc agatccttac 2460 tgtgaaaagg acttcacacc agtagatgtt gcaaataagg attcgctgtt gacaatgaaa 2520 tgtgatcaga ttacactgaa tgtgtctgaa atgcctaatt ctatgccaat caataaatgc 2580 cgagttattt caatgcagca atgtacagtc acaggaactg caaacagaag tgttgcagaa 2640 tgttcgaatg gatactttta tgaatatact ggggaattac atcaaagccc aaaagatgat 2700 gttggtgtat attgctttga gaaggcatgc aaacctaaca gatttccgca tcatcctagc 2760 aaccttcaag gttgtgtttc ccataataat gaaatgttga ccaggaaact taaagaaata 2820 aactattcaa acttggaaca attgaaacac agtctgcaag agacaattaa aacagatttg 2880 gttgagcatc attacatcct gacaaaaaac ttaccaaaaa tcaaccctat ttttaaagca 2940 ctctctattc aaggagtgga aactgatagt ggtattcaga gctcatatat agaaaccaat 3000 ttaatggtaa aaactggatt gtctgtgggt ttgcatctaa caacaaagaa tggggactcc 3060 ctttttgata ttatagtgtt tgtcaaaact gctcattatg aagctgtcta tgacgaaata 3120 tatcaaactg ggcctacaat tgggataaat gtccagcaca atgaaaaatg taccgggaag 3180 tgtccagaat cgctaatgaa aactggctgg ctgtcattct caaaagaaca cacaagtcaa 3240 tgggggtgcg aggaatttgg atgcttagca atcaatgaag ggtgcctcta cgggcattgt 3300 aaagacatca ttaagccaga aatgacagta ttgagaaagg ctcaagatga aactcctgtt 3360 attaaaatat gtatctcatt acctcatgaa acattttgtc aacccataaa tgcatttaat 3420 gcgatcatca cagataaagt agaaactcaa tttatatcaa atgaagcagg gaaactacct 3480 aagttgttag catataaatc aaataagatt cacacaggca tgataaatga tcttgggacc 3540 ttttcgaaga tgtgtgggag tgttcaatca atcaacaaca atgttctagg tgctggaact 3600 gcacgttttg actatatctg ccatgctgcg cagagaaaag atgtgactgt tagcagatgc 3660 tttgataatt tttatgactc atgcctgaga ctagaagctt cagataacat tgtttatgat 3720 aacaatcaaa aacgtgtctc actattaaat aaaaacatgg gtgagctaag gctaaaaata 3780 aaaatgggcg atctaaatta taaactattt gaaaaaatgc cttcttttga cttcaagggg 3840 agctgtgttg gctgcctaaa atgtattaaa ggagtagatt gtgagtttga tatacattcc 3900 acttctgaag cagtttgtgt tttgacatca aactgtgctt tctaccataa caacttaaag 3960 attgacccga atatccagaa atatggcatg aaagcgaaat gctcagaaga gaaaatatgg 4020 atagatctct gcgggaataa aatagaaatc caactgtcta tagtgcagac acatgaaact 4080 atcgaagtcg gcaacagtga ccaaacatat tttgtaaagg agaaagatga cagatgtggg 4140 acatggttat gtaaagtaag tgaacaaggg atttcctcaa tatttgctcc attctttgcc 4200 atatttggag attatgccaa gattgctttc tatactgttt tagcaatact tgctatagca 4260 ctgatggtat accttatgtt acctatgatt ggcaaattaa aagatatatt aaagaaaaat 4320 gaaatagaat atattaaaga attcaggggg aagaaaatct agaaagataa agtaatataa 4380 aagagtattt taatcaaaaa tataagatat aattagatga tgaactaaaa aataaaataa 4440 ataaaataaa aataaagcag aatacatata aaaactccca aaccaataac agctcaacaa 4500 ctataggata cttgtaatca tattgtaaca cttgtagggt tgttcaatca tacataatat 4560 ctgcatgtaa atttaaaa 4578 // ID MG029283; SV 1; linear; viral cRNA; STD; VRL; 1081 BP. XX AC MG029283; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Marituba virus isolate BeAn15 segment S, complete sequence. XX KW . XX OS Marituba virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1081 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1081 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; ff5fcf3462a459496480e4997a4bed41. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1081 FT /organism="Marituba virus" FT /segment="S" FT /host="Sapajus apella" FT /isolate="BeAn15" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="27-Dec-1954" FT /db_xref="taxon:292278" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A343W6Z9" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A343W6Z9" FT /protein_id="AVX48965.1" FT /translation="MSLTVLLEIDTESWQLLCLSFRLRSGARIRLLLTLNKHTKVLSMN FT TGRSSLLRILECSSSVRMRLNRSSVRVRQSSLTLNLALGRSLLLTTITLPTQQIRSLMV FT N" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:W8CZH1" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:W8CZH1" FT /protein_id="AVX48964.1" FT /translation="MATPLFEFSVEERGQNSSTFDPKQAYKSFVDEHREELTLENIRVF FT FLRANEAKQKLRKSSAKLANLKFGTWKVTVVNNHYPANTANTVADGELTLHRISGFLAK FT FILDLYADTEHRPEIEEKIINPIAESKGVTWAQSAKIYLAFFPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDVSLLKKPLRQQYKNDTPDKWMKEKKVMIQGAVSRISKLPWGTSGLSSQ FT AKDFLKEFGITMK" XX SQ Sequence 1081 BP; 351 A; 178 C; 238 G; 314 T; 0 other; agtagtgaac ttcttaggaa gttcatttat gtcacttaca gttctattgg agattgatac 60 agagtcatgg caactccttt gtttgagttt tcggttgagg agcggggcca gaattcgtct 120 acttttgacc ctaaacaagc atacaaaagt tttgtcgatg aacacaggga ggagctcact 180 cttgagaata ttagagtgtt cttcctccgt gcgaatgagg ctaaacagaa gctccgtaag 240 agttcggcaa agctcgctaa ccttaaattt ggcacttgga aggtcactgt tgttaacaac 300 cattaccctg ccaacacagc aaatacggtc gctgatggtg aattaacttt gcacagaatc 360 tctggattcc tcgcaaagtt tatcctggac ctctatgcag atacggaaca tagacctgag 420 atcgaggaaa aaattataaa tccaattgca gagtctaaag gggtcacatg ggcacaatct 480 gcaaaaatat atcttgcttt cttccctgga acagagatgt ttctacacga atttgagatg 540 ctgccgttgg ccatctacat ctaccgtgcc caaaaggggg aaatagatgt ctcattgctg 600 aagaaacctc tcagacagca gtacaaaaat gacactccag acaagtggat gaaggagaag 660 aaagttatga ttcagggagc cgtttctagg atttcaaagc tcccatgggg caccagtggc 720 ttgtcatcac aggccaaaga tttcttgaag gagtttggga taactatgaa ataatctata 780 aattgtttgc tttcatatgt attctatagg ttaaatttta ggtagttaag tctttagtag 840 tatacaaatt aaaaatgata agtgatggta ggtgaaaata aatgtatata tttaaaataa 900 ataaattcaa aattggggtt taatctgggt gattgggttt caaattaggg ggggataaaa 960 aaactgcttt taaaatttat ttgacatatt aaatataagg gttgggtggt tggggaaaca 1020 acaaaggctg cattgcatac aaatcaaatt tagaagagat taattgtatt cattaacaag 1080 t 1081 // ID MG029284; SV 1; linear; viral cRNA; STD; VRL; 6973 BP. XX AC MG029284; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Murutucu virus isolate BeAn974 segment L, complete sequence. XX KW . XX OS Murutucu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6973 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6973 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 738789851219f97ecde09586dd66431c. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6973 FT /organism="Murutucu virus" FT /segment="L" FT /host="Sapajus apella" FT /isolate="BeAn974" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1955" FT /db_xref="taxon:348008" FT CDS 55..6801 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2Z3DE85" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE85" FT /protein_id="AVX48966.1" FT /translation="MAILLGDVIRQYTARIRACASPEIGRDILAEITMTRHNYFAQQFC FT EAINIEYRNDVPAADIILEMNPALDLRTIKIPNVTPDNYYRDGAKIYIIDFKVSVSDES FT AMHTYKKYDTLLGDVFNQLGIEYEIVIIRMNPSDMHLHISSDNFSNLFPNIVLNLDFTW FT YFRLRDELFQRYRDNEEFMELVAHGEFTPTIPWLVEDTPELYTHPVFLEFIGSMPDGTI FT DDFYYALNHNAFQSDKWNDLLHIMMKKYGEYYTHFVKEQARNIFITDENYNKPSKDEIK FT KGWAEMVERIKNQRDVTDDCSKQKPSIHFIWSPNDTNASNENNTKLIKLAKKLQSIKES FT DTFSLAFKNIGYLMDFSEDVDKYENFCLKLKAEARSNIKPKSNKIIPITIGKCTVLWEQ FT QFKFDTEVIPKEVRIKFLKEFCGIGNHKQFKDRMLDDLDLNKPKILNFENPEIKDQAYV FT MMRNTQCFMSKESGLQKVGNILEEFEYKINDANPKTWETILEIAKSRYWQGINDFSVLI FT KNILSVSQYNKHNTFRVVCTANNNFFGILYPSASIKSKKSTVVFSTISLHESENDVLKC FT GALYRTYKVKGGYLSVSKAIRLDKERCQRLVTSPGIFLLTSLLFKSDNDVNLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDMIKRG FT CMSANDQRQLISIRDVFLNEFEITQKGVSSEKNLQSIWFPGKVSLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRMNVPEPWSFDMKKQSANLYLLIFSVAKMLNMDT FT SRHNHLRSRVENRNNFKRSLTSISTFTSSKSCIKVGDFRSLKEKTAKHIKKINEKDAKK FT TRIANTEFVEESDRDFEVTKSTYLDLVKCVPGYTDYISTKVFDRLYEKYKLEEIEDKPA FT IEVIMDTMKSHKDFKFCFSNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDATRSKNAEQSIIDDILETPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPAEKQRILYFFCNYMNKELILPDEMMCSLLDQKAEREND FT LIREMTNGFRKNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDIMKEVASLLEGRCNVCS FT MVHSDDNQTSVIMVQDKLNNDVIVNFVCSTFERCCLSFGNQANMKKTYITNHIKEFVSL FT FNIYGEPFSVFGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKVFLFDRRELPIELCGILQADLATIALVGLEAGNISFLTNL FT LKKMSPPQLVKESVQTQCNNIEQWDLDCLSDSEILKLKLLRYVVLDSEITEDSKMGETS FT EMRSRSLITPRKFTTTSSLEKLISYKDFQEIIVNTERTEELLESILAQPELLVTKGENS FT REFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSVLDMPRV FT QNSDGIIGRKTIPETFAAVKKDLNQMPLDQTDIKLIYSFCILNDPLNTTACNALLLSQV FT QSLLERTSMSAVTMPEFRNMKLIRYSPALVLRAYIHNNLSVGGANEDAMRRDLYHLNEF FT IVQTKIKERLDQRIAENQEIKGERDRLFEIKEITKFYQACYDYIKSTEHRIKVFILPSK FT AYTAFDFCATIHGNLIRDDGWYSVHYLKQIVSGTAKANVSLAPASEMVIVEECFKLLSH FT FCDTFIDVNSRLSFVINVIENFSYKNMPVKELLNLMKHSFKRQQFIPLLYWLGELQQDD FT LDKYDAFKTSERVSWNDWQINRTLNTGTIDLTIKGYQRTLRIMGEDDFLQIAELEILKG FT DNTSIETHGRKLLNCKHNLRFEKMRKYPIMEPNTYYICWQMRSRFAVTYQMLLSNIIEA FT RNSQTVSVTGGKFNELIPVCPVIIGRIESIERINMRQVKYLNMECSLSRLQLNQKEFVT FT TKRSHFSKMVFFQGPNLIVGNLNLTNLIKTPTLLTTNYPSLSQVPMMTLTRIFHCIGDE FT DQTDEFEFLSDEILEDIETTAVNTVPIFNAQYEVKSRKGYTYKKALQDALRRGIEEVED FT TLDFCGDGFYSPKNLAIIALLTSLIDRLQTNEWSTILQTAIHMSFFHNGKDRMYHLMKV FT PKAFVKNPIGEVLNWEKIRTFIIQLSTRHPGNHWDQMFNHFKEKILILIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFE" XX SQ Sequence 6973 BP; 2616 A; 1049 C; 1279 G; 2029 T; 0 other; agtagtgtac ccttgtttac ttattacaat tagttcacaa ccaggaaaca cagtatggca 60 atattacttg gcgatgtcat tagacagtat acggccagaa tacgtgcttg tgccagtcct 120 gagattggca gagacatttt ggcagaaata acaatgacac ggcataatta ttttgcccaa 180 caattctgtg aggcaatcaa tattgaatat aggaatgacg ttccagctgc tgacatcata 240 ctagagatga atccagcatt ggatctgaga acaattaaaa tcccaaatgt aacacctgat 300 aattattaca gagatggagc aaagatttac ataatagact tcaaggtctc tgtaagcgat 360 gaatcagcaa tgcatacata caagaaatat gacactttgc ttggtgatgt tttcaatcaa 420 ttggggatag aatacgagat agttataatc agaatgaatc caagtgatat gcatcttcac 480 atctcaagtg ataatttcag caatctattt ccaaatatag ttttgaattt agacttcaca 540 tggtatttcc gtttacgaga tgaattattt caaagatatc gagataatga ggagtttatg 600 gagcttgttg cgcatggcga attcacgcct accatacctt ggttggtgga agatacacct 660 gaactgtaca ctcatcctgt tttcctcgaa tttataggct ctatgccaga tgggactata 720 gatgattttt attatgcatt aaatcataat gctttccagt cagataaatg gaatgattta 780 ttacatatta tgatgaaaaa atatggcgaa tattatactc actttgtcaa agagcaagca 840 cgtaatatct tcatcacaga tgagaattat aataaaccat ctaaagatga aataaagaaa 900 ggatgggctg agatggttga acgaattaaa aatcaacgtg atgtaactga tgattgctct 960 aaacaaaaac caagcataca ttttatatgg tcaccaaatg ataccaatgc tagcaatgag 1020 aacaatacaa aattaattaa attggcaaag aaactccagt caatcaagga atcagataca 1080 ttcagtttag cattcaagaa tattggttac ctaatggatt tcagtgagga tgtagataaa 1140 tatgaaaatt tttgcttaaa gctcaaagca gaagctagat caaacatcaa gccgaaaagt 1200 aataaaatca ttccaatcac aataggaaaa tgtacagtat tgtgggaaca gcaatttaag 1260 tttgatacag aagtaatccc aaaagaagtt agaataaaat tcctaaaaga attttgtggg 1320 attggaaatc acaaacagtt caaagataga atgcttgatg atttagattt gaacaaacca 1380 aagatactaa attttgaaaa tcctgaaatt aaagatcaag catatgtaat gatgaggaac 1440 actcaatgct ttatgagcaa agagagtggc ttgcaaaaag tggggaatat tttagaagag 1500 ttcgaataca agattaatga tgcaaatcct aagacatggg agactatcct agagatagcc 1560 aaatcgagat actggcaagg gataaatgat ttttctgttt tgataaaaaa catattatct 1620 gtctctcaat acaacaaaca taatacgttc agagtggtat gtacagcaaa taataatttc 1680 tttggtatat tgtatccctc tgctagcata aaatccaaga aatcaactgt tgttttctca 1740 actatcagcc tacacgagag tgagaatgat gttttaaagt gtggagcttt atacagaaca 1800 tacaaagtga aagggggata tttatctgtc tccaaggcaa tacgtttaga taaagagagg 1860 tgtcaaagac tggtgacatc accagggata tttctcttga cttctcttct attcaagagt 1920 gataatgatg ttaacttgaa tgatgttatg aactttgcat tctttacatc attgtctata 1980 accaagagta tgctctcact aacagaacct tcaagatata tgataatgaa ttcccttgca 2040 ctctcaagcc atgtgagaga gtacattgca gagaaatttt caccttatac taaaacatta 2100 ttttcagtat acatgacaga catgataaag cgtggatgca tgtctgcaaa tgatcaaaga 2160 caacttatat ctattaggga tgtattctta aatgagtttg aaatcactca aaagggtgtt 2220 tccagtgaaa agaacctaca gtctatttgg ttccctggca aagtcagctt aaaagagtac 2280 atcaatcaaa tttacatgcc tttctatttc aatgctaaag gtcttcataa taaacatcac 2340 gttatgatag atctagcaaa aacagtatta gaaatagagt tagaccaaag aatgaatgtt 2400 ccagaaccat ggagctttga tatgaaaaaa caatcagcca atttgtatct acttatattt 2460 tcagtagcaa agatgctgaa tatggacaca tcaagacata accatcttag aagtagagta 2520 gagaatagga acaatttcaa aaggtcatta actagtatct caacattcac cagttcaaaa 2580 tcttgtatta aagttggaga ttttagaagc ctcaaagaaa agacagcaaa gcacattaag 2640 aaaattaatg aaaaagatgc aaagaagact aggattgcaa acacggaatt tgttgaagaa 2700 tctgacagag attttgaagt aacaaagagc acataccttg atttggtaaa gtgtgttcca 2760 gggtatactg attacatttc tacaaaagtc tttgatagac tatatgagaa atataaacta 2820 gaagaaattg aagacaagcc agcaattgaa gtaataatgg atactatgaa atcccacaaa 2880 gatttcaaat tctgtttttc taacaaaggc caaaaaacag caaaagatag ggaaatattt 2940 gtcggtgagt ttgaagcaaa gctatgcttg tacggagttg agagaatagc aaaggaaaga 3000 tgtaaattaa atcctgaaga aatgatatct gaacctggag atggcaaatt gaaaaagctt 3060 gagataaatg cagaatctga aataagatac ttaattgatg caacaagaag taagaatgct 3120 gagcaatcta ttatagacga tatattagaa acacctaaag gtataaaatt agagattaat 3180 gctgatatgt caaagtggag tgcgcaggat gtcttcttca agtacttctg gttgattgtt 3240 ttggacccaa ttctgtaccc tgcagaaaag cagcgaatac tatacttctt ttgcaattac 3300 atgaataagg agttaatatt gcctgatgag atgatgtgct cattattaga tcaaaaagca 3360 gagagagaaa atgatctcat aagagaaatg acaaatggat ttcgaaagaa tacagtgaat 3420 ataaggcgga attggttgca gggtaatttg aattatacat caagttacat tcatagttgt 3480 tccatgatgg ttttcaaaga tataatgaaa gaagttgcgt cacttttaga agggagatgc 3540 aatgtttgta gcatggttca ctcagatgat aatcagacct cagttataat ggtccaagat 3600 aaactgaata atgatgtaat cgtgaacttt gtatgttcaa cttttgagag atgctgttta 3660 tcatttggaa atcaagctaa tatgaagaaa acatatataa caaatcacat aaaagaattt 3720 gtcagtttgt ttaatattta tggggagcca ttctctgtat ttggaagatt cttactacca 3780 gcagttggtg attgtgcata tattggacca tatgaagata tggcaagtag attatcagca 3840 acccaaacag ctattaaaca tggatgtcct cccagtctag cttgggtgag cattgcattg 3900 aatcattgga ttacattcaa tacttacaat atgctacctg ggcagataaa tgatcctaca 3960 aaagtattct tatttgatag aagagaattg cctatagaat tatgtggcat attacaagct 4020 gatttggcaa caattgcatt agtcggatta gaagctggta atatatcgtt cttaacaaat 4080 ttgctaaaga aaatgtcacc tccacaatta gtaaaagaat cagtacaaac tcagtgtaat 4140 aatatagagc aatgggactt agactgccta tcagatagtg aaatattaaa attaaaactg 4200 ttaagatacg ttgtcttaga ttcagagatt acagaagaca gcaaaatggg agagacaagt 4260 gagatgagaa gtaggtccct tattactcct aggaaattca ctacaacctc atctctggaa 4320 aaacttatat catacaaaga tttccaagag attattgtaa acacagaaag gacagaagaa 4380 ctcttagagt ctatattagc acaacctgaa ctcttagtca caaaaggaga aaattctaga 4440 gaattcatga caaccatcct ctttaggtat aactctaaaa agttcaagga atctctgtca 4500 atacagagcc ctactcagct gttcatagag caaatattat ttgctaataa gccagtaata 4560 gattacactg gtatacaaga tagatattta agtgtgttgg atatgcccag agtgcaaaat 4620 agtgatggga ttattgggag gaaaacaata cccgagacat ttgctgctgt taaaaaggac 4680 ttaaatcaaa tgcctttaga tcaaacagat attaaactga tctattcatt ttgcattctg 4740 aatgatccac tcaacacaac agcatgtaat gccctcttac tatctcaagt acaatcttta 4800 ttagagagaa caagtatgtc tgctgtaact atgccagaat tcagaaatat gaagttaatt 4860 agatattcac ccgcattggt tttaagagca tatatccaca ataatctgtc tgtgggaggt 4920 gcaaatgaag atgctatgag gagagatttg taccatctaa atgaatttat tgtgcagacc 4980 aagattaaag aaagattaga tcaaagaatt gcagaaaatc aagaaataaa aggggaaaga 5040 gataggttat ttgagataaa agaaattact aaattctacc aggcatgcta tgattatatt 5100 aaatcaacag agcatagaat aaaagtattc attctaccat ctaaagcata tactgcattt 5160 gatttttgtg ctactattca tggtaattta attcgagatg atggatggta ttctgttcat 5220 tatctaaagc aaatagtttc tggaacagca aaggccaacg taagtttagc tcctgcgagt 5280 gaaatggtta ttgttgagga atgcttcaaa ctactatctc atttctgtga tacatttatt 5340 gatgtgaatt caagattatc atttgtaata aatgtgattg agaatttctc ttataagaat 5400 atgcctgtaa aagagttact aaatttaatg aagcattcct tcaaacgcca acaattcata 5460 cctttattat attggcttgg ggaattgcag caagatgatc tagacaaata tgatgcattt 5520 aagacaagtg aaagagtatc ttggaatgat tggcaaataa atagaacatt aaatacaggt 5580 acaatcgatc taacaatcaa aggttatcaa agaactttaa gaatcatggg ggaagatgac 5640 tttcttcaaa tagcagaatt agaaattctg aaaggtgaca atacatcaat agaaactcat 5700 ggtaggaaat tactgaattg taaacataac cttaggttcg aaaagatgag gaaataccct 5760 ataatggaac ctaatacata ttatatatgc tggcagatga gatctagatt tgcagtaaca 5820 tatcagatgc tgctgtctaa tattattgaa gctcgcaatt ctcagacagt atcagtgaca 5880 ggaggaaaat ttaatgaatt aatacctgtt tgccctgtta ttattggtag aattgaatca 5940 atcgaaagga taaatatgag gcaagtaaaa tatttaaata tggaatgctc cttatctaga 6000 ttacaactca atcaaaagga atttgtcact acaaagaggt ctcacttctc aaaaatggtc 6060 ttcttccaag gtcctaactt aatagtagga aatctgaatt taacaaattt aataaagaca 6120 cccacattgc taactaccaa ctatccttct ttatctcagg ttccaatgat gacattaact 6180 agaatattcc actgtatagg agacgaagat caaacagatg aatttgaatt cttatcggat 6240 gaaatattag aggatataga aacaactgct gttaacacag tccctatatt taatgctcaa 6300 tatgaagtca aatctaggaa aggttacaca tacaaaaagg ctttacaaga tgcattgagg 6360 agaggtatag aggaggttga agatacatta gatttctgtg gagatggctt ctattcccca 6420 aaaaatcttg caattatagc actgctgaca agtcttatag atcgattaca aacaaacgaa 6480 tggtctacca ttttacaaac agcaatacat atgtcttttt tccataatgg aaaagatagg 6540 atgtaccact tgatgaaggt tccaaaagca tttgttaaga atccaattgg agaagtgtta 6600 aattgggaga aaatcaggac atttataatc caattaagca caagacaccc tgggaaccat 6660 tgggatcaaa tgttcaacca cttcaaagaa aagatactga tcctaattga tagggaaatt 6720 aagatggaag gtatgtcctg gggagaaatg ttagatgaat tagatgatta taaagacact 6780 gaaatgttcc atttcgaata gagacaggat ctccatattt aatctttaaa ttaaagacat 6840 tgtctttaat ttaaaagatt aaaattgatc tttaaagtta aagacatggt ctttaatttt 6900 aaagatcaaa aatcaacaga ggcactaata taaactcaca cagacgtaat aatataatca 6960 agggaacact act 6973 // ID MG029285; SV 1; linear; viral cRNA; STD; VRL; 4613 BP. XX AC MG029285; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Murutucu virus isolate BeAn974 segment M, complete sequence. XX KW . XX OS Murutucu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4613 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4613 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 0dac81160afa47ccc7d79e466b6244df. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4613 FT /organism="Murutucu virus" FT /segment="M" FT /host="Sapajus apella" FT /isolate="BeAn974" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1955" FT /db_xref="taxon:348008" FT CDS 49..4335 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DE72" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR017896" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE72" FT /protein_id="AVX48967.1" FT /translation="MLLLMLLNVVAVVSGLPLSNRCFEGGVLVEERSMDHGIAEICVKD FT DVSMIKTLSVQKKNESRFTNIIMRKMLIPNYQDCNPTETPNGPIMIFKPNTDLMLIPKT FT YACRVDCTISLDRDEAAIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCEHIQVTC FT GSKSLSFHACFKYHMACIRLLNKSYMPAFMIQSVCQNKEIIIMTCLVLIIFALLYILTM FT SYICYILLPIFIPIAYFYGWLYNKSCKKCTYCGLAYHPFTKCGKNCVCGSMFENSDRMR FT MHRQAGLCKGYKSLRAARILCKSRGSAFCLAVILATLLLSFIQPLEAMQLSYNNEIIEL FT TELSNELDLIFEGLGNVRNIIIGQLVLCAAIIIMLISYLVMHKKIEDKLLMRILYYCPE FT CEMTHPKRGLRKYFSGEFTNMCNSCMCGCTYNQEDLNDGYAIPMTHKLTVGCYAPGRYY FT TSRRMNNIGMYLILAFLIAITSMSIAAADENCLKTSLYKSQEPVSCSAWIKVSSCTSIT FT GIETMMSKVKLPKHDTDLISGLRGSLDSILKKSQDSQVPIQSYIFEALAVNMYCPELAS FT AALDNGNINTNIKTQYSERQLEVCSAKRAVAVCSCFTGSSTCNPANADDVKDYYKSHHE FT IFKSDIARMINTLAKMFPGVFAKELLLASKTSNYTRIQAILKMLEPKLTNAKATQALVK FT ILSAGISETSVANVPAPVPSAKGVSPFDAKWSQDSIFKNMASATAIKTCASGKAYKCFF FT PVSLRFTFYYSCTDPNKFYHTGDDLISIHYNNAGTLCVADPYCEKDFTPVDVANKDSLM FT TMKCDEVTLTASDMPSAAPINKCRVISMQQCTVVGVANRSVAECSNGYFYEYTGELHQS FT SKDDVGVYCFERACKPNRFPHHPSNLKGCVSHNSEMLTRKLKEINYSNLEQLKHSLQET FT IKTDLVEHHYILTKNLPKINPIFKALSIQGVETDSGIQSPYIETNLMVKTGMSLGLHLT FT TKNGDPLFDIIVFVKTAHYESVYDEIYQTGPTVGINVQHDEQCTGKCPASLMKTGWLSF FT SKEHTSQWGCEEFGCLAVNEGCVFGHCKDIIKPDMTVLRKAQDETPVIKICISLPQETY FT CQPINAFTAIVTDKIETQFISNEAGKLPKLLGYRSHKIYTGMINDLGTFSKMCGSVQSV FT NNNVLGAGTARFDYICHAAQRKDITVSRCFDNFYDSCLQLEASESIVYDNTVKKVSLLN FT KNMGELRLKIKMGDINYKLFEKMPSFDFKGSCVGCLKCVKGVDCEFDIHSTSESVCVLT FT SNCAFYHNNLRIDPNIQKYGMKAKCNEEKIWIDLCGNKIEIQLSIVQSHETIEVGNSDQ FT TYFVKEKDDRCGTWLCKVSEQGISSIFAPFFAVFGDYAKIAFYTVLGILAAALIIYLML FT PMIGKIKDILKKNEIEYLKEFRGKRT" XX SQ Sequence 4613 BP; 1637 A; 752 C; 890 G; 1334 T; 0 other; agtagtgaac cgctgtgtac ttatatttta gttgagttca ttacaaggat gttgttgcta 60 atgctattaa atgtggttgc tgttgttagc ggattgcccc tatcaaacag atgtttcgaa 120 gggggtgtcc ttgttgaaga gcggtccatg gaccatggaa tcgcagagat ttgtgtgaaa 180 gatgatgtca gcatgataaa gacattatca gtccaaaaga aaaatgaaag cagatttaca 240 aatataataa tgaggaagat gttgattcca aattatcaag actgtaaccc tacagaaact 300 cctaatggtc caataatgat ctttaagcca aatacagact tgatgttaat ccctaagaca 360 tatgcttgca gagttgattg cacaatatca ttggatagag atgaggctgc aattatcctc 420 cactctgata aactcaataa ttttgaagtt atggggacaa caacagctac aagatggttt 480 caggggagca ccacttactc tcttgagcat acatgtgaac atatacaagt tacatgtggg 540 tcaaaaagct taagtttcca tgcttgtttt aaatatcata tggcttgtat taggttgtta 600 aacaaaagct acatgcctgc attcatgata caatctgtat gccagaataa ggaaattatt 660 ataatgacat gtttggtatt aatcatattt gcattgctat acattctaac aatgtcatac 720 atatgctata tcttgctccc tatctttata ccaattgcat acttttatgg ctggttgtac 780 aacaaatcat gcaagaaatg cacatattgt ggccttgcat atcatccatt tactaaatgt 840 ggtaaaaatt gtgtctgtgg gtcaatgttt gaaaactcag atagaatgag gatgcataga 900 caagcaggat tatgcaaagg atataagtct ttgagagctg ctaggattct ctgcaagagt 960 agaggttctg cattttgcct cgctgtgata ttagctacat tacttctatc gtttattcag 1020 ccattagaag caatgcagct gtcatacaac aatgagatca ttgaactgac agaattatct 1080 aatgagctgg atttaatctt tgaaggtcta ggcaatgtta ggaatatcat aattggtcaa 1140 ttagttttat gtgctgccat aataatcatg ttaataagct atttggtcat gcataagaaa 1200 attgaagata agctgctcat gagaatcctg tactattgtc ctgaatgcga aatgacccat 1260 ccgaaaagag ggctaaggaa gtatttctca ggagagttta caaacatgtg caatagctgc 1320 atgtgtggat gcacatacaa tcaagaagac ttaaatgatg ggtatgctat tccaatgacc 1380 cacaagctaa ctgttggctg ttatgccccg ggaagatact atactagcag gagaatgaac 1440 aatataggga tgtacttgat attagcattt ctgatagcta taacatctat gtcaatagct 1500 gcagctgatg aaaattgttt aaagacatct ttatataagt cgcaggaacc agtaagctgt 1560 tctgcttgga ttaaagtatc gagttgtact agcataactg ggatagaaac aatgatgtca 1620 aaggttaaat taccaaagca tgatacagat ctaatatccg gtttaagagg gagtctagat 1680 tctatcttaa agaagtcaca ggacagccaa gtccctatac agtcatacat ttttgaagct 1740 ctagcagtaa atatgtactg cccagagtta gcaagtgctg ctttagataa tggaaatata 1800 aatactaata tcaaaactca gtattcagaa agacaattgg aagtatgcag tgctaagaga 1860 gcagttgcag tatgttcctg ctttacaggg agttccacat gtaatccagc aaatgcagat 1920 gatgttaaag attattacaa aagccatcat gaaatattca aaagtgacat agctaggatg 1980 ataaataccc ttgctaaaat gtttccaggt gtatttgcta aggaactatt attggcatcc 2040 aaaacctcta actatactag aatacaggct atattgaaaa tgttagagcc taaacttact 2100 aatgcaaagg ccacacaagc cttggtcaaa attttatctg cagggatctc tgaaacatca 2160 gtagcaaatg tgccagcccc agtgccctct gcgaaaggtg tgagtccgtt tgatgcaaaa 2220 tggtctcagg acagtatctt caaaaatatg gcatctgcaa ctgctataaa aacatgtgca 2280 tctggcaagg cttataaatg cttcttccca gttagtctga gattcacatt ctactactca 2340 tgcactgatc ctaacaaatt ttatcataca ggagatgact tgatttcaat acattataat 2400 aatgctggta ctttatgtgt tgcagatccg tactgtgaga aagactttac tccagtggat 2460 gtggctaaca aagactctct aatgactatg aagtgtgacg aagttactct tacagcatcg 2520 gacatgccaa gtgctgctcc tataaacaaa tgtagagtca tctccatgca acaatgtaca 2580 gttgtaggag ttgctaatcg cagtgtagct gaatgctcca atgggtattt ctatgaatac 2640 acaggcgaat tgcatcaaag ttctaaagat gatgtaggtg tttactgctt tgaaagagct 2700 tgcaagccta acagattccc acaccacccg tcgaatctaa aaggttgtgt ttcacacaat 2760 agtgagatgt tgactagaaa acttaaagag ataaattact ccaatttaga acagttgaaa 2820 catagtctcc aagaaacaat aaaaacagac ttggtcgagc accattatat actgacaaag 2880 aatcttccaa aaataaatcc tatcttcaaa gctctgtcta tccaaggagt tgaaacagat 2940 agtggaattc aaagcccata cattgaaaca aatttaatgg taaagactgg catgtccctg 3000 ggtttacatc taactacaaa gaacggtgat ccactcttcg atataattgt atttgtcaaa 3060 actgcacatt atgagtcggt atatgatgaa atttatcaaa caggcccaac agttggtata 3120 aatgtccaac acgatgagca atgcactggg aaatgccctg catctcttat gaaaactgga 3180 tggctttcat tttctaaaga gcacacaagt caatgggggt gtgaggaatt tggatgctta 3240 gcagttaatg aggggtgtgt ttttggacac tgtaaggata tcataaaacc tgatatgaca 3300 gtattacgta aggctcaaga tgaaacacct gtcatcaaaa tttgcatatc attgccacaa 3360 gaaacatatt gtcagccaat aaatgcattc acagctattg tgactgacaa aattgagaca 3420 caattcatct caaatgaggc aggaaaactc ccaaagttgt tagggtatag atcgcataaa 3480 atctatactg gcatgataaa tgacctcgga acattttcaa aaatgtgtgg gagtgttcaa 3540 tctgtaaata ataatgtctt aggagctggc actgcacgat ttgattatat ttgtcatgca 3600 gcacaaagga aggacataac agtaagcaga tgtttcgata acttctatga ttcatgttta 3660 cagttagaag catctgaatc tatagtatat gataatacag tgaagaaagt ttcgttattg 3720 aacaaaaata tgggtgaatt gagactaaag attaaaatgg gggacattaa ttacaagctt 3780 ttcgaaaaaa tgccctcttt cgactttaaa ggtagttgcg taggctgcct aaaatgtgtc 3840 aaaggtgttg actgtgaatt tgacatacat tccacatctg aatctgtatg tgttctaaca 3900 tcaaattgtg ctttctatca taacaaccta agaattgatc caaacattca aaaatatggg 3960 atgaaagcaa aatgtaatga agaaaaaata tggatagatt tatgcggcaa taaaatagag 4020 atacaattat ctattgttca atcacatgaa actattgagg ttggaaatag tgatcaaaca 4080 tattttgtaa aagagaaaga tgacagatgc ggaacatggc tatgcaaagt cagcgaacaa 4140 ggaatatctt caatttttgc cccgttcttt gctgtatttg gagattatgc aaaaattgca 4200 ttctatactg tgttgggtat tttagctgca gccttgataa tatatttgat gttgcctatg 4260 atcggcaaga ttaaagatat attgaagaaa aatgagattg aatacttaaa agaatttaga 4320 ggcaagagga cataagcagc agcaaagtcc gcaagaatat gcatagaaca tagataaaat 4380 ttaaaatata aaaaaaaata aagaaaaata aaaataaaaa caatataaac aaaaaacaaa 4440 aaacaaaaaa aaaaaaaaaa aaaaaataaa aaaaaataaa aaatcagata gcaaattatg 4500 ggataattgg aataactggg ttaagattag gggttttaca acagctagat caaattccaa 4560 tcttatctat atttaattat tgataaaaat acaagacaca gcggttcact act 4613 // ID MG029286; SV 1; linear; viral cRNA; STD; VRL; 1111 BP. XX AC MG029286; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Murutucu virus isolate BeAn974 segment S, complete sequence. XX KW . XX OS Murutucu virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1111 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1111 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; b39d7a1cbef9084e57d54882b3b3e05d. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1111 FT /organism="Murutucu virus" FT /segment="S" FT /host="Sapajus apella" FT /isolate="BeAn974" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1955" FT /db_xref="taxon:348008" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A0H3VFK2" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A0H3VFK2" FT /protein_id="AVX48969.1" FT /translation="MSLAGLLEIDTESWQLLFLSFRLRNGDRIRLRLTLNKRTSALSMS FT TEMSSHLRILGSSSCARMRLSRSSVRVRQSSLTLNLALGRSLLLITITLPTLQIQSLMV FT N" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:A0A0H3VFS8" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:A0A0H3VFS8" FT /protein_id="AVX48968.1" FT /translation="MATPLFEFSVEERGQNSSTFDPKQAYQRFIDEHRDELTLENIRVF FT FLRANEAKQKLRKSSAKLANLKFGTWKVPVVNNHYPANTANTVADGELTLHRISGFLAK FT FILELYADTEHRPEIEEKIINPIAESKGVTWAQSAKVYLSFFPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDVALLKKPLRQQYKNDTPDKWMKEKKVMIQGAVSRISKLPWGTSGLSSQ FT AKDFLKEFGITMK" XX SQ Sequence 1111 BP; 336 A; 204 C; 252 G; 319 T; 0 other; agtagtgaac ttcttaggaa gttcacttat gtcacttgca ggtctattgg agattgatac 60 agaatcatgg caactcctct ttttgagttt tcggttgagg aacggggaca gaattcgtct 120 acgtttgacc ctaaacaagc gtaccagcgc tttatcgatg agcacagaga tgagctcaca 180 cttgagaata ttagggtctt cttcctgcgc gcgaatgagg ctaagcagaa gctccgtaag 240 agttcggcaa agctcgctaa ccttaaattt ggcacttgga aggtccctgt tgttaataac 300 cattaccctg ccaacactgc aaatacagtc gctgatggtg aactgactct tcatcggatt 360 tctgggttct tggctaagtt cattctggaa ctatatgctg acacagaaca ccgcccagag 420 attgaggaga aaatcatcaa ccccattgca gaatcaaagg gggtcacatg ggctcagtct 480 gccaaggtgt acctttcatt cttccctgga acagagatgt tcctacatga gtttgaaatg 540 ctcccattgg caatctatat ctacagagct caaaagggag agattgatgt ggcactcttg 600 aagaagccgc ttagacaaca gtacaagaac gacacaccag acaagtggat gaaagagaag 660 aaggtgatga ttcagggggc tgtctccaga atctcaaaac tcccatgggg aaccagtggc 720 ttatcatctc aggccaagga cttcctcaag gagttcggaa taactatgaa gtaatctcat 780 taggattaat aagattttta tagttctata agtttagtta agttttaggt taagttttag 840 tttagagtaa tcaataatgg taagtaatgg taagttcaaa taagtgtaca ctataatttt 900 caaaattcta aattggggtt aattaattta attgggtttt aattagggga aatatagccg 960 ctgagggaat tgttttgagc agctatattt cacatagaca aagggttggg tggttgggga 1020 aacaagagag gctgcctcat ttcacaattc tatttgtaat atgctaactg tgttcattga 1080 caaattgata ctttctaaga agaacactac t 1111 // ID MG029287; SV 1; linear; viral cRNA; STD; VRL; 6960 BP. XX AC MG029287; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Nepuyo virus isolate BeAn10709 segment L, complete sequence. XX KW . XX OS Nepuyo virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6960 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6960 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; c2ad0f6a8f86d554774390ff3f36f241. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6960 FT /organism="Nepuyo virus" FT /segment="L" FT /host="sentinel mouse" FT /isolate="BeAn10709" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1959" FT /db_xref="taxon:348009" FT CDS 59..6808 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:A0A2Z3DKZ0" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DKZ0" FT /protein_id="AVX48970.1" FT /translation="MAILLGDVIRQYNARIRACTNPELGRDILSEITVIRHNYFAQQFC FT DAINIEYRNDVPAIEIIQEMVPNLDPLTIKIPNITPDNYYRDGTKIYIIDFKVSVSDES FT SIYTYRRYNALMGDVLDKLNIEYEIVIVRMNPSDMHLHISSDNFANLFPNIALNVDFAW FT YFRLRDDLFERFRDNEEFMELIAHGDFTPTIPWVNEDTPELMTDPIFLEFLESMPDDTV FT NDFFYCLNNNAFETEKWNDLLHIIMRKYGDYYNKFIKEQAKNIFLLDENFEKPTKSEIE FT KGWAEMIGRVSNEREVIDDVTKQKPSIHFIWAKHNEGTSNENNTKIINLAKKMQSINSN FT DSLSNAFRAIGKLMDFSEDIPRYEQFCLRLKAEARSSLKPKSRKIEPIKIGDCTILWEQ FT QFKLDTDIIPKEIRIKFLKEFCGIGNHKQFKDRMLDDIDLDKPKILNFQNPDIINQSYI FT MMKNTQVLMEKESGLKKAGNILEEFKDKIIGANEQTWNLIEEITKTRYWQAINDFSVLI FT KNMLAVSQYNKHNTFRVVCTANNSVFGLLYPAASIKSKRSTIVFSTIVLHENEKEILSC FT GSLYKTYKVKGGYLSISKAIRLDKERCQRLVTSPGLFLLTTLLFKGETDIKLGDIMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDLIRRG FT CMSANDQRQQISIKDVFLNEFEITQKGVSNDRNLQSIWFPGKVSLKEYINQVYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRLNVPEPWSFDLRKQSANLYILIFSIAKMLNMDT FT SKNNHLRNRIENRNNFKRSITSISTFTSSKSCIKIGNFQSIKEKRSNHIKKIQEKEIKK FT TRIANTEFVDELDRDYEIAKSTYIDLIKCVPEYTDYISTKVFDRLYEMFRNDEIEDKPA FT IVIIMETMRKHKDFKFCFFNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDMTRSKNTELSKVDDILETPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPNEKKRILYFFCNYMKKELILPDEMMCSLLDQKAEREND FT IIRQMTNGFRKNTVNIKRNWLQGNLNYTSSYIHSCSMMVFKDIIKESSFLLEGKCTVNS FT LVHSDDNQTSIIYIQDKISNDIIINFICSTFEKCCLTFGNQANMKKTYVTNHIKEFVSL FT FNIYGEPFSIYGRFLLPSVGDCAYIGPYEDMASRLSATQIAIKHGCPPSIAWVSIALNH FT WITFSTYNMLPGQINDPCKVFTFDRKELPIELCGILQADLSTIALVGLESGNISFLTNL FT LKKMSQPQFVKESVQAQCTDIINWDLTRLTESEMIRLKILRYIVLDSEITEDSKMGETS FT EMRSRSLITPRKFTTTSSLERLVSYKDFQEIIVDEEKTNLLLDNILEKPELLVTKGEDS FT EEFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFSNKPIIDYSGIQDRFLSILDIPKV FT QDNDTIIGRKTIPDTFAAIKKDLSLLKIDHQDIKLVYSFCILNDPLNTTACNALLLSQV FT QSFIERTSLSAVTMPEFRNMKLIKHSPALVLRAYIHNNMNLPGANEEAMKRDIFHLSEF FT IRETKMREKLDARIAENEEIKGERDRLFEIKEITKFYQACYDYIKSTEHKVKVFILPAK FT AYTAFDFCATIHGNLIKDKGWFTVHYLKQIISGTAKATISNTPTSELIIVDECFKLMSH FT FCDTFIDTGSRLTFFNNILSNFTYKNIPVKDLLNIMLTSFKRQHFLPILYWIGELNQID FT IDKYDALKSGERISWNNWQVNRTLNTGPIDLTIKGYQRTIRIIGENEKLGVAELQVLKD FT DTTSVESHARKLLNSKHNLKFEKMERVEIMEQNTYYICWQFKTRFSYTYQMLLSNIIET FT RNSQTISVTGNKFNELVPVCPVIISRTNSTEQISMKQIKYLNMECSLSRLKLTQYEFAT FT VKRSHFSKMVFFHGPNLIVGNLNITSLIQTPSLLTTNYPALSQIPMMTLTRLFYCIGDT FT DQTDEFEFLSDELLEETETAVINSIPLFNAQYEVKSKKGFTYKKALQEALRRGIEEIEE FT VFDFCNDGFFSPKNMAIVALLTNLIDRLKTNEWSTILQSALHMAFFHNNKDDIYHCMKI FT PKAFIKNPIGEIIDWEKTRKFIIQLETRRQGSHWDQMFEHFKTKCLILIDKEIKMEGLS FT WGEMLDELDDYKDVEMFGFGF" XX SQ Sequence 6960 BP; 2721 A; 949 C; 1194 G; 2096 T; 0 other; agtagtgtac ccttgctttg aacactctaa cttggtttca aagtaaagcg cttcgagaat 60 ggcaatcctc ttgggagatg tgataaggca gtataatgcg cgaatccggg cttgtacgaa 120 ccctgagctt ggaagagata ttctatctga aataacagtt attcgccata actactttgc 180 tcaacaattt tgtgacgcaa taaatataga atatagaaat gatgttccag caatagaaat 240 catacaagaa atggttccga atcttgaccc attaacgata aagataccga atattacacc 300 ggacaattat tatagagacg gcacaaaaat ttacatcatt gattttaagg tatcagtcag 360 tgatgaatca tcaatttata cgtataggag atacaatgct ctaatgggag atgtcctaga 420 taagttaaac attgaatatg agatagtcat agttaggatg aatcctagtg atatgcatct 480 tcatatctct agtgacaatt ttgcaaatct attcccaaac attgcactaa atgtggattt 540 tgcttggtat tttaggctta gagacgatct atttgaaagg tttagagata atgaggaatt 600 tatggaatta attgctcatg gtgattttac acctacaata ccatgggtta atgaagatac 660 acctgaattg atgacagacc cgatatttct agaattctta gaatcaatgc cagatgatac 720 tgttaatgat ttcttttatt gtctaaataa taatgctttt gaaactgaga aatggaatga 780 tttattacat attattatga ggaaatatgg ggactactac aataagttta taaaagaaca 840 agctaaaaat atatttcttt tagatgagaa ttttgagaaa cctactaagt ctgaaattga 900 aaaaggatgg gcagaaatga ttgggcgggt ttcaaatgaa agagaggtca tagatgatgt 960 tacaaagcag aagccaagta tacattttat ttgggcaaag cataatgagg gcacatctaa 1020 tgaaaacaat actaaaataa tcaacttagc aaagaagatg caaagcatta attcaaatga 1080 tagcttatca aatgcattta gagcaatagg taaactaatg gactttagtg aagatatccc 1140 cagatatgaa caattttgct tgagattaaa agcagaggca cgatctagtc taaaaccaaa 1200 atcgaggaaa atagagccaa tcaaaatagg agactgcaca atattatggg aacagcagtt 1260 taaattagac acagatatta tacctaaaga gatcagaata aagttcttaa aagaattttg 1320 tggtatagga aaccataagc agttcaaaga tagaatgtta gatgatattg acttggataa 1380 gcctaaaatc ttaaattttc aaaatccaga tataattaat caatcataca taatgatgaa 1440 aaatacacaa gttttaatgg aaaaagaaag cggccttaaa aaagccggga atattttaga 1500 agaatttaaa gataaaataa taggtgccaa tgagcaaact tggaatctaa ttgaagaaat 1560 aacaaaaaca agatactggc aggcaataaa cgatttttca gtactaataa agaacatgct 1620 tgcagtttca caatataata aacataacac atttagagtt gtctgtactg ctaataactc 1680 agtttttggt ttattatatc cagcagctag tataaaatct aaaaggtcta caatagtctt 1740 ttctactata gtcttgcatg aaaatgagaa ggagattttg tcttgcggat ctttatacaa 1800 aacatataaa gtaaagggtg gttatttatc aatatcaaaa gcaattagat tagataaaga 1860 aagatgtcag agattagtta catcacctgg tttattccta ttaacaacat tactttttaa 1920 aggggagact gatataaaat taggagatat aatgaacttt gcttttttta catcattatc 1980 tataactaaa agcatgcttt ctctgacaga accatctaga tatatgataa tgaattcatt 2040 agcattatca agccatgtta gagagtatat agctgaaaaa ttttcccctt atactaaaac 2100 cctattttct gtatatatga ctgatcttat aagaagaggc tgcatgtctg ctaatgatca 2160 gaggcaacag atatctataa aagatgtttt tttgaatgaa tttgaaataa cccaaaaggg 2220 agtatctaat gataggaatt tacagtctat atggtttcct ggtaaagtaa gcttaaagga 2280 gtatataaat caggtttata tgccctttta tttcaatgca aaagggctgc acaataagca 2340 tcatgttatg atagatcttg caaaaacagt tctagaaata gaattagatc agaggttgaa 2400 tgttccagaa ccttggagtt ttgacttaag aaaacaatct gctaatcttt acattttgat 2460 cttttctata gcaaagatgc taaatatgga tacgtcaaaa aataatcatc ttagaaatag 2520 aatagaaaat agaaataatt ttaaaagatc tataaccagt atctcaactt tcactagctc 2580 caaatcttgt attaaaatag ggaactttca gtcaattaaa gaaaagaggt caaaccatat 2640 aaagaaaatt caggaaaaag agattaagaa aactagaata gcaaatacag aatttgttga 2700 tgaattagat agggattatg agatcgctaa aagtacttat atagacttga ttaaatgcgt 2760 accagaatat actgactata tatcaacaaa agtctttgat aggttgtatg aaatgtttag 2820 aaacgatgaa atcgaggaca aacctgcaat agttattata atggaaacta tgaggaagca 2880 caaagatttc aagttttgtt tttttaacaa aggacaaaaa acagcaaaag atagggagat 2940 atttgttggg gaatttgaag ctaaactatg tttatacggt gtagaaagaa ttgctaaaga 3000 aagatgcaaa ttgaatccgg aagaaatgat atcagagcca ggtgatggta aattgaaaaa 3060 actagaaata aatgcagagt cagagattag atatctaata gatatgacca gatcaaagaa 3120 tacagagcta tctaaagttg atgatatttt agaaacacct aaaggtatta aattagaaat 3180 aaatgctgat atgtcaaaat ggagcgctca agatgtcttc ttcaaatatt tctggctaat 3240 tgttttagat cctatcttat accctaatga gaagaagagg atattatatt tcttttgcaa 3300 ctatatgaaa aaagaattaa ttctaccaga tgagatgatg tgctcactat tagatcagaa 3360 agcagaaaga gaaaatgata taattagaca aatgacaaac gggttcagga aaaatactgt 3420 caatataaaa agaaattggc tacaggggaa tttaaattac acctctagct atatacatag 3480 ttgttctatg atggtcttta aagatataat aaaagaatca tcatttttac ttgaaggtaa 3540 atgcactgta aatagtctag tacattcaga tgataaccag acgtctatca tatatattca 3600 agataaaata agtaacgaca ttattattaa ttttatttgc agcacatttg agaaatgctg 3660 tttaacattt ggaaaccaag ctaacatgaa aaaaacatat gtaacaaatc acataaaaga 3720 gttcgtttca ttattcaata tctatgggga gcctttctca atatatggtc gattcctgtt 3780 accttccgtt ggtgattgtg cttatattgg accgtatgaa gatatggcaa gcaggctatc 3840 agctactcaa attgcaatca aacatggttg tccaccaagt attgcatggg tcagtatagc 3900 attaaatcat tggattacat ttagcactta taatatgttg cctgggcaaa taaatgatcc 3960 ttgtaaagtg ttcacctttg atagaaaaga gctacctatt gaactctgtg gcatcttaca 4020 agcagatctg tcaactatag ccttagttgg tttagaatct ggaaatatat cattcttaac 4080 caatttatta aaaaagatgt cccaaccgca atttgtaaaa gaatctgtcc aagcacaatg 4140 cacagatatt ataaattggg atctaacaag attaacagaa agtgaaatga ttaggctaaa 4200 gattttaaga tacattgtat tggattcaga aataacagaa gatagtaaaa tgggagaaac 4260 aagcgaaatg agaagtagat ccctaataac accaaggaaa ttcacaacaa cctcttcatt 4320 agagagatta gtttcataca aggattttca agagatcatt gtagatgaag agaagacaaa 4380 tttattatta gacaatatat tggaaaagcc tgagctctta gtaacaaaag gtgaagactc 4440 agaagaattt atgactacaa ttttatttag atataattca aagaaattta aagaatcatt 4500 gtctatacaa agccctaccc agttatttat tgaacaaata ttgttttcta acaagcctat 4560 aatagattat agtgggatac aagatagatt tttaagtata ttagatattc ccaaagttca 4620 agacaatgac actattattg gcaggaaaac tataccagat acatttgctg caataaaaaa 4680 ggatttaagc ttactaaaga tagatcatca agacattaaa ttagtgtatt cattctgtat 4740 tttgaacgat cctttaaata ctacagcatg caatgcatta ttattatctc aagttcaatc 4800 ttttatagaa aggacaagct tgtctgcagt gacaatgcct gaatttagaa acatgaaact 4860 gataaagcac tcacctgcac tagtattaag agcatatata cacaataata tgaatttacc 4920 tggtgcaaat gaggaagcta tgaagagaga tattttccac ttgtcagaat ttataaggga 4980 gaccaaaatg agagaaaagt tagatgctag aattgctgaa aatgaagaaa ttaaaggtga 5040 acgtgataga ttatttgaaa tcaaagaaat aacaaaattt taccaggcat gttacgatta 5100 tataaaatct actgagcata aagtaaaagt atttatattg ccagctaaag catacacagc 5160 ttttgacttc tgtgccacta ttcatgggaa tctaattaaa gacaaaggtt ggtttactgt 5220 ccattattta aaacagatca tatcaggcac tgcaaaagca accatcagta atactccaac 5280 aagtgaattg attatagtag atgagtgttt caagttgatg agtcattttt gtgatacttt 5340 tatagataca ggatctagat tgacattttt caacaatata ttgagtaatt ttacatataa 5400 aaatatacca gtcaaagatt tattaaatat aatgttaaca tctttcaaaa gacagcattt 5460 cttaccgata ttatactgga ttggggagct aaaccaaata gatatcgaca aatatgatgc 5520 actaaagagt ggtgagagaa tatcatggaa taactggcag gttaatagga cattaaatac 5580 tggccctata gatcttacta taaaaggtta tcagaggaca ataagaataa taggggaaaa 5640 tgagaagtta ggtgttgccg aactgcaagt tctaaaagat gacactacat cagtagaaag 5700 ccatgccagg aaattactta attctaaaca taatcttaag tttgaaaaaa tggagagagt 5760 agaaattatg gaacaaaata cttactatat atgttggcaa tttaaaacaa gattttctta 5820 tacttatcaa atgctcttat ccaatattat agaaactaga aatagccaaa caatttcagt 5880 tacaggtaat aagttcaatg aattagtacc tgtttgtcct gttataatta gtcgtaccaa 5940 ttcaactgaa caaattagta tgaaacaaat taaatattta aatatggaat gttctttatc 6000 tagattaaaa ttgactcaat acgaatttgc tactgttaaa agatcacatt tttctaaaat 6060 ggtattcttt catggaccta atctaatagt cggaaactta aacataacga gcttaataca 6120 gacacctagt ttactgacta ctaattatcc tgcactatcg caaatcccca tgatgacact 6180 aaccagatta ttttattgta taggcgatac agatcaaaca gatgaatttg aattcctttc 6240 tgacgaatta ttagaagaaa cagaaactgc agtgataaat tctataccat tgttcaatgc 6300 acaatatgaa gtaaaatcaa aaaaaggatt cacatataaa aaggcattac aagaagcatt 6360 aagaagagga attgaagaga tcgaagaagt ttttgatttt tgtaatgacg gatttttctc 6420 acctaaaaat atggccattg tagccttatt gacaaattta atagacagat taaaaacaaa 6480 tgaatggtca actatattac agtcagcatt acacatggca ttctttcaca ataataagga 6540 tgatatttac cattgcatga aaatcccaaa agcatttata aagaatccta ttggggaaat 6600 aatagattgg gagaagacta gaaaattcat aattcaactt gaaacaaggc ggcagggatc 6660 tcattgggat caaatgtttg aacattttaa gacaaaatgc ttgattttaa tagataagga 6720 gataaagatg gaaggcctct cttggggtga aatgttagat gaattagatg attataaaga 6780 tgttgaaatg tttggatttg gcttttaatc cgaattaaat gcaattagga taaaataata 6840 atcttaattg catttaataa gcagagataa ttgtatttca ttttatagat agaatacata 6900 actgccaaat gaatgataac tatttaacta acgcaagtat ataaacaagg gaacactact 6960 // ID MG029288; SV 1; linear; viral cRNA; STD; VRL; 4556 BP. XX AC MG029288; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Nepuyo virus isolate BeAn10709 segment M, complete sequence. XX KW . XX OS Nepuyo virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4556 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4556 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; 41184d77cd46e26db15c6a8e1b50b73b. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4556 FT /organism="Nepuyo virus" FT /segment="M" FT /host="sentinel mouse" FT /isolate="BeAn10709" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1959" FT /db_xref="taxon:348009" FT CDS 50..4351 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:A0A2Z3DG40" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DG40" FT /protein_id="AVX48971.1" FT /translation="MIMHIVLAFISMLTAYAVPLSNRCFEGGVLVEEKFMDHGIAEICI FT KDDISMIKTSSLQKKNDSKFTNTIIRKMLIPNYQDCNPMETPNGPIMIFKPDPSLVLIP FT KTYACRVDCTISLDKDEATIILHSDKLNNFEVMGTTTATRWFQGSTTYSLEHTCEHIQV FT TCGSKSLSFHACFKYHMACIRLLNKSYMPGFMIQSVCQNKEIILMTCLVLIIFSLLYIL FT TMSYICYILLPIFIPVAYLYGWLYNKSCKKCSYCGLAYHPFTKCGKNCVCGSMFENSER FT MRMHRQAGLCRGYKSLRAARILCKSRGSAFCLAVILATLLLSFVQPLEAVKLSINNEII FT ELTELSNEIDLIFKGIDAMKQINIAQMVVILIIIVCLILYLMFNRHLEDKLLNRVVFYC FT PECEMTHSKKGLHKYFNGDFTNQCNSCMCGCVYSQEELNDGYTMPMTHKHKLTIGCYAP FT GRYYTTRKISISTTTIILAMSIILLSLSIAAASDDNCLKITTFQSQEPVSCSAWIKATT FT CSGSRGIEGLMARVRIPKPDTDLISGMKGTLETILLKSQESAVLMQSYILESLAVNLYC FT NELISAGLESGNVNTGIKTLYMANKLEVCFAKKAVKVCACLSGEQNCDPANADDIKDFY FT KTHTEIFKIDVARMVQTLSKMFPGVLAKELLSAASNSNYTRMQAILKLLDQKLTNAKAT FT KALVKILTTAISETTLSSVQAPIPIAKNFVPFDNRWKSTSIFKDITAATPIKTCQNGKA FT YKCFFPVSLRFTHYFSCTDANKFYHTGNDLVAVSYTNPSTLCVSDPYCEKDFTPVEVSQ FT KEVLMSMRCDQETITASNLPNSAPINKCRVVSMQHCTVGGIANKTVAECSNGYFYEYTG FT ELHQSSKDDVGVYCFERACKPNRFPHHPSNLKGCTSHNIEILNRKLKEINYSNLEQLKH FT SLQETIKTDLIEHHYVLTKNLPKLNPTFKAISIQGVETDSGVQSSYIETNIMVKTGISM FT GLHLTTKAGDPLFDIIVFVRAAHYEASYDEIYQTGPTVGINVQHDEKCTGKCPEKLSQT FT GWLSFYKEHTSQWGCEEFGCLAINEGCVFGHCKDIIKPEMTILRKAQEELPSIKICISL FT PQETFCQPINAFTAIITDKIETQFISNEAGRIPKLLAYKSNKIYTGMINDLGTYSKMCG FT SVQSVNNNVVGAGNARFDYICHAAQRKEITVTRCFDNFYDSCLHLEQSDNIIYDNNIKK FT VSLLNRKYGGLRLKIKLGDLNYKIFEKMPSFDFKGSCVGCLKCVKGVDCEFTIHSTSES FT VCVLNSNCNFYHNNLKIDPSVQKYGMKAKCNEEKIWVDLCGNKIEIQISIVQSHDTIEV FT GNSDQTYFVKEKDDRCGTWLCKVSEQGISSIFAPFFAVFGDYARIAFYTLLGVLGAALL FT IYLMLPMIGKLKDILKKNEIEYIKEFRGKKI" XX SQ Sequence 4556 BP; 1689 A; 687 C; 811 G; 1369 T; 0 other; agtagtgaac cgctgagtgt caattttata gtaaagatct atattcataa tgataatgca 60 catagtgctt gcatttattt caatgcttac tgcatatgca gtaccattgt caaatagatg 120 cttcgaaggt ggggttttag ttgaagaaaa gttcatggat catgggatag cagaaatctg 180 tattaaagat gatattagca tgataaagac ttcatcactg caaaaaaaga atgattcaaa 240 attcacaaac acaattatta ggaaaatgct aattccaaac tatcaagatt gtaatcctat 300 ggaaacacct aatggaccta taatgatatt taagccagat ccttctttgg tgcttatacc 360 gaagacttat gcttgtagag ttgattgtac tatctccttg gataaagatg aagcaactat 420 tatacttcac tcagataaat taaataactt tgaggtaatg ggaactacaa ctgcaaccag 480 atggtttcaa ggtagtacca cttattcatt agaacatact tgcgagcata tacaagttac 540 atgtggatca aagagcttaa gtttccatgc ctgctttaag tatcatatgg cttgtatcag 600 actcttaaat aaaagttata tgccagggtt tatgatacaa tcagtttgcc aaaataaaga 660 aataatactt atgacatgct tagttcttat tatctttagt ttgttataca tattaacaat 720 gtcctatatc tgctatatcc tactgcctat ctttatccct gttgcatatc tgtatggttg 780 gttatataat aaatcttgta aaaaatgtag ttattgtggt ctggcatacc acccattcac 840 aaaatgcggt aaaaactgtg tttgtggatc gatgtttgaa aattctgaaa gaatgagaat 900 gcatagacaa gctggattgt gtagaggtta taagtctctt agagctgcaa gaatactatg 960 caaaagtaga ggttcagcct tttgtttggc agtcattctg gcaacattac tactatcatt 1020 tgttcagcca ttggaagcag tgaagcttag cataaataat gaaatcattg aactaactga 1080 attatctaat gaaatagatt taatattcaa aggaatagat gccatgaagc aaataaacat 1140 agcccaaatg gttgtaatat tgataattat agtgtgttta atcttgtatc tcatgtttaa 1200 tagacatttg gaggataagc tattaaatag agttgtattt tattgtccgg aatgtgaaat 1260 gacccattca aaaaaagggc tgcataaata tttcaatggt gatttcacca atcaatgcaa 1320 tagttgcatg tgtggctgcg tatatagtca agaagaactt aatgatgggt atacaatgcc 1380 aatgacacat aagcataagc ttacaatagg ttgttatgct ccaggcaggt attatacaac 1440 taggaagata tctatatcaa ctacaactat aatacttgca atgtctataa tcttgctctc 1500 gctctctata gcagcagctt cagatgacaa ttgtttaaaa atcacaacat ttcaaagtca 1560 agaaccagta tcttgttcag catggataaa agcgacaaca tgttcagggt ctagagggat 1620 agaaggactt atggcaagag tcagaatacc taaaccagat acagacttaa taagtggcat 1680 gaaaggtaca ttagaaacta tactactaaa atctcaagaa agtgctgtct taatgcaatc 1740 ttatatatta gagtcattag cagtcaatct gtattgtaat gaattgatta gtgcaggatt 1800 agaatctgga aatgtcaata ctggaataaa aactttatac atggcaaata aattagaagt 1860 atgctttgca aaaaaagcag tcaaagtatg tgcatgttta tctggagaac aaaattgcga 1920 cccagcaaat gcggatgata taaaggattt ttataaaaca catactgaaa tatttaagat 1980 tgatgttgct agaatggttc aaacactttc aaagatgttc ccaggtgttt tggccaaaga 2040 attactctca gcagcatcta attcaaatta tacaagaatg caagctatcc taaaactttt 2100 agatcaaaaa ttaacaaatg ctaaggcaac taaagcatta gttaaaattt taaccacagc 2160 aatctcagaa acaactctta gttcggttca agctccaatt ccaattgcaa aaaattttgt 2220 cccatttgac aataggtgga agagcacaag catctttaag gatatcacag ctgcaacacc 2280 aatcaaaact tgccaaaatg ggaaagcata caaatgtttt ttcccagtta gtttaaggtt 2340 tacacattat ttctcatgca cagatgcaaa caagttttac cacactggta acgatcttgt 2400 tgcagttagt tacactaatc catcaacact ttgtgtttct gacccatact gtgaaaaaga 2460 ctttacacct gttgaagtaa gtcaaaaaga agtgttgatg tcaatgaggt gtgatcaaga 2520 gacaatcact gcaagtaatc tgccaaattc agccccaatt aataagtgta gagttgtttc 2580 tatgcagcat tgtacagttg gtggaatagc aaataagaca gttgcagaat gctctaatgg 2640 gtatttctat gagtacactg gagaattaca ccaaagttca aaggatgatg taggagttta 2700 ctgctttgaa agagcatgca agccaaatag atttccacat cacccttcga atttgaaagg 2760 atgtacttca cataatattg aaatattaaa tagaaaactt aaggagatta attattccaa 2820 cttggagcag ttaaaacata gcttacaaga gactataaaa acagacttaa ttgagcatca 2880 ttatgtcttg acaaagaatt taccaaagtt aaacccaact ttcaaagcaa tatcaataca 2940 aggagttgaa actgatagtg gagtgcaaag ctcttacatc gaaacaaata ttatggttaa 3000 aactggaata tcaatgggat tgcacctaac aacaaaggca ggagatccac tatttgatat 3060 aattgtattt gttagagcag ctcattatga agcatcttat gatgaaatat accaaacagg 3120 tcccactgta ggaataaatg ttcagcatga tgaaaagtgt acagggaaat gtccagaaaa 3180 actttctcaa acaggatggt tgtcttttta taaagaacac actagtcaat ggggttgtga 3240 agaattcgga tgcttagcca taaacgaggg ctgcgtgttt ggtcattgca aagatattat 3300 taaaccggaa atgacaatcc taaggaaggc tcaagaggaa ctaccatcaa ttaagatatg 3360 catttcacta ccacaagaaa cattttgtca gcctatcaat gcattcactg ctataattac 3420 tgacaaaata gaaacacaat ttatatcaaa tgaagctggt aggatcccaa aactgcttgc 3480 ttataaatct aataaaatct acactggaat gattaatgac ttaggaacat attctaagat 3540 gtgtggcagt gttcaatctg tgaataacaa tgttgtagga gctggtaatg cgagatttga 3600 ttatatttgt catgctgccc aaaggaaaga aataactgta acaagatgtt ttgacaattt 3660 ttatgattca tgcttacatt tagaacaatc agataatata atttatgata acaatataaa 3720 gaaagtttca ttattaaata ggaaatatgg gggactaagg ttgaaaataa aattaggaga 3780 tcttaattac aaaatttttg aaaaaatgcc ttcctttgat tttaaagggt catgtgtggg 3840 atgtttgaaa tgtgtcaaag gagtagattg tgaatttaca attcattcca catctgaatc 3900 agtttgtgtt ttaaattcta actgcaattt ctaccataat aatctaaaga ttgatccatc 3960 tgtacagaaa tatggtatga aagcaaaatg caatgaagaa aaaatatggg ttgatttgtg 4020 cggcaacaag attgagatac aaatatctat agttcaatct catgacacaa tagaagtagg 4080 aaatagtgac caaacctatt ttgtcaaaga aaaagatgac aggtgtggca cttggctttg 4140 taaggtgagt gagcaaggga tatcttcgat atttgcacca ttttttgcag tttttggtga 4200 ctatgcaaga attgcttttt acactctgct aggagtctta ggagctgcat tgttaatata 4260 tcttatgttg ccaatgatag ggaaattaaa ggatatactt aagaaaaatg aaattgaata 4320 tattaaagaa tttagaggga agaaaattta agtaattcag aaaaaccaat tctagggtat 4380 atataaaagc aaaaataaaa gcaaaaaaat aaaaatataa aataaaaaat aaacaaaaaa 4440 caaaaacaac acatatcagc agtttaaata tctacttgag ataaatatct acttgagata 4500 aatagattac tataacacta caaaatttaa aatacaagac acagcggttc actact 4556 // ID MG029289; SV 1; linear; viral cRNA; STD; VRL; 1003 BP. XX AC MG029289; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Nepuyo virus isolate BeAn10709 segment S, complete sequence. XX KW . XX OS Nepuyo virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1003 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1003 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; d869659a715d6e3d60322e9aedd8dbc7. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1003 FT /organism="Nepuyo virus" FT /segment="S" FT /host="sentinel mouse" FT /isolate="BeAn10709" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1959" FT /db_xref="taxon:348009" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DH59" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DH59" FT /protein_id="AVX48973.1" FT /translation="MSITVLSEVNTETCQLLCLSFQWRSGDRVHLLLTLNRRTRVLSIT FT TGMNSHWRTLESSCCVRMRLNRSFVRVRQRSLILSLALGRSLLLITITRQTHQIQSLMV FT N" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:A0A2Z3DIU2" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DIU2" FT /protein_id="AVX48972.1" FT /translation="MSTPLFEFSVEERGQSTSTFDPKQAYQSFIDNHRDELTLENIRVF FT LLRANEAKQKLRKSTAKVANLKFGTWKVLVVNNHYPANSSNTVADGELTLHRISGFIAK FT YLLELYADPENRPGIEETIVNPIAESRGVSWNASAKVYLSFLPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDASLLKKPLRQQYKNDTPDKWMKERKVMIQSAVARVSKLAWGSAGLSAQ FT AKEFLKEFGISMK" XX SQ Sequence 1003 BP; 328 A; 179 C; 229 G; 267 T; 0 other; agtagtgaac ttcttagaga gttcatttat gtcaattaca gtgctttcag aggttaatac 60 agagacatgt caactccttt gtttgagttt tcagtggagg agcggggaca gagtacatct 120 acttttgacc ctaaacaggc gtaccagagt tttatcgata accacaggga tgaactcaca 180 ttggagaaca ttagagtctt cctgctgcgt gcgaatgagg ctaaacagaa gcttcgtaag 240 agtacggcaa aggtcgctaa tcttaagttt ggcacttgga aggtccttgt tgttaataac 300 cattacccgg caaactcatc aaatacagtc gctgatggtg aactaacact gcacaggatt 360 tctggattca tcgccaaata ccttctggag ctctatgcag atcctgaaaa cagacctgga 420 attgaagaaa caattgtcaa cccgatcgca gaatcccgtg gtgtatcctg gaacgcatca 480 gccaaagtgt atctctcatt cttgcctgga acagaaatgt tcttgcatga atttgagatg 540 cttcctttag ctatttacat ttacagagca caaaaaggag aaattgatgc ctcattacta 600 aaaaagccac tgagacagca atacaaaaat gacacaccag ataaatggat gaaagagagg 660 aaagtcatga ttcaaagtgc agtggctaga gtttctaaat tagcatgggg atctgcagga 720 ctatctgctc aggccaagga gttcctaaaa gaatttggga tttccatgaa ataagcagat 780 atccaaatct gtaaatgtgt atatagtaat aaatctaaat cagggttagg gagggaataa 840 ttagggtaat agggttaaat taggttaata agggtggtgg tgggaaacaa aacagcatgg 900 gttgggtggt tggggtacac attaaattca gccaggtaat ggcaattctt gcaattctac 960 agctgacatt gaaagtatca acatttctaa gaagaacact act 1003 // ID MG029290; SV 1; linear; viral cRNA; STD; VRL; 6979 BP. XX AC MG029290; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Oriboca virus isolate BeAn17 segment L, complete sequence. XX KW . XX OS Oriboca virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-6979 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-6979 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; edadb46bef208b254b983ccc4f47589a. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..6979 FT /organism="Oriboca virus" FT /segment="L" FT /host="Sapajus apella" FT /isolate="BeAn17" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1954" FT /db_xref="taxon:192199" FT CDS 55..6801 FT /codon_start=1 FT /product="RNA-dependent RNA polymerase" FT /db_xref="GOA:W8D0A5" FT /db_xref="InterPro:IPR007099" FT /db_xref="InterPro:IPR007322" FT /db_xref="InterPro:IPR029124" FT /db_xref="UniProtKB/TrEMBL:W8D0A5" FT /protein_id="AVX48974.1" FT /translation="MAILLGDVIRQYTARIRACASPEIGRDILAEITMTRHNYFAQQFC FT EAINIEYRNDVPAADIILEMNPALDLRTIKIPNVTPDNYYRDGAKIYIIDFKVSVSDES FT AMHTYKKYDTLLGDVFNQLGIEYEVVIIRMNPSDMHLHISSDNFSNLFPNIVLNLDFTW FT YFRLRDELFQRYRDNEEFMELVAHGEFTPTIPWLVEDTPELYTHPVFLEFIGSMPDGTI FT DDFYYALNHNAFQSDKWNDLLHIMMKKYGEYYTHFVKEQARNIFITDENYNKPSKDEIK FT KGWAEMVERIKNQRDVTDDCSKQKPSIHFIWSPNDANASNENNTKLIKLAKKLQSIKES FT DTFSLAFKNIGYLMDFSEDVDKYENFCLKLKAEARSNIKPKSNKIIPITIGKCTVLWEQ FT QFKFDTEVIPKEVRIKFLKEFCGIGNHKQFKDRMLDDLDLNKPKILNFENPEIKDQAYV FT MMRNTQCFMSKESGLQKVGNILEEFEYKINDANPKTWETILEIAKSRYWQGINDFSVLI FT KNILSVSQYNKHNTFRVVCTANNNFFGILYPSASIKSKKSTVVFSTVSLHESENDVLKC FT GALYRTYKVKGGYLSVSKAIRLDKERCQRLVTSPGIFLLTSLLFKSDNDVSLNDVMNFA FT FFTSLSITKSMLSLTEPSRYMIMNSLALSSHVREYIAEKFSPYTKTLFSVYMTDMIKRG FT CMSANDQRQLISIRDVFLNEFEITQKGVSSEKNLQSIWFPGKVSLKEYINQIYMPFYFN FT AKGLHNKHHVMIDLAKTVLEIELDQRMNVPEPWSFDMKKQSANLYLLIFSVAKMLNMDT FT SRHNHLRSRVENRNNFKRSLTSISTFTSSKSCIKVGDFRSLKEKTAKHIKKINEKDAKK FT TRIANTEFVEESDRDFEVTKSTYLDLVKCVPRYTDYISTKVFDRLYEKYKLEEIEDKPA FT IEVIMDTMKSHKDFKFCFFNKGQKTAKDREIFVGEFEAKLCLYGVERIAKERCKLNPEE FT MISEPGDGKLKKLEINAESEIRYLIDATRSKNAEQSIIDDILETPKGIKLEINADMSKW FT SAQDVFFKYFWLIVLDPILYPAEKQRILYFFCNYMNKELILPDEMMCSLLDQKAEREND FT LIREMTNGFRKNTVNIRRNWLQGNLNYTSSYIHSCSMMVFKDIMKEVASLLEGRCNVCS FT MVHSDDNQTSVIMVQDKLNNDIIVNFVCSTFERCCLSFGNQANMKKTYITNHIKEFVSL FT FNIYGEPFSVFGRFLLPAVGDCAYIGPYEDMASRLSATQTAIKHGCPPSLAWVSIALNH FT WITFNTYNMLPGQINDPTKIFLFDRRELPIELCGILQADLATIALVGLEAGNISFLTNL FT LKKMSPPQLVKESVQTQCNNIEQWDLDCLSDSEILKLKLLRYVVLDSEITEDSKMGETS FT EMRSRSLITPRKFTTTSSLEKLISYKDFQEIIVNTERTEELLESILAQPELLVTKGENS FT REFMTTILFRYNSKKFKESLSIQSPTQLFIEQILFANKPVIDYTGIQDRYLSVLDMPRV FT QNSDGIIGRKTIPETFAAVKKDLNQMPLDQTDIKLIYSFCILNDPLNTTACNALLLSQV FT QSLLERTSMSAVTMPEFRNMKLIRYSPALVLRAYIHNNLSVGGANEDAMRRDLYHLNEF FT IVQTRIKERLDQRIAENQEIKGERDRLFEIKEITKFYQACYDYIKSTEHRIKVFILPSK FT AYTAFDFCATIHGNLIRDDGWYSVHYLKQIVSGTAKANVSLAPASEMVIVEECFKLLSH FT FCDTFIDVNSRLSFVINVIENFSYKNMPVKELLNLMKHSFKRQQFIPLLYWLGELQQDD FT LDKYDAFKTSERVSWNDWQINRTLNTGTIDLTIKGYQRTLRIMGEDDFLQIAELEILKG FT DNTSIETHGRKLLNCKHNLRFEKMRKYPIMEPNTYYICWQMRSRFAVTYQMLLSNIIEA FT RNSQTVSVTGGKFNELIPVCPVIIGRIESIERINMRQVKYLNMECSLSRLQLNQKEFVT FT TKRSHFSKMVFFQGPNLIVGNLNLTNLIKTPTLLTTNYPSLSQVPMMTLTRIFHCIGDE FT DQTDEFEFLSDEILEDIETTAVNTVPIFNAQYEVKSRKGYTYKKALQDALRRGIEEVED FT TLDFCGDGFYSPKNLAIIALLTSLIDRLQTNEWSTILQTAIHMSFFHNGKDRMYHLMKV FT PKAFIKNPIGEVLNWEKIRTFIIQLSTRHPGNHWDQMFNHFKEKILILIDREIKMEGMS FT WGEMLDELDDYKDTEMFHFE" XX SQ Sequence 6979 BP; 2596 A; 1078 C; 1299 G; 2006 T; 0 other; agtagtgtac ccttgtttac ttattacaaa tagttcacga ccaggaaacg cagtatggca 60 atattacttg gcgatgtcat tagacagtat actgccagaa tacgtgcttg tgccagtcct 120 gaaattggca gagacatttt ggcagaaata acaatgacac gacataatta ttttgctcaa 180 caattctgtg aggcaatcaa tattgaatat aggaatgacg ttccagctgc tgacattata 240 ctagagatga atccagcact ggatttaaga accattaaga tcccaaatgt gacgcctgat 300 aattattata gagatggagc aaagatttac ataatagact tcaaagtctc tgtaagtgat 360 gaatcagcaa tgcatacata caagaaatat gacactctgc ttggtgatgt cttcaatcaa 420 ttggggatag aatatgaggt agttataatc agaatgaatc caagtgatat gcatcttcat 480 atctcaagtg ataatttcag caatctattt ccgaatatag ttctgaattt agacttcaca 540 tggtatttcc gcttgcgaga tgaattattt cagagatatc gagataacga ggaatttatg 600 gagcttgttg cgcatggtga attcacacct accatacctt ggttggtgga agatacacct 660 gaattgtaca ctcatcctgt tttcctcgaa tttataggct ctatgccaga tgggactata 720 gatgattttt attatgcatt aaatcataat gcgttccaat cagataaatg gaatgattta 780 ttacatatta tgatgaaaaa atatggcgaa tattatactc actttgtcaa ggagcaggca 840 cgtaatatct tcattacaga tgaaaactat aataaaccat ccaaggatga aataaagaaa 900 ggatgggctg agatggttga aagaattaaa aatcaacgtg atgtgaccga tgattgctct 960 aaacaaaaac caagtataca ttttatatgg tcgccaaatg atgctaatgc cagcaatgag 1020 aacaatacaa aattaatcaa gttggcaaag aaactccaat caattaagga atcagataca 1080 ttcagtttgg cattcaaaaa tattggttac ttaatggatt tcagtgaaga tgtagacaaa 1140 tatgaaaact tttgtctaaa gctcaaggca gaagccagat caaatatcaa gccaaaaagt 1200 aataaaatta ttccaatcac aataggaaaa tgcacagtat tgtgggaaca gcaatttaaa 1260 tttgacacag aagtgatccc aaaggaagtt agaataaaat tcctgaagga attttgtgga 1320 attggaaatc acaagcagtt caaagataga atgcttgatg atttagattt gaacaagcca 1380 aagatactaa atttcgaaaa ccctgaaatt aaagatcaag cgtatgtaat gatgaggaac 1440 actcagtgct ttatgagcaa agagagtggc ttgcaaaaag tgggaaatat cttagaggag 1500 tttgaatata agattaatga tgcaaaccct aagacatggg agactattct agagatagct 1560 aaatcgagat actggcaagg gataaatgat ttttcggttt tgataaagaa catattatct 1620 gtctctcaat acaataaaca taacacattc agagttgtat gtacagcaaa taataatttt 1680 tttggtatat tgtatccatc tgctagcatc aaatccaaga agtcaactgt tgttttctca 1740 actgtcagct tacatgagag tgagaatgat gttttaaagt gtggagcttt atacagaaca 1800 tacaaagtaa aaggaggata cttatctgtc tccaaggcaa tacgtttaga taaagagaga 1860 tgtcaaagat tagtgacatc accagggata ttcctcttga cttctcttct atttaagagt 1920 gataatgatg ttagcttgaa tgatgttatg aactttgcat tctttacatc actgtctata 1980 actaagagta tgctctcact cacagagcct tcaaggtaca tgataatgaa ttcccttgcg 2040 ctctcaagcc atgtgagaga gtacattgca gagaagtttt caccttatac taaaacatta 2100 ttttcggtat acatgacaga catgataaaa cgtggatgca tgtctgcaaa tgatcagaga 2160 caacttatat ctatcagaga tgtattctta aatgagtttg aaattactca aaagggtgtt 2220 tccagtgaaa agaacttaca atctatttgg ttccctggta aagtcagttt gaaagagtat 2280 atcaatcaaa tttacatgcc tttctatttc aatgccaaag gtcttcataa taaacatcac 2340 gttatgatag atctagcaaa gacagtatta gaaatagagt tggatcaaag aatgaatgtc 2400 ccagaaccat ggagctttga tatgaaaaaa caatcagcaa atttatatct actcatattt 2460 tcagtagcaa agatgctaaa tatggacaca tcaaggcata accatcttag gagtagagta 2520 gagaatagga acaatttcaa aagatcatta accagcatct caacatttac cagttcaaaa 2580 tcttgtatta aagttggaga ttttaggagc ctcaaggaga agacagcaaa gcacatcaag 2640 aagattaatg agaaagatgc aaagaagact aggattgcaa acacagaatt tgttgaagaa 2700 tctgacagag actttgaagt aacaaagagc acatatcttg atttggtgaa gtgtgttcct 2760 agatatactg actacatttc tacaaaagtc tttgatagat tatatgaaaa atataaacta 2820 gaagaaatcg aagataagcc agcaattgaa gtaataatgg acaccatgaa atcccacaaa 2880 gattttaaat tttgtttttt taacaaaggc caaaagacag caaaagatag agaaatattt 2940 gttggtgaat ttgaagcaaa gctatgcttg tatggggttg agaggatagc aaaagagaga 3000 tgtaaattaa atcctgaaga aatgatatct gaacccggag atggcaaact gaagaaactt 3060 gagataaatg cagaatctga aataagatac ttaattgatg caacaagaag taagaatgct 3120 gagcaatcta tcatagacga catattagaa acacccaaag gcataaaatt agagattaac 3180 gcagatatgt caaagtggag tgcccaagat gtcttcttca agtatttctg gttgattgtt 3240 ctagacccaa ttctataccc tgcagaaaag caacgaatac tatacttctt ctgcaattac 3300 atgaataaag aattaatact gcctgatgag atgatgtgct cattattaga tcaaaaagcc 3360 gagagagaaa acgatctcat aagagaaatg acaaatgggt ttcgaaagaa cacagtgaat 3420 ataaggcgga attggttgca aggtaatttg aattacacat caagttacat tcatagctgt 3480 tccatgatgg tttttaaaga tatcatgaaa gaagttgcgt cacttttaga aggaagatgt 3540 aacgtttgta gcatggttca ctcagatgac aatcaaacct cagttataat ggttcaagat 3600 aaattgaata acgatataat cgtaaatttt gtatgttcaa cttttgagag atgctgtttg 3660 tcttttggaa atcaagcaaa tatgaagaaa acgtatataa caaaccacat aaaagaattt 3720 gtcagtttgt tcaatattta tggggaacca ttctctgtat ttgggagatt cttactacca 3780 gcagttggtg actgtgcata cattggacca tatgaggata tggcaagtag attgtcagca 3840 actcaaacag ctattaaaca tggatgtcct cctagcctgg catgggtgag cattgcattg 3900 aaccattgga ttacattcaa tacttataat atgttgcctg ggcagataaa cgatcctaca 3960 aaaatattct tatttgatag aagagagcta cctatagaac tatgtggtat attgcaagct 4020 gatttagcaa caattgcatt ggttggatta gaagctggca acatctcgtt cttaacaaat 4080 ttgttaaaga aaatgtcacc tccacagtta gtaaaagaat cagtacaaac gcaatgcaat 4140 aatatagagc aatgggatct agactgccta tcagatagtg aaatactgaa attgaaactg 4200 ttaagatatg ttgttttaga ttcagagatt acagaggaca gcaaaatggg agaaacaagt 4260 gagatgagga gtaggtccct tatcactccc aggaaattca ctacaacctc atctctggaa 4320 aaacttattt cgtataaaga ttttcaagag attattgtaa atacagaaag gacagaagag 4380 ctcttagagt ctatattggc acaacctgaa ctcttagtca caaaaggaga aaattctaga 4440 gagtttatga caactatcct ctttaggtat aactctaaaa aattcaaaga atctctatca 4500 atacagagcc cgactcaact attcatagag cagatattat ttgctaataa gccagtaata 4560 gattacactg gtatacaaga tagatattta agtgtgttgg atatgcctag agtacaaaat 4620 agtgatggga ttatcggaag gaaaacaata cctgagacat ttgctgccgt taaaaaggac 4680 ttaaatcaaa tgcctttaga tcaaacagat attaagttga tctattcatt ttgcattctg 4740 aatgatccac tcaacacaac agcatgtaat gccctcttac tatcccaagt acaatcctta 4800 ttagagagaa caagcatgtc tgctgtaact atgccagaat ttagaaatat gaagttaatt 4860 agatattcac ctgcattggt tttaagagca tatatccaca ataatctgtc tgtgggaggc 4920 gcaaatgaag atgctatgag gagagacctg taccatttaa atgaatttat tgtgcaaacc 4980 aggattaaag aaagattgga tcaaagaatc gcagaaaatc aagagataaa aggggaaaga 5040 gataggttat ttgagataaa ggaaattact aaattctacc aggcatgcta tgattatatt 5100 aaatcaacag agcatagaat aaaagtattc attctaccat ctaaagcata tactgcattt 5160 gatttctgtg ctactattca tggtaatcta attcgagatg atggatggta ttccgtccat 5220 tatctaaagc aaatagtttc tgggacagca aaggccaatg taagtctagc ccctgcgagt 5280 gagatggtca ttgttgaaga atgctttaaa ctactatctc atttctgtga tacatttatc 5340 gatgtgaatt caagattatc ttttgtaata aatgtgattg agaatttctc ctataagaat 5400 atgcctgtaa aagagttact gaatttaatg aaacactcct tcaaacgcca acaattcata 5460 cccttactat attggcttgg ggaattacag caagatgatt tggacaaata tgatgcattt 5520 aaaacaagcg aaagggtatc ttggaatgat tggcaaataa atagaacatt aaatacaggt 5580 acaatcgatt taacaattaa aggctatcaa agaactttaa gaatcatggg agaagatgat 5640 ttccttcaaa tagcagaatt agagattttg aaaggtgaca acacatcaat agaaactcat 5700 ggtaggaagt tactgaactg taaacacaat cttaggtttg aaaagatgag gaaataccct 5760 ataatggaac ctaatacata ttacatatgc tggcaaatga ggtctagatt tgcggtaaca 5820 taccagatgt tgctgtctaa tattattgaa gctcgcaatt ctcagacagt atcagtgaca 5880 ggagggaaat ttaatgaact aatacctgtc tgccctgtta tcattggtag aatcgaatca 5940 attgaaagga taaacatgag acaagtaaaa tacttaaata tggaatgctc tttatccaga 6000 ttacaactca atcaaaaaga atttgttacc acaaaaaggt ctcatttctc aaaaatggtc 6060 ttcttccagg gtcctaactt aatagtagga aatttgaatt taacaaattt aataaagaca 6120 ccaacattgc taactaccaa ctacccctct ttgtctcaag tcccaatgat gacattaacc 6180 agaatattcc attgcatagg agatgaagat caaacagatg aatttgaatt tttatcggat 6240 gaaatattag aggatataga aacaactgct gttaacacag tccctatatt taatgctcaa 6300 tatgaagtca aatctaggaa aggttacaca tataaaaagg ctttacaaga tgcattgagg 6360 agaggtatag aggaggttga agatacatta gatttttgtg gagatggatt ctattcccca 6420 aaaaatcttg caattatagc attgctgaca agtcttatag accgattaca aacaaacgaa 6480 tggtccacca ttttgcaaac agcaatacat atgtccttct tccataatgg gaaagatagg 6540 atgtatcact taatgaaagt tccaaaagca tttattaaga atccaattgg agaagtttta 6600 aactgggaga aaatcaggac atttataatc cagttaagca cgagacaccc tgggaatcat 6660 tgggatcaga tgttcaacca tttcaaggaa aagatactga tcctaattga tagagaaatt 6720 aaaatggaag gcatgtcctg gggagaaatg ctagatgaat tggatgatta taaagacact 6780 gaaatgtttc atttcgaata gggacatcta gaacttccat atttaatctt ttaaattaaa 6840 gacattgtct ttaatttaaa agattaaaat tgatctttaa aattaaagac atagtcttta 6900 attttaaaca tcaaaaaatc aacagagaca ctaatataaa ctcacacaga cgtaataata 6960 taatcaaggg aacactact 6979 // ID MG029291; SV 1; linear; viral cRNA; STD; VRL; 4570 BP. XX AC MG029291; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Oriboca virus isolate BeAn17 segment M, complete sequence. XX KW . XX OS Oriboca virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4570 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-4570 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; df67704834c37cbc185b61d582cd865c. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..4570 FT /organism="Oriboca virus" FT /segment="M" FT /host="Sapajus apella" FT /isolate="BeAn17" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1954" FT /db_xref="taxon:192199" FT CDS 47..4336 FT /codon_start=1 FT /product="glycoprotein" FT /db_xref="GOA:W8CZV0" FT /db_xref="InterPro:IPR005167" FT /db_xref="InterPro:IPR005168" FT /db_xref="InterPro:IPR026400" FT /db_xref="UniProtKB/TrEMBL:W8CZV0" FT /protein_id="AVX48975.1" FT /translation="MLPLIVITLLNAVGAIPLNSKCFEGGVIVEEKTMTHGIAELCIKD FT DISMLKTTSTQIQNTTKFSNRIMRKMLIQNYQDCNPVETANGPIMIFQPNKELILTPKT FT YACRMDCSISLDKEEATIILHSDKLNHYEVMGTTTATRWFQGSTSYSLEHTCEHVQVTC FT GSKTLNFHACFKYHMACIRLLNKSYMPAFVIQSVCRNKEIILMTCLILIIFGLLYVLTL FT SYICYILLPLFIPIAYFYGWIYNRSCKKCQYCGLAFHPFSKCGKNCVCGSMFENSERMK FT MHRTAGLCKGYKSLRAARILCKSKGSAFVLAVLLATLLLSFIQPLEAVKLNYNGTAIEL FT PELSHELDLIFQNMETKTVILIVQISFLVATLLSLLIMYVFSKKIEDVLISRYLYYCKE FT CEMTHPKSGLKYYFNGEFTNMCNSCMCGCVYDQTELNSQDGYMVPMTHRLTIGCYAPAR FT YYTLRKMTNISYNLVVFLLVIFISLSIAAAETCSKDNYYKVSEPVSCSVWLKSSECTQS FT STLDALITRVKLPQADKDQIKSIKTSFLELMIRSQESTSPIGSYVMEDLALTLYCKDIV FT KMNQESGEYNHQMRVLFTGKELEICLNGKIAKACNCIVGRQNCDYSSLDEVTAFYGSHK FT EMFKTDTARMIQALSKIFPGILAKELLIASRASNYSRVVVILKLLEPKLVNAKSIIAIV FT KILEKALSDSTISAIKMPESSVKEVKPFDAEWGKTSIFDNMQTSTDKKTCTNAKLYKCF FT YPVSLKFTFFVKCEEENKFYTSGEYLVATHYSTPTSFCVADPYCELDFKGISVDRKEEL FT QTYRCIQETLNQQENDNSRPINKCKVVSTQSCTVGNTANRSVAECDNGYFYEYVGEIHQ FT SPKDNVGVYCFEKGCKQNRFPHHLDNVKGCTSHNLAMISRKLKEINYSNLEQLKHSMQE FT AIKTDLVEHNYIPTKNLPKITPSFKALSIHGVETEQGIESSYIETNLLVRTGVSSGLHL FT TTKNGEPLFDLILFVKTAHYEATYEEIYKTGPTVGINMQHNEKCTGSCPQNMSKIGWLS FT FSKEHTSQWGCEEFGCLAVNEGCVYGHCRDLIKPDITVIRKAQEEEPVIKICFTLPQES FT LCQNINSFTPIVTEKFEVQFLSNEAGKIPKLLAYKSHKIFAGMINDLGTFSKMCGSVQQ FT TPRGVLGAGIPRFDYLCHAAKRKDITISRCFDNFYESCSLLETRDDIVYDSSTNRVSLL FT NKNMGELRVKIKLGDINYKLFEKTPSFDLKGSCVGCINCIRGVDCELEIIASGESVCIL FT SSNCAFYHNNLRIDPNVQKYGLKAKCTQEHIWIELCGNKIEVQISITKTSETIEVGNSD FT QTYFVKEHDIRCGTWLCKVSEQGISSIFAPFFAVFGTYGKIAFYSVLGVLLAALIIYLS FT LPIIGKIKDTLKKNEYEYLKETIGKRR" XX SQ Sequence 4570 BP; 1648 A; 747 C; 880 G; 1295 T; 0 other; agtagtgaac cgctgtgtgt ttatactata gttgttcact gacaagatgt tgcctttgat 60 cgtaataacg ttgttgaacg ctgttggtgc tataccatta aacagcaaat gctttgaagg 120 aggtgtcata gttgaggaaa agacaatgac acatgggata gcagaattat gcatcaaaga 180 tgacataagc atgctaaaaa caacatctac ccaaattcaa aatacaacaa aattcagtaa 240 tcgaatcatg cgcaagatgt tgatacagaa ttaccaagac tgtaatcctg tagaaactgc 300 taatggtccg attatgattt tccaaccaaa taaagagttg atcttaactc ccaaaacata 360 tgcttgtaga atggattgtt caatctcttt ggataaagaa gaagcaacaa taattctgca 420 ttcagataaa ctgaaccatt atgaagtcat gggaacaaca actgcaacaa ggtggtttca 480 agggagtaca agctattctc ttgaacatac atgtgagcat gttcaagtga cttgtggatc 540 gaaaacatta aactttcatg catgtttcaa atatcatatg gcatgcatcc ggcttttgaa 600 caagagctat atgcctgcat ttgtgattca atctgtttgc agaaataaag aaataatact 660 tatgacatgc ttaatattga tcatatttgg attattgtat gtcttgacat tgtcatacat 720 ttgctacatc ctattgccac tattcatacc tattgcatac ttttatggtt ggatctacaa 780 tagatcgtgt aaaaaatgcc agtattgtgg gttagcattc catccattta gtaaatgtgg 840 gaagaactgt gtctgtggct caatgtttga aaattcagaa cggatgaaaa tgcacagaac 900 tgcagggctt tgcaaaggtt ataaatcttt gagagcggct agaattttat gcaaaagcaa 960 ggggtctgct tttgttctag cagtattgtt ggctacattg ctactctcat tcatacagcc 1020 cttggaggca gtaaaattaa actacaatgg gacagcaatt gaattgccag aactctcaca 1080 cgaacttgat ttaatctttc agaatatgga aacaaaaaca gttatcctta tagtacagat 1140 ttcttttcta gtagccacat tgctatcact tttgattatg tatgtattct caaagaagat 1200 tgaagatgtc ctgatatcta gatacttgta ctactgtaaa gaatgcgaga tgacacatcc 1260 taagagcggc ttaaaatatt acttcaatgg agaattcaca aatatgtgta actcatgcat 1320 gtgtggatgt gtctatgatc aaacagaact taattctcaa gatggatata tggttccaat 1380 gacacacaga ttaacaattg gatgttatgc ccctgcaaga tattatacat tacggaaaat 1440 gacaaacata agctacaatc tagttgtgtt tctgttggta atatttatat cactttctat 1500 tgcagcagct gaaacatgct ccaaagacaa ctattataaa gtttcagaac cagtctcatg 1560 ctcagtttgg ttaaaatcaa gtgaatgtac tcaatctagc actttagatg ctttgataac 1620 tagagtcaaa ttaccccagg cagataagga tcaaattaaa tctattaaaa catctttcct 1680 ggagctaatg atcagatctc aggaatcgac aagcccaatt ggatcttatg taatggaaga 1740 tcttgcctta acattatatt gtaaagacat tgttaagatg aatcaagaaa gtggggaata 1800 taatcatcag atgagagtac tattcactgg taaagaatta gagatttgcc tgaatgggaa 1860 aatagccaaa gcatgcaatt gtatagttgg aaggcaaaac tgtgattact catctttaga 1920 tgaagttaca gcattttatg gatcacacaa agaaatgttt aaaacagata cagctagaat 1980 gattcaggca ttatccaaaa tattccctgg catactcgca aaggaactgt tgatagcatc 2040 aagagcatca aattactcta gagtggttgt catcctgaag cttttagaac ccaaattggt 2100 gaatgctaaa tcgataattg ctattgtgaa aattctagaa aaggcactat cagacagcac 2160 gatcagtgca ataaaaatgc cagaatcctc agttaaagaa gttaaaccat ttgatgctga 2220 gtggggtaaa acaagtatct ttgacaatat gcagacatct acagacaaaa agacatgtac 2280 aaatgcaaag ttgtataaat gtttttaccc ggtaagtttg aaatttacat ttttcgtgaa 2340 atgtgaagaa gagaataaat tttacacttc tggagaatat ttagttgcaa cacattactc 2400 aacgccaacc agtttctgtg ttgctgatcc atattgtgag ctagatttta aaggaatcag 2460 tgtagataga aaagaagaac tgcagactta cagatgcatc caagagacat tgaaccaaca 2520 ggagaacgat aattcaagac ccattaataa atgtaaagtg gtatctacac aatcctgcac 2580 tgttggcaac actgcaaaca gaagtgtagc agaatgtgac aatggatact tttatgagta 2640 tgttggagag atccatcaaa gcccaaaaga taatgttggt gtttactgct ttgaaaaagg 2700 ctgcaaacaa aataggttcc cacatcactt ggataatgta aaaggctgca catcacataa 2760 tctagcaatg atcagtagaa aactaaaaga gatcaattat tctaacttgg aacagttgaa 2820 gcatagtatg caagaggcaa taaaaacaga cctagtggag cacaattata tccccacaaa 2880 gaacttacct aagataacac cttcctttaa ggctttatcc atacatggtg ttgaaacaga 2940 gcagggcata gaaagttctt acatagagac aaatctacta gtaagaacag gagtttcgtc 3000 aggtctacat ttgacgacaa aaaatggcga accattattt gacttgatac tctttgtcaa 3060 aacagcacac tatgaagcta catatgaaga aatttataaa acagggccaa ctgtaggaat 3120 aaatatgcaa cacaatgaaa agtgcacagg gagttgtccg caaaacatgt ccaagatagg 3180 ctggttatcg ttctctaaag agcatactag ccaatggggt tgcgaagaat ttggatgtct 3240 tgcagtcaat gagggttgtg tttatggtca ttgcagagat ttaatcaaac cagacataac 3300 agttataaga aaggcacaag aagaagaacc tgtcattaaa atatgtttta cattacccca 3360 agaatcatta tgccaaaata ttaattcatt tacgccgata gtgacagaga aatttgaagt 3420 ccaatttttg tctaatgaag ctgggaagat tccaaagctt ttagcatata aatcgcataa 3480 gatatttgct ggaatgataa atgatcttgg gactttctca aaaatgtgtg gaagcgtgca 3540 acagacaccc cgtggagtcc taggtgcagg cattcctcga tttgactact tatgccatgc 3600 tgccaagagg aaggatataa ctataagcag gtgttttgac aatttctatg aatcgtgctc 3660 actgttagaa acaagagatg acatagttta cgattcatcg actaatagag tgtcactgtt 3720 aaataaaaac atgggtgaac taagagttaa aataaaattg ggagatatca attataagct 3780 atttgaaaaa accccatcat ttgacttgaa aggatcctgt gttggctgca ttaattgtat 3840 tcggggagtg gattgtgaat tggagatcat tgctagtgga gagtcagtat gtatattgag 3900 ctctaattgt gctttttatc acaacaatct ccgcatagat ccgaatgttc aaaaatatgg 3960 actgaaagca aaatgtacac aggaacatat ctggatagag ctctgtggga ataaaataga 4020 ggttcaaatt agcatcacaa aaacatcaga aactattgaa gtggggaaca gcgatcagac 4080 ttactttgtt aaggagcatg acattcgatg tgggacatgg ctttgcaaag tgagtgagca 4140 agggataagt tccatttttg caccattttt tgcagtgttt gggacatatg ggaaaatagc 4200 attctattct gtcttgggag tattgcttgc agcactaatc atatatctat ctctccccat 4260 aattgggaag ataaaagata cattgaagaa aaatgaatac gaatacctga aggagacaat 4320 cgggaagaga agataatcag ggcttcattg ggaaagaatc cacagaacta aaagaagtct 4380 aaagaattaa aaactataaa atcaaaaata aataaaataa aataaaaaca taaaaataaa 4440 caaaataaaa taaaaaacat ataaaaccct acataagtgg ggatcaagca acagtctaaa 4500 atgttttgaa ctgatgttga tcccagaact tttaaatcac tataatataa aaacacagcg 4560 gttcactact 4570 // ID MG029292; SV 1; linear; viral cRNA; STD; VRL; 1111 BP. XX AC MG029292; XX DT 06-JUL-2018 (Rel. 137, Created) DT 06-JUL-2018 (Rel. 137, Last updated, Version 1) XX DE Oriboca virus isolate BeAn17 segment S, complete sequence. XX KW . XX OS Oriboca virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Peribunyaviridae; Orthobunyavirus. XX RN [1] RC Publication Status: Online-Only RP 1-1111 RX DOI; .1371/journal.pone.0197294. RX PUBMED; 29795585. RA Nunes M.R., de Souza W.M., Acrani G.O., Cardoso J.F., da Silva S.P., RA Badra S.J., Figueiredo L.T., Vasconcelos P.F.; RT "Revalidation and genetic characterization of new members of Group C RT (Orthobunyavirus genus, Peribunyaviridae family) isolated in the Americas"; RL PLoS One 13(5):e0197294-e0197294(2018). XX RN [2] RP 1-1111 RA Nunes M.R., Souza W.M., Acrani G.O., Silva S.P., Vasconcelos P.F.C.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL Center for Technological Innovation, Evandro Chagas Institute, Rodovia RL BR-316 km 7 s/n, Ananindeua, Para State 67030-000, Brazil XX DR MD5; badc5689649a099b76a011f84f436815. DR EuropePMC; PMC5967719; 29795585. XX CC ##Assembly-Data-START## CC Assembly Method :: Newbler v. 3.0 CC Sequencing Technology :: 454 CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1111 FT /organism="Oriboca virus" FT /segment="S" FT /host="Sapajus apella" FT /isolate="BeAn17" FT /mol_type="viral cRNA" FT /country="Brazil" FT /collection_date="1954" FT /db_xref="taxon:192199" FT CDS 29..346 FT /codon_start=1 FT /product="NSs protein" FT /db_xref="GOA:A0A2Z3DE79" FT /db_xref="InterPro:IPR000797" FT /db_xref="UniProtKB/TrEMBL:A0A2Z3DE79" FT /protein_id="AVX48977.1" FT /translation="MSLAGLLEIDTESWQLLFLSFRLRNGDKIRLRLTLNKRTSALSMS FT TETSSHLRILGSSSCARMRLSRSSVRVRQSSLTLNLALGRSLLLITITLPTLQIRSLMV FT N" FT CDS 67..774 FT /codon_start=1 FT /product="N protein" FT /db_xref="GOA:W8CZS0" FT /db_xref="InterPro:IPR001784" FT /db_xref="UniProtKB/TrEMBL:W8CZS0" FT /protein_id="AVX48976.1" FT /translation="MATPLFEFSVEERGQNSSTFDPKQAYQRFIDEHRDELTLENIRVF FT FLRANEAKQKLRKSSAKLANLKFGTWKVPVVNNHYPANTANTVADGELTLHRISGFLAK FT FILELYADTEHRPEIEEKIINPIAESKGVTWAQSAKVYLSFFPGTEMFLHEFEMLPLAI FT YIYRAQKGEIDVALLKKPLRQQYKNDTPDKWMKEKKVMIQGAVSRISKLPWGTSGLSSQ FT AKDFLKEFGITMK" XX SQ Sequence 1111 BP; 345 A; 206 C; 245 G; 315 T; 0 other; agtagtgaac ttcttaggaa gttcactaat gtcacttgca ggtctattgg agattgatac 60 agaatcatgg caactcctct ttttgagttt tcggttgagg aacggggaca aaattcgtct 120 acgtttgacc ctaaacaagc gtaccagcgc tttatcgatg agcacagaga cgagctcaca 180 cttgagaata ttagggtctt cttcctgcgc gcgaatgagg ctaagcagaa gctccgtaag 240 agttcggcaa agctcgctaa ccttaaattt ggcacttgga aggtccctgt tgttaataac 300 cattaccctg ccaacactgc aaatacggtc gctgatggtg aactaactct tcatcggatt 360 tctgggttct tggctaagtt cattctagaa ctgtatgctg acacagaaca ccgcccagag 420 attgaagaga aaatcatcaa tcctattgca gaatcaaaag gggtcacatg ggctcagtct 480 gccaaggtgt acctttcatt cttccctgga acagagatgt tcctacatga gtttgaaatg 540 ctcccattgg caatctatat ctacagagct caaaagggag agattgatgt ggcactcttg 600 aagaagcctc ttagacaaca gtacaagaac gacacaccag acaagtggat gaaagagaag 660 aaagtaatga ttcagggagc tgtctccaga atctcaaaac tcccatgggg aaccagtggc 720 ttgtcatctc aggccaagga cttcctcaaa gagttcggaa taactatgaa gtaatctcat 780 cagaattaat aagattttta tagttctata agtttagtta agttttaggt taagttttgg 840 tttagagtaa tcaataatgg taagtaatgg taagttcgaa taagtgtaca ctataatttc 900 caaaattcta aattggggtt aattaattta attgggttta aattggggga aatatagctg 960 ctgaggaaat tgttttgagc agccatattt cacatgaaca aagggttggg tggttgggga 1020 aacaagagag gctgcctcat ttcacaattc taattgcaat atactaactg tgttcattga 1080 caaattgata ctttctaaga agaacactac t 1111 // ID MG029472; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029472; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1303 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; 2a07eb0641bbea90e666568a482fe67e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1303" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITA6" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITA6" FT /protein_id="AXH06275.1" FT /translation="MSTLPKPQRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 67 A; 88 C; 91 G; 47 T; 0 other; atgagcacac ttcctaaacc tcaaagaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029473; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029473; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1324 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; b1fd0564c475d531eb518e30101a2e4f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1324" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITA7" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITA7" FT /protein_id="AXH06276.1" FT /translation="MSTLPKPQRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 66 A; 88 C; 93 G; 46 T; 0 other; atgagcacgc ttccgaaacc tcaaagaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029474; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029474; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1327 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; c0a0ae4d692fb4a0fdae98190c2212ed. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1327" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITA8" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITA8" FT /protein_id="AXH06277.1" FT /translation="MSTLPNPQRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 64 A; 90 C; 94 G; 45 T; 0 other; atgagcacgc ttccgaaccc gcaacgaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029475; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029475; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1334 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; d9f0fbffc2110a06620459e1a640f89f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1334" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITA9" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITA9" FT /protein_id="AXH06278.1" FT /translation="MSTLPKPQRRTKRNTIRRPQHVKYPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPTARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 66 A; 90 C; 91 G; 46 T; 0 other; atgagcacac ttcctaaacc tcaaagaaga accaaaagaa acaccatccg tcgcccacag 60 cacgtcaagt acccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccac ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029476; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029476; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1351 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; dac4595dae77de707f360975edf30ad7. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1351" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB0" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB0" FT /protein_id="AXH06279.1" FT /translation="MSTLPKPQRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 66 A; 89 C; 92 G; 46 T; 0 other; atgagcacac tgcctaaacc tcaacgaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029477; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029477; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1360 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; ca1639827df1c52969f7d38e1b29e986. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1360" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB1" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB1" FT /protein_id="AXH06280.1" FT /translation="MSTRPKPQRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 66 A; 88 C; 93 G; 46 T; 0 other; atgagcacac gtcctaaacc tcaaagaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aagacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029478; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029478; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1369 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; 9943c653a98a534c677bb660b98fa79f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1369" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB2" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB2" FT /protein_id="AXH06281.1" FT /translation="MSTLRTPHRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 65 A; 88 C; 93 G; 47 T; 0 other; atgagcacac ttcgtacacc tcacagaaaa accaaaagaa acacgatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029479; SV 1; linear; genomic RNA; STD; VRL; 293 BP. XX AC MG029479; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1371 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-293 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; 15d1151e883cd9b53055f4fcc4dec7ce. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..293 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1371" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>293 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB3" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB3" FT /protein_id="AXH06282.1" FT /translation="MSTRPTPHRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWAGWLL" XX SQ Sequence 293 BP; 65 A; 90 C; 92 G; 46 T; 0 other; atgagcacac gtcctacacc tcacagaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cagggtggct cct 293 // ID MG029480; SV 1; linear; genomic RNA; STD; VRL; 283 BP. XX AC MG029480; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1387 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-283 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-283 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; e594c52c0ab1a8a947e57f56403732be. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..283 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1387" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>283 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB4" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB4" FT /protein_id="AXH06283.1" FT /translation="MSTRPTPHRKTKRNTIRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARQSEGRYWAQPGYPWPLYGNEGCGWA" XX SQ Sequence 283 BP; 65 A; 87 C; 89 G; 42 T; 0 other; atgagcacac gtccgacacc tcacagaaaa accaaaagaa acaccatccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aaaacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcacgtcag agcgaaggcc ggtactgggc ccagcccggg 240 tacccttggc ccctctatgg taacgagggc tgcgggtggg cag 283 // ID MG029481; SV 1; linear; genomic RNA; STD; VRL; 285 BP. XX AC MG029481; XX DT 06-AUG-2018 (Rel. 137, Created) DT 06-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Hepacivirus C isolate 1395 core protein gene, partial cds. XX KW . XX OS Hepacivirus C OC Viruses; Riboviria; Flaviviridae; Hepacivirus. XX RN [1] RP 1-285 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT "Hcv infected thalassemic population in eastern India"; RL Unpublished. XX RN [2] RP 1-285 RA Biswas A., Gupta D., Firdaus R., Saha K., Sadhukhan P.C.H.; RT ; RL Submitted (03-OCT-2017) to the INSDC. RL VIROLOGY, ICMR VIRUS UNIT, GB 4, 1st Floor, ID & BG Hospital Campus 57, Dr. RL S.C. Banerjee Road Beliaghata Kolkata 700010 West Bengal, KOLKATA, WEST RL BENGAL 700010, India XX DR MD5; 8e2ce4b0dc160d9c9fd5f1c822bf0510. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..285 FT /organism="Hepacivirus C" FT /host="Homo sapiens; thalassemia patient" FT /isolate="1395" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="serum" FT /collected_by="ICMR VIRUS UNIT,KOLKATA" FT /collection_date="14-Jun-2015" FT /note="subtype: a; genotype: 3" FT /db_xref="taxon:11103" FT CDS 1..>285 FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:A0A345ITB5" FT /db_xref="InterPro:IPR002522" FT /db_xref="UniProtKB/TrEMBL:A0A345ITB5" FT /protein_id="AXH06284.1" FT /translation="MSTLPKPQRKTKRNTVRRPQDVKFPGGGQIVGGVYVLPRRGPRLG FT VRATRKTSERSQPRGRRQPIPKARRSEGRSWAQPGYPWPLYGNEGCGWAG" XX SQ Sequence 285 BP; 64 A; 84 C; 91 G; 46 T; 0 other; atgagcacac ttcctaaacc tcaaagaaaa accaaaagaa acaccgtccg tcgcccacag 60 gacgtcaagt tcccgggtgg cggacagatc gttggtggag tatacgtgtt gccgcgcagg 120 ggcccacgat tgggtgtgcg cgcgacgcgt aagacttctg aacggtcaca gcctcgcgga 180 cgacgacagc ctatccccaa ggcgcgtcga agcgaaggcc ggtcctgggc tcagcccggg 240 tacccttggc ccctttacgg taacgagggc tgtgggtggg cagga 285 // ID MG029625; SV 1; linear; genomic RNA; STD; VRL; 562 BP. XX AC MG029625; XX DT 26-APR-2018 (Rel. 136, Created) DT 26-APR-2018 (Rel. 136, Last updated, Version 1) XX DE Groundnut ringspot orthotospovirus isolate S30 nucleocapsid protein gene, DE partial cds. XX KW . XX OS Groundnut ringspot tospovirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Tospoviridae; Orthotospovirus. XX RN [1] RP 1-562 RA Fontes M.G., Fonseca M.E.N., Boiteux L.S., Lima M.F., Silva Filho J.G.; RT "First report of Groundnut ringspot orthotospovirus infecting soybean RT (Glycine max) in Brazil"; RL Unpublished. XX RN [2] RP 1-562 RA Fontes M.G., Fonseca M.E.N., Boiteux L.S., Lima M.F., Silva Filho J.G.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Plant Pathology, UNB, Campus Universitario Darcy Ribeiro, Asa Norte, RL Brasilia, Df 70910-900, Brazil XX DR MD5; eee3e6779d198a094671ddf124581dff. XX FH Key Location/Qualifiers FH FT source 1..562 FT /organism="Groundnut ringspot tospovirus" FT /host="Glycine max" FT /isolate="S30" FT /mol_type="genomic RNA" FT /country="Brazil" FT /lat_lon="15.93 S 48.14 W" FT /collection_date="Feb-2017" FT /db_xref="taxon:1933292" FT /altitude="900m" FT CDS 1..>562 FT /codon_start=1 FT /product="nucleocapsid protein" FT /db_xref="GOA:A0A2S0RPW6" FT /db_xref="InterPro:IPR002517" FT /db_xref="UniProtKB/TrEMBL:A0A2S0RPW6" FT /protein_id="AWA45293.1" FT /translation="MSKVKLTKENIVSLLTQSADVEFEEDQNQVAFNFKTFCQENLDLI FT KKMSITSCLTFLKNRQSIMKVVNQSDFTFGKVTIKKNSERVGAKDMTFRRLDSMIRVKL FT IEETANNENLAIIKAKIASHPLVQAYGLPLADAKSVRLAIMLGGSIPLIASVDSFEMIS FT VVLAIYQDAKYKELGIEPTKYNTK" XX SQ Sequence 562 BP; 184 A; 99 C; 118 G; 161 T; 0 other; atgtctaagg tcaagctcac aaaagaaaac attgtctctc ttttaactca atctgcagat 60 gttgagtttg aagaagacca aaaccaggtt gcatttaact ttaagacttt ctgtcaggaa 120 aatcttgacc tgattaagaa aatgagtatc acttcatgtt tgactttctt gaagaatcgt 180 caaagcatta tgaaagttgt gaaccaaagt gactttactt ttggtaaggt cacgataaag 240 aaaaattctg agagagttgg agctaaagat atgactttca ggaggcttga tagcatgata 300 agagtgaagc tcatagaaga gactgcaaac aatgagaatc ttgctattat caaggcaaaa 360 attgcctccc accctttggt ccaagcttac gggctgcctc tggcagatgc aaaatctgtg 420 agacttgcta taatgcttgg aggtagtatc cctctcattg cttctgttga cagcttcgaa 480 atgatcagtg ttgttcttgc catatatcaa gatgcaaagt acaaggagtt agggattgaa 540 ccaactaagt acaacactaa gg 562 // ID MG030372; SV 1; linear; genomic RNA; STD; VRL; 256 BP. XX AC MG030372; XX DT 20-DEC-2017 (Rel. 135, Created) DT 20-DEC-2017 (Rel. 135, Last updated, Version 4) XX DE Parechovirus A isolate Brisbane/ORChID-13/2010-2014 polyprotein gene, DE partial cds. XX KW . XX OS Parechovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RP 1-256 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT "Human parechovirus infections in Australian infants during the first RT 2-years of life: a community-based birth cohort study"; RL Unpublished. XX RN [2] RP 1-256 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Malaria and Molecular Diagnostic, QPID Laboratory, Centre for Children's RL Health Research, Level 8, 62 Graham St, South Brisbane, QLD 4101, Australia XX DR MD5; 72650072c8eafd675fb5792ca355ddda. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..256 FT /organism="Parechovirus A" FT /host="Homo sapiens" FT /isolate="Brisbane/ORChID-13/2010-2014" FT /mol_type="genomic RNA" FT /country="Australia" FT /isolation_source="stool" FT /collection_date="2010/2014" FT /db_xref="taxon:1803956" FT CDS <1..>256 FT /codon_start=2 FT /product="polyprotein" FT /note="VP3-1" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A2H4WAP5" FT /protein_id="AUC64103.1" FT /translation="NDHPIGLFQIEVLNRLTYNSSSPSEVYCIVQGKMGQDARFFCPTG FT SVVTFQNSWGSQMDLTDPLCIEDDTENCKQTMSPNELGLT" XX SQ Sequence 256 BP; 79 A; 52 C; 53 G; 72 T; 0 other; aaatgatcat ccaattggat tgttccagat tgaggttcta aacaggctca catacaatag 60 ctccagccct tctgaagtgt actgcatagt acaaggtaaa atgggacaag atgccagatt 120 cttttgtcca acaggttctg tggtgacttt ccagaactca tggggttccc agatggattt 180 aactgatcca ctttgtattg aggatgacac cgaaaattgt aaacagacaa tgtcccctaa 240 tgaattagga cttaca 256 // ID MG030373; SV 1; linear; genomic RNA; STD; VRL; 256 BP. XX AC MG030373; XX DT 20-DEC-2017 (Rel. 135, Created) DT 20-DEC-2017 (Rel. 135, Last updated, Version 4) XX DE Parechovirus A isolate Brisbane/ORChID-32/2010-2014 polyprotein gene, DE partial cds. XX KW . XX OS Parechovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RP 1-256 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT "Human parechovirus infections in Australian infants during the first RT 2-years of life: a community-based birth cohort study"; RL Unpublished. XX RN [2] RP 1-256 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Malaria and Molecular Diagnostic, QPID Laboratory, Centre for Children's RL Health Research, Level 8, 62 Graham St, South Brisbane, QLD 4101, Australia XX DR MD5; f22dfc766842237ede531de53b127156. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..256 FT /organism="Parechovirus A" FT /host="Homo sapiens" FT /isolate="Brisbane/ORChID-32/2010-2014" FT /mol_type="genomic RNA" FT /country="Australia" FT /isolation_source="stool" FT /collection_date="2010/2014" FT /db_xref="taxon:1803956" FT CDS <1..>256 FT /codon_start=2 FT /product="polyprotein" FT /note="VP3-1" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A2H4WAP7" FT /protein_id="AUC64104.1" FT /translation="HGNKLGLLQVEVLNRLTFNSSSPNKVHCIIQGRLGEDARFFCPAG FT SLVAFQNSWGSQMDLTDPLCIEDSTEDCKQDISPNELGLT" XX SQ Sequence 256 BP; 78 A; 52 C; 58 G; 68 T; 0 other; acatgggaac aagctgggtt tattacaagt tgaagtgttg aacaggttaa cattcaatag 60 ttccagccca aataaagtac attgtataat tcaaggcaga ctaggagagg atgccaggtt 120 tttttgtcca gctggttccc tagtagcctt ccaaaattcc tggggatccc aaatggatct 180 gactgatccc ttatgcattg aagatagcac agaggattgc aaacaggaca tttctcctaa 240 cgaactgggc ctaacg 256 // ID MG030374; SV 1; linear; genomic RNA; STD; VRL; 259 BP. XX AC MG030374; XX DT 20-DEC-2017 (Rel. 135, Created) DT 20-DEC-2017 (Rel. 135, Last updated, Version 4) XX DE Parechovirus A isolate Brisbane/ORChID-56/2010-2014 polyprotein gene, DE partial cds. XX KW . XX OS Parechovirus A OC Viruses; Riboviria; Picornavirales; Picornaviridae; Parechovirus. XX RN [1] RP 1-259 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT "Human parechovirus infections in Australian infants during the first RT 2-years of life: a community-based birth cohort study"; RL Unpublished. XX RN [2] RP 1-259 RA Wang C.Y.T., Mhango L.P., Ware R., Tozer S., Grimwood K., Lambert S., RA Bialasiewicz S.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Malaria and Molecular Diagnostic, QPID Laboratory, Centre for Children's RL Health Research, Level 8, 62 Graham St, South Brisbane, QLD 4101, Australia XX DR MD5; b06a346a22a83048268650da5c53f315. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..259 FT /organism="Parechovirus A" FT /host="Homo sapiens" FT /isolate="Brisbane/ORChID-56/2010-2014" FT /mol_type="genomic RNA" FT /country="Australia" FT /isolation_source="stool" FT /collection_date="2010/2014" FT /db_xref="taxon:1803956" FT CDS <1..>259 FT /codon_start=2 FT /product="polyprotein" FT /note="VP3-1" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A0A2H4WAP6" FT /protein_id="AUC64105.1" FT /translation="NGRPLGLFQVEVLNRLTYNSSCPNKVHCIVQGRLGNDARFYCPTG FT SLVEFQNSWGSQMDLSDPLCVEDDETEDCKQTISPDELGLT" XX SQ Sequence 259 BP; 82 A; 53 C; 57 G; 67 T; 0 other; aaacggaaga ccattgggac ttttccaagt tgaagtgttg aatagactaa cctacaacag 60 ttcatgtcct aataaagttc actgcatagt tcaaggtaga ttggggaatg atgctaggtt 120 ctattgccca acaggatccc ttgtcgagtt ccaaaactcc tggggttctc aaatggatct 180 tagcgaccca ttgtgtgtag aagatgatga aacagaggac tgcaaacaaa ccatttcacc 240 agatgaatta ggcctcaca 259 // ID MG030482; SV 1; linear; genomic RNA; STD; VRL; 419 BP. XX AC MG030482; XX DT 29-AUG-2018 (Rel. 137, Created) DT 29-AUG-2018 (Rel. 137, Last updated, Version 1) XX DE Bovine viral diarrhea virus 1 isolate 1604 polyprotein-like gene, partial DE sequence. XX KW . XX OS Bovine viral diarrhea virus 1 OC Viruses; Riboviria; Flaviviridae; Pestivirus. XX RN [1] RC Publication Status: Online-Only RP 1-419 RX DOI; .1186/s12917-018-1555-4. RX PUBMED; 30086756. RA Han D.G., Ryu J.H., Park J., Choi K.S.; RT "Identification of a new bovine viral diarrhea virus subtype in the RT Republic of Korea"; RL BMC Vet. Res. 14(1):233-233(2018). XX RN [2] RP 1-419 RA Han D.-G., Choi K.-S.; RT ; RL Submitted (02-OCT-2017) to the INSDC. RL Department of Animal Science and Biotechnology, Kyungpook National RL University, 386 Gajang-Dong, Sangju 37224, Republic of Korea XX DR MD5; c004602ab69a84bea24d6f636396e505. DR EuropePMC; PMC6081834; 30086756. XX FH Key Location/Qualifiers FH FT source 1..419 FT /organism="Bovine viral diarrhea virus 1" FT /host="bovine" FT /isolate="1604" FT /mol_type="genomic RNA" FT /country="South Korea" FT /isolation_source="diarrhea sample from Korean native calf" FT /collection_date="07-Apr-2017" FT /note="subtype: BVDV1o" FT /db_xref="taxon:11099" FT misc_feature 20..>419 FT /note="similar to polyprotein" XX SQ Sequence 419 BP; 136 A; 90 C; 104 G; 89 T; 0 other; ttctctgctg gtcatggcac atggaggcaa ggtaaaatga acttttatac aaaacataca 60 aacaaaaacc cttaggagtg gaggaaccag tctatgataa ggcaggcaat ccgttgtttg 120 gtgagaaagg agaagtccat cccctgtcaa cattaaaact cccacacagg agaggggaac 180 gcgatattcc taccaacttg gcttcattac caaaaagagg cgactgcaga tcaggcaaca 240 gcaaaggacc cgttagtgga atctacttaa agccagggcc tctttactac caggactaca 300 aaggaccagt ctatcacaga gccccattgg agctttttga ggaagggttc atgtgtgaaa 360 caaccaaaag gattgggagg gttacaggta ggatcggcaa gctggaccac atttatgtg 419 // ID MG030483; SV 1; linear; genomic DNA; STD; VRL; 35937 BP. XX AC MG030483; XX DT 09-NOV-2018 (Rel. 138, Created) DT 09-NOV-2018 (Rel. 138, Last updated, Version 1) XX DE Human adenovirus type 4 isolate NVI1727, complete genome. XX KW . XX OS Human adenovirus E4 OC Viruses; Adenoviridae; Mastadenovirus; Human mastadenovirus E. XX RN [1] RP 1-35937 RA Rogers A.E., Lu X., Killerby M., Campbell E., Gallus L., Kamau E., Froh I., RA Nowak G., Viers C., Erdman D.D., Sakthivel S., Gerber S.I., Schneider E., RA Watson J.T., Johnson L.A.; RT "Outbreak of Respiratory Illness associated with Adenovirus Type 4 at the RT United States Naval Academy"; RL Unpublished. XX RN [2] RP 1-35937 RA Lu X., Erdman D.D.; RT ; RL Submitted (04-OCT-2017) to the INSDC. RL Division of Viral Diseases, Centers for Disease Control and Prevention, RL 1600 Clifton Road, N.E., Atlanta, GA 30329, USA XX DR MD5; 70c6b21451bed4097622785f0a428c2b. XX CC ##Assembly-Data-START## CC Assembly Method :: CLC Genomics Workbench V. 8.5.1 CC Sequencing Technology :: Illumina CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..35937 FT /organism="Human adenovirus E4" FT /host="Homo sapiens" FT /isolate="NVI1727" FT /serotype="4" FT /mol_type="genomic DNA" FT /country="USA" FT /isolation_source="nasopharyngeal swab" FT /collection_date="08-Sep-2016" FT /note="genotype: 4a1" FT /db_xref="taxon:28280" FT repeat_region 1..193 FT /rpt_type=INVERTED FT /note="inverted terminal repeat" FT gene 561..1454 FT /gene="E1A" FT CDS join(561..1094,1187..1393) FT /codon_start=1 FT /gene="E1A" FT /product="27 kDa protein" FT /db_xref="GOA:Q2KSH1" FT /db_xref="InterPro:IPR014410" FT /db_xref="UniProtKB/TrEMBL:Q2KSH1" FT /protein_id="AXN73174.1" FT /translation="MRHLRDLPDEKIIIASGSEILELVVNAIMGDDHPEPPTPFETPSL FT HDLYDLEVDVPEDDPNEEAVNDLFSDAALLAAEEALSPRHGRGDKKIPWLKGEEMDLHC FT YEECLPPSDDEYEQAIQNAASQGVQAASESFALDCPPLPGHGCKSCEFHRMNTGDKAVL FT CALCYMRAYNHCVYSPVSDADDETPTTESTLLPPEIGTSPPKNIIRPVPVKATGRRAAV FT ECLDDLLQGGDEPLDLCTRKRPRH" FT CDS join(561..1001,1187..1393) FT /codon_start=1 FT /gene="E1A" FT /product="23.5 kDa protein" FT /db_xref="GOA:Q2KSH0" FT /db_xref="InterPro:IPR014410" FT /db_xref="UniProtKB/TrEMBL:Q2KSH0" FT /protein_id="AXN73173.1" FT /translation="MRHLRDLPDEKIIIASGSEILELVVNAIMGDDHPEPPTPFETPSL FT HDLYDLEVDVPEDDPNEEAVNDLFSDAALLAAEEALSPRHGRGDKKIPWLKGEEMDLHC FT YEECLPPSDDEYEQAIQNAASQGVQAASESFALDCPPLPGHGCPVSDADDETPTTESTL FT LPPEIGTSPPKNIIRPVPVKATGRRAAVECLDDLLQGGDEPLDLCTRKRPRH" FT CDS join(561..635,1188..1292) FT /codon_start=1 FT /gene="E1A" FT /product="6.8 kDa protein" FT /db_xref="UniProtKB/TrEMBL:Q2KSG9" FT /protein_id="AXN73175.1" FT /translation="MRHLRDLPDEKIIIASGSEILELVVPSLMQMMKPPLQSPLCYPLK FT LARPHLRILLDQFL" FT CDS 561..1106 FT /codon_start=1 FT /gene="E1A" FT /product="hypothetical protein" FT /db_xref="GOA:Q2KSG8" FT /db_xref="InterPro:IPR014410" FT /db_xref="UniProtKB/TrEMBL:Q2KSG8" FT /protein_id="AXN73176.1" FT /translation="MRHLRDLPDEKIIIASGSEILELVVNAIMGDDHPEPPTPFETPSL FT HDLYDLEVDVPEDDPNEEAVNDLFSDAALLAAEEALSPRHGRGDKKIPWLKGEEMDLHC FT YEECLPPSDDEYEQAIQNAASQGVQAASESFALDCPPLPGHGCKSCEFHRMNTGDKAVL FT CALCYMRAYNHCVYSKCD" FT regulatory 1449..1454 FT /gene="E1A" FT /regulatory_class="polyA_signal_sequence" FT gene 1503..3869 FT /gene="E1B" FT regulatory 1503..1542 FT /gene="E1B" FT /regulatory_class="promoter" FT CDS 1550..2101 FT /codon_start=1 FT /gene="E1B" FT /product="21.5 kDa protein" FT /db_xref="GOA:Q2KSG7" FT /db_xref="InterPro:IPR002475" FT /db_xref="InterPro:IPR002924" FT /db_xref="UniProtKB/TrEMBL:Q2KSG7" FT /protein_id="AXN73177.1" FT /translation="MEIWTVLEDFYKTRQLLENASNGVSYLWRFCFGGDLAKLVYRTKQ FT DYKEQFDDILKECPGLFDALNLGHQSHFNQRISRALDFTTPGRTTAAVAFFAFVLDKWS FT QETHFSRDYQLDFLAVALWRAWKCQRLNAISGYLPVQPLDTLRILSLQQISQERQRRQQ FT QQEDQEENPRAGLDPPAEEE" FT CDS join(1550..1951,1962..1979) FT /codon_start=1 FT /gene="E1B" FT /product="small T antigen" FT /note="16.5 kDa protein" FT /db_xref="GOA:Q2KSG6" FT /db_xref="InterPro:IPR002475" FT /db_xref="InterPro:IPR002924" FT /db_xref="UniProtKB/TrEMBL:Q2KSG6" FT /protein_id="AXN73178.1" FT /translation="MEIWTVLEDFYKTRQLLENASNGVSYLWRFCFGGDLAKLVYRTKQ FT DYKEQFDDILKECPGLFDALNLGHQSHFNQRISRALDFTTPGRTTAAVAFFAFVLDKWS FT QETHFSRDYQLDFLAVALWRAWKCQRLNAICRYSR" FT CDS 1855..3342 FT /codon_start=1 FT /gene="E1B" FT /product="large T antigen" FT /note="55 kDa protein" FT /db_xref="InterPro:IPR002612" FT /db_xref="InterPro:IPR006717" FT /db_xref="InterPro:IPR011050" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5W5" FT /protein_id="AXN73179.1" FT /translation="MESRNPFQQGLPVGFLSSSFVESMEVPAPECNLRLLAGTAARHSE FT DPKSPANFPGTPTPPAAAGGSRREPESRPGPSGGGGVADLFPELRRVLTRSLSGRERGI FT KRERHDETNHRTELTVGLMSRKRPETVWWHEVQLTGTDEVSVMHEKFSLEQVKTCWLEP FT EDDWEVAIRNYAKLALRPDRKYKITKLINIRNACYISGNGAEVEICPQDRVAFRCCMMN FT MYPGVVYMDGVTFMNIRFRGDGYNGTVFMANTKLTVHGCSFFGFNNTCIEAWGHVGVRG FT CSFSANWMGVVGRTKSMLSVKKCLFERCHLGVMSEGEARIRHCASTETGCFVLCKGNAK FT IKHNMICGASDERGYQMLTCAGGNSHMLATVHVASHPRKPWPEFEHNVMTRCNMHLGAR FT RGMLMLYQCNLNYVKVLLEPDAMSRVSLTGVFDMNVEVWKILRYDEYKTRCRACECGGK FT HARFQPVCVDVTEDLRPDHLVLSCTGTEFGSSGEESD" FT CDS join(1855..2139,3127..3342) FT /codon_start=1 FT /gene="E1B" FT /product="17.9 kDa protein" FT /db_xref="InterPro:IPR002612" FT /db_xref="InterPro:IPR006717" FT /db_xref="UniProtKB/TrEMBL:T1SR21" FT /protein_id="AXN73180.1" FT /translation="MESRNPFQQGLPVGFLSSSFVESMEVPAPECNLRLLAGTAARHSE FT DPKSPANFPGTPTPPAAAGGSRREPESRPGPSGGGGVADLFPELRRVLTRVSLTGVFDM FT NVEVWKILRYDEYKTRCRACECGGKHARFQPVCVDVTEDLRPDHLVLSCTGTEFGSSGE FT ESD" FT CDS join(1855..2108,3244..3262) FT /codon_start=1 FT /gene="E1B" FT /product="9 kDa protein" FT /db_xref="InterPro:IPR006717" FT /db_xref="UniProtKB/TrEMBL:T1SQR4" FT /protein_id="AXN73181.1" FT /translation="MESRNPFQQGLPVGFLSSSFVESMEVPAPECNLRLLAGTAARHSE FT DPKSPANFPGTPTPPAAAGGSRREPESRPGPSGGGGVADLPCVWM" FT gene 3426..3869 FT /gene="IX" FT CDS 3426..3854 FT /codon_start=1 FT /gene="IX" FT /product="hexon-associated protein IX" FT /db_xref="GOA:Q2KSG2" FT /db_xref="InterPro:IPR005641" FT /db_xref="UniProtKB/TrEMBL:Q2KSG2" FT /protein_id="AXN73218.1" FT /translation="MSGSGSFEGGIFSPYLTGRLPSWAGVRQNVMGSTVDGRPVQPANS FT STLTYATLSSSSVDAAAAAAAASAASVVRGMAMGAGYYGTLVANSSSTNNPASLNEEKL FT LLLMAQLEALSQRLGELTQQVAQLQEQTRAAVATVKSK" FT regulatory 3864..3869 FT /gene="IX" FT /note="E1B/IX" FT /regulatory_class="polyA_signal_sequence" FT gene complement(3886..5541) FT /gene="IVa2" FT regulatory complement(3886..3891) FT /gene="IVa2" FT /note="IVa2/E2B" FT /regulatory_class="polyA_signal_sequence" FT CDS complement(join(3917..5250,5529..5541)) FT /codon_start=1 FT /gene="IVa2" FT /product="maturation protein IVa2" FT /db_xref="GOA:Q2KSG1" FT /db_xref="InterPro:IPR003389" FT /db_xref="InterPro:IPR003593" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:Q2KSG1" FT /protein_id="AXN73182.1" FT /translation="METRGCRSGAVLDQPDEPEAHPRKRPTRRAPLHRDGDHPEADAST FT LEGPDPGHAGRPSSGALLPQSPQPAKRGGLLDRDAVEHITELWDRLELLQQTLSKMPMA FT DGLKPLKNFASLQELLSLGGERLLAELVRENMHVRQMINEVAPLLREDGSCLSLNYHLQ FT PVIGVIYGPTGCGKSQLLRNLLSAQLISPSPETVFFIAPQVDMIPPSELKAWEMQICEG FT NYAPGIEGTFVPQSGTLRPKFIKMAYDDLTQDHNYDVSDPRNVFAQAAAHGPIAIIMDE FT CMENLGGHKGVSKFFHAFPSKLHDKFPKCTGYTVLVVLHNMNPRRDLGGNIANLKIQAK FT MHLISPRMHPSQLNRFVNTYTKGLPVAISLLLKDIVQHHALRPCYDWVIYNTTPEHEAL FT QWSYLHPRDGLMPMYLNIQSHLYRVLEKIHRVLNDRGRWSRAYRARKIK" FT gene complement(5020..12140) FT /gene="E2B" FT CDS complement(join(5020..8592,12132..12140)) FT /codon_start=1 FT /gene="E2B" FT /product="DNA polymerase" FT /db_xref="GOA:Q2KSG0" FT /db_xref="InterPro:IPR004868" FT /db_xref="InterPro:IPR006172" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR014382" FT /db_xref="InterPro:IPR017964" FT /db_xref="UniProtKB/TrEMBL:Q2KSG0" FT /protein_id="AXN73183.1" FT /translation="MALVQTHGSRGLHPEATDPGRQPSRRRSRQSSPDAVPEPARARSR FT RAPAAPACRSRAAPAARRASSPPLLTMEAKPPPPTKKKQGTVVTPQGHGTLQAIDVATN FT GAVEIKYHLDLPRALEKLMQVNRAPPLPTDLTPQRLRSLDSSGLRALVLALRPVRAEVW FT TCLPRGLVSMTTIEAEEGQANHHNVVQHQMQAPQLHFPLKFLVKGTQVQLVQHVHPVQR FT CEHCGRLYKHKHECSARRRHFYFHHINSHSSNWWKEIQFFPIGSHPRTERLFLTYDVET FT YTWMGSFGKQLVPFMLVMKLSGDDRLVKLALDLALQLKWDRWHGDPRTFYCVTPEKMAV FT GQQFRQYRDRLQTALAVDLWTSFLRANPHLADWALKQHGLSDPDELTYEELKKLPHVKG FT RPRFVELYIVGHNINGFDEIVLAAQVINNRAEVPQPFRITRNFMPRAGKILFNDVTFAL FT PNPAYKKRTDFQLWEHGGCDDIDFKYQFLKVMVRDTFALTHTSLRKAAQAYALPVEKGC FT CAYKAVNQFYMLGSYRADQDGFPLEEYWKDREEFLLNRQLWKQKGKLKYDIIQETLDYC FT ALDVLVTAELVAKLQDSYAHFIRDSVGLPHAHFNIFQRPTISSNSHAIFRQIVYRTEKP FT NRSNLGAGLLAPSHELYDYVRASIRGGRCYPTYIGILEEPLYVYDICGMYASALTHPMP FT WGTPLSPYERALAVREWQASLDDLGTCISYFDPDLLPGIFTIDADPPEELMLDPLPPFC FT SRKGGRLCWTNEPLRGEVATSVDLITLHNRGWQVRIVPDKLTTVFPEWKCVAREYVQLN FT IAAKERADKEKNQTMRSIAKLLSNALYGSFATKLDNKKIVFSDQMDEGLLKGISAGTVN FT IKSSSFLETDNLSAEVMPAFEREYLPQQLTLLNSDSEDSEDEQGPAPFYTPPAGTPGHV FT AYTYKPITFLDVDEGDMCLHTLEKVDPLVDNDRYPSHVASFVLAWTRAFVSEWTGFLYD FT EDRGVPLEDRPIKSVYGDTDSLFVTQRGHELMETRGKKRIKKNGGKLVFDPDQPDLTWL FT VECETVCAHCGADAYAPESVFLAPKLYALKSLLCPACGQTSKGKLRAKGHAAEALNYEL FT MVNCYLADAQGADRERFSTSRMSLKRTLASAQPGAHPFTVTETTLKRTLRPWKDRTLAT FT LDAHRLAPYSRSRPNPRNEEVCWIEMP" FT CDS 5092..5661 FT /codon_start=1 FT /product="19.4 kDa protein" FT /db_xref="InterPro:IPR035249" FT /db_xref="UniProtKB/TrEMBL:Q2KSF9" FT /protein_id="AXN73184.1" FT /translation="MGVQRGQGPVLPGSKRPLQGGLRHGEGVRAGLGACEGALQAHPAG FT REPLPIGTLCVGQVAIDHKFVVKRLGRVAFGAELTFGSLPASGTEQGLQGVELGGKEDG FT LGGVGVRAAVGADGFALHEPGEVGLVGVKNQFSTVLFDAFLTSGLHELVSPLGNKEAVR FT VPVDRLYGPVLEWDAAVLVVEEPRPL" FT CDS 6113..6433 FT /codon_start=1 FT /product="11.5 kDa protein" FT /db_xref="InterPro:IPR035153" FT /db_xref="UniProtKB/TrEMBL:Q2KSF8" FT /protein_id="AXN73185.1" FT /translation="MERMVWFFSLSARSLAAMLSCTYSRATHFHSGKTVVSLSGTILTC FT QPRLCRVIRSTLVATSPRRGSLVQQRRPPLREQKGGRGSSISSSGGSASMVKMPGRRSG FT SK" FT gene 7801..13681 FT /gene="L1" FT CDS 7801..8394 FT /codon_start=1 FT /gene="L1" FT /product="DNA-binding protein" FT /db_xref="GOA:Q2KSF7" FT /db_xref="InterPro:IPR004292" FT /db_xref="UniProtKB/TrEMBL:Q2KSF7" FT /protein_id="AXN73186.1" FT /translation="MRADGEELDLLPPVGGMAVDVMKVEMPTARRTFVLVFIQAATVLA FT TLHGMHVLHKLYLGSFDEEFQWEVELWCLHLVLYYVVVVGLALFCLDGGHAYKPAREAG FT PDLGADGSKSKDEGAQAGAVQGPKTLRSQVSGQRRRAVNLHKFFQGAREVQMVLDLHRA FT VSGDVDGLQGPVPLGSDHRPLFLLGGRGRFGFHG" FT CDS complement(join(8391..10310,12132..12140)) FT /codon_start=1 FT /gene="E2B" FT /product="terminal protein precursor" FT /db_xref="GOA:Q2KSF6" FT /db_xref="InterPro:IPR003391" FT /db_xref="UniProtKB/TrEMBL:Q2KSF6" FT /protein_id="AXN73187.1" FT /translation="MALSINDCARLTGQTVPTMNYFLPLRNIWNRVREFPRASTTAAGI FT TWMSRYIYGYHRTMLENLAPGAPATERWPLYRQPPPHFLIGYQYLVRTCNDYIFDSRAY FT SRLKYHELARPGHQTVNWSVMANCSYTINTGAYHRFVDFDDFQTTLTQIQQAILAERVV FT ADLALVQPQRGFGLTRMHGRAGEEEVPVERLMQDYYKDLARCQDHAWGMANRLRIQQAG FT PKDLVLLATIRRLRTAYFNFITSSIAPAHTPPPPEETVLSLPCDCDWLETFVQRFSDPV FT DLETLRSLRGVPTGQLIRCIVSALSLPNGDPPGHLEMHGGVFTLRPREDGRAVTETMRR FT RRGETIERFIDRLPVRRRRRRPPPPPPPPEVEVQEMLVDEEEEVEELPGAFEREVRATI FT AELIRLLEEELTVSARNSQFFNFAVDFYEAMERLEALGDVSEMPLRRWIMYFFVTEHIA FT TTLNYLYQRLCNYAVFARHVELNLAQVVMRARDPEGTVVYSRVWNEAGMNAFSQLMGRI FT SNDLAATVERAGRGDLQEEEIEQFMTEIAYQDNSGDVQEILRQAAVNDTEIDSVELSFR FT FKLTGPVAFTQRRQIQDVNRRVVAHASLLRTQYQNLPARGADVPLPPLPAGPEPPLPPG FT ARPRRRF" FT misc_RNA 10343..10501 FT /gene="L1" FT /product="VA RNA I" FT /note="virus associated RNA I" FT misc_RNA 10564..10664 FT /gene="L1" FT /product="VA RNA II" FT /note="virus associated RNA II" FT CDS 10685..11857 FT /codon_start=1 FT /gene="L1" FT /product="52 kDa protein" FT /db_xref="GOA:Q2KSF5" FT /db_xref="InterPro:IPR004292" FT /db_xref="InterPro:IPR037536" FT /db_xref="UniProtKB/TrEMBL:Q2KSF5" FT /protein_id="AXN73188.1" FT /translation="MHPVLRQMRPHPPPQQQPPPPQQPALLPPPQQQQLPVTTATAAVS FT GAGQSQYDLALEEGEGLARLGASSPERHPRVQMKRDAHEAYVPKQNLFRDRSGEEPEEM FT RAARFHAGRELRCGLDRKRVLRDKDFEADELTGISPARAHVAAANLVTAYEQTVKEESN FT FQKSFNNHVRTLIAREEVTLGLMHLWDLLEAIVQNPTSKPLTAQLFLVVQHSRDNETFR FT EALLNITEPEGRWLLDLVNILQSIVVQERGLPLSEKLAAINFSVLSLGKYYARKIYKTP FT YVPIDKEVKIDGFYMRMTLKVLTLSDDLGVYRNDRMHRAVSASRRRELSDQELMHSLQR FT ALTGAGTDGESYFDMGADLHWQPSRRVLEAAAVPYVEEVDDEDEGEYLED" FT regulatory 11862..11867 FT /gene="L1" FT /note="52 kDa protein" FT /regulatory_class="polyA_signal_sequence" FT CDS 11881..13662 FT /codon_start=1 FT /gene="L1" FT /product="protein IIIa precursor" FT /db_xref="GOA:Q2KSF4" FT /db_xref="InterPro:IPR003479" FT /db_xref="UniProtKB/TrEMBL:Q2KSF4" FT /protein_id="AXN73189.1" FT /translation="MQQQSPPDPAMRAALQSQPSGINSSDDWTQAMQRIMALTTRNPEA FT FRQQPQANRLSAILEAVVPSRSNPTHEKVLAIVNALVENKAIRGDEAGLVYNALLERVA FT RYNSTNVQTNLDRMVTDVREAVAQRERFHRESNLGSMVALNAFLSTQPANVPRGQEDYT FT NFISALRLMVTEVPQSEVYQSGPDYFFQTSRQGLQTVNLSQAFKNLQGLWGVQAPVGDR FT ATVSSLLTPNSRLLLLLVAPFTDSGSINRNSYLGYLINLYREAIGQAHVDEQTYQEITH FT VSRALGQDDPGNLEATLNFLLTNRSQKIPPQYALSAEEERILRYVQQSVGLFLMQEGAT FT PSAALDMTAHNMEPSMYASNRPFINKLMDYLHRAAAMNSDYFTNAILNPHWLPPPGFYT FT GEYDMPDPNDGFLWDDVDSSVFSPRPGANERPLWKKEGSDRRPSSTLSGRTGAAAAVPE FT AASPFPSLPFSLNSVRSSELGRITRPRLLGEEEYLNNSLLRPEREKNFPNNGIESLVDK FT MNRWKTYAQEHRDDPRATQGTASRGSAARKRRWHDRQRGLMWDDEDSADDSSVLDLGGS FT GGGNPFAHLRPRVGRLM" FT regulatory 13676..13681 FT /gene="L1" FT /note="protein IIIa" FT /regulatory_class="polyA_signal_sequence" FT gene 13742..17263 FT /gene="L2" FT CDS 13742..15349 FT /codon_start=1 FT /gene="L2" FT /product="penton protein" FT /note="protein III" FT /db_xref="GOA:Q2KSF3" FT /db_xref="InterPro:IPR002605" FT /db_xref="UniProtKB/TrEMBL:Q2KSF3" FT /protein_id="AXN73190.1" FT /translation="MMRRAYPEGPPPSYESVMQQAMAAAAAIQPPLEAPYVPPRYLAPT FT EGRNSIRYSELTPLYDTTRLYLVDNKSADIASLNYQNDHSNFLTTVVQNNDFTPTEAST FT QTINFDERSRWGGQLKTIMHTNMPNVNQFMYSNKFKARVMVSRKTPNGVTVGDNYDGSQ FT DELKYEWVEFELPEGNFSVTMTIDLMNNAIIDNYLAVGRQNGVLESDIGVKFDTRNFRL FT GWDPVTELVMPGVYTNEAFHPDIVLLPGCGVDFTESRLSNLLGIRKRQPFQEGFQIMYE FT DLDGGNIPALLDVEAYEKSKEESVAAATTAVATASTEVRDDNFASAAAVAAVKADETKS FT KIVIQPVEKDSKERSYNVLSDKKNTAYRSWYLAYNYGDRDKGVRSWTLLTTSDVTCGVE FT QVYWSLPDMMQDPVTFRSTHQVSNYPVVGAELLPVYSKSFFNEQAVYSQQLRAFTSLTH FT VFNRFPENQILVRPPAPTITTVSENVPALTDHGTLPLRSSIRGVQRVTVTDARRRTCPY FT VYKALGIVAPRVLSSRTF" FT regulatory 15352..15357 FT /gene="L2" FT /regulatory_class="polyA_signal_sequence" FT CDS 15353..15934 FT /codon_start=1 FT /gene="L2" FT /product="protein VII precursor" FT /note="major core protein" FT /db_xref="GOA:Q2KSF2" FT /db_xref="InterPro:IPR004912" FT /db_xref="UniProtKB/TrEMBL:Q2KSF2" FT /protein_id="AXN73191.1" FT /translation="MSILISPSNNTGWGLHAPSKMYGGARQRSTQHPVRVRGHFRAPWG FT ALKGRVRSRTTVDDVIDQVVADARNYTPAAAPVSTVDAVIDSVVADARRYARAKSRRRR FT IARRHRSTTAMRAAQALLRRARRTGRRAMLRAARRAASGASAGRTRRRAATAAAAAIAS FT MSRPRRGNVYWVRDAATGVRVPVRTRPPRT" FT CDS 15982..17007 FT /codon_start=1 FT /gene="L2" FT /product="protein V precursor" FT /note="minor core protein" FT /db_xref="GOA:A0A3G1S5W7" FT /db_xref="InterPro:IPR005608" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5W7" FT /protein_id="AXN73192.1" FT /translation="MSKRRFKEEMLQVIAPEIYGPAAAVKDERNPRKIKRVKKDKKEED FT NVDDMVKFVREFAPRRRVQWRGRKVRPVLRPGTTVVFAPGERSGTTSKRSYDEVYGDED FT ILEQAAERLGEFAYGKRNRLAPLKEEVVSIPLDHGNPTPSLKPVTLQQVLPNAAPRQGL FT KRQGEDVYPTMQLMVPKRQKLEDVLETMKVDPDAQPEVKVRPIKQVAPGLGVQTVDIKI FT PTEPMETQTEVVKPITSTMEVQTDPWMPVVPRKPRRKYGATSLLMPNYALHPSIIPTPG FT YRGTRFYHGYTSSRRRKTTTRRRRRRTATTPADALVRRVYRRGRAPLTLPRARYHPSIA FT I" FT CDS 17030..17263 FT /codon_start=1 FT /gene="L2" FT /product="protein X" FT /note="protein mu" FT /db_xref="GOA:Q5GFB0" FT /db_xref="InterPro:IPR008393" FT /db_xref="UniProtKB/TrEMBL:Q5GFB0" FT /protein_id="AXN73193.1" FT /translation="MALTCRIRVPITGYRGRKPRRRRLAGSGMRRHPHRRRRAISKRLG FT GGFLPALIPIIAAAIGAIPGIASVAVQASQRH" FT gene 17295..21645 FT /gene="L3" FT CDS 17295..18053 FT /codon_start=1 FT /gene="L3" FT /product="protein VI precursor" FT /note="hexon-associated protein" FT /db_xref="GOA:A0A3G1S5T5" FT /db_xref="InterPro:IPR004243" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5T5" FT /protein_id="AXN73194.1" FT /translation="MDSDAPGPVMCFRRQMEDINFSSLAPRHGTRPFMGTWSDIGTSQL FT NGGAFNWSSLWSGLKNFGSTLKTYGSKAWNSTTGQALRDKLKEQNFQQKVVDGLASGIN FT GVVDLANQAVQRQINSRLDPVPPVGSVEEELPPLDKRGDKRPRPDAEETLLTHTDEPPP FT YEEAVKLGLPTTRPIAPVATGVLNPQSSKPATLELPPPPTPRPSTVAKPLPPVAVARAG FT SGARPQANWQSTLNSIVGLGVQSVKRRRCY" FT CDS 18160..20970 FT /codon_start=1 FT /gene="L3" FT /product="hexon" FT /note="protein II" FT /db_xref="GOA:Q2KSE8" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016108" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="InterPro:IPR037542" FT /db_xref="UniProtKB/TrEMBL:Q2KSE8" FT /protein_id="AXN73195.1" FT /translation="MATPSMLPQWAYMHIAGQDASEYLSPGLVQFARATDTYFSLGNKF FT RNPTVAPTHDVTTDRSQRLTLRFVPVDREDNTYSYKVRYTLAVGDNRVLDMASTYFDIR FT GVLDRGPSFKPYSGTAYNSLAPKGAPNTCQWKDANSKMHTFGVAAMPGVTGKKIEADGL FT PIRIDSTSGTDTVIYADKTFQPEPQVGNDSWVDTNDAEEKYGGRALKDTTNMKPCYGSF FT AKPTNKEGGQANLKDSETAATTPNYDIDLAFFDGKNIVANYDPDIVMYTENVDLQTPDT FT HIVYKPGKEDTSSESNLGQQAMPNRPNYIGFRDNFIGLMYYNSTGNMGVLAGQASQLNA FT VVDLQDRNTELSYQLLLDSLGDRTRYFSMWNQAVDSYDPDVRIIENHGVEDELPNYCFP FT LNGVGLTDTYQGVKVKTDAGSEKWDKDDTTVSTANEIHVGNPFAMEINIQANLWRNFLY FT ANVALYLPDKYKYTPANITLPTNTNTYEYMNGRVVAPSLVDAYINIGARWSLDPMDNVN FT PFNHHRNAGLRYRSMLLGNGRYVPFHIQVPQKFFAIKNLLLLPGSYTYEWNFRKDVNMI FT LQSSLGNDLRTDGASITFTSINLYATFFPMAHNTASTLEAMLRNDTNDQSFNDYLSAAN FT MLYPIPANATNVPISIPSRNWAAFRGWSFTRLKTKETPSLGSGFDPYFVYSGSIPYLDG FT TFYLNHTFKKVSITFDSSVSWPGNDRLLTPNEFEIKRTVDGEGYNVAQCNMTKDWFLVQ FT MLAHYNIGYQGFYVPEGYKDRMYSFFRNFQPMSRQVVDEVNYKDYQAVTLAYQHNNSGF FT VGYLAPTMRRGQPYPANYPYPLIGKSAVTSVTQKKFICDRVMWRIPFSSNFMSMGALTD FT LGQNMLYANSAHALDMNFEVDPMDESTLLYVVFEVFDVVRVHQPHRGVIEAVYLRTPFS FT AGNATT" FT CDS 20994..21614 FT /codon_start=1 FT /gene="L3" FT /product="23 kDa protease" FT /db_xref="GOA:Q2KSE7" FT /db_xref="InterPro:IPR000855" FT /db_xref="InterPro:IPR038765" FT /db_xref="UniProtKB/TrEMBL:Q2KSE7" FT /protein_id="AXN73196.1" FT /translation="MAAGSGEQELRAIIRDLGCGPYFLGTFDKRFPGFMAPHKLACAIV FT NTAGRETGGEHWLAFAWNPRSNTCYLFDPFGFSDERLKQIYQFEYEGLLRRSALATEDR FT CVTLEKSTQTVQGPRSAACGLFCCMFLHAFVHWPDRPMDKNPTMNLLTGVPNGMLQSPQ FT VESTLLRNQEALYRFLNSHSAYFRCHRARIQKATAFDRMNQDM" FT regulatory 21640..21645 FT /gene="L3" FT /regulatory_class="polyA_signal_sequence" FT gene complement(21682..23221) FT /gene="E2A" FT regulatory complement(21682..21687) FT /gene="E2A" FT /regulatory_class="polyA_signal_sequence" FT CDS complement(21689..23221) FT /codon_start=1 FT /gene="E2A" FT /product="DNA binding protein" FT /db_xref="GOA:Q2KSE6" FT /db_xref="InterPro:IPR003176" FT /db_xref="InterPro:IPR005376" FT /db_xref="InterPro:IPR036362" FT /db_xref="InterPro:IPR036367" FT /db_xref="InterPro:IPR036368" FT /db_xref="InterPro:IPR037540" FT /db_xref="UniProtKB/TrEMBL:Q2KSE6" FT /protein_id="AXN73197.1" FT /translation="MAGRGGSQSERRRERTPDRGRGSASHPPDRESPSPPPLPLKRHTY FT RRMASDQEEEIVVVSENSRSPSPQEQPPPSPPKKKPRKTKHVPLQDVSQDSENEREAEE FT ELAAVGFSYPPVRITEKDGKRSFETLNDNDPLVKAASAKMAVMNPLSLPIVSAWEKGME FT IMNMLMERYRVESDLKSNFQLMPEQGEVYRRICHLYINEEHRGIPLTFTSNKTLTTMMG FT RFLQGFVHAHSQIAHKNWESTGCALWLHGCTEVEGKLRCLHGTAMIHKEHMIEMDVASE FT NGQRALKENPDRAKITQNRWGRSVVQLANNDARCCVHDAGCATNQFSSKSCGVFFTEGA FT KAQQAFKQLEAFMKAMYPGMNAAQAQMMLIPLHCDCNHKPGCVPTMGRQTCKMTPFGMA FT NAEDLDVDGITDATVLASVKHPALMVFQCCNPVYRNSRAQNAGPNCDFKISAPDLLGAL FT QLTRKLWTDSFPDTLLPKLLIPEFKWLAKYQFRNVSLPAGHAETSRQNPFDF" FT regulatory complement(23149..23188) FT /gene="E2A" FT /regulatory_class="promoter" FT gene 23190..26888 FT /gene="L4" FT regulatory 23190..23229 FT /gene="L4" FT /regulatory_class="promoter" FT CDS 23250..25613 FT /codon_start=1 FT /gene="L4" FT /product="100 kDa hexon-assembly associated protein" FT /db_xref="GOA:Q2KSE5" FT /db_xref="InterPro:IPR003381" FT /db_xref="UniProtKB/TrEMBL:Q2KSE5" FT /protein_id="AXN73198.1" FT /translation="METQPSSPTLPSAPTADNKQQQNESLTAPPPSPAISDAAPDMQEM FT EESIEIDLGYVTPAEHEEELAVRFSTQEEIHQEQPEQEAESERDYLHLSGGEDALIKHL FT ARQAIIVKDALLDRTEVPLSVEELSRAYELNLFSPRVPPKRQPNGTCEPNPRLNFYPVF FT AVPEALATYHIFFKNQRIPVSCRANRTRADALFDLGPGARIPDIASLEEVPKIFEGLGS FT DETRAANALQGEGGEHEHHSALVELEGDNARLAVLKRTIELTHFAYPALNLPPKVMSTV FT MDQVLIKRASPISKEMQDPESSEEGKPVVSDEQLARWLGPQASPQSLEERRKLIMAVVL FT VTAELECLRRFFADAETLRKVEENLHYIFRHGFVRQACKISNVELTNLVSYMGILHENR FT LGQNVLHTTLRGEARRDYIRDCVYLYLCHTWQTAMGVWQQCLEEQNLKELCKLLQKNLK FT ALWTGFDERTTASDLANLIFPERLRLTLRNGLPDFMSQSMLQNFRSFILERSGILPATC FT SALPSDFVPLTFRECPPPLWSHCYLLRLANYLAYHSDVIQDVSGEGLLECHCRCNLCTP FT HRSLVCNPQLLSETQIIGTFELQGPGDEGSAAKGGLKLTPGLWTSAYLRKFVPQDYHPF FT EIRFYEDQSQPPKAELTACVITQGAILAQLQAIQKSRQEFLLKKGRGVYLDPQTGEELN FT PGFPQDAQRKQQEAESGAAAREGFGGKLGKQSGRGGGDGRLGQHSGRGGQPARQSGRRG FT GGRSSRRQTVVLGGESKQHGYHLRSGSGSRSTPQ" FT CDS join(25348..25653,25823..26137) FT /codon_start=1 FT /gene="L4" FT /product="33 kDa protein" FT /db_xref="GOA:A0A3G1S5Y1" FT /db_xref="InterPro:IPR021304" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5Y1" FT /protein_id="AXN73199.1" FT /translation="MPRGSSKKLKVELPPVKDLEENWESSQAEEEEMEDWDSTQAEEDS FT LQDSLEDEEVEEAVAARPSSLAEKASSTDTISAPGRGPARPHSRWDETGRFPNPTTQTA FT PTTLKKTKPAARKFTAAAAAGGLRIAANEPVQTRELRNRIFPTLYAIFQQSRGQEQELK FT VKNRSLRSLTRSCLYHKSEDQLQRTLEDAEALFNKYCALTLKE" FT CDS 25348..25863 FT /codon_start=1 FT /gene="L4" FT /product="22 kDa encapsidation protein" FT /db_xref="GOA:A0A3G1S5X8" FT /db_xref="InterPro:IPR021304" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5X8" FT /protein_id="AXN73200.1" FT /translation="MPRGSSKKLKVELPPVKDLEENWESSQAEEEEMEDWDSTQAEEDS FT LQDSLEDEEVEEAVAARPSSLAEKASSTDTISAPGRGPARPHSRWDETGRFPNPTTQTG FT KKERQGYKSWRGHKNAIVSCLQACGGNISFTRRYLLFHRGVNFPRNVLHYYRHLHSPYY FT FEEDKTSS" FT CDS 26205..26888 FT /codon_start=1 FT /gene="L4" FT /product="protein VIII" FT /db_xref="GOA:Q2KSE2" FT /db_xref="InterPro:IPR000646" FT /db_xref="UniProtKB/TrEMBL:Q2KSE2" FT /protein_id="AXN73201.1" FT /translation="MSKEIPTPYMWSYQPQMGLAAGAAQDYSTRMNWLSAGPGMISRVN FT DIRAHRNQILLKQSALTATPRNHLNPRNWPATLVYQEIPQPTTVLLPRDAQAEVQLTNS FT GVQLAGGATLCRHHPPQGIKRLVIRGRGTQLNDEVVSSSLGLRPDGVFQIAGSGRSSFT FT PRQAVLTLESSSSQPRSGGIGTLQFVEEFTPSVYFNPFSGSPGHYPDEFIPNFDAISES FT VDGYD" FT gene 26889..31264 FT /gene="E3" FT CDS 26889..27209 FT /codon_start=1 FT /gene="E3" FT /product="12.1 kDa protein" FT /db_xref="InterPro:IPR007912" FT /db_xref="UniProtKB/TrEMBL:Q2KSE1" FT /protein_id="AXN73202.1" FT /translation="MSHAGAADLARLRHLDHCRRFRCFARDLVEFTYFELPEEHPQGPA FT HGVRIVIEGGLDSHLLRIFSQRPILVERQQGNTLLTLYCICNHPGLHESLCCLLCTEYN FT KS" FT CDS 27163..27795 FT /codon_start=1 FT /gene="E3" FT /product="23.3 kDa protein" FT /db_xref="GOA:Q2KSP7" FT /db_xref="InterPro:IPR009266" FT /db_xref="InterPro:IPR026472" FT /db_xref="UniProtKB/TrEMBL:Q2KSP7" FT /protein_id="AXN73203.1" FT /translation="MKVFVVCCVLSIIKAEISDYSGLHCIPASFNQSLTFTGNETELHL FT QCKPHKKYLTWLYQGFPFAVVNHCYNDGVLLNGPANLTFSTRRSKLLLFRPFLPGIYQC FT VSGPCHHTFHLIPNTTYSPAPLPTNNQTNHHQRYRRDLVESNTTHTGGELRGPQSSGIY FT YGPWEVVGLIALGLVAGGLLALCYLYLPCFSYLVVLCCWFKKWGRSP" FT CDS 27777..28301 FT /codon_start=1 FT /gene="E3" FT /product="19 kDa protein" FT /db_xref="GOA:A0A3G1S5U4" FT /db_xref="InterPro:IPR006965" FT /db_xref="InterPro:IPR038710" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S5U4" FT /protein_id="AXN73204.1" FT /translation="MGKITLVCGVLVAVLLILGLGSTAVVTEKADPCLTFNPDKCQLSF FT QPDGNRCAVLIKCGWECNSVVIHYKNKTRNNTLASTWQPGDPEWYTVSVPGADGSLRTV FT NNTFIFKHMCNTAMFMSRQYDMWPPRKENIVVFSIAYSLCTVLITAIVCLSIHMLIAIR FT PRNNAEKEKQP" FT CDS 28331..28966 FT /codon_start=1 FT /gene="E3" FT /product="24.8 kDa protein" FT /db_xref="GOA:Q2KSD8" FT /db_xref="InterPro:IPR003470" FT /db_xref="InterPro:IPR003471" FT /db_xref="UniProtKB/TrEMBL:Q2KSD8" FT /protein_id="AXN73205.1" FT /translation="MASVTALIYFLGLLGSISSFDHKNITAYVGSNCVLTGYQSHQRVS FT WYWFDKKNTAYTLCKGYHQPTQRGGLYYSCTNNNITLLQVTKQYSGTYYGTNFNTKQDT FT YYSVEVLDPTTPRTKTTKLTTSTTLAMTTHTKLTSQATTENELVALTQNGENSSSNPLP FT TTPSEKIPRSMIGIIAAVVVCMVIIILCMMYYACYYRKHRLNNKLDPY" FT CDS 29276..30085 FT /codon_start=1 FT /gene="E3" FT /product="29.7 kDa protein" FT /db_xref="GOA:Q2KSD7" FT /db_xref="InterPro:IPR003471" FT /db_xref="UniProtKB/TrEMBL:Q2KSD7" FT /protein_id="AXN73206.1" FT /translation="MNALSTLVFLTLIGFVFSNPIPRVSFIKLVNVTEGGNVILVGVEG FT AKNTTWTKYHLNGWKNICNWSVTVYTCEGVNLIIANATSAQNGRIQGQSVSDSNGYYTQ FT HTFIYDIKVIPLPTPSPPSTTLTEPTTATTAEASSSSRIQMAFLLPPSSSPTASTNKQS FT TKFLSITKSHTTATSRAFSSTANLTSLSSTPASFPTPLKQTQGGLQWQITLLIVIGVVL FT LAVLLYFIFCRRIPNAKPVYKPIVIGQPEPLQVDGGLRNLLFSFTVW" FT CDS 30094..30369 FT /codon_start=1 FT /gene="E3" FT /product="10.4 kDa protein" FT /db_xref="GOA:Q8BEL1" FT /db_xref="InterPro:IPR005041" FT /db_xref="UniProtKB/TrEMBL:Q8BEL1" FT /protein_id="AXN73207.1" FT /translation="MIPRQFFIIGLLCALQVCATLALVANASPDCIGPFASYVLFAFIT FT CICCCSIVCLLITFFQFVDWVFVRIAYLRHHPQYRDQRVAQLLRLI" FT CDS 30375..30815 FT /codon_start=1 FT /gene="E3" FT /product="14.5 kDa protein" FT /db_xref="GOA:Q2KSD5" FT /db_xref="InterPro:IPR008131" FT /db_xref="UniProtKB/TrEMBL:Q2KSD5" FT /protein_id="AXN73208.1" FT /translation="MRALLLLTLLLLLAPLVAPFPLKSPTQSPKEVRKCKFQEPWKFLK FT CYQLKSDMHPSWIIIMGIVNILACTLFSFVIYPRFDFGWNAPKALWLPPAPDTPPQQQQ FT NQAHAPPPQPRPQYMPILDYEAEPQQAMLPAISYFNLTGEDD" FT CDS 30808..31209 FT /codon_start=1 FT /gene="E3" FT /product="14.7 kDa protein" FT /db_xref="GOA:Q2KSD4" FT /db_xref="InterPro:IPR004985" FT /db_xref="UniProtKB/TrEMBL:Q2KSD4" FT /protein_id="AXN73209.1" FT /translation="MTDPLNNTVNDLLDMDGRGSEQRLAQLRIRQQQERAVKELQDAVA FT IHQCKKGIFCLVKQAKISFEVTSTDHRLSYELLQQRQKFTCLVGVNPIVITQQSGDTKG FT CIHCSCNSSECVYTLIKTLCGLRDLLPIN" FT regulatory 31259..31264 FT /gene="E3" FT /regulatory_class="polyA_signal_sequence" FT gene 31444..32721 FT /gene="L5" FT CDS 31444..32721 FT /codon_start=1 FT /gene="L5" FT /product="fiber protein" FT /db_xref="GOA:A0A3G1S679" FT /db_xref="InterPro:IPR000931" FT /db_xref="InterPro:IPR000939" FT /db_xref="InterPro:IPR000978" FT /db_xref="InterPro:IPR008982" FT /db_xref="InterPro:IPR009013" FT /db_xref="UniProtKB/TrEMBL:A0A3G1S679" FT /protein_id="AXN73210.1" FT /translation="MSKKRARVDDGFDPVYPYDADNAPTMPFINPPFVSSNGFQEKPLG FT VLSLRLADPVTTKNGEITLNLGEGVDLDDSGKLIANTVNKAIAPLSFSNNTISLNMDTP FT LYTKDGKLSLQVSPPLSILRSTILNTLALAFGSGLGLRGSALAVQLASPLTFDDKGNIK FT ITLNRGLHVTTGNAIESNISWAKGIKFEDGAIATNIGKGLEFGTSSTETGVNNAYPIQV FT KLGSGLSFDSTGAIMAGNKDYDKLTLWTTPDPSPNCQILAENDAKLTLCLTKCDSQILA FT TVSVLVVRSGNLNPITGTVSSAQVFLRFDANGVLLTEHSTLKKYWGYRQGDSIDGTPYT FT NAVGFMPNSTAYPKTQSSTTKNNIVGQVYMNGDVSKPMLLTITLNGTDDTTSAYSISFS FT YTWTNGSYIGATFGANSYTFSYIAQE" FT regulatory 32781..32786 FT /gene="L5" FT /regulatory_class="polyA_signal_sequence" FT regulatory complement(32797..32802) FT /gene="E4" FT /regulatory_class="polyA_signal_sequence" FT gene complement(32817..35405) FT /gene="E4" FT CDS complement(join(32817..33065,33791..33964)) FT /codon_start=1 FT /gene="E4" FT /product="15.9 kDa protein" FT /db_xref="UniProtKB/TrEMBL:Q2KSD2" FT /protein_id="AXN73211.1" FT /translation="MSGNSSIMTRSRTRLALSCHHPYQP