ID KC307877; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307877; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV13 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; b0b7c5361444a1098de13d2545664c66. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV13" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYN3" FT /protein_id="AGL33883.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGXRYQYNVLPQGWKGSPAIFQHSMTRILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRXKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 257 A; 105 C; 139 G; 150 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg agatggagaa ggaagggaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagyacta agtggagaaa actagtagat 180 ttycgggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtataaay 360 aatgaaacac cagggrttag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagcatag catgacaaga attttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agarataggg 540 caacatagar caaaaataga ggaattaaga gaacatctgt tgaggtgggg atttaccaca 600 ccagacaaga aacatcaaaa ggaacctccg tttctttgga tggggtatga actccat 657 // ID KC307878; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307878; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV14 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 11e39280f45ff1da080ab0175a0a2a19. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV14" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ44" FT /protein_id="AGL33884.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEREGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAXNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLXWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 106 C; 132 G; 149 T; 8 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaag agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcargat ttytgggaag tccaattagg aataccacac 240 ccagcagggt taaaraaraa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagaaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac caggcattag atatcaatat aacgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttrgagc cctttagagc amaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 cagcatagag caaaaataga ggaattaaga garcatctgt taargtgggg gtttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307879; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307879; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV15 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 49f59ab5d748430e4a588a64b6c55710. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV15" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS44" FT /protein_id="AGL33885.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT XEXFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLXWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 106 C; 135 G; 148 T; 9 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggarc tcaataaaag aactcaagat ttttgggaag tycaattagg aataccacac 240 ccagcagggt taaaaaagaa aaartcagtg acagtactgg atgtggggga tgcatatttt 300 tcagtmcctt takatgaagr cttcagraag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatcc 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttagggc acagaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaataggg 540 caacatagag caaaaataga ggarttaaga gaacatctgt taasgtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307880; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307880; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV16 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 1dc3e83bcbb8e5467588c5a3230fe9db. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV16" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0M3" FT /protein_id="AGL33886.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAVFQASMTKILEPFRAXNPEIV FT IYQYMDDLYVGSDLEIGQXRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 107 C; 138 G; 147 T; 2 other; gatggcccaa aggttaaaca atggccatta acagaagaga aaataaaagc attaacagaa 60 atttgtgatg agatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccggcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatcg 420 ccagcagtat tccaggctag catgacaaaa atcttagagc cctttagagc amaaaatcca 480 gagatagtca tctatcaata tatggatgac ttgtatgtag ggtctgactt agaaataggg 540 caacamagag caaaaataga ggaattaaga gaacatctgt taaagtgggg attcaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctgtgga tggggtatga actccat 657 // ID KC307881; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307881; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV17 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; d6d4420fc5937599145c36d2caec0dd5. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV17" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2D4" FT /protein_id="AGL33887.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKXKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRARNPEJV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 255 A; 109 C; 138 G; 149 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgygatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga raatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttctgggarg ttcaattagg aataccacac 240 ccagcagggt taaaamagaa aaaatcagtg acagtrctgg atgtggggga tgcatatttt 300 tcagtccctt tacatgaaga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcagtgtag catgacaaaa atcttagagc cctttagggc acgaaatcca 480 gaamtagtca tctatcaata tatggatgac ttgtatgtag gctctgactt agaaataggg 540 caacacagag caaaaataga ggagttaaga gaacatctgt taaagtgggg attcaccaca 600 ccagacaaga aacatcagaa agaacctcca tttctttgga tggggtatga actccat 657 // ID KC307882; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307882; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV18 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 292167c65ea7de9adae7058b7d7b6d18. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV18" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYP0" FT /protein_id="AGL33888.1" FT /translation="DGPKVKQWPLTEEKITALKAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMIKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLXWGLTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 103 C; 133 G; 149 T; 10 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataacagc attaaaagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aatactccaa tatttgccat aaaaaagaag gayagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aacycaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt traaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaar tatactgcat tcaccatacc yagtacaaac 360 aatgaaacac cagggattag atatcaatay aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgataaaa atcttagagc cctttagrgc acaaaatcca 480 gaaatagtca tytatcaata tatggatgac ttgtatgtag gatctgactt agaaatagga 540 caacatagag caaaaataga ggaattaaga gaacatctgt taakgtgggg actyaccaca 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307883; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307883; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV19 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e611d51d4cc4eb5000ae958de8b4f1ef. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV19" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ53" FT /protein_id="AGL33889.1" FT /translation="DGPKVKQWPLTAEKIEALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT XEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQYSMTKILEPFRXQNPEMV FT IYQYMDDLYVGXDLEIGQHRXKIEELREHLLKWGFTTPDKKHQKKPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 104 C; 131 G; 150 T; 11 other; gatggcccaa aggttaaaca atggccattg acagcagaga aaatagaagc attaacagca 60 atttgtgagg aaatggaaaa ggaaggraaa attacaaaaa ttgggcctga aaatccrtat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tycagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactrg atgtggggga tgcatatttt 300 tcagttcctt tayatgaaga cttcagraaa tatactgcat tcactatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcagtatag catgacaaaa atcttagagc cctttaggrc acaaaatcca 480 gaaatggtca tctatcaata catggatgac ttgtatgtag gatytgactt agaaataggg 540 caacatagar caaaaataga rgaattaaga gaacatctgt taaartgggg atttaccaca 600 ccagacaaga aacatcaaaa gaaaccccca tttctttgga tggggtatga actccat 657 // ID KC307884; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307884; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV20 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 39cf0a6b317b375ffa33744c37c83c41. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV20" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS51" FT /protein_id="AGL33890.1" FT /translation="DGPKVKQWPLTKEKIEALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGVRYQYNVLPQGWKGSPAIFQHSMVKILEPFRAQNPEII FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 109 C; 135 G; 149 T; 1 other; gatggcccaa aggttaaaca gtggccattg acaaaggaaa aaatagaagc attaacagca 60 atttgtgatg aaatggagaa agaaggaaaa attacaaaaa ttgggcctga aaatccctat 120 aayactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcaagac ttttgggaag tccagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag acgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtacaaat 360 aatgaaacac caggggttag atatcaatat aatgtgcttc cacagggatg gaagggatca 420 ccagcaatat tccagcatag catggtaaaa atcttagagc cctttagagc acaaaatcca 480 gaaataatca tctatcaata catggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga agaactaaga gaacatctgt taagatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307885; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307885; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV21 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; a2386dce8e843c2b16bc7e06bf2c8461. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV21" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0M7" FT /protein_id="AGL33891.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTRILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHREKXEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 102 C; 135 G; 149 T; 9 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aratggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gayagtacta artggagaaa attagtagat 180 ttyagggaac tyaataaaag aactcaagat ttttgggaag tccaattagg aataccacay 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaag tatactgcat tcaccatacc yagtaggaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggrtg gaaaggatca 420 ccagcaatat tccagagtag catgacaaga atcttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag aaaaarttga ggaattaaga caacatctat tgaggtgggg atttaccaca 600 ccagacaaga aacatcaaaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307886; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307886; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV23 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 0e6c64508d41242b8889033229596283. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV23" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2E0" FT /protein_id="AGL33892.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKQKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLNIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 104 C; 131 G; 151 T; 8 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtaatg aaatggagaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgctat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aatwccacac 240 ccagcagggt taaagcagaa raaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga yttcaggaag tatactgcat tcaccatacc tagtataaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tycaggcyag catgacaaaa atyttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcaata yatggatgac ttgtatgtrg gatctgactt aaacataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307887; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307887; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV25 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; d19072fe398606978638ecbdd15b082d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV25" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYP7" FT /protein_id="AGL33893.1" FT /translation="DGPRVKQWPLTEEKIKALTXICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DESFRKYTAFTIPSXNNETPGIRYQYNVLPQGWKGSPAIFQASMXKILEPFRAENPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 257 A; 101 C; 140 G; 147 T; 12 other; gatggcccaa gggttaaaca atggccattg acagaagaga aaataaaagc attaacagsa 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcaagat ttttgggagg tycaattagg aataccacac 240 ccggcagggt taaaaaagaa aaagtcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttccwt tagatgaaag cttcaggaaa tatactgcat tcaccatacc tagtrbaaac 360 aatgaaacac cagggattag atatcaatay aatgtgctyc crcagggatg gaaaggatca 420 ccagcaatat tccaggctag tatgayaaaa attttagagc cctttagagc agaaaatcca 480 gaaatagtya tctaccaata tatggatgac ttgtatgtag gatctgacct rgaaataggr 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307888; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307888; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV26 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 074e0d941aedfd2132e4da1d3d46cc68. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV26" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ59" FT /protein_id="AGL33894.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEXEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEXFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGZHRAKIEXLRKHLLGWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 265 A; 99 C; 131 G; 153 T; 9 other; gatggcccaa aggttaaaca atggccattg acagaagara aaataaaagc attaacagaa 60 atttgtgatg aaatggaaar agaaggaaar atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcaagat ttttgggaag tycaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga mttcagaaag tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccagtgtag catgacaaaa atcttagagc cttttagggc acaaaatcca 480 gaaatagtca tytatcaata tatggatgac ttgtatgtag gatcagaytt agaaataggg 540 saacatagag caaaaataga agamttaaga aaacatctgc tagggtgggg atttaccacc 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307889; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307889; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV27 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 58871dc03e9cac5a23914111a5f4831e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV27" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS58" FT /protein_id="AGL33895.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQHSMTKILEPFRAQNPDIV FT IYQYMDDLYVGSDLEIGQHRAKIEELRKHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 265 A; 106 C; 131 G; 146 T; 9 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtaatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaacaaaag aactcaagat ttytgggaag tmcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtgytag atgtgggaga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaag tatactgcat tcaccatacc tagcataaac 360 aatgaaacac cagggattag atatcaatat aatgtgctmc cacagggatg gaaaggatca 420 ccagcaatat tccagcatag yatgacaaaa atcttagarc cctttagrgc acaaaatcca 480 gacatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agagataggg 540 caacatagag caaaaataga ggarytaaga aaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307890; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307890; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV28 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; ebc20a2196e75633fee25337b223de46. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV28" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0N2" FT /protein_id="AGL33896.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSCPL FT YEDFRKYTAFTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIKQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 105 C; 135 G; 152 T; 2 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaag attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta artggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcatgtcctt tatatgagga cttcaggaag tatactgcat tcaccatacc tagtgtaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagrgc acagaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataaag 540 caacatagag caaaaataga agaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307891; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307891; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV29 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; f7b423b9c1956e5452edeeb1033f3dfe. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV29" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2E8" FT /protein_id="AGL33897.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKRSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGXHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 258 A; 106 C; 139 G; 147 T; 7 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggagaa agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaar gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaggat ttttgggagg tccaattagg aataccacac 240 ccagcagggt taaagaaaaa gagrtcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagraaa tatactgcat tcaccatacc cagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgctgc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagrgc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgacyt agaaataggg 540 maacatagag caaaaataga rgaattaaga gaacatctgt tgaggtgggg atttaccaca 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307892; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307892; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10HDRCV14 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 3e8911dac750e75a9bb84ed9b6b09c4d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10HDRCV14" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYQ5" FT /protein_id="AGL33898.1" FT /translation="DGPKVKQWHLKEEKNKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGXRYQYNVLPQGWKGSPAIFQSSMTKILEPFKTKNPEMV FT IYQYMDDLYVGSDLEIGXHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 267 A; 102 C; 134 G; 150 T; 4 other; gatggcccaa aggttaaaca atggcatttg aaggaagaga aaaataaagc attaacagca 60 atttgtgaag aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatac 120 aatactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagagaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tgtatgaaga ytttaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggarttag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa atyttagagc cctttaagac aaaaaatcca 480 gaaatggtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 cwacatagag caaaaataga agaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctctgga tggggtatga actccat 657 // ID KC307893; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307893; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV19 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 080e6718b97aa7eb486d85750e17cf87. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV19" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ66" FT /protein_id="AGL33899.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLQWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 108 C; 136 G; 147 T; 4 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacya agtggagaaa attagtagay 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt tgaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcytatttc 300 tcagttcctt tatatgarga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagagc aaaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtgg gatctgactt ggaaataggg 540 caacatagag caaaaataga agagttaaga gaacatctgt tacagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307894; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307894; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV26 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 087a0d9c238dd2015f2e8b47005cbbc0. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV26" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS64" FT /protein_id="AGL33900.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSIPL FT CEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIX FT IYQYMDDLYVGSDLEIGQHREKIEELREHLLKWGLTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 268 A; 105 C; 128 G; 148 T; 8 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtaatg aaatggaaaa ggaaggaaaa atttcaaaaa ttggrcctga aaatccatat 120 aacactccaa tatttgcyat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggarg tccaattagg aataccacac 240 ccagcagggt tgaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcaattcctt tatgtgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggataag rtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tycaggctag catgacaaaa atcttagaac cctttagggc acaaaaccca 480 gaaatartca tctatcaata tatggatgac ttgtatgtag gatctgactt agaratagga 540 carcatagag aaaaaataga agaattaaga gaacatctgt tgaaatgggg acttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307895; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307895; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV28 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 767424f0d09b73b60e15bb6ecdb69152. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV28" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0N7" FT /protein_id="AGL33901.1" FT /translation="DGPRVKQWPLTEEKIKALTAICEEMEKEGKITXIGPENPYNTPXF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMIKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRKKIEDLREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 104 C; 134 G; 151 T; 7 other; gatggcccaa gggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggagaa ggaaggaaaa attacaaraa ttgggcctga aaatccatat 120 aacactccar tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagay ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactrg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagaaar tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggrattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatmt tccaggctag catgataaaa atcttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatcagactt agaaataggg 540 caacatagaa aaaagataga ggatttaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaacctcca tttctttgga tggggtatga actccat 657 // ID KC307896; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307896; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV29 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 9f3ea80dca16dfdbb8eebc88aa3a45ca. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV29" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2F4" FT /protein_id="AGL33902.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT BKXFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRVKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 268 A; 101 C; 131 G; 153 T; 4 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attgacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagay 180 tttagggaac tcaataaaag aactcaagac ttttgggaag ttcaattagg aataccacat 240 ccagcaggat taaaaaaraa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tarataaaga mttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag tatgacaaaa attttagagc cctttagagt gaaaaaccca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag ggtctgacct agaaataggg 540 caacatagag caaaaataga agagttaaga gaacacttgt taagatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca ttcctttgga tgggttatga actccat 657 // ID KC307897; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307897; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV31 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e0b966726d7d7c79063a18f608ccfe2a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV31" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYR1" FT /protein_id="AGL33903.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPDNPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 104 C; 132 G; 154 T; 4 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga taatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagay ttttgggaag ttcaattagg aataccacat 240 ccagcagggt tgaaaaagaa aaaatcagtg acagtactag atgtggggga tgcmtatttt 300 tcagttcctt tatatgaaga cttcagraaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccargctag catgacaaaa atcttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggagttaaga gaacatttat taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaacctccc tttctttgga tggggtatga actccat 657 // ID KC307898; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307898; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV32 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 9ca426e24181fe13ce8ba4aa34edc745. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV32" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ73" FT /protein_id="AGL33904.1" FT /translation="DGPKVKQWPLTEEKIKALTEICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKXKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMXKILEPFRXQNPEIV FT IYQYMDDLYVGSDLEIGQHRXKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 102 C; 130 G; 146 T; 20 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtgagg aaatggagaa agaaggaaaa attacaaaaa ttgggcctga aaatccatac 120 aacactccaa tatttgccat aaaaaagaag gacagcacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagay ttttgggaag tycaattagg rataccacac 240 ccagcaggtt taaaamagaa aaaatcagtg acagtaytgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga yttcaggaar tatactgcat tcaccatacc yagtacaaat 360 aatgaracac cagggattag rtatcartat aatgtgctyc cacagggwtg gaaaggatca 420 ccagcaatat tycargctag yatgayaaaa atcttagagc cctttagrrc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaataggg 540 caacatagak caaaaataga ggaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307899; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307899; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRCV34 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; a8befae34325fef71f0fd4b9dbd89acc. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRCV34" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS69" FT /protein_id="AGL33905.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPXIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 104 C; 128 G; 149 T; 12 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggaraa ggaaggaaaa attacaaara ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gayagtacta artggagaaa attagtagat 180 ttcagggaac tyaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaar tatacagcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcartat aatgtgcttc cacagggatg gaarggatca 420 ccagcaatat tccaagctag catgacaaaa atyttagagc cctttagagc acaaaatcca 480 gamatagtca tctatcaata tatggatgac ttgtatgtag gatctgaytt agaaataggr 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaaatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca ttcctttgga tggggtatga actccat 657 // ID KC307900; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307900; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV1037 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; cc294aea92d69cd56744989aba30899d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV1037" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0P1" FT /protein_id="AGL33906.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEXFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPSIFQASMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 103 C; 134 G; 150 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttggacctga raatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacay 240 ccagcagggt taaaaaagaa aaaatcagtg acagtgttgg atgtggggga tgcatatttt 300 tcagttcctt tacatgagga mttcaggaaa tatactgcat tcaccatacc tagtagaaat 360 aatgaaacac cagggataag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccatcaatat tycaggctag yatgacaaaa atcttagagc cctttaggac acaaaatccr 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggaattaagg gaacatctgt taaaatgggg attcaccaca 600 ccagacaaaa agcatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307901; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307901; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV1043 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 575c769e9d7f280b71d7be6e4ac4ffff. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV1043" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2F9" FT /protein_id="AGL33907.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DEDFRKYTAFTIPSRNNETPGVRYQYNVLPQGWKGSPAIFQCSMTKILEPFRKQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKRPPFLWMGYELH" XX SQ Sequence 657 BP; 269 A; 106 C; 133 G; 148 T; 1 other; gatggcccaa aagttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag acgtgggaga tgcatatttt 300 tcagttcctt tagatgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaay 360 aatgaaacac caggggttag atatcaatat aatgtgcttc cccagggatg gaaaggatca 420 ccagcaatat tccagtgtag tatgacaaaa atcttagagc cctttagaaa acaaaatccg 480 gaaatagtca tctatcaata tatggatgac ttatatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaggtgggg gtttaccaca 600 ccagacaaga aacatcaaaa aagaccccca tttctttgga tgggatatga actccat 657 // ID KC307902; SV 1; linear; genomic RNA; STD; VRL; 657 BP. XX AC KC307902; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJCV1045 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e8ba5b40e68794ba066dc5f4d21adc59. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJCV1045" FT /mol_type="genomic RNA" FT /country="India" FT /isolation_source="plasma" FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYR8" FT /protein_id="AGL33908.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRXXNPEIV FT IYQYMDDLYVGSDLEIGQHREKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 103 C; 138 G; 150 T; 4 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg aaatggaaaa ggaagggaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tycaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaagggatca 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttaggay aargaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag aaaaaataga agaattaaga gaacatctgt traggtgggg gtttaccaca 600 cctgacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307903; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307903; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV01 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; c874c3106f80894a76a84289b7bad94e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV01" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ81" FT /protein_id="AGL33909.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWXKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSXNNETPGIRYQYNVLPQGWKGSPAXFQSSMTXILEPFRAXNPEIX FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 104 C; 132 G; 152 T; 7 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaggc attaacagca 60 atttgtgatg aaatggaaaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgctat aaaaaagaag gacagtacta agtggaraaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaaa tacactgcat tcaccatacc tagtayraac 360 aatgaaacac caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcartat ttcagagtag catgacaara attctagagc cctttagagc amaaaatcca 480 gaaatartca tctatcaata tatggatgac ttgtatgtag ggtctgactt agaaataggg 540 cagcatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307904; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307904; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV02 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 8717fc47cb587322cdbca3212bf64c99. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV02" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS75" FT /protein_id="AGL33910.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFXAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 106 C; 134 G; 149 T; 5 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggagaa agagggaaaa atttcaaaaa ttgggcctga aaacccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac ttaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtr acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagaaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac caggrattag atatcartat aatgtgctyc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag tatgacaaaa atcttagagc cctttrgagc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacacagag caaaaataga ggaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa ggaaccccca ttcctttgga tggggtatga actccat 657 // ID KC307905; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307905; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV03 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 384b0b1fbf26f345a39cf434af6cf0f6. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV03" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0P9" FT /protein_id="AGL33911.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKVEELRVHLLKWGLTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 105 C; 136 G; 152 T; 1 other; gatggcccaa aggttaaaca gtggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aatactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacat 240 ccagcagggt taaagaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttc 300 tcagttcctt tacatgagga cttcaggaag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa attttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata tatggatgat ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaagtaga ggagttaaga gtacatctat traagtgggg acttaccaca 600 ccagataaga aacatcagaa agaacctcca ttcctttgga tggggtatga actccat 657 // ID KC307906; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307906; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV04 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; be9487a3d83693334aae6accca6ea430. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV04" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2G7" FT /protein_id="AGL33912.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLSWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 106 C; 132 G; 153 T; 2 other; gatggcccaa aggttaaaca atggccatta acagaagaaa aaataaaagc attaacagaa 60 atttgtgatg aaatggagaa ggagggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagrgaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtgggaga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatcc 420 ccagcaatat ttcaggctag catgacaaag atcttagagc cctttagrgc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 cagcatagag caaaaataga agagttaaga gaacatctgt taagctgggg atttaccaca 600 ccagacaaga aacatcaaaa agaacctcca tttctttgga tgggatatga actccat 657 // ID KC307907; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307907; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV05 from India nonfunctional pol protein (pol) gene, DE partial sequence. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e0ac68e00e089ea2548485def5a4ed28. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV05" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT misc_feature <1..>657 FT /gene="pol" FT /note="nonfunctional pol protein due to APOBEC mediated FT hypermutation" XX SQ Sequence 657 BP; 258 A; 109 C; 136 G; 151 T; 3 other; gatggcccaa aggttaaaca atggccattg acagaggaga aaataaaagc attaacagca 60 atctgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttrggaag tccaattagg aataccacac 240 ccagcarggt tgaaaaagaa aaaatcagtg acagtactag atgtrggaga tgcatatttt 300 tcagttcctt tatatgaaga ctttaggaaa tatactgcat tcaccatacc tagcagaaac 360 aatgcaacgc cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cttttagggc acaaaatcca 480 gatatagtca tctatcaata tatggatgat ttgtatgtag gatctgactt agaaataggg 540 cagcatagag caaaaataga ggaattaaag gaccatctgt tgaagtgagg atttaccaca 600 ccagacaaga aacaccagaa ggagccccca tttctttgga tagggtatga actccat 657 // ID KC307908; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307908; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV06 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 9ba0a13b813d16514f895c66ca24ee55. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV06" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYS7" FT /protein_id="AGL33913.1" FT /translation="DGPKVKQWPLTEEKIKALTEICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIEQHRAKIEELRGHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 267 A; 106 C; 135 G; 149 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtgaag aaatggaaaa agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggagc tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa gaaatcagta acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc cagtacaaac 360 aatgaaacac cagggattag ataccaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgacaaaa atcttagaac cctttagggc aaagaatcca 480 gagatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaatagag 540 caacatagag caaaaataga ggaattaaga ggacatctgt taaagtgggg atttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307909; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307909; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV08 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 6a52b2422ba62a47ccb7e1110dce1665. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV08" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQ89" FT /protein_id="AGL33914.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSXNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRVQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 104 C; 139 G; 153 T; 2 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagcg 60 atctgtgatg aaatggaaaa ggaagggaaa attacaaaga ttgggcccga aaacccatat 120 aacactccaa tatttgctat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac ttaataaaag ractcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga ctttaggaaa tatactgcat tcactatacc tagtayaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatcg 420 ccagcaatat ttcaggctag catgacaaaa atcttagagc cctttagggt acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacataggg caaaaataga ggaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcaaaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307910; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307910; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV09 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 62c5fcf9650b553add1c9335965c0072. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV09" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MS81" FT /protein_id="AGL33915.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTRILAPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEKLREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 102 C; 137 G; 157 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg agatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagat 180 tttagagaac ttaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga ctttaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgacaaga attttagcgc cctttagggc acaaaaccca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga gaagttaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaacctcca tttctttgga tggggtatga actccat 657 // ID KC307911; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307911; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV10 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; da59565c94a818f93a8264c199924e40. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV10" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0Q5" FT /protein_id="AGL33916.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKEKKSVTVLDVGDAYFSVPL FT DKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPELI FT IYQYMDDLYVGSDLELGQHRERIEKLRDHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 106 C; 136 G; 152 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt tgaaagagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tagataagga ctttaggaaa tatactgcat tcaccatacc tagtataaac 360 aatgaaacac cagggattag gtaccaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgacaaaa atcttagaac cctttagggc acaaaaccca 480 gaattgatca tctatcaata tatggatgac ttgtatgtag gatctgactt agaactaggg 540 caacacagag aaagaataga aaagttaaga gaccatctgt taaaatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307912; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307912; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV12 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; f73be3938151297dac1cf6a41c20ca39. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV12" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2J5" FT /protein_id="AGL33917.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGVRYQYNVLPQGWKGSPAIFQASMTKILEPFRTQNPEMV FT IYQYMDDLYVGSDLEIGQHRAKIEKLREHLARWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 110 C; 136 G; 145 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagac 180 tttagagaac tcaataaaag aactcaagat ttttgggaag tacaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttccct tgtatgaaga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggagttag atatcagtat aatgtgctgc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttaggac acaaaatcca 480 gaaatggtca tctatcaata tatggatgac ttgtacgtag gatctgactt agaaataggg 540 caacatagag caaaaataga gaagttaaga gaacatctgg caaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca ttcctttgga tggggtatga actccat 657 // ID KC307913; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307913; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV13 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 5e6808fc9486b17578cc84e401a152d8. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV13" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYV1" FT /protein_id="AGL33918.1" FT /translation="DGPKVKQWPLTEEKIKALTEICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTRILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 267 A; 111 C; 135 G; 144 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaacccatat 120 aacactccaa tatttgccat caaaaagaag gacagtacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcccc tatatgaaga tttcaggaaa tacactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaga atcttagaac cctttagggc acagaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggaactgaga gaacatctgt taaaatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307914; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307914; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV15 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 6ccfa7c0daa52efa0504fd9200b989de. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV15" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQB8" FT /protein_id="AGL33919.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKXVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 105 C; 138 G; 146 T; 6 other; gatggcccaa aggttaaaca atggcccttg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gayagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaaycagtg acagtactrg atgtggggga tgcatatttt 300 tcagttcctt trcatgaaga tttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagccatat ttcagagtag catgacaaaa atcttagagc cctttagagc aaaaaatcca 480 gaaatagtca tctatcaata yatggatgac ctgtatgtag gatcwgactt agaaataggg 540 caacataggg caaaaataga ggagttaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctgtgga tggggtatga actccat 657 // ID KC307915; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307915; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV16 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; cd75c1caa1d7f236692006dcaee82842. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV16" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSA2" FT /protein_id="AGL33920.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKIEKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKRKKSVTVXDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNEAPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 108 C; 137 G; 147 T; 1 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc gttaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attgaaaaaa ttggacctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaggaa aaaatcagtg acagtacygg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaagcac cagggataag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaagctag catgacaaag atcttagagc cctttagggc acagaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga agaattaaga gaacatctgt taaaatgggg atttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307916; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307916; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV17 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 4ed02f02564e60d2366b47e5fb0bb636. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV17" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0S0" FT /protein_id="AGL33921.1" FT /translation="DGPKVKQWPLSEEKIKALTAICDEMEKEGKISKIGPENPYNTPIF FT AIKKKBSTXWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDXRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELRXHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 257 A; 101 C; 136 G; 156 T; 7 other; gatggcccaa aggttaaaca atggccattg tcagaggaga aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggagggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag racagtactr agtggagaaa attagtggat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtagggga tgcatatttt 300 tcagtccctt tacatgaaga tyttaggaag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc crcagggatg gaaaggatca 420 ccagcaatat ttcagtgtag catgacaaaa atyttagagc cttttagrgc aaaaaatcca 480 gaaatagtca tctatcaata tatggatgat ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggagttaaga gracatctgt taaaatgggg atttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307917; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307917; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV18 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; b762d0cd3c0e928e0b3a4a142a878eb2. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV18" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2K5" FT /protein_id="AGL33922.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRTKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 273 A; 106 C; 130 G; 148 T; 0 other; gatggcccaa aagttaaaca atggccatta acagaagaga aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttccat tatatgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccagtgtag catgacaaaa atcttagaac cctttaggac aaaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacacagag caaaaataga agaattaaga gaacatctgt tgaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307918; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307918; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV19 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e848fd4a9231875d175bcfeb70d6f295. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV19" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYW6" FT /protein_id="AGL33923.1" FT /translation="DGARVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGIRYQYNXLPQGWKGSPAIFQASMTXILEPFRAXNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 101 C; 132 G; 150 T; 11 other; gatggcgcaa gggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaagaagaa aaaatcagtg acagtactgg atgtggggga tgcatacttt 300 tcagttcctt trtatgaaga ttttagraaa tatactgcat tcaccatacc yagtataaac 360 aatgaaacac cagggattag atatcaatat aatgygcttc cacagggrtg gaaaggatca 420 ccagcaatat tycaggctag catgacaara atcttagarc cctttagrgc amaaaatcca 480 gaaatagtca tctatcaata yatggatgac ttgtatgtag gatctgattt agaaataggg 540 caacatagag caaaaataga agaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcaaaa agaaccccca tttctatgga tggggtatga actccat 657 // ID KC307919; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307919; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV20 from India nonfunctional pol protein (pol) gene, DE partial sequence. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 82ab12c574fff226702ad92989786fd4. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV20" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT misc_feature <1..>657 FT /gene="pol" FT /note="nonfunctional pol protein due to APOBEC mediated FT hypermutation" XX SQ Sequence 657 BP; 293 A; 108 C; 108 G; 148 T; 0 other; gatggcccaa aagttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atctgtgatg aaataaaaaa agaaggaaaa attacaagaa ttaggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta aatggagaaa attagtagat 180 ttcagagaac tcaataaaag gactcaagat ttttagaaag tccaattaag aataccacac 240 ccagcaaagt tgaaaaagaa aaaatcagtg acagtactag atgtaaaaga tgcatatttt 300 tcagttcctt tatatgaaga attcaggaag tatactgcat tcaccatacc tagcagaaac 360 aatgaaacac caaggattag atatcaatat aatgtgcttc cacagagata gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttaagac acaaaatcca 480 gaactaatca tctatcaata tatagatgac ttgtatgtaa gatctgactt agaaataagg 540 caacatagag aaaaaataga gaaattaaga caacatctgt taaggtaagg atttaccaca 600 ccagacaaaa aacatcagaa ggagccccca tttctttaga taaggtatga actccat 657 // ID KC307920; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307920; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV26 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 4cb542a1f0d2da50718c7e737eb3f746. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV26" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQD2" FT /protein_id="AGL33924.1" FT /translation="DGAKVKPWPLTEEKIKALTAICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DKDFRKYTAFTIPSLNNETPGVRYQYNVLPQGWKGSPAIFQASMTKILEPFRAXNPEMV FT IYQYMDDLYVGSDLEIGQHRAKIEELRGHLLTWGFTTPDKKHQKDPPFHWMGYELH" XX SQ Sequence 657 BP; 258 A; 105 C; 134 G; 152 T; 8 other; gatggcgcaa aggttaaacc gtggccatta acagaggaga aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac ttaataaaag aactcaagat ttttgggaag ttcaattagg aatacchcac 240 ccagcagggt taaaaaaraa aaaatcagtg acagtactgg atgtagggga tgcatatttt 300 tcagttccyt trgacaaaga cttcaggaaa tatactgcat tcactatacc tagtctaaac 360 aatgaaacac caggggttag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tycaggccag tatgacaaaa atcttagagc cctttagagc amaaaatcca 480 gaaatggtca tctatcaata tatggatgac ttgtatgtag grtctgactt agaaataggg 540 caacatagag caaaaataga rgaattaaga gggcatctat taacatgggg atttaccaca 600 ccagataaaa aacatcagaa agacccccca tttcattgga tggggtatga actccat 657 // ID KC307921; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307921; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 09SJAV28 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; c25f234e3eeadf792084af32e5d257c3. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="09SJAV28" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSA5" FT /protein_id="AGL33925.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVXVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRTRNPEMV FT IYQYMDDLYVGSDLEIGQHRAKIEDLRKHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 102 C; 137 G; 151 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa raaatcagtg rcagtactgg atgtggggga tgcatatttt 300 tcagtwcctt tatatgarga cttcaggaag tatactgcat tcaccatacc tagtataaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttaggac aagaaatcca 480 gaaatggtca tctatcaata tatggatgac ttgtatgtag gatctgaytt agaaataggg 540 caacayagag caaaaataga ggatttaaga aaacatctgt taaagtgggg gtttaccaca 600 ccagacaaga aacatcagaa agaacctcca tttctttgga tggggtatga actccat 657 // ID KC307922; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307922; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV01 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 36f5a4d953d7369a83cc227be636314d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV01" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0S5" FT /protein_id="AGL33926.1" FT /translation="DGPRVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEEFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 105 C; 140 G; 149 T; 2 other; gatggcccaa gggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaacccatat 120 aacactccca tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggagg tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtattag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga gttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatac aatgtgcttc cacagggatg gaagggatca 420 ccagcaatat tccaggctag catgacaaaa atyttagagc cctttagagc aaaaaatcca 480 gagatagtca tytatcaata tatggatgac ttgtatgtag gatctgactt agagataggg 540 caacatagag caaaaataga agaattgaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307923; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307923; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV02 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 491f45c1140275652de48c34ed1d1b4e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV02" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2K8" FT /protein_id="AGL33927.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWXKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEBFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTXILEPFRAQNPXIV FT IYQYMDDLYVGSDLEIGQHRAKIEKLREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 102 C; 128 G; 152 T; 12 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgctat aaaaaagaag gacagtacta agtggaraaa attagtagat 180 ttyagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaaraa aaaatcagtg acagtactrg atgtgggaga tgcatayttt 300 tcagttcctt tatatgaara cttyaggaar tatactgcat ttaccatacc tagtataaac 360 aatgaaacac cagggattag rtatcaatat aatgtgctyc cacagggatg gaaaggatca 420 ccagcaatat tccagtgtag catgacaara atcttagagc cctttagagc acaaaatcca 480 raaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaataggg 540 caacacagag caaagataga aaaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagataaga agcatcagaa agaaccccca ttcctctgga tggggtatga actccat 657 // ID KC307924; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307924; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV03 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; d2607b87ada1d18331930a85a79e1710. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV03" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYW9" FT /protein_id="AGL33928.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQHSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLGWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 269 A; 104 C; 131 G; 152 T; 1 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg aaatggaaaa agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgagga cttcagaaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aaygtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccagcatag catgacaaaa atcttagagc cctttagggc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatcagactt agaaataggg 540 caacatagag caaaaataga agaattaaga gaacatttat tagggtgggg atttaccaca 600 cctgataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307925; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307925; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV04 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 10aa1b35cb3f5533169808c68895bea9. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV04" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQD7" FT /protein_id="AGL33929.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKXKSVTVLDVGDAYFSVPL FT XEEFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGLSTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 103 C; 131 G; 148 T; 16 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaayccatat 120 aayactccca tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtrgat 180 ttcagggaac ttaataarag aacycaagat ttttgggaag tmcaattagg aataccacac 240 ccagcagggt taaaaaagam aaaatcagtg acagtactmg atgtgggaga tgcatatttt 300 tcagttcctt takatgaaga attcagraag tatactgcat ttaccatacc tagtacmaac 360 aatgaaacac cagggattag atatcaatat aatgtrcttc cacagggatg gaaaggatca 420 ccagcaatat tccagtctag catgacaaaa atcttagagc cwtttagggc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag grtctgacyt agaaataggr 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg actatctaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307926; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307926; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV05 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 92559b1b2d2f91abdc78fb90a6968a43. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV05" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSA8" FT /protein_id="AGL33930.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVXPQGWKGSPAIFQASMTKILEPFRXHNPXIV FT IYQYMDDLYVGSDLEIGQHREKIEELRXHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 103 C; 132 G; 150 T; 9 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg rataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactrg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga ctttaggaaa tayactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggataag gtatcaatat aatgtgytkc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagarc cctttcggrc acacaatcca 480 graatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag aaaaaataga agaattaaga gracatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307927; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307927; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV06 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 50da847cbe6e26fb5d0e3c42ded3fb2a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV06" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0S7" FT /protein_id="AGL33931.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITRIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILDPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 104 C; 138 G; 151 T; 2 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacggca 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaagaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag gactcaggac ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaagaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tagatgaaga tttcaggaaa tatactgcat tcaccatacc tagtataaac 360 aatgaaacac cagggatcag atatcaatat aatgtgctyc cwcagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa attttagatc cctttagggc aaaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agagataggg 540 caacatagag caaaaataga agaattaaga gaacatctat taaggtgggg atttaccaca 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307928; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307928; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV08 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 1046943efdc23b7a8e8bd48fa1ca6a5e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV08" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2L4" FT /protein_id="AGL33932.1" FT /translation="DGPKVKQWPLTEEKIKALTEICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRVKNPDIV FT IYQYMDDLYVGSDLEIGQHRTKVEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 265 A; 107 C; 136 G; 146 T; 3 other; gatggcccaa aggttaagca atggccatta acagaagaga aaataaaagc attaacagaa 60 atttgtgagg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacaccccaa tatttgccat aaaaaagaar gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagac ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttccgt tacatgaaga yttcaggaag tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcagagcag catgacaaaa atcttagaac cctttagggt aaaaaatcca 480 gacatagtca tctatcarta tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagaa caaaagtaga agaattaaga gaacatctgt taaagtgggg gtttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307929; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307929; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV09 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e4d4abfa2d9879b544a478a6cc2dce74. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV09" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYX5" FT /protein_id="AGL33933.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPVF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPXPAGLKKKKSVTVLDVGDAYFSVPL FT YEBFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAVFQASMTKILEPFREQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELRAHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 255 A; 108 C; 139 G; 150 T; 5 other; gatggcccaa aggttaaaca gtggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccag tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccmcmc 240 ccagcagggt taaaaaaraa aaaatcagtg acagtactgg atgtagggga tgcatattty 300 tcagttcctt tatatgaara tttcaggaag tatactgcat tcactatacc tagtacaaac 360 aatgagacac cggggattag atatcaatac aatgtgcttc cacagggatg gaaaggatca 420 ccagcagtat tccaggctag catgacaaaa atcttagagc ccttcaggga acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaatagga 540 caacatagag caaaaataga ggaactaagg gcacatctgt taaggtgggg atttaccaca 600 cctgacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307930; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307930; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV10 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 70431eeaf1ca5a2bd31ddb194ddc499b. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV10" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQE2" FT /protein_id="AGL33934.1" FT /translation="DGPKVKQWPLTKEKIEALTAICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT XEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 107 C; 131 G; 151 T; 7 other; gatggcccaa aggttaaaca atggccattg acaaaagaga aaatagaagc attaacagca 60 atttgtgatg aaatggaaaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagagaac tcaataaaag aactcaagat ttttgggaag tccagttagg aataccacac 240 ccggcagggt taaaaaagaa aaaatcagtg acagtactdg atgtggggga tgcatatttt 300 tcagttcctt tayatgarga tttcaggaaa tatactgcat tcaccatacc gagtataaac 360 aatgaaacrc cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat ttcaggctag catgacaaaa atcttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata catggatgac ttgtatgtag gatctgactt agaaataggr 540 caacatagag caaarataga agagttaaga caacatctgt taagrtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307931; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307931; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV11 from India nonfunctional pol protein (pol) gene, DE partial sequence. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; d3f206d690a791c5e78b6e981d312cdc. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV11" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT misc_feature <1..>657 FT /gene="pol" FT /note="nonfunctional pol protein due to APOBEC mediated FT hypermutation" XX SQ Sequence 657 BP; 259 A; 105 C; 138 G; 149 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 accacyccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtrgat 180 ttcagggaac ttaataaaag aactcaagat ttttgggarg ttcaattagg aataccacac 240 ccagcagggt taaaaaaraa aaaatcagtg acagtattgg atgtgrrgga tgcatatttt 300 tcagttcctc tatatgagga cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagtgtag catgacaaga atcttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatcagactt agaaataggg 540 caacacaggg caaaaataga ggagttaaga ggacatctgt taaagtgagg gattaccaca 600 ccagacaaga aacatcagaa agagccccca tttctttgga tggggtatga actccat 657 // ID KC307932; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307932; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV12 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; e1d1678a01f1492eaaac641f07989ee1. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV12" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSB0" FT /protein_id="AGL33935.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DKEFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRXKNPEJV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 103 C; 134 G; 147 T; 7 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtaatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaggac ttttgggaag tccaattagg aataccacac 240 ccagcaggat taaaaaagaa aaagtcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tagataagga attcaggaar tatactgcat tcaccatacc yagtataaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa atyttagagc cctttagrrc aaaaaaycca 480 gaawtagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga agagttaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307933; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307933; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV13 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 0df09612107db500570e582f6d8d8a03. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV13" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0T2" FT /protein_id="AGL33936.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQHSMTRILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRXKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 258 A; 104 C; 137 G; 150 T; 8 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg agatggagaa ggaaggraaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagyacta agtggagaaa actagtagat 180 ttymgggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcrt tcaccatacc tagtataaay 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagcatag catgacaaga attttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agarataggg 540 caacatagar caaaaataga ggaattaaga gaacatctgt tgaggtgggg atttaccaca 600 ccagacaaaa aacatcaaaa ggaacctccg tttctttgga tggggtatga actccat 657 // ID KC307934; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307934; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV14 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; bec5fed581c1c933b3d44c5d9e14b127. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV14" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2M1" FT /protein_id="AGL33937.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEXEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRXKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 103 C; 129 G; 149 T; 10 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaar agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aayactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcargat ttttgggaag tycaattagg aataccacac 240 ccagcagggt taaaraagaa aaaatcagtg acagtactrg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagaaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac caggcattag atatcaatat aacgtgctkc cacaaggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cytttagarc aaaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttrtatgtag gatctgactt agaaataggg 540 cagcatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg gtttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307935; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307935; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV15 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 5c6ef6ef391da1b67d9f9e348ad7c872. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV15" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYY0" FT /protein_id="AGL33938.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT XEGFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHXLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 104 C; 138 G; 146 T; 10 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tycaattagg aataccacac 240 ccagcagggt taaaaaagaa aaagtcagtg acagtactgg atgtggggga tgcatatttt 300 tcrgtacctt takatgaagg cttcagaaag tatactgcat tcaccatacc tagtacaaay 360 aatgaaacrc caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatcc 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttagggc acagaatcca 480 gaaatagtca tytatcaata yatggatgac ttgtatgtag gatctgattt agaaataggg 540 carcatagag caaaaataga ggarttaaga gaacatckgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307936; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307936; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV16 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; b1a2847ccf71eceec782dc08f445baa6. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV16" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQE7" FT /protein_id="AGL33939.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSXNNETPGIRYQYNVLPQGWKGSPAXFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 106 C; 135 G; 150 T; 5 other; gatggcccaa aggttaaaca atggccatta acagaagaga aaataaaagc tttaacagca 60 atttgtgatg agatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcargat ttttgggaag tccaattagg aataccacac 240 ccggcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtayaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatcg 420 ccagcartat tccaggctag catgacaaaa atcttagagc cctttagagc acaaaatcca 480 gagatagtca tctatcaata yatggatgac ttgtatgtag ggtctgactt agaaataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaartgggg atttactaca 600 ccagacaaga aacatcaaaa agaaccccca tttctgtgga tggggtatga actccat 657 // ID KC307937; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307937; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV17 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 22629d6bc909da12661000cba018e3bd. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV17" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSB4" FT /protein_id="AGL33940.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTPRRKLVDFRELNKRTQDFWEVQLGIPHPAGLKXKKSVTVLDVGDAYFSVPL FT XEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILDPFRARNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFPTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 251 A; 115 C; 134 G; 148 T; 9 other; gatggcccca aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtactc cccgccgaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttctgggarg ttcaattagg aataccacac 240 ccagcagggt taaaamagaa aaaatcagtg acagtrytgg atgtggggga tgcatatttt 300 tcagtccctt tayatgaaga cttcagraaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggrattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tycagtgtag catgacaaaa atyttagacc cctttagggc acgaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gctctgactt agaaataggg 540 caacacaggg caaaaataga ggagttaaga gaacatctgt taaagtgggg attccccaca 600 ccagacaaga aacatcagaa agaacctcca tttctttgga tggggtatga actccat 657 // ID KC307938; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307938; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV18 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; def97d2ed91b902736c0f71d7643f85f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV18" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0T5" FT /protein_id="AGL33941.1" FT /translation="DGPKVKQWPLTEEKIXALKAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLXIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMIKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLMWGLTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 260 A; 103 C; 132 G; 152 T; 10 other; gatggcccaa aggttaaaca atggccgttg acagaagaga aaataamagc attaaaagca 60 atttgtgatg aaatggagaa ggaaggraaa attacaaaaa ttgggcctga aaatccatat 120 aatactccaa tatttgccat aaaaaagaag gayagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattarg aataccacac 240 ccagcagggt traaaaagaa aaaatcagtg acagtactgg atgtrgggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaar tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatay aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgataaaa atcttagagc cctttagrgc acaaaatcca 480 gaaatagtca tytatcaata tatggatgac ttgtatgtag gatctgactt agaaatagga 540 caacatagag caaaaataga ggaattaaga gaacatctgt taatgtgggg actcaccaca 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307939; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307939; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV19 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 5c94f82f64eabbfd91c250c8bae4c3d6. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV19" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2M5" FT /protein_id="AGL33942.1" FT /translation="DGPKVKQWPLTAEKIEALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT XEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQYSMTKILEPFRTQNPEMV FT IYQYMDDLYVGSDLEIGQHRXKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 262 A; 104 C; 133 G; 150 T; 8 other; gatggcccaa aggttaaaca atggccattg acagcagaga aaatagaagc attaacagca 60 atttgtgagg aaatggaaaa ggaagggaaa attacaaaaa ttgggcctga aaatccrtat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagagaac tcaataaaag aactcaagat ttttgggaag tycagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtaytrg atgtggggga tgcatatttt 300 tcagttcctt tayatgaaga cttcagraaa tatactgcat tcactatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcagtatag catgacaaaa atcttagagc cctttaggac acaaaatcca 480 gaaatggtca tctatcaata catggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagar caaaaataga ggaattaaga gaacatctgt taaartgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307940; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307940; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV20 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 2b5959d538f41bba4f0acd4ccdc0e152. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV20" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYY5" FT /protein_id="AGL33943.1" FT /translation="DGPKVKQWPLTKEKIEALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGXRYQYNVLPQGWKGSPAIFQHSMXKILEPFXAQNPEII FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 109 C; 131 G; 149 T; 7 other; gatggcccaa aggttaaaca gtggccattg acaaaggaaa aaatagaagc attaacagca 60 atttgtgatg aaatggagaa agaaggaaar attacaaaaa ttgggcctga aaatccctat 120 aayactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaataaaag aactcargac ttttgggaag tccagttagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag acgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtacaaat 360 aatgaaacac cagggrttag atatcaatat aatgtgcttc cacagggatg gaagggatca 420 ccagcaatat tccagcatag catgrtaaaa atcttagagc cctttaragc acaaaatcca 480 gaaataatca tctatcaata catggatgac ttrtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga agaactaaga gaacatctgt taagatgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307941; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307941; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV21 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 939907ce10e2bd7d6eb3ae123c1d4595. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV21" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQF1" FT /protein_id="AGL33944.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHLAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTRILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHREKVEELRQHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 105 C; 136 G; 150 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa actagtagat 180 ttcagggaac ttaataaaag aactcaagat ttttgggaag tccaattagg aataccacat 240 ctagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaag tatactgcat tcaccatacc cagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgctac cacaggggtg gaaaggatca 420 ccagcaatat tccagagtag catgacaaga atcttagagc cctttagagc acaaaatcca 480 gaaatagtca tctatcagta catggatgac ttgtatgtag gatctgattt agaaatagga 540 caacataggg aaaaagttga ggaattaaga caacatctat taaagtgggg atttaccaca 600 ccagacaaga aacatcaaaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307942; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307942; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV23 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 4f020f438d2638eb74e8f98566e3a69f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV23" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSB8" FT /protein_id="AGL33945.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHQAGLKQKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLNIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 105 C; 134 G; 154 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtaatg aaatggagaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgctat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 caagcagggt taaagcagaa gaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagtacctt tatatgaaga cttcaggaag tatactgcat tcaccatacc tagtataaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggccag catgacaaaa atcttagagc cttttagggc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt aaacataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaacccccc tttctttgga tggggtatga actccat 657 // ID KC307943; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307943; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV25 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; f79ec6293182e3b9fdb94905a4a7d520. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV25" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0T9" FT /protein_id="AGL33946.1" FT /translation="DGPRVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DESFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMXKILEPFRAENPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 258 A; 101 C; 140 G; 148 T; 10 other; gatggcccaa gggttaaaca atggccattg acagaagara aaataaaagc attaacagaa 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaayccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tyaataaaag aactcaagat ttttgggagg tycaattagg aataccacac 240 ccggcagggt taaaaaagaa aaagtcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tagatgaaag cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgctyc crcagggatg gaaaggatca 420 ccagcaatat tccaggctag tatgayaaaa attttagagc cctttagagc agaaaatcca 480 gaaatagtca tctaccaata tatggatgac ttgtatgtag gatctgacct rgaaataggr 540 caacatagag cmaaaataga ggaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307944; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307944; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV26 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 06ca5ba8da4617b5cdb215f90353d4b7. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV26" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2N0" FT /protein_id="AGL33947.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEEFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLXIGQHRAKIEELRKHLLGWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 102 C; 134 G; 151 T; 4 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggaaaa agaaggaaag attacaaaga ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tyaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga attcagaaag tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccagtgtag catgacaaaa atcttagagc cttttagggc acaaaatcca 480 gaaatagtca tctatcagta tatggatgac ytgtatgtag gatcmgactt araaataggg 540 caacatagag caaaaataga agaattaaga aaacatctgc tagggtgggg atttaccacc 600 ccagataaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307945; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307945; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV27 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 9b32d9bd7daad420da7c1650f193a633. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV27" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYY9" FT /protein_id="AGL33948.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQHSMTKILEPFRAQNPDIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 107 C; 132 G; 146 T; 8 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtaatg aaatggagaa ggaaggaaaa attacaaaaa ttgggccwga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 tttagggaac tcaacaaaag aactcaagat ttytgggaag tmcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtgytag atgtgggaga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaag tatactgcat tcaccatacc tagcataaac 360 aatgaaacac cagggattag atatcaatat aatgtgctcc cacagggatg gaaaggatca 420 ccagcaatat tccagcatag yatgacaaaa atcttagarc cctttagrgc acaaaatcca 480 gacatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agagataggg 540 caacatagag caaaaataga ggarttaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307946; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307946; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV28 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 4727faf6c00b5370ff1ddcab7f8c36c4. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV28" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQF6" FT /protein_id="AGL33949.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSXPL FT YEDFRKYTAFTIPSXNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIKQHRXKIEELREHLLXWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 259 A; 102 C; 126 G; 151 T; 19 other; gatggcccaa aggttaaaca atggccattr acagaagaga aaataaaagc attaacagaa 60 atttgtgatg agatggaaaa rgaaggaaar atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta artggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcagttagg aataccacac 240 ccagcagggt taaaaaagaa raaatcagtg acagtactrg atgtggggga tgcatatttt 300 tcakktcctt tatatgagga cttcagraag tatactgcat tcaccatacc tagtrkaaac 360 aatgaaacac caggrattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagarc cctttagrgc acaraatcca 480 gaaatagtya tctatcaata tatggatgac ttgtatgtag gatctgaytt agaaataaag 540 caacatagag maaaaataga agaattaaga gaacatctgt taargtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307947; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307947; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV29 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 192374c0d67758e463d6ac392a1e432a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV29" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSB9" FT /protein_id="AGL33950.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKRSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 260 A; 108 C; 140 G; 147 T; 2 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggagaa agaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaar gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaggat ttttgggagg tccaattagg aataccacac 240 ccagcagggt taaagaaaaa gagatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcaggaaa tatactgcat tcaccatacc cagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgctgc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagrgc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga agaattaaga gaacatctgt tgaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307948; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307948; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10HDRAV14 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; de4fa3d21d9b3a333cb02a8255c23fa4. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10HDRAV14" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0U2" FT /protein_id="AGL33951.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFKTKNPEMV FT IYQYMDDLYVGSDLKIGLHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 271 A; 105 C; 130 G; 151 T; 0 other; gatggcccaa aggttaaaca atggccatta acagaagaaa aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatac 120 aatactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagagaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tgtatgaaga ttttaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac caggaattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttaagac aaaaaatcca 480 gaaatggtca tctatcaata tatggatgac ttgtatgtag gatctgactt aaaaataggg 540 ctacatagag caaaaataga agaattaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaga agcatcagaa agaaccccca tttctctgga tggggtatga actccat 657 // ID KC307949; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307949; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV19 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 5574112c21fd2215e96883725e760618. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV19" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2N4" FT /protein_id="AGL33952.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLQWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 263 A; 107 C; 136 G; 151 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagaa 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt tgaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcttatttc 300 tcagttcctt tatatgaaga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacaaggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagagc aaaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtgg gatctgactt ggaaataggg 540 caacatagag caaaaataga ggagttaaga gaacatctgt tacagtgggg atttaccaca 600 ccagataaga aacatcaaaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307950; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307950; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV26 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; c7c6252cf37ffe1a9dbff2a60199d04f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV26" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYZ5" FT /protein_id="AGL33953.1" FT /translation="DGPKVKQWPLTEEKIKALTEICNEMEKEGKISKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSIPL FT CEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAQNPEII FT IYQYMDDLYVGSDLEIGQHREKIEKLRGHLLKWGLTTPDKKHQKKPPFLWMGYELH" XX SQ Sequence 657 BP; 272 A; 105 C; 131 G; 149 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagaa 60 atttgtaatg aaatggaaaa ggaaggaaaa atttcaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgctat aaaaaagaaa gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcaattcctt tatgtgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaac 360 aatgaaacac cagggataag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagggc acaaaaccca 480 gaaataatca tctatcaata tatggatgac ttgtatgtag ggtctgattt agaaatagga 540 caacatagag aaaaaataga aaaattaaga ggacatctgt tgaaatgggg acttaccaca 600 ccagacaaga agcatcagaa aaaaccccca tttctgtgga tggggtatga actccat 657 // ID KC307951; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307951; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV28 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 234c62212e576e30187c328d02fd0e05. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV28" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQG0" FT /protein_id="AGL33954.1" FT /translation="DGPRVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMIKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHREKIEELREHLLKWGFTTPTKKHQKKPPFLWLGYELH" XX SQ Sequence 657 BP; 264 A; 106 C; 136 G; 151 T; 0 other; gatggcccaa gggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tatatgaaga cttcagaaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgataaaa atcttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatcagactt agaaataggg 540 caacatagag aaaagataga ggagttaaga gaacatctgt taaagtgggg atttaccaca 600 cccacaaaga aacatcagaa aaaaccccca tttctttggt tggggtatga actccat 657 // ID KC307952; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307952; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV29 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 0f4dda04a8012f80c09b818046d9b611. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV29" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSC3" FT /protein_id="AGL33955.1" FT /translation="DGPKVKQWPLTEEKIKALTEICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DKEFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRXKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIKELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 266 A; 104 C; 132 G; 148 T; 7 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attgacagaa 60 atttgtgatg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagac 180 tttagggaac tcaataaaag aactcaagac ttttgggaag ttcaattagg aataccacat 240 ccagcaggrt taaaaaagaa aaaatcagtg acagtactgg aygtggggga tgcatatttt 300 tcagttcctt tagataarga attcagraag tatactgcat tcaccatacc yagtacaaac 360 aatgaaacac caggaattag atatcartat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat ttcaggctag catgacaaaa atcttagagc cctttagart gaaaaaccca 480 gaaatagtca tctatcaata tatggatgac ttatatgtag ggtctgacct agaaataggg 540 caacatagag caaaaataaa agagttaaga gaacacttgt taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca ttcctgtgga tgggttatga actccat 657 // ID KC307953; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307953; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV31 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; fbf95ec7e46f5be908405f2c2bc3d0c1. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV31" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0U7" FT /protein_id="AGL33956.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPDNPYNTPIF FT AIKKKDSTKWRKLVDFREINKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT YEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 265 A; 104 C; 133 G; 155 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgaag aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga taatccatat 120 aacactccaa tatttgccat aaaaaagaag gatagtacta agtggagaaa attagtagat 180 ttcagggaaa tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacat 240 ccagcagggt tgaaaaagaa aaaatcagtg acagtactag atgtggggga tgcctatttt 300 tcagttcctt tatatgaaga cttcaggaaa tatactgcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaagctag catgacaaaa atcttagagc cctttaggac acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggagttaaga gaacatttat taaggtgggg atttaccaca 600 ccagacaaga aacatcagaa agaacctccc tttctttgga tggggtatga actccat 657 // ID KC307954; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307954; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV32 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 4342401aae069411f5ef132eca54c7a6. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV32" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N2N8" FT /protein_id="AGL33957.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKRKSVTVLDVGDAYFSVPL FT DKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAQNPEIV FT IYQYMDDLYVGSDLNIEQHRAKIEELREHLLRWGFTTPDKNNQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 261 A; 104 C; 137 G; 155 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aatactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaagaagag aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tagataaaga tttcaggaag tatactgcat tcaccatacc tagtataaat 360 aatgaaacac cagggattag gtatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccagtctag catgacaaaa atcttagagc cctttagggc acaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt aaacatagag 540 caacatagag caaaaataga ggagttaaga gaacatctgt taaggtgggg atttaccaca 600 ccagacaaaa acaatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC307955; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307955; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 11HDRAV34 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; fca794762b7cc2a31a0956c8a631d313. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="11HDRAV34" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MYZ9" FT /protein_id="AGL33958.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQASMTKILEPFRAKNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 267 A; 106 C; 133 G; 151 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag ttcaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tacatgaaga tttcaggaaa tatacagcat tcaccatacc tagtacaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccagcaatat tccaggctag catgacaaaa atcttagagc cctttagggc caaaaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgattt agaaataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaagtgggg atttaccaca 600 ccagacaaga aacatcagaa agaaccccca tttctttgga tgggatatga actccat 657 // ID KC307956; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307956; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV1037 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; de069aad2bc220715c3715b6add5bb5e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV1037" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MQG4" FT /protein_id="AGL33959.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEXFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPSIFQASMTKILEPFRTQNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWVGYELH" XX SQ Sequence 657 BP; 263 A; 103 C; 135 G; 150 T; 6 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggagaa ggaaggaaaa attacaaaaa ttggacctga raatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacay 240 ccagcagggt taaaaaagaa aaaatcagtg acagtgttgg atgtggggga tgcatatttt 300 tcagttcctt tacatgagga mttcaggaaa tatactgcat tcaccatacc tagtagaaat 360 aatgaaacac cagggataag atatcaatat aatgtgcttc cacagggatg gaaaggatca 420 ccatcaatat tycaggctag yatgacaaaa atcttagagc cctttaggac acaaaatccr 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag caaaaataga ggaattaagg gaacatctgt taaaatgggg attcaccaca 600 ccagacaaaa agcatcagaa agaaccccca tttctttggg tggggtatga actccat 657 // ID KC307957; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307957; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV1043 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; 8842a9563fda07a8ffc566da69f123c4. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV1043" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4MSC8" FT /protein_id="AGL33960.1" FT /translation="DGPKVKQWPLTEEKIKALTAICEEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT DEDFRKYTAFTIPSRNNKTPGVRYQYNVLPQGWKGSPAIFQCSMTKILEPFRKRNPEIV FT IYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGNELH" XX SQ Sequence 657 BP; 270 A; 108 C; 133 G; 146 T; 0 other; gatggcccaa aagttaaaca atggccattg acagaagaga aaataaaagc attaacagca 60 atttgtgagg aaatggaaaa ggaaggaaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaaa gacagtacta agtggagaaa attagtagac 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactag acgtgggaga tgcatatttt 300 tcagttcctt tagatgaaga cttcaggaaa tatactgcat tcaccatacc tagtagaaat 360 aataaaacac caggggttag atatcaatac aatgtgcttc cccagggatg gaaaggatca 420 ccagcaatat tccagtgtag tatgacaaaa atcttagagc cctttagaaa acgaaatccg 480 gaaatagtca tctatcaata tatggatgac ttatatgtag gatctgacct agaaataggg 540 caacatagag caaaaataga ggaattaaga gaacatctgt taaggtgggg gtttaccaca 600 ccagacaaga aacatcaaaa agaaccccca tttctttgga tggggaatga actccat 657 // ID KC307958; SV 1; linear; genomic DNA; STD; VRL; 657 BP. XX AC KC307958; XX DT 13-MAY-2013 (Rel. 116, Created) DT 13-MAY-2013 (Rel. 116, Last updated, Version 1) XX DE HIV-1 isolate 10SJAV1045 from India pol protein (pol) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RC Publication Status: Online-Only RP 1-657 RX PUBMED; 23443042. RA Neogi U., Shet A., Sahoo P.N., Bontell I., Ekstrand M.L., Banerjea A.C., RA Sonnerborg A.; RT "Human APOBEC3G-mediated hypermutation is associated with antiretroviral RT therapy failure in HIV-1 subtype C-infected individuals"; RL J Int AIDS Soc 16:18472-18472(2013). XX RN [2] RP 1-657 RA Neogi U., Sonnerborg A., Shet A.; RT ; RL Submitted (10-DEC-2012) to the INSDC. RL Microbiology, St. John's Medical College, Sarjapur Road, Bangalore, RL Karnataka 560034, India XX DR MD5; f3784072270f4de8521a7d148d4df247. DR EuropePMC; PMC3582697; 23443042. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..657 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens; therapy-naive patient" FT /isolate="10SJAV1045" FT /mol_type="genomic DNA" FT /country="India" FT /proviral FT /note="subtype: C" FT /db_xref="taxon:11676" FT gene <1..>657 FT /gene="pol" FT CDS <1..>657 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="InterPro:IPR000477" FT /db_xref="UniProtKB/TrEMBL:R4N0V2" FT /protein_id="AGL33961.1" FT /translation="DGPKVKQWPLTEEKIKALTAICDEMEKEGKITKIGPENPYNTPIF FT AIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPL FT HEDFRKYTAFTIPSRNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRIKNPEIV FT IYQYMDDLYVGSDLEIGQHREKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELH" XX SQ Sequence 657 BP; 264 A; 105 C; 138 G; 150 T; 0 other; gatggcccaa aggttaaaca atggccattg acagaagaaa aaataaaagc attaacagca 60 atttgtgatg aaatggaaaa ggaagggaaa attacaaaaa ttgggcctga aaatccatat 120 aacactccaa tatttgccat aaaaaagaag gacagtacta agtggagaaa attagtagat 180 ttcagggaac tcaataaaag aactcaagat ttttgggaag tccaattagg aataccacac 240 ccagcagggt taaaaaagaa aaaatcagtg acagtactgg atgtggggga tgcatatttt 300 tcagttcctt tacatgaaga cttcaggaaa tatactgcat tcaccatacc cagtagaaac 360 aatgaaacac cagggattag atatcaatat aatgtgcttc cacagggatg gaagggatca 420 ccagcaatat tccagagtag catgacaaaa atcttagagc cctttaggat aaagaatcca 480 gaaatagtca tctatcaata tatggatgac ttgtatgtag gatctgactt agaaataggg 540 caacatagag aaaaaataga agaattaaga gaacatctgt tgaggtgggg gtttaccaca 600 cctgacaaga aacatcagaa agaaccccca tttctttgga tggggtatga actccat 657 // ID KC309415; SV 1; linear; genomic RNA; STD; VRL; 3111 BP. XX AC KC309415; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WG180B/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WG180B/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-3111 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RP 1-3111 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 04316d38ed26e75fe80a4934e1e5cdd2. DR EuropePMC; PMC3697660; 23678065. DR EuropePMC; PMC5356244; 28302145. XX FH Key Location/Qualifiers FH FT source 1..3111 FT /organism="Sapovirus swine/WG180B/2009/USA" FT /host="swine" FT /strain="WG180B/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304598" FT CDS <1..2488 FT /codon_start=2 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4Q0F4" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:M4Q0F4" FT /protein_id="AGH15832.1" FT /translation="QHPAVSSSSIEILRSFCVDCPIVSAAAEVLKSPARGMFEDVTFTT FT TSGLPSGMPFTSVINSINHMTYFAAALLKAYQDQGVPYTGNVFQLETVHTYGDDSIYGF FT LPASASIFPQFLANLKSFGLNPTNPDKGDVITPVDRPVFLKRTLAITPFGLRALLDVTS FT LERQCYWVKGSRTSDVYSPTTIDTQARSMQLEVMLAYASQHGPEVHERLSHIAQKTADG FT EGLTLVNTNYAQAAATYNAWYIGGVEPQLGSLANEGSAQVVFEMEGNGSKPAAGQSAST FT TMEAPPPGAVGPTEAALVVTNPDQPIATAQRVEMAIATGAQNSNIPEPIRQCFALFRTV FT PWNDRQPMGTFLGAVVLSPNVNPYTRHLSAMFAGWGGGMEVRISVSGSGMFAGRIICAV FT LPPGLNPATVADPGVLPHVLLDARVPDPAVFQVPDVRAVDYHRTDGDEATSSLGLWVLQ FT PLINPFSTTAISTAWLSIETRPTFDFDFCLLKPPTAQMDNGVPPDRLLPKRLGKSKGNR FT LGGLIVGMVVVAQHKQVNRHFMADSTTWGWSTAPTAPLACKITGHVDPISTDPKCGVQL FT AIGASARGPIFPHIPDHWPDSAAGTSQDTIAVPWERSRAIPQKSIVGSVMFFADNGDVD FT EDRVYYAAAADCLVNTTSRPALRGDFNAGTMTLIGFPVTARSPTNGTNIYWNPLFCDGS FT DASVNDRVTNMTGSNYVFSSSGMNNIILWKERIFSDYPNHTILYSSQLDYTAEVFQNSQ FT INIPKGMMAVYNVTSNGGEFQIGIRPDGYMVTTSQIGVNVDLDADTEFTYVGVFPITSS FT LNGPGGNASGARRIYQ" FT CDS 2485..2991 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q0J6" FT /protein_id="AGH15833.1" FT /translation="MSWIVGALQGLAGLSDVANTVSGIAYQHRQLDLLRQQNELQAQWM FT VKNEALQREAMAMTRDLAVNAPAQRMRAALDAGFNSVDARRLAGSSERVIRGYIERPVM FT LRTNLEGIRQTNNLLTMNSALETFKKGTSFGMSAPPRVQQGPIGFSNPNYGKVTLGPRP FT PESSV" XX SQ Sequence 3111 BP; 689 A; 845 C; 759 G; 818 T; 0 other; ccaacaccca gctgtttctt cttcttcaat tgaaatcctt cggtcatttt gtgtggactg 60 ccctattgtg tcagctgcag cagaggtgtt gaagtctcct gctcgtggta tgtttgagga 120 tgtcacgttc actaccacca gtggtctccc gtcgggcatg ccattcacca gtgtcataaa 180 ctcaatcaat catatgacct acttcgccgc cgcgttgctc aaggcatacc aagaccaggg 240 cgtaccctac acgggcaatg tgttccagct tgagactgtg catacctatg gtgacgattc 300 catctatggg tttcttcccg cttctgcctc tattttccct cagttcctag caaatttgaa 360 gtcttttggt cttaatccca ccaacccgga caagggtgat gtcatcacac cagtagatcg 420 ccctgtcttt ctcaagcgca cgcttgcaat taccccgttt ggccttaggg ctctcctgga 480 tgtcacctcc cttgaaaggc agtgctattg ggtgaagggc tcgcgcacct cggacgtcta 540 ttcgccaacc acaattgaca cccaggcgcg cagcatgcag cttgaagtca tgttggccta 600 cgcatcacaa catgggccag aagtccatga aaggttgtcc cacatcgccc agaaaacggc 660 ggacggtgag ggtctaacct tagttaacac caattatgct caagctgcag ccacttacaa 720 cgcatggtat attggtggcg tagaaccaca attggggagc ctcgccaatg aaggttcggc 780 tcaagtagtg tttgagatgg agggcaacgg ctccaagcct gcggccggcc aatcggcttc 840 tactaccatg gaggccccac ccccgggtgc agttgggcct acagaagctg cccttgtggt 900 caccaatcct gaccaaccga ttgctaccgc acaacgtgtg gaaatggcca ttgcaacggg 960 ggcccaaaat tctaacatcc ccgagccaat cagacagtgt tttgccctat ttcgtactgt 1020 tccttggaac gaccgacagc ccatgggcac cttccttggt gccgttgttt tgtcaccaaa 1080 tgtcaaccct tacactagac acctttctgc aatgtttgca gggtgggggg ggggaatgga 1140 ggttaggatt tctgtttctg gctcaggcat gtttgccgga aggatcatat gtgcagtctt 1200 gcccccgggc ctcaaccctg caactgtggc tgaccctggt gtgttgcccc acgtactcct 1260 tgatgctcgt gtccctgatc ctgctgtttt tcaggtccct gatgtccgtg cagttgatta 1320 ccatcgcacg gacggtgacg aggccacttc atctcttggc ttgtgggtgc tccagcctct 1380 cattaacccc ttctccacca cggcaatttc cactgcgtgg ctctcaattg aaacaaggcc 1440 aacctttgat tttgactttt gcctcctaaa accaccgacc gcgcagatgg acaatggggt 1500 tccacctgac aggttgctgc ctaagagact tgggaagtca aaaggaaatc gactcggtgg 1560 cttgattgtt ggtatggttg ttgtggccca acataaacag gtgaaccgcc acttcatggc 1620 cgactccacc acctggggtt ggtccactgc accaactgca cccttggcct gtaagatcac 1680 aggccatgtg gaccccatat caactgaccc aaagtgcggt gtgcaacttg ccattggagc 1740 gtctgcaaga ggtcccattt tcccccacat tccagatcac tggcctgatt ctgctgcggg 1800 tacttctcaa gacaccattg cagttccctg ggagagatca agggccattc ctcagaaatc 1860 catagttggg agtgtcatgt tctttgctga caacggtgat gtagacgaag accgtgtcta 1920 ctatgcggct gctgccgatt gtctggtcaa cacaacatca agacccgcct tgcgtggtga 1980 ctttaatgca ggtacaatga ccttgatcgg ctttcccgtt acggcacggt cgccaactaa 2040 tggtaccaac atttactgga atcccttgtt ttgtgatggc tctgatgcct cggtcaatga 2100 ccgggtcacc aacatgacag gttcaaacta tgtgttctct tcatcaggga tgaacaacat 2160 cattttatgg aaagagagga tcttttctga ttaccccaac cacactatcc tttactcatc 2220 acagcttgac tacacagctg aggtttttca aaactcccaa attaacattc cgaaagggat 2280 gatggctgtg tacaacgtca cttcaaacgg cggtgagttt caaattggga tccgccccga 2340 tgggtacatg gtgactacct cccaaattgg agttaatgtt gacttggatg ctgatactga 2400 gttcacatat gttggtgtct ttcctattac ctcttctctt aatggcccag gtgggaatgc 2460 aagtggggcc cgaaggattt accaatgagt tggatagttg gggctcttca gggtcttgcc 2520 ggtttgtcag atgtcgctaa cacggtgtct ggcattgctt accaacaccg gcagctggac 2580 ttgttacgcc agcaaaatga attgcaggcc caatggatgg tcaaaaacga agctctgcag 2640 cgcgaagcca tggccatgac acgggacctt gcggtcaacg ctcctgcgca acgcatgcgg 2700 gctgcgcttg atgctgggtt caacagtgtt gacgcacgtc gactggctgg atccagtgag 2760 cgcgtcatac gcggttacat cgagaggcct gttatgcttc gcacaaactt ggagggcatt 2820 cgtcaaacca acaacttgtt gaccatgaat tctgctcttg agacctttaa aaaggggacc 2880 tctttcggga tgtcagcccc cccacgggtc caacaaggac ccattgggtt ttcaaacccc 2940 aactacggaa aggtcacttt gggtcccaga cctcctgagt cttctgtttg atctttcttt 3000 ttcttttatt ttgtctgtct cttttatttt ctttccagca gttccacacg cgttcgggtg 3060 gataatgcca attaagcgat tggcgctgtc acttcaacag gaaaaggaag g 3111 // ID KC309416; SV 2; linear; genomic RNA; STD; VRL; 6654 BP. XX AC KC309416; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WG194D/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WG194D/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-6654 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RC Publication Status: Online-Only RP 1-6654 RX PUBMED; 27228126. RA Oka T., Lu Z., Phan T., Delwart E.L., Saif L.J., Wang Q.; RT "Genetic Characterization and Classification of Human and Animal RT Sapoviruses"; RL PLoS One 11(5):E0156373-E0156373(2016). XX RN [3] RP 1-6654 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [4] RC Sequence update by submitter RP 1-6654 RA Oka T., Wang Q.; RT ; RL Submitted (03-JUN-2016) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; a2c052f60451618e008c78b763ba002c. DR EuropePMC; PMC4881899; 27228126. DR EuropePMC; PMC5356244; 28302145. XX CC On Jun 3, 2016 this sequence version replaced gi:461180473. XX FH Key Location/Qualifiers FH FT source 1..6654 FT /organism="Sapovirus swine/WG194D/2009/USA" FT /host="swine" FT /strain="WG194D/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304599" FT CDS <1..6047 FT /codon_start=3 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4QBE6" FT /db_xref="InterPro:IPR000317" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001665" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:M4QBE6" FT /protein_id="AGH15834.2" FT /translation="ALVELHGSMISNLGNTTATIVAALTGSFELIKDFIGELIEKFSST FT QAPEGPTDLAWTIMFAGIIAIIMRLGGCKDILTSWPSLVKAAASFTTLTAAYKSFEWIR FT GKFAEVTLTRKIKMFMARCAALVELTHSRDVHGIDEIKELLKCYSVLEEEGNDLVAEAG FT NGTQAGLIRGYLTDLSTQASQLRTTLALDTPRKVPVCVILTGPPGIGKTRLAQEIGKGF FT GKLSSFTLLNDHHDSYTGNRVAIWDEFDVDAKGAFVETMISLVNSSPCPLNCDRPENKG FT KMFTSDYIICTSNFETSVIPDNPRAQAFYRRVITVDVSSPSVQNWMTKNPGRQPPKDLF FT KPDFSHLELSLRPFLGYNSQGDTLSGKRGRVSQITVDGLTRLMEQKFEEQAKDPPQNVW FT ITVPKDLVADALVVVKKYVMLHRGVCYVTGQPTPAECGNRHVSTVVVSDGLPTGNTFHH FT VKATALSLDNPTISSSLLSLFITDSRVSSSTQRDWLYKVWQPSILVQESAVDSQSIPIV FT RRVVVVNSAADFIVNMRHHLGFVSIPGLFTAFRGWRNSTSIIDFIEKHFRDFRFPSNPE FT CTIFRCANGDVMLYTYGSYFVFGTPRRLPILCDVDSCVMSNVPRCMTWWETIKLCLEYF FT YNFLKVVLPHAISLANLVYLFTRGDRQEEAKGKTKLGRGRRHGAARGVALSDDEYDEWR FT DMMRDWRQEMTVNEFLELRERAYAGLQGANEDRYRAWLNLRAMRMGTGNYQHATIIGKG FT GVRDEIIRTSVMRAPRHREDPYDEREAPFIPEANSAVVEFTNEEEHVGWGVHIGNGRVV FT TVTHVATMSNLVDGVPFTIKESNGETCHVLTPLKNLPHLQIGDGVPTYYSQRFHPVMVI FT NEGSYDTPKTTVNGWHVRIINDFPTKKGDCGTPYFDDCRRVVGLHAAGSIGGSTKLVQN FT IKSPKTNIEMFSWKGLSVSRSPPVGGMPTGTRYHRSPAWPNVSPTETHEPAPFGSGDSR FT YHFSQVEMLVNNLKPYSQAPPGIPPELLQRAATHVRTYLASMLGPGKSQPLTYHEAVAT FT LEKSTSCGPHVPGIKGDYWNETTEQFEGKLGDYLQHAWNQANLGHPLSHDYKLALKDEL FT RPLEKNAQGKRRLLWGADAGVTLVCAAAFKPVAIRLAELVPMTPVSVGINMDSAQIEVL FT NESLKGRIVYCFDYSKWDSTQHPAVSSSSIEILRSFCVDCPIVSAAAEVLKSPARGMFE FT DVTFTTTSGLPSGMPFTSVINSINHMTYFAAALLKAYQDQGVPYTGNVFQLETVHTYGD FT DSIYGFLPASASIFPQFLTNLKSFGLNPTNPDKGDVITPVDRPVFLKRTLAITPFGLRA FT LLDVTSLERQCYWVKGSRTSDVYSPTTIDTQARSMQLEVMLAYASQHGPEVHERLSHIA FT QKTADGEGLTLVNTNYAQAAATYNAWYIGGVEPQLGSLASEGSTQVVFEMEGNGSKSVV FT GSPSAPPTEAPPPGAVGPMEGGLVVVNPDQPVATAQRAELAIATGARSSNIPEPIRQCF FT ALFRTLPWNDRQPMGTFLGAVVLSPNVNPYTRHMSAMFAGWGGGMEVRVSVSGSGMYAG FT RIICAVLPPGLNPATVVDPGVLPHVLLDARVPDPAVFQVPDVRAVDYHRTDGEEATSSL FT GLWVLQPLINPFSTTAISTAWLSIETRPTFDFDFCLLKPPTAQMDNGTPPDRLLPKRLG FT RSKGNRLGGIITGMVVVASHKQVNRHFMADSTTWGWSTAPVSPLAAKITGHAERVTTGN FT RCGVQIAIGAAGKGPLFPFIPDHWPDSAAGTAQGTITIPWEQSRGIPQKSILGSAMFFA FT DNGDVDEGRTFYAVAADCMIAPTARPDIRGDFNAGTITLIGYQDGATSPDNGTNVYWNP FT LFCDGANPSVEGRVTNMTGTNYVFTSSGMNNIILWKERIFSDHPTDTILYSSQLDYTAE FT TFQNSQVNIPTGMMAVYNVASSGGEFQVGIRADGYMVTTSQIGVSVDLDPDTEFTYVGV FT FPVSSSLNGPDGNSSGARRIYQ" FT CDS 6044..6550 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q2Y2" FT /protein_id="AGH15835.1" FT /translation="MSWIVGALQGLAGLTDVANTVSGIAYQHRQLDLLRQQNELQAQWM FT VKNEALQREAMSMTRDLAVNAPAQRVQAALDAGFNAVDARRLAGSSERVIRGYLERPIM FT PRVDLQGIRQTNNLLTMNSALETFKKGTPFGMSAPPRVQQGPIGFSNPNYGKITLGPRP FT SESSV" XX SQ Sequence 6654 BP; 1553 A; 1773 C; 1642 G; 1686 T; 0 other; cagcccttgt ggagcttcac ggctccatga tctcaaacct tggtaacacc actgcaacca 60 ttgttgcggc gctcactggc tcgtttgagc tcatcaagga ctttattggg gaactgatag 120 aaaaattctc atccacacag gcaccagagg gacccacaga ccttgcatgg accatcatgt 180 ttgcaggcat cattgctatc ataatgcggt tgggagggtg caaagacatc ctcacctcct 240 ggccttcctt ggttaaagcg gccgcatcat tcacaaccct cacggctgct tataagtcat 300 ttgagtggat acgtggcaag tttgctgagg tcaccctcac ccgtaagatc aagatgttca 360 tggctaggtg cgctgccttg gtcgagctga cacactcccg cgatgtccac ggcatcgacg 420 aaatcaagga gttactcaag tgttactcag tccttgagga ggagggcaac gatcttgtgg 480 ccgaggcggg caatggcact caggccggtt taattcgtgg atacctcaca gatctttcca 540 ctcaggcatc ccagttgaga accacccttg ccttggacac tccacgcaag gtacctgttt 600 gtgttatctt aacaggaccc ccgggcatcg ggaaaactag gttagcccag gaaattggta 660 agggttttgg taaactctca tctttcaccc tcctaaacga ccatcatgac tcatacactg 720 gcaaccgggt ggccatctgg gacgagtttg atgttgacgc gaagggagca tttgtggaaa 780 caatgatctc gcttgtcaat agctcccctt gtccactaaa ctgtgacagg cctgaaaaca 840 agggcaaaat gtttacatct gattacatca tttgcacgtc aaactttgag acgtctgtca 900 tcccggacaa tccgcgagca caggcctttt atcggagagt gatcaccgtt gacgtctcct 960 cccccagtgt gcagaactgg atgactaaga accctgggcg tcaaccccca aaagacttgt 1020 ttaaacctga tttttcccat ttggagttgt ccttacgacc attccttggc tacaactccc 1080 agggcgacac gctctcaggg aagcgtggtc gggtgtctca aattactgtt gatggattga 1140 cgcgcctcat ggagcagaag tttgaggaac aggcaaaaga ccccccacaa aacgtgtgga 1200 ttaccgttcc aaaggacctt gttgcggatg cccttgttgt ggttaagaaa tatgtgatgc 1260 tccaccgcgg ggtgtgttat gtcactggcc agcccacccc ggcggaatgc gggaaccgcc 1320 acgtttcaac tgttgtggtg tcagacggtc tccctactgg aaacacattt catcatgtta 1380 aggccactgc attatccctt gacaacccca ccatctcctc atctttgttg tctcttttca 1440 tcactgactc acgtgtttct tcttccactc aacgtgattg gctctataag gtgtggcagc 1500 catccatctt ggtgcaagag tctgcagtgg actctcaatc aatcccaatt gtccgccgcg 1560 ttgtcgtggt gaacagtgca gcagacttca tagtcaacat gagacaccac cttggttttg 1620 tttcaatccc aggacttttc acggcatttc gtggttggcg caattccact tccatcatag 1680 acttcattga gaaacatttt cgagatttta gattcccctc caatccggag tgcaccatat 1740 tccgatgtgc gaatggggac gtcatgcttt acacctacgg gtcctatttt gtcttcggta 1800 ccccaagaag actccccatt ctctgtgacg tggattcctg tgtcatgtcc aatgtgccac 1860 gctgtatgac atggtgggaa acaattaaac tgtgtcttga atacttctac aacttcttga 1920 aagtggtttt accccatgcc atttcactag caaatcttgt ctaccttttc acccgggggg 1980 atcgtcagga agaggccaag ggcaaaacaa agcttgggcg tggtcgccgc catggtgcag 2040 cccgtggagt ggctttgagc gacgatgagt atgatgagtg gagagacatg atgcgggact 2100 ggcgccaaga aatgacagtt aatgagttcc tcgaattgag agaaagggcc tatgcgggtt 2160 tgcagggcgc aaatgaagat cgctacaggg cgtggttgaa ccttagggca atgagaatgg 2220 gaacaggtaa ttaccaacac gctactatca ttggcaaggg tggtgtgcgt gatgagatca 2280 tacgtacatc agtcatgcgt gccccgcgcc accgagagga cccatacgac gagcgcgagg 2340 cacccttcat ccctgaggct aactctgcgg ttgttgagtt cacaaatgaa gaagaacatg 2400 ttggttgggg agtgcatatt ggaaatggga gggtcgtcac cgtcacccac gttgcaacta 2460 tgtccaatct cgtggacggt gtccccttca ccatcaaaga gtcaaatggt gaaacttgtc 2520 acgttctcac acctctcaag aatctccccc acctgcaaat tggagatgga gtgccaactt 2580 actactcaca aaggtttcat ccggtcatgg tgatcaatga gggatcgtac gataccccca 2640 agaccactgt gaacgggtgg catgtccgca tcatcaatga ttttccaacc aagaaaggag 2700 attgtggcac accctacttt gatgactgcc gcagagtggt tggcctccat gctgctggtt 2760 ctattggtgg ttctactaaa ctagttcaaa acattaaatc acctaagacc aacatcgaga 2820 tgttctcatg gaaggggttg agtgtctccc ggtcaccccc ggtaggtggt atgccgacgg 2880 gcaccagata ccacagatcg ccagcttggc ctaacgtgtc ccccactgaa acacacgagc 2940 ctgccccttt cggctctggt gattcacgtt accacttttc tcaagtggaa atgttggtta 3000 acaatctcaa accttatagt caggcccccc ctggcattcc ccctgagctc ttacaacgag 3060 ctgccaccca tgttaggacc tacttggctt caatgcttgg cccagggaaa agtcaaccac 3120 tcacatacca tgaagctgta gcaacattgg agaagagcac gtcttgcgga ccccatgttc 3180 ctgggatcaa gggtgattac tggaatgaaa caactgagca gtttgagggt aagttgggtg 3240 actacttgca gcacgcctgg aaccaagcta acctaggcca cccactttca catgattaca 3300 aactcgccct taaagatgag ttgcgtccac ttgagaaaaa cgcccagggc aagcgccgct 3360 tgttgtgggg tgcggatgca ggcgtgactc tcgtttgtgc cgctgcattc aagccggtgg 3420 ccatacggct tgctgagttg gttccaatga cacctgtctc agttgggatc aacatggact 3480 ctgcacaaat cgaggtcctc aatgaatcgc ttaagggcag gatcgtttac tgctttgatt 3540 actccaaatg ggattccacc caacacccag ctgtttcttc ttcctcaatt gaaatccttc 3600 ggtctttttg cgtggactgc cctatagtgt cagctgcagc ggaggtgttg aagtctcctg 3660 ctcgtggtat gtttgaggat gtcacgttca ctaccaccag tggcctcccg tcaggcatgc 3720 cattcaccag tgtcataaac tcaattaatc atatgaccta cttcgctgcc gcgctgctca 3780 aggcatacca agaccagggc gtaccctaca ccggcaatgt gtttcagctt gagactgtgc 3840 atacctatgg tgatgattcc atctatggat ttctccccgc ttctgcctct atcttccccc 3900 agtttttaac aaatttgaag tcttttggcc tcaaccccac caacccggac aagggtgatg 3960 tcattacacc agtagatcgc cctgtctttc tcaagcgtac gcttgcaatt accccatttg 4020 gccttagggc actcctggac gtcacctctc ttgaaaggca gtgctattgg gtgaagggct 4080 cgcgcacctc ggacgtctat tcgccaacca caattgacac ccaggcgcgt agtatgcagc 4140 ttgaagttat gctggcctac gcatcacaac atgggccaga agttcatgaa aggttatccc 4200 acatcgccca gaaaacggcg gacggtgagg gtctaacctt agttaacacc aattatgctc 4260 aagctgcggc cacctacaac gcatggtata ttggtggcgt agaaccacaa ttggggagtc 4320 tcgccagtga aggttcgact caagtagtgt ttgagatgga gggcaacggc tccaaatccg 4380 tggttggttc gcccagtgct cccccaactg aggcaccacc accaggtgca gttggtccca 4440 tggagggggg tttggttgtt gtcaatccag accaacctgt tgcaacagcc cagcgcgcag 4500 aactagcaat tgcaactggt gcacggagct ccaatatccc cgagccaata cgccagtgct 4560 ttgctctctt ccgcacgctc ccttggaatg accgacagcc catgggcacc tttcttgggg 4620 ccgttgtctt gtcaccaaat gtcaatcctt acacaaggca catgtcagcc atgttcgctg 4680 gctggggagg aggaatggag gttagagtgt ctgtgtctgg atctggcatg tacgccggga 4740 gaatcatatg tgctgtgttg ccacctgggc tcaaccctgc gaccgttgtt gatcctggtg 4800 ttctccccca cgttcttctt gatgcccgtg ttccagaccc tgctgttttc caggtccccg 4860 atgttcgtgc cgttgattac catcgtacgg acggggagga ggccacctct tcacttggtt 4920 tgtgggttct tcagccatta atcaatccct tctccactac tgcaatttcc accgcatggt 4980 tgtctattga gacaaggcca acctttgatt ttgacttttg ccttctcaag ccccctactg 5040 cccaaatgga caatggcaca ccacctgaca gactgctccc caagagactt ggcaggtcca 5100 aaggcaaccg tctcggcggt attatcactg ggatggttgt ggtggctagc cataaacagg 5160 ttaaccgcca cttcatggca gattccacta cgtgggggtg gtctacagct cccgtttctc 5220 cattggccgc caagattact ggccatgcag aacgcgtcac cactgggaat aggtgtggtg 5280 ttcaaatagc tatcggggct gcaggcaagg gccccctctt tccattcatc ccagaccact 5340 ggcctgactc agctgcgggc actgctcaag gcaccatcac gatcccatgg gagcaatcac 5400 gaggcattcc gcaaaaatcc attctgggca gtgccatgtt ctttgccgac aatggggacg 5460 tggatgaggg tcgcacgttc tatgctgtag ccgcagattg catgatcgca ccaacagcca 5520 ggcctgacat tcgtggggac ttcaatgcgg gaacaatcac cttaattggt taccaagatg 5580 gtgcaacctc accagataat ggcactaatg tatattggaa tccactcttt tgtgacggtg 5640 ctaatccatc tgttgaaggc agagttacta acatgactgg taccaattat gtttttacat 5700 catctggcat gaacaacatc atcttgtgga aggagaggat tttctcagat caccccacag 5760 acacaatcct ctattcatcc caattagatt atacagctga aacctttcaa aattctcagg 5820 tcaacatccc cacgggaatg atggcagttt acaatgtggc ttcaagtggt ggtgagtttc 5880 aagttggcat tcgtgctgat gggtacatgg tgacaacctc gcaaattggt gttagcgttg 5940 atcttgaccc tgatactgag tttacctatg ttggtgtttt tcctgtatct tcttccttaa 6000 atggcccaga tgggaattca agtggggccc gtagaattta ccaatgagct ggattgttgg 6060 ggctcttcaa ggacttgctg ggctaacgga cgtggcaaat acagtgtctg ggatagccta 6120 ccaacaccga caattggatc ttttgcgcca acagaacgag ctgcaagccc agtggatggt 6180 caaaaatgag gcccttcagc gtgaagccat gtccatgacg cgtgatcttg cggtcaacgc 6240 ccctgcgcaa cgagtgcagg cagcgttgga tgcagggttt aacgccgttg acgcacgtcg 6300 actagcaggt tctagtgagc gcgtcattcg tgggtatctt gagcggccca tcatgccgcg 6360 agtcgacctg cagggcattc ggcagactaa caacctgttg accatgaaca gcgcacttga 6420 aacttttaaa aagggcaccc cctttggcat gtcagcgcct ccacgggtgc agcagggccc 6480 cattgggttc tccaacccca actatggcaa gatcacactt ggccctcgac cctctgagtc 6540 atctgtgtaa ttttctttcc tctgttttat tttcctgctc cggcagttcc acacgcgttc 6600 gggtggatga tgtcaattaa gcgattgacg ctgccccttt ttagaaaggg aggg 6654 // ID KC309417; SV 2; linear; genomic RNA; STD; VRL; 6497 BP. XX AC KC309417; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WG197C/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WG197C/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-6497 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RC Publication Status: Online-Only RP 1-6497 RX PUBMED; 27228126. RA Oka T., Lu Z., Phan T., Delwart E.L., Saif L.J., Wang Q.; RT "Genetic Characterization and Classification of Human and Animal RT Sapoviruses"; RL PLoS One 11(5):E0156373-E0156373(2016). XX RN [3] RP 1-6497 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [4] RC Sequence update by submitter RP 1-6497 RA Oka T., Wang Q.; RT ; RL Submitted (03-JUN-2016) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 97bee9e654d369b1ff8f88033b8d3d17. DR EuropePMC; PMC4881899; 27228126. DR EuropePMC; PMC5356244; 28302145. XX CC On Jun 3, 2016 this sequence version replaced gi:461180477. XX FH Key Location/Qualifiers FH FT source 1..6497 FT /organism="Sapovirus swine/WG197C/2009/USA" FT /host="swine" FT /strain="WG197C/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304600" FT CDS <1..5883 FT /codon_start=1 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4Q473" FT /db_xref="InterPro:IPR000317" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001665" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:M4Q473" FT /protein_id="AGH15836.2" FT /translation="AWTIMFAGIIAIIMRLGGCKDILTSWPSLVKAAASFTTLTAAYKS FT FEWIRGKFAEVTLTRKIKMFMARCAALVELTHSRDVHGIDEIKELLKCYSVLEEEGNDL FT VAEAGNGTQAGLIRGYLTDLSTQAXQLRTTLALDTPRKVPVCVILTGPPGIGKTRLAQE FT IGKGFGKLSSFTLLNDHHDSYTGNRVAIWDEFDVDAKGAFVETMISLVNSSPCPLNCDR FT PENKGKMFTSDYIICTSNFETSVIPDNPRAQAFYRRVITVDVSSPSVQNWMTKNPGRQP FT PKDLFKPDFSHLELSLRPFLGYNSQGDTLSGKRGRVSQITVDGLTRLMEQKFEEQAKDP FT PQNVWITVPKNLVADALVVVKKYVMLHRGVCYVTGQPTPAECGNRHVSTVVVSDGPPTG FT NTFHHVKATALSLDNPTISSSLLSLFTTDSRVSSSTQRDWLYKVWQPSILVQESAVDSQ FT SIPIVRRVVVVNSAADFIVNMRHHLGFVSIPGLFTAFRGWRNSTSIIDFIEKHFRDFKF FT PSNPECTIFRCANGDVMLYTYGSYFVFGTPRRLPILCDVDPCVMSNVPRCMTWWETIKL FT CLEYFYNFLKVVLPHAISLANLVYLFTRGDRQEEAKGKTKLGRGRRHGAARGVALSDDE FT YDEWRDMMRDWRQEMTVNEFLELRERAYAGLQGANEDRYRAWLNLRAMRMGTGNYQHAT FT IIGKGGVRDEIIRTSVMRAPRHREDPYDEREAPFIPEANSAVVEFTNEEEHVGWGVHIG FT NGRVVTVTHVATMSNLVDGVPFTIKESNGETCHVLTPLKNLPHLQIGDGVPTYYSQRFH FT PVMVINEGSYDTPKTTVNGWHVRIINDFPTKKGDCGTPYFDDCRRVVGLHAAGSIGGST FT KLVQNIKPPKTNIEMFSWKGLSVSRSPPVGGMPTGTRYHRSPAWPNVSPTETHEPAPFG FT SGDSRYHFSQVEMLVSNLKPYSQAPPGIPPELLQRAATHVRTYLASMLGPGKSQPLTYH FT EAVATLEKSTSCGPHVPGIKGDYWNETTEQFEGKLGDYLQHAWNQANLGHPLSHDYKLA FT LKDELRPLEKNAQGKRRLLWGADAGVTLICAAAFKPVAIRLAELVPMTPVSVGINMDSA FT QIEVLNESLKGRIVYCFDYSKWDSTQHPAVSSSSIEILRSFCVDCPIVSAAAEVLKSPA FT RGMFEDVTFTTTSGLPSGMPFTSVINSINHMTYFAAALLKAYQDQGVPYTGNVFQLETV FT HTYGDDSIYGFLPASASIFPQFLANLKSFGLNPTNPDKGDVITPVDRPVFLKRTLAITP FT FGLRALLDITSLERQCYWVKGSRTSDVYSPTTIDTQARSMQLEVMLAYASQHGPEVHER FT LSHVAQKTADGEGLTLVNTNYAQAAATYNAWYIGGVEPQLGSLASEGSAQVVFEMEGNG FT SKPAAGQSASTTTEAPPPGAVGPTEAALVVTNPDQPIATAQRVEMAIATGAQNSNIPEP FT IRQCFALFRTVPWNDRQPMGTFLGAVVLSPNVNPYTRHLSAMFAGWGGGMEVRISVSGS FT GMYAGRIICAVLPPGLNPATVADPGVLPHVLLDARVPDPAVFQVPDVRAVDYHRTDGDE FT ATSSLGLWVLQPLINPFSTTAISTAWLSIETRPTFDFDFCLLKPPTAQMDNGVPPDRLL FT PKRLGKSKGNRLGGLIVGMVVVAQHKQVNRHFMADSTTWGWSTAPTAPLACKITGHVDP FT ISTDPKCGVQLAVGASARGPIFPHIPDHWPDSAAGTSQDTIAVPWERSRAIPQKSIVGS FT VMFFADNGDVDEDRIYYAAAADCLVNTTSRPALRGDFNAGTMTLIGFPVTAQSPNNGTN FT IYWNPLFCDGSDSSVNDRVTNMTGSNYVFSSSGMNNIILWKERIFSDYPNHTILYSSQL FT DYTAEVFQNSQINIPKGMMAVYNVTSNGGEFQLGIRPDGYMVTTSPIGVNVDLDGDTEF FT SYVGVFPITSSLNGPGGNASGARRIYQ" FT CDS 5880..6386 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q0F7" FT /protein_id="AGH15837.1" FT /translation="MSWIVGALQGLAGLSDVANTVSGIAYQHRQLDLLRQQNELQAQWM FT VKNEALQREAMAMTRDLAVNAPAQRMRAALDAGFNSVDARRLAGSSERVIRGYIERPVM FT LRTNLEGIRQTNNLLTMNSALETFKKGTPFGMSAPPRVQQGPVGFSNPNYGKVTLGPRP FT PESSV" XX SQ Sequence 6497 BP; 1500 A; 1760 C; 1594 G; 1642 T; 1 other; gcatggacca tcatgtttgc aggcatcatt gctatcataa tgcggttggg agggtgcaag 60 gacatcctta cctcctggcc ctccttggtc aaagcagccg catcattcac aactcttacg 120 gctgcttaca agtcattcga gtggattcgt ggcaagttcg ctgaagtcac cctcacccgt 180 aagatcaaga tgttcatggc taggtgtgct gccttggtcg agctgacaca ctcccgcgac 240 gtccacggca tcgatgaaat caaggagtta ctcaagtgct actcagtcct tgaggaggag 300 ggtaacgatc ttgtggccga ggcaggcaat ggcactcagg ccggtttgat tcgtggatac 360 ctcactgatc tttctactca ggcgycccag ttgagaacca cccttgcctt ggacactcca 420 cgcaaggtgc ctgtttgtgt tattttaaca ggacccccgg gcatcgggaa aactcggcta 480 gcccaggaaa ttggtaaggg ttttggcaaa ctgtcatctt tcaccctcct aaacgaccat 540 catgactcat atactggtaa ccgggtggcc atctgggacg agtttgatgt tgacgcaaaa 600 ggggcatttg tggaaacaat gatctcgctt gttaatagct ccccttgccc actaaactgt 660 gacaggcctg aaaacaaggg caaaatgttt acatctgatt acatcatttg cacgtcaaac 720 tttgagacgt ctgtcattcc ggacaatccg cgagcacagg ccttttatcg cagagtgatc 780 accgttgacg tctcctcccc cagtgtccag aactggatga ccaagaatcc tgggcgtcaa 840 cctccaaaag acttatttaa acctgatttc tcccacttgg agctgtcctt acgaccattc 900 cttggttaca actcccaggg cgacacgctc tcagggaagc gtggtcgggt gtctcaaatt 960 actgttgatg ggttgacgcg ccttatggag cagaagtttg aggaacaggc gaaagacccc 1020 ccgcaaaacg tgtggattac tgttccaaag aaccttgttg cagatgccct tgttgtggtt 1080 aagaaatatg tgatgctcca tcgcggcgtg tgttatgtca ctggccagcc cactccggca 1140 gaatgcggga accgccacgt ctcaactgtt gtagtctcag acggtccccc taccggaaac 1200 acatttcatc atgttaaggc cactgcatta tcccttgata accccaccat ctcctcttct 1260 ttgttgtctc ttttcaccac tgactcacgt gtttcttctt ccactcaacg tgattggctt 1320 tataaggtgt ggcagccatc catcttggtg caggagtctg cagtggactc tcaatcaatc 1380 ccaattgtcc gccgcgttgt cgtggtgaac agtgcagcag acttcatagt caatatgaga 1440 caccaccttg gttttgtttc aatcccagga cttttcacgg ctttccgtgg ttggcgcaat 1500 tccacgtcca tcatagactt cattgagaaa cattttcgag acttcaaatt cccctccaac 1560 ccagagtgca ctatattccg atgtgcgaat ggggatgtca tgctttacac ctacggttcc 1620 tatttcgtct tcggcacccc aagaagactc cccattctct gcgacgtgga tccctgtgtc 1680 atgtccaatg tgccacgctg tatgacatgg tgggaaacaa tcaaactgtg tcttgaatac 1740 ttctacaact tcttgaaagt ggttttgccc catgccattt cactggcaaa tcttgtttac 1800 cttttcaccc ggggggaccg tcaggaagag gctaagggca agacaaagct tggccgtggt 1860 cgccgtcatg gtgcagcccg tggagtggcc ttgagcgacg atgagtatga tgagtggaga 1920 gacatgatgc gggactggcg ccaagaaatg acagttaatg agttccttga attgagagaa 1980 agggcctatg cgggcttgca gggagcaaac gaagaccgct acagggcgtg gttaaacctt 2040 agggcaatga gaatgggaac aggtaattac caacatgcca ctatcattgg caagggtggt 2100 gtgcgtgatg agatcatacg cacatcagtc atgcgtgctc ctcgccaccg agaggaccca 2160 tatgacgagc gcgaggcacc cttcatccct gaggccaact ctgcggttgt cgagttcaca 2220 aatgaagaag aacatgttgg ttggggcgtg cacattggaa atggaagggt cgtcaccgtc 2280 acccacgttg caactatgtc taatctcgta gacggtgtcc ccttcaccat caaagagtcg 2340 aatggtgaaa cttgtcacgt cctcacacct ctcaagaacc tcccccacct gcaaatcgga 2400 gatggagtgc caacttacta ctcacaaaga tttcatccgg tcatggtaat caatgaggga 2460 tcgtatgaca cccccaagac cactgtgaac gggtggcatg tccgtatcat caatgatttt 2520 ccaaccaaga aaggagattg tggcacaccc tactttgatg actgccgcag agtggttggc 2580 ctccatgctg ctggttccat tggtggttcc accaaactag tccaaaacat taaaccacct 2640 aagaccaaca tcgagatgtt ctcatggaag gggttgagtg tctcccggtc acccccggta 2700 ggtggtatgc cgacgggtac cagatatcac agatcgccag cctggcccaa cgtgtcccct 2760 actgaaacac acgaacccgc ccctttcggc tctggtgact cacgttacca cttttctcaa 2820 gtggagatgt tggttagcaa tctcaaacct tatagtcagg cccctcctgg cattccccct 2880 gagctcttac aacgagctgc cacccatgtt aggacctact tggcttcaat gcttggccca 2940 gggaaaagtc aaccactcac ataccatgag gctgtggcaa cattggagaa gagcacgtct 3000 tgcgggcccc atgttcctgg gatcaagggt gactactgga atgaaacaac tgagcagttt 3060 gagggtaagc taggtgatta cttgcagcac gcctggaacc aagctaatct aggccaccca 3120 ctttcacatg actacaaact cgcccttaaa gatgagttgc gtccacttga gaaaaacgcc 3180 cagggcaagc gccgcttgtt gtggggtgca gatgcaggcg tgactctcat ttgtgccgct 3240 gcgttcaagc cggtggctat acggcttgct gagttggtcc caatgacgcc tgtctcagtt 3300 gggatcaaca tggactccgc acagatcgaa gtcctcaatg aatcacttaa gggcagaatc 3360 gtttactgct ttgattactc caaatgggat tccacccaac acccagctgt ttcttcttcc 3420 tcaattgaaa tccttcggtc gttttgtgtg gactgcccta ttgtgtcggc tgcagcggag 3480 gtgttgaagt ctcctgctcg tggtatgttt gaggatgtca cgtttactac caccagtggc 3540 ctcccgtcag gcatgccatt caccagtgtc ataaactcaa ttaatcacat gacctacttc 3600 gctgccgcgc tgctcaaggc ataccaagac cagggcgtgc cctacaccgg taatgtgttt 3660 cagcttgaga ctgtgcatac ctatggtgac gattccatct atgggtttct ccccgcttct 3720 gcctctattt tccctcagtt cctagcaaac ttgaagtctt ttggtctcaa ccccaccaac 3780 ccagacaagg gcgatgtcat tacaccagta gatcgcccag tctttcttaa gcgtacgctt 3840 gcaattaccc catttggcct tagggctctc ctggacatca cctcccttga aaggcagtgc 3900 tattgggtga agggctcgcg tacctcagac gtctattcgc caaccacaat tgacacccag 3960 gcgcgtagta tgcagcttga agttatgttg gcctacgcat cacaacatgg gccagaagtc 4020 catgaaaggt tatcccacgt cgcccagaaa acagcagacg gtgagggtct aaccttagtt 4080 aacaccaatt atgctcaagc tgcagccacc tacaatgcgt ggtatattgg tggcgtggaa 4140 ccacaattgg ggagcctcgc cagtgaaggt tcagctcaag tagtgtttga gatggagggc 4200 aacggctcca agcctgcggc cggccaatcg gcttctacca ccacggaggc tccgcccccg 4260 ggcgcagttg ggcctacaga agctgccctt gtggtcacca accctgatca accaattgcc 4320 accgcacaac gtgtggaaat ggccattgca acgggggccc aaaattccaa catccccgaa 4380 ccaattagac aatgttttgc cctatttcgg actgttcctt ggaacgaccg acagcccatg 4440 ggcacctttc ttggtgccgt tgttttgtca ccaaatgtca acccttacac caggcacctt 4500 tctgcaatgt ttgcaggttg ggggggggga atggaggtta ggatttctgt ttctggatcg 4560 ggcatgtatg ccggaaggat catatgtgca gtcttgcccc ccggcctcaa ccctgcaact 4620 gtggctgacc ctggcgtgtt gccccatgtg ctccttgacg ctcgtgtccc tgaccctgct 4680 gtatttcagg tccctgatgt ccgtgcggtt gattaccatc gcacggacgg tgacgaggcc 4740 acttcatctc ttggcttgtg ggtgctccag cctctcatta accccttctc caccacggca 4800 atttccactg cgtggctctc aattgaaaca aggccaactt tcgacttcga cttttgcctc 4860 ctgaaaccac cgaccgcgca gatggataat ggggtcccac ctgacaggtt gctgcctaag 4920 agacttggga agtcaaaagg aaatcgactt ggtggcttga ttgttggtat ggttgttgtg 4980 gcccaacaca aacaggtgaa ccgtcacttc atggccgact ccaccacctg gggttggtct 5040 actgcaccaa ctgcgccctt ggcctgtaag atcacaggcc atgtggaccc catatcaact 5100 gacccaaagt gcggtgtgca acttgccgtt ggagcgtctg cgagaggtcc cattttcccc 5160 cacattccag atcactggcc tgactctgct gcaggtactt ctcaagacac cattgcagtt 5220 ccctgggaga gatcaagggc catccctcag aaatccatag ttggaagtgt catgttcttt 5280 gccgacaacg gtgatgtgga cgaagaccgt atctactatg cggctgctgc cgattgtctg 5340 gtcaatacaa catcaagacc cgccttgcgt ggcgacttta acgcaggcac aatgaccttg 5400 atcggcttcc ccgttacggc acagtcgcca aataatggta ctaacattta ctggaatccc 5460 ttgttttgtg atggctctga ttcttcggtc aatgaccggg tcaccaacat gacaggttca 5520 aattacgtgt tctcttcatc agggatgaac aacatcattt tgtggaaaga gaggatcttt 5580 tctgattacc ccaaccacac catcctttac tcatcacagc ttgactacac agctgaggtt 5640 tttcaaaact cccaaatcaa catcccgaaa gggatgatgg ctgtgtataa cgtcacttca 5700 aacggcggtg agtttcaact cggaatccgc cctgatgggt acatggtgac cacctcccca 5760 attggagtta atgttgactt ggacggtgat actgagttct catatgttgg tgtttttcct 5820 atcacctctt ctcttaatgg cccaggtggg aatgcaagtg gggcccggag gatttaccaa 5880 tgagttggat agttggggct cttcagggtc ttgccggctt gtcagatgtc gctaacacgg 5940 tgtctggcat cgcttaccaa caccggcagc tggatttgtt acgccagcaa aatgagttgc 6000 aggcccaatg gatggtcaaa aacgaagcct tgcagcgcga agccatggcc atgacacggg 6060 acctcgcggt caacgctccc gcgcaacgca tgcgggctgc gcttgatgct gggtttaaca 6120 gtgttgacgc acgtcgactg gctggatcca gtgagcgcgt catacgcggc tacatcgaga 6180 ggcctgttat gcttcgcact aacttggagg gcattcgtca aaccaacaac ttgttgacca 6240 tgaactctgc tcttgagacc tttaaaaagg gtaccccttt cgggatgtca gcccccccgc 6300 gggtccaaca agggcccgtt gggttttcaa acccaaatta tggaaaggtc actttgggcc 6360 ccagacctcc tgagtcttcc gtttagtttt ccctttattt tgtctgtttc ttttattttc 6420 tttccagcag ttccacacgc gttcgggtgg ataatgccaa ttaagcgatt ggcgctgtca 6480 ctttaggata aggaagg 6497 // ID KC309418; SV 3; linear; genomic RNA; STD; VRL; 3695 BP. XX AC KC309418; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 4) XX DE Sapovirus swine/WG214C/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WG214C/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-3695 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RC Publication Status: Online-Only RP 1-3695 RX PUBMED; 27228126. RA Oka T., Lu Z., Phan T., Delwart E.L., Saif L.J., Wang Q.; RT "Genetic Characterization and Classification of Human and Animal RT Sapoviruses"; RL PLoS One 11(5):E0156373-E0156373(2016). XX RN [3] RP 1-3695 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [4] RC Sequence update by submitter RP 1-3695 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (25-APR-2013) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [5] RC Sequence update by submitter RP 1-3695 RA Oka T., Wang Q.; RT ; RL Submitted (03-JUN-2016) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 7a431947a3417ee81f274ff7d6ae52fd. DR EuropePMC; PMC4881899; 27228126. DR EuropePMC; PMC5356244; 28302145. XX CC On Jun 3, 2016 this sequence version replaced gi:482680045. XX FH Key Location/Qualifiers FH FT source 1..3695 FT /organism="Sapovirus swine/WG214C/2009/USA" FT /host="swine" FT /strain="WG214C/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304601" FT CDS <1..3150 FT /codon_start=1 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4Q0J9" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4Q0J9" FT /protein_id="AGH15838.2" FT /translation="NTKSKPKPIETWKGLEVDRGEQSMGPLPCSTKYHRSLVHFDTGYE FT PANFGPTDQRCPRPLTDVIAEQLEPYQEEPVPVDQVLLQRGQKHLLAFLRHVLGTNRRE FT HIGMLEAFKSLNHKSSNGPWFPGTKRDYMDAEGNPNALLESYISTKLKDIKLGQFKHHY FT RLSLKDELRPREKVLAGKRRLLWGCDVGLATACAMVYKNLFDEISAAAPYTGCAVGIDM FT DNLDTVRDLNEMFTGSHLVCADYSKWDSTLHPDVIKLAIDTLAQFVTLDDLAVGVNNVL FT CSRPCGLVYDIVVPTKKGLPSGMPGTSIINSVAHLILFAASVLSIYQKCGVPYPGNVFQ FT HERVVVYGDDCVYGFSTATASKAQLFWDQMRAFGMKPTNADKTGDPAFTDTIHFLKRKV FT ILNEGRLMAALDQSSLLRQLAWIKGPKTTKMEPIYPPDPVGRLDQIYNAVWRSAAWGKE FT FFEDFEAKAATLAKHELLPYTGVEYSEAINVLTSISSRPPEGEAIVYVMEGPNGPKGAQ FT PLEQPMEIADGAQSSTAGPPIMVNPAPPNVAIQAASAISASGGAATTLGEDVMSTFCVA FT ANYTWNSRAAPNTLLGAMPLGPQCNPYTRHVAKMYGGWSGSMEIRITVSGSGMYAGKVM FT AVVLPPGVKAEDVTNPGAYPHALIDAKTSISFAVTMYDVRNTDFHFTGDQRVSVFGLWV FT FQPLINPFAGTADSSALITVETRPSIDFRLCLLKSPEDVVEVQAPDDLLPRRFVGSTEN FT RFNTHPQGLEIVNIAHQVNHHYRPNGTTFGWSTVPVAPPQLVVGNSATLGGRTAYLVTP FT AANTDLIPGIPNHWPDACASTSLNGGSNSIATYGAAGVVMTQQGLNVDETKPLSVMYAI FT FESSSSPIGPSVDANKLMIVLNSSITGPAPGDNAICTVNYIQQSNDNQVNVGAAVTPLF FT NNRPSYGPMGGNNLALWRTQLPSSGNNGCVVYSSQLNESAINCMNVRGVPDGAMAVYTV FT HTLGDVFQLGFCFDGYLRTGAALGTVLLDDDATFEFNGIFSVNTPLQGPNGNQGAARRL FT R" FT CDS 3147..3653 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4QBF0" FT /protein_id="AGH15839.2" FT /translation="MSWFAGALGTAGLLGDIANTIGNIVAQQQVVANQRRQLELYQQAI FT DKNLKLQDKMLEMNYELATFGPSLQYSSARKLGFNHIEATQMLGSHRVTYGGVDVEPRA FT LTSLPYYMQNPQLQGQAHSVVSQFVQGAPGFTKPAPAGFSNPNYGVKLTAYRQNIQHQP FT GDSNA" XX SQ Sequence 3695 BP; 906 A; 963 C; 957 G; 869 T; 0 other; aacaccaagt caaagcccaa gcccatagaa acatggaagg gcctagaggt ggaccgtgga 60 gagcagagca tgggcccact accatgctcc acgaagtacc acaggtcgct ggtgcatttc 120 gacaccggtt acgagccagc aaactttggg cccacagatc agcgctgtcc acgaccactc 180 acggatgtca tagctgagca acttgaaccc taccaggaag agcccgttcc tgttgaccag 240 gtgctccttc aacgtgggca gaagcacctg ctggcgtttc tccgtcacgt gctaggcacg 300 aatcgacgtg agcacatagg catgcttgag gccttcaagt cactgaacca caagagctca 360 aatgggccct ggttccccgg cactaagagg gactacatgg atgccgaggg gaatccaaac 420 gccctgttgg aaagttacat cagtaccaaa ctgaaagata taaaactcgg gcagttcaaa 480 caccactata ggttgtcact gaaagacgaa ctcagaccac gggagaaggt gctagccggc 540 aaacgccgct tgctttgggg ctgcgacgtt gggttggcca ctgcgtgtgc tatggtctat 600 aaaaatctgt ttgatgagat cagcgctgct gcgccctaca ctgggtgcgc cgttggtatt 660 gacatggaca accttgacac ggtgagagac ctgaacgaga tgttcactgg gtcccacctt 720 gtgtgtgcag attacagcaa gtgggactcc acactgcacc ctgatgtcat caaactggcc 780 attgacactc ttgcacagtt cgttacgctg gacgacttgg ctgttggcgt caacaacgtg 840 ctctgcagcc gtccttgtgg tctggtgtat gacatagtgg tcccaaccaa gaaggggctc 900 ccgtcaggga tgccaggcac cagtatcatc aattctgtgg cccatttgat tctgtttgcc 960 gccagcgtac tttccatcta tcagaagtgt ggcgtgcctt acccagggaa cgttttccag 1020 cacgaacgtg tggttgtcta tggtgatgac tgtgtctacg gattttcaac agcaactgcc 1080 tctaaggcac agctattttg ggaccagatg agagcctttg gcatgaaacc cacgaacgcc 1140 gacaagaccg gtgacccagc attcactgac accatccact tcctcaagcg gaaggtgatt 1200 ctaaatgagg gtcgactcat ggctgcactt gaccaatcat ccttgctgcg gcagcttgcc 1260 tggattaagg gacccaagac tacgaagatg gaaccaatct accccccaga cccagttggg 1320 aggcttgacc aaatttacaa tgccgtatgg aggtctgctg cctggggcaa agagttcttt 1380 gaggactttg aagccaaggc tgctacactg gcaaaacacg aattgttacc ctacactggt 1440 gtggaatact cagaggcaat taacgtgcta acatctattt catccagacc gcctgagggt 1500 gaggcaatag tgtatgtgat ggagggtcca aacggcccta agggcgctca gcccctggaa 1560 caacccatgg agatagccga tggtgcccag tcgtcaacgg ctggcccacc tatcatggtt 1620 aacccagctc ccccgaatgt ggcaattcag gctgccagtg ctatatcagc tagcggagga 1680 gctgcaacca cgctcggcga agacgtaatg tcgacgtttt gcgtggcggc aaactacacc 1740 tggaactcac gtgcagcacc gaacacactg cttggtgcca tgccgttggg cccacaatgt 1800 aatccttaca cccgtcacgt cgcgaagatg tacggcggat ggtctggttc catggaaatt 1860 cgaatcacgg tgtcagggtc aggaatgtac gccgggaaag tcatggccgt tgtgcttcca 1920 cctggggtca aagctgagga cgtgacaaac cctggtgctt acccacatgc gcttattgac 1980 gcaaagacat ccatctcatt tgccgtcact atgtatgatg tgcgcaacac agacttccac 2040 ttcactgggg atcagcgagt cagtgtgttt ggcttgtggg tattccaacc acttatcaac 2100 ccttttgctg gtacagctga ctcgagtgca cttatcacgg ttgagaccag gccctcaatt 2160 gacttcagac tgtgcttgct gaaatcccca gaggacgttg ttgaagttca ggcgcctgat 2220 gatttgttgc cacgcaggtt cgtgggtagc actgaaaata ggtttaacac tcacccgcaa 2280 ggccttgaga ttgttaacat tgcacaccaa gtcaaccacc actaccgacc caatggcacc 2340 acgtttggtt ggtcgacggt gcctgtcgct ccaccacaat tggtggtcgg taactctgca 2400 acccttggtg gaagaacagc atacttggtc actccagccg ccaacaccga cttgattcca 2460 ggtataccca accactggcc agacgcttgc gccagcacct cactcaatgg cggaagcaac 2520 tcaattgcaa cttatggtgc tgctggggtt gtgatgacac agcagggctt gaatgtggat 2580 gagacgaaac cacttagtgt tatgtatgcc atctttgagt ctagcagctc gccaattggg 2640 ccaagtgttg atgccaacaa gttaatgatt gtactcaaca gctcaatcac tggtcctgcc 2700 cctggtgaca atgcgatatg cacagtcaac tacatccagc agagcaatga caaccaagtg 2760 aacgttggtg cggctgtgac cccactcttt aacaatcggc ccagttatgg gccaatgggt 2820 ggcaacaatt tagccctgtg gcgcacccaa ctaccatcat caggtaacaa tggttgtgtg 2880 gtctacagct cgcagctcaa cgaaagtgcc atcaattgca tgaacgttcg cggcgtgcca 2940 gatggggcaa tggctgtgta tactgtgcac acattggggg atgtgtttca actggggttt 3000 tgctttgacg ggtacctgag aactggtgcc gcgcttggca ctgtgctcct tgatgatgat 3060 gccacgtttg agtttaatgg gattttctct gtgaacactc cactacaagg gcccaatggg 3120 aaccaggggg ctgcaaggcg tttaagatga gttggttcgc aggggccttg ggtacagctg 3180 gactgctggg cgacatcgcc aacaccatcg gcaatattgt tgcccagcaa caggtggttg 3240 ctaatcagcg taggcagctt gagctttacc aacaagcaat agataaaaat ttgaaactgc 3300 aggacaaaat gttggaaatg aattatgagt tggccacgtt tggtccctct cttcaatatt 3360 cctctgccag aaagttgggt tttaatcaca ttgaagctac ccagatgttg ggctcacata 3420 gggtcacata tggtggtgtt gacgtggaac caagggcact cacttccttg ccatattaca 3480 tgcagaaccc acagttacag ggtcaggccc actcagtggt ctcgcaattt gtgcagggtg 3540 cgcctggttt cacgaaacct gccccggctg gtttttccaa cccaaattat ggtgttaaat 3600 tgactgcata taggcaaaac atacaacacc aacctggaga ctctaatgct tgattgaatt 3660 agttcttttt cttttctttc cctttgcaac agtgg 3695 // ID KC309419; SV 2; linear; genomic RNA; STD; VRL; 7497 BP. XX AC KC309419; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WG214D/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WG214D/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-7497 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RC Publication Status: Online-Only RP 1-7497 RX PUBMED; 27228126. RA Oka T., Lu Z., Phan T., Delwart E.L., Saif L.J., Wang Q.; RT "Genetic Characterization and Classification of Human and Animal RT Sapoviruses"; RL PLoS One 11(5):E0156373-E0156373(2016). XX RN [3] RP 1-7497 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [4] RC Sequence update by submitter RP 1-7497 RA Oka T., Wang Q.; RT ; RL Submitted (03-JUN-2016) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 7758590861ba5afd83b7aef9f3120e2a. DR EuropePMC; PMC4881899; 27228126. DR EuropePMC; PMC5356244; 28302145. XX CC On Jun 3, 2016 this sequence version replaced gi:461180486. XX FH Key Location/Qualifiers FH FT source 1..7497 FT /organism="Sapovirus swine/WG214D/2009/USA" FT /host="swine" FT /strain="WG214D/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304602" FT CDS <1..6897 FT /codon_start=1 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4Q2Y6" FT /db_xref="InterPro:IPR000317" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:M4Q2Y6" FT /protein_id="AGH15840.2" FT /translation="VIALMASRPFKAVSVSFRSELFVLRSAYLRVADRDTFLPDSSLTT FT LTKYLWPSWSRHLSCQRQLAHTPSPQLPATPGVLFQEEGLGDWFRSAVARDVDPQEVYK FT KLLGIDMRQACPLSMAEMAKLQGETALALDTPGHALNKVYTRGELVKLFATLSRFVPQE FT TTVDEQRRRNELDRENADAFANLPGEGVINANSWKTYFYTMWRRVVKGCRRSYHSLANC FT SSWLGSLAQRAEPIREALANAAVAAGELSKCSADYLLATLVNKLKPTTMVMIYQQHRNT FT FRGWMATLTAFFELHGDLLSKLGCTAATVVAAVTGCFELLTGFIEELIQKFATTQNPQG FT PTDVAWVSICAGILAIIMRLGGCKDVLQSWPHLLKAAATVTTLTAAAKSFQWVRDQFAQ FT AHLNRKVKMFMARCAALVELTHSREVCGVDELKELLKCYNVLEEEGNDLVQEAGNGTQA FT SIIRGYMQDLATQATNLRSTIALDTPRKVPVAVILTGPPGIGKTWLAQEIGKGFGKLSN FT FTVLQDHHDSYTGNPVAIWDEFDVDPKGVFVETMISLVNSAPCPLNCDRPENKGKMFTS FT QYIICTSNFPTSVIPDNPRAQAFYRRVITVDVSSPSISKWMATNPGRRPPKDLFKSDFS FT HLELSLRPYMGYDPEGNVLGGKKGRVTQITVDGLVQLMERRFEEQARDPPRNVWITVPK FT PLVADALAAVKKYVQANRGLCHVTSQPSPTECGDRHVTQIVVSDAGPVGTSSFLHVKTR FT GMSLEGPSVAHSLLSMFDTDIRVPGTQQREWLYRIYDPTLVVQETSLCSQAIPVVRRVV FT MVENVFDFITNVRHHLGFCSIPGMFTAFRGWRDSTSIVDFISRHFKDFKFPHNPECTIF FT RCANGDVLFYTFGSYLMFASPARIPVACDQDVPSLGNVPAKMTWFETIKMCCEYYYQFL FT STVLPYLVTMSNVMYLFSRGDREPEAKGKTKHGRGLRHTHGKGVSLRDDEYDEWRDLMR FT DWRLEMTADQFLELRERAYAGMQGPEEDRYRTWLSLRAMRLGTGNYQHATIIGRGGVRD FT ELIRTSVLRAPRRKGLDDIIGDSYEAEANTPMVQFTSQGDHVGWGVHLGNGRIVTITHV FT AMGANEVEGQQFTIQRTEGETCYVNAPLKGHPHAQIGSGEPAFFSYRFHPVIVIGEGQY FT DTPKTTVHGWHVRITNDYPTKKGDCGTPYLDECRRVVGLHAAGAINGSTKLAQRVIEDT FT DNVTKFSWKGLSVERIPSVGGMPTGTRYHRSPAWPSMMPTETHAPAPFGAGDTRYGFSQ FT VEMLVNNLKPYATPTPGIPPALLQRAAVHVRSYLQSVIGRERSEPLSFQMAEQILERST FT SCGPHVPGLKGDYWDEATQQYTGVLREHLERAWNNAHIGQPLPHDYKLALKDELRPLAK FT NAEGKRRLLWGADAGVVLIAAAAFKPVAIRLAATVPMHPVSVGVNMDSGQINIINESLV FT GRVVYCLDYSKWDSTQHPAVSSSSIDILKSFCVDCPLVSSAAEVLRSPARGCFEDVCFT FT TVSGLPSGMPFTSVINSLNHMTYVAAAILKAYQDVGAPYTGNVFQLETLHTYGDDSIYG FT FTPATASLFPQILDNLRSFGLKPTDASKGTDIRPVDRPVFLKREFVNTPDGLRAVLDVT FT SLERQCYWIKGSRTSDINSPTTFDVQGRAMQLEVMLAYASQHGVKEHERLAHLAETTSK FT AEGYTLVNLNFEQARATYNSWFVGGSAPELTTIASEGSGQVVFEMEGVGSNPQQPQTPA FT MTSNPQGVVGPMEAPLVAVNPETPVAPAQRMELAVATGARTSCVPDPIRQCFALYRTFP FT WNDRQPQGTFIGAVVLSPGANPYTAHLSAMFAGWGGGMEVRCNVSGSGMYAGRLIISIL FT PPGLNPATVGDPGALPHVLLDARTTDPAVFVVPDVRAVDYHRTDGDEATSSLGIWVLQP FT LINPFATGAVSTAWLTLETRPSFDFDFCLLKPPTVQMENGTPPDRLLPKRLNRARGNRV FT GGNVKGMVVVAAHKQVNRHFMADGTTWGWSTAPVAPMAAAVYGNVGPATADPKCGSQIG FT VGSDNKGPLFPNIPDHWPDTCATVVCQWDNGAYGPKTAITGTLMLFDDNGDVNENIATY FT CTAISAIMPGTPQRPALRESFNAGSMTLVGIGANSITQQGNMNLYFSPQFVRGNVGQIE FT GRVCNLQGMNYTFSSSGPNNVVLWQEQLFSDHPGAQYVWSSQLDTTAEAFQSGPVNIPA FT NSMAVYNVTSNAAEFQVAIRPDGYMVTTAQVGTTIPLDPETTFQYVGTFPLNSVLNGPN FT GNTVGSRRVQL" FT CDS 6894..7421 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q477" FT /protein_id="AGH15841.1" FT /translation="MSWMIGALQTLGGLTDVANTISGIVYQQRQLDLLQQQTQLQAEWM FT RRNEQLQRDSLALSRELSTEGPTLRFQSALAAGFNPVDARRLAGSGERVVRGYLDQPVF FT HQGDLVGLRATSHLNTMNAALTTFKQGHPLGTGSPPKPTTGTPQPQGFANPNYQPRPPG FT LTLQGTSSSSKV" XX SQ Sequence 7497 BP; 1657 A; 2114 C; 1936 G; 1790 T; 0 other; gtgatagctt tgatggcctc ccggcctttc aaagccgttt cggtttcgtt ccgaagcgaa 60 ctctttgtcc ttcgttccgc ctacctcagg gtagcggaca gggacacttt tctccctgat 120 tcttcattga ccacattgac taaatatttg tggccttcat ggtcacgtca cctctcttgc 180 cagcgtcagc tggcacacac accctcacct cagcttccag ccacgccggg tgtcctcttc 240 caagaagagg gcctgggcga ttggttccgg tcggctgttg cacgtgacgt tgacccccag 300 gaggtgtata aaaaacttct cggaattgac atgcgccagg cttgccctct ttccatggct 360 gaaatggcca agttgcaagg ggagacagca ctggcccttg acacacccgg acatgccctt 420 aacaaagttt acactcgtgg cgagctggtt aagctgttcg ccaccctttc gcgattcgtt 480 ccgcaggaaa caactgtaga cgaacaacgg cggcgaaatg agctggaccg tgaaaatgct 540 gacgcctttg ccaacctccc gggtgagggc gtcattaatg ccaacagctg gaaaacctat 600 ttctacacca tgtggcggag ggttgtgaaa ggttgccgcc gttcttacca ctccctagcc 660 aattgctcgt cctggcttgg ctctcttgct cagcgggctg agccgattcg agaagccctt 720 gccaatgcgg ctgttgctgc tggggaattg tcaaaatgtt cggctgacta tttgttggcc 780 acacttgtca acaagttaaa acccaccaca atggtaatga tctaccaaca acataggaac 840 acgtttcgcg gttggatggc cacgctaact gcgttctttg agctccacgg tgatctttta 900 tctaagctgg gatgcacggc tgccactgtt gttgcagcag taaccggctg ttttgagctc 960 ctcacaggtt ttatcgagga acttatccag aaatttgcca ccacacaaaa cccacaaggt 1020 cccacagatg tcgcatgggt gagtatatgt gcgggcattc ttgcaatcat aatgcgcctt 1080 ggggggtgca aggatgtcct acagtcatgg ccccacctgc tcaaagcagc cgcgactgtt 1140 acaactctca ctgccgccgc caagtcgttc cagtgggtgc gcgaccagtt tgcccaggcc 1200 catcttaacc gaaaggtaaa aatgttcatg gcacgctgtg ctgccttggt ggaactcacc 1260 cactcacgcg aggtgtgtgg tgtcgacgag ctcaaggagc ttctcaagtg ctacaatgtg 1320 cttgaggagg agggcaatga cctggttcag gaggccggca acggcactca ggctagtatc 1380 atccggggtt acatgcaaga cctagccaca caagcaacta atctcagatc caccattgca 1440 ctcgacaccc cccgcaaagt gcccgttgcg gtgatattga ctgggcctcc aggcattggg 1500 aaaacttggc tcgcccaaga gattggtaag ggtttcggga aactctcaaa tttcaccgtt 1560 cttcaggacc accatgattc ttacacgggg aaccctgttg caatttggga tgagtttgac 1620 gtcgacccga aaggggtttt tgtggaaacc atgatatcac tggtcaattc tgccccatgc 1680 cccctgaatt gcgatcgacc tgaaaataag ggcaagatgt tcacgtcgca gtacatcatt 1740 tgcacctcta atttcccaac atctgtgatt cccgacaacc cgcgggcgca agccttctac 1800 cggcgggtca tcactgttga cgtgtcgtca ccttccatct caaaatggat ggccactaac 1860 cccggccgtc ggcccccaaa agatctgttc aaatcagact tttcccacct tgagttatct 1920 ttaagacctt acatgggtta cgaccccgaa gggaatgtcc ttggaggtaa gaagggccgc 1980 gttacccaaa ttacggtcga cggtctcgtc caattgatgg agcgcagatt cgaggagcag 2040 gctcgcgacc cacctaggaa tgtgtggata actgtcccaa aaccactcgt tgcggatgcc 2100 cttgctgcag tgaagaagta cgttcaggcc aaccgtgggt tgtgccatgt tacctcgcaa 2160 cccagcccca ctgagtgtgg cgaccgtcat gtcacccaaa tagttgtgtc tgacgcagga 2220 ccagttggaa catcctcgtt tttacatgtg aaaacccgtg gaatgtcttt agaggggccc 2280 tccgtagccc attccctcct gtccatgttt gacactgaca tccgtgttcc aggcacccag 2340 caacgtgagt ggctttatcg catttatgac cccacccttg tcgttcaaga aacatcacta 2400 tgctcgcagg ctattcctgt tgtgcgcagg gtggttatgg ttgagaacgt gtttgacttt 2460 atcaccaatg ttcgtcacca tctgggtttt tgttccatcc ctggaatgtt caccgcgttc 2520 cgcgggtggc gcgactcgac ctcaatcgtg gatttcattt ccagacattt taaggacttc 2580 aaattcccac acaatcctga gtgcacaatc ttccgttgtg ccaatggtga cgtgttgttt 2640 tacacatttg gctcctatct gatgtttgca tcccccgccc gcatcccggt ggcttgtgac 2700 caagacgtcc catcgctcgg aaatgtccca gcaaagatga cctggtttga aacaatcaag 2760 atgtgttgtg agtattacta ccagttttta tctactgtgt taccctatct agtgacaatg 2820 agcaacgtca tgtatctgtt ctcgcgcggt gaccgtgagc cagaagcaaa agggaaaaca 2880 aaacacgggc gtggcctgcg ccacactcac ggcaaaggtg tgtcgttgcg tgatgacgag 2940 tacgacgaat ggcgcgacct gatgcgtgac tggcgcctgg agatgacagc tgaccaattc 3000 ttggagcttc gggagcgcgc ctacgcaggg atgcagggcc cggaagaaga tcgttatcgt 3060 acttggcttt cgttgcgggc aatgcgactc ggcactggca attaccaaca cgccactatc 3120 attgggcgcg gtggtgtgcg cgatgaactc atacggacga gcgttttgcg cgcaccacgt 3180 aggaagggcc ttgatgatat cattggtgac agttacgaag ccgaggcgaa cacacccatg 3240 gtccagttca cgtcccaggg agatcacgtc gggtggggcg ttcaccttgg taacggccga 3300 attgtcacca tcacacatgt ggctatgggt gcgaatgagg ttgagggcca acagttcacc 3360 atccagcgca cagaagggga gacctgttac gtgaatgccc cacttaaggg acacccccat 3420 gcccagatcg gcagtgggga acccgcgttt ttctcctacc ggttccaccc tgtgattgtc 3480 attggtgaag ggcagtacga cacacccaag accacagttc atggctggca cgtgagaatt 3540 acaaatgatt acccaacaaa gaaaggtgat tgcggcacac cgtacctgga tgaatgccgc 3600 cgtgtcgttg ggcttcacgc cgctggggcg attaacggat ccaccaagct cgcccagcgt 3660 gttatcgagg acacagacaa cgtcaccaag ttctcgtgga agggactgag tgtggagcgt 3720 atcccttcgg tcggtggcat gccaaccggg actcgttacc accgctcccc tgcctggcca 3780 agcatgatgc ccactgagac acatgcacca gcgccatttg gagcagggga caccaggtac 3840 ggtttttctc aagtggaaat gttggtcaac aacctaaagc cttatgccac tcccacacct 3900 ggaatccccc ctgcccttct tcagcgagct gctgtgcatg tgcggtctta cctccagtcc 3960 gtcattggcc gtgaacgatc tgagcccttg tcattccaga tggcggaaca gatccttgag 4020 aggtccacgt cgtgcggtcc ccacgtccct ggtctcaagg gcgattattg ggacgaagca 4080 actcagcaat acacaggtgt gttgcgggag catcttgaga gagcatggaa caacgcgcac 4140 attggccagc cattgcccca tgattataaa ttggccctga aggatgaact gagaccccta 4200 gccaaaaacg cggaggggaa gcgacgtttg ttgtggggtg ccgatgcggg tgttgtgtta 4260 attgcggctg ctgccttcaa gcccgttgcc atccgattgg ctgcaactgt tccaatgcac 4320 cccgtttcag ttggggtcaa tatggactcc ggccaaatca acatcatcaa tgaatcactt 4380 gtggggcgtg tggtatactg cctggactac tcgaagtggg actctaccca acaccccgct 4440 gtctcatctt catccattga catattaaag tccttctgtg tggactgccc tcttgtgtct 4500 tcagcggcag aggtcctccg ctccccggca cgtgggtgtt ttgaggacgt gtgtttcacc 4560 actgtgtctg gtttgccctc agggatgccc ttcaccagtg taattaattc tctcaatcac 4620 atgacttatg ttgccgctgc catccttaag gcctaccaag atgtgggggc cccctacact 4680 gggaatgttt tccaactgga gacgctgcac acctatggcg atgattccat ttacggtttc 4740 acaccagcga ccgcttcgct ttttcctcaa atccttgaca acttacggtc gtttggtctc 4800 aaacccactg atgcctcgaa agggaccgac atccgtccag tggatcgccc ggtatttctg 4860 aagcgggagt ttgtcaacac accggacgga ttgcgcgctg tgctggacgt gacatcactt 4920 gagcgccagt gctactggat caagggttct cgcacctcag acatcaattc cccaaccacc 4980 tttgatgtcc aagggcgagc catgcaactt gaggtgatgc tggcatatgc ctctcagcac 5040 ggggttaaag aacatgaacg cctcgcccat ttggcagaga caacgtccaa agctgagggc 5100 tacactctag tcaatttgaa ttttgagcag gctcgggcca cctacaactc gtggttcgta 5160 ggtggcagcg cgccggagct cacaaccatc gccagtgaag gctcaggtca ggtagtgttt 5220 gagatggagg gcgttgggtc caaccctcaa cagccccaaa cgcccgcgat gacaagcaac 5280 cctcaggggg tcgtgggccc aatggaggca ccgctggtgg cggtcaaccc tgaaacccct 5340 gtggcgcctg cccaacgaat ggagcttgct gttgccactg gcgcgcgaac gtcgtgcgtt 5400 ccggacccaa tccgtcagtg ttttgcactc tacaggactt tcccatggaa tgaccgccag 5460 ccccaaggca ccttcattgg tgcagtcgtg ttgtcaccag gggcaaatcc ctacaccgca 5520 catttgtcag ccatgtttgc gggatggggt ggtggcatgg aggtgcgctg caacgtttcc 5580 ggttcaggta tgtacgctgg gcgcctcatc atttccattc taccgcctgg tctcaaccct 5640 gcaactgttg gcgacccggg ggctctccct catgttttgt tggatgcccg cacaactgac 5700 ccagcagtgt tcgttgtccc tgatgttcgt gcagttgatt accatcgcac tgacggggat 5760 gaagccacat cctcccttgg tatatgggtg ttgcagccac tgatcaaccc ttttgccact 5820 ggcgccgtat caacagcgtg gctgactttg gagacacgcc caagctttga ttttgacttt 5880 tgcctgctca aaccgcccac tgtccaaatg gagaatggca ccccaccgga ccgtctcctt 5940 ccaaagcgcc tcaaccgagc gcgtggaaat cgggtgggcg gcaatgtcaa gggtatggtg 6000 gttgttgccg cccacaagca agtgaatcgc cattttatgg ctgatgggac aacgtggggt 6060 tggtcaactg ccccagtggc tccaatggct gctgctgttt acgggaatgt tggcccagcc 6120 acagcagacc caaaatgcgg ctcccaaatt ggggttgggt cagataacaa ggggcccctc 6180 tttcccaaca tccccgacca ctggcctgac acgtgcgcca ccgtcgtttg tcagtgggac 6240 aatggtgctt acggcccaaa gacagcaatc actggcacac tcatgttgtt tgatgataac 6300 ggggatgtca atgagaacat tgccacttac tgcacggcca tctcggccat catgcctggc 6360 accccacaac gccctgccct tagggagtcg tttaacgctg gcagtatgac cctagttgga 6420 attggtgcaa attcaatcac tcagcaagga aacatgaacc tttacttttc cccccagttt 6480 gtgcgtggca atgtgggcca gattgagggc cgcgtttgta acttgcaggg tatgaactac 6540 accttttcgt caagtggccc aaacaacgta gtgctatggc aagagcaact tttcagcgac 6600 cacccagggg ctcagtatgt gtggtcatca cagttggaca caactgctga agcgttccaa 6660 tccgggcctg tgaacattcc agcaaattct atggcagtgt ataatgtcac atctaacgct 6720 gccgagtttc aggttgcaat tcgccctgat ggttacatgg tcactactgc ccaggttggc 6780 acgactattc cccttgaccc agagaccacc tttcagtatg ttggcacctt tcccttaaat 6840 tctgttttga atggcccaaa tgggaacaca gtgggctcac gtagggtgca gttatgagtt 6900 ggatgatcgg tgcacttcag acattgggtg ggctgacaga tgttgccaac acaatatcag 6960 gcattgtcta ccagcagcgc caacttgatt tgctccaaca acaaacccag ctccaggctg 7020 agtggatgcg gcgcaatgaa caacttcagc gcgattcatt agccctttct cgtgagcttt 7080 ctactgaggg acccaccttg agatttcaat ctgctcttgc agccgggttt aatcccgttg 7140 acgcccgacg gctggcgggg tctggtgagc gcgtagtgcg tggctacctg gaccaacctg 7200 tcttccatca gggagacctc gtgggtctaa gagccacatc acacctcaac actatgaacg 7260 ctgcactaac cacgttcaaa caaggtcatc cacttggcac cggttcacca ccaaagccga 7320 ccacgggcac cccacaaccc caaggctttg ctaatccaaa ttatcaacca cgcccacctg 7380 gtttaacctt gcaaggcacc tcttcttcct caaaagtttg aattttcttt tctttcccct 7440 ggttccacac gcgttcgggt ggacaatgca ggttaagcga ccagcgccag tttccgg 7497 // ID KC309420; SV 1; linear; genomic RNA; STD; VRL; 2933 BP. XX AC KC309420; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WGP3/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WGP3/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-2933 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RP 1-2933 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 58c16bbc48d7fc891d4edd5459231b51. DR EuropePMC; PMC5356244; 28302145. XX FH Key Location/Qualifiers FH FT source 1..2933 FT /organism="Sapovirus swine/WGP3/2009/USA" FT /host="swine" FT /strain="WGP3/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304604" FT CDS <1..2398 FT /codon_start=2 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4Q0G1" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4Q0G1" FT /protein_id="AGH15842.1" FT /translation="LHPVVIQHAVDVLMDFVVEDDLALGVANVLKSRPSGLVYDLIIPT FT LKGLPSGMPGTSVINSVCHLILFASAVLGVYQRFNAPYGGNVFQHEKVVTYGDDCIYGF FT CTATASKVSTFWDLMRAFGMHPTNADKSGDPTFAHTIQFLKRTIVLRDGHLLGALEPSS FT LWRQLNWIKGSKTTTMEPIYPPDPIGRLDQIYNAVWRSAAWGEEFFRTFEQHARELCKA FT ERLPYTEVNYSEAIQVLTSISSTTPEGKAVVYVMEGPTAPRQDQAIEMADGVQSSSAAP FT PIMVNPAPPNQVAMAVASNDAAGGAVQTVADDIKSTYCVHKNFTWNARAAQGTLIGFVT FT LGPDCNPYTDHISRMYRGWSGSMKVRISISGSGIYAGKIMAAILPPGLNPEDAGNPGSY FT PHALIDAKTNLAFSVDVYDIRNTEYHYQGDRNVSSLGIWVYQPLINPFAQGDASAMITI FT ETMPGPDFNFCMLKSPDSVTNVNAPSELLPITLLGSRDNRFGQTPVGFVGANIATQANH FT HFDCSGVTFGWSTVPFAAPNIVLETARVDLGGGMAGYRARPAEGEDVVVAGVPNHWPDS FT CASTAINTGSNSAAGKGAGGVVLTHQNGDIGENNPYTVLYMVFNQTGAPLAGDIADGNL FT MVRTLNGTTGSAPANNGSLTTVSYIQTNADNNGNIQSATTPLRGRAVSYGPLGGNNILL FT WQVELPSSHRGGGVVYSSQLDHTAVACANIVNVPPGGMAVYTVKSAGDVFQVGVCHDGY FT LRTGIGNGVVALLDPRTTFEYNGVYSITTPLNGPDGTGRGFRMAP" FT CDS 2395..2901 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q0K3" FT /protein_id="AGH15843.1" FT /translation="MSWTAGALSGLGLLSDIAGNIGNIVAQQQIVKNQKKQLEIQQQAL FT TQQVRLAERAQDLTMFLNTNGPQLQYSAARSLGYNHLEAKQIVGGSRVSYGGVDVEPRP FT LVTMPFYNAGANSHAKAQMVVSQFKQGTTGFTLPQPQGFSNPNYQARLTRFRQNLEHAP FT GESVV" XX SQ Sequence 2933 BP; 713 A; 763 C; 765 G; 692 T; 0 other; actgcatccc gtggtcattc aacacgctgt ggacgtgctc atggattttg tggttgagga 60 tgatcttgcg ttgggtgtgg caaacgttct aaaatcacgc ccaagtggcc ttgtgtacga 120 tttgataatc ccaaccctca aagggcttcc ctctggtatg cctgggacta gtgtcatcaa 180 ttctgtgtgt cacctgatct tgttcgcttc ggccgtccta ggagtgtatc agagatttaa 240 cgcgccctat ggtgggaacg tgtttcagca cgagaaggtc gtcacgtatg gtgatgattg 300 catctatgga ttttgcacag cgacggcctc aaaagtgagc actttctggg atttgatgag 360 agcctttggc atgcacccta ccaacgctga taaatcaggt gacccaacct tcgcgcacac 420 tatccagttc ttgaagcgca caattgtcct cagagatggt cacctcctcg gtgcgcttga 480 accatcttcc ttgtggcgac aactcaattg gataaagggt tccaagacaa cgacaatgga 540 accaatttac cccccagacc caattggtag actcgaccaa atttacaatg cagtgtggag 600 gtcagccgcc tggggggagg aattctttag aaccttcgag caacatgcac gggagttatg 660 caaagcagaa aggttaccct acactgaggt gaattactct gaggcaatac aagtactaac 720 cagcatttca tccaccacgc ccgagggcaa ggcagtagtg tatgtcatgg aggggccaac 780 ggctccacgc caggaccagg ccattgagat ggcagatggg gtccaatcct ccagtgccgc 840 accaccaatc atggtgaacc ctgctccacc aaaccaggtg gcaatggcgg tcgcctcaaa 900 cgatgctgct ggaggtgcgg tccaaacagt tgcagatgac ataaaaagta catattgcgt 960 ccacaagaat ttcacctgga acgcaagggc agcccaaggc acactcatag ggtttgtcac 1020 acttggacct gactgtaacc catacacaga tcacatctcc cgcatgtacc gtgggtggtc 1080 tggctcgatg aaggtgcgca tctcaatatc tgggtcgggc atctatgcag ggaagatcat 1140 ggccgcgata ttaccaccgg gcctcaaccc tgaagacgct ggtaaccctg gcagttaccc 1200 gcacgcgctg atagacgcaa aaaccaatct tgccttttca gttgacgtgt atgacatcag 1260 aaacactgag tatcactacc agggtgatcg caacgtttcc agcctcggca tatgggtgta 1320 tcaacccttg atcaatccct ttgcccaagg ggatgccagc gcaatgatca cgattgagac 1380 catgcctggg ccagatttta atttctgcat gctcaaatcc cctgatagtg tcaccaatgt 1440 caacgcccct agcgagctcc tacccatcac actccttggg tcacgtgaca acaggtttgg 1500 gcagacacca gtggggtttg tgggtgccaa catcgccacg caagccaatc atcattttga 1560 ctgctctggc gtgacatttg gatggtcgac tgtcccgttc gctgcaccta atatagtttt 1620 ggaaaccgcc cgtgtcgatc tcgggggtgg tatggctggg tacagagccc ggccagctga 1680 aggagaggac gtcgtggttg ctggtgtgcc aaatcattgg ccagactcat gtgctagtac 1740 tgccatcaac actggtagta actcagctgc aggaaagggt gcgggtgggg tcgtcctcac 1800 acaccagaat ggggacattg gcgagaacaa cccctacact gtgttgtaca tggtgttcaa 1860 tcaaactgga gcaccacttg ctggcgacat tgccgatggt aacctcatgg tgcgcacgtt 1920 gaacggtaca acaggttctg caccagcaaa caatgggagc ttgaccaccg tttcgtacat 1980 ccaaaccaat gctgataata acggcaacat acaatctgcc accacaccac tcaggggccg 2040 tgcagtttcg tacggccccc tcggtggcaa taacatcctt ctgtggcagg ttgaattgcc 2100 aagctcccat aggggtgggg gcgtggtgta cagctcccaa cttgaccaca ctgccgtcgc 2160 ttgtgcgaac attgtcaacg tcccgcctgg tgggatggct gtgtatacgg tcaagtccgc 2220 tggtgatgtc ttccaagttg gggtctgcca tgatgggtat cttcgcactg gcatcggcaa 2280 tggcgttgtg gcactgttgg acccacgaac aacctttgag tataacggtg tttactcaat 2340 aacaacacca ctgaatggcc ccgatgggac tgggcggggt ttcaggatgg caccatgagt 2400 tggacagctg gtgcgttatc ggggctgggc ctcctatccg atattgcagg caatattggc 2460 aacatcgtgg cccagcaaca aattgtgaag aatcagaaga aacagctgga aattcagcaa 2520 caggcactca cccaacaagt ccgtctagct gagagggcgc aggatttgac aatgtttttg 2580 aacaccaacg ggccccagtt gcagtattct gctgctcggt ccctaggtta caaccacctg 2640 gaggccaaac aaatcgttgg gggctcacga gtcagctatg gtggtgttga tgttgaaccg 2700 cgtccattgg tcaccatgcc cttttacaac gctggtgcga acagtcatgc taaggcccag 2760 atggttgtta gccagtttaa acagggcacc acggggttca ccttacctca accgcaaggc 2820 ttttcaaacc caaattacca ggctagattg actagattta ggcaaaatct tgaacatgcc 2880 cctggggaaa gtgttgttta aatttgattg atgtttttct tttttctgca agg 2933 // ID KC309421; SV 2; linear; genomic RNA; STD; VRL; 6052 BP. XX AC KC309421; XX DT 19-MAR-2013 (Rel. 116, Created) DT 06-JUN-2016 (Rel. 129, Last updated, Version 3) XX DE Sapovirus swine/WGP247/2009/USA polyprotein gene, partial cds; and small DE basic protein VP2 gene, complete cds. XX KW . XX OS Sapovirus swine/WGP247/2009/USA OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-6052 RX DOI; 10.1128/JCM.00865-13. RX PUBMED; 23678065. RA Scheuer K.A., Oka T., Hoet A.E., Gebreyes W.A., Molla B.Z., Saif L.J., RA Wang Q.; RT "Prevalence of porcine noroviruses, molecular characterization of emerging RT porcine sapoviruses from finisher Swine in the United States, and unified RT classification scheme for sapoviruses"; RL J. Clin. Microbiol. 51(7):2344-2353(2013). XX RN [2] RC Publication Status: Online-Only RP 1-6052 RX PUBMED; 27228126. RA Oka T., Lu Z., Phan T., Delwart E.L., Saif L.J., Wang Q.; RT "Genetic Characterization and Classification of Human and Animal RT Sapoviruses"; RL PLoS One 11(5):E0156373-E0156373(2016). XX RN [3] RP 1-6052 RA Scheuer K.A., Wang Q., Saif L.J.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX RN [4] RC Sequence update by submitter RP 1-6052 RA Oka T., Wang Q.; RT ; RL Submitted (03-JUN-2016) to the INSDC. RL Food Animal Health Research Program, The Ohio State University, 1680 RL Madison Ave, Wooster, OH 44691, USA XX DR MD5; 50982e89e23e03674bfb348218b4a1d9. DR EuropePMC; PMC3697660; 23678065. DR EuropePMC; PMC4881899; 27228126. DR EuropePMC; PMC5356244; 28302145. XX CC On Jun 3, 2016 this sequence version replaced gi:461180495. XX FH Key Location/Qualifiers FH FT source 1..6052 FT /organism="Sapovirus swine/WGP247/2009/USA" FT /host="swine" FT /strain="WGP247/09/USA" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2009" FT /db_xref="taxon:1304603" FT CDS <1..5517 FT /codon_start=1 FT /product="polyprotein" FT /note="contains RNA-dependent RNA polymerase and capsid FT protein" FT /db_xref="GOA:M4QBF4" FT /db_xref="InterPro:IPR000317" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR003593" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QBF4" FT /protein_id="AGH15844.2" FT /translation="KKKVLKFTIRTGALLDATKVAGLKASEVDSVLAFIEDLTAEGNEL FT LITPGIGALASVVQTCVRELAQHKTYVTNLKDAALRHEPPKLYVFSGPPGVGKTTLIKK FT LISDLGLPHSNFTLDLDHHDYYTGEEVCVWDEFDTDAKGQYISTVITLVNTVPFPLNCD FT RIENKGRNFTSKIILATTNNETPVQPNDPRFEAFMRRVTYYDVRCPPVTRCYEDGRKPG FT ANLYKDDFSHLTITRRCFMAYNAEGCLANGKKFSGTPVTYSSILANIKGDLASFKLQSP FT AWEGVWVKCLKPHHVPQVIQFFNGVFAHLGLPNRVTTERTQSVSGFYDFIVSTDSHPPG FT AQYHEIVIDGFKTVPDCLQDAYERPLQVFDFVGTPSQTLLNICLTQARGHTILTSNSPI FT NVSSLPRPKEIVYVENWYGLVRAAFRHCSIFTPVALFKMIRNGMTLEQSNVEEFFRKLT FT HEVKFSVAPQCTLIRMPLFDILFFTSVGSMTWILPGRMPFATPGDIGSLVVPAQPVYRG FT SLWGALRMALTSFMNFIKPYLGLVATSVAISHVWGDSLQKKKGKNKGARGMRALNDDEY FT TEWRDMRRDWRTEFTIEQYLDIVSNPDSDYAERYKAWSQLRSLRMANNAYDHAVTIGKG FT GVKWEAQGPGATVKLRCGPTDVGWANRIGEGLYVTATHLMAMADNVDGVTYDVEYSKDD FT CTVIRQHTPTHGPCYKVSTSSKPTCFTEDRIPVTCLSVVETTVSGNKVVGWKVSCDQKT FT VGGDCGKPYLDADGRIVAIHSAAANYGPTKLASRVVLKPSTPAQGETWKGLNVERNTIT FT MGALPGSTKYHKSMLHKDDSYGPANFGPCDTRCPRPLPDVIAEQIKPFQDNPVSIDENL FT IARGAKHVRAFMRHILGAHRRPHIDQLSAFRSLNMKSSNGPWFPGTKKDHIDDEGRPNA FT MLESYISARISDVKTGKFKHHYKLSLKDELRPKEKILAGKRRLLWGCDVGFATACAMVF FT KTLFDDICEAAPYTGCAVGIDMDNITVVKELNDMFTGTHLVCADYSRWDSTLHPVVIQH FT AVDVLMDFVVEDDLALGVANVLKSRPSGLVYDLIIPTLKGLPSGMPGTSVINSVCHLIL FT FASAVLGVYQRFNAPYGGNVFQHEKVVTYGDDCIYGFCTATASKVSTFWDLMRAFGMHP FT TNADKSGDPTFAHTIQFLKRTIVLRDGHLLGALEPSSLWRQLNWIKGSKTTTMEPIYPP FT DPVGRLDQIYNAVWRSAAWGEEFFRTFEQHARELCKAERLPYTEVNYSEAIQVLTSISS FT TTPEGKAVVYVMEGPTAPRQDQAIEMADGVQSSSAAPPIMVNPAPPNQVAMAVASNDAA FT GGAVQTVADDIKSTYCVHKNFTWNARAAQGTLIGFVTLGPDCNPYTDHISRMYRGWSGS FT MKVRISISGSGIYAGKIMAAILPPGLNPEDAGNPGSYPHALIDAKTNLAFSVDVYDIRN FT TEYHYQGDRNVSSLGIWVYQPLINPFAQGDASAMITIETMPGPDFNFCMLKSPDSVTNV FT NAPSELLPITLLGSRDNRFGQTPVGFVGANIATQANHHFDCSGVTFGWSTVPFAAPNIV FT LETARVDLGGGMAGYRARPAEGEDVVVAGVPNHWPDSCASTAINTGSNSAAGKGAGGVV FT LTHLNGDIGESNPYTVLYMVFNQTGAPLAGDIADGNLMVRTLNGTTGSAPANNGSLTTV FT SYIQTNADNNGNIQSATTPLRGRAVSYGPLGGNNILLWQVELPSSHRGGGVVYSSQLDH FT TAVACANIVNVPPGGMAVYTVKSAGDVFQVGVCHDGYLRTGIGNGVVAPLDPRTTFEYN FT GVYSITTPLNGPDGTGRGFRMAP" FT CDS 5514..6020 FT /codon_start=1 FT /product="small basic protein VP2" FT /note="ORF2" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/TrEMBL:M4Q2Y9" FT /protein_id="AGH15845.1" FT /translation="MSWTAGALSGLGLLSDIAGNIGNIVAQQQIVKNQKKQLEIQQQAL FT TQQVRLAERAQDLTMFLNTNGPQLQYSAARSLGYNHLEAKQIVGGSRVSYGGVDVEPRP FT LVTMPFYNAGANSHAKAQMVVSQFKQGTTGFTLPQPQGFSNPNYRARLTGFRQNLEHVP FT GESVV" XX SQ Sequence 6052 BP; 1533 A; 1563 C; 1546 G; 1410 T; 0 other; aagaagaagg ttctcaagtt caccatacgc acaggtgcac tgctagacgc aacgaaagtt 60 gcgggtctta aagcctctga agtcgacagt gtgcttgcat ttatagaaga cctcacagca 120 gagggtaatg agttgttgat aacgccaggc attggagccc tagcatcagt cgttcagacc 180 tgcgttagag agcttgccca gcacaaaacg tatgtcacaa acttaaagga tgctgctttg 240 cgacatgagc cgccaaaact gtatgtgttt tctggccctc ctggtgtggg aaaaaccaca 300 cttataaaga aactaatatc tgaccttggg ctcccacact ctaacttcac gttggacctc 360 gatcaccacg actactacac tggtgaggaa gtgtgtgtgt gggatgagtt tgacactgat 420 gcaaagggac agtacatatc aactgtcata acactggtaa acacggttcc ttttccatta 480 aactgtgacc gcattgagaa caagggtcgc aacttcacct cgaagatcat attggccacc 540 acaaacaatg agactcctgt tcaaccaaac gacccacgtt ttgaagcatt catgcgccgc 600 gtcacatact atgatgtgcg gtgcccgcca gttacgcggt gctatgaaga cgggaggaag 660 ccaggtgcta acctctacaa ggatgatttt tcacacctca caataacaag gaggtgtttc 720 atggcctaca acgcggaggg atgcctggcg aatggaaaga agtttagcgg aacgccagtg 780 acttatagct ccatcttagc caacatcaag ggtgacctgg caagttttaa actccagagt 840 cctgcatggg agggcgtttg ggtcaagtgc ttgaaacccc accatgtacc acaagtcatc 900 cagttcttta acggtgtgtt cgcgcacctt ggcctcccca atcgcgttac gaccgagcgc 960 acacaatcgg tcagtgggtt ttatgacttt attgtgtcaa ctgatagcca cccaccaggc 1020 gcccagtacc acgagatagt cattgatggg tttaaaactg tccctgattg cctccaagat 1080 gcctatgaga ggcccttaca ggtgtttgat tttgtaggca ccccatccca gaccctgctc 1140 aacatttgct tgacccaagc gcgtggtcac actatactca catctaactc acccattaac 1200 gtgagttccc tcccacgccc caaagaaata gtgtatgttg aaaattggta cgggttggtt 1260 agagcagcct ttcgccactg ttccatattc acaccagttg cgctgttcaa aatgatccgc 1320 aacgggatga cattagagca atccaacgtt gaagagtttt ttcgcaaact cacccacgag 1380 gtgaagttca gcgtggcccc acagtgcacc ttgatccgca tgccattgtt tgacatccta 1440 tttttcactt ctgttggttc aatgacatgg attttgccag ggcgcatgcc atttgccaca 1500 ccgggtgata tcggctccct tgtcgtgccc gcacaacctg tctaccgtgg gtctttgtgg 1560 ggtgcccttc gtatggcact cacatctttc atgaatttca taaaacccta cctgggcctc 1620 gttgccacct cagttgcaat ctcacacgtg tggggtgact ccctccaaaa gaagaaaggt 1680 aagaacaagg gtgccagagg catgcgcgcc ttaaacgatg acgagtacac cgagtggcgt 1740 gacatgaggc gtgattggcg tacggagttc acaattgagc agtacctcga catagtgtcc 1800 aaccccgaca gcgactacgc tgagcgctac aaagcgtggt cacaactgcg ctcactacgc 1860 atggccaaca acgcatacga ccatgccgtt accataggga aaggtggggt taagtgggag 1920 gcgcaggggc ctggcgccac agttaaactc agatgtggtc ccaccgacgt tgggtgggcc 1980 aatcgcatcg gtgagggcct ctacgtgaca gcaacgcacc tgatggccat ggcagacaac 2040 gttgatggtg tgacctatga cgttgagtac tccaaggatg attgcacagt gattagacag 2100 cacacaccaa cacacggtcc ttgttataag gtgtcaacaa gcagcaaacc cacgtgcttc 2160 acagaagatc gcatcccagt cacgtgcctg tcagtcgtgg agaccaccgt gtctggtaac 2220 aaagtagttg gttggaaagt gagttgtgac cagaagactg ttggtggtga ctgtggaaag 2280 ccttacctgg acgctgatgg tcgcattgtc gcaatacaca gtgcggctgc caactatggt 2340 ccaacaaagt tggcctctag ggtggtgcta aaaccaagca cacctgctca aggggaaacc 2400 tggaagggcc tcaacgttga gcgcaacacc atcacaatgg gtgcactccc tggctccaca 2460 aagtaccaca aatcaatgct gcataaagat gactcatacg ggccagcaaa ctttggacct 2520 tgtgacacac gctgcccaag gccactgcct gatgtgattg ctgagcaaat aaaacctttc 2580 caagacaacc cggttagcat tgatgagaac ctgatcgccc gtggtgcgaa gcatgtgcgt 2640 gcctttatga ggcacatact tggtgcacac cgtagaccac acattgacca actcagtgcc 2700 ttcaggtcgc tcaacatgaa gagctcaaac ggtccatggt tcccgggcac aaagaaagac 2760 cacattgatg acgagggcag acccaacgct atgctcgagt cgtacatctc agcaaggatt 2820 agtgatgtga aaacgggcaa atttaagcac cactataaac tatctctcaa agatgagttg 2880 agaccaaaag aaaagatatt ggctggcaaa cgacgcttat tatggggttg tgatgtgggg 2940 ttcgcaacag cgtgcgccat ggtctttaaa acactgtttg acgacatctg tgaggcagca 3000 ccttatactg gttgtgccgt tggcattgat atggacaaca tcacggtggt caaagagctg 3060 aatgacatgt tcactggcac ccacttggtg tgtgctgact attcaaggtg ggactccaca 3120 ctgcatcccg tggtcattca acacgctgtg gacgtgctca tggattttgt ggttgaggat 3180 gatcttgcgt tgggtgtggc aaacgttcta aaatcacgcc caagtggcct tgtgtacgat 3240 ttgataatcc caaccctcaa agggcttccc tctggtatgc ctgggactag tgtcatcaat 3300 tctgtgtgcc acctgatctt gttcgcttcg gccgtcctag gagtgtatca gagatttaac 3360 gcgccctatg gtgggaacgt gtttcagcac gaaaaggtcg tcacgtatgg tgatgattgc 3420 atctatggat tttgtacagc gacggcctca aaagtgagca ctttctggga tttgatgaga 3480 gcctttggca tgcaccctac caacgctgat aaatcaggtg acccaacctt cgcgcacacc 3540 atccagtttt tgaagcgcac aattgtcctc agagatggtc acctcctcgg tgcgcttgaa 3600 ccatcttcct tgtggcgaca actcaattgg ataaagggtt ccaagacaac gacaatggaa 3660 ccaatttacc ccccagaccc agttggtaga ctcgaccaaa tttacaatgc agtgtggagg 3720 tcagccgcct ggggggagga attctttaga accttcgagc aacatgcacg ggagttatgc 3780 aaagcagaaa ggttacccta cactgaggtg aattactctg aggcaataca agtactaacc 3840 agcatttcat ccaccacgcc cgagggcaag gcagtagtgt atgtcatgga ggggccaacg 3900 gctccacgcc aggaccaggc cattgagatg gcagatgggg tccaatcctc cagcgccgca 3960 ccaccaatca tggtgaatcc tgctccacca aaccaggtgg caatggcggt cgcctcaaat 4020 gacgctgctg gaggtgcggt ccaaacagtt gcagatgaca taaaaagcac atattgcgtc 4080 cacaagaatt tcacctggaa cgcaagggca gcccaaggca cactcatagg gtttgtcaca 4140 cttggacctg actgcaaccc atacacagat cacatctctc gcatgtaccg tgggtggtcg 4200 ggctcaatga aggtgcgcat ctcaatatct gggtcgggca tttatgcagg aaagatcatg 4260 gccgcgatat taccgccggg tcttaaccct gaagacgctg gcaaccctgg cagttacccg 4320 cacgcgctga tagacgcaaa aaccaatctt gccttttcag ttgacgtgta tgacatcaga 4380 aacactgagt atcactatca gggtgatcgc aacgtttcca gcctcggcat atgggtgtat 4440 caacccttga tcaatccttt tgcccaaggg gatgccagcg caatgatcac gattgagacc 4500 atgcctgggc cagatttcaa cttctgcatg ctcaaatccc ctgatagtgt caccaatgtc 4560 aacgccccta gcgagctcct acccatcaca ctccttgggt cacgcgacaa caggtttgga 4620 cagacaccag tggggtttgt gggtgccaac atcgccacac aagccaatca tcattttgac 4680 tgctctggcg tgacatttgg atggtcgact gtcccgttcg ctgcacctaa tatagttttg 4740 gaaaccgccc gtgtcgatct cgggggtggt atggctgggt acagggcccg gccagctgaa 4800 ggagaggacg tcgtggttgc tggtgtgcca aatcattggc cagactcatg tgctagcact 4860 gccatcaaca ctggtagtaa ctcagctgca ggaaagggtg cgggcggggt cgtcctcaca 4920 cacctgaatg gggacattgg cgagagcaac ccctacactg tgttgtacat ggtgttcaat 4980 caaactgggg caccacttgc tggcgacatt gccgatggta acctcatggt gcgcacgttg 5040 aacggcacaa caggttctgc accagcaaac aatgggagct tgaccaccgt ttcgtacatc 5100 caaaccaatg ctgacaacaa cggtaacata caatctgcca ccacgccact caggggtcgt 5160 gcagtttcgt acggccccct cggtggcaat aacatccttc tgtggcaggt tgaattgcca 5220 agctcccata ggggtggggg cgtggtgtac agctcccaac ttgaccacac tgccgtcgct 5280 tgtgcgaaca ttgtcaacgt cccgcctggt gggatggctg tgtatacggt caagtccgct 5340 ggtgatgttt tccaagttgg ggtctgccat gatgggtatc ttcgcactgg catcggcaat 5400 ggcgttgtgg caccgttgga cccacgaaca acctttgagt acaacggtgt ttactcaata 5460 acaacaccac tgaatggccc cgatgggact gggcggggtt tcaggatggc accatgagtt 5520 ggacagctgg tgcgttgtcg gggctgggcc ttctatccga tattgcaggc aatattggca 5580 acatcgtggc ccagcaacaa attgtgaaga atcagaagaa acagctggaa attcagcaac 5640 aggcactcac ccagcaagtc cgtctagctg agagggcgca ggatttgaca atgtttttga 5700 acaccaacgg gccccaactg cagtattctg ctgctcggtc cctaggttac aaccacctgg 5760 aggccaaaca aatcgttggg ggctcacgag tcagctatgg tggtgttgat gtcgaaccgc 5820 gtccattggt caccatgccc ttttacaacg ctggtgcgaa cagtcatgct aaggctcaga 5880 tggttgttag tcagtttaaa cagggcacca cggggttcac cttacctcaa ccgcaaggct 5940 tttcaaaccc aaattatcgg gctagattga ctggatttag gcaaaatctt gagcatgtcc 6000 ctggggaaag tgttgtttaa atttaattga tgtttttctt ttttctgtaa gg 6052 // ID KC309427; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309427; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/13.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 12c08257bb680f0f40a7c996d9806276. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/13.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QH45" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QH45" FT /protein_id="AGH27726.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDKLDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 112 A; 89 C; 83 G; 68 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga caagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309428; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309428; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/02.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 9578f18181d9567b7854c7e5a6d4fb8d. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/02.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QGX2" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QGX2" FT /protein_id="AGH27727.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 109 A; 89 C; 86 G; 68 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacgaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcgtc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309429; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309429; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/17.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 4a0090ac2bde5bf9d4c803bf1710ba99. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/17.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QU45" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QU45" FT /protein_id="AGH27728.1" FT /translation="ALTAVEIGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 110 A; 89 C; 84 G; 69 T; 0 other; gcactgacag cagtggagat tgggcacaca tcacaagtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacgaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcgtc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309430; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309430; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/26.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; c176d4eb567e28963946dd6fc3ee634f. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/26.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QJG3" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QJG3" FT /protein_id="AGH27729.1" FT /translation="ALTAVEIGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 110 A; 88 C; 84 G; 70 T; 0 other; gcactgacag cagtggagat tgggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgtttacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcgtc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309431; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309431; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/30.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; ad8854884b1015e37856a0663bc0ce3d. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/30.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QKV8" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QKV8" FT /protein_id="AGH27730.1" FT /translation="ALTAVEIGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITLVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 112 A; 89 C; 83 G; 68 T; 0 other; gcactgacag cagtggagat tgggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacattag ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309432; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309432; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/04.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 2bd3a5edfe08b66c8322d941f1daca9f. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/04.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QH51" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QH51" FT /protein_id="AGH27731.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 111 A; 88 C; 84 G; 69 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat atatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309433; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309433; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/11.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 9ea879d456b35af76db547eba7b2522f. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/11.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QGX8" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QGX8" FT /protein_id="AGH27732.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 110 A; 89 C; 84 G; 69 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt ctatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309434; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309434; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/29.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 12eff049d247fcd562a971ab93a76190. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/29.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QU51" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QU51" FT /protein_id="AGH27733.1" FT /translation="ALTAVEIGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 112 A; 89 C; 82 G; 69 T; 0 other; gcactgacag cagtggagat tgggcacaca tcacaagtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309435; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309435; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/16.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 9f6c05c686bedbc3bd07dbc6228a1a88. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/16.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QJG9" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QJG9" FT /protein_id="AGH27734.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 111 A; 88 C; 84 G; 69 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgt 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309436; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309436; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/01.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; ab83aa268a9c4ebedee78983e88820ed. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/01.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QKW5" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QKW5" FT /protein_id="AGH27735.1" FT /translation="ALTAVEMGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 111 A; 89 C; 84 G; 68 T; 0 other; gcactgacag cagtggagat ggggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309437; SV 1; linear; genomic RNA; STD; VRL; 352 BP. XX AC KC309437; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Echovirus E30 strain EVs/Alexandroupolis.GRC/14.12 VP1 gene, partial cds. XX KW . XX OS Echovirus E30 OC Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; OC Enterovirus B. XX RN [1] RC Publication Status: Available-Online prior to print RP 1-352 RX DOI; 10.1097/INF.0b013e31828f875c. RX PUBMED; 23459085. RA Mantadakis E., Pogka V., Voulgari-Kokota A., Tsouvala E., Emmanouil M., RA Kremastinou J., Chatzimichael A., Mentis A.; RT "Echovirus 30 outbreak associated with a high meningitis attack rate in RT thrace, Greece"; RL Pediatr. Infect. Dis. J. 32(8):914-916(2013). XX RN [2] RP 1-352 RA Pogka V., Voulgari-Kokota A., Emmanouil M., Mantadakis E., Mentis A.F.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Influenza Reference Laboratory of Southern Greece, Hellenic RL Pasteur Institute, Vas. Sofias Ave., Athens 11521, Greece XX DR MD5; 1b7999b9d5e2786ae53d50549e0abcd5. XX CC ##Assembly-Data-START## CC Assembly Method :: Bioedit Sequence Alignment Editor v. 7.1.7 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..352 FT /organism="Echovirus E30" FT /host="Homo sapiens" FT /strain="EVs/Alexandroupolis.GRC/14.12" FT /mol_type="genomic RNA" FT /country="Greece" FT /isolation_source="stool sample" FT /collection_date="2012" FT /note="genotype: B" FT /db_xref="taxon:41846" FT CDS <1..>352 FT /codon_start=1 FT /product="VP1" FT /note="structural protein" FT /db_xref="GOA:M4QH57" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:M4QH57" FT /protein_id="AGH27736.1" FT /translation="ALTAVEIGHTSQVVPSDTMQTRHVVNYHTRSESSIENFMGRAACV FT YIAQYATEKVNDELDRYTNWEITTRQVAQLRRKLEMFTYMRFDLEITFVVTSSQRTSTT FT YASDSPPLTHQVM" XX SQ Sequence 352 BP; 111 A; 89 C; 83 G; 69 T; 0 other; gcactgacag cagtggagat tgggcacaca tcacaggtgg tgccgagtga cacaatgcaa 60 acacggcacg tggtcaacta ccacaccaga tcagaatcgt caatagagaa ttttatgggt 120 agagcagctt gtgtgtacat tgctcagtac gccacagaga aggtcaacga cgagttagac 180 aggtacacca actgggagat aacaaccagg caagtggcac aattgaggcg aaaactggaa 240 atgttcacat acatgagatt tgacctcgag atcacatttg ttgtcaccag ctcccagcgc 300 acttcaacca catatgcatc ggactcccct ccactaacac accaagtgat gt 352 // ID KC309438; SV 1; linear; genomic DNA; STD; VRL; 3411 BP. XX AC KC309438; XX DT 25-MAR-2013 (Rel. 116, Created) DT 05-MAY-2013 (Rel. 116, Last updated, Version 2) XX DE Gull adenovirus isolate LA010815.1 polymerase gene, complete cds. XX KW . XX OS Gull adenovirus OC Viruses; Adenoviridae; unclassified Adenoviridae. XX RN [1] RP 1-3411 RX DOI; 10.1016/j.virol.2013.02.011. RX PUBMED; 23507452. RA Bodewes R., van de Bildt M.W., Schapendonk C.M., van Leeuwen M., RA van Boheemen S., de Jong A.A., Osterhaus A.D., Smits S.L., Kuiken T.; RT "Identification and characterization of a novel adenovirus in the cloacal RT bursa of gulls"; RL Virology 440(1):84-88(2013). XX RN [2] RP 1-3411 RA Bodewes R., van de Bildt M.W.G., Schapendonk C.M.E., van Leeuwen M., RA van Boheemen S., de Jong A.A.W., Osterhaus A.D.M.E., Smits S.L., Kuiken T.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Viroscience, Erasmus Medical Centre, Dr Molewaterplein 50, Rotterdam RL 3015GE, The Netherlands XX DR MD5; 65af60bfc26f6f423db08415cf4d6432. XX CC ##Assembly-Data-START## CC Assembly Method :: DNAstar v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..3411 FT /organism="Gull adenovirus" FT /host="Larus argentatus" FT /isolate="LA010815.1" FT /mol_type="genomic DNA" FT /country="Netherlands" FT /isolation_source="cloacal bursa" FT /collection_date="15-Aug-2001" FT /db_xref="taxon:1306817" FT CDS 1..3411 FT /codon_start=1 FT /product="polymerase" FT /db_xref="GOA:M4SSA5" FT /db_xref="InterPro:IPR004868" FT /db_xref="InterPro:IPR006172" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR014382" FT /db_xref="InterPro:IPR017964" FT /db_xref="UniProtKB/TrEMBL:M4SSA5" FT /protein_id="AGH58206.1" FT /translation="MLITRDLTDAWTWISSRAPVRKCLSCGRYWNTGHSCNELRSAFYF FT HAVQKNGRDMWQHVPFRCPAQPANLRQLFVTYDIETYTQHEARGKRMHPFLLCFALSGD FT DALASEARRLALQDETLRETGGGYYWLDFEPGAVARRFRAFRTRLQQHFARDLVRRFAE FT HNGEFCQQTMRDGGYASPYDIPYELFEQPAQALTLPEDFYVVNVVVLGHNITKFDELLL FT ATELVERRDIFPTAGKCTRTFMPRVGRLLFNDVHFAMPNPLYLRRDPTRTQRWARGIVL FT PSDARHVAVRFLVRDTLQLTSGAKLKKAAAAYALELAKGECPYEAVNDMVALGSFESDA FT DGFPVARYWESEAVRREQKALWEQNHPGTPYDLTRACLEYCMLDVKVTEELAVTLYRSY FT DTYYKRELGMRGDYNIFERPTIPSNTHAFWKQLAFTAFAEKNGAEADARGKRKGKRKDV FT PHNYVAMLYAPYRPMFTYIRQALRGGRCYPNVLGPYRERVYVFDICGMYASALTHPMPH FT GMPLDPAHSAKHVDELNAMLARERPLSYFDERVKPSILKIDAYPPPVDMLDPLPPLCSR FT RGGKLVWTNEALHEEVVTVLDVLTLHNRGWTVTVLHDEMNIVFPEWSTICADYVARNIA FT AKEKADKEKNEVMRSISKMLSNALYGAFATNMDTSRVVFEQDLTDRDRKEIYEGTQIVK FT HVTLLNDRSFSGVVVTPQSAPFSAERMRRHFDALGDHEDDEYDRSPEDTDEEEEEGEVD FT ARVADRSPDPYDVIKGERKRTETALTEDEPLPLTEVDLDLEEALRDPALLYPSDDHAHY FT ASANETRFKSMVLLDAAPEALTVLHLEKLDQLVENKRYATQIACFVLGWSRAVRCGWCE FT ILHGPDRGVHQHDRQPQSLYGDTDSMFVTESGYRRMKERGAHRIKGPQTRLIYDEKRPA FT LYWACECDVKCERCKADAYASESVFLAPKLYGLRDAVCTNPECGHVGPGKIRSKGHRQA FT ELIYDTLLKCWQRYEDQLYAGFSDLPELHTSRTIFKTTLLNKVSRYEPFTIHTEPLTRV FT LRPWRDPTLYAHGGFLYPCDTAHPNPRTAHERRRVPDIGHEDPLAPLDIDPLSFLSVEE FT CDCLLDSIGESCGEPDEAKEEPEAGSRRR" XX SQ Sequence 3411 BP; 740 A; 1106 C; 1023 G; 541 T; 1 other; atgcttatca cgcgcgatct cacagacgcc tggacctgga tatcgagtcg agcgccggta 60 cgcaagtgcc tgagctgcgg gcggtactgg aacacgggac actcctgtaa cgagctacgc 120 agcgccttct attttcacgc cgtgcagaaa aacggccgcg acatgtggca gcacgtgccg 180 ttccgctgtc ccgcgcagcc cgccaacctg cgacagcttt tcgtcaccta cgacatcgag 240 acttacacgc agcacgaggc gcgcggcaaa cgcatgcacc ctttcctgct atgcttcgcc 300 ctgagcggcg acgacgccct ggcgtccgaa gcgaggcggt tggccctgca ggacgagacg 360 ttgcgggaga cgggcggcgg ctactactgg ttggatttcg aaccgggggc ggtagcccgg 420 cgttttcgcg cctttcgcac gcggttacag caacatttcg cgcgcgacct ggtacgtcgc 480 ttcgccgagc acaacggcga attttgccaa caaacgatgc gcgacggcgg atacgcgtcg 540 ccctacgaca taccgtacga actcttcgag caacccgccc aggctcttac cctgcccgaa 600 gacttttacg tggtgaacgt ggtcgtgctc ggacacaaca tcaccaagtt cgacgagttg 660 ttgctggcca cggagctggt ggagcggcgg gacatttttc cgaccgccgg caagtgcacc 720 cgcaccttca tgccccgcgt gggacgtctg ctgtttaacg acgtgcactt cgccatgcct 780 aaccccctat acttgcgccg agaccctacg cgcacgcagc gctgggcgcg cgggatcgtg 840 ttgccgagcg acgcgcgaca cgtcgcggtg cgctttctgg tgcgcgacac gctgcagcta 900 accagcgggg ccaaactcaa aaaggcagct gccgcctacg ccctggagct ggccaaaggg 960 gaatgtcctt acgaagcggt gaacgacatg gtggccctgg gcagcttcga aagcgacgcc 1020 gacggttttc cggtggcgcg ctactgggag agcgaagccg tgcgccgcga gcaaaaggcg 1080 ctgtgggaac agaaccaccc cggaacgccg tacgatctga cccgcgcttg cctrgagtac 1140 tgcatgttgg acgtcaaggt gacggaagaa ctggcagtca ctctctaccg cagctacgac 1200 acctactaca agcgggaact gggcatgcgc ggggactaca acatcttcga gcgacccacc 1260 atccccagca acacgcacgc gttctggaag cagctggcct tcacggcttt cgcggagaaa 1320 aacggcgccg aggccgacgc ccgcggcaag cgcaaaggca aacgcaaaga cgtgccgcac 1380 aattacgtag ccatgctata cgccccctac cgaccaatgt ttacgtacat acgacaggcg 1440 ctgcgcggcg gacggtgcta tcccaacgtg ttaggaccct accgcgagcg ggtgtacgtc 1500 ttcgacatct gcggcatgta cgcctcggct ctcacgcatc ccatgcccca cggcatgccg 1560 ttggatcccg cgcacagcgc caagcacgtg gacgaactca acgccatgct ggcgcgggaa 1620 cgcccgctaa gctacttcga cgagcgcgtg aagccctcca tcttaaaaat cgacgcctac 1680 ccaccccccg tagacatgct agaccccttg ccgcctctat gctccaggcg cggcggtaag 1740 ctagtatgga ccaacgaggc gctgcacgaa gaggtcgtga cggtgctgga cgtgctgacg 1800 ctacataacc gcggatggac cgtaacagtc ctgcacgacg agatgaacat cgttttcccc 1860 gaatggagca ccatatgcgc cgactacgtg gcgcgcaaca tcgcagccaa ggaaaaagcc 1920 gacaaggaaa agaacgaagt catgcgctcc atctccaaaa tgctcagcaa cgcgctgtac 1980 ggcgccttcg ccaccaacat ggacaccagc cgggtggttt tcgagcaaga cttgaccgac 2040 agggaccgaa aggagatata cgaggggacg cagatcgtga agcacgtcac gttgctcaac 2100 gaccgctcct tcagcggcgt cgtcgtcacg ccgcaaagcg ccccgttttc ggcggaacgc 2160 atgcggcgcc acttcgacgc cctcggcgac cacgaagacg acgagtacga ccggagcccc 2220 gaagacactg acgaggaaga ggaggagggg gaagtcgacg cgcgcgtcgc agaccggagt 2280 ccggacccgt atgacgtcat aaagggagaa agaaagcgaa cggaaaccgc actcaccgaa 2340 gacgaaccac ttccgctgac ggaggtcgac ttggatctgg aagaagccct gcgagacccc 2400 gcccttttat acccgagcga tgaccacgcc cactacgcct cggccaatga aacgcgtttc 2460 aagagcatgg tgctgctgga cgcggcgccc gaggccctga cggtgctcca tctggaaaaa 2520 ctggaccagc tggtggaaaa caaacgctac gccacccaaa tcgcgtgttt cgtgctgggc 2580 tggtcgcgtg ctgttcgttg tggatggtgc gagattttgc acgggccgga ccgcggcgtt 2640 caccaacacg accgccagcc gcagagcctc tacggagaca ccgacagtat gttcgtcacc 2700 gagagcggat accggcgcat gaaagagcgc ggggcgcacc gcatcaaggg accccagacg 2760 cgtctgatct acgacgagaa acggcccgcc ctgtactggg cgtgcgagtg cgacgtgaag 2820 tgcgagcgtt gcaaggccga cgcctacgcc agcgagtccg tgtttctggc gcccaagctg 2880 tacggcttgc gagacgccgt ctgcaccaat ccggagtgcg gtcacgtggg tccggggaag 2940 atcaggtcaa aaggtcaccg acaggcggag cttatatacg acacgctcct caagtgctgg 3000 cagcggtacg aggaccagct gtacgcgggc ttttcggatc tgcccgagct gcacaccagc 3060 cgcaccatct ttaaaaccac gctgctgaac aaggtcagcc gctacgaacc cttcaccatc 3120 cacacggaac cgctgacgcg cgtcctgcgt ccctggagag atccgacgct gtacgcgcac 3180 ggaggcttcc tgtatccctg cgacacggcc catcccaacc cgcgcaccgc tcacgaacgg 3240 cgacgggtac ccgacatagg ccacgaagat ccgctggcgc cgctggatat cgacccgctg 3300 tccttcctca gcgtcgaaga gtgcgactgt ctcctcgaca gcatcggcga gagctgcggc 3360 gagcccgacg aggccaagga agaaccggag gccgggtcac ggcgccggta g 3411 // ID KC309439; SV 1; linear; genomic DNA; STD; VRL; 2818 BP. XX AC KC309439; XX DT 25-MAR-2013 (Rel. 116, Created) DT 05-MAY-2013 (Rel. 116, Last updated, Version 2) XX DE Gull adenovirus isolate LA010815.1 hexon protein gene, complete cds. XX KW . XX OS Gull adenovirus OC Viruses; Adenoviridae; unclassified Adenoviridae. XX RN [1] RP 1-2818 RX DOI; 10.1016/j.virol.2013.02.011. RX PUBMED; 23507452. RA Bodewes R., van de Bildt M.W., Schapendonk C.M., van Leeuwen M., RA van Boheemen S., de Jong A.A., Osterhaus A.D., Smits S.L., Kuiken T.; RT "Identification and characterization of a novel adenovirus in the cloacal RT bursa of gulls"; RL Virology 440(1):84-88(2013). XX RN [2] RP 1-2818 RA Bodewes R., van de Bildt M.W.G., Schapendonk C.M.E., van Leeuwen M., RA van Boheemen S., de Jong A.A.W., Osterhaus A.D.M.E., Smits S.L., Kuiken T.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Viroscience, Erasmus Medical Centre, Dr Molewaterplein 50, Rotterdam RL 3015GE, The Netherlands XX DR MD5; 240221f2524f2aa6460d24111753fd7a. XX CC ##Assembly-Data-START## CC Assembly Method :: DNAstar v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..2818 FT /organism="Gull adenovirus" FT /host="Larus argentatus" FT /isolate="LA010815.1" FT /mol_type="genomic DNA" FT /country="Netherlands" FT /isolation_source="cloacal bursa" FT /collection_date="15-Aug-2001" FT /db_xref="taxon:1306817" FT CDS 1..2817 FT /codon_start=1 FT /product="hexon protein" FT /db_xref="GOA:M4STS6" FT /db_xref="InterPro:IPR016107" FT /db_xref="InterPro:IPR016108" FT /db_xref="InterPro:IPR016111" FT /db_xref="InterPro:IPR016112" FT /db_xref="UniProtKB/TrEMBL:M4STS6" FT /protein_id="AGH58207.1" FT /translation="MAAFTPDLTTATPRLQYFHIAGAGYTRIPERGPATVYLLHVVLFR FT LKNKFRQTVVAPTRHVTTEKSQRLQIRYYPTQTDDTTTSYRARYSVSVGDGWVLDMGST FT YFDIKGVLDRGPSFKPYGGTAYNSLAPRESTFNFWQERDQTTKLVNAQLSNIYRNDTPT FT DVKATHDYVTLFSNVVPDPNVGPYVFDTNFIDVTRAGAAGKVAKVPAGRSPLNYGAYVR FT PVDQRGGQTVTQTPYYITDQTGAAYRGALSLEDVSATVTYPDTLYIPPAAQQSVDYGVT FT RGLRPNYIGFRDNFINLLYHDTGVCAGTLSSERSGMNVVVELQDRNTELSYQMMLADTM FT SRHHFYALWNQAVDQYDPDVRVFNNDGYEEGVPAYAFRPSGAGVDQPSAPLAASRVYTV FT GQNGQLGDDQGVTLNAVLGAVAAHEINLGATMKRNFIVTNIADYLPDKYKYFIPGFDPQ FT TETVDPRTYRYLNRRVPLVNVVDLFTNLGARWSVDQMDNVNPFNHHRNWGLKYRSQLLG FT NSRFCSFHIQVPQKFFALRSLLLLPGTYTYEWVFRKDPNMVLQSSLGNDLRADGARIVY FT QEINLMASFMPLDHNTSNQLELMMRNAVNDQTFADYLGAKSSLYQVPAGSTALTINVPA FT RTWEGLRGWSFTRLKAEETPQQGAQYDTNFRYSGTIPYSDGTFYLNHTFRSMSILFDTS FT INWPGNDRLLTPNMFEIKRQIATDNEGFTTSQCDITKDWYLVQMATNYNYAFNGYRFWP FT ERHYYHYDFLRNFDPMSMQAPNFGRANVFNLVSTTPPAVEESTADDQNRVRNNSGFVAD FT RSVAVFNKRQGQPWPSNWPYPLIGEHSLSADDILNYRKFLCDNYLWTIPFSSDFMYMGE FT LTDLGQNPMYTNNSHSMVINFEVDAMDEDTFVYMLYGVFDTVRVNQPERNVLAMAYFRT FT PFATGNAV" XX SQ Sequence 2818 BP; 615 A; 916 C; 743 G; 544 T; 0 other; atggccgctt tcacgccgga tctgaccacg gccactcccc ggctgcaata ctttcacatc 60 gccggggccg ggtacacgcg aatacctgag cgaggacctg caacagttta tctcctccac 120 gtcgtcctat ttcgattgaa aaataagttc aggcagacgg tggtggcccc cacgcgacac 180 gtcacgaccg aaaagtcgca gcgtctgcag atccgctact accctacgca gacggatgac 240 accaccacgt cgtaccgcgc gcggtacagc gtgagcgtgg gtgacggatg ggtgctggac 300 atggggtcca catacttcga catcaaagga gtgctggacc gaggaccctc gttcaagccg 360 tacggcggta ccgcatacaa cagcctggcg ccccgcgaat ccacgtttaa cttctggcaa 420 gaacgcgacc agaccacgaa gctggtcaac gcgcagctca gcaacattta tcgcaacgat 480 acgcccaccg acgtcaaagc cactcacgac tacgttacgc tgttttctaa cgtggtgccg 540 gatcctaacg tcggtcctta cgtgttcgat accaacttca tcgacgtgac gcgcgccggc 600 gccgccggca aggtggctaa ggtaccggcc ggacggtcgc cgctcaacta cggggcttac 660 gtgcgacccg tggaccagcg cggcggtcag acggtcacgc agacgcccta ctacatcacg 720 gaccagacgg gcgccgcgta tcggggcgct ctgtccctgg aagacgtgtc cgctaccgtg 780 acctacccgg acacgctcta catccctccc gccgcgcagc agtccgtgga ttacggggtt 840 acccgcggct tgcgtcccaa ctacatcggt tttcgggaca atttcatcaa cctgctctac 900 cacgacacgg gcgtctgcgc cggcaccctc agctcggagc gttcgggcat gaacgtcgtg 960 gtggagctgc aggaccgtaa cacggaactc agctatcaga tgatgctggc ggacaccatg 1020 tcgcgacacc atttttacgc cctctggaac caggcggtcg atcagtacga ccccgacgtg 1080 cgggtgttca acaacgacgg gtacgaggag ggagtccccg catatgcctt tcgtcccagc 1140 ggtgccggcg tcgaccaacc ctccgcgccg ctagcggctt cccgcgtcta caccgtcgga 1200 caaaacggtc agctcggcga tgaccaggga gtcaccctta acgccgtgtt gggagccgtg 1260 gccgctcacg aaatcaactt gggcgctacc atgaagcgaa atttcatcgt caccaacatc 1320 gctgattacc taccggacaa gtacaagtac ttcattcccg gtttcgaccc gcaaaccgaa 1380 actgtggatc ctcgcactta ccgctacctc aaccgacgag tgccgctggt gaatgtggta 1440 gacctcttta cgaacctggg agcccgctgg tccgttgacc agatggacaa cgtgaacccg 1500 ttcaatcacc accgcaactg gggcctcaag taccgctctc agctgctggg caacagccgc 1560 ttctgcagct tccatattca ggtgccccaa aagttcttcg ccttgcgcag cctcttgctc 1620 ctaccgggca cctatactta cgaatgggtg ttccgcaagg accccaacat ggtcctccag 1680 tccagtttag gcaacgatct gcgcgccgac ggcgcccgta tcgtttacca agagatcaac 1740 ctcatggcgt ctttcatgcc gctggaccac aacaccagca accagttgga gctgatgatg 1800 cgcaacgctg tcaacgatca gacattcgcg gactacctgg gcgccaagag ttctctctac 1860 caagtgccag ccggctcaac ggcgctgacc atcaacgtgc cggcgcgcac ttgggaagga 1920 ctgcgggggt ggtcctttac gcgactgaag gccgaagaga ccccgcaaca gggagctcaa 1980 tacgacacta acttccgcta ttcgggcacc atcccgtact ccgacggtac gttctacctg 2040 aaccacacgt tccgcagcat gagcatccta ttcgatacgt ccatcaactg gccgggcaac 2100 gatcggctgt tgacgccgaa catgttcgag atcaagcgcc aaatcgcgac ggacaacgaa 2160 ggcttcacca ccagccagtg cgacatcacc aaggactggt acttggtgca gatggctacc 2220 aattacaact acgccttcaa cgggtaccgt ttctggcccg agcgacacta ctaccactac 2280 gactttttac gcaatttcga ccccatgtcc atgcaagcgc ccaactttgg ccgcgcaaac 2340 gtgttcaacc tggtgtcgac gacgccgccg gccgttgaag agagcacggc cgacgaccag 2400 aatcgcgtgc ggaacaattc cggcttcgtc gccgatcggt cggtggcggt cttcaataag 2460 cggcaggggc agccgtggcc gtcaaactgg ccctatccgc tgatcggcga gcatagcctg 2520 tcggccgacg acatacttaa ctaccgcaaa tttctgtgcg ataactacct gtggactatc 2580 cccttcagct ccgactttat gtacatgggc gagctgaccg atttgggtca gaaccccatg 2640 tacactaaca actcgcacag catggtcatc aactttgagg tcgatgccat ggacgaagac 2700 acttttgtct acatgctcta cggcgtcttc gataccgtgc gcgtcaatca gccggagcgt 2760 aacgtgctgg ccatggctta cttccgcaca cctttcgcta caggcaacgc cgtgtaga 2818 // ID KC309440; SV 1; linear; genomic DNA; STD; VRL; 1578 BP. XX AC KC309440; XX DT 25-MAR-2013 (Rel. 116, Created) DT 05-MAY-2013 (Rel. 116, Last updated, Version 2) XX DE Gull adenovirus isolate LA010815.1 penton base protein gene, complete cds. XX KW . XX OS Gull adenovirus OC Viruses; Adenoviridae; unclassified Adenoviridae. XX RN [1] RP 1-1578 RX DOI; 10.1016/j.virol.2013.02.011. RX PUBMED; 23507452. RA Bodewes R., van de Bildt M.W., Schapendonk C.M., van Leeuwen M., RA van Boheemen S., de Jong A.A., Osterhaus A.D., Smits S.L., Kuiken T.; RT "Identification and characterization of a novel adenovirus in the cloacal RT bursa of gulls"; RL Virology 440(1):84-88(2013). XX RN [2] RP 1-1578 RA Bodewes R., van de Bildt M.W.G., Schapendonk C.M.E., van Leeuwen M., RA van Boheemen S., de Jong A.A.W., Osterhaus A.D.M.E., Smits S.L., Kuiken T.; RT ; RL Submitted (06-DEC-2012) to the INSDC. RL Viroscience, Erasmus Medical Centre, Dr Molewaterplein 50, Rotterdam RL 3015GE, The Netherlands XX DR MD5; 1911c8be27e26698a0fd400f654d939b. XX CC ##Assembly-Data-START## CC Assembly Method :: DNAstar v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1578 FT /organism="Gull adenovirus" FT /host="Larus argentatus" FT /isolate="LA010815.1" FT /mol_type="genomic DNA" FT /country="Netherlands" FT /isolation_source="cloacal bursa" FT /collection_date="15-Aug-2001" FT /db_xref="taxon:1306817" FT CDS 1..1578 FT /codon_start=1 FT /product="penton base protein" FT /db_xref="GOA:M4SPE1" FT /db_xref="InterPro:IPR002605" FT /db_xref="UniProtKB/TrEMBL:M4SPE1" FT /protein_id="AGH58208.1" FT /translation="MYFHAGIGQPPSVPPSAPPPPSTRTPSGYPTMVNGYPAYVPTAAA FT SDDGDGSELYLPPQRVLAPTGGRNSIKYRDILPNQNTTKIFYVDNKLGDIDTYNQEASH FT SNFSTSVIHNQDLDPNTAATESILLDARSHWGGDLNTAVKTNCPNVTGFFQSNVLRVRL FT MSYRDPVSGDQSESGQYQPAGARYKWYDLKIPEGNYALSELIDLLNEGIVQLYLSEGRQ FT NNVLKSDIGVKFDTRYLDLLRDPVTGLVTPGAYVNKGYHPDVILLPGCAVDFTYTRLSL FT LLGISKRQPYAKGFVLTYEDLEGGDVPALLDRSTSHVDDWDGQIADPANAKPLFRDPHG FT VSYNVITDPADKPRLAYRSWLLAYNRDGSRAQAETLLTVPDLSGGLGAMYYSLPDTFVA FT PTGFKDDNRTNNAPVVGMTLFPTSGKVTYMGAANYVQVLENSCLTASSAFNRFPDNEIL FT KQAPPLNVAAVCDNQPAVSRQGTLPIKNSLPGLQRVLITDDRRRPIPYVHKTLATVQPR FT VLSSATLQ" XX SQ Sequence 1578 BP; 326 A; 524 C; 433 G; 295 T; 0 other; atgtatttcc acgccgggat cggacagcct ccgtcggtgc ctccgtcggc tcctccgccg 60 ccttctacgc gaacaccttc cggctatccg acgatggtga acggatatcc ggcctacgtg 120 ccgacggcgg cggcttccga cgatggggac gggtcggaac tgtatctgcc tccgcagcga 180 gtgcttgcgc ctactggagg gcgaaacagc attaagtatc gcgacatttt gcccaaccaa 240 aacaccacaa aaatctttta cgtggacaac aagttaggcg atatagacac gtacaatcag 300 gaggcgagcc acagcaactt tagcaccagc gtcatccaca accaagatct ggatccgaac 360 acggccgcca cggaatccat tctgttggat gcgcgctccc actggggcgg cgacctcaac 420 acggcggtca agaccaactg tcccaacgtg acgggcttct ttcagagcaa cgtgttgcgc 480 gtgcgtctga tgagttaccg tgaccccgtg tccggcgacc agtcggagag cggtcaatat 540 caacccgccg gggcgcgcta taagtggtac gacctgaaaa tcccggaggg taactacgcc 600 ctttccgagc tcatcgacct cttgaacgaa ggcatcgtgc agctctacct gagcgagggc 660 cgacagaaca acgtcctcaa gtcggacatc ggagtcaaat tcgacacccg ttatctggac 720 ctactgcggg acccggtcac gggtctggtc actccgggcg cgtacgtcaa taagggctac 780 cacccggacg tcatactgct gcccggctgc gcggttgact ttacgtacac ccgcctgagc 840 ctcctgctgg gcatcagcaa gcggcagcct tatgccaaag gttttgtgct cacctacgag 900 gatctggaag ggggcgacgt ccccgcgctg ctcgatcgct ccacgtcgca cgtggacgac 960 tgggacggac agatagcgga tccggcgaac gctaagccgc tcttccgcga tccgcacggc 1020 gtttcctaca acgtcatcac tgatcccgcg gacaagcccc gcctggccta ccgttcctgg 1080 ttgttggcgt acaaccgaga tggtagccgc gctcaagccg aaacgctgct gacggtgccc 1140 gacctgtcgg gcggtctggg cgccatgtac tattccctgc ccgatacctt tgtcgcgccc 1200 acggggttca aagacgacaa tcgtaccaac aacgcgccgg tggtgggcat gaccctcttt 1260 cccaccagcg gcaaggtgac ctacatggga gccgccaatt acgtgcaggt cttggaaaac 1320 tcttgcttga ccgcctcttc cgcttttaac cgctttcccg acaacgagat actcaagcag 1380 gcgccgcctc tcaacgtggc cgccgtctgc gacaaccaac cggccgtttc ccgccagggc 1440 acgctgccca taaagaattc gctgccgggc ctgcaacgcg tgctgatcac ggacgaccgc 1500 cgccgaccca ttccctacgt gcacaagaca ctggctaccg ttcaacctcg ggtgctcagc 1560 agcgccaccc tgcagtga 1578 // ID KC310688; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC310688; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; dc9ee950423f4f46f83ee37ccf3f8e8c. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260048/2012(H3N2))" FT /segment="4" FT /host="swine" FT /strain="A/swine/Indiana/A01260048/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="17-Oct-2012" FT /db_xref="taxon:1268642" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:L0HPF2" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HPF2" FT /protein_id="AGB08384.1" FT /translation="MKTMIAFSCILCLIFAQKLPGGDNSMATLCLGHHAVPNGTLVKTI FT TDDQIEVTNATELVQSSSKGRICNSPHQILDGKNCTLIDALLGDPHCDDFQNKEWDLFV FT ERSTAYSNCYPYYVPDYATLRSLVASSGNLEFTQESFNWTGVAQDGSSYACRRGSVNSF FT FSRLNWLYNLNYKYPEQNVTMPNNDKFDKLYIWGVHHPGTDKDQTNLYVQASGRVIVST FT KRSQQTVIPNIGSRPWVRGVSSIISIYWTIVKPGDILLINSTGNLIAPRGYFKIQSGKS FT SIMRSDAHIDECNSECITPNGSIPNDKPFQNVNKITYGACPRYVKQNTLKLATGMRNVP FT EKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQITGKLNR FT VIKKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS FT EMSKLFERTRRQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDIYRNEALNNRFQ FT IKGVQLKSGYKDWILWISFAISCFLLCVVLLGFIMWACQKGNIRCNICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1035 FT /gene="HA" FT /product="HA1" FT mat_peptide 1036..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 576 A; 341 C; 376 G; 408 T; 0 other; atgaagacta tgattgcttt tagctgcatt ctatgtctga ttttcgctca aaaacttccc 60 ggaggtgaca acagcatggc aacgctgtgc ctgggacacc atgcagtgcc aaacggaaca 120 ttagtgaaaa caatcacgga tgaccaaatt gaagtgacta atgctactga gctggtccag 180 agttcctcaa aaggtagaat atgcaacagt cctcaccaaa tccttgatgg gaaaaattgc 240 acactgatag atgctctatt gggggaccct cattgtgatg acttccaaaa caaggaatgg 300 gacctttttg ttgaacgaag cacagcctac agcaactgtt acccttatta tgtgccagat 360 tatgccaccc ttaggtcact agttgcctca tctggcaacc tggaatttac ccaagaaagc 420 ttcaattgga ctggagttgc tcaagacgga tcaagctatg cctgcagaag gggatctgtt 480 aacagtttct ttagtagatt gaattggttg tataacttga attacaagta tccagagcag 540 aacgtaacta tgccaaacaa tgacaaattt gacaaattgt acatttgggg ggttcaccac 600 ccgggtacgg acaaggacca aaccaaccta tatgtccaag catcagggag agttatagtc 660 tctaccaaaa gaagccaaca aactgtaatc ccaaatatcg ggtctagacc ctgggtaagg 720 ggtgtctcca gcataataag catctattgg acgatagtaa aaccgggaga catacttttg 780 attaacagca cagggaatct aattgcccct cggggttact tcaaaataca aagtgggaaa 840 agctcaataa tgagatcaga tgcacacatt gatgaatgca attctgaatg cattactcca 900 aatggaagca ttcccaatga caaacctttt caaaatgtaa acaagatcac atatggagcc 960 tgtcccagat atgttaagca aaacacccta aaattggcaa caggaatgcg gaatgtacca 1020 gagaaacaaa ctagaggcat attcggcgca attgcaggtt tcatagaaaa tggttgggag 1080 ggaatggtag acggttggta cggtttcagg catcagaatt ctgaaggcac aggacaagca 1140 gcagatctta aaagcactca agcagcaatc aaccaaatca ccgggaaact gaatagggta 1200 atcaagaaaa cgaacgagaa attccatcaa atcgaaaaag aattctcaga agtagaaggg 1260 agaattcagg acctagagaa atacgttgaa gacactaaaa tagatctctg gtcttacaac 1320 gctgagcttc ttgttgccct ggagaaccaa catacaattg atttaaccga ctcagagatg 1380 agcaaactgt tcgaaagaac aagaaggcaa ctgagggaaa atgctgagga catgggcaat 1440 ggttgcttca aaatatacca caaatgtgac aatgcctgca taggatcaat cagaaatgga 1500 acttatgacc atgatatata cagaaacgag gcattaaaca atcggttcca gatcaaaggt 1560 gttcagctaa agtcaggata caaagattgg atcctatgga tttcctttgc catatcatgc 1620 tttttgcttt gtgttgttct gctggggttc attatgtggg cctgccaaaa aggcaacatt 1680 aggtgcaaca tttgcatttg a 1701 // ID KC310689; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC KC310689; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; c78cf7f6b15728bd7dc9ceba1733f203. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260048/2012(H3N2))" FT /segment="6" FT /host="swine" FT /strain="A/swine/Indiana/A01260048/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="17-Oct-2012" FT /db_xref="taxon:1268642" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:L0HT97" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:L0HT97" FT /protein_id="AGB08385.1" FT /translation="MNPNQKIITIGSVSLIIATICFLMQIAILVTTVTLHFKQHDYNSP FT PNNQVMLCEPTIIERNTTEIVYLTNTTIEKEICPKLAEYRNWSKPQCDITGFAPFSKDN FT SIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNGHSNNTVHDRTPYRTLLMNEL FT GVPFHLGTRQVCMAWSSSSCHDGKAWLHVCITGNDNNATASFIYNGRLVDSIGSWSKNI FT LRTQESECVCINGTCTVVMTDGSASGKADTKILFVEEGKIVHISTLSGSAQHVEECSCY FT PRFPGVRCVCRDNWKGSNRPIVDINVKNYSIVSSYVCSGLVGDTPRESDSVSSSYCLDP FT NNEKGGHGVKGWAFDDGNDVWMGRTINETLRLGYETFKVIEGWSKANSKLQTNRQVIVE FT KGDRSGYSGIFSVEGKSCINRCFYVELIRGRKEETKVWWTSNSIVVFCGTSGTYGTGSW FT PDGADINLMPI" XX SQ Sequence 1410 BP; 444 A; 264 C; 330 G; 372 T; 0 other; atgaatccaa atcaaaagat aataacaatt ggctctgttt ctctcatcat tgccacaata 60 tgtttcctta tgcaaattgc tatcctagta actactgtaa cattacattt caagcagcat 120 gactacaatt cccccccaaa caaccaagta atgctgtgtg aaccaacaat aatagaaaga 180 aacacaacag agattgtgta tttgaccaac accaccatag agaaagaaat atgccccaaa 240 ctagcagaat atagaaactg gtcaaagccg caatgtgaca ttacaggatt tgcacctttt 300 tctaaggaca attcaattcg gctttctgct ggtggggaca tctgggtgac aagagaacct 360 tatgtgtcat gcgatcctga caagtgttat caatttgccc ttgggcaggg aacaacatta 420 aacaacggac attcaaataa cactgtacat gataggaccc cttatcgaac cctattgatg 480 aatgaattgg gtgttccatt tcatttggga accaggcaag tgtgcatggc atggtccagc 540 tcaagttgtc acgatggaaa agcatggctg catgtttgta taactgggaa tgataacaat 600 gcaacagcta gcttcattta caatgggagg cttgtagata gtattggttc atggtccaaa 660 aatatactca gaacccagga gtcagaatgc gtctgtatca atggaacctg tacagtagta 720 atgactgatg ggagcgcttc aggaaaagct gatactaaaa tactattcgt tgaggagggg 780 aagatcgttc atattagcac attgtcagga agtgctcagc atgttgagga gtgctcctgt 840 tatcctcgat ttcctggtgt cagatgtgtc tgcagagaca actggaaagg ctccaatagg 900 cccatcgtag atataaatgt aaagaattat agcattgttt ccagttatgt atgctcagga 960 cttgttggag acacacccag agaaagtgac agcgtcagca gtagttattg cctagatcct 1020 aacaatgaga aaggtggtca tggggtgaaa ggctgggcct ttgatgatgg aaatgacgtg 1080 tggatgggaa ggacaatcaa cgagacgtta cgcttaggtt atgaaacctt caaagtcatt 1140 gaaggctggt ccaaagctaa ctccaaatta cagacaaata ggcaagtcat agttgaaaag 1200 ggtgacaggt ccggttattc tggtattttc tccgttgaag gtaaaagctg catcaatcgg 1260 tgcttttatg tggagttgat aaggggaagg aaagaggaaa ctaaagtctg gtggacctca 1320 aacagtattg ttgtgttttg tggcacctca ggtacatatg gaacaggctc atggcctgat 1380 ggagcggata tcaatctcat gcctatataa 1410 // ID KC310690; SV 1; linear; viral cRNA; STD; VRL; 982 BP. XX AC KC310690; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) segment 7 matrix DE protein 2 (M2) and matrix protein 1 (M1) genes, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260048/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-982 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 7582f020dbff03dae40c917251be98f1. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..982 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260048/2012(H3N2))" FT /segment="7" FT /host="swine" FT /strain="A/swine/Indiana/A01260048/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="17-Oct-2012" FT /db_xref="taxon:1268642" FT gene 1..982 FT /gene="M2" FT CDS join(1..26,715..982) FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2" FT /db_xref="GOA:L0HRC4" FT /db_xref="InterPro:IPR002089" FT /db_xref="UniProtKB/TrEMBL:L0HRC4" FT /protein_id="AGB08387.1" FT /translation="MSLLTEVETPTRSEWECRCSDSSDPLAIAANIIGILHLILWITDR FT LFFKCIYRRFKYGLKRGPSTEGVPESMREEYQQEQQSAVDVDDGHFVNIELE" FT gene 1..759 FT /gene="M1" FT CDS 1..759 FT /codon_start=1 FT /gene="M1" FT /product="matrix protein 1" FT /db_xref="GOA:L0HU17" FT /db_xref="InterPro:IPR001561" FT /db_xref="InterPro:IPR013188" FT /db_xref="InterPro:IPR015423" FT /db_xref="InterPro:IPR015799" FT /db_xref="InterPro:IPR036039" FT /db_xref="InterPro:IPR037533" FT /db_xref="UniProtKB/TrEMBL:L0HU17" FT /protein_id="AGB08386.1" FT /translation="MSLLTEVETYVLSIIPSGPLKAEIAQRLESVFAGKNTDLEALMEW FT LKTRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDRAVKLYKKLK FT REITFHGAKEVSLSYSTGALASCMGLIYNRMGTVTTEAAFGLVCATCEQIADSQHRSHR FT QMATTTNPLIRHENRMVLASTTAKAMEQMAGSSEQAAEAMEVANQTRQMVHAMRTIGTH FT PSSSTGLKDDLLENLQAYQKRMGVQMQRFK" XX SQ Sequence 982 BP; 285 A; 208 C; 255 G; 234 T; 0 other; atgagtcttc taaccgaggt cgaaacgtac gttctttcta tcataccgtc aggccccctc 60 aaagccgaga tcgcgcagag actggaaagt gtctttgcag gaaagaacac agatcttgag 120 gctctcatgg aatggctaaa gacaagacca atcttgtcac ctctgactaa gggaatttta 180 ggatttgtgt tcacgctcac cgtgcccagt gagcgaggac tgcagcgtag acgctttgtc 240 caaaatgccc tgaatgggaa tggggaccca aacaacatgg atagagcagt taaactatac 300 aagaagctca aaagagaaat aacgttccat ggggccaagg aggtgtcact aagctattca 360 actggtgcac ttgccagttg catgggcctc atatacaaca ggatgggaac agtgaccaca 420 gaagctgctt ttggtctagt gtgtgccact tgtgaacaga ttgctgattc acagcatcgg 480 tctcacagac agatggctac taccaccaat ccactaatca ggcatgagaa cagaatggtg 540 ctggctagca ctacggcaaa ggctatggaa cagatggctg gatcgagtga acaggcagcg 600 gaggccatgg aggttgctaa tcagactagg cagatggtac atgcaatgag aactattggg 660 actcatccta gctccagtac tggtctgaaa gatgaccttc ttgaaaattt gcaggcctac 720 cagaagcgaa tgggagtgca gatgcagcga ttcaagtgat cctctcgcca ttgcagcaaa 780 tatcattggg atcttgcacc tgatattgtg gattactgat cgtctttttt tcaaatgtat 840 ttatcgtcgc tttaaatacg gtttgaaaag agggccttct acggaaggag tgcctgagtc 900 catgagggaa gaatatcaac aggaacagca gagtgctgtg gatgttgacg atggtcattt 960 tgtcaacata gagctagagt aa 982 // ID KC310691; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC310691; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; cd3d635f5e4706d26b36e8b730b0d136. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260093/2012(H3N2))" FT /segment="4" FT /host="swine" FT /strain="A/swine/Indiana/A01260093/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="30-Oct-2012" FT /db_xref="taxon:1268643" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:L0HS03" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HS03" FT /protein_id="AGB08388.1" FT /translation="MKTIIAFSCILCLIFAQRLPGSDNSMATLCLGHHAVPNGTLVKTI FT TDDQIEVTNATELVQSSSTGRICNSPHQILDGKNCTLIDALLGDPHCDDFQNKEWDLFV FT ERSTAYSNCYPYYVPDYATLRSLVASSGNLEFTQESFNWTGVAQDGSSYACRRGSVKSF FT FSRLNWLYNLNYKYPEQNVTMPNNDKFDKLYIWGVHHPGTDKDQTNLYVQASGRVIVST FT KRSQQTVIPNIGSRPWVRGVSSIISIYWTIVKPGDILLINSTGNLIAPRGYFKIQSGKS FT SIMRSDAHIDECNSECITPNGSISNDKPFQNVNKITYGACPRYVKQNTLKLATGMRNVP FT EKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQITGKLNR FT VIKKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS FT EMSKLFERTRRQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDIYRNEALNNRFQ FT IKGVQLKSGYKDWILWISFAISCFLLCVVLLGFIMWACQKGNIRCNICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1035 FT /gene="HA" FT /product="HA1" FT mat_peptide 1036..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 574 A; 343 C; 374 G; 410 T; 0 other; atgaagacta tcattgcttt tagctgcatt ctatgtctga ttttcgctca aagacttccc 60 ggaagtgaca acagcatggc aacgctgtgc ctgggacacc atgcagtgcc aaacggaaca 120 ttagtgaaaa caatcacgga tgaccaaatt gaagtgacta atgctactga gctggtccag 180 agttcctcaa caggtagaat atgcaacagt cctcaccaaa tccttgatgg gaaaaattgc 240 acactgatag atgctctatt gggggaccct cattgtgatg acttccaaaa caaggaatgg 300 gacctttttg ttgaacgaag cacagcctac agcaactgtt acccttatta tgtgccagat 360 tatgccaccc ttaggtcact agttgcctca tctggcaacc tggaatttac ccaagaaagc 420 ttcaattgga ctggagttgc tcaagacgga tcaagctatg cctgcagaag gggatctgtt 480 aaaagtttct ttagtagatt gaattggttg tataacttga attacaagta tccagaacag 540 aacgtaacta tgccaaacaa tgacaaattt gacaaattgt acatttgggg ggttcaccac 600 ccgggtacgg acaaggacca aaccaaccta tatgtccaag catcagggag agttatagtc 660 tctaccaaaa gaagccaaca aactgtaatc ccgaatatcg ggtctagacc ctgggtaagg 720 ggtgtctcca gcataataag catctattgg acgatagtaa aaccgggaga catacttttg 780 attaacagca cagggaatct aattgcccct cggggttact tcaaaataca aagtgggaaa 840 agctcaataa tgagatcaga tgcacacatt gatgaatgca attctgaatg cattactcca 900 aatggaagca tttccaatga caaacctttt caaaatgtaa acaagatcac atatggagcc 960 tgtcccagat atgttaagca aaacaccctg aaattggcta caggaatgcg gaatgtacca 1020 gagaaacaaa ctagaggcat attcggcgca attgcaggtt tcatagaaaa tggttgggag 1080 ggaatggtag acggttggta cggtttcagg catcagaatt ctgaaggcac aggacaagca 1140 gcagatctta aaagcactca agccgcaatc aaccaaatca ccgggaaact aaatagagta 1200 atcaagaaaa cgaacgagaa attccatcaa atcgaaaaag aattctcaga agtagaaggg 1260 agaattcagg acctagagaa atacgttgaa gacactaaaa tagatctctg gtcttacaac 1320 gctgagcttc ttgttgccct ggagaaccaa catacaattg atttaaccga ctcagagatg 1380 agcaaactgt tcgaaagaac aagaaggcaa ctgcgggaaa atgctgagga catgggcaat 1440 ggttgcttca aaatatacca caaatgtgac aatgcctgca taggatcaat cagaaatgga 1500 acttatgacc atgatatata cagaaacgag gcattaaaca atcggttcca gatcaaaggt 1560 gttcagctaa agtcaggata caaagattgg atcctatgga tttcctttgc catatcatgc 1620 tttttgcttt gtgttgttct gctggggttc attatgtggg cctgccaaaa aggcaacatt 1680 aggtgcaaca tttgcatttg a 1701 // ID KC310692; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC KC310692; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 546520832818797170a5aa2880aa49aa. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260093/2012(H3N2))" FT /segment="6" FT /host="swine" FT /strain="A/swine/Indiana/A01260093/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="30-Oct-2012" FT /db_xref="taxon:1268643" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:L0HPG0" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:L0HPG0" FT /protein_id="AGB08389.1" FT /translation="MNPNQKIITIGSVSLIIATICFLMQIAILVTTVTLHFKQHDYNSP FT PNNQAMLCEPTIIERNTTEIVYLTNTTIEKEICPKLAEYRNWSKPQCNITGFAPFSKDN FT SIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNGHSNNTVHDRTPYRTLLMNEL FT GVPFHLGTRQVCMAWSSSSCHDGKAWLHVCITGNDNNATASFIYNGRLVDSIGSWSKNI FT LRTQESECVCINGTCTVVMTDGSASGKADTKILFVEEGKIAHISTLSGSAQHVEECSCY FT PRFPGVRCVCRDNWKGSNRPIVDINVKNYSIVSSYVCSGLVGDTPRKSDSVSSSYCLDP FT NNEKGGHGVKGWAFDDGNDVWMGRTINETLRLGYETFKVIEGWSKANSKLQTNRQVIVE FT KGDRSGYSGIFSVEGKSCINRCFYVELIRGRKEETRVWWTSNSIVVFCGTSGTYGTGSW FT PDGADINLMPI" XX SQ Sequence 1410 BP; 447 A; 265 C; 327 G; 371 T; 0 other; atgaatccaa atcaaaagat aataacaatt ggctctgttt ctctcatcat tgccacaata 60 tgtttcctta tgcaaattgc tatcctagta actactgtaa cattacattt caagcaacat 120 gactacaact cccccccaaa caaccaagca atgctgtgtg aaccaacaat aatagaaaga 180 aacacaacag agattgtgta tttgaccaac accaccatag agaaagaaat atgccccaaa 240 ctagcagaat atagaaattg gtcaaagccg caatgtaaca ttacaggatt tgcacctttt 300 tctaaggaca attcaattcg gctttctgct ggtggggaca tctgggtgac aagagaacct 360 tatgtgtcat gcgatcctga caagtgttat caatttgccc ttgggcaggg aacaacatta 420 aacaacggac attcaaataa cactgtacat gataggaccc cttatcgaac cctattaatg 480 aatgaattgg gtgttccatt tcatttagga accaggcaag tgtgcatggc atggtccagc 540 tcaagttgtc acgatggaaa agcatggctg catgtttgta taactgggaa tgataacaat 600 gcaacagcta gcttcattta caatgggagg cttgtagata gtattggttc atggtccaaa 660 aatatactca gaacccagga gtcggaatgc gtctgtatca atggaacctg tacagtagta 720 atgactgatg ggagcgcttc agggaaagct gatactaaaa tactattcgt tgaggagggg 780 aagatcgctc atattagcac attgtcagga agtgctcagc atgttgagga gtgttcctgt 840 tatcctcgat ttcctggtgt cagatgtgtc tgcagagaca actggaaagg ctccaatagg 900 cccatcgtag atataaatgt aaagaattat agcattgttt ccagttatgt atgctcagga 960 cttgttggag acacacccag aaaaagtgac agcgtcagca gtagttattg cctagatcct 1020 aacaatgaga aaggtggtca tggggtgaaa ggctgggcct ttgatgatgg aaatgacgtg 1080 tggatgggaa ggacaatcaa cgagacgtta cgcttaggtt atgaaacctt caaagtcatt 1140 gaaggctggt ccaaagctaa ctccaaatta cagacaaata ggcaagtcat agttgaaaaa 1200 ggtgacaggt ccggttattc tggtattttc tccgttgaag gtaaaagctg catcaatcgg 1260 tgcttttatg tggagttgat aaggggaagg aaagaggaaa ctagagtctg gtggacctca 1320 aacagtattg ttgtgttttg tggcacctca ggtacatatg gaacaggctc atggcctgat 1380 ggagcggata tcaatctcat gcctatataa 1410 // ID KC310693; SV 1; linear; viral cRNA; STD; VRL; 1002 BP. XX AC KC310693; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) segment 7 matrix DE protein 2 (M2) and matrix protein 1 (M1) genes, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260093/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1002 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; fe13158855625cb470c8396f29c54355. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1002 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260093/2012(H3N2))" FT /segment="7" FT /host="swine" FT /strain="A/swine/Indiana/A01260093/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="30-Oct-2012" FT /db_xref="taxon:1268643" FT gene 1..982 FT /gene="M2" FT CDS join(1..26,715..982) FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2" FT /db_xref="GOA:L0HU23" FT /db_xref="InterPro:IPR002089" FT /db_xref="UniProtKB/TrEMBL:L0HU23" FT /protein_id="AGB08391.1" FT /translation="MSLLTEVETPTRSEWECRCSDSSDPLAIAANIIGILHLILWITDR FT LFFKCIYRRFKYGLKRGPSTEGVPESMREEYQQEQQSAVDVDDGHFVNIELE" FT gene 1..759 FT /gene="M1" FT CDS 1..759 FT /codon_start=1 FT /gene="M1" FT /product="matrix protein 1" FT /db_xref="GOA:L0HTA3" FT /db_xref="InterPro:IPR001561" FT /db_xref="InterPro:IPR013188" FT /db_xref="InterPro:IPR015423" FT /db_xref="InterPro:IPR015799" FT /db_xref="InterPro:IPR036039" FT /db_xref="InterPro:IPR037533" FT /db_xref="UniProtKB/TrEMBL:L0HTA3" FT /protein_id="AGB08390.1" FT /translation="MSLLTEVETYVLSIIPSGPLKAEIAQRLESVFAGKNTDLEALMEW FT LKTRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDRAVKLYKKLK FT REITFHGAKEVSLSYSTGALASCMGLIYNRMGTVTTEAAFGLVCATCEQIADSQHRSHR FT QMATTTNPLIRHENRMVLASTTAKAMEQMAGSSEQAAEAMEVANQTRQMVHAMRTIGTH FT PSSSTGLKDDLLENLQAYQKRMGVQMQRFK" XX SQ Sequence 1002 BP; 294 A; 213 C; 255 G; 240 T; 0 other; atgagtcttc taaccgaggt cgaaacgtac gttctttcta tcataccgtc aggccccctc 60 aaagccgaga tcgcgcagag actggaaagt gtctttgcag gaaaaaacac agatcttgag 120 gctctcatgg aatggctaaa gacaagacca atcttgtcac ctctgactaa gggaatttta 180 ggatttgtgt tcacgctcac cgtgcccagt gagcgaggac tgcagcgtag acgctttgtc 240 caaaatgccc tgaatgggaa tggggaccca aacaacatgg atagagcagt taaactatac 300 aagaagctca aaagagaaat aacgttccat ggggccaagg aggtgtcact aagctattca 360 actggtgcac ttgccagttg catgggcctc atatacaaca ggatgggaac agtgaccaca 420 gaagctgctt ttggtctagt gtgtgccact tgtgaacaga ttgctgattc acagcatcgg 480 tctcacagac agatggctac aaccaccaat ccactaatca ggcatgagaa cagaatggtg 540 ctggctagca ccacggcaaa ggctatggaa cagatggctg gatcgagtga acaggcagcg 600 gaggccatgg aggttgctaa tcagactagg cagatggtac atgcaatgag aactattggg 660 actcatccta gctccagtac tggtctgaaa gatgaccttc ttgaaaattt gcaggcctac 720 cagaagcgaa tgggagtgca gatgcagcga ttcaagtgat cctctcgcca ttgcagcaaa 780 tatcattggg atcttgcacc tgatattgtg gattactgat cgtctttttt tcaaatgtat 840 ttatcgtcgc tttaaatacg gtttgaaaag agggccttct acggaaggag tgcctgagtc 900 catgagggaa gaatatcaac aggaacagca gagtgctgtg gatgttgacg atggtcattt 960 tgtcaacata gagctagagt aaaaaactac cttgtttcta ta 1002 // ID KC310694; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC310694; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 91d2a55d0848d158176a6090f77dd05d. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260099/2012(H3N2))" FT /segment="4" FT /host="swine" FT /strain="A/swine/Indiana/A01260099/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="02-Nov-2012" FT /db_xref="taxon:1268644" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:L0HRC9" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HRC9" FT /protein_id="AGB08392.1" FT /translation="MKTIIAFSCILCLIFAQKLPGSDNSMATLCLGHHAVPNGTLVKTI FT TDDQIEVTNATELVQSSSTGRICNSPHQILDGKNCTLIDALLGDPHCDDFQNKEWDLFV FT ERSTAYSNCYPYYVPDYATLRSLVASSGNLEFTQESFNWTGVAQDGSSYACRRGSVNSF FT FSRLNWLYNLNYKYPEQNVTMPNNDKFDKLYIWGVHHPGTDKDQTNLYVQASGRVIVST FT KRSQQTVIPNIGSRPWVRGVSSIISIYWTIVKPGDILLINSTGNLIAPRGYFKIQSGKS FT SIMRSDAHIDECNSECITPNGSISNDKPFQNVNKITYGACPRYVKQNTLKLATGMRNVP FT EKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQITGKLNR FT VIKKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDS FT EMSKLFERTRRQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDIYRNEALNNRFQ FT IKGVQLKSGYKDWILWISFAISCFLLCVVLLGFIMWACQKGNIRCNICI" FT sig_peptide 1..48 FT /gene="HA" FT mat_peptide 49..1035 FT /gene="HA" FT /product="HA1" FT mat_peptide 1036..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 578 A; 342 C; 372 G; 409 T; 0 other; atgaagacta tcattgcttt tagctgcatt ctatgtctga ttttcgctca aaaacttccc 60 ggaagtgaca acagcatggc aacgctgtgc ctgggacacc atgcagtgcc aaacggaaca 120 ttagtgaaaa caatcacgga tgaccaaatt gaagtgacta atgctactga gctggtccag 180 agttcctcaa caggtagaat atgcaacagt cctcaccaaa tccttgatgg gaaaaattgc 240 acactgatag atgctctatt gggggaccct cattgtgatg acttccaaaa caaggaatgg 300 gacctttttg ttgaacgaag cacagcctac agcaactgtt acccttatta tgtgccagat 360 tatgccaccc ttaggtcact agttgcctca tctggcaacc tggaatttac ccaagaaagc 420 ttcaattgga ctggagttgc tcaagacgga tcaagctatg cctgcagaag gggatctgtt 480 aacagtttct ttagtagatt gaattggttg tataacttga attacaagta tccagaacag 540 aacgtaacta tgccaaacaa tgacaaattt gacaaattgt acatttgggg ggttcaccac 600 ccgggtacgg acaaggacca aaccaaccta tatgtccaag catcagggag agttatagtc 660 tctaccaaaa gaagccaaca aactgtaatc ccgaatatcg ggtctagacc ctgggtaagg 720 ggtgtatcca gcataataag catctattgg acgatagtaa aaccgggaga catacttttg 780 attaacagca cagggaatct aattgcccct cggggttact tcaaaataca aagtgggaaa 840 agctcaataa tgagatcaga tgcacacatt gatgaatgca attctgaatg cattactcca 900 aatggaagca tttccaatga caaacctttt caaaatgtaa acaagatcac atatggagcc 960 tgtcccagat atgttaagca aaacaccctg aaattggcaa caggaatgcg gaatgtacca 1020 gagaaacaaa ctagaggcat attcggcgca attgcaggtt tcatagaaaa tggttgggag 1080 ggaatggtag acggttggta cggtttcagg catcagaatt ctgaaggcac aggacaagca 1140 gcagatctta aaagcactca agcagcaatc aaccaaatca ccgggaaact aaatagagta 1200 atcaagaaaa cgaacgagaa attccatcaa atcgaaaaag aattctcaga agtagaaggg 1260 agaattcagg acctagagaa atacgttgaa gacactaaaa tagatctctg gtcttacaac 1320 gctgagcttc ttgttgccct ggagaaccaa catacaattg atttaaccga ctcagagatg 1380 agcaaactgt tcgaaagaac aagaaggcaa ctgcgggaaa atgctgagga catgggcaat 1440 ggttgcttca aaatatacca caaatgtgac aatgcctgca taggatcaat cagaaatgga 1500 acttatgacc atgatatata cagaaacgag gcattaaaca atcggttcca gatcaaaggt 1560 gttcagctaa aatcaggata caaagattgg atcctatgga tttcctttgc catatcatgc 1620 tttttgcttt gtgttgttct gctggggttc attatgtggg cctgccaaaa aggcaacatt 1680 aggtgcaaca tttgcatttg a 1701 // ID KC310695; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC KC310695; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; e33535539535690253b8823b500846c3. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260099/2012(H3N2))" FT /segment="6" FT /host="swine" FT /strain="A/swine/Indiana/A01260099/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="02-Nov-2012" FT /db_xref="taxon:1268644" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:L0HS09" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:L0HS09" FT /protein_id="AGB08393.1" FT /translation="MNPNQKIITIGSVSLIIATICFLMQVAILVTTVTLHFKQHDCDSS FT PNNQVMFCEPTIIERNKTEIVYLTNTTVEKEICPKPAKYRNWSKPQCNITGFAPFSKDN FT SIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLNNGHSNDTVHDRTPYRTLLMNEL FT GVPFHLGTRQVCIAWSSSSCHDGKAWLHVCITGDDKNATASFIYNGRLVDSIGSWSKNI FT LRTQESECVCINGTCTVVMTDGSASGKADTKILFIEEGKIIHISTLSGSAQHVEECSCY FT PRYPGVRCVCRDNWKGSNRPIVDINVKDYSIVSSYVCSGLVGDTPRKNDSFSSSHCLDP FT NNEEGGHGVKGWAFDDGNDVWMGRTISEKSRLGYETFKVIKGWSKPNSKLQTNRQVIVG FT RGNRSGYSGIFSVEGKNCINRCFYVELIRGRKEETKVWWTSNSIVVFCGTSGTYGTGSW FT PDGADINLMPI" XX SQ Sequence 1410 BP; 438 A; 267 C; 336 G; 369 T; 0 other; atgaatccaa atcaaaagat aataacaatt ggctctgttt ctctcatcat tgccacaata 60 tgcttcctta tgcaagttgc catcctggtg actactgtaa cactgcattt caagcaacat 120 gattgcgact cctccccaaa caaccaagta atgttttgtg aaccaacaat aatagaaaga 180 aacaaaacgg agattgtgta tctgaccaac accactgtag agaaggaaat atgccccaaa 240 ccagcaaaat acagaaattg gtcaaagcct caatgtaaca ttacaggatt tgcacctttt 300 tctaaggaca attcgattcg gctttctgct ggtggggaca tctgggtgac aagagaacct 360 tatgtgtcat gcgatcccga caagtgttat caatttgccc ttgggcaggg aacaacacta 420 aacaacgggc attcaaatga cactgtacat gataggaccc cttaccgaac cctattgatg 480 aatgaattgg gtgttccatt tcatttggga accaggcaag tgtgcatagc atggtccagt 540 tcaagttgtc acgatgggaa agcatggctg catgtttgta taactgggga tgataaaaat 600 gcaactgcta gcttcattta caatgggagg ctagtagata gtattggttc atggtccaaa 660 aatatactaa gaacccagga gtcggaatgc gtttgtatta atggaacttg tacagtagtc 720 atgactgatg gaagcgcttc cggaaaagct gatactaaaa tattattcat tgaggagggg 780 aaaatcattc atattagcac gttgtcagga agtgcgcagc atgtcgagga gtgctcttgt 840 tatcctcgat atcctggtgt cagatgcgtc tgcagagaca actggaaagg ctccaatagg 900 cccatagttg atataaatgt aaaggattat agcattgttt ccagttatgt atgctctgga 960 cttgttggag acacacccag aaaaaacgac agcttcagca gtagtcattg cctagatcct 1020 aacaatgagg aaggtggtca tggggtgaaa ggctgggcct ttgatgatgg aaatgacgtg 1080 tggatgggaa gaacgatcag cgagaagtca cgcttaggct atgaaacctt caaagtcatc 1140 aaaggatggt ccaaacccaa ctccaaatta cagacaaata ggcaagttat agttggtaga 1200 ggtaacaggt ccggttattc tggtattttc tccgttgaag gcaaaaactg catcaatagg 1260 tgcttttatg tggagttgat aaggggaagg aaagaggaaa ctaaagtctg gtggacctca 1320 aacagtattg ttgtgttttg tggcacctca ggtacgtatg gaacaggctc atggcctgat 1380 ggggcggata tcaatctcat gcctatataa 1410 // ID KC310696; SV 1; linear; viral cRNA; STD; VRL; 983 BP. XX AC KC310696; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) segment 7 matrix DE protein 2 (M2) and matrix protein 1 (M1) genes, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260099/2012(H3N2)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-983 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 5c2c572cd98865672c445764a4429fae. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..983 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260099/2012(H3N2))" FT /segment="7" FT /host="swine" FT /strain="A/swine/Indiana/A01260099/2012" FT /serotype="H3N2" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="lung" FT /collection_date="02-Nov-2012" FT /db_xref="taxon:1268644" FT gene 1..982 FT /gene="M2" FT CDS join(1..26,715..982) FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2" FT /db_xref="GOA:L0HTA7" FT /db_xref="InterPro:IPR002089" FT /db_xref="UniProtKB/TrEMBL:L0HTA7" FT /protein_id="AGB08395.1" FT /translation="MSLLTEVETPTRSEWECRCSDSSDPLAIAANIIGILHLILWITDR FT LFFKCIYRRFKYGLKRGPSTEGVPESMREEYQQEQQSAVDVDDGHFVNIELE" FT gene 1..759 FT /gene="M1" FT CDS 1..759 FT /codon_start=1 FT /gene="M1" FT /product="matrix protein 1" FT /db_xref="GOA:L0HPG5" FT /db_xref="InterPro:IPR001561" FT /db_xref="InterPro:IPR013188" FT /db_xref="InterPro:IPR015423" FT /db_xref="InterPro:IPR015799" FT /db_xref="InterPro:IPR036039" FT /db_xref="InterPro:IPR037533" FT /db_xref="UniProtKB/TrEMBL:L0HPG5" FT /protein_id="AGB08394.1" FT /translation="MSLLTEVETYVLSIIPSGPLKAEIAQRLESVFAGKNTDLEALMEW FT LKTRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDRAVKLYKKLK FT REITFHGAKEVSLSYSTGALASCMGLIYNRMGTVTTEAAFGLVCATCEQIADSQHRSHR FT QMATTTNPLIRHENRMVLASTTAKAMEQMAGSSEQAAEAMEVANQTRQMVHAMRTIGTH FT PSSSTGLKDDLLENLQAYQKRMGVQMQRFK" XX SQ Sequence 983 BP; 286 A; 208 C; 256 G; 233 T; 0 other; atgagtcttc taaccgaggt cgaaacgtac gttctttcta tcataccgtc aggccccctc 60 aaagccgaga tcgcgcagag actggaaagt gtctttgcag gaaagaacac agatcttgag 120 gctctcatgg aatggctaaa gacaagacca atcttgtcac ctctgactaa gggaatttta 180 ggatttgtgt tcacgctcac cgtgcccagt gagcgaggac tgcagcgtag acgctttgtc 240 caaaatgccc tgaatgggaa tggggaccca aacaacatgg atagagcagt taaactatac 300 aagaagctca aaagagaaat aacgttccat ggggccaagg aggtgtcact aagctattca 360 actggtgcac ttgccagttg catgggcctc atatacaaca ggatgggaac agtgaccaca 420 gaagctgctt ttggtctagt gtgtgccact tgtgaacaga ttgctgattc acagcatcgg 480 tctcacagac agatggctac aaccaccaat ccactaatca ggcatgagaa cagaatggtg 540 ctggctagca ctacggcaaa ggctatggaa cagatggctg gatcgagtga acaggcagcg 600 gaggccatgg aggttgctaa tcagactagg cagatggtac atgcaatgag aactattggg 660 actcatccta gctccagtac tggtctgaaa gatgaccttc ttgaaaattt gcaggcctac 720 cagaagcgaa tgggagtgca gatgcagcga ttcaagtgat cctctcgcca ttgcagcaaa 780 tatcattggg atcttgcacc tgatattgtg gattactgat cgtctttttt tcaaatgtat 840 ttatcgtcgc tttaaatacg gtttgaaaag agggccttct acggaaggag tgcctgagtc 900 catgagggaa gaatatcagc aggaacagca gagtgctgtg gatgttgacg atggtcattt 960 tgtcaacata gagctagagt aaa 983 // ID KC310697; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC310697; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 924aa1f614c728cf3b3bafa8341fa1e9. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260109/2012(H1N1))" FT /segment="4" FT /host="swine" FT /strain="A/swine/Indiana/A01260109/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268645" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:L0HU28" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HU28" FT /protein_id="AGB08396.1" FT /translation="MKAILVVLLYTFTTANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLENRHNGKLCKLRGVAPLHLGKCNIAGWLLGNPECESLSTASSWSYIVETSNSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDTNRGVTAACPHAGTNSFYR FT NLIWLVKKGNSYPKINKSYINNKEKEVLVLWAIHHPSTSADQQSLYQNANAYVFVGSSR FT YSRKFEPEIATRPKVRDQAGRMNYYWTLIEPGDKITFEATGNLVAPRYAFALKRNSGSG FT IIISDTSVHDCDTTCQTPNGAINTSLPFQNIHPVTIGECPKYVKSTKLRMATGLRNIPS FT IQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADLKSTQNAIDGITNKVNSV FT IEKMNTQFTAVGKEFSHLERRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDDMCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" FT sig_peptide 1..51 FT /gene="HA" FT mat_peptide 52..1032 FT /gene="HA" FT /product="HA1" FT mat_peptide 1033..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 598 A; 317 C; 370 G; 416 T; 0 other; atgaaggcaa tactagtagt cctgctatat acatttacaa ccgcaaatgc cgacacatta 60 tgtataggtt atcatgcaaa caattcaact gacaccgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt caaccttcta gaaaataggc ataatgggaa actatgtaaa 180 ctaagagggg tagctccatt gcatttgggt aaatgtaaca ttgctggctg gcttctggga 240 aatccagagt gtgaatcact ctccacagca agttcatggt cctatattgt ggaaacatct 300 aattcagaca atgggacgtg ttacccagga gatttcatca attatgagga gctaagagag 360 cagttgagct cagtgtcatc atttgaaaga tttgagatat tccccaagac aagttcatgg 420 cccaatcatg acacgaacag aggtgtgacg gcagcatgtc ctcatgctgg gacaaacagc 480 ttctacagaa atttaatatg gctagtaaaa aagggaaatt catacccaaa gatcaacaaa 540 tcctacatta acaataaaga gaaggaagtt ctcgtgctat gggccattca ccatccatct 600 accagtgccg accaacaaag tctctaccaa aatgcaaatg cctatgtgtt tgtggggtca 660 tcaagataca gcaggaagtt cgaaccagaa atagcaacaa gacctaaggt gagagaccaa 720 gcagggagaa tgaactatta ctggacacta atagagcctg gagacaagat aacattcgaa 780 gcaactggaa atctagtggc accgagatat gccttcgcat tgaaaagaaa ttctggatct 840 ggtattatca tttcagatac atcagtccac gattgtgata cgacttgtca gacacccaat 900 ggtgctataa acaccagcct cccatttcaa aatatacatc cagtcacaat tggagaatgt 960 ccaaaatatg taaaaagtac taaactgaga atggccacag gtttaaggaa tatcccgtct 1020 attcaatcta gaggcctgtt tggtgccatt gctggcttta tcgaaggggg ttggacagga 1080 atgatagatg gatggtacgg ttatcaccat caaaatgagc agggatcagg atatgcagcc 1140 gacctaaaga gcacacagaa tgccattgac gggatcacta acaaggtaaa ctctgttatt 1200 gaaaagatga acacacaatt cacggcagta ggtaaagagt tcagccactt ggaaagaaga 1260 atagagaatt taaataaaaa ggttgatgat ggttttctag atatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggatt accacgactc aaatgtgaaa 1380 aacttatatg aaaaagtaag aagccaacta aaaaacaatg ccaaggaaat tggaaatggc 1440 tgctttgaat tttaccacaa atgtgatgac atgtgcatgg aaagcgtcaa aaatggaact 1500 tatgattacc ctaaatactc agaggaagca aaactaaaca gagaggaaat agatggagta 1560 aagttggaat caacaaggat ttaccaaatt ttggcgatct attcaacggt cgccagttcg 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtcgcta 1680 cagtgcagaa tatgtattta a 1701 // ID KC310698; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC KC310698; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 049da2f1fbbcf85d365f97976a53dc2c. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260109/2012(H1N1))" FT /segment="6" FT /host="swine" FT /strain="A/swine/Indiana/A01260109/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268645" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:L0HRD5" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:L0HRD5" FT /protein_id="AGB08397.1" FT /translation="MNTNQRIITIGTVCMIVGIISLLLQIGNIVSLWIIHSIQTGWENH FT TEMCNQSVITYVNNTWVNRTYVNISNIKIATIQDVTSIILAGNSSLCPVSGWAVYSKDN FT SIRIGSKGDIFVIREPFISCSQLECRTFFLTQGALLNDKHSNGTVKDRSPYRTLMSCPI FT GEAPSPYNSRFESVAWSASACHDGMGWLTIGISGPDNGAVAVLKYNGIITDTIKSWRNK FT ILRTQESECVCMNGSCFTVLTDGPSNGQASYKIFKVVKGKIIKSIELDAPNYHYEECSC FT YPDTGKVMCVCRDNWHASNRPWVSFNQNLDYQIGYICSGVFGDNPRSNDGKGNCGPVLS FT NGANGVKGFSYRYGNGVWIGRTKSINSRSGFEMIWDPNGWTETDSSFSMKQDIIALNDW FT SGYSGSFVQHPELTGMNCIRPCFWVELIRGQPKESTIWASGSSISFCGVNSETASWSWP FT DGADLPFTIDK" XX SQ Sequence 1410 BP; 453 A; 252 C; 332 G; 373 T; 0 other; atgaatacaa atcaaagaat aataaccatt gggacagttt gcatgatagt tggaataatc 60 agtctattgt tacagatagg aaacatagtc tcgttatgga ttatccattc aattcagacc 120 ggatgggaaa atcacactga gatgtgcaac caaagtgtca ttacatatgt aaataacaca 180 tgggtgaacc gaacttatgt gaacattagc aatatcaaaa ttgctactat acaggatgtg 240 acttcgatta tactagccgg caattcctca ctttgcccag taagtgggtg ggctgtatac 300 agcaaagaca atagcataag gattggttct aaaggggaca tttttgtcat aagagaacca 360 ttcatttcat gctctcaatt ggaatgcaga accttttttc tgacccaggg cgctttgctg 420 aatgacaaac attctaatgg aaccgtcaag gacaggagtc cctatagaac cctgatgagc 480 tgccccatcg gtgaagcccc atctccgtac aactcaaggt tcgaatcagt tgcttggtca 540 gcaagtgcat gccatgatgg gatgggatgg ctaacaatcg ggatctctgg tccagataat 600 ggagcagtag ctgttttaaa atacaacggt ataataacag atacaataaa aagttggaga 660 aacaaaatat taagaacaca agagtcggaa tgtgtttgta tgaacggttc ttgttttact 720 gtattaactg atggcccaag caatgggcaa gcctcgtaca aaatatttaa agtggtaaaa 780 ggaaaaataa ttaagtcaat tgagctggat gcccccaatt accactatga ggaatgctca 840 tgttatcctg atacaggcaa agtaatgtgt gtttgcagag acaattggca tgcctcgaac 900 cggccatggg tctctttcaa tcagaatctt gactatcaaa taggatacat atgcagtgga 960 gttttcggtg ataacccgcg ttccaatgat gggaagggca attgtggccc agtactttcc 1020 aatggagcaa atggagtgaa aggattctca tatagatatg gtaatggtgt ttggatagga 1080 agaactaaga gtatcaactc cagaagtggg tttgaaatga tttgggatcc aaatgggtgg 1140 actgaaactg atagtagttt ctctatgaag caggacatta tagcattgaa tgattggtca 1200 ggatacagtg gaagttttgt ccaacatcct gaattaacag gaatgaattg cataaggcct 1260 tgtttctggg tggaattaat cagagggcaa cccaaggaaa gcacaatctg ggctagcgga 1320 agcagcatct ctttctgtgg cgtaaatagt gaaaccgcaa gctggtcatg gccagacgga 1380 gctgatctgc cattcaccat tgacaagtag 1410 // ID KC310699; SV 1; linear; viral cRNA; STD; VRL; 983 BP. XX AC KC310699; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) segment 7 matrix DE protein 2 (M2) and matrix protein 1 (M1) genes, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260109/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-983 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; e89cb738227dccb949aff11f97d734a1. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..983 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260109/2012(H1N1))" FT /segment="7" FT /host="swine" FT /strain="A/swine/Indiana/A01260109/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268645" FT gene 1..982 FT /gene="M2" FT CDS join(1..26,715..982) FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2" FT /db_xref="GOA:L0HPH1" FT /db_xref="InterPro:IPR002089" FT /db_xref="UniProtKB/TrEMBL:L0HPH1" FT /protein_id="AGB08399.1" FT /translation="MSLLTEVETPTKSEWECRCSDSSDPLAIAANIVGILHLILWITDR FT LFFKCIYRRFKYGLKRGPSTEGVPESMREEYQQEQQSAVDVDDGHFVNLELG" FT gene 1..759 FT /gene="M1" FT CDS 1..759 FT /codon_start=1 FT /gene="M1" FT /product="matrix protein 1" FT /db_xref="GOA:L0HS14" FT /db_xref="InterPro:IPR001561" FT /db_xref="InterPro:IPR013188" FT /db_xref="InterPro:IPR015423" FT /db_xref="InterPro:IPR015799" FT /db_xref="InterPro:IPR036039" FT /db_xref="InterPro:IPR037533" FT /db_xref="UniProtKB/TrEMBL:L0HS14" FT /protein_id="AGB08398.1" FT /translation="MSLLTEVETYVLSIIPSGPLKAEIAQRLESVFAGKNTDLEALMEW FT LKTRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDRAVKLYKKLK FT REITFHGAKEVSLSYSTGALASCMGLIYNRMGTVTTEAAFGLVCATCEQIADSQHRSHR FT QMATTTNPLIRHENRMVLASTTAKAMEQMAGSSEQAAEAMEVANQTRQMVHAMRTIGTH FT PSSSTGLKDDLLENLQAYQKRMGVQMQRFK" XX SQ Sequence 983 BP; 283 A; 207 C; 258 G; 235 T; 0 other; atgagtcttc taaccgaggt cgaaacgtac gttctttcta tcataccgtc aggccccctc 60 aaagccgaga tcgcgcagag actggaaagt gtctttgcag gaaagaacac agatcttgag 120 gctctcatgg aatggctaaa gacaagacca atcttgtcac ctctgactaa gggaatttta 180 ggatttgtgt tcacgctcac cgtgcccagt gagcgaggac tgcagcgtag acgctttgtc 240 caaaatgccc tgaatgggaa tggggaccca aacaatatgg atagagcagt taaactatac 300 aagaagctca aaagagaaat aacgttccat ggggccaagg aggtgtcact aagctattca 360 actggtgcac ttgccagttg catgggcctc atatacaaca ggatgggaac agtgaccaca 420 gaagctgctt ttggtctagt gtgtgccact tgtgaacaga ttgctgattc acagcatcgg 480 tctcacagac agatggctac aaccaccaat ccactaatca ggcatgagaa cagaatggtg 540 ctggctagca ctacggcaaa ggctatggaa cagatggctg gatcgagtga acaggcagcg 600 gaggccatgg aggttgctaa tcagactagg cagatggtac atgcaatgag aactattggg 660 actcatccta gctccagtac tggtctgaaa gatgaccttc ttgaaaattt gcaggcctac 720 caaaagcgaa tgggagtgca gatgcagcga ttcaagtgat cctctcgcca ttgcagcgaa 780 tatcgttggg atcttgcacc tgatattgtg gattactgat cgtctttttt tcaaatgtat 840 ttatcgtcgc tttaaatacg gtttgaaaag agggccttct acggaaggag tgcctgagtc 900 catgagggaa gaatatcaac aggaacagca gagtgctgtg gatgttgacg atggtcattt 960 tgtcaactta gagctggggt aaa 983 // ID KC310700; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC310700; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) segment 4 DE hemagglutinin (HA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 1fa5bf96860c7f3ea075ba989c274cad. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260110/2012(H1N1))" FT /segment="4" FT /host="swine" FT /strain="A/swine/Indiana/A01260110/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268646" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /function="receptor binding and fusion protein" FT /db_xref="GOA:L0HTB5" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HTB5" FT /protein_id="AGB08400.1" FT /translation="MKAILVVLLYTFTTANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLENRHNGKLCKLRGVAPLHLGKCNIAGWLLGNPECESLSTASSWSYIVETSNSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDTNRGVTAACPHAGTNSFYR FT NLIWLVKKGNSYPKINKSYINNKEKEVLVLWAIHHPSTSADQQSLYQNANAYVFVGSSR FT YSRKFEPEIATRPKVRDQAGRMNYYWTLIEPGDKITFEATGNLVAPRYAFALKRNSGSG FT IIISDTSVHDCDTTCQTPNGAINTSLPFQNIHPVTIGECPKYVKSTKLRMATGLRNIPS FT IQSRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADLKSTQNAIDGITNKVNSV FT IEKMNTQFTAVGKEFSHLERRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDDMCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" FT sig_peptide 1..51 FT /gene="HA" FT mat_peptide 52..1032 FT /gene="HA" FT /product="HA1" FT mat_peptide 1033..1698 FT /gene="HA" FT /product="HA2" XX SQ Sequence 1701 BP; 597 A; 317 C; 371 G; 416 T; 0 other; atgaaggcaa tactagtagt cctgctatat acatttacaa ccgcaaatgc cgacacatta 60 tgtataggtt atcatgcaaa caattcaact gacaccgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt caaccttcta gaaaataggc ataatgggaa actatgtaaa 180 ctaagagggg tagctccatt gcatttgggt aaatgtaaca ttgctggctg gcttctggga 240 aatccagagt gtgaatcact ctccacagca agttcatggt cctatattgt ggaaacatct 300 aattcagaca atgggacgtg ttacccagga gatttcatca attatgagga gctaagagag 360 cagttgagct cagtgtcatc atttgaaaga tttgagatat tccccaagac aagttcatgg 420 cccaatcatg acacgaacag aggtgtgacg gcagcatgtc ctcatgctgg gacaaacagc 480 ttctacagaa atttaatatg gctagtaaaa aagggaaatt catacccaaa gatcaacaaa 540 tcctacatta acaataaaga gaaggaagtt ctcgtgctat gggccattca ccatccatct 600 accagtgccg accaacaaag tctctaccaa aatgcaaatg cctatgtgtt tgtggggtca 660 tcaagataca gcaggaagtt cgaaccagaa atagcaacaa gacctaaggt gagagaccaa 720 gcagggagaa tgaactatta ctggacacta atagagcctg gagacaagat aacattcgaa 780 gcaactggaa atctagtggc accgagatat gccttcgcat tgaaaagaaa ttctggatct 840 ggtattatca tttcagatac atcagtccac gattgtgata cgacttgtca gacacccaat 900 ggtgctataa acaccagcct cccatttcaa aatatacatc cagtcacaat tggagaatgt 960 ccaaaatatg taaaaagtac taaactgaga atggccacag gtttaaggaa tatcccgtct 1020 attcaatcta gaggcctgtt tggtgccatt gctggcttta tcgaaggggg ttggacagga 1080 atgatagatg gatggtacgg ttatcaccat caaaatgagc agggatcagg atatgcagcc 1140 gacctaaaga gcacacagaa tgccattgac gggatcacta acaaggtaaa ctctgttatt 1200 gaaaagatga acacacaatt cacggcagta ggtaaagagt tcagccactt ggaaagaaga 1260 atagagaatt taaataaaaa ggttgatgat ggttttctag atatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggatt accacgactc aaatgtgaaa 1380 aacttatatg aaaaagtaag aagccaacta aaaaacaatg ccaaggaaat tggaaatggc 1440 tgctttgaat tttaccacaa atgtgatgac atgtgcatgg aaagcgtcaa aaatggaact 1500 tatgattacc ctaaatactc agaggaagca aaactaaaca gagaggaaat agatggggta 1560 aagttggaat caacaaggat ttaccaaatt ttggcgatct attcaacggt cgccagttcg 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtcgcta 1680 cagtgcagaa tatgtattta a 1701 // ID KC310701; SV 1; linear; viral cRNA; STD; VRL; 1410 BP. XX AC KC310701; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) segment 6 DE neuraminidase (NA) gene, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1410 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; 049da2f1fbbcf85d365f97976a53dc2c. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1410 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260110/2012(H1N1))" FT /segment="6" FT /host="swine" FT /strain="A/swine/Indiana/A01260110/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268646" FT gene 1..1410 FT /gene="NA" FT CDS 1..1410 FT /codon_start=1 FT /gene="NA" FT /product="neuraminidase" FT /db_xref="GOA:L0HU36" FT /db_xref="InterPro:IPR001860" FT /db_xref="InterPro:IPR033654" FT /db_xref="InterPro:IPR036278" FT /db_xref="UniProtKB/TrEMBL:L0HU36" FT /protein_id="AGB08401.1" FT /translation="MNTNQRIITIGTVCMIVGIISLLLQIGNIVSLWIIHSIQTGWENH FT TEMCNQSVITYVNNTWVNRTYVNISNIKIATIQDVTSIILAGNSSLCPVSGWAVYSKDN FT SIRIGSKGDIFVIREPFISCSQLECRTFFLTQGALLNDKHSNGTVKDRSPYRTLMSCPI FT GEAPSPYNSRFESVAWSASACHDGMGWLTIGISGPDNGAVAVLKYNGIITDTIKSWRNK FT ILRTQESECVCMNGSCFTVLTDGPSNGQASYKIFKVVKGKIIKSIELDAPNYHYEECSC FT YPDTGKVMCVCRDNWHASNRPWVSFNQNLDYQIGYICSGVFGDNPRSNDGKGNCGPVLS FT NGANGVKGFSYRYGNGVWIGRTKSINSRSGFEMIWDPNGWTETDSSFSMKQDIIALNDW FT SGYSGSFVQHPELTGMNCIRPCFWVELIRGQPKESTIWASGSSISFCGVNSETASWSWP FT DGADLPFTIDK" XX SQ Sequence 1410 BP; 453 A; 252 C; 332 G; 373 T; 0 other; atgaatacaa atcaaagaat aataaccatt gggacagttt gcatgatagt tggaataatc 60 agtctattgt tacagatagg aaacatagtc tcgttatgga ttatccattc aattcagacc 120 ggatgggaaa atcacactga gatgtgcaac caaagtgtca ttacatatgt aaataacaca 180 tgggtgaacc gaacttatgt gaacattagc aatatcaaaa ttgctactat acaggatgtg 240 acttcgatta tactagccgg caattcctca ctttgcccag taagtgggtg ggctgtatac 300 agcaaagaca atagcataag gattggttct aaaggggaca tttttgtcat aagagaacca 360 ttcatttcat gctctcaatt ggaatgcaga accttttttc tgacccaggg cgctttgctg 420 aatgacaaac attctaatgg aaccgtcaag gacaggagtc cctatagaac cctgatgagc 480 tgccccatcg gtgaagcccc atctccgtac aactcaaggt tcgaatcagt tgcttggtca 540 gcaagtgcat gccatgatgg gatgggatgg ctaacaatcg ggatctctgg tccagataat 600 ggagcagtag ctgttttaaa atacaacggt ataataacag atacaataaa aagttggaga 660 aacaaaatat taagaacaca agagtcggaa tgtgtttgta tgaacggttc ttgttttact 720 gtattaactg atggcccaag caatgggcaa gcctcgtaca aaatatttaa agtggtaaaa 780 ggaaaaataa ttaagtcaat tgagctggat gcccccaatt accactatga ggaatgctca 840 tgttatcctg atacaggcaa agtaatgtgt gtttgcagag acaattggca tgcctcgaac 900 cggccatggg tctctttcaa tcagaatctt gactatcaaa taggatacat atgcagtgga 960 gttttcggtg ataacccgcg ttccaatgat gggaagggca attgtggccc agtactttcc 1020 aatggagcaa atggagtgaa aggattctca tatagatatg gtaatggtgt ttggatagga 1080 agaactaaga gtatcaactc cagaagtggg tttgaaatga tttgggatcc aaatgggtgg 1140 actgaaactg atagtagttt ctctatgaag caggacatta tagcattgaa tgattggtca 1200 ggatacagtg gaagttttgt ccaacatcct gaattaacag gaatgaattg cataaggcct 1260 tgtttctggg tggaattaat cagagggcaa cccaaggaaa gcacaatctg ggctagcgga 1320 agcagcatct ctttctgtgg cgtaaatagt gaaaccgcaa gctggtcatg gccagacgga 1380 gctgatctgc cattcaccat tgacaagtag 1410 // ID KC310702; SV 1; linear; viral cRNA; STD; VRL; 985 BP. XX AC KC310702; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) segment 7 matrix DE protein 2 (M2) and matrix protein 1 (M1) genes, complete cds. XX KW . XX OS Influenza A virus (A/swine/Indiana/A01260110/2012(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-985 RG USDA Swine Surveillance RA ; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL National Vetennary Services Laboratories, USDA, 1920 Dayton Ave, Ames, IA RL 50010, USA XX DR MD5; f15fa5da1cf4840fe63c15d53ca50d95. XX CC ##Assembly-Data-START## CC Assembly Method :: DNA STAR v. 9 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..985 FT /organism="Influenza A virus FT (A/swine/Indiana/A01260110/2012(H1N1))" FT /segment="7" FT /host="swine" FT /strain="A/swine/Indiana/A01260110/2012" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="USA:Indiana" FT /isolation_source="nasal swab" FT /collection_date="08-Nov-2012" FT /db_xref="taxon:1268646" FT gene 1..982 FT /gene="M2" FT CDS join(1..26,715..982) FT /codon_start=1 FT /gene="M2" FT /product="matrix protein 2" FT /db_xref="GOA:L0HS21" FT /db_xref="InterPro:IPR002089" FT /db_xref="UniProtKB/TrEMBL:L0HS21" FT /protein_id="AGB08403.1" FT /translation="MSLLTEVETPTRSEWECRCSDSSDPLAIAANIIGILHLILWITDR FT LFFKCIYRRFKYGLKRGPSTEGVPESMREEYQQEQQSAVDVDDGHFVNIELE" FT gene 1..759 FT /gene="M1" FT CDS 1..759 FT /codon_start=1 FT /gene="M1" FT /product="matrix protein 1" FT /db_xref="GOA:L0HRD7" FT /db_xref="InterPro:IPR001561" FT /db_xref="InterPro:IPR013188" FT /db_xref="InterPro:IPR015423" FT /db_xref="InterPro:IPR015799" FT /db_xref="InterPro:IPR036039" FT /db_xref="InterPro:IPR037533" FT /db_xref="UniProtKB/TrEMBL:L0HRD7" FT /protein_id="AGB08402.1" FT /translation="MSLLTEVETYVLSIIPSGPLKAEIAQRLESVFAGKNTDLEALMEW FT LKTRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDRAVKLYKKLK FT REITFHGAKEVSLSYSTGALASCMGLIYNRMGTVTTEAAFGLVCATCEQIADSQHRSHR FT QMATTTNPLIRHENRMVLASTTAKAMEQMAGSSEQAAEAMEVANQTRQMVHAMRTIGTH FT PSSSTGLKDDLLENLQAYQKRMGVQMQRFK" XX SQ Sequence 985 BP; 289 A; 207 C; 255 G; 234 T; 0 other; atgagtcttc taaccgaggt cgaaacgtac gttctttcta tcataccgtc aggccccctc 60 aaagccgaga tcgcgcagag actggaaagt gtctttgcag gaaagaacac agatcttgag 120 gctctcatgg aatggctaaa gacaagacca atcttgtcac ctctgactaa gggaatttta 180 ggatttgtgt tcacgctcac cgtgcccagt gagcgaggac tgcagcgtag acgctttgtc 240 caaaatgccc tgaatgggaa tggggaccca aacaatatgg atagagcagt taaactatac 300 aagaagctca aaagagaaat aacgttccat ggggccaagg aggtgtcact aagctattca 360 actggtgcac ttgccagttg catgggcctc atatacaaca ggatgggaac agtgaccaca 420 gaagctgctt ttggtctagt gtgtgccact tgtgaacaga ttgctgattc acagcatcgg 480 tctcacagac agatggctac aaccaccaat ccactaatca ggcatgagaa cagaatggtg 540 ctggctagca ctacggcaaa ggctatggaa cagatggctg gatcgagtga acaggcagcg 600 gaggccatgg aggttgctaa tcagactagg cagatggtac atgcaatgag aactattggg 660 actcatccta gctccagtac tggtctgaaa gatgaccttc ttgaaaattt gcaggcctac 720 cagaagcgaa tgggagtgca gatgcagcga ttcaagtgat cctctcgcca ttgcagcaaa 780 tatcattggg atcttgcacc tgatattgtg gattactgat cgtctttttt tcaaatgtat 840 ttatcgtcgc tttaaatacg gtttgaaaag agggccttct acggaaggag tgcctgagtc 900 catgagggaa gaatatcaac aggaacagca gagtgctgtg gatgttgacg atggtcattt 960 tgtcaacata gagctagagt aaaaa 985 // ID KC310737; SV 1; linear; genomic RNA; STD; VRL; 7630 BP. XX AC KC310737; XX DT 03-SEP-2013 (Rel. 118, Created) DT 03-SEP-2013 (Rel. 118, Last updated, Version 1) XX DE Encephalomyocarditis virus isolate Sing-M100-02, partial genome. XX KW . XX OS Encephalomyocarditis virus OC Viruses; Riboviria; Picornavirales; Picornaviridae; Cardiovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7630 RX PUBMED; 23914943. RA Yeo D.S., Lian J.E., Fernandez C.J., Lin Y.N., Liaw J.C., Soh M.L., RA Lim E.A., Chan K.P., Ng M.L., Tan H.C., Oh S., Ooi E.E., Tan B.H.; RT "A highly divergent Encephalomyocarditis virus isolated from nonhuman RT primates in Singapore"; RL Virol J 10(1):248-248(2013). XX RN [2] RP 1-7630 RA Yeo D.S., Lian J.E., Fernandez C.J., Lin Y.-N., Tan B.H.; RT ; RL Submitted (09-DEC-2012) to the INSDC. RL Detection & Diagnostics Laboratory, DMERI @ DSO National Laboratories RL Singapore, 27 Medical Drive 13-00, Singapore 117510, Singapore XX DR MD5; f7247773abf508878e3945d9f7662a4c. DR EuropePMC; PMC3750836; 23914943. DR EuropePMC; PMC5656786; 28786807. XX CC ##Assembly-Data-START## CC Assembly Method :: SeqMan DNASTAR v. Lasergene 8 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7630 FT /organism="Encephalomyocarditis virus" FT /host="orangutan" FT /isolate="Sing-M100-02" FT /mol_type="genomic RNA" FT /country="Singapore" FT /isolation_source="lung" FT /collection_date="2002" FT /db_xref="taxon:12104" FT 5'UTR <1..573 FT CDS 574..7479 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:T1WNN9" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR015031" FT /db_xref="InterPro:IPR021573" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="InterPro:IPR037080" FT /db_xref="InterPro:IPR037243" FT /db_xref="UniProtKB/TrEMBL:T1WNN9" FT /protein_id="AGU38151.1" FT /translation="MATIMEQEICAHTMTFEECPKCSALQYRNGFYLLKYDEEWYPEEL FT LIEGEDDVFDPELDMEVVFETQGNSTSSDKNNSSSEGNEGVIINNFYSNQYQNSIDLSA FT NATGNNPPKTYGQFSNLLSGAVNAFTNMLPLLNDQNTEEMENLSDRVAQDTAGNTVTNT FT QSTVGRLLGYGVSHNGEHPASCADTASEKILAVERYYTFKVTDWTSTQKAFEYIRIPLP FT HVLSGESGGVFGAALRRHYLVKTGWRVQIQCNASQFHAGSLLVFMAPEYPTLDAFVMDN FT RWSKDNLPNGTKTQTNKKGPFGMDHQNYWQWTLYPHQFLNLRTNTTVDLEVPYVNIAPT FT SSWTQHASWTLVIAVVAPLTYSTGASTSLDITASIQPVRPVFNGLRHETLQTQSPIPVT FT IREHAGTWYSTLPDTTVPIYGKTPVAPSNYMVGEYTDFLEIAQIPTFIGNKIPNAVPYI FT EATNTVVKTNPLATYQVTLSCTCLANTFLAALSRNFAQYRGSMVYTFVFTGTAMMKGKF FT LIAYTPPGAGKPTTRDQAMQATYAIWDLGLNSTYSFTVPFISPSHFRMVGTDQVNITNV FT DGWVTVWQLTPLTYPPGCPNTAKILTMVSAGKDFTVKMPISPAPWSPQGIENAEKGVTE FT NTDATADFVAQPVYLPENQTKVNFFYDRYSPIGAFSVKNGTMEGAFTPFASDFCPNSVI FT LTPGPQYDPNTPQARPQRLTEIWGNGNEDTSSVFPLKTKQDYSFCLFSPFVYYKCDLEV FT TISPHTSGNHGLAVRWAPTGTPTKPTTQVLHAVSSLSEGRTPKMYSAGPGTSNHISFVV FT PYNSPLSVLPAVWYNGHKKFDNTGSLGIAPNSDFGTLFFAGTKPDVKFTVYLRYKNMKV FT FCPRPTVFFPWPSVGDKVDMTPRAGVLMLESPPFLQRAANPLDIFQTFPVLHILLEFNH FT RGIEARLFRYGQYWACCYAEVVLRSRAKQIAFLTKGSTSDCDSAAEWNPWKRTYHAILR FT AEPHRVTLDIYHKRIKPFKMPLVQKEWRTHEENIFQLWRLFDQHYAGYFSDLLIHDVEL FT NPGPFMFKPKKQVFQTQGAALTTMANTLAPSNIANQALGSAFSALLDANEDAQKAMKIM FT KTLSSLSDAWENVKDTLQNQEFWKQLLTRCVQVIAGMTIAVMHPDPLTLLCLGVMTTAE FT VTSQTNLCEEIVSKFKNIFRTPPPKFPGISLFQQQSPPLKNVNDVFSLAKNLDWAVRTV FT EKIVSWFGDWVLQEEKEQTLDEMLTRFPEHAKRISDLRNGMAAYVECKESFDFFERLYN FT QAVKEKRTGIAAVCEKFRQKHDHATARCEPVVIVLRGDAGQGKSLSSQIIAQAVSKTIF FT GRQSVYSLPPDSDFFDGYENQFAAIMDDLGQNPDGSDFTTFCQMVSTTNFLPNMASLER FT KGTPFTSQLVVATTNLPEFRPVTIAHYPAVERRITFDYSVSAGPMCSRTEAGQKVLDVE FT RAFRPTGEEAPLPCFQSDCLFLNKAGIQFRDNRTKEIISLVEVIERAVAKIERKKKVLT FT TVQTLVAQAPVDEVNFHSVVQQLRARQEATDEQLEELQEAFAKTQERSSIFSDWMKISA FT MVCAATLALSQVVKMARTVKQIFKPDLVRVQVDENEQGPYNERTRLPPKTLQLLDVQGP FT NQTMDFEKYVAKNVTAPIEFIYPTGVRIQTCLLIKNRVLAVNRHMVETDWEAIQVRGVV FT HRREAVKILAIAKTGKDTDVTFLKLNSGPLFKDNVKKFVSAKDVMPQSSSPLIGIMNSE FT IPMMYTGSFLKAGVSVPVETGNTFSHCIHYKANTKKGWCGSAVISDLGGQKKIVGMHSA FT GSMGIAAASMISQEMIGAVLNVFEPQGALEQLPDGPRIHVPRKTALRPTVAKQVFQPDF FT APAVLSKFDPRTEADVDTVAFSKHTSNQETLPPVFRMVAKEYANRVFSLLGKDNGKISV FT KQALEGMEGMDPMDRNTSPGLPYTSLGMRRTDVVDWESGTLIPFASERLENMTKGDFSG FT IVYQTFLKDELRPMEKVRAAKTRIVDVPPFEHCILGRQLLGKFASKFQTQPGLELGSAI FT GCDPDVHWTKFGVAMQSFQRVYDVDYSNFDSTHSVAMFRLLAEEFFTPENGFDPLVSQY FT LDSLAISTHAFEEKRYLITGGLPSGCAATSMLNTIMNNIIIRAGLYLTYKNFEFDDIQV FT LSYGDDLLVATDYQLDFDRVKASLAKTGYKITPANKTSSFPLESTLDDVVFLKRKFKRE FT GPLYRPVMNKEALEAMLSYYRPGSLAEKLTSVTMLAVHSGKQEYDRLFAPFREVGIMVP FT QYESVEYRWRSLFW" FT mat_peptide 574..774 FT /product="leader protein" FT mat_peptide 775..984 FT /product="1A capsid protein" FT mat_peptide 985..1752 FT /product="1B capsid protein" FT mat_peptide 1753..2445 FT /product="1C capsid protein" FT mat_peptide 2446..3276 FT /product="1D capsid protein" FT mat_peptide 3277..3726 FT /product="2A protein" FT mat_peptide 3727..4176 FT /product="2B protein" FT mat_peptide 4177..5157 FT /product="2C protein" FT mat_peptide 5158..5421 FT /product="3A protein" FT mat_peptide 5422..5481 FT /product="3B protein" FT mat_peptide 5482..6096 FT /product="3C protein" FT mat_peptide 6097..7476 FT /product="3D protein" FT 3'UTR 7480..7630 XX SQ Sequence 7630 BP; 2198 A; 1786 C; 1706 G; 1940 T; 0 other; cccccccctc cccctccccc ttattttcct ggtcgaaacc gctcggaata agaccggggt 60 cttgtaatgt ctaaatgtta cttctaccca accattgtct atgatggttg gagggctgta 120 gaacctagcc cttgcttctt gcagagaaat ccaagtggtc tttccactct cgacaatggg 180 tttcatggct cgccaaaagt tgtgaagaaa gcaagtccta tggaagcttt ctgacgaccg 240 atgatgtctg tagcgaccct ttgcaggcag cggaatcccc cacctggtaa caggtgcctc 300 tgcggccaaa agccacgtgt ttaacagaca cctgcaaagg cggcacaacc ccagtgccac 360 atcaagagtc tgatgactgt ggaaatagtc aactggcttt tcttaagcaa atttggtgtc 420 ggggctgaag gatgcccgga aggtaccaca ctggttgtga tctgatccgg ggcccatgtg 480 catgtgctat acacatgtag cctgggttaa aaaacgtcta ggccccccga accacgggga 540 cgtggttttc cttttgaaaa ccacaatgat aatatggcta caattatgga acaagagatt 600 tgcgctcata caatgacttt tgaagaatgt ccaaagtgct ctgccctgca atacagaaat 660 ggattctatc ttttaaaata tgatgaagaa tggtatcctg aggaattgct catagaagga 720 gaagatgatg tatttgatcc agaattggac atggaagtag tctttgaaac tcagggaaat 780 tctacctcat cagacaaaaa caactccagt tctgaaggaa atgaaggtgt aattataaac 840 aatttctatt ccaaccagta ccagaattcc attgatctct cagccaatgc cactggaaat 900 aaccccccta aaacttatgg gcagttttca aatttattgt ctggtgcagt gaatgccttc 960 accaacatgc tacctctcct gaatgaccaa aatacagaag aaatggagaa tctttcagac 1020 agagtggctc aagatacagc cggaaatacg gtcacaaaca cacaatctac agttggccga 1080 cttctaggct acggggtttc acacaatgga gaacatcctg cctcttgtgc cgatactgcc 1140 tcagagaaaa tactggcagt tgaaagatac tacacattta aggtaacaga ctggacttca 1200 acccagaagg cctttgaata tataaggatc cctttgcctc atgtattgtc tggtgaaagt 1260 ggaggagtat ttggtgcagc tttacgcaga cactacctgg tgaagacagg atggagagtt 1320 caaatccagt gtaatgcctc acagtttcac gctggaagcc ttttagtttt catggcacca 1380 gaatacccga ccctggatgc ttttgtgatg gacaaccgtt ggtccaaaga caatcttccg 1440 aatggaacaa aaactcagac caataagaaa ggaccctttg ggatggacca ccaaaattat 1500 tggcaatgga ccttgtaccc acatcaattc ttgaatttgc gaacaaacac cacagttgat 1560 ctggaagtcc cttatgtcaa cattgcccct acttcatctt ggacacaaca tgccagctgg 1620 actctggtta tcgctgtggt ggctcccctg acatactcca ccggagcttc cacatccctc 1680 gatatcaccg cctcgataca acctgtccga ccggtgttca atggattgcg acatgaaact 1740 ctccagacac agtcccccat tccggtgact atccgggagc atgctggcac ctggtactct 1800 acactccctg ataccactgt ccctatatat ggtaaaactc ccgttgcacc ctctaactat 1860 atggtaggag aatatacaga cttcctggag attgcccaga taccaacatt tataggaaac 1920 aaaataccta atgcagtacc atatattgag gctacgaaca cagtagtaaa aacaaaccca 1980 ttggctacct atcaagtaac attgtcatgt acatgcctgg ccaacacttt cctggctgca 2040 ctatccagaa actttgccca gtacagaggc tcaatggttt atacttttgt cttcactggt 2100 accgcaatga tgaaaggaaa attcttgatt gcctatacac cccccggtgc tggaaagccc 2160 accactaggg atcaggctat gcaagcaact tacgctatct gggatttggg tttaaattct 2220 acttactctt ttactgtgcc ttttatatct ccctcacatt ttagaatggt tggaacagat 2280 caagtcaaca tcacaaatgt tgatggatgg gtcacagttt ggcaactgac acctctaacg 2340 tacccacctg gctgtccgaa cactgcaaaa atactcacca tggttagcgc cgggaaagac 2400 ttcactgtca aaatgcctat ttcccctgcc ccatggagtc cacagggaat agaaaatgca 2460 gaaaaaggtg tgactgaaaa tacagatgct actgcagatt ttgtagccca gcctgtttac 2520 ctgcctgaaa accagactaa agtaaacttc ttctacgatc gatacagtcc gattggcgct 2580 ttttccgtaa aaaatggaac catggagggt gcttttacgc cctttgcaag tgatttctgt 2640 ccaaactcag ttattttgac accaggacca cagtatgacc ccaacacccc ccaggcgcga 2700 ccccagcggc tcactgagat ttggggaaat ggcaatgaag acactagtag tgtcttccct 2760 ctcaaaacaa aacaggacta ctcattttgt ctcttttccc cctttgtgta ttataagtgt 2820 gatcttgaag tgacgattag tccccataca tctggcaatc atggcttagc tgtacgttgg 2880 gctccaacag gaacaccaac aaagccgact acccaggtgt tgcatgctgt gagttcactt 2940 tctgagggac gtactcccaa aatgtacagc gctggacccg gaacctcaaa ccatatatca 3000 tttgttgtac catacaactc acctctgtca gtcttgcccg ctgtctggta taatggacac 3060 aaaaaatttg acaatacagg cagcttgggc atagccccaa attcagactt cggtacttta 3120 ttctttgccg ggaccaagcc cgatgtgaag tttacagtgt acctgagata taagaatatg 3180 aaggtatttt gtccgagacc tactgttttc tttccttggc cctctgttgg ggacaaggtg 3240 gacatgaccc cccgagctgg tgttctgatg ctcgagagcc cacctttcct gcagagagct 3300 gcaaacccac ttgacatctt tcagaccttc cctgttctcc acatcctgct tgaatttaac 3360 catagaggga ttgaagcaag gctctttaga tatgggcagt attgggcatg ctgttatgca 3420 gaagttgttc tcagatcaag agcaaaacag atagctttct tgacaaaggg ttctacaagt 3480 gattgtgact ccgcagctga atggaacccg tggaagagaa cctaccatgc catactcagg 3540 gctgaaccac atcgagtcac cttggacatt taccacaaaa gaatcaaacc ctttaagatg 3600 cccctagtgc agaaagaatg gagaactcat gaagaaaaca tcttccagct ttggagactc 3660 ttcgatcagc actacgcagg ctatttctct gacctgctta ttcatgatgt tgagctaaac 3720 ccaggcccat tcatgttcaa gcccaagaaa caggtttttc agacacaagg agcggcgctg 3780 accaccatgg ccaataccct ggcgccgagc aacattgcca atcaagcact aggatcagct 3840 ttttcggctt tgctagacgc caacgaggac gcccaaaaag caatgaagat tatgaagaca 3900 ttaagttctc tgtcggatgc atgggaaaat gtaaaagata ctctgcaaaa tcaggagttt 3960 tggaagcagc ttcttacaag atgtgtgcag gtgattgcag gaatgacaat tgcagtgatg 4020 catccagacc ctctgacact gctgtgtcta ggagtcatga caaccgcgga ggtaaccagc 4080 caaaccaacc tttgcgaaga aatagtctct aaatttaaaa acatttttag aactccaccc 4140 cctaagtttc caggaatctc attgtttcag cagcaatccc ctcctctgaa gaacgtaaat 4200 gatgtatttt ctctggcaaa gaatcttgat tgggctgtga gaacagtgga aaaaattgtg 4260 tcgtggtttg gagactgggt attgcaggaa gaaaaagaac agactttaga tgaaatgctg 4320 acccgctttc cagaacacgc aaagagaatc tctgacctca gaaatggaat ggctgcttat 4380 gttgagtgta aggaaagttt tgatttcttt gagaggcttt ataatcaggc tgttaaggag 4440 aaaagaactg gcattgccgc tgtgtgtgag aagttcagac agaagcatga tcatgctaca 4500 gcgaggtgtg aaccagttgt cattgtcctc cgcggagatg caggacaggg aaagtccctc 4560 tctagtcaga ttattgccca ggctgtttca aagacaatct ttggccgcca gtcagtttat 4620 tctcttcctc cagattctga tttttttgat ggttatgaaa atcagtttgc agctataatg 4680 gatgatctag gtcagaatcc tgatggctcg gatttcacaa ctttctgtca gatggtgtct 4740 actactaact ttcttcctaa tatggccagt cttgagagaa agggaactcc gtttacttct 4800 cagcttgtgg tcgcaactac taaccttccc gagtttagac ctgttactat tgcccattat 4860 cccgctgtag agagaagaat cacttttgac tactcggtct cggctgggcc catgtgttcg 4920 cgtactgagg ctggacagaa agtgctggat gtcgagagag ccttcagacc aacaggggaa 4980 gaagcccctc tcccgtgctt tcaatcagac tgtctctttc ttaacaaagc tggaatccag 5040 ttcagagaca acagaacaaa ggaaatcatt tccctggttg aagtcattga gagagccgtt 5100 gctaaaattg agaggaaaaa gaaagtgctc acgactgtac agacccttgt tgctcaagcc 5160 cctgtagatg aagttaactt ccactctgta gtccagcagc tcagagcccg ccaggaagcc 5220 acagacgaac agcttgagga actccaggag gcctttgcta agacgcaaga gagatcatcg 5280 atcttttcag attggatgaa aatttcagca atggtgtgtg cagcaactct ggccctatcg 5340 caggtagtca agatggccag aacagttaaa caaattttca aaccagacct ggttagggtt 5400 caggtagatg aaaatgaaca gggtccttac aacgagagga ccaggcttcc accaaaaacc 5460 ctgcaactgc tggatgttca gggacctaac cagacgatgg attttgagaa atatgttgca 5520 aaaaatgtga cagcccccat agagttcatc tatcctacag gagtcagaat ccaaacttgt 5580 ctactcataa agaatagagt acttgctgtg aacaggcaca tggtggaaac ggactgggaa 5640 gcgatacagg taagaggagt ggtgcaccgt agagaggcag tgaaaatcct tgccatagct 5700 aagacaggaa aagataccga tgtcaccttc ttgaaactta attcaggacc attattcaaa 5760 gataatgtaa agaagtttgt gtctgctaag gatgttatgc cccaatcttc aagtccattg 5820 attggaatca tgaattcaga aattcctatg atgtacacag gcagcttcct gaaagcaggg 5880 gtctctgtcc cagtggagac ggggaacacc ttcagccact gcattcacta caaggcaaac 5940 acaaagaaag gttggtgtgg atctgctgtc atctctgacc taggtggcca aaagaagatt 6000 gtaggaatgc attcagccgg gtccatgggc attgcggcgg catcaatgat ctcacaagag 6060 atgattggtg ctgtgctcaa tgtatttgaa ccccaaggag ctcttgagca gctaccggat 6120 ggtccccgca ttcatgtgcc aaggaaaaca gccctgcgcc ccaccgtggc caaacaagtt 6180 ttccaaccag attttgcccc agcagtgttg tctaaattcg atcctagaac tgaggctgat 6240 gtagatacag tggccttctc aaagcacact tcaaaccagg aaacactccc tccagtgttt 6300 agaatggtgg caaaagaata tgcaaacaga gttttttcgc ttcttggcaa agacaacggg 6360 aagatatcag tgaagcaagc attggaggga atggaaggaa tggaccccat ggacaggaat 6420 acttccccag gccttccgta cacgtcgcta ggaatgcgac gtacagatgt agttgactgg 6480 gaaagcggaa ccctgattcc gtttgcttct gaaagactag agaacatgac taaaggagac 6540 ttctctggaa ttgtctacca aacattcctc aaagacgagc tcagaccaat ggaaaaagtc 6600 agagcagcca agactagaat agttgatgta ccaccttttg aacattgtat tctgggtaga 6660 cagcttctgg ggaaattcgc atcaaagttc cagacccagc cgggtctgga gcttggatca 6720 gcaattggct gtgacccaga cgtgcactgg actaaatttg gtgtggcaat gcagtccttt 6780 cagagagtct atgatgttga ctactcaaac tttgattcaa cccactcagt tgcaatgttt 6840 cgtctccttg ctgaggagtt ttttacccct gagaatggat ttgacccact ggtatctcaa 6900 taccttgact cacttgccat ctcaacgcat gcatttgagg agaagcgcta tctcataacc 6960 ggaggtcttc cttccggttg tgctgcgacc tcaatgctca ataccattat gaacaatatt 7020 ataattaggg ctggtttgta cctcacttat aagaactttg aatttgatga tatacaggta 7080 ctgtcatatg gagatgatct cctggtggct acagattatc aattggattt tgatagggtg 7140 aaggcaagcc tagcaaagac agggtacaaa attacacccg ctaacaaaac ttctagcttt 7200 cctcttgaat caacactaga tgatgtagtt ttccttaaga gaaaatttaa gagagagggc 7260 cccttgtatc gtcctgtcat gaacaaggag gcgctagagg ctatgttgtc atactaccgt 7320 ccaggttccc tggcggagaa actcacctca gtgaccatgc tcgccgtcca ctccggaaag 7380 caagagtacg accgtctctt tgcccctttc cgtgaagttg gtatcatggt accacaatat 7440 gagagtgtgg agtaccgctg gagaagtctg ttctggtagt agcgcggaca atggcacaac 7500 gctttacccg ggaagccact cgggtgtacg cggtcgctat tccgcagaca gggtagtttc 7560 tactttgcaa gatagactag agtagtaaaa taaatagttt aagaaaaaaa aaaaaaaaaa 7620 aaaaaaaaaa 7630 // ID KC310738; SV 1; linear; genomic RNA; STD; VRL; 7634 BP. XX AC KC310738; XX DT 03-SEP-2013 (Rel. 118, Created) DT 03-SEP-2013 (Rel. 118, Last updated, Version 1) XX DE Encephalomyocarditis virus isolate Sing-M105-02, partial genome. XX KW . XX OS Encephalomyocarditis virus OC Viruses; Riboviria; Picornavirales; Picornaviridae; Cardiovirus. XX RN [1] RC Publication Status: Online-Only RP 1-7634 RX PUBMED; 23914943. RA Yeo D.S., Lian J.E., Fernandez C.J., Lin Y.N., Liaw J.C., Soh M.L., RA Lim E.A., Chan K.P., Ng M.L., Tan H.C., Oh S., Ooi E.E., Tan B.H.; RT "A highly divergent Encephalomyocarditis virus isolated from nonhuman RT primates in Singapore"; RL Virol J 10(1):248-248(2013). XX RN [2] RP 1-7634 RA Yeo D.S., Lian J.E., Fernandez C.J., Lin Y.-N., Tan B.H.; RT ; RL Submitted (09-DEC-2012) to the INSDC. RL Detection & Diagnostics Laboratory, DMERI @ DSO National Laboratories RL Singapore, 27 Medical Drive 13-00, Singapore 117510, Singapore XX DR MD5; f76c0d40ad42d48f940a7525bc148553. DR EuropePMC; PMC3750836; 23914943. XX CC ##Assembly-Data-START## CC Assembly Method :: SeqMan DNASTAR v. Lasergene 8 CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..7634 FT /organism="Encephalomyocarditis virus" FT /host="orangutan" FT /isolate="Sing-M105-02" FT /mol_type="genomic RNA" FT /country="Singapore" FT /isolation_source="heart" FT /collection_date="2002" FT /db_xref="taxon:12104" FT 5'UTR <1..578 FT CDS 579..7484 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:T1WMM5" FT /db_xref="InterPro:IPR000199" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR015031" FT /db_xref="InterPro:IPR021573" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="InterPro:IPR037080" FT /db_xref="InterPro:IPR037243" FT /db_xref="UniProtKB/TrEMBL:T1WMM5" FT /protein_id="AGU38152.1" FT /translation="MATIMEQEICAHTMTFEECPKCSALQYRNGFYLLKYDEEWYPEEL FT LIEGEDDVFDPELDMEVVFETQGNSTSSDKNNSSSEGNEGVIINNFYSNQYQNSIDLSA FT NATGNNPPKTYGQFSNLLSGAVNAFTNMLPLLNDQNTEEMENLSDRVAQDTAGNTVTNT FT QSTVGRLLGYGVSHNGEHPASCADTASEKILAVERYYTFKVTDWTSTQKAFEYIRIPLP FT HVLSGESGGVFGAALRRHYLVKTGWRVQIQCNASQVHAGSLLVFMAPEYPTLDAFVMDN FT RWSKDNLPNGTKTQTNKKGPFGMDHQNYWQWTLYPHQFLNLRTNTTVDLEVPYVNIAPT FT SSWTQHASWTLVIAVVAPLTYSTGASTSLDITASIQPVRPVFNGLRHETLQTQSPIPVT FT IREHAGTWYSTLPDTTVPIYGKTPVAPSNYMVGEYTDFLEIAQIPTFIGNKIPNAVPYI FT EATNTVVKTNPLATYQVTLSCTCLANTFLAALSRNFAQYRGSMVYTFVFTGTAMMKGKF FT LIAYTPPGAGKPTTRDQAMQATYAIWDLGLNSTYSFTVPFISPSHFRMVGTDQVNITNV FT DGWVTVWQLTPLTYPPGCPNTAKILTMVSAGKDFTVKMPISPAPWSPQGIENAEKGVTE FT NTDATADFVAQPVYLPENQTKVNFFYDRYSPIGAFSVKNGTMEGAFTPFASDFCPNSVI FT LTPGPQYDPNTPQARPQRLTEIWGNGNEDTSSVFPLKTKQDYSFCLFSPFVYYKCDLEV FT TISPHTSGNHGLAVRWSPTGTPTKPTTQVLHAVSSLSEGRTPKMYSAGPGTSNHISFVV FT PYNSPLSVLPAVWYNGHKKFDNTGSLGIAPNSDFGTLFFAGTKPDVKFTVYLRYKNMKV FT FCPRPTVFFPWPSVGDKVDMTPRAGVLMLESPPFLQRAANPLDIFQTFPVLHILLEFNH FT RGIEARLFRYGQYWACCYAEVVLRSRAKQIAFLTKGSTSDCDSAAEWNPWKRTYHAILR FT AEPHRVTLDIYHKRIKPFKMPLVQKEWRTHEENIFQLWRLFDQHYAGYFSDLLIHDVEL FT NPGPFMFKPKKQVFQTQGAALTTMANTLAPSNIANQALGSAFSALLDANEDAQKAMKIM FT KTLSSLSDAWENVKDTLQNQEFWKQLLTRCVQVIAGMTIAVMHPDPLTLLCLGVMTTAE FT VTSQTNLCEEIVSKFKNIFRTPPPKFPGISLFQQQSPPLKNVNDVFSLAKNLDWAVRTV FT EKIVSWFGDWVLQEEKEQTLDEMLTRFPEHAKRISDLRNGMAAYVECKESFDFFERLYN FT QAVKEKRTGIAAVCEKFRQKHDHATARCEPVVIVLRGDAGQGKSLSSQIIAQAVSKTIF FT GRQSVYSLPPDSDFFDGYENQFAAIMDDLGQNPDGSDFTTFCQMVSTTNFLPNMASLER FT KGTPFTSQLVVATTNLPEFRPVTIAHYPAVERRITFDYSVSAGPMCSRTEAGQKVLDVE FT RAFRPTGEEAPLPCFQSDCLFLNKAGIQFRDNRTKEIISLVEVIERAVAKIERKKKVLT FT TVQTLVAQAPVDEVNFHSVVQQLRARQEATDEQLEELQEAFAKTQERSSIFSDWMKISA FT MVCAATLALSQVVKMARTVKQIFKPDLVRVQVDENEQGPYNERTRLPPKTLQLLDVQGP FT NQTMDFEKYVAKNVTAPIEFIYPTGVRIQTCLLIKNRVLAVNRHMVETDWEAIQVRGVV FT HRREAVKILAIAKTGKDTDVTFLKLNSGPLFKDNVKKFVSAKDVMPQSSSPLIGIMNSE FT IPMMYTGSFLKAGVSVPVETGNTFSHCIHYKANTKKGWCGSAVISDLGGQKKIVGMHSA FT GSMGIAAASMISQEMIGAVLNVFEPQGALEQLPDGPRIHVPRKTALRPTVAKQVFQPDF FT APAVLSKFDPRTEADVDTVAFSKHTSNQETLPPVFRMVAKEYANRVFSLLGKDNGKISV FT KQALEGMEGMDPMDRNTSPGLPYTSLGMRRTDVVDWESGTLIPFASERLENMTKGDFSG FT IVYQTFLKDELRPMEKVRAAKTRIVDVPPFEHCILGRQLLGKFASKFQTQPGLELGSAI FT GCDPDVHWTKFGVAMQSFQRVYDVDYSNFDSTHSVAMFRLLAEEFFTPENGFDPLVSQY FT LDSLAISTHAFEEKRYLITGGLPSGCAATSMLNTIMNNIIIRAGLYLTYKNFEFDDIQV FT LSYGDDLLVATDYQLDFDRVKASLAKTGYKITPANKTSSFPLESTLDDVVFLKRKFKRE FT GPLYRPVMNKEALEAMLSYYRPGSLAEKLTSVTMLAVHSGKQEYDRLFAPFREVGIMVP FT QYESVEYRWRSLFW" FT mat_peptide 579..779 FT /product="leader protein" FT mat_peptide 780..989 FT /product="1A capsid protein" FT mat_peptide 990..1757 FT /product="1B capsid protein" FT mat_peptide 1758..2450 FT /product="1C capsid protein" FT mat_peptide 2451..3281 FT /product="1D capsid protein" FT mat_peptide 3282..3731 FT /product="2A protein" FT mat_peptide 3732..4181 FT /product="2B protein" FT mat_peptide 4182..5162 FT /product="2C protein" FT mat_peptide 5163..5426 FT /product="3A protein" FT mat_peptide 5427..5486 FT /product="3B protein" FT mat_peptide 5487..6101 FT /product="3C protein" FT mat_peptide 6102..7481 FT /product="3D protein" FT 3'UTR 7485..7634 XX SQ Sequence 7634 BP; 2196 A; 1792 C; 1707 G; 1939 T; 0 other; cccccccccc ccctccccct cccccttatt ttcctggtcg aaaccgctcg gaataagacc 60 ggggtcttgt aatgtctaaa tgttacttct acccaaccat tgtctatgat ggttggaggg 120 ctgtagaacc tagcccttgc ttcttgcaga gaaatccaag tggtctttcc actctcgaca 180 atgggtttca tggctcgcca aaagttgtga agaaagcaag tcctatggaa gctttctgac 240 gaccgatgat gtctgtagcg accctttgca ggcagcggaa tcccccacct ggtaacaggt 300 gcctctgcgg ccaaaagcca cgtgtttaac agacacctgc aaaggcggca caaccccagt 360 gccacatcaa gagtctgatg actgtggaaa tagtcaactg gcttttctta agcaaatttg 420 gtgtcggggc tgaaggatgc ccggaaggta ccacactggt tgtgatctga tccggggccc 480 atgtgcatgt gctatacaca tgtagcctgg gttaaaaaac gtctaggccc cccgaaccac 540 ggggacgtgg ttttcctttt gaaaaccaca atgataatat ggctacaatt atggaacaag 600 agatttgcgc tcatacaatg acttttgaag aatgtccaaa gtgctctgcc ctgcaataca 660 gaaatggatt ctatctttta aaatatgatg aagaatggta tcctgaggaa ttgctcatag 720 aaggagaaga tgatgtattt gatccagaat tggacatgga agtagtcttt gaaactcagg 780 gaaattctac ctcatcagac aaaaacaact ccagttctga aggaaatgaa ggtgtaatta 840 taaacaattt ctattccaac cagtaccaga attccattga tctctcagcc aatgccactg 900 gaaataaccc ccctaaaact tatgggcagt tttcaaattt attgtctggt gcagtgaatg 960 ccttcaccaa catgctacct ctcctgaatg accaaaatac agaagaaatg gagaatcttt 1020 cagacagagt ggctcaagat acagccggaa atacggtcac aaacacacaa tctacagttg 1080 gccgacttct aggctacggg gtttcacaca atggagaaca tcctgcctct tgtgccgata 1140 ctgcctcaga gaaaatactg gcagttgaaa gatactacac atttaaggta acagactgga 1200 cttcaaccca gaaggccttt gaatatataa ggatcccttt gcctcatgta ttgtctggtg 1260 aaagtggagg agtatttggt gcagctttac gcagacacta cctggtgaag acaggatgga 1320 gagttcaaat ccagtgtaat gcctcacagg ttcacgctgg aagcctttta gttttcatgg 1380 caccagaata cccgaccctg gatgcttttg tgatggacaa ccgttggtcc aaagacaatc 1440 ttccgaatgg aacaaaaact cagaccaata agaaaggacc ctttgggatg gaccaccaaa 1500 attattggca atggaccttg tacccacatc aattcttgaa tttgcgaaca aacaccacag 1560 ttgatctgga agtcccttat gtcaacattg cccctacttc atcttggaca caacatgcca 1620 gctggactct ggttatcgct gtggtggctc ccctgacata ctccaccgga gcttccacat 1680 ccctcgatat caccgcctcg atacaacctg tccgaccggt gttcaatgga ttgcgacatg 1740 aaactctcca gacacagtcc cccattccgg tgactatccg ggagcatgct ggcacctggt 1800 actctacact ccctgatacc actgtcccta tatatggtaa aactcccgtt gcaccctcta 1860 actatatggt aggagaatat acagacttcc tggagattgc ccagatacca acatttatag 1920 gaaacaaaat acctaatgca gtaccatata ttgaggctac gaacacagta gtaaaaacaa 1980 acccattggc tacctatcaa gtaacattgt catgtacatg cctggccaac actttcctgg 2040 ctgcactatc cagaaacttt gcccagtaca gaggctcaat ggtttatact tttgtcttca 2100 ctggtaccgc aatgatgaaa ggaaaattct tgattgccta tacacccccc ggtgctggaa 2160 agcccaccac tagggatcag gctatgcaag caacttacgc tatctgggat ttgggtttaa 2220 attctactta ctcttttact gtgcctttta tatctccctc acattttaga atggttggaa 2280 cagatcaagt caacatcaca aatgttgatg gatgggtcac agtttggcaa ctgacacctc 2340 taacgtaccc acctggctgt ccgaacactg caaaaatact caccatggtt agcgccggga 2400 aagacttcac tgtcaaaatg cctatttccc ctgccccatg gagtccacag ggaatagaaa 2460 atgcagaaaa aggtgtgact gaaaatacag atgctactgc agattttgta gcccagcctg 2520 tttacctgcc tgaaaaccag actaaagtaa acttcttcta cgatcgatac agtccgattg 2580 gcgctttttc cgtaaaaaat ggaaccatgg agggtgcttt tacgcccttt gcaagtgatt 2640 tctgtccaaa ctcagttatt ttgacaccag gaccacagta tgaccccaac accccccagg 2700 cgcgacccca gcggctcact gagatttggg gaaatggcaa tgaagacact agtagtgtct 2760 tccctctcaa aacaaaacag gactactcat tttgtctctt ttcccccttt gtgtattata 2820 agtgtgatct tgaagtgacg attagtcccc atacatctgg caatcatggc ttagctgtac 2880 gttggtctcc aacaggaaca ccaacaaagc cgactaccca ggtgttgcat gctgtgagtt 2940 cactttctga gggacgtact cccaaaatgt acagcgctgg acccggaacc tcaaaccata 3000 tatcatttgt tgtaccatac aactcacctc tgtcagtctt gcccgctgtc tggtataatg 3060 gacacaaaaa atttgacaat acaggcagct tgggcatagc cccaaattca gacttcggta 3120 ctttattctt tgccgggacc aagcccgatg tgaagtttac agtgtacctg agatataaga 3180 atatgaaggt attttgtccg agacctactg ttttctttcc ttggccctct gttggggaca 3240 aggtggacat gaccccccga gctggtgttc tgatgctcga gagcccacct ttcctgcaga 3300 gagctgcaaa cccacttgac atctttcaga ccttccctgt cctccacatc ctgcttgaat 3360 ttaaccatag agggattgaa gcaaggctct ttagatatgg gcagtattgg gcatgctgtt 3420 atgcagaagt tgttctcaga tcaagagcaa aacagatagc tttcttgaca aagggttcta 3480 caagtgattg tgactccgca gctgaatgga acccgtggaa gagaacctac catgccatac 3540 tcagggctga accacatcga gtcaccttgg acatttacca caaaagaatc aaacccttta 3600 agatgcccct agtgcagaaa gaatggagaa ctcatgaaga aaacatcttc cagctttgga 3660 gactcttcga tcagcactac gcaggctatt tctctgacct gcttattcat gatgttgagc 3720 taaacccagg cccattcatg ttcaagccca agaaacaggt ttttcagaca caaggagcgg 3780 cgctgaccac catggccaat accctggcgc cgagcaacat tgccaatcaa gcactaggat 3840 cagctttttc ggctttgcta gacgccaacg aggacgccca aaaagcaatg aagattatga 3900 agacattaag ttctctgtcg gatgcatggg aaaatgtaaa agatactctg caaaatcagg 3960 agttttggaa gcagcttctt acaagatgtg tgcaggtgat tgcaggaatg acaattgcag 4020 tgatgcatcc agaccctctg acactgctgt gtctaggagt catgacaacc gcggaggtaa 4080 ccagccaaac caacctttgc gaagaaatag tctctaaatt taaaaacatt tttagaactc 4140 caccccctaa gtttccagga atctcattgt ttcagcagca atcccctcct ctgaagaacg 4200 taaatgatgt attttctctg gcaaagaatc ttgattgggc tgtgagaaca gtggaaaaaa 4260 ttgtgtcgtg gtttggagac tgggtattgc aggaagaaaa ggaacagact ttagatgaaa 4320 tgctgacccg ctttccagaa cacgcaaaga gaatctctga cctcagaaat ggaatggctg 4380 cttatgttga gtgtaaggaa agttttgatt tctttgagag gctttataat caggctgtta 4440 aggagaaaag aactggcatt gccgctgtgt gtgagaagtt cagacagaag catgatcatg 4500 ctacagcgag gtgtgaacca gttgtcattg tcctccgcgg agatgcagga cagggaaagt 4560 ccctctctag tcagattatt gcccaggctg tttcaaagac aatctttggc cgccagtcag 4620 tttattctct tcctccagat tctgattttt ttgatggtta tgaaaatcag tttgcagcta 4680 taatggatga tctaggtcag aatcctgatg gctcggattt cacaactttc tgtcagatgg 4740 tgtctactac taactttctt cctaatatgg ccagtcttga gagaaaggga actccgttta 4800 cttctcagct tgtggtcgca actactaacc ttcccgagtt tagacctgtt actattgccc 4860 attatcccgc tgtagagaga agaatcactt ttgactactc ggtctcggct gggcccatgt 4920 gttcgcgtac tgaggctgga cagaaagtgc tggatgtcga gagagccttc agaccaacag 4980 gggaagaagc ccctctcccg tgctttcaat cagactgtct ctttcttaac aaagctggaa 5040 tccagttcag agacaacaga acaaaggaaa tcatttccct ggttgaagtc attgagagag 5100 ccgttgctaa aattgagagg aaaaagaaag tgctcacgac tgtacagacc cttgttgctc 5160 aagcccctgt agatgaagtt aacttccact ctgtagtcca gcagctcaga gcccgccagg 5220 aagccacaga cgaacagctt gaggaactcc aggaggcctt tgctaagacg caagagagat 5280 catcgatctt ttcagattgg atgaaaattt cagcaatggt gtgtgcagca actctggccc 5340 tatcgcaggt agtcaagatg gccagaacag ttaaacaaat tttcaaacca gacctggtta 5400 gggttcaggt agatgaaaat gaacagggtc cttacaacga gaggaccagg cttccaccaa 5460 aaaccctgca actgctggat gttcagggac ctaaccagac gatggatttt gagaaatatg 5520 ttgcaaaaaa tgtgacagcc cccatagagt tcatctatcc tacaggagtc agaatccaaa 5580 cttgtctact cataaagaat agagtacttg ctgtgaacag gcacatggtg gaaacggact 5640 gggaagcgat acaggtaaga ggagtggtgc accgtagaga ggcagtgaaa atccttgcca 5700 tagctaagac aggaaaagat accgatgtca ccttcttgaa acttaattca ggaccattat 5760 tcaaagataa tgtaaagaag tttgtgtctg ctaaggatgt tatgccccaa tcttcaagtc 5820 cattgattgg aatcatgaat tcagaaattc ctatgatgta cacaggcagc ttcctgaaag 5880 caggggtctc tgtcccagtg gagacgggga acaccttcag ccactgcatt cactacaagg 5940 caaacacaaa gaaaggttgg tgtggatctg ctgtcatctc tgacctaggt ggccaaaaga 6000 agattgtagg aatgcattca gccgggtcca tgggcattgc ggcggcatca atgatctcac 6060 aagagatgat tggtgctgtg ctcaatgtat ttgaacccca aggagctctt gagcagctac 6120 cggatggtcc ccgcattcat gtgccaagga aaacagccct gcgccccacc gtggccaaac 6180 aagttttcca accagatttt gccccagcag tgttgtctaa attcgatcct agaactgagg 6240 ctgatgtaga tacagtggcc ttctcaaagc acacttcaaa ccaggaaaca ctccctccag 6300 tgtttagaat ggtggcaaaa gaatatgcaa acagagtttt ttcgcttctt ggcaaagaca 6360 acgggaagat atcagtgaag caagcattgg agggaatgga aggaatggac cccatggaca 6420 ggaatacttc cccaggcctt ccgtacacgt cgctaggaat gcgacgtaca gatgtagttg 6480 actgggaaag cggaaccctg attccgtttg cttctgaaag actagagaac atgactaaag 6540 gagacttctc tggaattgtc taccaaacat tcctcaaaga cgagctcaga ccaatggaaa 6600 aagtcagagc agccaagact agaatagttg atgtaccacc ttttgaacat tgtattctgg 6660 gtagacagct tctggggaaa ttcgcatcaa agttccagac ccagccgggt ctggagcttg 6720 gatcagcaat tggctgtgac ccagacgtgc actggactaa atttggtgtg gcaatgcagt 6780 cctttcagag agtctatgat gttgactact caaactttga ttcaacccac tcagttgcaa 6840 tgtttcgtct ccttgctgag gagtttttta cccctgagaa tggatttgac ccactggtat 6900 ctcaatacct tgactcactt gccatctcaa cgcatgcatt tgaggagaag cgctatctca 6960 taaccggagg tcttccttcc ggttgtgctg cgacctcaat gctcaatacc attatgaaca 7020 atattataat tagggctggt ttgtacctca cttataagaa ctttgaattt gatgatatac 7080 aggtactgtc atatggagat gatctcctgg tggctacaga ttatcaattg gattttgata 7140 gggtgaaggc aagcctagca aagacagggt acaaaattac acccgctaac aaaacttcta 7200 gctttcctct tgaatcaaca ctagatgatg tagttttcct taagagaaaa tttaagagag 7260 agggcccctt gtatcgtcct gtcatgaaca aggaggcgct agaggctatg ttgtcatact 7320 accgtccagg ttccctggcg gagaaactca cctcagtgac catgctcgcc gtccactccg 7380 gaaagcaaga gtacgaccgt ctctttgccc ctttccgtga agttggtatc atggtaccac 7440 aatatgagag tgtggagtac cgctggagaa gtctgttctg gtagtagcgc ggacaatggc 7500 acaacgcttt acccgggaag ccactcgggt gtacgcggtc gctattccgc agacagggta 7560 gtttctactt tgcaagatag actagagtag taaaataaat agtttaagaa aaaaaaaaaa 7620 aaaaaaaaaa aaaa 7634 // ID KC311375; SV 1; linear; genomic RNA; STD; VRL; 774 BP. XX AC KC311375; XX DT 21-MAR-2013 (Rel. 116, Created) DT 30-JUL-2013 (Rel. 117, Last updated, Version 2) XX DE Tomato chlorosis virus isolate BJ coat protein (CP) gene, complete cds. XX KW . XX OS Tomato chlorosis virus OC Viruses; Riboviria; Closteroviridae; Crinivirus. XX RN [1] RP 1-774 RX DOI; .1094/PDIS-12-12-1163-PDN. RX PUBMED; 30722472. RA Zhao R.N., Wang R., Wang N., Fan Z.F., Zhou T., Shi Y.C., Chai M.; RT "First Report of Tomato chlorosis virus in China"; RL Plant Dis. 97(8):1123-1123(2013). XX RN [2] RP 1-774 RA Zhao R.N., Wang R., Wang N., Zhou T.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL Department of Plant Pathology, China Agricultural University, No. 2 RL Yuanmingyuan West Road, Beijing, Beijing 100193, China XX DR MD5; e0e0aa33d3a53a77220cc1d43a9033ea. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..774 FT /organism="Tomato chlorosis virus" FT /host="tomato" FT /isolate="BJ" FT /mol_type="genomic RNA" FT /country="China" FT /collection_date="26-Oct-2012" FT /db_xref="taxon:67754" FT gene 1..774 FT /gene="CP" FT CDS 1..774 FT /codon_start=1 FT /gene="CP" FT /product="coat protein" FT /db_xref="GOA:M4Q8W1" FT /db_xref="InterPro:IPR002679" FT /db_xref="UniProtKB/TrEMBL:M4Q8W1" FT /protein_id="AGH20649.1" FT /translation="MENSAVANTGDNGGDRNPLVRPLDDGVDDEVQNLGRRDDSTSLIP FT ANPNRSSSWALLNPDTINYNELRKLKVHSTRGDTLTLTQEEEFEKILESFCRRIIGETP FT MTDKIFAGFYMSMCQAIVNQGTSVKAAGNNSLENYFEVDGARFKWKTPDLINEVRPKMS FT DVPNAIRRYARSHEKIIQDFINSGLIKPDYHLQFKHGVLPSHVFGTGDYINGSLMNISD FT DQLISNLLMKRNALCKGNEGKELYNVNQLASITGC" XX SQ Sequence 774 BP; 229 A; 151 C; 186 G; 208 T; 0 other; atggagaaca gtgctgttgc aaacactggt gataacggtg gtgaccgcaa tcctctggtt 60 agaccgttag atgatggcgt agatgacgag gtgcagaact tgggcaggag ggacgattcg 120 acatctctca ttccggctaa tcctaatcga tcttccagtt gggctttgtt gaacccggat 180 actattaatt ataacgagtt aaggaaattg aaggtacact ccactagggg tgatactctc 240 accttgactc aggaagagga gttcgagaag atactcgaat ccttttgcag gcgaataatc 300 ggtgagaccc cgatgacgga taagattttc gctggtttct acatgtctat gtgtcaggcc 360 attgtaaacc aagggacctc agttaaagca gccggtaata acagtcttga aaactacttt 420 gaggtagatg gtgcgagatt taagtggaaa actccggatt tgataaatga ggttagaccc 480 aaaatgtccg atgttccaaa cgccatacgt cggtacgcca gaagtcatga aaagattatt 540 caggacttta tcaactccgg tcttattaag cctgattatc atttacaatt caaacatggc 600 gtattaccaa gccatgtgtt tggtaccggc gattatataa atggttcgtt gatgaatatc 660 tcagatgatc aacttatctc gaacctgctt atgaaaagaa acgctttgtg caagggtaac 720 gagggcaagg aactgtacaa cgttaaccaa cttgcatcga taactggttg ctaa 774 // ID KC311433; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311433; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Ankara/03/2010(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Ankara/03/2010(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 8f26aa1264432352c031ee2a3feee53a. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Ankara/03/2010(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Ankara/03/2010" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Dec-2010" FT /db_xref="taxon:1268675" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HPH6" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HPH6" FT /protein_id="AGB08404.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLIVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 606 A; 315 C; 372 G; 408 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacagga 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aaactggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactga tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311434; SV 1; linear; viral cRNA; STD; VRL; 1728 BP. XX AC KC311434; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Ankara/04/2010(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Ankara/04/2010(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1728 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; b22e95a0e79ea28911cc575aa320727d. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1728 FT /organism="Influenza A virus (A/Ankara/04/2010(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Ankara/04/2010" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Dec-2010" FT /db_xref="taxon:1268676" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HTC0" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HTC0" FT /protein_id="AGB08405.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLIVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1728 BP; 617 A; 318 C; 379 G; 414 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa ggcccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacagga 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aaactggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactga tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acattaggat ttcagaagca tgagaaaa 1728 // ID KC311435; SV 1; linear; viral cRNA; STD; VRL; 1733 BP. XX AC KC311435; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Cankiri/01/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Cankiri/01/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1733 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; eef6056895e0d327c3630a12a7b3aab7. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1733 FT /organism="Influenza A virus (A/Cankiri/01/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Cankiri/01/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268680" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HU42" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HU42" FT /protein_id="AGB08406.1" FT /translation="MKAILVVLLYTFATANADTLCRGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQWSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1733 BP; 617 A; 319 C; 384 G; 413 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtagagggt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga actaagagag 360 caatggagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacctaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacatttgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacgg gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acattaggat ttcagaagca tgagaaaaac acc 1733 // ID KC311436; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311436; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Ankara/02/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Ankara/02/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 82ed5a99d3ec4818184d489f47402ddc. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Ankara/02/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Ankara/02/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268674" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HRE2" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HRE2" FT /protein_id="AGB08407.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 601 A; 314 C; 377 G; 409 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaagggaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacctaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacgg gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311437; SV 1; linear; viral cRNA; STD; VRL; 1732 BP. XX AC KC311437; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Tokat/03/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Tokat/03/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1732 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 489eaca0b776837bb744febff9f48586. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1732 FT /organism="Influenza A virus (A/Tokat/03/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Tokat/03/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268686" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HS26" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HS26" FT /protein_id="AGB08408.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1732 BP; 616 A; 319 C; 382 G; 415 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacctaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacgg gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acattaggat ttcagaagca tgagaaaaac ac 1732 // ID KC311438; SV 1; linear; viral cRNA; STD; VRL; 1724 BP. XX AC KC311438; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Ankara/04/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Ankara/04/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1724 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 57b31f187e04a6f88188586ce95f11b4. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1724 FT /organism="Influenza A virus (A/Ankara/04/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Ankara/04/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268677" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HPI3" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HPI3" FT /protein_id="AGB08409.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1724 BP; 610 A; 317 C; 381 G; 416 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttaccctgga gatttcatca attatgaaga gttaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acattaggat ttcagaagca tgag 1724 // ID KC311439; SV 1; linear; viral cRNA; STD; VRL; 1735 BP. XX AC KC311439; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Tekirdag/05/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Tekirdag/05/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1735 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 52d76f5677ec39e28d9540c964031a1f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1735 FT /organism="Influenza A virus (A/Tekirdag/05/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Tekirdag/05/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Feb-2011" FT /db_xref="taxon:1268685" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HTC7" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HTC7" FT /protein_id="AGB08410.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1735 BP; 618 A; 320 C; 382 G; 415 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gttaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acaataggat ttcagaagca tgagaaaaac acctg 1735 // ID KC311440; SV 1; linear; viral cRNA; STD; VRL; 1732 BP. XX AC KC311440; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Samsun/06/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Samsun/06/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1732 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 2cb11abfcd74931ef3dab5e0dc4a99b6. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1732 FT /organism="Influenza A virus (A/Samsun/06/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Samsun/06/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268684" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HU46" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HU46" FT /protein_id="AGB08411.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNATCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VENLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1732 BP; 615 A; 320 C; 383 G; 414 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctaagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacctaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaatg caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacgg gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtggaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta acaataggat ttcagaagca tgagaaacac ac 1732 // ID KC311441; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311441; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Kastamonu/07/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Kastamonu/07/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 929ea9bc1611b1888e89df3868dfec45. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Kastamonu/07/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Kastamonu/07/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Feb-2011" FT /db_xref="taxon:1268682" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HRE6" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HRE6" FT /protein_id="AGB08412.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSR FT YSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRNQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 599 A; 316 C; 379 G; 407 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctaagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgagga gctgagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta atgataaagg gaaagaagtc ctcgtgctgt ggggcattca ccatccatct 600 actactgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaagataca gcaagaagtt caagccggaa atagcaataa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctagtggt accgagatat gcattcgcaa tggagagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggcttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaag 1380 aacttgtatg aaaaggtaag aaaccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311442; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311442; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Trabzon/08/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Trabzon/08/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; b352a10a21f3bcd69e5b6d8edd722c4e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Trabzon/08/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Trabzon/08/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268687" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HS31" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HS31" FT /protein_id="AGB08413.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVDLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFIDYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLNKSYINDKGKEVLVLWGIHHPSTSTDQQSLYQNADAYVFVGTSR FT YSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSG FT IIISDTPIHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDEITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 602 A; 315 C; 378 G; 406 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt tgaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctaagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agctcagaca atggaacgtg ttacccagga gatttcatcg attatgagga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgaaatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcaacaaa 540 tcctacatta atgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtactg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaagataca gcaagaagtt taagccggaa atagcaataa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctagtggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accaatccac gattgcaata caacttgtca gacacccaag 900 ggggctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaagtgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacagga 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac gagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggcaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaag 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgagt tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311443; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311443; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Ankara/09/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Ankara/09/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 30fc84c3f729744e409f1cf3a740f66f. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Ankara/09/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Ankara/09/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Jan-2011" FT /db_xref="taxon:1268678" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HPI8" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HPI8" FT /protein_id="AGB08414.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 603 A; 314 C; 375 G; 409 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gttaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311444; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311444; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Konya/10/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Konya/10/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 003956e896af215d61af9b29ce579bca. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Konya/10/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Konya/10/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Feb-2011" FT /db_xref="taxon:1268683" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HTD2" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HTD2" FT /protein_id="AGB08415.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTTADQQSLYQNADAYVFVGTSR FT YSKKFKPEIAIRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRNQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 602 A; 315 C; 376 G; 408 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctaagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgagga gctaagagag 360 caattgagtt cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta atgataaagg gaaagaagtc ctcgtgctgt ggggcattca ccatccatct 600 actactgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaagataca gcaagaagtt caagccggaa atagcaataa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctagtggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggcttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaag 1380 aacttgtatg aaaaggtaag aaaccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctagaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311445; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311445; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Antalya/11/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Antalya/11/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; a108f26f922d85688ff83f959d1d9c77. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Antalya/11/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; male" FT /strain="A/Antalya/11/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Feb-2011" FT /db_xref="taxon:1268679" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HU51" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HU51" FT /protein_id="AGB08416.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLVEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 601 A; 314 C; 376 G; 410 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttaccctgga gatttcatca attatgaaga gctaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacctaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta gtagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacgg gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttctgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagca aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311446; SV 1; linear; viral cRNA; STD; VRL; 1701 BP. XX AC KC311446; XX DT 28-DEC-2012 (Rel. 115, Created) DT 28-DEC-2012 (Rel. 115, Last updated, Version 1) XX DE Influenza A virus (A/Izmir/12/2011(H1N1)) segment 4 hemagglutinin (HA) DE gene, complete cds. XX KW . XX OS Influenza A virus (A/Izmir/12/2011(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1701 RA Guldemir D., Durmaz R., Korukluoglu G., Kalaycioglu A.T., Altas B.A.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Molecular Microbiology Research and Application Laboratory, Public Health RL Agency of Turkey, Cemal Gursel Caddesi, Ankara 06100, Turkey XX DR MD5; 41f50b29398b7eb717ffb278e4cd313e. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..1701 FT /organism="Influenza A virus (A/Izmir/12/2011(H1N1))" FT /segment="4" FT /host="Homo sapiens; female" FT /strain="A/Izmir/12/2011" FT /serotype="H1N1" FT /mol_type="viral cRNA" FT /country="Turkey" FT /isolation_source="nasopharyngeal swab" FT /collection_date="Feb-2011" FT /db_xref="taxon:1268681" FT gene 1..1701 FT /gene="HA" FT CDS 1..1701 FT /codon_start=1 FT /gene="HA" FT /product="hemagglutinin" FT /db_xref="GOA:L0HRF0" FT /db_xref="InterPro:IPR000149" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR013828" FT /db_xref="UniProtKB/TrEMBL:L0HRF0" FT /protein_id="AGB08417.1" FT /translation="MKAILVVLLYTFATANADTLCIGYHANNSTDTVDTVLEKNVTVTH FT SVNLLEDKHNGKLCKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDN FT GTCYPGDFINYEELREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYK FT NLIWLVKKGNSYPKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSK FT YSKKFKPEIAVRPKVRDQEGRMNYYWTLLEPGDKITFEATGNLLVPRYAFAMERNAGSG FT IIISDTPVHDCNTTCQTPKGAINTSLPFQNIHPITIGKCPKYVKSTKLRLATGLRNVPS FT IQSRGLFGAIAGFIEGGWTGMVDGWYGYHHQNEQGSGYAADLKSTQNAIDKITNKVNSV FT IEKMNTQFTAVGKEFNHLEKRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDYHDSN FT VKNLYEKVRSQLKNNAKEIGNGCFEFYHKCDNTCMESVKNGTYDYPKYSEEAKLNREEI FT DGVKLESTRIYQILAIYSTVASSLVLVVSLGAISFWMCSNGSLQCRICI" XX SQ Sequence 1701 BP; 602 A; 315 C; 374 G; 410 T; 0 other; atgaaggcaa tactagtagt tctgctatat acatttgcaa ccgcaaatgc agacacatta 60 tgtataggtt atcatgcgaa caattcaaca gacactgtag acacagtact agaaaagaat 120 gtaacagtaa cacactctgt taaccttcta gaagacaagc ataacgggaa actatgcaaa 180 ctgagagggg tagccccatt gcatttgggt aaatgtaaca ttgctggctg gatcctggga 240 aatccagagt gtgaatcact ctccacagca agctcatggt cctacattgt ggaaacatct 300 agttcagaca atggaacgtg ttacccagga gatttcatca attatgaaga gttaagagag 360 caattgagct cagtgtcatc atttgaaagg tttgagatat tccccaagac aagttcatgg 420 cccaatcatg actcgaacaa aggtgtaacg gcagcatgtc ctcatgctgg agcaaaaagc 480 ttctacaaaa atttaatatg gctagttaaa aaaggaaatt catacccaaa gctcagcaaa 540 tcctacatta acgataaagg gaaagaagtc ctcgtgctat ggggcattca ccatccatct 600 actagtgctg accaacaaag tctctatcag aatgcagatg catatgtttt tgtggggaca 660 tcaaaataca gcaagaagtt caagccggaa atagcagtaa gacccaaagt gagggatcaa 720 gaagggagaa tgaactatta ctggacacta ttagagccgg gagacaaaat aacattcgaa 780 gcaactggaa atctattggt accgagatat gcattcgcaa tggaaagaaa tgctggatct 840 ggtattatca tttcagatac accagtccac gattgcaata caacttgtca gacacccaag 900 ggtgctataa acaccagcct cccatttcag aatatacatc cgatcacaat tggaaaatgt 960 ccaaaatatg taaaaagcac aaaattgaga ctggccacag gattgaggaa tgtcccgtct 1020 attcaatcta gaggcctatt tggggccatt gccggtttca ttgaaggggg gtggacaggg 1080 atggtagatg gatggtacgg ttatcaccat caaaatgagc aggggtcagg atatgcagcc 1140 gacctgaaga gcacacagaa tgccattgac aagattacta acaaagtaaa ttccgttatt 1200 gaaaagatga atacacagtt cacagcagta ggtaaagagt tcaaccacct ggaaaaaaga 1260 atagagaatt taaataaaaa agttgatgat ggtttcctgg acatttggac ttacaatgcc 1320 gaactgttgg ttctattgga aaatgaaaga actttggact accacgattc aaatgtgaaa 1380 aacttatatg aaaaggtaag aagccagtta aaaaacaatg ccaaggaaat tggaaacggc 1440 tgctttgaat tttaccacaa atgcgataac acgtgcatgg aaagtgtcaa aaatgggact 1500 tatgactacc caaaatactc agaggaagct aaattaaaca gagaagaaat agatggggta 1560 aagctggaat caacaaggat ttaccagatt ttggcgatct attcaactgt cgccagttca 1620 ttggtactgg tagtctccct gggggcaatc agtttctgga tgtgctctaa tgggtctcta 1680 cagtgtagaa tatgtattta a 1701 // ID KC311731; SV 2; circular; genomic DNA; STD; VRL; 7247 BP. XX AC KC311731; XX DT 03-FEB-2013 (Rel. 115, Created) DT 04-SEP-2013 (Rel. 118, Last updated, Version 3) XX DE Human papillomavirus strain Fi864, complete genome. XX KW . XX OS Human papillomavirus OC Viruses; Papillomaviridae; unclassified Papillomaviridae. XX RN [1] RC Publication Status: Online-Only RP 1-7247 RX PUBMED; 23516180. RA Phan T.G., Vo N.P., Aronen M., Jartti L., Jartti T., Delwart E.; RT "Novel human gammapapillomavirus species in a nasal swab"; RL Genome Announc 1(2):E0002213-E0002213(2013). XX RN [2] RP 1-7247 RA Delwart E.; RT ; RL Submitted (11-DEC-2012) to the INSDC. RL Virology, Blood Systems Research Institute, 270 Masonic Avenue, San RL Francisco, CA 94118, USA XX RN [3] RC Sequence update by submitter RP 1-7247 RA Delwart E.; RT ; RL Submitted (03-SEP-2013) to the INSDC. RL Virology, Blood Systems Research Institute, 270 Masonic Avenue, San RL Francisco, CA 94118, USA XX DR MD5; 34cce98777fae29915aae5025e727c9d. DR EuropePMC; PMC3593334; 23516180. DR EuropePMC; PMC3923884; 24551244. XX CC On Sep 3, 2013 this sequence version replaced gi:443909500. XX FH Key Location/Qualifiers FH FT source 1..7247 FT /organism="Human papillomavirus" FT /host="Homo sapiens" FT /strain="Fi864" FT /mol_type="genomic DNA" FT /country="Finland" FT /isolation_source="nasal swab" FT /collection_date="2008" FT /db_xref="taxon:10566" FT CDS 277..777 FT /codon_start=1 FT /product="E6" FT /db_xref="GOA:L7XGG1" FT /db_xref="InterPro:IPR001334" FT /db_xref="InterPro:IPR038575" FT /db_xref="UniProtKB/TrEMBL:L7XGG1" FT /protein_id="AGD80374.1" FT /translation="MIVFQPASVPIKDSIFLSVSLFLMAELCPTRLDEYCKVLGISFFD FT VSLKCVFCNCKLSLQDLASFVSKCLSLIWKNNECFASCTLCLRLSARYEREKYTQCIVK FT GCMLETLTATPLCELIIRCKYCYRKLDYVEKIDCCVGDLPFSLVRSQWRNCCRLCRYEN FT ERA" FT CDS 764..1060 FT /codon_start=1 FT /product="E7" FT /db_xref="GOA:L7XE66" FT /db_xref="InterPro:IPR000148" FT /db_xref="UniProtKB/TrEMBL:L7XE66" FT /protein_id="AGD80377.2" FT /translation="MRGPEIDVQDIELHLESLVLPQNLLSNESLSPDTEGQPEEVEQAP FT YRVDTCCWSCGTGVRICVFASRLAILTLQQLLTAELNLLCPSCSRIHFRHGRH" FT CDS 1047..2849 FT /codon_start=1 FT /product="E1" FT /db_xref="GOA:L7XGG5" FT /db_xref="InterPro:IPR001177" FT /db_xref="InterPro:IPR014000" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR016393" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:L7XGG5" FT /protein_id="AGD80379.1" FT /translation="MADINKGTEHIKENNAWYITEAECIDSLDTLEDLFEGSTDGSDIS FT NLIDDAEYDQGNSLALLNAQLTDDCNTAVLELKRKFTTSPEQSVADLSPRLQAINISPQ FT RTSKRRLFYDSGIAEDEAENTYEKVDELQSSVTVVDDTCSTNLNLLNNHNHKIILFTKC FT KEMFGVSFTEVTRCFKSDKTCSEQWIVLAYCIRQELLEACKVQLQQHCEYFQMIENDFS FT ALLCVMFKSGKNRETVCKLMCTILACNENQLLLEPPRTRSPPVAMFFYQKCFGNACYKY FT GEFPGWIKKQVLLSHESAATAETFDLSQMIQFCYDNDLTDEPSIAYRYALQADCDPNAA FT AFLKHNNQAKFVKDACCMVKYYKRQEMREMSISQWIWKCCDDCDEIGEWKVIAQLFKYQ FT GVNFVAFLTALRYFLKGVPKRQCLVFYGPSDTGKSYFCNSLIQFFKGKVVSIMNRCSSF FT WLQPLLDSKMGFIDDCTYPGWQYLDTNMRGALDGNTVCIDAKYRAQTQLKLPPMLITTN FT VDLEKEDCFKYIRSRLQIFNFPHKFPLKEDGSVVYEITNKTWKCFFSKLGNQIDLSPKE FT DTQDESGRSDRTFRCTAGQTNDSL" FT CDS 2785..3981 FT /codon_start=1 FT /product="E2" FT /db_xref="GOA:L7XDX8" FT /db_xref="InterPro:IPR000427" FT /db_xref="InterPro:IPR001866" FT /db_xref="InterPro:IPR012677" FT /db_xref="InterPro:IPR033668" FT /db_xref="InterPro:IPR035975" FT /db_xref="InterPro:IPR036050" FT /db_xref="InterPro:IPR042503" FT /db_xref="InterPro:IPR042504" FT /db_xref="UniProtKB/TrEMBL:L7XDX8" FT /protein_id="AGD80375.1" FT /translation="MNQADLTERFDALQDRLMTLYESAPNTIDDQIEIWEIIRKEYVYY FT YYARKEGYKHFGLQPIPALSVSEYKAKEAIQQVLLLKSLKQSPYGREEWTLTNTSAELT FT HTQPKNAFKKNPYIVDVHFDHKADNSFPYTNWDNLYIQDDDDEWYKTPGLVDINGLYFE FT DKYGVKNYFVIFATDAQTYGTTGEWTVYYKNQTISTSSASTSQASLFGSLQGSSRGVVS FT SSRDAVPIPQTPRGQKSEEGRASSTTETPPPALRRRRRRPADQQGEPSTTRQKRRRLEK FT DTAPVSPGQVGSGHLTVPRRNLSRLERLEAEAKDPPIILVSGAANQLKCWRWRCKKAKV FT PCQCISTVFSWAGNSSDNCISNHKMLIAFQSREQRELFRATVKFPKDTTFSYGNLNAL" FT CDS 3317..3739 FT /codon_start=1 FT /product="E4" FT /db_xref="UniProtKB/TrEMBL:L7XD04" FT /protein_id="AGD80378.1" FT /translation="MHRLMVPQENGLYIIKIKLFLLLLLALLKPRSSDLFKGPPGELSA FT PPGTPYPSRKHPEDKKAKREELAQPPRHHHLRYDEDDDDPQTNKENLPPRDKSDADWRK FT TLLQSLLDKLEADILQCQEEIFQDLSDLKQRLRIPQ" FT CDS 3987..5546 FT /codon_start=1 FT /product="L2" FT /db_xref="GOA:L7XDY2" FT /db_xref="InterPro:IPR000784" FT /db_xref="UniProtKB/TrEMBL:L7XDY2" FT /protein_id="AGD80380.1" FT /translation="MLSAKRVKRDSPENIYRHCKVTGTCPPDVENKIENKTWADVLLQA FT FSSIIFLGNLGIGTGKGSLSGGVRPLPGGRVVPESIAPTPTPARPSLTRPSVTRPTRPF FT SVPIDTIGIGSRPIDPIGRRPIDVIDPSSPAVVTLSEASPDTVITIGEGTVPELEVITD FT TSSIASHPTVFQSPDNGVAILNVTPGNPPPTKVYFSVEIQNPVFETSIGHVEPTYDVHV FT NPFITTETITLGEEIPLEPINPRSEFEIEDTPKTSTPVESIQKAFSKVKTFYKKTVQQV FT PTRNQNLLGDVSRAIEFGFENPAFDPEVSLQFQEDVNEVRAAPDPDFTGIQKISRPFLT FT ATEEGKVRVSRLGSRAGIRTRSGTVIGQDVHLYYDISTIEEIELPTISSATTTSMAEPS FT TTETFIGASQSLPATVSDNELLDTFAESFTNAQLVLPVVEEEEDIALHPFILSNTFARP FT VVVDIGSGYFYSPENNSKPNVNPAIPTIPLTPGISINVYSTDFILHPSLLKKRKRKRSD FT SF" FT CDS 5557..7110 FT /codon_start=1 FT /product="L1" FT /db_xref="GOA:L7X8U7" FT /db_xref="InterPro:IPR002210" FT /db_xref="InterPro:IPR011222" FT /db_xref="InterPro:IPR036973" FT /db_xref="UniProtKB/TrEMBL:L7X8U7" FT /protein_id="AGD80376.1" FT /translation="MSWTQSGTLYLPPQKPVAKIYNTDDYVEGTGYYFHAGTDRLLLVG FT HPYFDITDSNDPTKIVVPKVSANQYRVLRLDFPDPNKFAIADTCVYNPETERLVWKLVG FT FQMDRGGPLGIGATGHPYFNKYTDVENPTGYPAKQDANADYRVDMAFDPKQVQICIVGC FT TPPVGQYWDTTKFCADHRKNNGDCPPIELMHTIIQDGDMCEIGFGNANFENISQDRAGV FT PLELSNEISIWPDFVKMSKDKYGDQMFFCAKKEQLYARHYLAKAGIDGDDLPTNSYWNP FT QNNTVLQKDLASYSYYTTPSGSLVSSDSSIFNRAYWLHKALGANNGILWGNECFITVVD FT NTRNVNLNISVYKEAETMPDDNSYRYKAQDFKNYIRHPEEYELEVIVELCKVPLTADII FT AHLNVMNPKILENWELSYVPPPPEGIQDTYRYIQSLASKCPDDVPPKEKPDPYATYTFW FT KINLHEKLTAELSQTALGKRFLYQTGQTENTKLKNCSQTIACKRCLPSNCKRTVKRRKR FT " XX SQ Sequence 7247 BP; 2402 A; 1239 C; 1441 G; 2165 T; 0 other; ccttataaat acgtccacag atggacattc acagtttggc agtgaccttt gtgtgacctt 60 ctactacgac cgttttaggt ttttaaatag cctcaggaaa tatctggcaa ccggtaccgg 120 tgttttccac agaagtcagt gagtacaggt aggtgcaagt tcgcacaaag ctatgctttt 180 cgcgccaacc gaaaacggtt actccctgca aaaatatgta ccagaagcgg tggtgatttt 240 atctcgtata tcattgttgg caactatgat ttcctgatga ttgtttttca accagcatcg 300 gtgcctataa aagactcgat ttttctatct gtctctctct ttctgatggc tgagttgtgc 360 cctactagac ttgatgagta ttgtaaagta ttgggaataa gcttttttga tgtatcactg 420 aaatgtgtat tttgtaattg taaattgtct ttgcaagatt tagcaagctt tgtttctaaa 480 tgtttaagtt tgatatggaa gaataatgaa tgttttgctt cttgtacttt atgtttaaga 540 ctgtctgcaa gatatgaaag ggagaaatat actcaatgta ttgtaaaagg gtgtatgctt 600 gaaactttga ctgcaacacc tttgtgtgag ctaataataa ggtgtaaata ttgttataga 660 aaactagatt atgttgaaaa gattgattgc tgtgtaggtg accttccttt ttcattagta 720 cgctctcagt ggagaaactg ttgtagactt tgcagatacg aaaatgagag ggcctgaaat 780 tgatgttcaa gacatagaat tacatttaga aagtttagta ctgccacaaa atttattaag 840 caacgaatcg ttgtcgccag atactgaggg acaaccagaa gaggtggagc aagcacctta 900 tagagtagac acttgttgct ggtcttgtgg aacaggtgtt cgtatttgtg tgtttgcttc 960 tcgtcttgct attcttacac ttcaacaact attgactgca gagttgaatt tgctttgccc 1020 ttcgtgctca aggattcact ttcgccatgg cagacattaa taaaggtact gaacatataa 1080 aagaaaataa tgcttggtat ataacagagg cagaatgtat tgatagtttg gatactttgg 1140 aggacttgtt tgaaggtagt acagatggat cagacatttc aaatcttata gatgatgcag 1200 agtatgatca gggaaattcc ctggcactgc tcaatgcaca gctaacggac gattgtaata 1260 ctgctgttct agagctaaaa cgaaagttta caacttcacc agaacagtca gtagctgact 1320 taagtccgag attgcaggct ataaatattt ctcctcaaag aacaagcaaa aggcgactat 1380 tttatgacag tggtatagct gaggatgaag ctgaaaatac ttatgaaaag gtagacgaac 1440 tacaaagttc tgttactgtt gttgatgata catgtagtac taatctgaat ttgttaaaca 1500 atcataatca taaaattata ctatttacaa aatgcaaaga aatgtttggg gtatcgttta 1560 cagaagttac tagatgtttt aaaagtgata aaacatgtag cgaacagtgg attgtgctag 1620 catactgcat tagacaagaa ctgctagaag cttgtaaggt tcagctacag caacactgcg 1680 aatattttca aatgattgag aatgatttta gtgcattact atgtgttatg tttaaatcag 1740 gtaaaaatag agaaacagta tgtaagttaa tgtgtactat attagcttgt aatgaaaatc 1800 aattgttatt agaacctcct cgtactcgaa gccctccagt tgctatgttt ttctatcaaa 1860 aatgttttgg taatgcttgt tataaatatg gagagtttcc aggttggata aaaaagcaag 1920 tgttattatc tcatgagtca gcagctactg ctgaaacatt tgacttgagt caaatgattc 1980 aattttgtta tgacaacgat ctaacagatg aaccaagtat agcatacaga tatgctctgc 2040 aagctgattg tgatccaaat gcagcagcat ttttaaagca caataatcaa gcaaaatttg 2100 taaaagatgc atgttgtatg gtaaaatatt ataaaaggca ggaaatgaga gaaatgtcta 2160 tttcccaatg gatctggaaa tgttgtgatg attgcgatga aataggggaa tggaaggtta 2220 tagcacagct atttaagtat caaggtgtaa attttgtagc attcttaact gcattacgtt 2280 actttttaaa aggagttcct aaaagacaat gcttagtatt ttatggacct tcagatactg 2340 gcaaatcata tttttgtaat tccttgattc agtttttcaa aggtaaagtt gtatctatta 2400 tgaatagatg tagctcattt tggttgcaac ctttattaga ttccaaaatg ggatttatag 2460 atgattgtac atatcctgga tggcagtatc tagatacaaa tatgagaggt gcattggatg 2520 gtaatactgt ttgtattgat gctaaatata gagctcaaac acagttaaaa cttcctccaa 2580 tgctaataac tacaaatgtt gatcttgaaa aagaggattg ttttaaatac attagaagta 2640 gattgcaaat atttaacttt cctcataaat ttcctcttaa ggaggacggt agtgttgtgt 2700 atgaaataac taataaaaca tggaaatgtt tttttagcaa acttggaaat caaattgatt 2760 taagtccaaa agaagacaca caagatgaat caggccgatc tgacagaacg tttcgatgca 2820 ctgcaggaca gactaatgac tctttatgaa tctgcaccaa atacaattga tgatcaaatt 2880 gaaatttggg aaataataag aaaggaatat gtttattatt attatgctag gaaagaaggt 2940 tataaacatt ttggactgca acctattcct gcattaagtg tatctgagta taaagctaaa 3000 gaagctattc agcaagtact gttattgaaa tccctaaaac aatctccgta tggcagggaa 3060 gaatggacac ttacaaacac cagtgctgaa ttaacacata cacagcccaa aaatgcattc 3120 aaaaagaatc catatattgt agatgtgcac tttgatcata aagctgataa ttcctttcca 3180 tatactaatt gggacaattt atatattcaa gatgatgatg atgaatggta taaaacacct 3240 ggcttagttg atattaatgg cctctatttt gaagataagt atggagtcaa aaattacttt 3300 gtaatttttg caactgatgc acagacttat ggtaccacag gagaatggac tgtatattat 3360 aaaaatcaaa ctatttctac ttcttctgct agcacttctc aagcctcgct cttcggatct 3420 cttcaagggt cctccagggg agttgtcagc tcctccaggg acgccgtacc catcccgcaa 3480 acacccagag gacaaaaaag cgaagaggga agagctagct caaccaccga gacaccacca 3540 cctgcgttac gacgaagacg acgacgaccc gcagaccaac aaggagaacc ttccaccacg 3600 agacaaaagc gacgcagatt ggagaaagac actgctccag tctctcctgg acaagttgga 3660 agcggacatc ttacagtgcc aagaagaaat ctttcaagac ttgagcgact tgaagcagag 3720 gctaaggatc ccccaataat cttagtatca ggtgctgcaa atcagttaaa atgttggaga 3780 tggagatgta aaaaagccaa agtgccatgc caatgtatta gcacagtttt tagttgggct 3840 ggaaacagtt ctgataattg tataagtaat cataaaatgc ttatagcttt ccaaagtagg 3900 gaacaaagag aattgtttag agctactgta aagtttccta aagatactac attttcttat 3960 gggaacttaa atgctttgta actaatatgc tttctgccaa aagagtaaag cgtgattccc 4020 ctgaaaatat atataggcat tgtaaggtta ctggtacatg tcctcctgat gttgaaaata 4080 aaattgaaaa caaaacatgg gcagatgttc tcttgcaagc gtttagtagt atcatatttt 4140 taggcaattt aggaattggt actgggaagg gctctctatc tgggggtgta aggcctttgc 4200 ctggtgggag ggttgtgcct gagagcatag caccaacacc tactccagct agacctagtt 4260 taacaagacc ttctgttacc agacctacac gaccgttttc ggtgcccata gataccatag 4320 gcattggttc acgtcctata gatccaatag gacgtaggcc tatagatgtc atagatccat 4380 ctagtccagc tgttgtcaca ttatctgaag ctagtccaga cacagtaatc actattgggg 4440 agggcacagt gcctgaatta gaagtaataa cagacacatc atcaatagca agtcatccta 4500 cagtatttca gtcaccagat aatggagtag ctattttaaa tgtaactcct ggtaatcctc 4560 cacctactaa agtatacttt agtgtggaaa tacaaaaccc tgtttttgaa accagtattg 4620 gtcatgttga acccacttat gatgtacatg tgaatccttt catcaccaca gaaactatta 4680 cattagggga agaaatacct ttagagccaa ttaatcctag aagtgagttt gaaatagaag 4740 atacacctaa aaccagtact ccagtggaaa gtatacaaaa ggcgtttagt aaagttaaaa 4800 cattctataa aaaaactgtg caacaggtgc ccacacgtaa ccaaaatcta ttgggtgatg 4860 tttcccgcgc aatagagttt ggatttgaaa atcccgcctt tgaccctgag gtgtcgttac 4920 agtttcagga agatgttaat gaggtaagag cagcgcctga tccagacttc acaggcattc 4980 aaaaaattag ccgccctttt ttaactgcta cagaagaagg aaaagtgcgt gtcagcagat 5040 tagggagcag agcaggaata cgcaccagaa gtgggaccgt tatcggtcag gatgttcatt 5100 tatattatga tattagtacc atagaagaaa ttgagttacc cactatatca tctgcaacta 5160 ctacatctat ggcagaaccg tctactactg aaacctttat aggggcttca caaagcctgc 5220 ctgcaacagt gtctgataat gaattacttg atacatttgc agagagtttt actaatgcac 5280 aattagtatt accagtagta gaagaagagg aggacatagc tttacatcct tttattttat 5340 ctaatacctt tgctaggcca gtagttgtgg atataggaag tggttacttt tattccccag 5400 aaaataattc aaagcctaat gtaaatcctg ctattccaac tattccatta actccaggaa 5460 tatctattaa tgtttactct actgatttta ttttacatcc tagtttatta aagaaaagaa 5520 agagaaaacg atcagattct ttctaatttt tttcagatgt cttggacaca atcaggaacg 5580 ctctacctac cgcctcaaaa acctgttgca aagatttaca acacggatga ttatgtagaa 5640 ggcacaggtt attattttca tgcaggaact gacagactgc tacttgtagg acatccgtat 5700 tttgatataa cagactccaa tgatcctact aaaatagtgg tacctaaggt ttcagctaat 5760 caatacagag ttttgcgact agactttcca gatccaaata aatttgcaat agctgatacc 5820 tgtgtttata atccagaaac agagagatta gtttggaagc tagtaggatt tcaaatggat 5880 agaggaggcc ctttaggtat tggggctaca gggcatccgt atttcaataa gtatacagat 5940 gttgaaaatc ctacaggcta tcctgcaaag caagatgcta atgcagatta tagggttgat 6000 atggcatttg atcctaagca ggttcaaata tgtatagtag gctgtacacc tccagtaggt 6060 cagtactggg atactacaaa gttttgtgca gatcatagaa aaaacaatgg tgactgccca 6120 cctattgaac taatgcacac tattatacaa gatggtgata tgtgtgaaat agggttcggg 6180 aacgcaaact ttgaaaatat cagtcaggat cgtgcaggtg ttcctttaga acttagtaat 6240 gaaatcagca tttggcctga ttttgtcaaa atgagtaagg ataagtatgg agatcagatg 6300 tttttttgtg ctaaaaagga acagttatat gcccgtcatt atttagcaaa ggcaggtata 6360 gacggggatg atttacctac caattcttat tggaatcccc agaataacac cgtcctgcaa 6420 aaggatttag cgtcctactc atattataca acacctagtg gttctttggt ttccagtgat 6480 tctagcattt ttaacagagc atattggcta cacaaagcct taggtgcaaa taatggcata 6540 ttgtggggta atgaatgttt cattactgtg gttgataata ctagaaatgt gaacttaaat 6600 atatcagtat ataaagaagc agaaacaatg ccagatgata acagctatag atacaaggct 6660 caggatttta aaaattatat ccgacaccca gaggaatacg aattggaagt tattgttgaa 6720 ctttgtaaag ttccattaac agcagatatt attgctcatt taaatgttat gaaccctaaa 6780 atattagaaa attgggaatt gtcttatgtt cctccacctc cagagggtat acaagatacc 6840 tatagatata tacaaagttt agcttccaaa tgtcctgatg atgttcctcc aaaggagaaa 6900 ccagatccgt atgcaacata cacattttgg aaaattaacc tacacgaaaa actaacagct 6960 gagctgtctc aaacagcttt ggggaaacgt ttcttgtatc agactggaca aacagaaaat 7020 actaaactaa aaaactgttc acaaacgatt gcttgtaaac gatgtttacc ttccaattgt 7080 aaacgcactg taaagagacg gaagagataa ataaataaat gcttgatatg aaaaaaaatg 7140 tgaatacctt aaatgctgct gtaatgacac tacctcaata aatgtttaaa ctgtggaatg 7200 tgatttgagt catttcaatt attgtctcca cccaatacat gtgtcca 7247 // ID KC311940; SV 1; linear; genomic RNA; STD; VRL; 564 BP. XX AC KC311940; XX DT 24-MAR-2013 (Rel. 116, Created) DT 24-MAR-2013 (Rel. 116, Last updated, Version 1) XX DE Muscovy duck reovirus strain MW9710 lamda B gene, partial cds. XX KW . XX OS Muscovy duck reovirus OC Viruses; Riboviria; Reoviridae; Spinareovirinae; Orthoreovirus. XX RN [1] RP 1-564 RA Hu Q.L., Chen S.Y., Lin F.Q., Cheng X.X., Lin T.L., Jiang B., Chen S.L., RA Cheng Y.Q., Li Y.Y., Zhu X.L.; RT "The identification of muscovy duck reovirus"; RL Ping Tu Hsueh Pao 20(3):242-248(2004). XX RN [2] RP 1-564 RA Hu Q.L., Chen S.Y., Lin F.Q., Cheng X.X., Lin T.L., Jiang B., Chen S.L., RA Cheng Y.Q., Li Y.Y., Zhu X.L.; RT ; RL Submitted (12-DEC-2012) to the INSDC. RL Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of RL Agriculture Sciences, Fujian Animal Diseases Control Technology Development RL Center, 247 Wu Si Road, Fuzhou, Fujian 350003, China XX DR MD5; 0af3d5a6655d3b2c46bd64d2f99a5e92. XX CC ##Assembly-Data-START## CC Sequencing Technology :: Sanger dideoxy sequencing CC ##Assembly-Data-END## XX FH Key Location/Qualifiers FH FT source 1..564 FT /organism="Muscovy duck reovirus" FT /segment="L2" FT /host="muscovy duck" FT /strain="MW9710" FT /mol_type="genomic RNA" FT /country="China" FT /collection_date="1997" FT /db_xref="taxon:77153" FT CDS <1..>564 FT /codon_start=1 FT /product="lamda B" FT /db_xref="GOA:M4QC67" FT /db_xref="InterPro:IPR007097" FT /db_xref="InterPro:IPR012915" FT /db_xref="UniProtKB/TrEMBL:M4QC67" FT /protein_id="AGH25588.1" FT /translation="VSAAHTLSADYINYHMNLSTTSGSAVIEKVVPLGMYASCPPAQAV FT NIDIKACDASITYQYFLSVIVGAIHEGAAGRRVSSSFMGVPPSVLSVVDSSGVTSSMPI FT SGFQVMCQWLAKLYQRGFEYQVTDTFSPGNTFTHHTTTFPSGSTATSTEHTANNSTMMD FT GFLRSWIPSSGASDVLKKFCRSISI" XX SQ Sequence 564 BP; 114 A; 150 C; 126 G; 174 T; 0 other; gtctcagcag ctcacacctt atccgctgat tacattaatt accatatgaa cttatccacc 60 acatcgggta gtgctgtcat tgagaaggtt gttccgctgg gtatgtacgc ctcctgcccg 120 cctgctcaag cagtcaatat cgacattaaa gcgtgtgatg cgtctattac gtatcagtac 180 tttctctccg ttatagttgg tgccattcat gagggtgcgg cagggcgtcg tgtatcctcc 240 tcattcatgg gtgttcctcc cagcgtattg tccgttgttg attccagtgg cgtcacgtcc 300 tcaatgccca tttccggttt ccaggtcatg tgccaatggt tagcgaaact ttatcagcgt 360 ggttttgagt atcaggttac ggacacattc tcacccggca atacctttac gcatcatacg 420 acgacttttc catctgggtc taccgccacg tccactgaac atactgctaa taatagcacg 480 atgatggatg gctttttgag atcctggatc ccctcctctg gtgcgtcgga cgttctgaag 540 aagttttgtc gttccatttc catc 564 // ID KC312330; SV 1; linear; genomic RNA; STD; VRL; 5215 BP. XX AC KC312330; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_A2 from USA nonfunctional gag protein (gag) gene, DE complete sequence; nonfunctional pol protein (pol) gene, partial sequence; DE vif protein (vif) gene, complete cds; and vpr protein (vpr) gene, partial DE cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5215 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5215 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 0194d4e7f3aea5d0a5a5abd8d79b9629. DR EuropePMC; PMC3637789; 23542380. XX FH Key Location/Qualifiers FH FT source 1..5215 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_A2" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1710 FT /gene="gag" FT misc_feature 209..1710 FT /gene="gag" FT /note="nonfunctional gag protein due to mutation" FT gene <1504..4514 FT /gene="pol" FT misc_feature <1504..4514 FT /gene="pol" FT /note="nonfunctional pol protein due to mutation" FT gene 4459..5037 FT /gene="vif" FT CDS 4459..5037 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYG0" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYG0" FT /protein_id="AGG76588.1" FT /translation="MEDRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKIKGHRGSHTLNGH" FT gene 4977..>5215 FT /gene="vpr" FT CDS 4977..>5215 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYE0" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYE0" FT /protein_id="AGG76589.1" FT /translation="MEQAPEDQGPQREPYTEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5215 BP; 1957 A; 924 C; 1239 G; 1095 T; 0 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacggttc gccgtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaagggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagca ttatcagaag gagccacccc acaagattta aataccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa aattggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacattagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgtagggccc ctaggaaagg 1440 aggctgttgg aaatgtggaa aagaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggaaatgtat cccatagctt ccctcaatca 1680 ctctttggca acgacccctt gtcacaataa aaataggggg gcaactaaag gaagctctat 1740 tagatacagg agcagatgat acagtattag aagaaatgac cctgccagga aaatggaaac 1800 caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat cagataccca 1860 tagaaatctg tggacataga gctataggta cggtattagt aggacctaca cctgtcaaca 1920 taattggaag aaatctgttg actcagattg gttgcacttt aaattttccc attagtccta 1980 ttgaaacggt accagtaaaa ttaaagccag gaatggatgg cccaaaagtt aaacaatggc 2040 cattgacaga agaaaaaata aaagcattag tagaaatttg cacagaaatg gaaaaggaag 2100 ggaaaatttc aagaattgga cctgaaaatc catacaatac tccaatattt gccataaaga 2160 aaaaagacag tactaaatgg aggaaattag tagatttcag agaacttaat aagaaaactc 2220 aagatttctg ggaagttcaa ttaggaatac cacatcccgc agggttaaaa aagaaaaagt 2280 cagtaacagt gctggatgtg ggggatgcat atttttcagt tcccttagat aaagatttca 2340 ggaagtatac tgcatttacc atacctagta caaacaatga gacaccaggg attagatatc 2400 agtacaatgt gcttccacag ggatggaaag gatcaccagc aatattccaa agtagcatga 2460 caaaaatctt agagcctttc agacaacaaa atccagacat agtcatctat caatacatgg 2520 atgatttata tgtaggatct gacttagaaa tagggcagca tagaacaaaa atagaggaac 2580 taagacaaca tctgttgagg tggggattta ccacaccaga caaaaaacat cagaaagaac 2640 ctccattcct ctggatgggc tatgaactcc atcctgataa atggactgtg cagcctatag 2700 tgctgccaga aaaagatagt tggactgtca atgacataca gaagttagtg ggaaaattga 2760 attgggcaag tcagatttat gcagggatta aagtaaggca attatgtaaa ctccttaggg 2820 gaaccaaggc actaacagaa gtaataccac taacagaaga agcagagcta gaactggcag 2880 aaaacaggga aattctaaaa gaaccagtac atggagtgta ctatgaccca tcaaaagact 2940 tattagcaga aatacagaag caggggcaag gccaatggac atatcaaatt tatcaagagc 3000 catttaaaaa tctaaaaaca ggaaaatatg caagaatgag gggtgcccac actaatgatg 3060 taaaacaatt aacagaggca gtgcaaaaaa tagccacaga gagcatagtg atatggggaa 3120 agattcctaa atttagacta cccatacaaa aagagacatg ggaatcatgg tggacagact 3180 attggcaagc aacctggatt cctgagtggg agtttgttaa tacccctccc ttagtaaaat 3240 tatggtacca gttagagaaa gaacccatag taggagtaga aactttctat gtagatgggg 3300 cagctaacag ggagactaaa ttaggaaaag caggatatgt tactgataga ggaagacaaa 3360 aagttgtctc cctaactgac acaacaaatc agaagactga gttacaagca attcagatgg 3420 ccttgcagga ctcgggatta gaagtaaaca tagtaacaga ctcacaatat gcattaggaa 3480 tcattcaagc acaaccagat aaaagtgaat cagaaatagt caatcaaata atagaacagt 3540 taataaaaaa ggaacgggtc tacctgacat gggtaccagc acacaaagga attggaggaa 3600 atgaacaagt agataagtta gtcagtgctg gaatcaggaa agtactattt ttagatggaa 3660 tagataaggc ccaagaagaa catgaaaaat atcacagtaa ttggagagct atggctagtg 3720 attttaacct gccacctgtg gtagcaaaag aaatagtagc ctgctgtgat aaatgtcaac 3780 aaaaaggaga agccatgcat ggacaagtag actgtagtcc aggaatatgg caattagatt 3840 gtacacatct agaaggaaaa gttatcctgg tagcagtgca tgtagccagt ggatacatag 3900 aagcagaagt tattccagca gagacagggc aggaaacagc atacttcctc ttaaaattag 3960 caggaagatg gccagtaaaa acaatacata cagacaatgg cagcaatttc accagtacta 4020 cggttaaggc tgcctgttgg tgggcgggga tcaagcagga atttggcatc ccctacaatc 4080 cccaaagtca aggagtagta gaatctatga ataaagagtt aaagaaaatt ataggacagg 4140 taagagatca ggccgaacat ctcaagacag cagtacaaat ggcagtattc attcacaatt 4200 ttaaaagaaa aggggggatt gggggataca gtgcagggga aagaatagta gacatgatag 4260 caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 4320 tttattacag ggacagcaga gatccacttt ggaaaggacc agcaaagctt ctctggaaag 4380 gtgaaggggc agtagtaata caagataata gtgacataaa agtagtgcca agaagaaaag 4440 caaagataat tagggattat ggaagacaga tggcaggtga tgattgtgtg gcaagtagac 4500 aggatgagga ttagagcatg gaaaagccta gtaaaacacc atatgtatgt ttcaaaaaag 4560 gctcagggat ggttttatag acatcactat gacagtcgtc atccaagaat aagttcagaa 4620 gtacacatcc cactagggga agccacattg gtcgtaacaa catattgggg tctgaataca 4680 ggagaaagag actggcattt gggtcaggga gtctccatag aatggaggaa aaggagatat 4740 agcacacaag tagaccctaa cttagcagac caactaattc atctgtatta ctttgattgt 4800 ttttcagaat ccgctataag aaatgcctta ttaggacata tagttagacc taagtgtgca 4860 tatcaagcag gacataacaa ggtaggatct ctacagtact tggcactagt agcattaaca 4920 acaccaaaaa agataaagcc acctttgcct agtgtcgcaa aattgacaga ggatagatgg 4980 aacaagcccc agaagatcaa gggccacaga gggagccata cactgaatgg acactagagc 5040 ttttagagga gcttaagaat gaagctgcta gacactttcc taggctgtgg ctccatggtt 5100 taggacaaca tatctatgaa acatatgggg atacttgggc aggagtggaa gccctaataa 5160 gaattctgca acaactgctg tttattcatt tcagaattgg gtgtcaacat agcag 5215 // ID KC312331; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312331; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_A3 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 5fa54a6197e57d389002f8c08cc745af. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_A3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N768" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N768" FT /protein_id="AGG76590.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0A2" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0A2" FT /protein_id="AGG76591.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNAPSEAG FT ANRQGNVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNRKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N242" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N242" FT /protein_id="AGG76592.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQRTKGRRGSHTLNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYG4" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYG4" FT /protein_id="AGG76593.1" FT /translation="MEQAPEDQGPQREPYTEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1947 A; 921 C; 1254 G; 1094 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggaact agaacgattt gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaact ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcaaaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 agggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta aggatgtata gccccaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gaccatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agtcttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aatcaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 gccccctcag aagcaggagc caatagacaa ggaaatgtat cctgtagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gtacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtgtt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaagtta gtagatttca gagaacttaa taggaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacgccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt actacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga tagaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaagggaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acagttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatccatg aataaagagt tgaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtgtt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggtagtaaca acatattggg gtctgcatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaggacca agggccgcag agggagccat acactgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312332; SV 1; linear; genomic RNA; STD; VRL; 5219 BP. XX AC KC312332; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_A4 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5219 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5219 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 9a3a1b816d7e1694afa3e78754659b93. XX FH Key Location/Qualifiers FH FT source 1..5219 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_A4" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYE5" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYE5" FT /protein_id="AGG76594.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSNSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKDMYPITSLRSLFGNDPSSQ" FT gene <1504..4518 FT /gene="pol" FT CDS <1504..4518 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N771" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N771" FT /protein_id="AGG76595.1" FT /translation="FFRKNLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNPPSEAG FT ANRQGYVSHNFPQITLWQRPLVTVKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKK FT DSTKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFR FT KYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYM FT DDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQP FT IVLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELE FT LAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAH FT TNDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTP FT PLVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTEL FT QAIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNQIIEQLIKKERVYLTWVPA FT HKGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIV FT ACCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQE FT TAYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMN FT KELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQ FT KQITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGK FT QMAGDDCVASRQDED" FT gene 4463..5041 FT /gene="vif" FT CDS 4463..5041 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N0A7" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N0A7" FT /protein_id="AGG76596.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLHTGERDWHLGQGVSIEWRKKRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4981..>5219 FT /gene="vpr" FT CDS 4981..>5219 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N247" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N247" FT /protein_id="AGG76597.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5219 BP; 1962 A; 926 C; 1239 G; 1092 T; 0 other; ctggtaacta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattagat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcctt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc aacagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gaccatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgtaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta ggaaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 cccccctcag aagcaggagc caatagacaa ggatatgtat cccataactt ccctcagatc 1680 actctttggc aacgacccct cgtcacagta aaaatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtgtta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacatag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaaaaagca ttagtagaaa tttgcacaga aatggaaaag 2100 gaagggaaaa tttcaagaat tggacctgaa aatccataca atactccagt atttgccata 2160 aagaaaaaag acagtactaa atggagaaaa ttagtagatt tcagagaact taataagaaa 2220 actcaagatt tctgggaagt tcaattagga ataccccatc ccgcagggtt aaaaaagaaa 2280 aagtcagtaa cagtactgga tgtgggggat gcatattttt cagttccctt agataaagat 2340 ttcaggaagt atactgcatt taccatacct agtacaaaca atgagacacc agggattaga 2400 taccagtaca atgtgcttcc acagggatgg aaaggatcac cagcaatttt ccaaagtagc 2460 atgacaaaaa tcttagagcc tttcagacaa caaaatccag acatagtcat ctatcaatac 2520 atggatgatt tgtatgtagg atctgactta gaaatagggc agcatagaac aaaaatagag 2580 gaactgagac aacatttgtt gaggtgggga tttaccacac cagacaaaaa acatcagaaa 2640 gaacctccat tcctctggat gggctatgaa ctccatcctg ataaatggac tgtacagcct 2700 atagtgctgc cagaaaaaga tagttggact gtcaatgaca tacagaagtt agtgggaaaa 2760 ttgaattggg caagtcagat ttatgcaggg attaaagtaa ggcaattatg taaactcctt 2820 aggggaacca aggcactaac agaggtaata ccactaacag aagaagcaga gttagaactg 2880 gcagaaaaca gggaaattct aaaagaacca gtacatggag tatactatga cccatcaaaa 2940 gacttaatag cagaaataca gaagcagggg caaggccaat ggacatatca aatttatcaa 3000 gagccattta aaaatctaaa aacaggaaaa tatgcaagaa tgaggggtgc ccacactaat 3060 gatgtaaaac aattaacaga ggcagtgcaa aaaatagcca cagagagcat agtgatatgg 3120 ggaaagattc ctaaatttag actacccata caaaaagaga catgggaatc atggtggaca 3180 gactattggc aagccacctg gattcctgag tgggagtttg tcaatactcc tcccttagta 3240 aaattatggt accagttaga gaaagaaccc atagtaggag tagaaacttt ctatgtagat 3300 ggggcagcta acagggagac taaattagga aaagcaggat atgttactga tagaggaaga 3360 caaaaagttg tctccctaac tgacacaaca aatcagaaga ctgagttaca agcaattcag 3420 atggccttgc aggactcggg attagaagta aacatagtaa cagactcaca atatgcatta 3480 ggaatcattc aagcacaacc agatagaagt gaatcagaaa tagtcaatca aataatagaa 3540 cagttaataa aaaaggaacg ggtctacctg acatgggtac cagcacacaa aggaattgga 3600 ggaaatgaac aagtagataa gttagtcagt gctggaatca ggaaagtact atttttagat 3660 ggaatagata aggcccaaga agaacatgaa aaatatcaca gtaattggag agcaatggct 3720 agtgatttta acctgccacc tgtggtagca aaagaaatag tagcctgctg tgataaatgt 3780 caacaaaaag gagaggccat gcatggacaa gtagactgta gtccaggaat atggcaatta 3840 gattgtacac atctagaagg aaaagttatc ctggtagcag tgcatgtagc cagtggatat 3900 atagaagcag aagttattcc agcagagaca gggcaggaaa cagcatactt cctcttaaaa 3960 ttagcaggaa gatggccagt aaaaacaata catacagaca atggcagcaa cttcaccagt 4020 actacggtta aggctgcctg ttggtgggcg gggatcaagc aggaatttgg catcccctac 4080 aatccccaaa gtcaaggggt agtagaatct atgaataaag aattaaagaa aattatagga 4140 caggtcagag atcaggctga acatctcaag acagcagtac aaatggcagt attcattcac 4200 aattttaaaa gaaaaggggg gattggggga tacagtgcag gggaaagaat agtagacatg 4260 atagcaacag acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt 4320 cgggtttatt acagggacag cagagatcca ctttggaaag gaccagcaaa gcttctctgg 4380 aaaggtgaag gggcagtagt aatacaagat aatagtgaca taaaagtagt gccaagaaga 4440 aaagcaaaga taattaggga ttatggaaaa cagatggcag gtgatgattg tgtggcaagt 4500 agacaggatg aggattagag catggaaaag tctagtaaaa caccatatgt atgtttcaaa 4560 aaaagctcag ggatggtttt atagacatca ctatgacagt cgtcatccaa gaataagttc 4620 agaagtacac atcccactag gggaggctaa attggtagta acaacatatt ggggtctgca 4680 tacaggagaa agagactggc atttgggtca gggagtctcc atagaatgga ggaaaaagag 4740 atatagcaca caagtagacc ctaacttagc agaccaacta attcatctgt attactttga 4800 ttgtttttca gaatccgcta taagaaatgc cctattagga cacatagtta gacctaagtg 4860 tgcatatcaa gcaggacata acaaggtagg atctctacag tacttggcac tagtagcatt 4920 aacaacacca aaaaagataa agccaccttt gcctagtgtc gcaaaattga cagaagatag 4980 atggaacaag ccccagaaga ccaagggcca cagagggagc catataatga atggacacta 5040 gagcttttag aggagcttaa gaatgaagct gctagacact ttcctaggct gtggctccat 5100 ggtttagggc aacatatcta tgaaacatat ggggatactt gggcaggagt ggaagcccta 5160 ataagaattc tgcaacaatt actgtttatt catttcagaa ttgggtgtca acatagcag 5219 // ID KC312333; SV 1; linear; genomic RNA; STD; VRL; 5215 BP. XX AC KC312333; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_A5 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5215 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5215 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 417e5eac581079147bd3fb42f3a23a22. XX FH Key Location/Qualifiers FH FT source 1..5215 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_A5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 208..1710 FT /gene="gag" FT CDS 208..1710 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYH0" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYH0" FT /protein_id="AGG76598.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAGGCRQILEQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQTTADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHTGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVTSLRSLFGNDPSSQ" FT gene <1503..4514 FT /gene="pol" FT CDS <1503..4514 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYF1" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYF1" FT /protein_id="AGG76599.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4459..5037 FT /gene="vif" FT CDS 4459..5037 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MZH9" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MZH9" FT /protein_id="AGG76600.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4977..>5215 FT /gene="vpr" FT CDS 4977..>5215 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N0B2" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N0B2" FT /protein_id="AGG76601.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5215 BP; 1955 A; 922 C; 1248 G; 1090 T; 0 other; tggtaactag agatccctca gaccctgtta ttcggtgtgc aaaatctcta gcagtggcgc 60 ccgaacaggg acttgaaagc gaaaggaaaa ccagaggagc tctctcgacg caggactcgg 120 cttgctgaag cgcgcacggc aagaggcgag gggtggcgac tggtgagtac gccaaacttt 180 tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcggtatt aagcgggggt 240 caattggata gatgggagaa aattcggtta aggccagggg gaaaaaagca atataggtta 300 aaacatatag tatgggcaag cagggagcta gaacgattcg cagtcaatcc tggcctgtta 360 gaaacagcag ggggctgtag acaaatactg gaacagctac aaccagccct tcagacagga 420 tcagatgaac ttagatcatt atataataca gtagcaaccc tctattgtgt acatcaaagg 480 atagaggtaa aagacaccaa ggaagcttta gagaaaatag aggaggagca aaataaaagt 540 aagaaaaagg cacagcaaac aacagctgac acaggaaaca gcagccaggt cagccaaaat 600 taccctatag tgcagaacct tcaggggcaa atggtacatc aggccatatc acctagaact 660 ttaaatgcat gggtaaaagt agtagaagag aaggccttca gcccagaagt aatacccatg 720 ttttcagcgt tatcagaagg agccacccca caagatttaa acaccatgct aaacacagtg 780 gggggacatc aagcagccat gcaaatgtta aaagagacca tcaatgatga agctgcagaa 840 tgggatagac tgcatccagt gcatacaggg cctgttgcac caggccagat gagagaacca 900 aggggaagtg acatagcagg aactactagt acccttcagg aacaaatagg atggatgaca 960 aataatccac ctatcccagt aggagagatc tataaaagat ggataatcct gggattaaat 1020 aaaatagtaa ggatgtatag ccctaccagc attctggaca taagacaagg accaaaggaa 1080 ccctttagag actatgtaga ccggttctat aaaactctaa gagccgagca ggcgtcacag 1140 gatgtaaaaa cttggatgac agaaaccttg ttggtccaaa atgcaaaccc agattgtaag 1200 actattttaa aagcattggg accagcagct acactagaag aaatgatgac agcatgtcag 1260 ggagtgggag gacccagcca taaagcaaga gttttggcag aagcaatgag ccaagcaaca 1320 aattcacctg ccataatgat gcagagaggc aattttagga accaaagaaa gagtgttaaa 1380 tgctttaatt gtggcaagga agggcacata gccagaaatt gcaaggcccc taggaaaaga 1440 ggctgttgga aatgtggaaa ggaaggacac caaatgaaag attgtactga gagacaggct 1500 aattttttag ggaaaatctg gccttcccac aaggggaggc cagggaattt ccttcagagc 1560 agaccagagc caacagcccc accagaagag agcttcaggt ttggggaaga gacaacaact 1620 ccccctcaga agcaggagcc aatagacaag gagatgtatc ctgtaacttc cctcagatca 1680 ctctttggca acgacccctc gtcacaataa agataggggg gcaactaaaa gaagctctat 1740 tagatacagg agcagatgat acagtattag aagaaatgac cctgccagga aaatggaaac 1800 caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat cagataccca 1860 tagaaatctg tggacacaga gctatgggta cggtattagt aggacctaca cctgtcaaca 1920 taattggaag aaatctgttg actcagattg gttgcacttt aaattttccc attagtccta 1980 ttgaaacggt accagtaaaa ttaaagccag gaatggatgg cccaaaagtt aaacaatggc 2040 cattgacaga agaaaaaata aaagcattag tagaaatttg tacagaaatg gaaaaggaag 2100 ggaaaatttc aagaattgga cctgaaaatc catacaatac tccagtattt gccataaaga 2160 agaaagacag tactaaatgg agaaaattag tagatttcag agaacttaat aagaaaactc 2220 aagatttctg ggaagttcaa ttaggaatac cacatcccgc agggctaaaa aagaaaaagt 2280 cagtaacagt actggatgtg ggggatgcat atttttcagt tcctttagat aaagatttca 2340 ggaagtatac tgcatttacc atacctagta caaacaatga gacaccaggg attagatatc 2400 agtacaatgt gcttccacag ggatggaaag gatcaccagc aatattccaa agtagcatga 2460 caaaaatctt agagcctttc agacaacaaa atccagacat agtcatctat caatacatgg 2520 atgatttgta tgtaggatct gacttagaaa tagggcagca tagaacgaag atagaggaac 2580 tgagacaaca tctgttgagg tggggattta ccacaccaga caaaaaacat cagaaagaac 2640 ctccattcct ctggatgggc tatgaactcc atcctgataa atggactgta cagcctatag 2700 tgctgccaga aaaagatagt tggactgtca atgacataca gaagttagtg ggaaaattga 2760 attgggcaag tcagatttat gcagggatta aagtaaggca attatgtaaa ctccttaggg 2820 gaaccaaggc actaacagag gtaataccac taacagaaga agcagagtta gaactggcag 2880 aaaacaggga aattctaaaa gaaccagtac atggagtgta ctatgaccca tcaaaagact 2940 taatagcaga aatacagaag caggggcaag gccaatggac atatcaaatt tatcaagagc 3000 catttaaaaa tctaaaaaca ggaaaatatg caagaatgag gggtgcccac actaatgatg 3060 taaaacaatt aacagaggca gtgcaaaaaa tagccacaga gagcatagtg atatggggaa 3120 agattcctaa atttagacta cccatacaaa aagagacatg ggaatcatgg tggacagact 3180 attggcaagc cacctggatt cctgagtggg aatttgtcaa tacccctccc ttagtaaaat 3240 tatggtacca gttagagaaa gaacccatag taggagtaga aactttctat gtagatgggg 3300 cagctaacag ggagactaaa ttaggaaaag caggatatgt tactgataga ggaagacaaa 3360 aagttgtctc cctaactgac acaacaaatc agaagactga gttacaagca attcagatgg 3420 ccttgcagga ctcgggatta gaagtaaaca tagtaacaga ctcacaatat gcattaggaa 3480 tcattcaagc acaaccagat aaaagtgaat cagaaatagt caatcaaata atagaacagt 3540 taataaaaaa ggaaagggtc tacctgacat gggtaccagc acacaaagga attggaggaa 3600 atgaacaagt agataagtta gtcagtgctg gaatcaggaa agtactattt ttagatggaa 3660 tagataaggc ccaagaagaa catgaaaaat atcacagtaa ttggagagct atggctagtg 3720 attttaacct gccacctgtg gtagcaaaag agatagtagc ctgctgtgat aaatgtcaac 3780 aaaaaggaga ggccatgcat ggacaagtag actgtagtcc aggaatatgg caattagatt 3840 gtacacatct agaaggaaaa gttatcctgg tagcagtgca tgtagccagt ggatatatag 3900 aagcagaagt tattccagca gagacagggc aggaaacagc atacttcctc ttaaaattag 3960 caggaagatg gccagtaaaa acaatacata cagacaatgg cagcaatttc accagtacta 4020 cagttaaggc tgcctgctgg tgggcgggga tcaagcaaga atttggcatc ccctacaatc 4080 cccaaagtca aggagtagta gaatctatga ataaagagtt aaagaaaatt ataggacagg 4140 taagagatca ggctgaacat ctcaagacag cagtacaaat ggcagtattc attcacaatt 4200 ttaaaagaaa aggggggatt gggggataca gtgcagggga aagaatagta gacatgatag 4260 caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 4320 tttattacag ggacagcaga gatccacttt ggaaaggacc agcaaagctt ctctggaaag 4380 gtgaaggggc agtagtaata caagataata gtgacataaa agtagtgcca agaagaaaag 4440 caaagataat tagggattat ggaaaacaga tggcaggtga tgattgtgtg gcaagtagac 4500 aggatgagga ttagagcatg gaaaagtcta gtaaaacacc atatgtatgt ttcaaaaaag 4560 gctcagggat ggttttatag acatcactat gacagtcgtc atccaagaat aagttcagaa 4620 gtacacatcc cactagggga ggctacattg gtcgtaacaa catattgggg tctgcataca 4680 ggagaaagag actggcattt gggtcaggga gtctccatag aatggaggaa aaggagatat 4740 agcacacaag tagaccctaa cttagcagac caactaattc atctgtatta ctttgattgt 4800 ttttcagaat ccgctataag aaatgcctta ttaggacaaa tagttagacc taagtgtgca 4860 tatcaagcag gacataacaa ggtaggatct ctacagtact tggcactagt agcattaaca 4920 acaccaaaaa agataaagcc acctttgcct agtgtcgcaa aattgacaga ggatagatgg 4980 aacaagcccc agaagaccaa gggccacaga gggaaccata caatgaatgg acactagagc 5040 ttttagagga gcttaagaat gaagctgcta gacactttcc taggctgtgg ctccatggtt 5100 tagggcaaca tatctatgaa acatatgggg atacttgggc aggagtggaa gccctaataa 5160 gaattctgca acaactgctg tttattcatt tcagaattgg gtgtcaacat agcag 5215 // ID KC312334; SV 1; linear; genomic RNA; STD; VRL; 5213 BP. XX AC KC312334; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B1 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5213 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5213 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 13dc8d9353e37498cd38f7d1e9cf0cec. XX FH Key Location/Qualifiers FH FT source 1..5213 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1708 FT /gene="gag" FT CDS 209..1708 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N252" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N252" FT /protein_id="AGG76602.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4512 FT /gene="pol" FT CDS <1504..4512 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYH5" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYH5" FT /protein_id="AGG76603.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNSPSEAGA FT NRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIGG FT IGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETVP FT VKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKDS FT TKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRKY FT TAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMDD FT LYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPIV FT LPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELELA FT ENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHTN FT DVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPPL FT VKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYITDRGRQKVVSLTDTTNQKTELQA FT IQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAHK FT GIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVAC FT CDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQETA FT YFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNKE FT LKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDIIATDIQTKELQKQ FT ITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQM FT AGDDCVASRQDED" FT gene 4457..5035 FT /gene="vif" FT CDS 4457..5035 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYF6" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYF6" FT /protein_id="AGG76604.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4975..>5213 FT /gene="vpr" FT CDS 4975..>5213 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N776" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N776" FT /protein_id="AGG76605.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5213 BP; 1957 A; 924 C; 1242 G; 1090 T; 0 other; ctggtaacta gagatccctc agaccctgtt tttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agaaaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtacagaacc tccaggggca aatggtacat caggccatct cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gaccatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaac tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaactcc 1620 ccctcagaag caggagccaa tagacaagga gatgtatcct gtagcttccc tcagatcact 1680 ctttggcaac gacccctcgt cacaataaag ataggggggc aactaaagga agctctatta 1740 gatacaggag cagatgatac agtattagaa gaaatgaccc tgccaggaaa atggaaacca 1800 aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatacccata 1860 gaaatctgtg gacatagagc tataggtacg gtattagtag gacctacacc tgtcaacata 1920 attggaagaa atctgttgac tcagattggt tgcactttaa attttcccat tagtcctatt 1980 gaaacggtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca 2040 ttgacagaag aaaaaataaa agcattagta gaaatttgca cagaaatgga aaaggaaggg 2100 aaaatttcaa gaattggacc tgaaaatcca tacaatactc cagtgtttgc cataaagaaa 2160 aaagacagta ctaaatggag aaaattagta gatttcagag aacttaataa gaaaactcaa 2220 gatttctggg aagttcaatt aggaataccc catcccgcag ggttaaaaaa gaaaaagtca 2280 gtaacagtac tggatgtggg ggatgcatat ttttcagttc ctttagataa agatttcagg 2340 aagtatactg catttaccat acctagtaca aacaatgaga caccagggat tagatatcag 2400 tacaatgtgc tgccacaggg atggaaagga tcaccagcaa tattccaaag tagcatgaca 2460 aaaatcttag agcctttcag acaacaaaac ccagacatag tcatctatca atacatggat 2520 gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaattg 2580 agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct 2640 ccattcctct ggatgggcta tgaactccat cctgataaat ggactgtaca gcctatagtg 2700 ctgccagaaa aagatagttg gactgtcaat gacatacaaa agttagtggg aaaattgaat 2760 tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact ccttagggga 2820 accaaggcac taacagaagt aataccacta acagaagaag cagagttaga actggcagaa 2880 aacagggaaa ttctaaaaga accagtacat ggagtgtact atgacccatc aaaagactta 2940 atagcagaaa tacagaagca ggggcaaggc caatggacat atcagattta tcaagagcca 3000 tttaaaaatc taaaaacagg aaaatatgca agaatgaggg gtgcccacac taatgatgta 3060 aaacaattaa cagaggcagt gcaaaaaata gccacagaga gcatagtgat atggggaaag 3120 attcctaaat ttagactacc catacaaaaa gagacatggg aatcatggtg gacagactat 3180 tggcaagcca cctggattcc tgagtgggag tttgtcaaca ctcctcccct agtaaaatta 3240 tggtaccagt tagagaaaga acccatagta ggagtagaaa ctttctatgt agatggggca 3300 gctaacaggg agactaaatt aggaaaagca ggatatatta ctgatagagg aagacaaaaa 3360 gttgtctccc taactgacac aacaaatcag aagactgagt tacaagcaat tcagatggct 3420 ttgcaggact cgggattaga agtaaacata gtaacagact cacaatatgc attaggaatc 3480 attcaagcac aaccagataa aagtgaatca gaaatagtca atcaaataat agaacagtta 3540 ataaaaaagg aacgggtcta cctgacatgg gtaccagcac acaaaggaat tggaggaaat 3600 gaacaagtag ataagttagt cagtgctgga atcaggaaag tactattttt agatggaata 3660 gataaggccc aagaagaaca tgaaaaatat cacagtaatt ggagagctat ggctagtgat 3720 tttaacctgc cacctgtggt agcaaaagaa atagtagcat gctgtgataa atgtcaacaa 3780 aaaggagagg ccatgcatgg acaagtagac tgtagtccag gaatatggca attagattgt 3840 acacacctag aaggaaaagt tatcctggta gcagtgcatg tagccagcgg atatatagaa 3900 gcagaagtta ttccagcaga gacagggcag gaaacagcat acttcctctt aaaattagca 3960 ggaagatggc cagtaaaaac aatacataca gacaatggca gcaattttac cagtactaca 4020 gttaaggctg cctgctggtg ggcggggatc aagcaagaat ttggcatccc ctacaatccc 4080 caaagtcaag gggtagtaga atctatgaat aaagagttaa agaaaattat aggacaggta 4140 agagatcagg ctgaacatct caagacagca gtacaaatgg cagtattcat tcacaatttt 4200 aaaagaaaag gggggattgg gggatacagt gcaggggaaa gaataataga cataatagca 4260 acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt 4320 tattacaggg acagcagaga tccactttgg aaaggaccag caaagcttct ctggaaaggt 4380 gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca 4440 aagataatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag 4500 gatgaggatt agagcatgga aaagtctagt aaaacaccat atgtatgttt caaaaaaggc 4560 tcagggatgg ttttatagac atcactatga cagtcgtcat ccaagaataa gttcagaagt 4620 acacatccca ctaggggagg ctaaattggt agtaacaaca tattggggtc tgcatacagg 4680 agaaagagac tggcatttgg gtcagggagt ctccatagaa tggaggaaaa ggagatatag 4740 cacacaagta gaccctaact tagcagacca actaattcat ctgtattact ttgattgttt 4800 ttcagaatcc gctataagaa atgccatatt aggacatata gttagaccta agtgtgcata 4860 tcaagcagga cataacaagg taggatctct acagtacttg gcactagtag cattaacaac 4920 accaaaaaag ataaagccac ctttgcctag tgtcgcaaaa ttgacagagg atagatggaa 4980 caagccccag aagaccaagg gccacagagg gagccatata atgaatggac actagagctt 5040 ttagaggagc ttaagaatga agctgctaga cactttccta ggctgtggct ccatggttta 5100 ggacaacata tctatgaaac atatggggat acttgggcag gagtggaagc cctaataaga 5160 attctgcaac aactgctgtt tattcatttc agaattgggt gtcaacatag cag 5213 // ID KC312335; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312335; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B10 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 4655cc4ebb71f8e6504e6a0b768e2c03. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B10" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0B7" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0B7" FT /protein_id="AGG76606.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N260" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N260" FT /protein_id="AGG76607.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT SAKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76608.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYF9" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYF9" FT /protein_id="AGG76609.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 920 C; 1242 G; 1096 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaaag aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtgctaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagcaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312336; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312336; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B11 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; a3805b4c69109e2d25b71749aa2c5758. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B11" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N779" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N779" FT /protein_id="AGG76610.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKEMYPIASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0C1" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0C1" FT /protein_id="AGG76611.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGNVSHSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTGEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQESFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKIVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N264" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N264" FT /protein_id="AGG76612.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYI3" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYI3" FT /protein_id="AGG76613.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1962 A; 923 C; 1239 G; 1092 T; 0 other; ctggtatcta gagatccctc agaccctgtt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcta tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa aattggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 aactatttta aaagcattgg gaccagcagc tacactagaa gagatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttagca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttaga aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggaaatgtat cccatagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacatag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag gagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatataaca ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgcg tatttttcag ttcccttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagaaccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctacgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aactgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagct agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga cataccaaat ttatcaagag 3000 tcatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagat 3180 tattggcaag ccacctggat tcctgagtgg gagtttgtca atactcctcc cctagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gctttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacacc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagacc aggctgaaca tcttaagaca gcagtacaaa tggcagtatt catccacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaataat agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aaatagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctacatt ggtcgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acctagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tctgctataa gaaatgccct attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312337; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312337; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B13 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 04958cf98dab1e47c854b4b49ea605dc. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B13" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYG3" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYG3" FT /protein_id="AGG76614.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N782" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N782" FT /protein_id="AGG76615.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHNNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N0C6" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N0C6" FT /protein_id="AGG76616.1" FT /translation="MENRWQVMIVWQVDRMRIKAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N269" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N269" FT /protein_id="AGG76617.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1954 A; 924 C; 1243 G; 1095 T; 0 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc tccaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagatcta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattttggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaagcaatga gccaagcgac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtgtt tgccataaag 2160 aaaaaagaca gcactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcccttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caattttcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttat atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagct agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gagtttgtca atactcctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacaata attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaggagt taaagaaaat tataggacaa 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agacccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attaaagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aagctacatt ggtcgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgttgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggaaccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312338; SV 1; linear; genomic RNA; STD; VRL; 5214 BP. XX AC KC312338; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B15 from USA gag protein (gag) gene, complete cds; DE nonfunctional pol protein (pol) gene, partial sequence; vif protein (vif) DE gene, complete cds; and vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5214 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5214 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 03636cef35437e2b57a2da20ae33d1a0. XX FH Key Location/Qualifiers FH FT source 1..5214 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B15" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYM9" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYM9" FT /protein_id="AGG76618.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPIASLKSLFGNDPSSQ" FT gene <1504..4513 FT /gene="pol" FT misc_feature <1504..4513 FT /gene="pol" FT /note="nonfunctional pol protein due to mutation" FT gene 4458..5036 FT /gene="vif" FT CDS 4458..5036 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYG8" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYG8" FT /protein_id="AGG76619.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDKWNKPQKIKGHRGSHTLNGH" FT gene 4976..>5214 FT /gene="vpr" FT CDS 4976..>5214 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N785" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N785" FT /protein_id="AGG76620.1" FT /translation="MEQAPEDQGPQREPYTEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5214 BP; 1953 A; 917 C; 1241 G; 1103 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacggg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttt agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccctaccag tattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aagcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaatc cagattgtaa 1200 gactatttta aaagcattag gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gtcaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgtaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagtttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggaaatgtat cccatagctt ccctcaaatc 1680 actctttggc aacgacccct cgtcacagta aaaatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 ggaaaaattt cacgaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaaa 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acacaatgag acaccaggga ttagatatca 2400 atacaatgtg cttccacagg gatggaaagg atcaccagca atattccaaa gtagcatgac 2460 aaaaatctta gagcctttca gacaacaaaa tccagacata gtcatctatc aatacatgga 2520 tgatttgtat gtaggatctg acttagaaat agggcagcat agaacaaaga tagaggaact 2580 gagacaacat ctgttgaggt ggggatttac cacaccagac aaaaaacatc agaaagaacc 2640 tccattcctc tggatgggct atgaactcca tcctgataaa tggactgtgc agcctatagt 2700 gctgccagaa aaagatagtt ggactgtcaa tgacatacag aaattagtgg gaaaattgaa 2760 ttgggcaagt cagatttatg cagggattaa ggtaaggcaa ttatgtaaac tccttagggg 2820 aaccaaggca ctaacagaag taataccact aacagaagaa gcagagctag aactggcaga 2880 aaacagggaa attctaaaag aaccagtaca tggagtgtac tatgacccat caaaagactt 2940 aatagcagaa atacagaagc aggggcaagg ccaatggaca tatcaaattt atcaagagcc 3000 atttaaaaat ctaaaaacag gaaaatatgc aagaatgagg ggtgcccaca ctaatgatgt 3060 aaaacaatta acagaggcag tgcaaaaaat agccacagag agcatagtga tatggggaaa 3120 gattcctaaa tttagactac ccatacaaaa agagacatgg gaatcatggt ggacagacta 3180 ttggcaagcc acctggattc ctgagtggga atttgtcaat acccctccct tagtaaaatt 3240 atggtaccag ttagagaaag aacccatagt aggagtagaa actttctatg tagatggggc 3300 agctaacagg gagactaaat taggaaaagc aggatatgtt actgatagag gaagacaaaa 3360 agttgtctcc ctaactgaca caacaaatca gaagactgag ttacaagcaa ttcagatggc 3420 tttgcaggac tcgggattag aagtaaacat agtaacagac tcacaatatg cattaggaat 3480 cattcaagca caaccagata gaagtgaatc agaaatagtc aatcaaataa tagaacagtt 3540 aataaaaaag gaacgggtct acctgacatg ggtaccagca cacaaaggaa ttggaggaaa 3600 tgaacaagta gataagttag tcagtgctgg aatcaggaaa gtactatttt tagatggaat 3660 agataaggcc caagaagaac atgaaaaata tcacagtaat tggagagcta tggctagtga 3720 ttttaacctg ccacctgtgg tagcaaaaga aatagtagcc tgctgtgata aatgtcaaca 3780 aaaaggagag gccatgcatg gacaagtaga ctgtagtcca ggaatatggc aattagattg 3840 tacacatcta gaaggaaaag ttatcctggt agcagtgcat gtagccagtg gatatataga 3900 agcagaagtt attccagcag agacagggca ggaaacagca tacttcctct taaaattagc 3960 aggaagatgg ccagtaaaaa caatacatac agacaatggc agcaatttca ccagtactac 4020 ggttaaggct gcctgttggt gggcggggat caagcaggaa tttggcatcc cctacaatcc 4080 ccaaagtcaa ggagtagtag aatctatgaa taaagagtta aagaaaatta taggacaggt 4140 aagagatcag gctgaacatc tcaagacagc agtacaaatg gcagtgttta ttcacaattt 4200 taaaagaaaa ggggggattg ggggatacag tgcaggggaa agaatagtag acatgatagc 4260 aacagacata caaactaaag aattacaaaa acaaattaca aaaattcaaa attttcgggt 4320 ttattacagg gacagcagag atccactttg gaaaggacca gcaaagcttc tctggaaagg 4380 tgaaggggca gtagtaatac aagataatag tgacataaaa gtagtgccaa gaagaaaagc 4440 aaagataatt agggattatg gaaaacagat ggcaggtgat gattgtgtgg caagtagaca 4500 ggatgaggat tagagcatgg aaaagtctag taaaacacca tatgtatgtt tcaaaaaagg 4560 ctcagggatg gttttataga catcactatg acagtcgtca tccaagaata agttcagaag 4620 tacacatccc actaggggaa gctacattgg tcgtaacaac atattggggt ctgaatacag 4680 gagaaagaga ctggcatttg ggtcagggag tctccataga atggaggaaa aggagatata 4740 gcacacaagt agaccctaac ttagcagacc aactaattca tctgtattac tttgattgtt 4800 tttcagaatc cgctataaga aatgccttat taggacatat agttagacct aagtgtgcat 4860 atcaagcagg acataacaag gtaggatctc tacagtactt ggcactagta gcattaacaa 4920 caccaaaaaa gataaagcca cctttgccta gtgtcgcaaa attgacagag gataaatgga 4980 acaagcccca gaagatcaag ggccacagag ggagccatac actgaatgga cactagagct 5040 tttagaagag cttaagaatg aagctgctag acactttcct aggctgtggc tccatggttt 5100 aggacaacat atctatgaaa catatgggga tacttgggca ggagtggaag ccctaataag 5160 aattctgcaa caactgctgt ttattcattt cagaattggg tgtcaacata gcag 5214 // ID KC312339; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312339; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B17 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 866889a3c9b39635edd5a5bb0825097f. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B17" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0D2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0D2" FT /protein_id="AGG76621.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKTVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N274" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N274" FT /protein_id="AGG76622.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76623.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYH4" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYH4" FT /protein_id="AGG76624.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 921 C; 1241 G; 1096 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agactgttaa 1380 atgctttaat tgtggcaaag aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg gcctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcaaatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagcaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agaaacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312340; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312340; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B18 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 0ce84ef950674d3569e95db7c7aed0a8. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B18" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N789" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N789" FT /protein_id="AGG76625.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0D6" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0D6" FT /protein_id="AGG76626.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGXPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N280" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N280" FT /protein_id="AGG76627.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPVGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQRTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYK1" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYK1" FT /protein_id="AGG76628.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1944 A; 919 C; 1254 G; 1098 T; 1 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gtgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccagaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta aggatgtata gccccaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gagatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcaccc gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatttgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacagtgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatataaca ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaatr ccccatcccg cagggttaaa aaagaaaaaa 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagc acaaacaatg agacaccagg aattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagata tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa gatagaggaa 2580 ttgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa acaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tggataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaggagt taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccagtagggg aggctacatt ggtcgtaaca acatattggg gtctgcatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgccat attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattgac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaggacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312341; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312341; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B2 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 32f05b2f370b84b858dca7cb79f99ca5. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B2" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYH8" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYH8" FT /protein_id="AGG76629.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAGGCRQILEQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQKGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N791" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N791" FT /protein_id="AGG76630.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNKIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHIASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MZH9" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MZH9" FT /protein_id="AGG76631.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N286" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N286" FT /protein_id="AGG76632.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1959 A; 923 C; 1244 G; 1090 T; 0 other; ctggtaacta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gggggctgta gacaaatact ggaacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagctct agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcaaaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta aggatgtata gccccaccag cattttggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagaaagg caattttagg aaccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtaactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctctg 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt actgtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca gttaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tattggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa ataggacagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcagat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagtagtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga tagaagtgaa tcagaaatag tcaataaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc aatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atatagccag tggatacata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaactt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcaag aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccattagggg aggctacatt ggtcgtaaca acatattggg gtctacatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagcgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggaaccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312342; SV 1; linear; genomic RNA; STD; VRL; 5213 BP. XX AC KC312342; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B20 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5213 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5213 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; aed7e05104aecc511e439e7b5f23bb66. XX FH Key Location/Qualifiers FH FT source 1..5213 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B20" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 206..1708 FT /gene="gag" FT CDS 206..1708 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYK6" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYK6" FT /protein_id="AGG76633.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1501..4512 FT /gene="pol" FT CDS <1501..4512 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYI2" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYI2" FT /protein_id="AGG76634.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNKIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEIIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4457..5035 FT /gene="vif" FT CDS 4457..5035 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76635.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4975..>5213 FT /gene="vpr" FT CDS 4975..>5213 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N0E3" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N0E3" FT /protein_id="AGG76636.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5213 BP; 1958 A; 917 C; 1242 G; 1096 T; 0 other; gtaactagag atccctcaga cccttttgtt cggtgtgcaa aatctctagc agtggcgccc 60 gaacagggac ttgaaagcga aaggaaaacc agaggagctc tctcgacgca ggactcggct 120 tgctgaagcg cgcacggcaa gaggcgaggg gtggcgactg gtgagtacgc caaacttttg 180 actagcggag gctagaagga gagagatggg tgcgagagcg tcggtattaa gcgggggtca 240 attggataga tgggaaaaaa ttcggttaag gccaggggga aaaaagcaat ataggttaaa 300 acatatagta tgggcaagca gggagctaga acgattcgca gtcaatcctg gcctgttaga 360 aacagcagag ggctgtagac aaatactgac acagctacaa ccagcccttc agacaggatc 420 agatgaactt agatcattat ataatacagt agcaaccctc tattgtgtac atcaaaggat 480 agaggtaaaa gacactaagg aagctttaga gaaaatagag gaggagcaaa ataaaagtaa 540 gaaaaaggca cagcaagcaa cagctgacac aggaaacagc agccaggtca gccaaaatta 600 ccctatagtg cagaaccttc aggggcaaat ggtacatcag gccatatcac ctagaacttt 660 aaatgcatgg gtaaaagtag tagaagagaa ggccttcagc ccagaagtaa tacccatgtt 720 ttcagcgtta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 780 gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgatgaag ctgcagaatg 840 ggatagactg catccagtgc atgcagggcc tgttgcacca ggccagatga gagaaccaag 900 gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgaccaa 960 taatccacct atcccagtag gagagatcta taaaagatgg ataatcctgg gattaaataa 1020 aatagtaaga atgtatagcc ccaccagcat tctggatata agacaaggac caaaggaacc 1080 ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcagg cgtcacagga 1140 tgtaaaaact tggatgacag aaaccttgtt ggtccaaaat gcaaacccag attgtaagac 1200 tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg 1260 agtgggagga cccagccata aagcaagagt tttggcggag gcaatgagcc aagcaacaaa 1320 ttcacctgcc ataatgatgc agagaggcaa ttttaggaac caaagaaaga ttgttaaatg 1380 ctttaattgt ggcaaagaag ggcacatagc cagaaattgc aaggccccta ggaaaagagg 1440 ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa 1500 ttttttaggg aaaatctggc cttcccacaa ggggaggcca gggaatttcc ttcagagcag 1560 accagagcca acagccccac cagaagagag cttcaggttt ggggaagaga caacaactcc 1620 ccctcagaag caggagccaa tagacaagga gatgtatcct gtagcttccc tcagatcact 1680 ctttggcaac gacccctcgt cacaataaag ataggggggc aactaaagga agctctatta 1740 gatacaggag cagatgatac agtattagaa gaaatgaccc tgccaggaaa atggaaacca 1800 aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatacccata 1860 gaaatctgtg gacacagagc tatgggtacg gtattagtag gacctacacc tgtcaacata 1920 attggaagaa atctgttgac tcagattggt tgcactttaa attttcccat tagtcctatt 1980 gaaacagtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca 2040 ttgacagaag aaaaaataaa agcattagta gaaatttgta cagaaatgga aaaggaaggg 2100 aaaatttcaa gaattgggcc tgaaaatcca tacaatactc cagtatttgc cataaagaaa 2160 aaagacagta ctaaatggag gaaattagta gatttcagag aacttaataa gaaaactcag 2220 gatttctggg aagttcaatt aggaatccca catcccgcag ggttaaaaaa gaaaaagtca 2280 gtaacagtac tggatgtggg ggatgcatat ttttcagttc ctttagataa agatttcagg 2340 aagtatactg catttaccat acctagtaca aacaatgaga caccagggat tagatatcag 2400 tacaatgtgc ttccacaggg atggaaagga tcaccagcaa tattccaaag tagcatgaca 2460 aaaatcttag agcctttcag acaacaaaat ccagacatag tcatctatca atacatggat 2520 gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaactg 2580 agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct 2640 ccattcctct ggatgggcta tgaactccat cctgataaat ggactgtaca acctatagtg 2700 ctgccagaaa aagatagttg gactgtcaat gacatacaga agttagtggg aaaattgaat 2760 tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact ccttagggga 2820 accaaggcac taacagaagt aataccacta acagaagaag cagagttaga actggcagaa 2880 aacagggaaa ttctaaaaga accagtacat ggagtgtact atgacccatc aaaagactta 2940 atagcagaaa tacagaagca ggggcaaggc caatggacat atcaaattta tcaagagcca 3000 tttaaaaatc taaaaacagg aaaatatgca agaatgaggg gtgcccacac taatgatgta 3060 aaacaattaa cagaggcagt gcaaaaaata gccacagaga gcatagtgat atggggaaag 3120 attcctaaat ttagactacc catacaaaaa gagacatggg aatcatggtg gacagactat 3180 tggcaagcca cctggattcc tgagtgggaa tttgtcaata cccctccctt agtaaaatta 3240 tggtaccagt tagagaaaga acccatagta ggagtagaaa ctttctatgt agatggggca 3300 gctaacaggg agactaaatt aggaaaagca ggatatgtta ctgatagagg aagacaaaaa 3360 gttgtctccc taactgacac aacaaatcag aagactgagt tacaagcaat tcagatggcc 3420 ttgcaggact cgggattaga agtaaacata gtaacagact cacaatatgc attaggaatc 3480 attcaagcac aaccagataa aagtgaatca gaaatagtca ataaaataat agagcagtta 3540 ataaaaaagg aacgggtcta cctgacatgg gtaccagcac acaaaggaat tggaggaaat 3600 gaacaagtag ataagttagt cagtgctgga atcaggaaag tactattttt agatggaata 3660 gataaggccc aagaagaaca tgaaaaatat cacagtaatt ggagagctat ggctagtgat 3720 tttaacctgc cacctgtggt agcaaaagaa atagtagcct gctgtgataa atgtcaacaa 3780 aaaggagaag ccatgcatgg acaagtagac tgtagtccag gaatatggca attagattgt 3840 acacatctag aaggaaaagt tatcctggta gcagtgcatg tagccagtgg atatatagaa 3900 gcagaaatta ttccagcaga gacagggcag gaaacagcat acttcctctt aaaattagca 3960 ggaagatggc cagtaaaaac aatacataca gacaatggca gcaatttcac cagtactacg 4020 gttaaggctg cctgttggtg ggcggggatc aagcaggaat ttggcatccc ctacaatccc 4080 caaagtcaag gggtagtaga atctatgaat aaagaattaa agaaaattat aggacaggta 4140 agagatcagg ctgaacatct caagacagca gtacaaatgg cagtattcat tcacaatttt 4200 aaaagaaaag gggggattgg gggatacagt gcaggggaaa gaataataga catgatagca 4260 acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt 4320 tattacaggg acagcagaga tccactttgg aaaggaccag caaagcttct ctggaaaggt 4380 gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca 4440 aagataatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag 4500 gatgaggatt agagcatgga aaagtctagt aaaacaccat atgtatgttt caaaaaaggc 4560 tcagggatgg ttttatagac atcactatga cagtcgtcat ccaagaataa gttcagaagt 4620 acacatccca ctaggggagg ctaaattggt tgtaacaaca tattggggtc tgaatacagg 4680 agaaagagac tggcatttgg gtcagggagt ctccatagaa tggaggaaaa ggagatatag 4740 cacacaagta gaccctaact tagcagacca actaattcat ctgtattact ttgattgttt 4800 ttcagaatcc gctataagaa atgccttatt aggacaaata gttagaccta agtgtgcata 4860 tcaagcagga cataacaagg taggatctct acagtacttg gcactagtag cattaacaac 4920 accaaaaaag ataaagccac ctttgcctag tgtcgcaaaa ttgacagagg atagatggaa 4980 caagccccag aagaccaagg gccacagagg gagccataca atgaatggac actagagctt 5040 ttagaggagc ttaagaatga agctgttaga cactttccta gactgtggct ccatggttta 5100 ggacaacata tctatgaaac atatggggat acttgggcag gagtggaagc cataataaga 5160 attctgcaac aactgctgtt tattcatttc agaattgggt gtcaacatag cag 5213 // ID KC312343; SV 1; linear; genomic RNA; STD; VRL; 5215 BP. XX AC KC312343; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B21 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5215 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5215 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 34b07a3cac9cdf242b783a72243cf455. XX FH Key Location/Qualifiers FH FT source 1..5215 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B21" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 208..1710 FT /gene="gag" FT CDS 208..1710 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N291" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N291" FT /protein_id="AGG76637.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADAGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPPQKQEPIDKDMYPITSLRSLFGNDPSSQ" FT gene <1503..4514 FT /gene="pol" FT CDS <1503..4514 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYL2" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYL2" FT /protein_id="AGG76638.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRGELQVWGRDSNSSSEAG FT ANRQGYVSHNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4459..5037 FT /gene="vif" FT CDS 4459..5037 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYI7" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYI7" FT /protein_id="AGG76639.1" FT /translation="MENRWQVMIVWQVDRMRIKAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRKAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKTKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4977..>5215 FT /gene="vpr" FT CDS 4977..>5215 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N797" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N797" FT /protein_id="AGG76640.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5215 BP; 1954 A; 919 C; 1247 G; 1095 T; 0 other; tggtaactag agatccctca gaccctattg ttcggtgtgc aaaatctcta gcagtggcgc 60 ccgaacaggg acttgaaaac gaaaggaaaa ccagaggagc tctctcgacg caggactcgg 120 cttgctgaag cgcgcacggc aagaggcgag gggtggcgac tggtgagtac gccaaacttt 180 tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcggtatt aagcgggggt 240 caattggata gatgggagaa aattcggtta aggccagggg gaaaaaagca atataggtta 300 aaacatatag tatgggcaag cagggagcta gaacgattcg cagtcaatcc tggcctgtta 360 gaaacagcag agggctgtag acaaatactg actcagctac aaccagccct tcagacagga 420 tcagatgaac ttagatcact atataataca gtagcaaccc tctattgtgt acatcaaagg 480 atagaggtaa aagacaccaa ggaagcttta gagaaaatag aggaggagca aaataaaagt 540 aagaaaaagg cacagcaagc aacagctgac gcaggaaaca gcagccaggt cagccaaaat 600 taccctatag tgcagaacct tcaggggcaa atggtacatc aggctatatc acctagaact 660 ttaaatgcat gggtaaaagt agtagaagag aaggccttca gcccagaagt aatacccatg 720 ttttcagcgt tatcagaagg agccacccca caagatttaa acaccatgct aaacacagtg 780 gggggacatc aagcagccat gcaaatgtta aaagagacca tcaatgatga agctgcagaa 840 tgggatagac tgcatccagt gcatgcaggg cctgttgcac caggccagat gagagaacca 900 aggggaagtg acatagcagg aactactagt acccttcagg aacaaatagg atggatgaca 960 aataatccac ctatcccagt aggagagatc tataaaagat ggataatctt gggattaaat 1020 aaaatagtaa gaatgtatag ccctaccagc attctggaca taagacaagg accaaaggaa 1080 ccctttagag actatgtaga ccggttctat aaaactctaa gagccgagca ggcgtcacag 1140 gatgtaaaaa attggatgac agaaaccttg ttggtccaaa atgcaaaccc agattgtaag 1200 actattttaa aagcattggg accagcagct acactagaag agatgatgac agcatgtcag 1260 ggagtgggag gacccagcca taaagcaaga gttttggcag aagcaatgag ccaagcaaca 1320 aattcacctg ccataatgat gcagagaggc aattttagga accaaagaaa gattgttaaa 1380 tgctttaatt gtggcaagga agggcacata gccagaaatt gtaaggcccc taggaaaaga 1440 ggctgttgga aatgtggaaa ggaaggacac cagatgaaag attgtactga gagacaggct 1500 aattttttag ggaaaatctg gccttcccac aaggggaggc cagggaattt tcttcagagc 1560 agaccagagc caacagcccc accagaggag agcttcaggt ttggggaaga gacagcaact 1620 cctcctcaga agcaggagcc aatagacaag gatatgtatc ccataacttc cctcagatca 1680 ctctttggca acgacccctc gtcacaataa agataggggg gcaactaaag gaagctctat 1740 tagatacagg agcagatgat acagtattag aagaaatgac cctgccagga aaatggaaac 1800 caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat cagataccca 1860 tagaaatctg tggacataga gctataggta cggtattagt aggacctaca cctgtcaaca 1920 taattggaag aaatctgttg actcagattg gctgcacttt aaattttccc attagtccta 1980 ttgaaacggt accagtaaaa ttaaagccag gaatggatgg cccaaaagtt aaacaatggc 2040 cattgacaga agaaaaaata aaagcattag tagaaatttg cacagaaatg gaaaaggaag 2100 ggaaaatttc aagaattgga cctgaaaatc catataacac tccagtattt gccataaaga 2160 aaaaagacag tactaaatgg agaaaattag tggatttcag agaacttaat aagaaaactc 2220 aagatttctg ggaagttcaa ttaggaatac cccatcccgc agggttaaaa aagaaaaagt 2280 cagtaacagt actggatgtg ggggatgcat atttttcagt tcctttagat aaagatttca 2340 ggaagtatac tgcatttacc atacctagta caaacaatga gacaccaggg attagatatc 2400 aatacaatgt gcttccacag ggatggaaag gatcaccagc aatattccaa agtagcatga 2460 caaaaatctt agagcctttc agacaacaaa atccagacat agtcatctat caatacatgg 2520 acgatttgta tgtaggatct gacttagaaa tagggcagca tagaacaaaa atagaggaac 2580 tgagacaaca tctgttgagg tggggattta ccacaccaga caaaaaacat cagaaagaac 2640 ctccattcct ctggatgggc tatgaactcc atcctgataa atggactgta cagcctatag 2700 tgctgccaga aaaagatagt tggactgtca atgacataca gaagttagtg ggaaaattga 2760 attgggcaag tcagatttat gcagggatta aagtaaggca attatgtaaa ctccttaggg 2820 gaaccaaggc actaacagag gtaataccac taacagaaga agcagagtta gaactggcag 2880 aaaacaggga aattctaaaa gaaccagtac atggagtgta ctatgaccca tcaaaagact 2940 taatagcaga aatacagaag caggggcaag gccaatggac atatcaaatt tatcaagagc 3000 catttaaaaa tctaaaaaca ggaaaatatg caagaatgag gggtgcccac actaatgatg 3060 taaaacaatt aacagaggca gtgcaaaaaa tagccacaga gagcatagtg atatggggaa 3120 agattcctaa attcagacta cccatacaaa aagagacatg ggaatcatgg tggacagact 3180 attggcaagc cacctggatt cctgagtggg agtttgttaa tacccctccc ttagtaaaat 3240 tatggtacca gttagagaaa gaacccatag taggagtaga aactttctat gtagatgggg 3300 cagctaacag ggagactaaa ttaggaaaag caggatatgt tactgataga ggaagacaaa 3360 aagttgtctc cctaactgac acaacaaatc agaagactga gttacaagca attcagatgg 3420 ctttgcagga ctcgggatta gaagtaaaca tagtaacaga ctcacaatat gcattaggaa 3480 tcattcaagc acaaccagat aaaagtgaat cagaaatagt caatcaaata atagaacagt 3540 taataaaaaa ggaaagggtc tacctgacat gggtaccagc acacaaagga attggaggaa 3600 atgaacaagt agataagtta gtcagtgctg gaatcaggaa agtactattt ttagatggaa 3660 tagataaggc ccaagaagaa catgaaaaat atcacagtaa ttggagagct atggctagtg 3720 attttaacct gccacctgtg gtagcaaaag aaatagtagc ctgctgtgat aaatgtcaac 3780 aaaaaggaga ggccatgcat ggacaagtag actgtagtcc aggaatatgg caattagatt 3840 gtacacattt agaaggaaaa gttatcctgg tagcagtgca tgtagccagt ggatatatag 3900 aagcagaagt tattccagca gagacagggc aggaaacagc atacttcctc ttaaaattag 3960 caggaagatg gccagtaaaa acaatacata cagacaatgg cagcaatttc accagtacta 4020 cggttaaggc tgcctgttgg tgggcgggga tcaagcagga atttggcatc ccctacaatc 4080 cccaaagtca aggggtagta gaatctatga ataaggagtt aaagaaaatt ataggacagg 4140 taagagatca ggctgaacat ctcaagacag cagtacaaat ggcagtattc attcacaatt 4200 ttaaaagaaa aggggggatt gggggataca gtgcagggga aagaatagta gacatgatag 4260 caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 4320 tttattacag ggacagcaga gacccacttt ggaaaggacc agcaaagctt ctctggaaag 4380 gtgaaggggc agtagtaata caagataata gtgacataaa agtagtgcca agaagaaaag 4440 caaagataat tagggattat ggaaaacaga tggcaggtga tgattgtgtg gcaagtagac 4500 aggatgagga ttaaagcatg gaaaagtcta gtaaaacacc atatgtatgt ttcaaaaaag 4560 gctcagggat ggttttatag acatcactat gacagtcgtc atccaagaat aagttcagaa 4620 gtacacatcc cactagggga agctacattg gtcgtaacaa catattgggg tctgaataca 4680 ggagaaagag actggcattt gggtcaggga gtctccatag aatggaggaa aaggagatat 4740 agcacacaag tagaccctaa cctagcagac caactaattc atctgtatta ctttgattgt 4800 ttttcagaat ctgctataag aaaagccata ttaggacata tagttagacc taagtgtgca 4860 tatcaagcag gacataacaa ggtaggatct ctacagtact tggcactagt agcattaaca 4920 acaccaaaaa agacaaagcc acctttgcct agtgtcgcaa aattgacaga ggatagatgg 4980 aacaagcccc agaagaccaa gggccacaga gggagccata taatgaatgg acactagagc 5040 ttttagagga gcttaagaat gaagctgcta gacactttcc taggctgtgg ctccatggct 5100 taggacaaca tatctatgaa acatatgggg atacttgggc aggagtggaa gccctaataa 5160 gaattctgca acaactgctg tttattcatt tcagaattgg gtgtcaacat agcag 5215 // ID KC312344; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312344; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B23 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 1327f57a8f756960b90d8f47ab8263e3. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B23" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0E8" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0E8" FT /protein_id="AGG76641.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKTQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N296" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N296" FT /protein_id="AGG76642.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLKAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYL9" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYL9" FT /protein_id="AGG76643.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYISKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYJ2" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYJ2" FT /protein_id="AGG76644.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1951 A; 923 C; 1248 G; 1094 T; 0 other; ctggtaacta gagatccctc agaccctgtt tttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag acacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc tccaggggca aatggtacat caagccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttt agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaacaatcca cctatcccag taggagaaat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattttggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagccc ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtgtt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttta gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgct tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atttgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaaaagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttaggct acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gagtttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagacggg 3300 gcagctaata gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gctttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaggaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga caaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatccatg aataaagagt tgaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtgtt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaaggaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagcct agtaaaacac catatgtata tttcaaaaaa 4560 agctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctacatt ggtcgtaaca acatattggg gtctgcatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggaaccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggg 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312345; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312345; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B24 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 32501ffe9dfc0a3747397f4865fecf40. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B24" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N7A0" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N7A0" FT /protein_id="AGG76645.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAGGCRQILEQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQTTADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVTSLRSLFGNDPLSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0F3" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0F3" FT /protein_id="AGG76646.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N299" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N299" FT /protein_id="AGG76647.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYM4" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYM4" FT /protein_id="AGG76648.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1957 A; 919 C; 1244 G; 1096 T; 0 other; ctggtaacta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacggttc gccgtcaatc ctggcctgtt 360 agaaacagca gggggctgta gacaaatact ggaacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaaa caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacac caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtaactt ccctcagatc 1680 actctttggc aacgacccct tgtcacaata aaaatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattgg taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtgtt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaatta 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtta atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agaaacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tcttaagaca gcagtacaaa tggcagtatt catccacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agacccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgatagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagatccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aagttgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312346; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312346; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B3 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 70bd055f1d398b1623559beab0ea8e49. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYJ8" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYJ8" FT /protein_id="AGG76649.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPRKQEPIDKDMYPITSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N7A3" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N7A3" FT /protein_id="AGG76650.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDNNSPSEAR FT ANRQGYVSHNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWELWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N0I0" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N0I0" FT /protein_id="AGG76651.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQRTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N2A3" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N2A3" FT /protein_id="AGG76652.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 917 C; 1242 G; 1099 T; 0 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaagggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtggaaga gaaggccttc agcccagaag taatacccat 720 gttttcagca ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattttggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa aattggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gagatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcgg aagcaagagc caatagacaa ggatatgtat cccataactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacagtgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatataaca ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaaa 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg aattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa gatagaggaa 2580 ttgagacaac atctgttgag gtggggattt accacgccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa acaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaattatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaag gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agaaacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaataat agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacat catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aagctacatt ggtcgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgccat attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaggacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312347; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312347; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B4 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; a6c75b2cc5ee223ab0c9cacf21ab3ea8. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B4" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYM9" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYM9" FT /protein_id="AGG76653.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPIASLKSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYK4" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYK4" FT /protein_id="AGG76654.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGNVSHSFPQITLWQRPLVTVKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPSGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIAELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPDKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGHVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDIIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIREYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7A6" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7A6" FT /protein_id="AGG76655.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N0G1" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N0G1" FT /protein_id="AGG76656.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1962 A; 930 C; 1234 G; 1090 T; 0 other; ctggtaacta gagatccctc agaccctgtt actcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacctgaaaa cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggct aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtacagaacc ttcaggggca aatggtacat caggccatct cacccagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agtccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 taataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taagactcta agagccgagc aagcatcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gaccatttta aaagcattgg gaccagcagc tacactagaa gagatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggaaatgtat cccatagctt ccctcaaatc 1680 actctttggc aacgacccct cgtcacagta aaaatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatataaca ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaagtta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccacatccct cagggttaaa gaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgacttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagcggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag acaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag agatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtacaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gagtttgtca atactcctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccttaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gctttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga tagaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggagagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagaa aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccgcctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatacata 3900 gaagcagaag ttattccagc agaaacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acagttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacac 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaataat agacataata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggaata tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aagctacatt ggtcgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggaaccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312348; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312348; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B6 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; b1c2b3542b8a6d854d01a8ea3896e64d. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B6" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N2F2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N2F2" FT /protein_id="AGG76657.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYN4" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYN4" FT /protein_id="AGG76658.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76659.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N7A8" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N7A8" FT /protein_id="AGG76660.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1957 A; 920 C; 1242 G; 1097 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaaag aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagcaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caagaaacag catacttcct cttaaaatta 3960 gcagggagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312349; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312349; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B8 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; ee9d5e41855dc712666529d35c17ebb0. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B8" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0G7" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0G7" FT /protein_id="AGG76661.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLKEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N2B3" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N2B3" FT /protein_id="AGG76662.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYP1" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYP1" FT /protein_id="AGG76663.1" FT /translation="MENRWQVMIVWQVDRMRIRAGKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVASTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYL5" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYL5" FT /protein_id="AGG76664.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 921 C; 1242 G; 1095 T; 0 other; ctggtaacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactaaaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaaag aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagcaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcag ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcatcaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312350; SV 1; linear; genomic RNA; STD; VRL; 5206 BP. XX AC KC312350; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_B9 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5206 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5206 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; c1326dc0c5c4bd05690d14675291ecbf. XX FH Key Location/Qualifiers FH FT source 1..5206 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_B9" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 208..1701 FT /gene="gag" FT CDS 208..1701 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N7B1" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N7B1" FT /protein_id="AGG76665.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQTTADTGSSSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKAFSP FT EVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGPVAP FT GQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILD FT IRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPAATL FT EEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRSQRKIVKCFNCGKEGHIA FT RNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAPPEE FT SFRFGEETTTPPQKQEPIDKDMYPITSLRSLFGNDPSSQ" FT gene <1494..4505 FT /gene="pol" FT CDS <1494..4505 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0H3" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0H3" FT /protein_id="AGG76666.1" FT /translation="FFRENLAFPQGEARKFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGYVSHNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGRWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4450..5028 FT /gene="vif" FT CDS 4450..5028 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYT4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYT4" FT /protein_id="AGG76667.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVSPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4968..>5206 FT /gene="vpr" FT CDS 4968..>5206 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYP8" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYP8" FT /protein_id="AGG76668.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5206 BP; 1954 A; 918 C; 1243 G; 1091 T; 0 other; tggtaactag agatccctca gaccctgtta ttcggtgtgc aaaatctcta gcagtggcgc 60 ccgaacaggg acttgaaagc gaaaggaaaa ccagaggagc tctctcgacg caggactcgg 120 cttgctgaag cgcgcacggc aagaggcgag gggtggcgac tggtgagtac gccaaacttt 180 tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcggtatt aagcgggggt 240 caattggata gatgggagaa aattcggtta aggccagggg gaaaaaagca atataggtta 300 aaacatatag tatgggcaag cagggagcta gaacgattcg cagtcaatcc tggcctgtta 360 gaaacagcag agggctgtag acaaatactg acacagctac aaccagccct tcagacagga 420 tcagatgaac ttagatcttt atataataca gtagcaaccc tctattgtgt acatcaaagg 480 atagaggtaa aagacaccaa ggaagcttta gagaaaatag aggaggagca aaataaaagt 540 aagaaaaagg cacagcaaac aacagctgac acaggaagca gcagccaaaa ttaccctata 600 gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac tttaaatgcg 660 tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat gttttcagcg 720 ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt ggggggacat 780 caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga atgggataga 840 ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc aaggggaagt 900 gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac aaataatcca 960 cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa taaaatagta 1020 agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga accctttaga 1080 gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca ggatgtaaaa 1140 acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa gaccatttta 1200 aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca gggagtggga 1260 ggacccagcc ataaagcaag agtcttggcg gaagcaatga gccaagcaac aaattcaccc 1320 gccataatga tgcagagagg caattttagg agccaaagaa agattgttaa atgctttaat 1380 tgtggcaagg aagggcacat agccagaaat tgtaaggccc ctaggaaaag aggctgttgg 1440 aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc taatttttta 1500 gggaaaatct ggccttccca caaggggagg ccaggaaatt ttcttcagag cagaccagag 1560 ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac tccccctcag 1620 aagcaggagc caatagacaa ggatatgtat cccataactt ccctcagatc actctttggc 1680 aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta ttagatacag 1740 gagcagatga tacagtatta gaagaaatga ccctgccagg aagatggaaa ccaaaaatga 1800 tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc atagaaatct 1860 gtggacacag agctatgggt acggtattag taggacctac acctgtcaac ataattggaa 1920 gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct attgaaacgg 1980 taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg ccattgacag 2040 aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa gggaaaattt 2100 caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag aagaaagaca 2160 gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact caagatttct 2220 gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag tcagtaacag 2280 tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc aggaagtata 2340 ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat cagtacaatg 2400 tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg acaaaaatct 2460 tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg gatgatttat 2520 atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa ctgagacaac 2580 atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa ccgccattcc 2640 tttggatggg ctatgaactc catcctgata aatggactgt gcagcctata gtgctgccag 2700 aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg aattgggcaa 2760 gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg ggaaccaagg 2820 cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca gaaaacaggg 2880 aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac ttaatagcag 2940 aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagaa ccatttaaaa 3000 atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat gtaaaacaat 3060 taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga aagattccta 3120 aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac tattggcaag 3180 ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa ttatggtacc 3240 agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg gcagctaaca 3300 gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa aaagttgtct 3360 ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg gctttgcagg 3420 actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga atcattcaag 3480 cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag ttaataaaaa 3540 aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga aatgaacaag 3600 tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga atagataagg 3660 cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt gattttaacc 3720 tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa caaaaaggag 3780 aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat tgtacacacc 3840 tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata gaagcagaag 3900 ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta gcaggaagat 3960 ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact acggtcaagg 4020 ctgcctgctg gtgggcgggg atcaagcaag aatttggtat cccctacaat ccccaaagtc 4080 aaggggtagt agaatctatg aataaagagt taaagaaaat tataggacag gtaagagatc 4140 aggctgaaca tctcaagaca gcagtacaaa tggcagtatt catccacaat tttaaaagaa 4200 aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata gcaacagaca 4260 tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 4320 gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa ggtgaagggg 4380 cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa gcaaagataa 4440 ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga caggatgagg 4500 attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa ggctcaggga 4560 tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga agtacacatc 4620 ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac aggagaaaga 4680 gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata tagcacacaa 4740 gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg tttttcagaa 4800 tccgctataa gaaatgccat attaggacat atagttagcc ctaagtgtgc atatcaagca 4860 ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac aacaccaaaa 4920 aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg gaacaagccc 4980 cagaagacca agggccacag agggagccat ataatgaatg gacactagag cttttagagg 5040 agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt ttaggacaac 5100 atatctatga aacatatggg gatacttggg caggagtgga agccataata agaattctgc 5160 aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5206 // ID KC312351; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312351; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C1 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; bfd191db360b024f91650bf2c2c5b2b2. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYM1" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYM1" FT /protein_id="AGG76669.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N7B4" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N7B4" FT /protein_id="AGG76670.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQKFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N0I0" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N0I0" FT /protein_id="AGG76671.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQRTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N2C3" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N2C3" FT /protein_id="AGG76672.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 918 C; 1243 G; 1097 T; 0 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gtgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgagg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagctgagc aggcgtcaca 1140 ggatgtaaaa aattggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgcttcaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaagat aaaagcatta gtagaaattt gtacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg gcctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taaaaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaaa 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg aattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa gatagaggaa 2580 ttgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa acaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaaag ctgcctgttg gtgggcgggg atcaagcaaa aatttggcat cccctacaac 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaataat agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aagctacatt ggtcgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgccat attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaggacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312352; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312352; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C10 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 185c35c18cc164591e7b29861c33f038. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C10" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYQ3" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYQ3" FT /protein_id="AGG76673.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAGGCRQILEQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPPQKQEPIDKEMYPVTSLRSLFGNDPLSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYM8" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYM8" FT /protein_id="AGG76674.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDSNSPSEAG FT ANRQGNVSCNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIIDIIATDIQTKELQK FT QITKIQNFRVYYRDSKDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7B7" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7B7" FT /protein_id="AGG76675.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQRTKGHRGSHTLNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N0I5" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N0I5" FT /protein_id="AGG76676.1" FT /translation="MEQAPEDQGPQREPYTEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1953 A; 920 C; 1245 G; 1098 T; 0 other; ctggtaacta gagatccctc agaccctgtt tttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacctgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gggggctgta gacaaatact ggaacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatcat tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcaaaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgct aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagtc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccccctcag aagcaggagc caatagacaa ggaaatgtat cctgtaactt ccctcagatc 1680 actctttggc aacgacccct tgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacatcg agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatttgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaagat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtgtt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tgctggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caattttcca aagtagcatg 2460 acaaaaatct tagaaccttt cagacaacaa aatccagata tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag atggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaggatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagct agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttggtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga tagaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaactt caccagtact 4020 acggttaagg ctgcctgttg gtgggcgggg atcaagcagg aatttggcat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tcttaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaataat agacataata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcaa agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgcatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaggacca agggccacag agggagccat acactgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312353; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312353; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C11 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 6bd9bb0fb424769fcabedd10127c3343. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C11" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N2D0" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N2D0" FT /protein_id="AGG76677.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPIASLKSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYQ8" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYQ8" FT /protein_id="AGG76678.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGGDNNSPSEAG FT ANRQGNVSHSFPQITLWQRPLVTVKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEQEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQTEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYN3" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYN3" FT /protein_id="AGG76679.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVITTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKIKGHRESHTLNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N7C0" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N7C0" FT /protein_id="AGG76680.1" FT /translation="MEQAPEDQGPQREPYTEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1952 A; 925 C; 1242 G; 1097 T; 0 other; ctggtgacta gagatccctc agaccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaagaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gtgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacta aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgcttcaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggagg agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggaaatgtat cccatagctt ccctcaaatc 1680 actctttggc aacgacccct cgtcacagta aaaatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctataggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaagat aaaagcatta gtagaaattt gtacagaaat ggaacaggaa 2100 gggaaaattt caagaattgg gcctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aaaaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagttca attaggaata ccccatcccg cagggttaaa aaagaaaaag 2280 tccgtaacag tactggatgt gggggatgca tatttttcag ttcccttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttat atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctaagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagacagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acagcctata 2700 gtgctgccag aaaaagatag ctggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga ggtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt attatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta agtttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atactcctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagacaagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaagggg aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatccta gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggttaaag ctgcctgttg gtgggcgggg atcaagcaag aatttggcat cccctacaac 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tcttaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agacccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttataaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagatca agggccacag agagagccat acactgaatg gacactagag 5040 cttttagagg aacttaagaa tgaagctgct agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccctaata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312354; SV 1; linear; genomic RNA; STD; VRL; 5213 BP. XX AC KC312354; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C12 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5213 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5213 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 681ea8ea1c6d72ccc02a6ed887a7bad6. XX FH Key Location/Qualifiers FH FT source 1..5213 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C12" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1708 FT /gene="gag" FT CDS 209..1708 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0J2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0J2" FT /protein_id="AGG76681.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRSQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4512 FT /gene="pol" FT CDS <1504..4512 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N2D5" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N2D5" FT /protein_id="AGG76682.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNSPSEAGA FT NRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIGG FT IGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETVP FT VKLKPGMDGPKVKQWPLTEEKIKALVEICAEMEKEGKISRIGPENPYNTPVFAIKKKDS FT TKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKEFRKY FT TAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMDD FT LYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPIV FT LPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELELA FT ENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHTN FT DVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPPL FT VKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQA FT IQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAHK FT GIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVAC FT CDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQETA FT YFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNKE FT LKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQKQ FT ITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQM FT AGDDCVASRQDED" FT gene 4457..5035 FT /gene="vif" FT CDS 4457..5035 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYR4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYR4" FT /protein_id="AGG76683.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYRTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4975..>5213 FT /gene="vpr" FT CDS 4975..>5213 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYN8" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYN8" FT /protein_id="AGG76684.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAARHFPRLWLHGLGQH FT IYETYGDTWVGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5213 BP; 1944 A; 922 C; 1254 G; 1093 T; 0 other; ctggtaacta gagatccctc agaccctgtt tttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaaac agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagagg taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagaaat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta aggatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcggc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg agccaaagaa agagtgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg aaagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaactcc 1620 ccctcagaag caggagccaa tagacaagga gatgtatcct gtagcttccc tcagatcact 1680 ctttggcaac gacccctcgt cacaataaag ataggggggc aactaaagga agctctatta 1740 gatacaggag cagatgatac agtattagaa gaaatgaccc tgccaggaaa atggaaacca 1800 aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatacccata 1860 gaaatctgtg gacatagagc tataggtacg gtattagtag gacctacacc tgtcaacata 1920 attggaagaa atctgttgac tcagattggt tgcactttaa attttcccat tagtcctatt 1980 gaaacggtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca 2040 ttgacagaag agaaaataaa agcattagta gaaatttgcg cagaaatgga aaaggaaggg 2100 aaaatttcaa gaattggacc tgaaaatcca tacaatactc cagtgtttgc cataaagaaa 2160 aaagacagta ctaaatggag aaaattagta gatttcagag aacttaataa gaaaactcaa 2220 gatttctggg aagttcaatt aggaataccc catcccgcag ggttaaaaaa gaaaaagtca 2280 gtaacagtac tggatgtggg ggatgcatat ttttcagttc ctttagataa agagttcagg 2340 aagtatactg catttaccat acctagtaca aacaatgaga caccagggat tagatatcag 2400 tacaatgtgc tgccacaggg atggaaagga tcaccagcaa tattccaaag tagcatgaca 2460 aaaatcttag agcctttcag acaacaaaat ccagacatag tcatctatca atacatggat 2520 gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaactg 2580 agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct 2640 ccattcctct ggatgggcta tgaactccat cctgataaat ggactgtaca gcctatagtg 2700 ctgccagaaa aagatagttg gactgtcaat gacatacaga agttagtggg aaaattgaat 2760 tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact ccttagggga 2820 accaaggcac taacagaggt aataccacta acagaagaag cagagttaga actggcagaa 2880 aacagggaaa ttctaaaaga accagtacat ggagtgtact atgacccatc aaaagactta 2940 atagcagaga tacagaagca ggggcaaggc caatggacat atcaaattta tcaagagcca 3000 tttaaaaatc taaaaacagg aaaatatgca agaatgaggg gtgcccacac taatgatgta 3060 aaacaattaa cagaggcagt gcaaaaaata gccacagaga gcatagtaat atggggaaag 3120 attcctaaat ttagactacc catacaaaaa gagacatggg aatcatggtg gacagactat 3180 tggcaagcca cctggattcc tgagtgggaa tttgtcaata cccctccctt agtaaaatta 3240 tggtaccagt tagagaaaga acccatagta ggagtagaaa ctttctatgt agatggggca 3300 gctaacaggg agactaaatt aggaaaagca ggatatgtta ctgatagagg aagacaaaaa 3360 gttgtctccc taactgacac aacaaatcag aagactgagt tacaagcaat tcagatggcc 3420 ttgcaggact cgggattaga agtaaacata gtaacagact cacaatatgc attaggaatc 3480 attcaagcac aaccagataa aagtgaatca gaaatagtca atcaaataat agaacagtta 3540 ataaaaaagg aacgggtcta cctgacatgg gtaccagcac acaaaggaat tggaggaaat 3600 gaacaagtag ataagttagt cagtgctgga atcaggaaag tactattttt agatggaata 3660 gataaggccc aagaagaaca tgaaaaatat cacagtaatt ggagagctat ggctagtgat 3720 tttaacctgc cacctgtggt agcaaaagaa atagtagcct gctgtgataa atgtcaacaa 3780 aaaggagagg ccatgcatgg acaagtagac tgtagtccag gaatatggca attagattgt 3840 acacatctag aaggaaaagt tatcctggta gcagtgcatg tagccagtgg atacatagaa 3900 gcagaagtta ttccagcaga gacagggcag gaaacagcat acttcctctt aaaattagca 3960 ggaagatggc cagtaaaaac aatacataca gacaatggca gcaatttcac cagtactacg 4020 gttaaggctg cctgttggtg ggcggggatc aagcaggaat ttggcatccc ctacaatccc 4080 caaagtcaag gggtagtaga atccatgaat aaagagttga agaaaattat aggacaggta 4140 agagatcagg ctgaacatct caagacagca gtacaaatgg cagtgttcat tcacaatttt 4200 aaaagaaaag gggggattgg gggatacagt gcaggggaaa gaatagtaga catgatagca 4260 acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt 4320 tattacaggg acagcagaga tccactttgg aaaggaccag caaagcttct ctggaaaggt 4380 gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca 4440 aagataatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag 4500 gatgaggatt agagcatgga aaagtctagt aaaacaccat atgtatgttt caaaaaaggc 4560 tcagggatgg ttttatagac atcactatga cagtcgtcat ccaagaataa gttcagaagt 4620 acacatccca ctaggggagg ctacattggt cgtaacaaca tattggggtc tacatacagg 4680 agaaagagac tggcatttgg gtcagggagt ctccatagaa tggaggaaaa ggagatatag 4740 aacacaagta gaccctaacc tagcagacca actaattcat ctgtattact ttgattgttt 4800 ttcagaatct gctataagaa atgccctatt aggacatata gttagaccta agtgtgcata 4860 tcaagcagga cataacaagg taggatctct acagtacttg gcactagtag cattaacaac 4920 accaaaaaag ataaagccac ctttgcctag tgtcgcaaaa ttgacagagg atagatggaa 4980 caagccccag aagaccaagg gccacagagg gagccatata atgaatggac actagagctt 5040 ttagaggagc ttaagaatga agctgctaga cactttccta ggctgtggct ccatggttta 5100 ggacaacata tctatgaaac atatggggat acttgggtag gagtggaagc cctaataaga 5160 attctgcaac aactgctgtt tattcatttc agaattgggt gtcaacatag cag 5213 // ID KC312355; SV 1; linear; genomic RNA; STD; VRL; 5215 BP. XX AC KC312355; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C13 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5215 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5215 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; d8ef86b396bd4fc7a0b1d992a3f4174e. XX FH Key Location/Qualifiers FH FT source 1..5215 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C13" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 208..1710 FT /gene="gag" FT CDS 208..1710 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N7C2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N7C2" FT /protein_id="AGG76685.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGNSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRSNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1503..4514 FT /gene="pol" FT CDS <1503..4514 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0J8" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0J8" FT /protein_id="AGG76686.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLLAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCTPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDKD" FT gene 4459..5037 FT /gene="vif" FT CDS 4459..5037 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N2E1" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N2E1" FT /protein_id="AGG76687.1" FT /translation="MENRWQVMIVWQVDRIRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4977..>5215 FT /gene="vpr" FT CDS 4977..>5215 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYR8" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYR8" FT /protein_id="AGG76688.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5215 BP; 1955 A; 922 C; 1241 G; 1097 T; 0 other; tggtaactag agatccctca gacccttttg ttcggtgtac aaaatctcta gcagtggcgc 60 ccgaacaggg acttgaaagc gaaaggaaaa ccagaggagc tctctcgacg caggactcgg 120 cttgctgaag cgcgcacggc aagaggcgag gggtggcgac tggtgagtac gccaaacttt 180 tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcggtatt aagcgggggt 240 caattggata gatgggaaaa aattcggtta aggccagggg gaaaaaagca atataggtta 300 aaacatatag tatgggcaag cagggagcta gaacgattcg cagtcaatcc tggcctgtta 360 gaaacagcag agggctgtaa acaaatactg gcacagctac aaccagccct tcagacagga 420 tcagatgaac ttagatcttt atataataca gtagcaaccc tctattgtgt acatcaaagg 480 atagaggtaa aagacaccaa ggaagcttta gagaaaatag aggaggagca aagtaaaagt 540 aagaaaaagg cacagcaagc aacagctgac acaggaaaca gcagccaggt cagccaaaat 600 taccctatag tgcagaacct tcaggggcaa atggtacatc aggccatatc acctagaact 660 ttaaatgcat gggtaaaagt agtagaagag aaggccttca gcccagaagt aatacccatg 720 ttttcagcgt tatcagaagg agccacccca caagatttaa acaccatgct aaacacagtg 780 gggggacatc aagcagccat gcaaatgtta aaagagacca tcaatgatga agctgcagaa 840 tgggatagac tgcatccagt gcatgcaggg cctgttgcac caggccagat gagagaacca 900 aggggaagtg acatagcagg aactactagt acccttcagg aacaaatagg atggatgacc 960 aataatccac ctatcccagt aggagagatc tataaaagat ggataatcct gggattaaat 1020 aaaatagtaa gaatgtatag ccccaccagc attctggata taagacaagg accaaaggaa 1080 ccctttagag actatgtaga ccggttctat aaaactctaa gagccgagca ggcgtcacag 1140 gatgtaaaaa cttggatgac agaaaccttg ttggtccaaa atgcaaaccc agattgtaag 1200 actattttaa aagcattggg accagcagct acactagaag aaatgatgac agcatgtcag 1260 ggagtgggag gacccagcca taaagcaaga gttttggcgg aggcaatgag ccaagcaaca 1320 aattcacctg ccataatgat gcagagaagc aattttagga accaaagaaa gattgttaaa 1380 tgctttaatt gtggcaaaga agggcacata gccagaaatt gcaaggcccc taggaaaaga 1440 ggctgttgga aatgtggaaa ggaaggacac caaatgaaag attgtactga gagacaggct 1500 aattttttag ggaaaatctg gccttcccac aaggggaggc cagggaattt ccttcagagc 1560 agaccagagc caacagcccc accagaagag agcttcaggt ttggggaaga gacagcaact 1620 ccctctcaga agcaggagcc aatagacaag gatatgtatc ccatgacttc cctcagatca 1680 ctctttggca acgacccctc gtcacaataa agataggggg gcaactaaag gaagctctat 1740 tagatacagg agcagatgat acagtattag aagaaatgac cctgccagga aaatggaaac 1800 caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat cagataccca 1860 tagaaatctg tggacacaga gctataggta cggtattagt aggacctaca cctgtcaaca 1920 taattggaag aaatctgttg actcagattg gttgcacttt aaattttccc attagtccta 1980 ttgaaacggt accagtaaaa ttaaagccag gaatggatgg cccaaaagtt aaacaatggc 2040 cattgacaga agaaaagata aaagcattag tagaaatttg tacagaaatg gaaaaggaag 2100 ggaaaatttc aagaattggg cctgaaaatc catacaatac tccagtattt gccataaaga 2160 aaaaagacag tactaaatgg aggaaattag tagatttcag agaacttaat aagaaaactc 2220 aggatttctg ggaagttcaa ttaggaatcc cacatcccgc agggttaaaa aagaaaaagt 2280 cagtaacagt gctggatgtg ggggatgcat atttttcagt tcccttagat aaagatttca 2340 ggaagtatac tgcatttacc atacctagta caaacaatga gacaccaggg attagatatc 2400 agtacaatgt gcttccacag ggatggaaag gatcaccagc aatattccaa agtagcatga 2460 caaaaatctt agaacctttc agacaacaaa atccagacat agtcatctat caatacatgg 2520 atgatttgta tgtaggatct gacttagaaa tagggcagca tagaacaaaa atagaggaac 2580 tgagacaaca tctgttgagg tggggattta ccacaccaga caaaaaacat cagaaagaac 2640 ctccattcct ctggatgggc tatgaactcc atcctgataa atggactgta cagcctatag 2700 tgctgccaga aaaagatagt tggactgtca atgacataca gaagttagtg ggaaaattga 2760 attgggcaag tcagatttat gcagggatta aagtaaggca attatgtaaa ctccttaggg 2820 gaaccaaggc actaacagaa gtaataccac taacagaaga agcagagcta gaactggcag 2880 aaaacaggga aattctaaaa gaaccagtac atggagtgta ctatgaccca tcaaaagact 2940 tattagcaga aatacagaag caggggcaag gccaatggac atatcaaatt tatcaagagc 3000 catttaaaaa tctaaaaaca ggaaaatatg caagaatgag gggtgcccac actaatgatg 3060 taaaacaatt aacagaggca gtgcaaaaaa tagccacaga gagcatagtg atatggggaa 3120 agattcctaa atttagacta cccatacaaa aagagacatg ggaatcatgg tggacagact 3180 attggcaagc cacctggatt cctgagtggg aatttgtcaa tacccctccc ttagtaaaat 3240 tatggtacca gttagagaaa gaacccatag taggagtaga aactttctat gtagatgggg 3300 cagctaacag ggagactaaa ttaggaaaag caggatatgt tactgataga ggaagacaaa 3360 aagttgtctc cctaactgac acaacaaatc agaagactga gttacaagca attcagatgg 3420 ccttgcagga ctcgggatta gaagtaaaca tagtaacaga ctcacaatat gcattaggaa 3480 tcattcaagc acaaccagat aaaagtgaat cagcaatagt caatcaaata atagaacagt 3540 taataaaaaa ggaaagggtc tacctgacat gggtaccagc acacaaagga attggaggaa 3600 atgaacaagt agataagtta gtcagtgctg gaatcaggaa agtactattt ttagatggaa 3660 tagataaggc ccaagaagaa catgaaaaat atcacagtaa ttggagagct atggctagtg 3720 attttaacct gccacctgtg gtagcaaaag aaatagtagc ctgctgtgat aaatgtcaac 3780 aaaaaggaga agccatgcat ggacaagtag actgtactcc aggaatatgg caattagatt 3840 gtacacatct agaaggaaaa gttatcctgg tagcagtgca tgtagccagt ggatatatag 3900 aagcagaagt tattccagca gagacagggc aggaaacagc atacttcctc ttaaaattag 3960 caggaagatg gccagtaaaa acaatacata cagacaatgg cagcaatttt accagtacta 4020 cggttaaggc tgcctgctgg tgggcgggga tcaagcaaga atttggcatc ccctataatc 4080 cccaaagtca aggggtagta gaatctatga ataaagaatt aaagaaaatt ataggacagg 4140 taagagatca ggctgaacat ctcaagacag cagtacaaat ggcagtattc attcacaatt 4200 ttaaaagaaa aggggggatt gggggataca gtgcagggga aagaatagta gacatgatag 4260 caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 4320 tttattacag ggacagcaga gatccacttt ggaaaggacc agcaaagctt ctctggaaag 4380 gtgaaggggc agtagtaata caagataata gtgacataaa agtagtgcca agaagaaaag 4440 caaagataat tagggattat ggaaaacaga tggcaggtga tgattgtgtg gcaagtagac 4500 aggataagga ttagagcatg gaaaagtcta gtaaaacacc atatgtatgt ttcaaaaaag 4560 gctcagggat ggttttatag acatcactat gacagtcgtc atccaagaat aagttcagaa 4620 gtacacatcc cactagggga ggctaaattg gttgtaacaa catattgggg tctgaataca 4680 ggagaaagag actggcattt gggtcaggga gtctccatag aatggaggaa aaggagatat 4740 agcacacaag tagaccctaa cttagcagac caactaattc atctgtatta ctttgattgt 4800 ttttcagaat ccgctataag aaatgcctta ttaggacaaa tagttagacc taagtgtgca 4860 tatcaagcag gacataacaa ggtaggatct ctacagtact tggcactagt agcattaaca 4920 acaccaaaaa agataaagcc acctttgcct agtgtcgcaa aattgacaga ggacagatgg 4980 aacaagcccc agaagaccaa gggccacaga gggagccata caatgaatgg acactagagc 5040 ttttagagga gcttaagaat gaagctgtta gacactttcc tagactgtgg ctccatggtt 5100 taggacaaca tatctatgaa acatatgggg atacttgggc aggagtggaa gccataataa 5160 gaattctgca acaactgctg tttattcatt tcagaattgg gtgtcaacat agcag 5215 // ID KC312356; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312356; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C14 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; b74590a5d2db7eba446f02728781f606. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C14" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N2F2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N2F2" FT /protein_id="AGG76689.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N7C4" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N7C4" FT /protein_id="AGG76690.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76691.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N2E8" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N2E8" FT /protein_id="AGG76692.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1955 A; 921 C; 1244 G; 1096 T; 0 other; ctggtaacta gagatccctc aggccctttt gttcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaaa aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta aacaaatact ggcacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaagtaaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggcg gaggcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaaag aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt tccttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacagcaac 1620 tccctctcag aagcaggagc caatagacaa ggatatgtat cccatgactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccttgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaagact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa actaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagcaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgcctt attaggacaa atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312357; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312357; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C16 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; bfb069c6fc97210f7908f1bcc8d22787. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C16" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N789" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N789" FT /protein_id="AGG76693.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYQ0" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYQ0" FT /protein_id="AGG76694.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ANRQGDVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7C7" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7C7" FT /protein_id="AGG76695.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N0L1" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N0L1" FT /protein_id="AGG76696.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1957 A; 919 C; 1243 G; 1097 T; 0 other; ctggtatcta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacttgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gtgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaag caacagctga cacaggaagc agcagccagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 caataatcca cctatcccag taggagagat ctataaaaga tggataatcc tgggattaaa 1020 taaaatagta agaatgtata gccccaccag cattctggat ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta caaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gactatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agttttggca gaagcaatga gccaagcaac 1320 aaattcacct gccataatga tgcagagagg caattttagg aaccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgcaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccagggaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc caatagacaa ggagatgtat cctgtagctt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctgccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggttgcactt taaattttcc cattagtcct 1980 attgaaacag taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttgt atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 cctccattcc tctggatggg ctatgaactc catcctgata aatggactgt acaacctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagag 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagcatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gccttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaacgggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aagccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacatc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt taccagtact 4020 acggttaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggcat cccctataat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagaat taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt cattcacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgccat attaggacat atagttagac ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattgac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat acaatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctagactgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312358; SV 1; linear; genomic RNA; STD; VRL; 5213 BP. XX AC KC312358; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C17 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5213 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5213 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 35de668de3034a20e76c2d62732ea7e6. XX FH Key Location/Qualifiers FH FT source 1..5213 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C17" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 206..1708 FT /gene="gag" FT CDS 206..1708 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N2F2" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N2F2" FT /protein_id="AGG76697.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPSQKQEPIDKDMYPMTSLRSLFGNDPSSQ" FT gene <1501..4512 FT /gene="pol" FT CDS <1501..4512 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4MYS8" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4MYS8" FT /protein_id="AGG76698.1" FT /translation="FFRENLAFPQGEAREFPSEQTRANSPTRRELQVWGRDSNSLSEAG FT ANRQGYVSHDFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWELWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESAIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4457..5035 FT /gene="vif" FT CDS 4457..5035 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4N7I4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4N7I4" FT /protein_id="AGG76699.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHTMNGH" FT gene 4975..>5213 FT /gene="vpr" FT CDS 4975..>5213 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4N7D0" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4N7D0" FT /protein_id="AGG76700.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5213 BP; 1955 A; 918 C; 1243 G; 1097 T; 0 other; gtaactagag atccctcaga cccttttgtt cggtgtgcaa aagctctagc agtggcgccc 60 gaacagggac ttgaaagcga aaggaaaacc agaggagctc tctcgacgca ggactcggct 120 tgctgaagcg cgcacggcaa gaggcgaggg gtggcgactg gtgagtacgc caaacttttg 180 actagcggag gctagaagga gagagatggg tgcgagagcg tcggtattaa gcgggggtca 240 attggataga tgggaaaaaa ttcggttaag gccaggggga aaaaagcaat ataggttaaa 300 acatatagta tgggcaagca gggagctaga acgattcgca gtcaatcctg gcctgttaga 360 aacagcagag ggctgtaaac aaatactggc acagctacaa ccagcccttc agacaggatc 420 agatgaactt agatctttat ataatacagt agcaaccctc tattgtgtac atcaaaggat 480 agaggtaaaa gacaccaagg aagctttaga gaaaatagag gaggagcaaa gtaaaagtaa 540 gaaaaaggca cagcaagcaa cagctgacac aggaagcagc agccaggtca gccaaaatta 600 ccctatagtg cagaaccttc aggggcaaat ggtacatcag gccatatcac ctagaacttt 660 aaatgcatgg gtaaaagtag tagaagagaa ggccttcagc ccagaagtaa tacccatgtt 720 ttcagcgtta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 780 gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgatgaag ctgcagaatg 840 ggatagactg catccagtgc atgcagggcc tgttgcacca ggccagatga gagaaccaag 900 gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgaccaa 960 taatccacct atcccagtag gagagatcta taaaagatgg ataatcctgg gattaaataa 1020 aatagtaaga atgtatagcc ccaccagcat tctggatata agacaaggac caaaggaacc 1080 ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcagg cgtcacagga 1140 tgtaaaaact tggatgacag aaaccttgtt ggtccaaaat gcaaacccag attgtaagac 1200 tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg 1260 agtgggagga cccagccata aagcaagagt tttggcggag gcaatgagcc aagcaacaaa 1320 ttcacctgcc ataatgatgc agagaggcaa ttttaggaac caaagaaaga ttgttaaatg 1380 ctttaattgt ggcaaagaag ggcacatagc cagaaattgc aaggccccta ggaaaagagg 1440 ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa 1500 ttttttaggg aaaatctggc cttcccacaa ggggaggcca gggaatttcc ttcagagcag 1560 accagagcca acagccccac cagaagagag cttcaggttt ggggaagaga cagcaactcc 1620 ctctcagaag caggagccaa tagacaagga tatgtatccc atgacttccc tcagatcact 1680 ctttggcaac gacccctcgt cacaataaag ataggggggc aactaaagga agctctatta 1740 gatacaggag cagatgatac agtattagaa gaaatgacct tgccaggaaa atggaaacca 1800 aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatacccata 1860 gaaatctgtg gacacagagc tatgggtacg gtattagtag gacctacacc tgtcaacata 1920 attggaagaa atctgttgac tcagattggt tgcactttaa attttcccat tagtcctatt 1980 gaaacagtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca 2040 ttgacagaag aaaaaataaa agcattagta gaaatttgca cagaaatgga aaaggaaggg 2100 aaaatttcaa gaattggacc tgaaaatcca tacaatactc cagtatttgc cataaagaag 2160 aaagacagta ctaaatggag aaaattagta gatttcagag aacttaataa gaaaactcaa 2220 gatttctggg aagtccaatt aggaatacca catcccgcag ggttaaaaaa gaaaaagtca 2280 gtaacagtac tggatgtggg ggatgcatat ttttcagttc ctttagataa agatttcagg 2340 aagtatactg catttaccat acctagtaca aacaatgaga caccagggat tagatatcag 2400 tacaatgtgc ttccacaggg atggaaagga tcaccagcaa tattccaaag tagcatgaca 2460 aaaatcttag agcctttcag acaacaaaat ccagacatag tcatctatca atacatggat 2520 gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaactg 2580 agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct 2640 ccattcctct ggatgggcta tgaactccat cctgataaat ggactgtaca acctatagtg 2700 ctgccagaaa aagatagttg gactgtcaat gacatacaga agttagtggg aaaattgaat 2760 tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact ccttagggga 2820 accaaggcac taacagaagt aataccacta acagaagaag cagagttaga actggcagaa 2880 aacagggaaa ttctaaaaga accagtacat ggagtgtact atgacccatc aaaagactta 2940 atagcagaaa tacagaagca ggggcaaggc caatggacat atcaaattta tcaagagcca 3000 tttaaaaatc taaaaacagg aaaatatgca agaatgaggg gtgcccacac taatgatgta 3060 aaacaattaa cagaggcagt gcaaaaaata gccacagaga gcatagtgat atggggaaag 3120 attcctaaat ttagactacc catacaaaaa gagacatggg aattatggtg gacagactat 3180 tggcaagcca cctggattcc tgagtgggaa tttgtcaata cccctccctt agtaaaatta 3240 tggtaccagt tagagaaaga acccatagta ggagtagaaa ctttctatgt agatggggca 3300 gctaacaggg agactaaatt aggaaaagca ggatatgtta ctgatagagg aagacaaaaa 3360 gttgtctccc taactgacac aacaaatcag aagactgagt tacaagcaat tcagatggcc 3420 ttgcaggact cgggattaga agtaaacata gtaacagact cacaatatgc attaggaatc 3480 attcaagcac aaccagataa aagtgaatca gcaatagtca atcaaataat agaacagtta 3540 ataaaaaagg aaagggtcta cctgacatgg gtaccagcac acaaaggaat tggaggaaat 3600 gaacaagtag ataagttagt cagtgctgga atcaggaaag tactattttt agatggaata 3660 gataaggccc aagaagaaca tgaaaaatat cacagtaatt ggagagctat ggctagtgat 3720 tttaacctgc cacctgtggt agcaaaagaa atagtagcct gctgtgataa atgtcaacaa 3780 aaaggagaag ccatgcatgg acaagtagac tgtagtcccg gaatatggca attagattgt 3840 acacatctag aaggaaaagt tatcctggta gcagtgcatg tagccagtgg atatatagaa 3900 gcagaagtta ttccagcaga gacagggcag gaaacagcat acttcctctt aaaattagca 3960 ggaagatggc cagtaaaaac aatacataca gacaatggca gcaattttac cagtactacg 4020 gttaaggctg cctgctggtg ggcggggatc aagcaagaat ttggcatccc ctataatccc 4080 caaagtcaag gggtagtaga atctatgaat aaagaattaa agaaaattat aggacaggta 4140 agagatcagg ctgaacatct caagacagca gtacaaatgg cagtattcat tcacaatttt 4200 aaaagaaaag gggggattgg gggatacagt gcaggggaaa gaatagtaga catgatagca 4260 acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt 4320 tattacaggg acagcagaga tccactttgg aaaggaccag caaagcttct ctggaaaggt 4380 gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca 4440 aagataatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag 4500 gatgaggatt agagcatgga aaagtctagt aaaacaccat atgtatgttt caaaaaaggc 4560 tcagggatgg ttttatagac atcactatga cagtcgtcat ccaagaataa gttcagaagt 4620 acacatccca ctaggggagg ctaaattggt tgtaacaaca tattggggtc tgaatacagg 4680 agaaagagac tggcatttgg gtcagggagt ctccatagaa tggaggaaaa ggagatatag 4740 cacacaagta gaccctaact tagcagacca actaattcat ctgtattact ttgattgttt 4800 ttcagaatcc gctataagaa atgccttatt aggacaaata gttagaccta agtgtgcata 4860 tcaagcagga cataacaagg taggatctct acagtacttg gcactagtag cattaacaac 4920 accaaaaaag ataaagccac ctttgcctag tgtcgcaaaa ttgacagagg atagatggaa 4980 caagccccag aagaccaagg gccacagagg gagccataca atgaatggac actagagctt 5040 ttagaggagc ttaagaatga agctgttaga cactttccta ggctgtggct ccatggttta 5100 ggacaacata tctatgaaac atatggggat acttgggcag gagtggaagc cataataaga 5160 attctgcaac aactgctgtt tattcatttc agaattgggt gtcaacatag cag 5213 // ID KC312359; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312359; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C18 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; a1f776ec573c386a331a2b6c8f9a8bf7. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C18" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N0L6" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N0L6" FT /protein_id="AGG76701.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQTTADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRSQRKIVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETTTPPQKQEPVDKDMYPITSLRSLFGNDPSSQ" FT gene <1504..4515 FT /gene="pol" FT CDS <1504..4515 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N2F9" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N2F9" FT /protein_id="AGG76702.1" FT /translation="FFRENLAFPQGEARKFSSEQTRANSPTRRELQVWGRDNNSPSEAG FT ASRQGYVSHNFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAMGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDKSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4460..5038 FT /gene="vif" FT CDS 4460..5038 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MYT4" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MYT4" FT /protein_id="AGG76703.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEAKLVVTTYWGLNTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNAILGHIVSPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGSHIMNGH" FT gene 4978..>5216 FT /gene="vpr" FT CDS 4978..>5216 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYR3" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYR3" FT /protein_id="AGG76704.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKNEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEAIIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5216 BP; 1958 A; 921 C; 1244 G; 1093 T; 0 other; ctggtaacta gagatccctc agaccctgtt attcggtgtg caaaatctct agcagtggcg 60 cccgaacagg gacctgaaag cgaaaggaaa accagaggag ctctctcgac gcaggactcg 120 gcttgctgaa gcgcgcacgg caagaggcga ggggtggcga ctggtgagta cgccaaactt 180 ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg 240 tcaattggat agatgggaga aaattcggtt aaggccaggg ggaaaaaagc aatataggtt 300 aaaacatata gtatgggcaa gcagggagct agaacgattc gcagtcaatc ctggcctgtt 360 agaaacagca gagggctgta gacaaatact gacacagcta caaccagccc ttcagacagg 420 atcagatgaa cttagatctt tatataatac agtagcaacc ctctattgtg tacatcaaag 480 gatagaggta aaagacacca aggaagcttt agagaaaata gaggaggagc aaaataaaag 540 taagaaaaag gcacagcaaa caacagctga cacaggaagc agcagtcagg tcagccaaaa 600 ttaccctata gtgcagaacc ttcaggggca aatggtacat caggccatat cacctagaac 660 tttaaatgca tgggtaaaag tagtagaaga gaaggccttc agcccagaag taatacccat 720 gttttcagcg ttatcagaag gagccacccc acaagattta aacaccatgc taaacacagt 780 ggggggacat caagcagcca tgcaaatgtt aaaagagacc atcaatgatg aagctgcaga 840 atgggataga ctgcatccag tgcatgcagg gcctgttgca ccaggccaga tgagagaacc 900 aaggggaagt gacatagcag gaactactag tacccttcag gaacaaatag gatggatgac 960 aaataatcca cctatcccag taggagagat ctataaaaga tggataatct tgggattaaa 1020 taaaatagta agaatgtata gccctaccag cattctggac ataagacaag gaccaaagga 1080 accctttaga gactatgtag accggttcta taaaactcta agagccgagc aggcgtcaca 1140 ggatgtaaaa acttggatga cagaaacctt gttggtccaa aatgcaaacc cagattgtaa 1200 gaccatttta aaagcattgg gaccagcagc tacactagaa gaaatgatga cagcatgtca 1260 gggagtggga ggacccagcc ataaagcaag agtcttggcg gaagcaatga gccaagcaac 1320 aaattcaccc gccataatga tgcagagagg caattttagg agccaaagaa agattgttaa 1380 atgctttaat tgtggcaagg aagggcacat agccagaaat tgtaaggccc ctaggaaaag 1440 aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa gattgtactg agagacaggc 1500 taatttttta gggaaaatct ggccttccca caaggggagg ccaggaaatt ttcttcagag 1560 cagaccagag ccaacagccc caccagaaga gagcttcagg tttggggaag agacaacaac 1620 tccccctcag aagcaggagc cagtagacaa ggatatgtat cccataactt ccctcagatc 1680 actctttggc aacgacccct cgtcacaata aagatagggg ggcaactaaa ggaagctcta 1740 ttagatacag gagcagatga tacagtatta gaagaaatga ccctaccagg aaaatggaaa 1800 ccaaaaatga tagggggaat tggaggtttt atcaaagtaa gacagtatga tcagataccc 1860 atagaaatct gtggacacag agctatgggt acggtattag taggacctac acctgtcaac 1920 ataattggaa gaaatctgtt gactcagatt ggctgcactt taaattttcc cattagtcct 1980 attgaaacgg taccagtaaa attaaagcca ggaatggatg gcccaaaagt taaacaatgg 2040 ccattgacag aagaaaaaat aaaagcatta gtagaaattt gcacagaaat ggaaaaggaa 2100 gggaaaattt caagaattgg acctgaaaat ccatacaata ctccagtatt tgccataaag 2160 aagaaagaca gtactaaatg gagaaaatta gtagatttca gagaacttaa taagaaaact 2220 caagatttct gggaagtcca attaggaata ccacatcccg cagggttaaa aaagaaaaag 2280 tcagtaacag tactggatgt gggggatgca tatttttcag ttcctttaga taaagatttc 2340 aggaagtata ctgcatttac catacctagt acaaacaatg agacaccagg gattagatat 2400 cagtacaatg tgcttccaca gggatggaaa ggatcaccag caatattcca aagtagcatg 2460 acaaaaatct tagagccttt cagacaacaa aatccagaca tagtcatcta tcaatacatg 2520 gatgatttat atgtaggatc tgacttagaa atagggcagc atagaacaaa aatagaggaa 2580 ctgagacaac atctgttgag gtggggattt accacaccag acaaaaaaca tcagaaagaa 2640 ccgccattcc tttggatggg ctatgaactc catcctgata aatggactgt gcagcctata 2700 gtgctgccag aaaaagatag ttggactgtc aatgacatac agaagttagt gggaaaattg 2760 aattgggcaa gtcagattta tgcagggatt aaagtaaggc aattatgtaa actccttagg 2820 ggaaccaagg cactaacaga agtaatacca ctaacagaag aagcagagtt agaactggca 2880 gaaaacaggg aaattctaaa agaaccagta catggagtgt actatgaccc atcaaaagac 2940 ttaatagcag aaatacagaa gcaggggcaa ggccaatgga catatcaaat ttatcaagaa 3000 ccatttaaaa atctaaaaac aggaaaatat gcaagaatga ggggtgccca cactaatgat 3060 gtaaaacaat taacagaggc agtgcaaaaa atagccacag agagtatagt gatatgggga 3120 aagattccta aatttagact acccatacaa aaagagacat gggaatcatg gtggacagac 3180 tattggcaag ccacctggat tcctgagtgg gaatttgtca atacccctcc cttagtaaaa 3240 ttatggtacc agttagagaa agaacccata gtaggagtag aaactttcta tgtagatggg 3300 gcagctaaca gggagactaa attaggaaaa gcaggatatg ttactgatag aggaagacaa 3360 aaagttgtct ccctaactga cacaacaaat cagaagactg agttacaagc aattcagatg 3420 gctttgcagg actcgggatt agaagtaaac atagtaacag actcacaata tgcattagga 3480 atcattcaag cacaaccaga taaaagtgaa tcagaaatag tcaatcaaat aatagaacag 3540 ttaataaaaa aggaaagggt ctacctgaca tgggtaccag cacacaaagg aattggagga 3600 aatgaacaag tagataagtt agtcagtgct ggaatcagga aagtactatt tttagatgga 3660 atagataagg cccaagaaga acatgaaaaa tatcacagta attggagagc tatggctagt 3720 gattttaacc tgccacctgt ggtagcaaaa gaaatagtag cctgctgtga taaatgtcaa 3780 caaaaaggag aggccatgca tggacaagta gactgtagtc caggaatatg gcaattagat 3840 tgtacacacc tagaaggaaa agttatcctg gtagcagtgc atgtagccag tggatatata 3900 gaagcagaag ttattccagc agagacaggg caggaaacag catacttcct cttaaaatta 3960 gcaggaagat ggccagtaaa aacaatacat acagacaatg gcagcaattt caccagtact 4020 acggtcaagg ctgcctgctg gtgggcgggg atcaagcaag aatttggtat cccctacaat 4080 ccccaaagtc aaggggtagt agaatctatg aataaagagt taaagaaaat tataggacag 4140 gtaagagatc aggctgaaca tctcaagaca gcagtacaaa tggcagtatt catccacaat 4200 tttaaaagaa aaggggggat tgggggatac agtgcagggg aaagaatagt agacatgata 4260 gcaacagaca tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg 4320 gtttattaca gggacagcag agatccactt tggaaaggac cagcaaagct tctctggaaa 4380 ggtgaagggg cagtagtaat acaagataat agtgacataa aagtagtgcc aagaagaaaa 4440 gcaaagataa ttagggatta tggaaaacag atggcaggtg atgattgtgt ggcaagtaga 4500 caggatgagg attagagcat ggaaaagtct agtaaaacac catatgtatg tttcaaaaaa 4560 ggctcaggga tggttttata gacatcacta tgacagtcgt catccaagaa taagttcaga 4620 agtacacatc ccactagggg aggctaaatt ggttgtaaca acatattggg gtctgaatac 4680 aggagaaaga gactggcatt tgggtcaggg agtctccata gaatggagga aaaggagata 4740 tagcacacaa gtagacccta acttagcaga ccaactaatt catctgtatt actttgattg 4800 tttttcagaa tccgctataa gaaatgccat attaggacat atagttagcc ctaagtgtgc 4860 atatcaagca ggacataaca aggtaggatc tctacagtac ttggcactag tagcattaac 4920 aacaccaaaa aagataaagc cacctttgcc tagtgtcgca aaattgacag aggatagatg 4980 gaacaagccc cagaagacca agggccacag agggagccat ataatgaatg gacactagag 5040 cttttagagg agcttaagaa tgaagctgtt agacactttc ctaggctgtg gctccatggt 5100 ttaggacaac atatctatga aacatatggg gatacttggg caggagtgga agccataata 5160 agaattctgc aacaactgct gtttattcat ttcagaattg ggtgtcaaca tagcag 5216 // ID KC312360; SV 1; linear; genomic RNA; STD; VRL; 5214 BP. XX AC KC312360; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C19 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5214 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5214 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 2691b230b5b5fc407fc22bf2650bb21e. XX FH Key Location/Qualifiers FH FT source 1..5214 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C19" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 207..1709 FT /gene="gag" FT CDS 207..1709 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4N7D3" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4N7D3" FT /protein_id="AGG76705.1" FT /translation="MGARASVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCRQILTQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQNKSKKKAQQTTADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKTWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT PEESFRFGEETATPPQKQEPIDKEMYPVASLRSLFGNDPSSQ" FT gene <1502..4513 FT /gene="pol" FT CDS <1502..4513 FT /codon_start=1 FT /gene="pol" FT /product="pol protein" FT /db_xref="GOA:M4N0M2" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010659" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:M4N0M2" FT /protein_id="AGG76706.1" FT /translation="FFRENLAFPQGEAREFSSEQTRANSPTRRELQVWGRDSNSPSEAG FT ANRQGNVSCSFPQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMTLPGKWKPKMIG FT GIGGFIKVRQYDQIPIEICGHRAIGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETV FT PVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISRIGPENPYNTPVFAIKKKD FT STKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRK FT YTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRQQNPDIVIYQYMD FT DLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI FT VLPEKDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLTEEAELEL FT AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHT FT NDVKQLTEAVQKIATESIVIWGKIPKFRLPIQKETWESWWTDYWQATWIPEWEFVNTPP FT LVKLWYQLEKEPIVGVETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTDTTNQKTELQ FT AIQMALQDSGLEVNIVTDSQYALGIIQAQPDRSESEIVNQIIEQLIKKERVYLTWVPAH FT KGIGGNEQVDKLVSAGIRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVA FT CCDKCQQKGEAMHGQVDCSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQET FT AYFLLKLAGRWPVKTIHTDNGSNFTSTTVKAACWWAGIKQEFGIPYNPQSQGVVESMNK FT ELKKIIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDMIATDIQTKELQK FT QITKIQNFRVYYRDSRDPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQ FT MAGDDCVASRQDED" FT gene 4458..5036 FT /gene="vif" FT CDS 4458..5036 FT /codon_start=1 FT /gene="vif" FT /product="vif protein" FT /db_xref="GOA:M4MZH9" FT /db_xref="InterPro:IPR000475" FT /db_xref="UniProtKB/TrEMBL:M4MZH9" FT /protein_id="AGG76707.1" FT /translation="MENRWQVMIVWQVDRMRIRAWKSLVKHHMYVSKKAQGWFYRHHYD FT SRHPRISSEVHIPLGEATLVVTTYWGLHTGERDWHLGQGVSIEWRKRRYSTQVDPNLAD FT QLIHLYYFDCFSESAIRNALLGQIVRPKCAYQAGHNKVGSLQYLALVALTTPKKIKPPL FT PSVAKLTEDRWNKPQKTKGHRGNHTMNGH" FT gene 4976..>5214 FT /gene="vpr" FT CDS 4976..>5214 FT /codon_start=1 FT /gene="vpr" FT /product="vpr protein" FT /db_xref="GOA:M4MYU1" FT /db_xref="InterPro:IPR000012" FT /db_xref="UniProtKB/TrEMBL:M4MYU1" FT /protein_id="AGG76708.1" FT /translation="MEQAPEDQGPQREPYNEWTLELLEELKSEAVRHFPRLWLHGLGQH FT IYETYGDTWAGVEALIRILQQLLFIHFRIGCQHS" XX SQ Sequence 5214 BP; 1955 A; 921 C; 1247 G; 1091 T; 0 other; ggtaactaga gatccctcag accctgttat tcggtgtgca aaatctctag cagtggcgcc 60 cgaacaggga cttgaaagcg aaaggaaaac cagaggagct ctctcgacgc aggactcggc 120 ttgctgaagc gcgcacggca agaggcgagg ggtggcgact ggtgagtacg ccaaactttt 180 gactagcgga ggctagaagg agagagatgg gtgcgagagc gtcggtatta agcgggggtc 240 aattggatag atgggaaaaa attcggttaa ggccaggggg aaaaaagcaa tataggttaa 300 aacatatagt atgggcaagc agggagctag aacgattcgc agtcaatcct ggcctgttag 360 aaacagcaga gggctgtaga caaatactga cacagttaca accagccctt cagacaggat 420 cagatgaact tagatcacta tataatacag tagcaaccct ctattgtgta catcaaagga 480 tagaggtaaa agacaccaag gaagctttag agaaaataga ggaggagcaa aataaaagta 540 agaaaaaggc acagcaaaca acagctgaca caggaagcag cagccaggtc agccaaaatt 600 accctatagt gcaaaacctt caggggcaaa tggtacatca ggccatatca cctagaactt 660 taaatgcatg ggtaaaagta gtagaagaga aggccttcag cccagaagta atacccatgt 720 tttcagcgtt atcagaagga gccaccccac aagatttaaa caccatgcta aacacagtgg 780 ggggacatca agcagccatg caaatgttaa aagagaccat caatgatgaa gctgcagaat 840 gggatagact gcatccagtg catgcagggc ctgttgcacc aggccagatg agagaaccaa 900 ggggaagtga catagcagga actactagta cccttcagga acaaatagga tggatgacaa 960 ataatccacc tatcccagta ggagagatct ataaaagatg gataatcttg ggattaaata 1020 aaatagtaag aatgtatagc cctaccagca ttttggacat aagacaagga ccaaaggaac 1080 cctttagaga ctatgtagac cggttctata aaactctaag agccgagcag gcgtcacagg 1140 atgtaaaaac ttggatgaca gaaaccttgt tggtccaaaa tgcaaaccca gattgtaaga 1200 ccattttaaa agcattggga ccagcagcta cactagaaga aatgatgaca gcatgtcagg 1260 gagtgggagg acccagccat aaagcaagag ttttggcgga ggcaatgagc caagcaacaa 1320 attcacctgc cataatgatg cagagaggca attttaggaa ccaaagaaag agtgttaaat 1380 gctttaattg tggcaaggaa gggcacatag ccagaaattg caaggcccct aggaaaagag 1440 gctgttggaa atgtggaaag gaaggacacc aaatgaaaga ttgtactgag agacaggcta 1500 attttttagg gaaaatctgg ccttcccaca aggggaggcc agggaatttt cttcagagca 1560 gaccagagcc aacagcccca ccagaagaga gcttcaggtt tggggaagag acagcaactc 1620 cccctcagaa gcaggagcca atagacaagg aaatgtatcc tgtagcttcc ctcagatcac 1680 tctttggcaa cgacccctcg tcacaataaa gatagggggg caactaaagg aagctctatt 1740 agatacagga gcagatgata cagtattaga agaaatgacc ctgccaggaa aatggaaacc 1800 aaaaatgata gggggaattg gaggttttat caaagtaaga cagtatgatc agatacccat 1860 agaaatctgt ggacacagag ctataggtac ggtattagta ggacctacac ctgtcaacat 1920 aattggaaga aatctgttga ctcagattgg ttgcacttta aattttccca ttagtcctat 1980 tgaaacggta ccagtaaaat taaagccagg aatggatggc ccaaaagtta aacaatggcc 2040 attgacagaa gaaaaaataa aagcattagt agaaatttgc acagaaatgg aaaaggaagg 2100 gaaaatttca agaattggac ctgaaaatcc atacaatact ccagtgtttg ctataaagaa 2160 aaaagacagt actaaatgga gaaaattagt agatttcaga gaacttaata agaaaactca 2220 agatttctgg gaagttcaat taggaatacc acatcccgca gggttaaaaa agaaaaagtc 2280 agtaacagta ctggatgtgg gggatgcata tttttcagtt cccttagata aagatttcag 2340 gaagtatact gcatttacca tacctagtac aaacaatgag acaccaggga ttagatatca 2400 gtacaatgtg cttccacagg gatggaaagg atcaccagca atattccaaa gtagcatgac 2460 aaaaatctta gagcctttca gacaacaaaa tccagacata gtcatctatc aatacatgga 2520 tgatttgtat gtaggatctg acttagaaat agggcagcat agaacaaaaa tagaggaact 2580 gagacaacat ctgttgaggt ggggatttac cacaccagac aaaaaacatc agaaagaacc 2640 tccattcctc tggatgggct atgaactcca tcctgataaa tggactgtac agcctatagt 2700 gctgccagaa aaagatagtt ggactgtcaa tgacatacag aagttagtgg gaaaattgaa 2760 ttgggcaagt cagatttatg cagggattaa agtaaggcaa ttatgtaaac tccttagggg 2820 aaccaaggca ctaacagagg taataccact aacagaagaa gcagagttag aactggcaga 2880 gaacagggaa attctaaaag aaccagtaca tggagtgtac tatgacccat caaaagactt 2940 aatagcagaa atacagaagc aggggcaagg ccaatggaca tatcaaattt atcaagagcc 3000 atttaaaaat ctaaaaacag gaaaatatgc aagaatgagg ggtgcccaca ctaatgatgt 3060 aaaacaatta acagaggcag tgcaaaaaat agccacagag agcatagtga tatggggaaa 3120 gattcctaaa tttagactac ccatacaaaa agagacatgg gaatcatggt ggacagacta 3180 ttggcaagcc acctggattc ctgagtggga atttgtcaat actcctcccc tagtaaaatt 3240 atggtaccag ttagagaaag aacccatagt aggagtagaa actttctatg tagatggggc 3300 agctaacagg gagactaaat taggaaaagc aggatatgtt actgatagag gaagacaaaa 3360 agttgtctcc ctaactgaca caacaaatca gaagactgag ttacaagcaa ttcagatggc 3420 tttgcaggac tcgggattag aagtaaacat agtaacagac tcacaatatg cattaggaat 3480 cattcaagca caaccagata gaagtgaatc agaaatagtc aatcaaataa tagaacagtt 3540 aataaaaaag gaaagggtct acctgacatg ggtaccagca cacaaaggaa ttggaggaaa 3600 tgaacaagta gataagttag tcagtgctgg aatcaggaaa gtactatttt tagatggaat 3660 agataaggcc caagaagaac atgaaaaata tcacagtaat tggagagcta tggctagtga 3720 ttttaacctg ccacctgtgg tagcaaaaga aatagtagcc tgctgtgata aatgtcaaca 3780 aaaaggagag gccatgcatg gacaagtaga ctgtagtcca ggaatatggc aattagattg 3840 tacacatcta gaaggaaaag ttatcctggt agcagtgcat gtagccagtg gatatataga 3900 agcagaagtt attccagcag aaacagggca ggaaacagca tacttcctct taaaattagc 3960 aggaagatgg ccagtaaaaa caatacatac agacaatggc agcaatttca ccagtactac 4020 ggttaaggct gcctgctggt gggcggggat caagcaagaa tttggcatcc cctacaatcc 4080 ccaaagtcaa ggggtagtag aatctatgaa taaagaatta aagaaaatta taggacaggt 4140 aagagatcag gctgaacatc ttaagacagc agtacaaatg gcagtattca tccacaattt 4200 taaaagaaaa ggggggattg ggggatacag tgcaggggaa agaatagtag acatgatagc 4260 aacagacata caaactaaag aattacaaaa acaaattaca aaaattcaaa attttcgggt 4320 ttattacagg gacagcagag atccactttg gaaaggacca gcaaagcttc tctggaaagg 4380 tgaaggggca gtagtaatac aagataatag tgacataaaa gtagtgccaa gaagaaaagc 4440 aaagataatt agggattatg gaaaacagat ggcaggtgat gattgtgtgg caagtagaca 4500 ggatgaggat tagagcatgg aaaagtctag taaaacacca tatgtatgtt tcaaaaaagg 4560 ctcagggatg gttttataga catcactatg acagtcgtca tccaagaata agttcagaag 4620 tacacatccc actaggggag gctacattgg tcgtaacaac atattggggt ctgcatacag 4680 gagaaagaga ctggcatcta ggtcagggag tctccataga atggaggaaa aggagatata 4740 gcacacaagt agaccctaac ttagcagacc aactaattca tctgtattac tttgattgtt 4800 tttcagaatc cgctataaga aatgccttat taggacaaat agttagacct aagtgtgcat 4860 atcaagcagg acataacaag gtaggatccc tacagtactt ggcactagta gcattaacaa 4920 caccaaaaaa gataaagcca cctttgccta gtgtcgcaaa attgacagag gatagatgga 4980 acaagcccca gaagaccaag ggccacagag ggaaccatac aatgaatgga cactagagct 5040 tttagaggag cttaagagtg aagctgttag acactttcct aggctgtggc tccatggttt 5100 agggcaacat atctatgaaa catatgggga tacttgggca ggagtggaag ccctaataag 5160 aattctgcaa caactgctgt ttattcattt cagaattggg tgtcaacata gcag 5214 // ID KC312361; SV 1; linear; genomic RNA; STD; VRL; 5216 BP. XX AC KC312361; XX DT 13-MAR-2013 (Rel. 116, Created) DT 26-APR-2013 (Rel. 116, Last updated, Version 2) XX DE HIV-1 isolate WARO_5_C2 from USA gag protein (gag) gene, complete cds; pol DE protein (pol) gene, partial cds; vif protein (vif) gene, complete cds; and DE vpr protein (vpr) gene, partial cds. XX KW . XX OS Human immunodeficiency virus 1 OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus. XX RN [1] RP 1-5216 RX DOI; 10.1073/pnas.1304288110. RX PUBMED; 23542380. RA Parrish N.F., Gao F., Li H., Giorgi E.E., Barbian H.J., Parrish E.H., RA Zajic L., Iyer S.S., Decker J.M., Kumar A., Hora B., Berg A., Cai F., RA Hopper J., Denny T.N., Ding H., Ochsenbauer C., Kappes J.C., Galimidi R.P., RA West A.P.Jr., Bjorkman P.J., Wilen C.B., Doms R.W., O'Brien M., RA Bhardwaj N., Borrow P., Haynes B.F., Muldoon M., Theiler J.P., Korber B., RA Shaw G.M., Hahn B.H.; RT "Phenotypic properties of transmitted founder HIV-1"; RL Proc. Natl. Acad. Sci. U.S.A. 110(17):6626-6633(2013). XX RN [2] RP 1-5216 RA Parrish N., Li H., Shaw G., Hahn B.; RT ; RL Submitted (05-DEC-2012) to the INSDC. RL Medicine, University of Pennsylania, 3610 Hamilton Walk, Philadelphia, PA RL 19104, USA XX DR MD5; 0a915ce5c566aa12ba66f9336f19d353. XX FH Key Location/Qualifiers FH FT source 1..5216 FT /organism="Human immunodeficiency virus 1" FT /host="Homo sapiens" FT /isolate="WARO_5_C2" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="17-Oct-2007" FT /db_xref="taxon:11676" FT gene 209..1711 FT /gene="gag" FT CDS 209..1711 FT /codon_start=1 FT /gene="gag" FT /product="gag protein" FT /db_xref="GOA:M4MYR9" FT /db_xref="InterPro:IPR000071" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012344" FT /db_xref="InterPro:IPR014817" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:M4MYR9" FT /protein_id="AGG76709.1" FT /translation="MGARTSVLSGGQLDRWEKIRLRPGGKKQYRLKHIVWASRELERFA FT VNPGLLETAEGCKQILAQLQPALQTGSDELRSLYNTVATLYCVHQRIEVKDTKEALEKI FT EEEQSKSKKKAQQATADTGSSSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKVVEEKA FT FSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINDEAAEWDRLHPVHAGP FT VAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTS FT ILDIRQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPA FT ATLEEMMTACQGVGGPSHKARVLAEAMSQATNSPAIMMQRGNFRNQRKSVKCFNCGKEG FT HIARNCKAPRKRGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAP FT