-
Notifications
You must be signed in to change notification settings - Fork 0
/
nucleo_siv.txt
157 lines (156 loc) · 11.2 KB
/
nucleo_siv.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
>lcl|NC_001549.1_cds_NP_687035.1_1 [gene=gag-pol] [locus_tag=SIVgp1] [db_xref=GeneID:1490009] [protein=Gag-Pol] [exception=ribosomal slippage] [protein_id=NP_687035.1] [location=join(897..2197,2197..5314)] [gbkey=CDS]
ATGGGCGGGGGTCACTCAGCACTGTCAGGGAGAAGCCTCGACACGTTCGAGAAGATTAGGCTACGTCCGA
ACGGGAAAAAGAAGTACCAAATTAAACATTTAATATGGGCAGGAAAAGAAATGGAACGATTTGGGTTACA
TGAGAAACTTTTAGAAACAAAAGAAGGCTGTCAAAAAATCATAGAAGTTTTAACCCCGTTGGAACCGACA
GGCTCCGAGGGGCTAAAAGCTCTGTTTAATTTGTGCTGCGTCATTTGGTGCATTCACGCAGAACAGAAAG
TGAAAGACACAGAGGAAGCTGTAGTAACAGTTAAGCAACACTACCATCTAGTGGACAAAAATGAGAAAGC
AGCTAAAAAGAAAAATGAGACAACAGCGCCACCTGGTGGCGAATCAAGAAATTACCCAGTAGTAAATCAG
AATAATGCCTGGGTACACCAGCCTTTGTCTCCGCGCACGTTAAATGCGTGGGTCAAATGCGTGGAGGAAA
AAAGGTGGGGAGCAGAAGTAGTCCCCATGTTCCAAGCACTCTCAGAGGGATGTCTCTCCTATGATGTAAA
TCAGATGCTCAATGTAATAGGAGACCATCAGGGGGCATTACAAATTCTTAAGGAAGTCATTAATGAAGAA
GCAGCAGAGTGGGACAGGACACACAGACCACCAGCTGGCCCGTTACCAGCAGGGCAGCTAAGAGACCCGA
CAGGGTCAGATATAGCAGGAACTACCAGCTCAATTCAGGAACAAATAGAGTGGACCTTCAATGCCAATCC
AAGAATAGACGTAGGGGCACAATACAGAAAATGGGTTATTTTGGGCTTACAAAAGGTAGTGCAGATGTAC
AATCCCCAAAAGGTCCTAGACATTCGACAGGGACCTAAAGAACCCTTCCAGGACTATGTAGACAGATTCT
ATAAAGCCCTGAGAGCAGAACAAGCACCACAGGATGTTAAAAATTGGATGACACAAACTTTGCTTATCCA
GAATGCCAATCCGGATTGTAAATTGATTCTGAAAGGATTGGGAATGAATCCAACCTTGGAGGAAATGCTA
ATAGCTTGCCAGGGAGTAGGAGGGCCACAACATAAGGCTAAGCTAATGGTAGAAATGATGAGTAATGGAC
AGAATATGGTCCAAGTGGGACCTCAGAAAAAGGGCCCCCGAGGGCCGCTAAAATGCTTTAATTGTGGCAA
ATTTGGACATATGCAAAGGGAATGCAAGGCACCAAGACAGATCAAATGCTTTAAGTGCGGCAAAATTGGC
CATATGGCAAAAGACTGCAAGAATGGACAGGCAAATTTTTTTAGGGTATGGCCATTGGGGAGGAGCGAAA
CCAAGAAATTTTGTGCAATACAGAGGAGACACAGTTGGTCTGGAACCAACAGCCCCCCCAATGGAAACAG
CTTACGATCCAGCAAAGAAGCTCCTCCAGCAGTATGCAGAGAAGGGACAGCGCCTGAGAGAGGAGAGAGA
ACAGACAAGGAAACAGAAGGAGAAAGAAGTGGAGGATGTTTCCTTGAGCTCCCTCTTTGGAGGAGACCAA
TGAAACGAGTCATCATAGAAGGAACGCCAGTGCAAGCCTTGTTAGATACAGGAGCAGATGACACTATAAT
TCAAGAAAAGGACTTGCACTTTCCCCCACATAAACCATGGCGTTCCAAGGTAGTAGGAGGTATAGGAGGA
GGGATTCATGTCAAAGAATATCAGGGGGTACAAGTACAATTGGAGGATAAAATCATCACCGGCTCAATTC
TAATAGGAAGTACACCAATCAATATTATAGGAAGAAATATTTTAGCTCAGGCAGGCATGAAATTAGTTAT
GGGAGTTCTATCTAGTCAGATTGAGGAAACAAAAGTACAACTAAAAGAAGGGAAAGATGGACCTAAATTG
AAACAATGGCCCTTATCAAGAGAAAAAATTGAAGCTTTAACAGAAATATGCAAACAAATGGAAGAGGAGG
GAAAATTATCTAGGATAGGAGGAGAAAATCCTTATAATACACCAGTGTTTGCCATAAAGAAAAAGGATAA
AACACAATGGAGAATGCTTGTAGATTTCAGGGAACTAAACAAAGCTACTCAAGACTTTTTTGAGGTTCAG
CTGGGAATTCCTCACCCAGCGGGCCTTCAGAAAAAGAAGCAAATCACAGTAATAGACATAGGGGATGCCT
ATTATTCAATACCATTATGCAAGGAATTCAGAAAATATACAGCATTTACCATCCCCTCAGTAAATAATAC
AGGGCCAGGGATAAGGTATCAGTTCAATTGTCTGCCTCAGGGATGGAAAGGATCTCCTACAATTTTCCAG
AATACGGCAGCAAACATTTTAGAGGAGATCAAAAGGCACACTCCTGGGTTAGAAATTGTCCAATACATGG
ACGATTTGTGGTTGGCGTCAGACCATGATGAGACTAGACATAATCAACAGGTAGACATAGTAAGAAAGAT
GCTGCTAGAAAAAGGTCTAGAAACCCCAGACAAGAAAGTCCAAAGAGAACCGCCATGGGAATGGATGGGG
TATAAATTGCATCCGAATAAATGGACCATTAACAAAATAGAATTACCCCCCTTAGAAGGAGAATGGACAG
TAAACAAAATACAGAAGGTAGTAGGAGTTCTAAATTGGGCAAGTCAAATTTATCCAGGAATTAAAACCAA
ACATACCTGTGCCATGTTGAGAGGGAAAAAGAACCTCCTAGAAGAAATAGTATGGACAGAAGAGGCAGAG
GCAGAATATAAGAACAATCAAGGGATAGTGCAGGAAACACAAGAAGGAACATACTATGACCCTCTCAAAG
AATTAATAGCAACAGTTCAAAAGCAAGGAGAAGGGCAATGGACATACCAATTCACCCAAGAAGGGGCAGT
ATTAAAGGTGGGAAGATATGCCAAGCAAAGAGAAACTCATACTAATGATCTAAGGACTCTAGCACACCTT
GTCCAAAAAATCTGTAAGGAAGCACTTACCATTTGGGGAAGACTTCCACGAGTACAACTCCCAGTAGACA
AGAAAACATGGGATATGTGGTGGCAGGACTATTGGCAAGTATCCTGGATACCAGAATGGGAGTTTGTTAG
CACACCACTCCTAGTAAAACTGTGGTATTCCTTAGTAAAAGAACCAATCAAAGGAGAAGATGTTTATTAT
GTGGATGGGGCAGCATCCAAAGTGACCAAATTAGGTAAGGCAGGATATCTGTCAGAGAGAGGAAAAAGTA
GAATTAGGGAATTAGAAAACACCACTAACCAACAAGCAGAATTAACAGCAGTTAAGATGGCATTGGAGGA
CAGTGGAGAAAATGTAAATATAGTCACAGATTCTCAATATGTAATGAACATCTTGACAGCATGTCCACAG
GAAAGTAACTCACCCTTAGTGGAACAGATAATACAAGCCCTAATGAAAAAGAGGCAGGTCTACTTACAAT
GGGTACCAGCTCATAAGGGGATAGGAGGCAATACAGAAATAGATAAATTAGTAAGCAAAGGAATAAGACA
GATCCTCTTCTTAGATAGAATAGAAGAAGCACAAGATGACCATGCAAAGTACCATAACAATTGGAGAAGT
ATGGTACAGGAATTTGGATTACCTAATATAGTAGCAAAAGAGATAGTAGCGGCATGTCCCAAATGCCAAA
TAAGAGGAGAACCTAAGCATGGACAGGTAGACGCCTCCATTGAAACTTGGCAGATGGACTGCACCCATTT
AGAAGGAAAAGTTATAATAGTAGCAGTACATGTAGCCAGTGGATTCATAGAAGCAGAGGTGATCCCAAGA
GAAACTGGGAAGGAGACAGCACACTTTCTGCTGAAACTGTTAGCAAGATGGCCAGTGAAACATCTACACA
CTGATAATGGCCCAAACTTTACCTCTCAGAATGTGGCAGCGGTGTGCTGGTGGGGTAATATAGAGCACAC
CACTGGAATACCTTATAACCCACAGTCACAGGGTAGTGTAGAAAGCATGAACAGACAGCTCAAGGAAATC
ATCTCTCAAATAAGAGATGATTGTGAGAGATTGGAGACAGCAGTGCAAATGGCTACGCATATCCACAATT
TTAAAAGAAAGGGAGGAATAGGGGGTATCTCTAGTGCAGAAAGATTGGTTAATATGCTAACAACACAACT
AGAACTAAATACTCTACAAAACCAAATCCAAAAAATTTTGAATTTTAAGGTCTACTACAGAGAAGGTAGA
GATCCAGTGTGGAAAGGACCAGCGCGACTCATCTGGAAAGGAGAAGGCGCGGTGGTAATTAAAGAGGGGG
AAGACATCAAGGTAGTCCCCAGGAGAAAGGCTAAGATTATCAAAGATTATGGAGAGAGAAAAACAATGGA
TAGTGAGGGTAGTATGGAGGGTGTCAGAGAGGCAAATAAGCAGATGGAGGGGGATAGTGACTTACAAGAT
CAGGAATAA
>lcl|NC_001549.1_cds_NP_054369.1_2 [gene=gag-pol] [locus_tag=SIVgp1] [db_xref=GeneID:1490009] [protein=gag protein] [protein_id=NP_054369.1] [location=897..2438] [gbkey=CDS]
ATGGGCGGGGGTCACTCAGCACTGTCAGGGAGAAGCCTCGACACGTTCGAGAAGATTAGGCTACGTCCGA
ACGGGAAAAAGAAGTACCAAATTAAACATTTAATATGGGCAGGAAAAGAAATGGAACGATTTGGGTTACA
TGAGAAACTTTTAGAAACAAAAGAAGGCTGTCAAAAAATCATAGAAGTTTTAACCCCGTTGGAACCGACA
GGCTCCGAGGGGCTAAAAGCTCTGTTTAATTTGTGCTGCGTCATTTGGTGCATTCACGCAGAACAGAAAG
TGAAAGACACAGAGGAAGCTGTAGTAACAGTTAAGCAACACTACCATCTAGTGGACAAAAATGAGAAAGC
AGCTAAAAAGAAAAATGAGACAACAGCGCCACCTGGTGGCGAATCAAGAAATTACCCAGTAGTAAATCAG
AATAATGCCTGGGTACACCAGCCTTTGTCTCCGCGCACGTTAAATGCGTGGGTCAAATGCGTGGAGGAAA
AAAGGTGGGGAGCAGAAGTAGTCCCCATGTTCCAAGCACTCTCAGAGGGATGTCTCTCCTATGATGTAAA
TCAGATGCTCAATGTAATAGGAGACCATCAGGGGGCATTACAAATTCTTAAGGAAGTCATTAATGAAGAA
GCAGCAGAGTGGGACAGGACACACAGACCACCAGCTGGCCCGTTACCAGCAGGGCAGCTAAGAGACCCGA
CAGGGTCAGATATAGCAGGAACTACCAGCTCAATTCAGGAACAAATAGAGTGGACCTTCAATGCCAATCC
AAGAATAGACGTAGGGGCACAATACAGAAAATGGGTTATTTTGGGCTTACAAAAGGTAGTGCAGATGTAC
AATCCCCAAAAGGTCCTAGACATTCGACAGGGACCTAAAGAACCCTTCCAGGACTATGTAGACAGATTCT
ATAAAGCCCTGAGAGCAGAACAAGCACCACAGGATGTTAAAAATTGGATGACACAAACTTTGCTTATCCA
GAATGCCAATCCGGATTGTAAATTGATTCTGAAAGGATTGGGAATGAATCCAACCTTGGAGGAAATGCTA
ATAGCTTGCCAGGGAGTAGGAGGGCCACAACATAAGGCTAAGCTAATGGTAGAAATGATGAGTAATGGAC
AGAATATGGTCCAAGTGGGACCTCAGAAAAAGGGCCCCCGAGGGCCGCTAAAATGCTTTAATTGTGGCAA
ATTTGGACATATGCAAAGGGAATGCAAGGCACCAAGACAGATCAAATGCTTTAAGTGCGGCAAAATTGGC
CATATGGCAAAAGACTGCAAGAATGGACAGGCAAATTTTTTAGGGTATGGCCATTGGGGAGGAGCGAAAC
CAAGAAATTTTGTGCAATACAGAGGAGACACAGTTGGTCTGGAACCAACAGCCCCCCCAATGGAAACAGC
TTACGATCCAGCAAAGAAGCTCCTCCAGCAGTATGCAGAGAAGGGACAGCGCCTGAGAGAGGAGAGAGAA
CAGACAAGGAAACAGAAGGAGAAAGAAGTGGAGGATGTTTCCTTGAGCTCCCTCTTTGGAGGAGACCAAT
GA
>lcl|NC_001549.1_cds_NP_054370.1_3 [gene=vif] [locus_tag=SIVgp2] [db_xref=GeneID:1490005] [protein=vif protein] [protein_id=NP_054370.1] [location=5214..5873] [gbkey=CDS]
ATGGAGAGAGAAAAACAATGGATAGTGAGGGTAGTATGGAGGGTGTCAGAGAGGCAAATAAGCAGATGGA
GGGGGATAGTGACTTACAAGATCAGGAATAAACAATTGCCTTGGGAATACAGACATCATTGGCAGGTGCA
ATGGCAGTTTTGGACCTACAGCCAGTTCATTATCCCCTTATCAAAAGATGATTACATAGAAGTGAATATT
TATCACAACCTCACCCCAGAAAGAGGATGGCTCTCAAGTCATGGAGTAGGGTTATCCTATTACCATCAAA
AGGGATATAAGACAGAAGTAGATCCAGGAACAGCAGACAGAATGATACACCTATATTATTTTAACTGTTT
TACAGATAGAGCCATCCAACAGGCTATCAGAGGGGAGAAGTATACGTGGTGCACATTCAAGGAAGGACAT
AAAGGTCAGGTACAATCACTGCAACTTTTGGCACTAGTTGCATATACAAATGGCATCAGGAAGAGATCCA
AGAGAACCTTTACCAGGATGGCTGGAAATCTGGGATCTAGACAGGGAGCCATGGGACGAATGGCTACAAG
ACATGCTCAGGGATCTAAACGAAGAAGCCAGAAGGCACTTTGGAATGAACATGCTAATCCGAGTATGGAA
TTACTGTGTAGAGGAGGGAAGGAGACATAA
>lcl|NC_001549.1_cds_NP_054371.1_4 [gene=vpx] [locus_tag=SIVgp3] [db_xref=GeneID:1490006] [protein=vpx protein] [protein_id=NP_054371.1] [location=5683..6039] [gbkey=CDS]
ATGGCATCAGGAAGAGATCCAAGAGAACCTTTACCAGGATGGCTGGAAATCTGGGATCTAGACAGGGAGC
CATGGGACGAATGGCTACAAGACATGCTCAGGGATCTAAACGAAGAAGCCAGAAGGCACTTTGGAATGAA
CATGCTAATCCGAGTATGGAATTACTGTGTAGAGGAGGGAAGGAGACATAATACCCCATGGAATGAGATA
GGCTACAAGTACTATAGAATTGTTCAAAAGTCTATGTTTGTACATTTCAGATGTGGTTGTAGAAGGAGAG
GACCTTTTTCCCCTTACGAAGAGAGGAGAAATGGACAAGGAGGAGGAGCCCCACCCCCTCCTCCAGGACT
TGCATAG
>lcl|NC_001549.1_cds_NP_054372.1_5 [gene=env] [locus_tag=SIVgp6] [db_xref=GeneID:1490007] [protein=envelope protein] [protein_id=NP_054372.1] [location=6202..8766] [gbkey=CDS]
ATGGGGAGATTGCTTATAAAAATACTAATAATAGCAATAGGGATAAGTATAGGAATAGGTAACCTGTATG
TGACAGTGTTTTATGGAATCCCAGTATGGAAAAATTCAACAGTTCAGGCATTTTGCATGACGCCCAATAC
CAATATGTGGGCAACCACCAACTGCATACCAGATGATCATGATAATACAGAGGTGCCTCTAAACATTACA
GAAGCTTTCGAGGCTTGGGATAATCCGCTGGTAAAACAAGCAGAGAGTAATATACATCTACTCTTTGAAC
AAACGATGAGGCCTTGTGTTAAGCTCTCCCCCATATGTATTAAAATGTCCTGTGTAGAGCTGAATGGTAC
AGCCACGACAAAGGCCACCACTACTGCAACTACAACAATGACTACCCCCTGTCAGAATTGCAGTACAGAG
CAGATAGAAGGAGAAATGGCAGAGGAACCAGCATCCAACTGCACTTTTGCAATTGCAGGATATCAAAGAG
ATGTAAAAAAGAATTATAGCATGACCTGGTATGATCAGGAGTTAGTCTGCAATAATAAAACAGGAAGTGA
AAAGGGAAGTAAGGATTGTTACATGATACATTGTAATGATTCAGTGATAAAAGAAGCTTGTGATAAAACA
TATTGGGATACTTTAAGAGTAAGATACTGTGCACCAGCAGGGTATGCTTTGCTAAAATGTAATGATAAGG
ATTATAGAGGCTTTGCTCCAAAGTGCAAGAATGTTTCAGTAGTGCATTGTACTAGATTAATCAATACTAC
TATAACTACAGGGATAGGATTAAATGGTAGTAGATCAGAAAATAGAACAGAGATATGGCAGAAAGGAGGA
AATGATAATGATACAGTTATAATAAAGTTGAATAAGTTTTACAACTTGACAGTGAGATGCCGAAGACCTG
GTAATAAAACAGTGTTGCCAGTAACAATCATGGCAGGGTTAGTATTTCACTCTCAGAAATATAATACCAG
GTTAAAACAAGCGTGGTGCCACTTCCAAGGAGATTGGAAAGGGGCATGGAAAGAAGTCAGAGAAGAAGTA
AAGAAAGTGAAAAATCTTACAGAAGTAAGCATAGAAAATATACATCTGAGAAGGATATGGGGAGATCCAG
AATCAGCGAATTTTTGGTTCAATTGTCAAGGTGAATTTTTCTATTGTAAGATGGACTGGTTTATCAATTA
TCTAAACAATCGAACAGAAGATGCAGAAGGTACTAATAGGACCTGTGACAAAGGGAAGCCAGGACCAGGA
CCATGTGTTCAGAGAACTTATGTTGCCTGCCATATACGACAAGTAGTAAATGATTGGTACACTGTCTCTA
AAAAGGTATATGCTCCACCAAGGGAAGGTCATTTGGAGTGTAACTCATCAGTCACGGCACTATACGTGGC
AATAGATTATAACAACAAGTCTGGCCCAATAAATGTGACCCTAAGTCCTCAGGTACGCAGCATATGGGCG
TACGAACTGGGAGACTATAAATTAGTAGAGATAACACCAATTGGCTTTGCTCCTACAGATGTAAGAAGAT
ATACTGGCCCCACAAGAGAAAAAAGGGTGCCATTCGTGCTAGGGTTTCTAGGCTTCTTGGGAGCTGCTGG
AACTGCAATGGGCGCAGCGGCAACAACGCTGACAGTCCAGTCTCGGCATTTGCTTGCTGGGATATTGCAG
CAGCAGAAGAACTTGCTGGCGGCTGTGGAACAGCAACAACAGTTGTTGAAGCTGACCATTTGGGGTGTGA
AAAACCTCAATGCCCGCGTCACAGCTCTCGAGAAGTACCTAGAGGATCAGGCACGGCTAAATTCATGGGG
ATGTGCGTGGAAACAAGTATGTCACACCACAGTGCCATGGAAGTATAATAACACTCCTAAGTGGGACAAT
ATGACTTGGTTGGAGTGGGAGAGACAAATTAATGCCTTGGAAGGCAACATAACTCAACTATTGGAAGAAG
CACAAAATCAGGAATCAAAGAATCTGGATCTGTACCAGAAATTGGATGATTGGTCAGGGTTCTGGTCATG
GTTCTCACTGTCAACTTGGTTAGGCTATGTTAAAATAGGATTTTTAGTGATAGTGATTATTCTAGGATTA
AGATTTGCATGGGTATTATGGGGATGTATCAGAAATATTAGGCAGGGATATAATCCTCTCCCCCAGATCC
ATATCCACAGTTCAGCGGAACGGCCAGACAACGGAGGAGGGCAAGACAGAGGTGGAGAAAGCAGCAGCAG
CAAATTGATAAGATTGCAGGAAGAGTCCTCAACACCTTCGAGGATCAACAACTGGTGGCTCAACTTCAAG
AGCTGCAGCTTGAGAATAAGGACTTGGTGTTACAACATCTGCCTGACCCTCCTCATATTCATCAGGACAG
CAGTGGGATACCTGCAGTATGGGCTCCAGCAACTCCAAGAGGCAGCAACAGGGCTTGCTCAAGCTCTGGC
GAGGGCTGCGAGGGAAGCCTGGGGCAGACTGGGTGCTATTGTCCGATCCGCTTATCGGGCAGTCATCAAC
AGTCCAAGAAGAGTGCGGCAAGGCCTTGAAAAAGTCCTGGGGTAA
>lcl|NC_001549.1_cds_NP_054373.1_6 [gene=nef] [locus_tag=SIVgp7] [db_xref=GeneID:1490008] [protein=nef protein] [protein_id=NP_054373.1] [location=8600..9271] [gbkey=CDS]
ATGGGCTCCAGCAACTCCAAGAGGCAGCAACAGGGCTTGCTCAAGCTCTGGCGAGGGCTGCGAGGGAAGC
CTGGGGCAGACTGGGTGCTATTGTCCGATCCGCTTATCGGGCAGTCATCAACAGTCCAAGAAGAGTGCGG
CAAGGCCTTGAAAAAGTCCTGGGGTAAAGGTAAAATGACTCCAGACGGCCGCCGCCTGCAAGAAGGAGAC
ACCTTTGATGAGTGGGATGATGATGAAGAAGAAGTAGGCTTCCCTGTGCAACCTCGAGTCCCCTTAAGAC
AGATGACCTATAAATTAGCAGTGGACTTTTCCCACTTTTTAAAATCAAAGGGGGGACTGGATGGGATATA
TTACTCTGAAAGAAGAGAAAAGATCCTGAATTTGTATGCCTTGAACGAGTGGGGAATAATAGATGATTGG
CAAGCTTACTCACCAGGCCCGGGGATAAGGTACCCGAGAGTCTTTGGCTTCTGCTTTAAGCTAGTCCCAG
TGGACCTGCATGAGGAGGCACGCAACTGTGAGAGACACTGTCTGATGCATCCAGCACAGATGGGGGAAGA
TCCTGATGGAATAGATCATGGAGAAGTCTTGGTCTGGAAGTTTGACCCGAAGTTGGCGGTGGAGTACCGC
CCGGACATGTTTAAGGACATGCACGAACATGCAAAGCGCTAG