-
Notifications
You must be signed in to change notification settings - Fork 0
/
trainData.txt
547 lines (547 loc) · 16.4 KB
/
trainData.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
DLGPPISLERLDVGTNLGNAIAKLEAKELLESSD
HRIDLGPPISLERLDVGTNLGNAIAKLEAKELLE
IDLGPPISLERLDVGTNLGNAIAKLEAKELLESS
LERLDVGTNLGNAIAKLEAKELLESSDQILRSMK
LGPPISLERLDVGTNLGNAIAKLEAKELLESSDQ
LHRIDLGPPISLERLDVGTNLGNAIAKLEAKELL
PISLERLDVGTNLGNAIAKLEAKELLESSDQILR
PPISLERLDVGTNLGNAIAKLEAKELLESSDQIL
RIDLGPPISLERLDVGTNLGNAIAKLEAKELLES
SLERLDVGTNLGNAIAKLEAKELLESSDQILRSM
ALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKQ
AVSKVLHLEGEVNKIALLSTNKAVVSLSNGVSV
EVNKIALLSTNKAVVSLSNGVSVLTSKVLDLKN
GEVNKIALLSTNKAVVSLSNGVSVLTSKVLDLK
IALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDK
KIALLSTNKAVVSLSNGVSVLTSKVLDLKNYID
LEGEVNKIALLSTNKAVVSLSNGVSVLTSKVLD
NKIALLSTNKAVVSLSNGVSVLTSKVLDLKNYI
SKVLHLEGEVNKIALLSTNKAVVSLSNGVSVLT
VAVSKVLHLEGEVNKIALLSTNKAVVSLSNGVS
VNKIALLSTNKAVVSLSNGVSVLTSKVLDLKNY
VSKVLHLEGEVNKIALLSTNKAVVSLSNGVSVL
ERKVDFLEENITALLEEAQIQQEKNMYELQKLNSW
EWERKVDFLEENITALLEEAQIQQEKNMYELQKLN
FLEENITALLEEAQIQQEKNMYELQKLNSWDVFGN
KVDFLEENITALLEEAQIQQEKNMYELQKLNSWDV
QEWERKVDFLEENITALLEEAQIQQEKNMYELQKL
RKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD
VDFLEENITALLEEAQIQQEKNMYELQKLNSWDVF
WERKVDFLEENITALLEEAQIQQEKNMYELQKLNS
WQEWERKVDFLEENITALLEEAQIQQEKNMYELQK
EWYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQ
GEWYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQL
GNITLGEWYNQTKDLQQKFYEIIMDIEQNNVQG
ITLGEWYNQTKDLQQKFYEIIMDIEQNNVQGKKGI
IWNHGNITLGEWYNQTKDLQQKFYEIITMDIEQNNV
LGEWYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQ
NITLGEWYNQTKDLQQKFYEIIMDIEQNNVQGK
NQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQKWE
TKDLQQKFYEIIMDIEQNNVQGKKGIQQLQKWEDW
TLGEWYNQTKDLQQKFYEIIMDIEQNNVQGKKG
WNHGNITLGEWYNQTKDLQQKFYEIIMDIEQNNVQ
WYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQK
YNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQKW
AGAGTGATAIGMVTQYHQVL
CNCTNSSSSYSGTKMACPSNRG
EGGTLGNWAREIWATLFKKA
EGPGLGNWAREIWATLFKKA
EGPTGGNWAREIWATLFKKA
EGPTLGGWAREIWATLFKKA
EGPTLGNWAGEIWATLFKKA
EGPTLGNWAREGWATLFKKA
EGPTLGNWAREIWAGLFKKA
EGPTLGNWAREIWATGFKKA
EGPTLGNWAREIWATLFKGA
EGPTLGNWAREIWATLFKKA
EGPTLGNWAREIWATLFKKG
EGPTLGNWAREIWATLGKKA
EGPTLGNWAREIWGTLFKKA
EGPTLGNWARGIWATLFKKA
EGPTLGNWGREIWATLFKKA
EIWATLFKKATRQCRRGRIW
GGPTLGNWAREIWATLFKKA
HVMLALATVLSIAGAGTGATAI
IGLKVEAMEKFLYTAFAMQE
IGNIPQYLKGLLGGILGIGL
LGNWAREIWATL
LGNWAREIWATLFK
LGNWAREIWATLFKKA
LLGGILGIGLGVLLLILCLP
LQKWEDWVRWIGNIPQYLKG
NQTKDLQQKFYEIIMDIEQN
NVQGKTGIQQLQKWEDWVRW
PDYLLVPEEVMEYKPRRKRAAI
PTLGNWAREIWATLFKKA
RGILRNWYNPFAGLRQSLEQ
SGTKMACPSNRGILRNWYNP
TGALKINNLRLVTLEHQVLV
TRQCRRGRIWKRWNETITGP
YKPRRKRAAIHVMLALATVLSI
AIKWEYVLLLFLL
IKWEYVLLLFLL
KWEYVLLLFLL
SFAIKWEYVLLLFLL
VSFAIKWEYVLLLFL
WEYVLLLFLL
SWLRDIWDWACEVLSDFK
SWLRDIWDWECEVLSDFK
SWLRDIWDWGCEVLSDFK
SWLRDIWDWICELLSDFK
SWLRDIWDWLCELLSDFK
SWLRDIWDWSCEVLSDFK
SWLRDIWDWVCEVLSDFK
SWLRDLWDWICELLSDFK
SWLRDLWDWICEVLSDFK
SWLRDLWDWLCELLSDFK
SWLRDLWDWLCEVLSDFK
SGSWLRDIWDWICEVLSDFK
GSWLRDIWDWICEVLSDFK
SWLRDIWDWICEVLSDFKT
SWLRDIWDWICEVLSDFKTW
SWRLIDWDWICEVLSDFK
SWLRDIWDWICEVL
KFDSLVECIWDWIDRLWS
KWLCRIWSWISDVLDDFE
SIWRDWVDLICEFLSDWK
SWLRDVWDWICTVLTDFK
SWLRDVWDWVCTILTDFK
DWLRIIWDWVCSVVSDFK
SWLWEVWDWVLHVLSDFK
SWLRDVWDWVCTVLSDFK
SWLRDIWDWISEVLSDFK
SWLRDIWDWIREVLSDFK
SWLRDIWDWIEEVLSDFK
SWLDDIWDWICEVLSDFE
SWLRDIWDWICKVLSDFK
SWLDRIWRWICKVLSRFE
SWLRDIWRWICKVLSRFK
SWLRRIWRWICKVLSRFK
ACKFWW
AEPERRNIKYL
AIPCGESCVWIPCISAAIGCSCKNKVCYR
AKITFTNNHPRTIWP
CGESCAMISFCFTEVIGCSCKNKVCYLNSIS
CGESCVFIPCITSVAGCSCKSKVCYRNGIP
CGSVFLVGQLFTFSPRHH
CLGVGSCNDFAGCGYAIVCFW
DCPNGPWVWVPAFCQAVGWG
DREINNYTSLIHSLIEESQNQQEKNEQELLELDKWA
EELAKKAEELAKKAEELAKKAEELAKKAWASLWNWF
EWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK
FKCRRWQWRM
FKRIVQRIKDFLR
GADFQECMKEHSQKQHQHQG
GDPTFCGETCRVIPVCTYSAALGCTCDDRSDGLCKRN
GFCRCICTRGFCRCICTR
GFCRCLCRRGVCRCICTR
GFKRIVQRIKDFLRNLV
GFPCGESCVFIPCISAAIGCSCKNKVCYRN
GICRCICGRGICRCYCGR
GICRCICGRRICRCICGR
GICRCYCGRGICRCICGR
GIGGKILSGLKTALKGAAKELASTYLH
GIGTKILGGVKTALKGALKELASTYAN
GIKEFKREFQRIKDFLRNLV
GIKEFKRIVQRIKDFLRNLV
GIKQFKRIVQRIKDFLRNLV
GIKYFSMVGNWAKVLVVL
GIPCAESCVWIPCTVTALVGCSCSDKVCYN
GIPCGESCVFIPCLTTVAGCSCKNKVCYRN
GLPICGETCVGGTCNTPGCSCSWPVCTRN
GPWVWVPAFCQAVGWGDPIT
GRFKRFRKKFKKLFKKIS
GRFKRFRKPFKKLFKKIS
GTACGESCYVLPCFTVGCTCTSSQCFKN
GTKALTEVIPLTEEAEC
GTKWLTEWIPLC
GTKWLTEWIPLTAEAEC
GTKWLTEWIPLTAEC
GVCRCICGRGVCRCICGR
GVCRCICGRGVCRCICRR
GVIPCGESCVFIPCISAAIGCSCKNKVCYRN
GVPCGESCVFIPCITGVIGCSCSSNVCYLN
GYCRCICGRGICRCICGR
HAKFWW
HCAFWW
HCKAWW
HCKFWF
HCKFWG
HCKFWH
HCKFWI
HCKFWR
HCKFWV
HCKFWW
HCKFWY
IWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNE
KIMAKPSKFYEQLRGR
KIPCGESCVWIPCLTSVFNCKCENKVCYHD
KIPCGESCVWIPCVTSIFNCKCENKVCYHD
KPVSLSYRCPCR
KPVSLSYRCPCRF
KPVSLSYRCPCRFF
KQTENLADTY
LCDCPNGPWVWVPAFCQAVG
LEAIPCSIPPCFAFNKDFVF
LEAIPCSIPPCFAFNKPFVF
LEAIPCSIPPCLAFAKPFVF
LEAIPCSIPPCVFFGKPFVF
LEAIPCSIPPCVFFNKPFVF
LEAIPCSIPPCVGFGKPFVF
LEAIPCSIPPCVLFNKPFVF
LEAIPCSIPPECLFGKPFVF
LEAIPCSIPPEFLFGKPFVFLEAIPCSIPPEFLFGKPFVF
LEAIPISIPPEVFFGKPFVF
LEAIPMCIPPECFFNKPFVF
LEAIPMCIPPECLFGKPFVF
LEAIPMCIPPECLFNKPFVF
LEAIPMKIPPEFLFGKPFVF
LEAIPMSCPPEFCFGKPFVF
LEAIPMSIPPEFAFNKDFVF
LEAIPMSIPPEFLFGKPFVF
LEAIPMSIPPEIAFNKPFVF
LEAIPMSIPPELAFAKPFVF
LEAIPMSIPPEVKFNKPFVF
LIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT
LLGDLLRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES
LLRIPQAIMDMIAGAHWG
LPLPAPSFHRTT
LSYRCPCRFF
MEWDREINNYTSLIHSLIEESQNQQEKNEQELLELD
NGVIPCGESCVFIPCISTLLGCSKNKVCYR
NMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQEL
NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQE
NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN
NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW
PNGPWVWVPAFCQAVGWGDP
PQITLRKKRRQRRRPPQVSFNFATLNF
PQITLRKKRRQRRRPPQVSFNFCTLNF
PTGERVWDRGNVTLLCDCPN
PVSLSYRCPCRFFE
QIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKN
QLLDVVKRQQEMLRLTVWGTKNLQARVTAIEKYLKDQ
QLLIRMIYKNI
QLLIRMIYKNILFYLVPGPGHGAEPERRNIKYL
RGGRLCYCRRRFCVCVGR
RGNVTLLCDCPNGPWVWVPA
RIPTGERVWDRGNVTLLCDC
RMIYKNILFYLVPGPGHGAEPERRNIKYL
SKEKIGKEFKRIVQRIKDFLR
SLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI
SNQGGSPLPRSV
SSGLYHVTNDCPNSSIVY
SYRCPCRFFES
SYSMEHFRWGKPV
TFCGETCRVIPVCTYSAALGCTCDDRSDGLCKRNGDP
TLLCDCPNGPWVWVPAFCQA
TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN
TWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE
VCYRNGIPCGESCVWIPCISAALGSCK
WDREINNYTSLIHSLIEESQNQQEKNEQELLELDKW
WDRGNVTLLCDCPNGPWVWV
WMEWDREIEALAKAAEALAKAAEALAKAAWASLWNWF
WMEWDREIEEAAKKLEEAAKKLEEAAKKLWASLWNWF
WMEWDREINNYTSLIGSLIEESQNQQEKNEQELLE
WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE
WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL
WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQ
WVWVPAFCQAVGWGDPITHW
YCKKCCYHCQ
YPGHITGHRMANMMMNW
YQCGQGG
YQLAIRMIYKNI
YQLLIRAIYKNI
YQLLIRMAYKNI
YQLLIRMIAKNI
YQLLIRMIY
YQLLIRMIYANI
YQLLIRMIYKAI
YQLLIRMIYKNA
YQLLIRMIYKNI
YTSLIHSLIEESQNLQEKNEQELLELDKWASLWNWF
YTSLIHSLIEESQNQQEKLEQELLELDKWASLWNWF
YTSLIHSLIEESQNQQEKNEQELLELDKWASLANWF
YTSLIHSLIEESQNQQEKNEQELLELDKWASLFNFF
YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNAF
YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNSF
YTSLIHSLIEESQNQQEKNEQELLELDKWASPWNWF
YTSLIHSLIEESQNQQEKNEQELLQLDKWASLWNWF
YTSLIHSLIEESQNQQEKNEQQLLELDKWASLWNWF
YTSLIHSLIEESQNQQEKNQQELLQLDKWASLWNWF
YTSLIHSLIEESQQQQEKNEQELLELDKWASLWNWF
YTSLIHSLIEQSQNQQEKNEQELLELDKWASLWNWF
YTSLIHSLIQESQNQQEKNEQELLELDKWASLWNWF
YTSLIQSLIEESQNQQEKNEQELLELDKWASLWNWF
YTSLIQSLIEESQNQQEKNEQQLLELDKWASLWNWF
ALWKTMLKKLGTMALHAGKAALGAAADTISQGTQ
GLFGVLAKVAAHVVPAIAEHF
GLLSVLGSVAKHVLPHVVPVIAEHL
CGRLLLRRQRRRAHQN
EWRKKRYSTQV
FFGKVLKLIRKIF
FFHHIFRGIVHVGKTIHRLVTG
GFKDLLKGAAKALVKTVLF
GFLDIIEKIAKSW
GFLSILKKVLPKVMAHMK
GFNEIVQDIEDFLQNLV
GIGAVLKVLTTGLPALISWIKRKRQQ
GIIDIAKKLFESW
GIWDTIKSMGKVFAGKILQNL
GIWSDLAEIIKKF
GLLGLLGSVVSHVVPAIVGHF
GLRRLLGRLLRRLGRLLLR
GLRSKIWLWVLLMIWQESNKFKKM
GLRSRIWLWVLLMIWQESNRFKRM
GWFDIIKKIASEL
GWFDVVKHIAKRF
ILGPVLGLVSRTLRRVLGIL
LLKELWTKIKGAGKAVLGKIKGLL
MTWEAWDRAIAEYAARIEALIRAAQEQQEKNEAALREL
MTWMAWDRAIANYAALIHALIEAAQNQQEKNEAALLEL
MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELL
MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL
SLSRFLRFLKIVYRRAF
TTWEAWDRAIAEYAARIEALIRAAQELQEKLEAALREL
TTWEAWDRAIAEYAARIEALIRAAQELQEKNEAALREL
TTWEAWDRAIAEYAARIEALIRAAQEQQEKLEAALREL
TTWEAWDRAIAEYAARIEALIRAAQEQQEKLEAVLREL
TTWEAWDRAIAEYAARIEALIRAAQEQQEKNEAALREL
TTWEAWDRAIAEYAARIEALIRALQELQEKLEAILREL
TTWEAWDRAIAEYAARIEALIRALQELQEKNEAALREL
TTWEAWDRAIAEYAARIEALIRALQELQEKNEAILREL
TTWEAWDRAIAEYAARIEALIRALQEQQEKNEAALREL
TTWEAWDRAIAEYAARIEALIRALQEQQEKNEAILREL
TTWEEWDREINEYTSRIESLIRESQEQQEKNEQELREL
VFQFLGRIIHHVGNFVHGFSHVF
WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF
YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF
YVSGKARGWFYRHHY
YVSGKARGWFYRHHYESPHPRISSEVHIPLGDARLV
AEGIGALFLGFLGAAGSTMGARSMTLTVQARQL
AVGIGALFLGFLGAAGSTMGARSMTLTVQARQL
AIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKEIV
AKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVA
ALDPIDISIELNKAKSDLEESKEWIRRSNQKLDSI
ARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKS
AVALVEAKQARSDIEKLKEAIRDTNKAVQSVQSSI
DISIELNKAKSDLEESKEWIRRSNQKLDSIGNWHQ
DPIDISIELNKAKSDLEESKEWIRRSNQKLDSIGN
EAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIV
KLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVN
KQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAI
LNNSVALDPIDISIELNKAKSDLEESKEWIRRSNQ
LVEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNL
RSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSV
SDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQ
SVALDPIDISIELNKAKSDLEESKEWIRRSNQKLD
TAAVALVEAKQARSDIEKLKEAIRDTNKAVQSVQS
TLNNSVALDPIDISIELNKAKSDLEESKEWIRRSN
VEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLI
ALDPIDISIELNKAKSDLEESKEWIRRSNQKLDS
AQITAAVALVEAKQARSDIEKLKEAIRDTNKAVQS
DPIDISIELNKAKSDLEESKEWIRRSNQKLDSIG
IDISIELNKAKSDLEESKEWIRRSNQKLDSIGNW
LDPIDISIELNKAKSDLEESKEWIRRSNQKLDSI
LKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNK
NSVALDPIDISIELNKAKSDLEESKEWIRRSNQK
PIDISIELNKAKSDLEESKEWIRRSNQKLDSIGN
QARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIK
FKCRRWQWRMKKLGA
FKCRRWQWRMKKLGAPSITCVRRAF
RFLVCWKQKIWGKARPSMCTRRARF
TKCFQWQRNMRKVRGPPVSCIKRDS
TKCFQWQWNMRKVRGPPVSCIKRDS
TKCRRWQRNMRKVRGPPVSCIKRDS
AAHLIDALYAEFLGGRVLTTPVVHRALFYASAVLRQPFLAGVPSA
CYCRIPACIAGERRYGTCIYQGRLWAFCC
FLPVLAGIAAKVVPALFCKITKKC
GHRRYFTFGGGYVYF
GIGKFLHSAGKFGKAFVGEIMKS
GLASTLTRWAHYNALIRAF
HEFVPLEVYTRHEIK
HRWRKRWRKWRWRKRWRK
KRWRKRWRKKRWRKRWRK
KTTSSIEFARLQFTY
KVLTTGLPALISWIKRKRQQ
LLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES
LRKLRKRLL
LRKRKRL
LRKRKRLLK
LRKRKRLRK
LRKRKRLRKLRKRKRLRK
LRTRKRGRK
LRTRKRGRKLRTRKRGRK
LRWRKRWRKLRWRKRWRK
LRWRKRWRKWRWRKRWRK
RARRSLLIASALCTSDVAAATNADLRTALARADHQKTLFWL
RLTRKRGLK
RRCICTTRTCRFPYRRLGTCIFQNRVYTFCC
RRWRKRWRKRRWRKRWRK
RRWRKRWRKWRWRKRWRK
RTRKGRK
RTRKGRR
RTRKRWRKRTRKRGRK
RWRKRGRKRWRKRGRK
RWRKRWRKRWRKRWRK
RWRKRWRKWRWRKRWRK
RWRKRWRWRKRWRWRKRW
TARLQLEARLQHLVAEILEREQSLA
TARLQLEARLQHLVAEILEREQSLALHALGYQLAFV
TTPKFTVAWDWVPKR
VFQFLGKIIKKVGNFVKGFSKVF
VVCACRRALCLPLERRAGFCRIRGRIHPLCCRR
WRWRKRWRK
WRWRKRWRKWRWRKRWRK
YDHIQDHVNTMFSRLATSWCLLQNKERALWAEAA
YFYNAK
LRKRKRLLRKRKRL
RTRKRGRRTRKRGR
GIGKFLHSAKKFGKAFVGEIMNS
ILPWKWPWWPWRR
GMASKAGAIAGKIAKVALKAL
GIGKFLKKAKKGIGAVLKVLTTGL
ALSKALSKALSKALSKALSKALSK
KKLLKKLKKLLKKL
IDISIELNKAKSDLEESKEWIRRSNQKLDSIGNWH
LDPIDISIELNKAKSDLEESKEWIRRSNQKLDSIG
NNSVALDPIDISIELNKAKSDLEESKEWIRRSNQK
NSVALDPIDISIELNKAKSDLEESKEWIRRSNQKL
PIDISIELNKAKSDLEESKEWIRRSNQKLDSIGNW
SIELNKAKSDLEESKEWIRRSNQKLDSIGNWHQSS
VALDPIDISIELNKAKSDLEESKEWIRRSNQKLDS
CATCEQIADSQHRSHRQMV
CATCQIADSHRSHRQMV
PPWCCCSPMKRASPPPAQSDLPATPKCPP
RRKKAAVALLPAVLLALLAP
RRKKAVLLALLAP
RRKKLPAVLLALLAP
RRKKPAVLLALLAP
RRKKVLLALLAP
RRKKAAVALLAVLLALLA
RRKKVALLAVLLALLA
GLLRKGGEKIGEKLKKIGQKIKNFFQKLVPQPEQ
RKAVLLALLA
RKKLAVLLALLA
RRKKAAAAAAAAA
RRKKLAVLLALLA
RRKKLLAVLLALLA
HGVSGHGQHGVHG
DLGPPISLERLDVGTNLGNAIAKLEDAKELLESSD
IDLGPPISLERLDVGTNLGNAIAKLEDAKELLESS
LERLDVGTNLGNAIAKLEDAKELLESSDQILRSMK
LGPPISLERLDVGTNLGNAIAKLEDAKELLESSDQ
LHRIDLGPPISLERLDVGTNLGNAIAKLEDAKELL
PISLERLDVGTNLGNAIAKLEDAKELLESSDQILR
PPISLERLDVGTNLGNAIAKLEDAKELLESSDQIL
RIDLGPPISLERLDVGTNLGNAIAKLEDAKELLES
SLERLDVGTNLGNAIAKLEDAKELLESSDQILRSM
LFRLIKSLIKRLVSAFK
DASISQVNEKINQSLAFIRKSDELLHNVNAGKSTT
DEFDASISQVNEKINQSLAFIRKSDELLHNVNAGK
EFDASISQVNEKINQSLAFIRKSDELLHNVNAGKS
FPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN
FYDPLVFPSDEFDASISQVNEKINQSLAFIRKSDE
PLVFPSDEFDASISQVNEKINQSLAFIRKSDELLH
PSDEFDASISQVNEKINQSLAFIRKSDELLHNVNA
VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV
YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDEL
DASISQVNEKINQSLAFIRKSDELLHNVNAGKSIT
DPLVFPSDEFDASISQVNEKINQSLAFIRKSDELL
KKRKRRFLGFLLGVGSA
SDEFDASISQVNEKINQSLAFIRKSDELLHNVNAG
SSVITSLGAIVSCYGKT
VITIELSNIKENKCNGAKVKLIKQELDKYKNAV
YTSVITIELSNIKENKCNGAKVKLIKQELDKYK
IQKEIDRLNEVAKNLNESLI
PTTFMLKYDENGTITDAVDC
QYGSFCTQLNRALSGIAAEQ
YQDVNCTDVSTAIHADQLTP
GVFVFNGTSWFITQRNFFS
GYFVQDDGEWKFTGSSYYY
GYHLMSFPQAAPHGVVFLHVTW
IQKEIDRLNEVAKNLNESLIDLQELGK
NGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTA
AALYKKKIIKKLLES
KNGRKLCLDLQAALY
KWKLFKKIGIGKFLHAAKKF
KWKLFKKIGIGKFLHFAKKF
KWKLFKKIGIGKFLHWAKKF
RRKKAAVAALPAVLLALLAP
RRKKAAVALAPAVLLALLAP
RRKKAAVALKPAVLLALLAP
RRKKAAVALLKAVLLAALAP
RRKKAAVALLKAVLLALAAP
RRKKAAVALLKAVLLALKAP
RRKKAAVALLPAVALALLAP
RRKKAAVALLPAVKLALLAP
RRKKAAVALLPAVLAALLAP
RRKKAAVALLPAVLEALLAP
RRKKAAVALLPAVLKALLAP
RRKKAAVALLPAVLLAKLAP
RRKKAAVALLPAVLLALL
RRKKAAVALLPAVLLALLA
RRKKALLPAVLLALLAP
RRKKAVALLPAVLLALLAP
RRKKLLPAVLLALLAP
RRKKVALLPAVLLALLAP
RWRWRW
CDVIALLACHLNT
CDVIALLCHLNTPSF
CDVIALLCHLNTPSFNTTHYRESWY
DTRACDVIALLCHLNT
ALWKTLLKKVLKAAAK
ALWKTLLKKVLKAAAKAALNAVLVGANA
ALWMTLLKKVLKAAAK
ALWMTLLKKVLKAAAKAALNAVLVGANA
FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST
KPKQIKPPLPSV
NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ
SVITIELSNIKENKCNGTDAKVKLIKQELDKYKNA
TSVITIELSNIKENKCNGTDAKVKLIKQELDKYKN
VITIELSNIKENKCNGTDAKVKLIKQELDKYKNAV
KDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHIPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNDAQAPKAD
MGSSHHHHHHSSGVDNKFNKEQQNAFYEILHLPNLNEEQRNAFIQSLKDDSYIDTNNDGAYEGDELSGSQSANLL
MGSSHHHHHHSSGVDNKFNKEQQNAFYEILHLPNLNEEQRNAFIQSLKDDSYIDTNNDGAYEGDELSGSQSANLLAEAKKLNDAQAPK
HLPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNNFNKEQQNAFYEILHLPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKA
YEILHLPNLTEEQRNGFIQSLKDDPSVSKEILAEAKKLNDAQAPKEEDNNKPGKEDGNKPGKEDNNKPGKEDGNGVHVV
QKFYEILHLPNLTEEQRNGFIQSLKDDPSVSKDILVEAKKLNDSQAKPDYSEAQQNAFYEILHLPNLTEEQ
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLGEAKN
NAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLGEAQKLNDSQAPKADAQQNKFNKDQQSA
APKADNNFNKEQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNESQAPKADNKFNKE
KDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHIPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNDAQAPKAD
QKFYEILHLPNLTEEQRNGFIQSLKDDPSVSKDILVEAKKLNDSQAKPDYSEAQQNAFYEILHLPNLTEEQ
MENKNFFSIRKLSIGVGSCLIASSLLVNTPSFAEETDNANINDAQQNAFYEILHLPNLTEEQ
NDSQAKPDYSEAQQNAFYEILHLPNLTEEQRNGFIQSLKDDPSVSKDILVEAKKLNDSQAKPDYSEAQQNA
MENKNFFSIRKLSIGVGSCLIASSLLVNTPSFAEETDNANINDAQQNAFYEILHLPNLTEEQQNGFIQSLKDDPS
MENKNFFSIRKLSIGVGSCLIASSLLVNTPSFAEETDNANINDAQQNAFYEILHLPNLTEE
VSKEILAEAKKLNDAQAPKEEDNNKPGKEDGNGVHVVKPGDTVND
ITEEQRIQYIKTLREHPECAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNHNLTEQEKNNYIAQIKENPD
ENIGSYKQENPVDPDSYIYSSRVYYSLGLALSGRNFTEMFLDDMLLNLTPEQRNCFLQSIKDMSKAVALLDGKEVSLENLFILGRSYFFTGYI
MGSLTKDQQDEFDQIVNGTGLNEEQQNTLLQDKITEQNNNLTRSLKLETIKKIAKYVAKITGNSLKAKSLKAFINFLTNYEGKAEDGYGMH
ATLNSNAATTDTKFDDNVQAVLNKASQQNQEKQTKIAEIQNLSNLNQSQKDALVQEVKDSLHSQAQSVLEKAKTLDTKMKQLKDKVAEESTNKALDAYQN
MATNALNDGTAKQRLSETGTAKRRIRKVFGNIHEVVQMPNLIEVQRESYEQFLRSDPSIGYVSGLEKTLRSVFPIRDFAGTAEL
MKTAITLRPDAKGRVGINALARQLQDRLGGQTISGYTAEVTADGAILLRPRVEVDAQEASTLILGAEDREAFLQALASPPPPGAALKAAARAHARATRRR
KDDPSVSKEILAEAKKLNDAQAPKEEDNNKPGKEDNKKPGKEDGNKPGKEDGNKPGKEDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTA
EAIDKIKDAIKRIYDNKDDLKKIVDELPNLSEQEKEHFKDQIQNEDDPVKRNKIIREAQKINDQKQELINLINEQPNLS
NGNQKLADAKQDAKTTLGTLDHLNDAQKQALTTQVEQAPDIATVNNVKQNAQNLNNAMTNLNNALQDKTETLNSINFTDADQAKKDAYTNAVSHAEGI
QLYVESTQDHQQRLNGLRQVVNRTYRIGTTKRVEVSQGNVQTKKVLESTNLNIDDFVDDPLSYVKTPSNKVLGFYSTNANTNAFRPGGAQQLNEYQL
VGQANRLEDVQTVQTNGQALNNAMKGLRDSIANETTVKTSQNYTDASPNNQSTYNSAVSNAKGIINQTNNPTMDTSAITQATTQVNNAKNGLN
DSIANETTVKTSQNYTDASPNNQSTYNSAVSNAKGIINQTNNPTMDTSAITQATTQVNNAKNGLNGAENLRNAQNTAKQNLNTLSHLTNN
LEAAKQQASQSLGSLDNLNNAQKQTVTDQINGAHTVDEANQIKQNAQNLNTAMGNLKQAIADKDATKATVNFTDADQAKQQAYNTAVTNAE
MGNLKQAIADKDATKATVNFTDADQAKQQAYNTAVTNAENIISKANGGNATQAEVEQAIKQVNAAKQAL
LNGNENLEAAKQQASQSLGSLDNLNNAQKQTVTDQINGAHTVDEANQIKQNAQNLNTAMGNLKQAIADKDATKATVNFTDADQAKQQAYNTAVTNAEN
QIVISDRAKQSSSTGNESNSHLTIGYGTANHPFNSSTIGHKKKIDEDDDIDPLHMRHFRNNFGNVIKNAIGVVGISG
AMGNLKQAIADKDATKATVNFTDADQAKQQAYNTAVTNAENIISKANGGNATQAEVEQAIKQVNAAKQA
ALNGNENLEAAKQQASQSLGSLDNLNNAQKQTVTDQINGAHTVDEANQIKQNAQNLNTAMGNLKQAIA
DNAITAAKAILNKSTGPNTAQNAVEAALQRVNNAKDALNGDAKLIAAQNAAKQHLGTLTHITTAQRNDLTNQISQATNLA
IGTTDEKQAAMNQINEIVLETIRDINNAHTLQQVEAALNNGIARISAVQIVISD
LTDAINAAPTRTEVAQHVQTATELDHAMEILKNKVDQVNTDKAQPNYTEASTDKKEAVDQALQAAESITDPTNGSNANKDAVEQALTKLQEKENELNG
MTKKLQTTFIVLIIVLLALLGAYILYYSFRSKSPPPPLLTEEQKTQALIDSITPTTTTTSLSAQELKKINDSIAPRDKSKTINN
MFPYETDDPYMLVRDPELTAEQAAEKIQALADDLAGSHQLAAEQAVERRKVFWSETGRHHFLSRGPELTAE
MSSMKNFITEQQKAELERLHNSNRNGRVRDRIKAILLGYEGWSSAMIAQALRLHQITIAHHTRDVIAFVTRTWSIIFNIPGMNK
MKKTVFYEIDALADGREAFLCSHCHREASMRLVPAYCPSCGRKVRRVKDALGLAGSPVFYKLYHSTGRNPEAVEAFIACVQGDDGRMAACLAHALQPQA
MEEVHGLTPGKLASDPNTPVDKISIPKFTHQIYMGNQLVSSLDFESDYSLKPITEKSLFAFNGNEKTF
MNKKIKMLLLATGSIVTATAPVLLSAVAADEDINSFSNYTDPNNIQYGNIDYDILNTDM
EKSEWRKANGIPDKPEDYAVPEVKGYEWTEADKPLMNAFMSSMHAKNATQEQIDAMLQTYVSVAAESKVAQAEADKAAEVEVIDHLRT
EAKNTKQDPIYSSDTSDKKSAFDNAITESETKLDEHLKVNLSNLSAEQILEKAKEVQADIKTLDDEIKKLDGKKQALRDEINAYSP
ITEEQRIQYIKTLREHPECAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNHNLTEQEKNNYIAQIKENPD