猿猴腺病毒的核酸和氨基酸序列,含有它们的载体以及用法 【发明背景】
腺病毒是一种基因组大小36千碱基(kb)的双链DNA病毒,由于其能在各种靶组织实现高效基因转移和大的转基因容量,已广泛用于基因转移用途。通常,缺失腺病毒的E1基因,用含有所选的启动子、感兴趣基因的cDNA序列和聚A信号序列的转基因盒替代,产生复制缺陷性重组腺病毒。
腺病毒的形态特征是具有由三种主要蛋白质、六邻体(II)、五邻体碱基(III)和缠绕纤维(IV)、以及许多其它小蛋白VI、VIII、IX、IIIa和IVa2组成的二十面体衣壳(W.C.Rurreii,J.Gen.Virol.,81:2573-2604.2000)。其病毒基因组是线性双链DNA,具有共价结合于有着倒置末端重复序列的5’端的一末端蛋白质。腺病毒DNA与高碱性蛋白质工程VII和一小肽终端mu密切相联。此DNA-蛋白质复合物包装了另一蛋白质V,并通过蛋白质VI提供了与衣壳的结构连接。此病毒还含有加工某些结构性蛋白质产生成熟传染性病毒所需的一种病毒编码的蛋白酶。
用于向细胞输送分子的重组腺病毒已有描述,见美国专利6,083,716所述的两种黑猩猩腺病毒。
本领域需要能避开人群中对所选腺病毒血清型预先存在的免疫力的和/或可用于反复给药和如果需要经二次疫苗接种能提高效价的更有效的载体
发明概述
本发明提供六个猿猴腺病毒的分离的核酸序列和氨基酸序列、含这些序列的载体和能表达猿猴腺病毒基因的细胞系。还提供采用本发明载体和细胞的一些方法。
本发明方法包括给予本发明地载体向哺乳动物输送一种或多种所选的异源基因。因为各种载体构建物衍生自猿猴而非人,非猿病人或病兽的免疫系统不能将此载体视作外源抗原立刻对其发生应答。因此采用本发明的组合物当给予非猿受试者时,得以更稳定地表达所选项的转基因。采用本发明的组合物作疫苗接种得以提供所选的抗原以引发保护性免疫应答。不希望受理论的束缚,本发明的腺病毒转导人树突状细胞的能力至少部分导致本发明的重组构建物能够引发免疫应答。本发明的重组猿猴腺病毒也可用于体外产生异源基因产物。这种基因产物本身的多种多样可用于如本文所述的各种目的。
本发明的这些和其它实施例及优点将在以下作更详细描述。
附图简述
图1提供了本发明黑猩猩腺病毒C1(SEQ ID NO:13)、黑猩猩腺病毒C68(Pan-9)(SEQ ID NO:14)和新的Pan5(SEQ ID NO:15)、Pan6(SEQ ID NO:16)、Pan7(SEQ IDNO:17)黑猩猩腺病毒序列的衣壳蛋白六邻体的L1换和部分L2环的氨基酸序列排列对比。相关的保守区是腺病毒各血清型之间保守的基础结构域部分。
图2提供了黑猩猩腺病毒C68(Pan-9)(SEQ ID NO:18)、Pan-6(SEQ ID NO:19)、Pan-7(SEQ ID NO:20)、Pan-5(SEQ ID NO:21)和人腺腺病毒血清型2(SEQ ID NO:22)和5(SEQ ID NO:23)的纤维瘤结构域的氨基酸序列的排列对比。
发明详述
本发明提供了最初分离自黑猩猩淋巴结的Ad Pan5(SEQ ID NO:1-4、15和21)、Ad Pan6(SEQ ID NO:5-8,16,19)和Ad血清型Pan7(SEQ ID NO:9-12,17,20)的新的核酸和氨基酸序列。本说明书的几个例子中,这些腺病毒分别以本文的C5,C6和C7为末端。本发明还提供最初分离自猕猴肾细胞的腺病毒SV1的序列(SEQ IDNO:24-28)。本发明还提供最初分离自猕猴肾细胞的腺病毒SV-25(SEQ ID NO:19-33)和SV-39(SEQ ID NO:34-37)的序列。
本发明提供新型腺病毒载体和产生这些载体的包装细胞系,用于在体外产生重组蛋白质或片段或其它制剂。本发明还提供用于输送治疗或疫苗目的的异源分子的组合物。这些治疗性或疫苗组合物含有携带插入的异源分子的腺病毒载体。此外,本发明的新型序列可用于提供产生重组腺病毒相关病毒(AAV)载体所需的辅助功能。因此本发明提供在此种产生方法中利用这些序列的辅助构建物、方法和细胞系
术语“基本上同源”或“基本上相似”指核酸或其片段时,表示当任选地将含相应核苷酸插入或缺失的序列与另一核酸(或其互补链)排列对比时,对比的序列至少约95-99%核苷酸序列相同。
术语“基本上同源”或“基本上相似”指氨基酸或其片段时,表示当任选地将含相应氨基酸插入或缺失的序列与另一氨基酸(或其互补链)排列对比时,对比的序列至少约95-99%核苷酸序列相同。优选同源性指蛋白质的全长序列,或其长度至少8个氨基酸,更佳至少15个氨基酸的片段。本文将描述合适片段例子。
本文核酸序列所用术语“序列同一性百分率”或“相同”指当作最大排列对比时两个序列中的残基相同。序列同一性比较的长度可包括全长基因组(如约36kb)、基因、蛋白质、亚基或酶开放阅阅读框的全长(例如见表中提供的腺病毒编码区),或者,需要的话,至少约500-5000个核苷酸的片段。然而,也可能需要比较较小片段,如至少约9个核苷酸,通常至少约20-24个核苷酸,至少约28-32个核苷酸,至少约36个或更多核苷酸的同一性。类似的,可不难确定某蛋白质全长或其片段的氨基酸序列的“序列同一性百分率”。一片段宜至少长约8个氨基酸,可长达700个氨基酸。本文将描述合适片段的例子。
用缺省设置,采用本文的算法和计算机程序不难测定同一性。优选这种同一性指蛋白质、酶、亚基或其至少长约8个氨基酸的片段。然而,这种同一性可基于较短的区域,适合产生相同的基因产物的区域。
如本文中所述,可用各种已发表的或可商品购得的多个序列排列对比程序,如“Clustal W”,通过inter网服务获得,进行此排列对比。或者,也可采Vector NTI公用事业公司的程序。本领域已知许多算法可用于测定核苷酸序列的同一性,包括上述程序中所含有的那些。另一例子,可采用Fasta(GCC第6.1版中的程序)比较多核苷酸的序列。Fasta提供了询问和检索之间最佳重叠区域的对比和序列同一性百分率。例如,可利用Fasta的缺失参数(六个字大小和评分矩阵的NOPAM系数),如GCC第6版中提供的(纳入本文参考文献),来测定核酸序列之间的序列同一性百分率。可获得类似的程序来进行氨基酸的排列对比。通常这些程序采用缺省设置,虽然如果需要本领域技术人员可换用其它设置。或者,本领域技术人员可采用至少能提供如上述算法和程序同样的同一性或对比水平的其它算法或计算机程序。
如本说明书和权利要求书所用,术语“包含”及其异体词,其中包括“含有”、“包括”等,包括其它组分、元件、整数、步骤等。术语“由…组成”不包括其它组分、元件、整数、步骤等。
I.猿猴腺病毒序列
本发明提供了分离自与腺病毒天然相关的其它病毒材料的Pan5、Pan6、Pan7、SV1、SV25和SV39的核酸序列和氨基酸序列。
A.核酸序列
本发明的Pan5核酸序列包括SEQ ID NO:1的核苷酸1-36462。本发明的Pan6核酸序列包括SEQ ID NO:5的核苷酸1-36604。本发明的Pan7核酸序列包括SEQ IDNO:9的核苷酸1-36535。本发明的SV1核酸序列包括SEQ ID NO:24的核苷酸1-34264。本发明的SV25核酸序列包括SEQ ID NO:29的核苷酸1-31044。本发明的SV39核酸序列包括SEQ ID NO:34的核苷酸1-34115。见纳入本文参考文献中的序列表。
本发明的核酸序列还包括SEQ ID NO:5、9、24、29和34序列互补的链,以及相应于这些序列图中序列及其互补链的RNA和cDNA。本发明还包括与序列表有95-98%以上,更优选约99-99.9%同源性或同一性的核酸序列。本发明的核酸序列还包括SEQ ID NO:5、9、24、29和34中提供的序列和它们的互补链的天然变种和基因工程修饰。这种修饰例如包括本领域已知的标记、甲基化和一个或多个天然核苷酸被简并核苷酸所取代。
本发明还包括Pan5、Pan6、Pan7、SV1、SV25和SV39序列的片段、它们的互补链、互补的cDNA和RNA。合适的片段至少长15个核苷酸并含有功能性片段,如生物学感兴趣的片段。例如,功能性片段可表达所需的腺病毒产物,或用于产生重组病毒载体。这种片段包含下表所列的基因序列和片段。
下表提供了本发明猿猴腺病毒序列的转录区和开放阅读框。对于某些基因,其转录区和开放阅读框(ORFs)位于与SEQ ID NO:5、9、24、29和34序列互补的链上。见例如E2b,E4和E2a。还显示了其编码蛋白的分子量。注意,E1a开放阅读框Pan5(SEQ ID NO:1的核苷酸576-1436)、Pan6(SEQ ID NO:5的核苷酸576-1437)和Pan7(SEQ ID NO:9的核苷酸576-1437)含有内部剪接位点。这些剪接位点在下表中标出: Ad Pan-5(SEQ ID NO:1) 区域 起始 (nt) 终止 (nt) 分子量 (道尔顿) ITR 1 120 - E1a 转录物 478 - 13S 576-664,1233-1436 28120 12S 576-1046,1233-1436 24389 9S 576-664,1233-1436 9962 转录物 1516 - E1b 转录物 1552 - 小T 1599 2171 22317 大T 1904 3412 55595 LX 3492 3920 14427 转录物 3959 - E2b 转录物 10349 - PTP 10349 8451 72930 聚合酶 8448 5083 127237 IVa2 5604 3980 50466 转录物 3960 - 28.1Kd 5155 5979 28141 Agnoprotein 7864 8580 25755 L1 转录物 10849 - 52/55D 10851 12025 - IIIa 12050 13819 65669 转录物 13832 - 转录物 13894 - L2 五邻体 13898 15490 59292 VII 15494 16078 21478 V 16123 17166 39568 Mu 17189 17422 8524 转录物 17422 - 转录物 17488 - L3 VI 17491 18222 26192 六邻体 18315 21116 104874 内切蛋白酶 20989 21783 28304 转录物 21811 - E2a 转录物 26782 - DBP 23386 21845 57358 转录物 21788 - L4 转录物 23406 - 100kD 23412 25805 88223 33kD同源 25525 26356 24538 VIII 26428 27111 24768 转录物 27421 - E3 转录物 26788 - Orf#1 27112 27432 12098 Orf#2 27385 28012 23040 Orf#3 27994 28527 19525 Orf#4 28557 29156 22567 Orf#5 29169 29783 22267 Orf#6 19798 30673 31458 Orf#7 30681 30956 10477 Orf#8 30962 31396 16523 Orf#9 31389 31796 15236 转录物 31837 - L5 转录物 32032 - 纤维 32035 33382 47670 转录物 33443 - E4 转录物 36135 - Orf7 33710 33462 9191 Orf6 34615 33710 35005 Orf4 34886 34521 13878 Orf3 35249 34896 13641 Orf2 35635 35246 14584 Orf4 36050 35676 13772 转录物 33437 - ITR 36343 36462 - Ad Pan-6(SEQ ID NO:5) 区域 起始 (nt) 终止 (nt) 分子量 (道尔顿) ITR 1 123 - E1a 转录物 478 - 13S 576-1143,1229-1437 28291 12S 576-1050,1229-1437 24634 9S 576-645,1229-1437 10102 转录物 1516 - E1b 转录物 1553 - 小T 1600 2172 22315 大T 1905 3413 55594 LX 3498 3926 14427 转录物 3965 - E2b 转录物 10341 - PTP 10340 8451 72570 聚合酶 8445 5089 126907 IVa2 5610 3986 50452 转录物 3966 - L1 转录物 10838 - 52/55kD 10840 12012 44205 IIIa 12036 13799 65460 转录物 13812 - 28.1Kd 5161 5985 28012 Agnoprotein 7870 8580 25382 L2 转录物 13874 - 五邻体 13878 15467 59314 VII 15471 16055 21508 V 16100 17137 39388 Mu 17160 17393 8506 转录物 17415 - L3 转录物 17466 - VI 17469 18188 25860 六邻体 18284 21112 106132 内切蛋白酶 21134 21754 23445 转录物 21803 - E2a 转录物 26780 - DBP 23375 21837 57299 转录物 21780 - L4 转录物 23398 - 100kD 23404 25806 88577 33kD同源 25523 26357 24609 VIII 26426 27109 24749 转录物 27419 - E3 转录物 26786 - Orf#1 27110 27430 12098 Orf#2 27384 28007 22880 Orf#3 27989 28519 19460 Orf#4 28553 29236 25403 Orf#5 29249 29860 22350 Orf#6 29875 30741 31028 Orf#7 30749 31024 10469 Orf#8 31030 31464 16540 Orf#9 31457 31864 15264 转录物 31907 - L5 转录物 32159 纤维 32162 33493 47364 转录物 33574 - E4 转录物 36276 - Orf7 33841 33593 9177 Orf6 34746 33841 35094 Orf4 35017 34652 13937 Orf3 35380 35027 13627 Orf2 35766 35377 14727 Orf4 36181 35807 13739 转录物 33558 - ITR 36482 36604 - Ad Pan-7(SEQ ID NO:9) 区域 起始 (nt) 终止 (nt) 分子量 (道尔顿) ITR 1 132 - E1a 转录物 478 - 13S 576-1143,1229-1437 28218 12S 576-1050,1229-1437 24561 9S 576-645,1229-1437 10102 转录物 1516 - E1b 转录物 1553 - 小T 1600 2178 22559 大T 1905 3419 55698 LX 3992 5616 50210 转录物 3971 - E2b 转录物 10340 - PTP 10340 8457 72297 聚合酶 8451 5095 126994 IVa2 3504 3932 14441 转录物 3972 - 28.1Kd 5167 5991 28028 Agnoprotein 7876 8586 25424 L1 转录物 10834 - 52/55kD 10836 12011 44302 IIIa 12035 13795 65339 转录物 13808 - L2 转录物 13870 - 五邻体 13874 15469 59494 VII 15473 16057 21339 V 16102 17139 39414 Mu 17167 17400 8506 转录物 17420 - L3 转录物 17467 - VI 17470 18198 26105 六邻体 18288 21086 104763 内切蛋白酶 21106 21732 23620 转录物 21781 - E2a 转录物 26764 - DBP 23353 21815 57199 转录物 21755 - L4 转录物 23370 - 100kD 23376 25781 88520 33kD同源 25489 26338 25155 VIII 26410 27093 24749 转录物 27403 - E3 转录物 26770 - Orf#1 27094 27414 12056 Orf#2 07368 27988 22667 Orf#3 27970 28500 19462 Orf#4 28530 29150 22999 Orf#5 29163 29777 22224 Orf#6 29792 30679 32153 Orf#7 30687 30962 10511 Orf#8 30968 31399 16388 Orf#9 31392 31799 15205 转录物 31842 - L5 转录物 32091 - 纤维 32094 33425 47344 转录物 33517 - E4 转录物 36208 - Orf7 33784 33536 9191 Orf6 34689 33784 35063 Orf4 34960 34595 13879 Orf3 35323 34970 13641 Orf2 35709 35320 14644 Orf4 36123 35749 13746 转录物 33501 - ITR 36404 36535 - Ad SV-1 (SEQ ID NO:24) Ad SV-25 (SEQ ID NO:29) Ad SV-39 (SEQ ID NO:34) 区域 起始 终止 起始 终止 起始 终止 ITR 1 106 1 133 1 150 E1a 352 1120 - - 404 1409 E1b 1301 2891 359 2273 1518 3877 E2b 9267 2882 9087 2754 10143 3868 E2a 24415 20281 24034 20086 25381 21228 E3 24974 27886 24791 25792 25790 29335 E4 33498 30881 30696 28163 33896 31157 ITR 34145 34264 30912 31044 33966 34115 Ad SV-1 (SEQ ID NO:24) Ad SV-25 (SEQ ID NO:29) Ad SV-39 (SEQ ID NO:34) 区域 起始 终止 起始 终止 起始 终止 ITR 1 106 1 133 1 150 L1 9513 12376 9343 12206 10416 13383 L2 12453 15757 12283 15696 13444 16877 L3 15910 20270 15768 20080 17783 21192 L4 21715 25603 21526 25420 22659 26427 L5 28059 30899 25320 28172 29513 31170 ITR 34145 34264 30912 31044 33966 34115 蛋白质 Ad SV-1,SEQ ID NO:24 起始 终止 分子量 ITR 1 106 - E1a 13S 459 953 18039 12S - E1b 小T - 大T 1301 2413 42293 LX 2391 2885 16882 IVa2 4354 2924 54087 聚合酶 6750 4027 102883 PTP 9257 7371 72413 Agnoprotein 6850 7455 20984 L1 52/55kD 9515 10642 42675 IIIa 10663 12372 636568 L2 五邻体 13454 13965 56725 VII 13968 14531 20397 V 14588 15625 39374 Mu 15645 15857 7568 L3 VI 15911 16753 30418 六邻体 16841 19636 104494 内切蛋白酶 19645 20262 23407 2a DBP 21700 20312 52107 L4 100kD 21721 24009 85508 VIII 24591 25292 25390 E3 Orf#1 25292 25609 11950 Orf#2 25563 26081 18940 Orf#3 26084 26893 30452 Orf#4 26908 27180 10232 Orf#5 27177 17512 12640 Orf#6 27505 27873 13639 L5 纤维#2 28059 29150 39472 纤维#1 29183 30867 61128 E4 Orf7 31098 30892 7837 Orf6 31982 31122 33921 Orf4 32277 31915 14338 Orf3 32629 32279 13386 Orf2 33018 32626 14753 Orf1 33423 33043 14301 ITR 34145 34264 蛋白质 Ad SV-25,SEQ ID NO:29 Ad SV-39,SEQ ID NO:34 起始 终止 分子量 起始 终止 分子量 ITR 1 133 - 1 150 - E1a 13S 492 1355 28585 12S 492 1355 25003 E1b 小T 478 1030 20274 1518 2075 21652 大T 829 2244 52310 1823 3349 55534 LX 2306 2716 13854 3434 3844 14075 IVa2 4208 2722 54675 3912 5141 46164 聚合酶 6581 3858 102839 7753 5033 103988 PTP 9087 7207 71326 10143 8335 69274 Agnoprotein 6681 7139 16025 L1 52/55kD 9345 10472 42703 10418 11608 44232 IIIa 10493 12202 63598 11574 13364 66078 L2 五邻体 12284 13901 56949 13448 16959 56292 VII 13806 14369 20369 14960 15517 20374 V 14426 15463 39289 15567 16628 39676 Mu 15483 15695 7598 16650 16871 7497 L3 VI 15749 16591 30347 16925 17695 28043 六邻体 16681 19446 104035 17785 20538 102579 内切蛋白酶 19455 20072 23338 20573 21181 22716 2a DBP 21511 20123 42189 22631 21231 53160 L4 100kD 21532 23829 85970 22659 25355 100362 VIII 24408 25109 25347 25410 26108 25229 E3 Orf#1 25109 25426 11890 26375 27484 42257 Orf#2 27580 28357 29785 Orf#3 28370 28645 10514 Orf#4 28863 29333 18835 Orf#5 Orf#6 L5 纤维#2 25380 26423 37529 纤维#1 26457 28136 60707 29515 31116 56382 E4 Orf7 31441 31118 11856 Orf6 29255 28395 33905 32292 31438 33437 Orf4 29550 29188 14399 32587 32222 13997 Orf3 29902 29552 13284 32954 32607 13353 Orf2 30291 29899 14853 33348 32959 14821 Orf1 30316 30696 14301 33764 33378 14235 ITR 30912 31044 33966 34115
Pan5、Pan6、Pan7、SV1、SV25和SV39腺病毒核酸序列可用于治疗制剂和构建各种载体系统和宿主细胞。如本文所用,载体所包含的适合的核酸分子包括,裸DNA、质粒、粘粒或游离体。这些序列和产物可单独应用,或与其它腺病毒序列或片段联用,或与其它腺病毒或非腺病毒的元件联用。本发明的腺病毒序列还可作为反义输送载体、基因治疗载体或疫苗载体应用。因此,本发明还提供含有本发明Ad序列的核酸分子、基因输送载体和宿主细胞。
例如,本发明包括含有本发明猿猴Ad ITR序列的核酸分子。另一实施例中,本发明提供含有本发明猿猴Ad序列编码所需Ad基因产物的核酸分子。本领域技术人员在阅读了本文提供的信息后不难明白采用本发明序列构建其它核酸分子。
一实施例中,本文鉴定的猿猴Ad基因区域可用于载体向细胞输送异源分子。例如,产生用于表达腺病毒衣壳蛋白(或其片段)的载体,目的是在包装宿主细胞中产生病毒载体。可设计顺式表达的这类载体。或者,可设计这类载体来提供稳定含有能表达所需腺病毒功能序列,如E1a、E1b、末端重复序列、E2a、E2b、E4、E4ORF6a区域的细胞。
此外,所述腺病毒基因序列或其片段可用于提供产生辅助依赖性病毒(如缺失了必须功能的腺病毒载体或腺相关病毒AAV)所必须的辅助功能。对于这种产生方法,本发明的猿猴腺病毒序列以类似于人Ad所述的方式用于此方法中。然而,由于本发明猿猴腺病毒和人Ad序列之间的序列差异,采用本发明的序列必须消除在带有人Ad E1功能的宿主细胞中与辅助功能同源重组的可能性,如在rAAV产生时可产生传染性腺病毒污染物的293细胞中。
在许多关于人腺病毒血清型的文献中已描述了利用腺病毒辅助功能产生rAAV的方法。见例如美国专利6,258,595和本文所引用参考文献。这些方法也可用于产生非人血清型AAV,包括非人灵长动物AAV血清型。本发明提供必须的辅助功能的猿猴腺病毒序列(如E1a、E1b、E2a和/或E4ORF6),在提供必须的腺病毒功能,同时尽量减少或消除与人类来源的rAAV-包装细胞中存在的任何其它腺病毒重组的可能性中,可能特别有用。因此这些rAAV产生方法中,可采用本发明选出的腺病毒序列的基因或开放阅读框。
或者,这些方法中可采用本发明的重组猿猴腺病毒载体。这类重组猿猴腺病毒载体可包括,例如杂交的黑猩猩Ad/AAV,其中黑猩猩Ad序列侧接由AAV 3’和/或5’ITR组成的rAAV表达盒,和在控制其表达的调控序列控制下的转基因。本领域技术人员会懂得本发明的其它猿猴腺病毒载体和/或基因序列还可用于产生依赖腺病毒辅助的rAAV和其它病毒。
在另一实施例中,设计了用于能在宿主细胞中输送和表达所选腺病毒基因产物以实现所需生理效应的核酸分子。例如,可将含有编码本发明腺病毒E1a蛋白序列的核酸分子输送给患者,用于癌症治疗。任选地,可用液体载体配制此分子,优选靶向癌细胞。这类制剂可与其它癌症治疗剂(如顺氯氨铂、紫杉醇等)联用。本领域技术人员不难明白,本文提供的腺病毒序列还有其它用途。
此外,本领域技术人员不难理解,可容易地采纳本发明的Ad序列用于各种病毒和非病毒载体系统以在体外、活体外或体内输送治疗性和免疫原性分子。例如,在各种rAd和非rAd载体系统中可采用本发明的Pan5、Pan6、Pan7、SV1、SV25和SV39猿猴Ad基因组。这些载体系统查包括,如质粒、慢病毒、逆转录病毒、痘病毒、痘苗病毒和腺病毒相关病毒系统。选择这些载体系统不是限制本发明。
本发明还提供用于产生本发明的猿猴和猿猴病毒衍生的蛋白质。载有多核苷酸的这种分子可包括裸DNA、质粒、病毒或其它基因元件形式的本发明猿猴Ad DNA序列。
B.本发明的腺病毒蛋白质
本发明还提供上述腺病毒的基因产物,如本发明腺病毒核酸编码的蛋白质、酶、及其片段。本发明还包括具有用其它方法产生的这些核酸序列编码的Pan5、Pan6、Pan7、SV1、SV25和SV39蛋白、酶及其片段。这些蛋白包括上表、图1和图2所鉴定的开放阅读框所编码的蛋白。
因此,本发明一方面提供基本纯的,即没有其它病毒和蛋白质的独特猿猴腺病毒蛋白质。优选这些蛋白质至少10%同源,更优选60%同源,最优选95%同源。
一实施例中,本发明提供独特的猿猴腺病毒衣壳蛋白。如本文所用,猿猴腺病毒衣壳蛋白包括上述含Pan5、Pan6、Pan7、SV1、SV25和SV39衣壳蛋白或其片段的任何腺病毒衣壳蛋白,包括但不限于,嵌合性衣壳蛋白、融合蛋白、人造衣壳蛋白、合成的衣壳蛋白和重组衣壳蛋白,不限于产生这些蛋白的方法。
适当地,这些猿猴腺病毒衍生的衣壳蛋白含有与这里所述的不同腺病毒血清型衣壳区或其片段,或修饰的猿猴衣壳蛋白或片段组合的一个或多个Pan5、Pan6、Pan7、SV1、SV25和SV39区域或及片段(如五邻体、六邻体、纤维或其片段)。“与嗜性改变相关的衣壳蛋白的修饰”在这里包括改变的衣壳蛋白,即五邻体、六邻体或纤维蛋白区域或其片段,如纤维区的球形突出区,或编码它们的多核苷酸,因此其特异性被改变。可用酏发明的一个或多个猿猴Ad或人或非人来源的其它Ad血清型来构建猿猴腺病毒衍生的衣壳。这类Ad可获自各种来源,包括ATCC、商业来源和学术来源,或此Ad序列可获自GenBank或其它合适来源。
本文提供本发明的猿猴腺病毒五邻体蛋白的氨基酸序列。SEQ ID NO:2中提供了Ad Pan5五邻体蛋白。SEQ ID NO:6中提供了Ad Pan7五邻体蛋白。SEQ ID NO:10中提供了Ad Pan6五邻体蛋白。SEQ ID NO:25中提供了SV1五邻体蛋白。SEQ ID NO:30中提供了SV25五邻体蛋白。SEQ ID NO:35中提供了SV39五邻体蛋白。任何这些五邻体蛋白或其特定片可用于各种目的。合适片段的例子包括根据以上提供的氨基酸编号和SEQ ID NO:2、SEQ ID NO:6、SEQ ID NO:25、SEQ ID NO:30或SEQ ID NO:35中的N-端和/或C-端截短的约50,100,150,或200个氨基酸的五邻体。其它合适的片段包括较短的内部片段、C-端或N-端片段。也可修饰该五邻体蛋白用于本领域技术人员已知的各种目的。
本发明还提供Pan5(SEQ ID NO:3)、Pan6(SEQ ID NO:7)、Pan7(SEQ ID NO:11)、SV1(SEQ ID NO:26)、SV25(SEQ ID NO:31)和/或SV39(SEQ ID NO:36)六邻体蛋白的氨基酸序列。适合的该六邻体蛋白或其特定片段可用于各种目的。合适片段的例子包括根据以上提供的氨基酸编号和SEQ ID NO:3,7,11,26,31和36中的N-端和/或C-端截短的约50,100,150,200,300,400或500个氨基酸的六邻体。其它合适的片段包括较短的内部片段、C-端或N-端片段。例如,一个合适的片段是该六邻体蛋白的环区(结构域),命名为DE1和FG1,或其超变区。这些片段包括跨越该猿猴病毒六邻体蛋白的氨基酸残基约125-443;约138-441;或较小片段如跨越残基约138-163;170-176;195-203;223-246;253-264;287-297;和404-430的区域,参见SEQ ID NO:3、7、11、26、31或36。本领域技术人员不难鉴定其它合适的片段。也可修饰该六邻体蛋白用于本领域技术人员已知的各种目的。因为该六邻体蛋白是腺病毒血清型的决定簇。人造六邻体蛋白可产生具有人造血清型的腺病毒。其它人造衣壳蛋白也可用本发明的黑猩猩Ad五邻体序列和/或纤维序列和/或其片段构建。
一实施例中,可能需要用本发明六邻体蛋白序列产生具有改变的六邻体蛋白的腺病毒。改变六邻体蛋白的一种适宜方法见美国专利5,922,315中所述,纳入本文参考文献。此法中,用另一腺病毒血清型至少一个环区改变此腺病毒的至少一个环区。因此被改变的腺病毒六邻体蛋白至少一个环区是本发明猿猴Ad六邻体的环区(如Pan7)。一实施例中,Pan7六邻体蛋白的一个环区被另一腺病毒血清型的一不区所替代。另一实施例中,用Pan7六邻体的该环区来替代另一腺病毒血清型的环区。合适的腺病毒血清型如本文所述不难从人或非人血清型中选择。选择Pan7目的只是为了说明,也同样选择本发明其它猿猴Ad六邻体蛋白,或用于改变另一Ad六邻体。选择合适的血清型不是限制本发明。本领域技术人员不难明白本发明的六邻体蛋白还有其它用途。
本发明还包括本发明的猿猴腺病毒的纤维蛋白。Ad Pan5的纤维蛋白具有SEQ IDNO:4的氨基酸序列。Ad Pan6的纤维蛋白具有SEQ ID NO:8的氨基酸序列。Ad Pan7的纤维蛋白具有SEQ ID NO:12的氨基酸序列。SV1具有两种纤维蛋白,纤维2具有SEQ ID NO:27的氨基酸序列;纤维1具有SEQ ID NO:28的氨基酸序列。SV-25也有两种纤维蛋白,纤维2具有SEQ ID NO:32的氨基酸序列;纤维1具有SEQ IDNO:33的氨基酸序列。SV-39纤维蛋白具有SEQ ID NO:37的氨基酸序列。
此纤维蛋白或其特定片段可适当的用于各种目的。一种合适的片段是跨越SEQID NO:4、8、12、28、32、33和37的氨基酸约247-425的纤维瘤。见图2。其它合适的片段例子包括具有根据以上提供的氨基酸编号和SEQ ID NO:4、8、12、28、32、33和37的约50、100、150或200个氨基酸的N-端和/或C-端截短的纤维。其它合适的片段包括内部片段。也可用本领域技术人员熟知的各种技术修饰此纤维蛋白。
本发明还包括本发明至少长8个氨基酸的独特蛋白片段。此外,本发明包括被引入后能提高Pan5、Pan6、Pan7、SV1、SV25或SV39基因产物产量和/或表达的此类修饰,如融合分子构建物,该构建物中,Pan5、Pan6、Pan7、SV1、SV25或SV39基因产物的全部或某片段融合(直接或通过一接头)于一融合伴侣而得到增强。其它合适的修饰包括但不限于截短一编码区(如一蛋白或酶)以去除通常被切断的前蛋白或原蛋白产生成熟的蛋白或酶。和/或突变编码区以提供分泌性基因产物。本领域技术人员不难明白还可进行其它修饰。本发明还包括与本文提供的Pan5、Pan6、Pan7、SV1、SV25或SV39蛋白同一性至少约95%-99%的蛋白质。
如本文所述,本发明的含本发明腺病毒衣壳蛋白的载体特别适合于以下应用,即用中和抗体来去除其它Ad血清型载体以及其它病毒载体的作用。本发明的rAd载体在反复基因治疗或为加强免疫应答(疫苗的效价)的重复给药中有特殊优点。
某些情况下可能需要采用Pan5、Pan6、Pan7、SV1、SV25或SV39基因产物(如衣壳蛋白或其片段)的一种或多种来产生抗体。术语“抗体”本文用于指能特异性结合某表位的免疫球蛋白分子。本发明的抗体能优先特异性结合Pan5、Pan6、Pan7、SV1、SV25或SV39表位而无交叉反应。本发明的抗体可有各种形式,如高亲和力多克隆抗体、单克隆抗体、合成抗体、嵌合性抗体、重组抗体和人源化抗体。这些抗体有免疫球蛋白IgG,IgM,IgA,IgD和IgE类型。
这些抗体也可采用本领域已知许多方法这之一产生。可用熟知的常规技术如Kohler和Milstein及其众多已知改进的技术来产生适合的抗体。可用已知的重组技术开发针对这些抗原的多在隆或单克隆抗体,以产生类似的所需高效价抗体(见例如PTC专利申请No.PTC/GB85/00392;英国专利申请公开号GB2188638;Amit等,Science.233:747-753,1986;Queen等,Proc.Nat’l.Acad.Sci.USA.86:10029-10033,1989;PTC专利申请PTC/WO9007861;和Riechmann等,Nature.332:323-327,1988;Huse等,Science.246:1275-1281,1998a)。或者,通过操作针对本发明抗原的动物或人抗体的互补决定区产生这类抗体,见例如,E Mark和Padlin的“单克隆抗体的人源化”(Humanization of Monoclonal Antibodies),第四章,实验药物手册,第13卷,单克隆抗体的药理学,Springer-Verlag(1994年6月);Harlow等,“抗体使用实验室手册”(Using Antibodies:A Laboratory Manual),Cold Spring Harbor Laboratory Press.NY;Harlow等,1989,“抗体实验室手册”(Antibodies:A Laboratory Manual).Cold Spring Harbor,New York:Harlow等,Proc.Natl.Acad.Sci.USA 85:5879-5883,1988;和Bird等,Science.242:423-426,1988。本发明还提供抗独特型抗体(Ab2)和抗-抗-独特型抗体(Ab3)。例如参见M.Wettendorff等,“用抗独特型抗体调节抗肿瘤免疫力”(Modulation of anti-tumor immunity by anti-idiotypic antibodies),收录在“独特型网络和疾病”(Idiotypic Network and Diseases),J.Cerny和J.Hiernaux编,1990,J.Am.Soc.Microbiol.,华盛顿特区:203-229页)。可用本邻域技术人员所熟知的技术生产这些抗独特型和抗-抗-独特型抗体。这些抗体可用于各种目的,包括诊断和临床方法及试剂盒。
在某些情况下可能需要在本发明的Pan5、Pan6、Pan7、SV1、SV25或SV39基因产物、抗体或其它构建物上引入一可检测标志可一个尾部。本文所用的可检测标志是单独或与另一分子反应时能提供可检测信号的分子。最理想的是该标志可肉眼检测,如通过荧光,便于在免疫组化分析或免疫荧光显微镜中使用。例如,合适的标志包括:异硫氰基荧光系(FITC)、藻红蛋白(PE)、别藻兰蛋白(APC)、coriphosphine-O(CPO)或串联染料,PE-花青-5(PC5)和PE-得克萨斯红(ECD)。所有这些荧光染料可从商品获得,它们的用途是本领域知道的。其它有用的标志是胶体金标志。其它有用的标志还包括放射活性化合物或元系。此外,标记还包括各种酶系统,它们能在试验中产生有色信号,如葡萄糖氧化酶(利用葡萄糖作底物)可释放过氧化氢,存在过氧化物酶和氢供体如四甲基联苯胺(TMB)时,此产物可产生呈兰色可见的氧化TMB。其它例子包括辣根过氧化物酶(HRP)、碱性磷酸酶(AP)、已糖激酶和能与葡萄糖-6-磷酸脱氢酶的结合,后者能与ATP、葡萄糖和NAD+反应,在其它产物中产生NADH,其因340nm波长吸光值增加而可检测。
本发明方法所用的其它标记系统可通过其它方法,如着色的乳胶微粒(Bangs实验室,印第安纳州)来检测,该微粒中包埋有染料用于代替酶与靶序列形成偶联物,从而提供可视信号,表明应用试验中存在所述复合物。
将标记与所需分子偶联或结合的方法是类似的常规方法,为本领域技术人员所知道标记物结合的已知方法已有描述(见例如,“荧光探针和研究用化学制剂手册”(Handbook of Fluorescent probes and Research Chemicals),第六版,R.P.M.Haugland,Molecular Probes有限公司,Eugene,OR,1996;“Pierce目录和手册之生活科学和分析研究产品”(Pierce Catalog and Handbook,Life Scienceand Analytical Research Products),Pierce Chemical公司,Rockford.IL,1994/1995)。标记和偶联方法的选择不限制本发明的范围。
本发明的序列、蛋白和片段可用任何适当的方法产生,包括重组产生、化学合成或其它合成方法。合适的生产技术是本领域技术人员报熟知的,见例如,Sambrook等,“分子克隆实验室手册”(Molecular Cloning:A Laboratory Manual),ColdSpring Harbor Press(纽约,冷泉港)。或者也可用熟知的固相肽合成法(Merrifield,J.Am.Chem.Soc.,85:2149,1962:Stewart和Young,“固相肽合成”(Solid Phase Peptide Synthesis)(Freeman,旧金山,1969),27-62页)来合成肽。这些和其它合适的生产方法都在本领域技术人员知识范围内,不限制本发明。
此外,本领域技术人员不难理解,查容易地采纳本发明的Ad序列用于各中病毒和非病毒载体系统,在体外、活体外或体内输送治疗和免疫原性分子。例如,一实施例中,可将本文所述猿猴Ad衣壳蛋白和其它腺病毒蛋白,用于基因、蛋白质和其它所需的诊断性、治疗性和免疫原性分子的基于非病毒性蛋白的输送。一个这样的实施例中,将本发明的蛋白质直接或间接连接于能靶向带有腺病毒受体的细胞的分子。优选可作为细胞表面受体的配体的衣壳蛋白,如六邻体、五邻体、纤维蛋白或其片段。可从本文所述治疗分子和它们的基因产物中选择适合用于输送的分子。各种接头,包括脂质、聚赖氨酸等可用作接头。例如,为了用类似于Medina-Kauwe LK等,Gene Ther.2001 May;8(10):795-803和Medina-Kauwe LK等,Gene Ther.2001 Dec;8(23):1753-1761。所述方法用猿猴五邻体序列产生一种融合蛋白,而容易地利用该猿猴五邻体蛋白。或者,可利用猿猴Ad蛋白IX的氨基酸序列作为向细胞表面靶向输送的载体,见美国专利申请20010047081。合适的配体包括CD40抗原、含RGD或含多聚赖氮酸的序列等。其它猿猴Ad蛋白,包括例如,六邻体蛋白和/或纤维蛋白,可用于这些目的和类似目的。
本发明的其它腺病毒蛋白可单独或与其它腺病毒蛋白联用于各种目的,本领域技术人员不难明白这一点。此外本领域技术人员不难理解本发明的腺病毒蛋白还有其它用途。
II重组腺病毒载体
本发明的组合物包括用于治疗或疫苗目的的能向细胞输送异源分子的载体。本文所用的载体包含的基因元件有,包括但不限于裸DNA、噬菌体、转座子、粘粒、游离体、质粒或病毒。这些载体含有猿猴腺病毒Pan5、Pan6、Pan7、SV1、SV25和/或SV39和小基因的DNA。“小基因”指所选异源基因和在宿主细胞中发生翻译、转录和/或基因产物表达所必须的其它调控元件的组合。
通常设计本发明腺病毒载体,以使该小基因位于核酸分子中含有与所选腺病毒基因天然在一起的其它腺病毒序列的区域中。如需要,可将该小基因插入现有基因区域中以破坏该区域的功能。或者,可将该小基因插入已部分或全部缺失腺病毒基因的区域。例如,可将该小基因置于功能性E1缺失或功能性E3缺失的部位,也可从其它部位选择。术语“功能性缺失”或“功能性缺省”指通过突变或修饰去除或损伤了该基因区足够数量的核苷酸,因而该基因区不再能产生功能性基因表达产物。如需要,可去除整个基因区域。基因破坏或缺失的其它合适部位在本申请其它地方讨论。
例如,为了产生能用于产生重组病毒的载体,该载体可含此小基因,和腺病毒基因组的5’端或3’端,或5’和3’端。所述腺病毒基因组的5’端含有包装和复制所必须的5’顺式元件,即5’反式末端重复(ITR)序列(其功能是复起点)和天然5’包装增强功能域(含有包装线性Ad基因组所需的序列和E1启动子的增强子元件)。所述腺病毒的3’端包含包装和衣壳化所必须的3’-顺式元件(包括ITRs)。重组腺病毒宜含有5’和3’-腺病毒顺式元件,该小基因宜位于5’和3’-腺病毒序列之间。本发明的任何腺病毒载体也可含其它腺病毒序列。
本发明的腺病毒载体宜含衍生自本发明腺病毒基因组的一种或多种腺病毒元件。一实施例中,该载体含腺病毒Pan5、Pan6、Pan7、SV1、SV25或SV39的ITR和同一腺病毒血清型的其它腺病毒序列。另一实施例中,该载体含不同腺病毒血清型而非提供此ITR腺病毒衍生的腺病毒序列。如本文所述,假型腺病毒指其衣壳蛋白来自不同血清型而非提供此ITR血清型的腺病毒。选择载体中存在的ITR血清型和其它腺病毒序列的血清型不意味限制本发明。各种腺病毒株可从美国模式培养物保藏所(马纳萨斯,弗吉尼亚)获得,或通过咨询各种商业和学术来源而获得。此外,许多菌株的序列可从各种数据库,包括例如,PubMed和GenBank获得。发表的文献中已描述了从其它猿猴或人的腺病毒制备的同源性腺病毒载体(见例如,美国专利5,240,846)。许多类型腺病毒的DNA序列可获自GenBank,包括Ad5(GenBank登录号M73260)。可获得任何已知腺病毒血清型。如血清型2.3.4.7.12和40的腺病毒序列,还包括目前已鉴定的人类型。在本发明的载体构建中也可采用已知能感染非人动物(如猿猴)的相似腺病毒。见例如,美国专利6,083,716。
在构建本文所述载体中所用的病毒序列、辅助病毒,如需要还有重组病毒颗粒、其它载体成分和序列可如上所述来获得。本发明的Pan5、Pan6、Pan7、SV1、SV25或SV39猿猴腺病毒序列可用于构建此类载体和用于制备此载体的细胞系。
对形成本发明载体的核酸序列的修饰,包括序列缺失、插入和可用标准分子生物学技术产生的其它突变,这些修饰属于本发明的范围。
A.“小基因”
采用方法来选择转基因,克隆并构建该小基因,将其插入病毒载体中是本领域的技术,本文说明如下:
1.转基因
转基因是一种编码多肽、蛋白质或其它感兴趣产物的核酸序列,与侧接该转基因的其它载体序列异源。可将该核酸编码序列以允许转基因在宿主细胞中转录、翻译和/或表达的方式,可操作性连接于调控成分。
转基因序列的组成取决于将要产生的载体的用途。例如,一种转基因序列含有一报告序列,其表达时产生一可检测信号。此类报告序列包括但不限于:编码β-内酰胺酶、β-半乳糖苷酶(LacZ)、碱性磷酸酶、胸苷激酶、绿色荧光蛋白(GFP)、氯霉素乙酰转移酶(CAT)、荧光素酶、膜结合蛋白,包括例如CD2、CD4、CD8、流感血凝素蛋白和其它本领域熟知的、对其存在着有或可用常规方法产生的高亲和力抗体的报告分子,以及含有适当融合于血凝素或Myc的抗原标志结构域的膜结合蛋白的融合蛋白质。这些编码序列当与能趋使其表达的调控元件连接时,可提供能用常规方法检测的信号,这些方法包括酶试验、放射图象、比色、荧光试验或其它光谱试验、荧光激活细胞选拣试验和免疫试验,包括酶连免疫吸附试验(ELISA)、放射免疫试验(RIA)和免疫组织化学。例如,当该标志序列是LacZ基因时,存在携带该信号的载体可用β-半乳糖苷酶活性检测。当该转基因是GFP或荧光素酶时,携带该信号的载体可在发光仪中通过观察颜色或发光产物来检测。
然而,需要的转基因是一种无标志序列,其编码的产物可用于生物学和医学,如蛋白质、肽、RNA、酶或催化性RNA。需要的RNA分子包括tRNA、dsRNA、核糖体RNA、催化性RNA和反义RNA。有用RNA序列的一个例子,是能抑制经治疗动物中的靶向核酸序列表达的序列。
该转基因可作为癌症治疗或疫苗,引发免疫应答和/或作为预防性疫苗,用于治疗例如遗传缺陷。本文所述免疫应答的引发指一种分子(如基因产物)引发对该分子的T细胞和/或体液免疫应答反应的能力。本发明还包括用多个转基因来纠正或减轻多亚基蛋白质所致疾病。某些情况下,可用不同的转基因来编码某蛋白质的各亚基,或编码不同的肽或蛋白质。当编码该蛋白的DNA分子很大时,如编码免疫球蛋白、血小板生长因子或肌营养不良性蛋白时。为了使细胞能产生此多亚基蛋白,可用含不同亚基每一种的重组病毒感染细胞。或者,可用同一转基因编码某蛋白的不同亚基。此时,一个转基因就包含了编码各亚基的DNA,编码各亚基的DNA被一内部核糖体进入位点(IRES)分隔。当编码各亚基的DNA分子很小时,如编码诸亚基和IRES的DNA总分子小于5kd时需要这样。作为IRES的替代物,可用含2A肽(翻译后其可自身切断)的序列来分隔此DNA。见例如,M.L.Donnelly等,J.Gen.Virol.,78(Pt1):13-21(1997.1);Fueler,S.等,Gene Ther.,8(11):864-73(2001.6);Klump H.等,Gene Ther.,8(11):811-17(2001.5)。此2A肽明显小于IRES,使其在间隔为有限因素时很适用。然而,所选的转基因可编码任何生物活性产物或其它产物,如研究所需的产物。
本领域技术人员不难选出适合的转基因。转基因的选择不应认为是对本发明的限制。
2.调控元件
除了上述微基因的主要元件外,该载体还包含操作性连接于此转基因的必须的常规调控元件,它们以允许此转基因在该质粒转染的细胞中,或在用本发明产生的病毒感染的细胞中转录、翻译和/或表达的方式调控。本文所用的“操作性相连接的”序列包括与感兴趣基因毗邻连接的表达调控序列和反式作用或远距离调控感兴趣基因的表达调控序列。
表达调控序列包括适当的转录启始、终止、启动子和增强子序列;有效的RNA加工信号如剪切和聚腺苷酸(polyA)信号;稳定胞浆mRNA的序列;增强翻译效率的序列(即Kozak共有序列);提高蛋白质稳定性的序列;以及当需要时能提高编码产物分泌的序列。大量的表达调控序列包括天然的、组成型、诱导型和/或组织特异性启动子,是本领域已知,可利用的。
组成型启动子的例子包括但不限于逆转录病毒罗斯肉瘤病毒(RSV)LTR启动子(任选地含RSV增强子)、巨细胞病毒(CMV)启动子(任选地含CMV增强子)(见例如,Boshart等,Cell.41:521-30,1985)SV40启动子、二氢叶酸还原酶启动子、β-肌动蛋白启动子、磷酸甘油激酶(PGK)启动子和EF1α启动子(Invitrogen)。
可诱导启动子可调节基因的表达,并受外源提供的化合物、环境因素如温度或存在的特定生理状态如急性期、细胞的特定分化阶段的调控,或只在复制性细胞中受调节。可诱导启动子和可诱导系统可从各种商业来源获得,包括但不限于Invitrogen、Clontech和Ariad。已报导了许多其它系统,本领域技术人员不难选择。例如,可诱导启动子包括锌-可诱导绵羊金属硫蛋白(MT)癖动子和地塞米松(Dex)可诱导小鼠乳腺瘤病毒(MMTV)启动子。其它可诱导系统包括T7聚合酶启动子系统(WO98/10088);蜕皮激素插入启动子(No等,Proc.Natl.Acad.Sci.USA,93:3346-3351,1996)、四环素阻抑系统(Gossen等,Proc.Natl.Acad.Sci.USA,89:5547-5551,1992),也见Harvey等,Curr.Opin.Chem.Biol.,2:512-518,1998)。其它系统包括FK506双体、采用castradiol、二酚基muri slerone的VP16或p65、RU486-可诱导系统(Wang等,Nat.biotech.15:239-243,1997)和Wang等,Gene Ther.,4:432-441,1997)和雷帕霉素可诱导系统(Magari等,J.Clin.Invest.,100:2865-2872,1997)。某些可诱导启动子的作用随时间而提高。此种情况时,可通过插入多个串联阻抑物,如TetR经IRES连接于TetR,来提高此系统的效果。或者,可等侍至少三天再筛选所需功能。可通过已知能增强此系统效果的方法来提高所需蛋白质的表达。例如,采用鸭肝炎病毒转录后调节元件(WPRE)。
另一实施例中,采用该转基因的天然启动子。当需要该基因的表达要模拟天然表达时可优选天然启动子。当该转基因的表达必须受温度或发育的调节时,或以组织特异性方式,或以对特异性转录刺激应答的方式时,可采用天然启动子。另一实施例中,可采用其它天然表达调控元件,如增强子元件、聚腺苷酸位点或Kozak共有序列来模拟天然表达。
转基因的另一实施例包括与组织特异性启动子操作性连接的转基因。例如,如需要在骨骼肌中表达,应采用在肌肉中有活性的启动子。这些启动子包括:编码骨骼β-肌动蛋白、肌球蛋白轻链2A、肌营养不良蛋白、肌酸激酶的启动子,以及活性比天然启动子高的合成性肌肉启动子(见Li等,Nat.Biotech.17:241-245,1999)。其它之中,启动子的例子有:已知的肝组织特异性启动子(白蛋白,Miyatake等,J.Viol.,71:5124-5132,1997);乙肝病毒核启动子(Sandig等,Gene Ther.,3:1002-1009,1996);甲胎蛋白(AFP)(Arbuthnot等,Hum.Gene.Ther.,7:1503-1514,1996);骨钙蛋白(Stein等,Mol.Biol.Rep.,24:185-196,1997);骨涎蛋白(Chen等,J.Bone.Miner.Res.,11:654-664,1996);淋巴细胞(CD2,Hansal等,J.Immunol.,161:1063-1068,1998);免疫球蛋白重链;T细胞受体链;神经特异性烊烯醇化酶(NSE)启动子(Andersen等,Cell.Mol.Neurobiol.,13:503-515,1993);神经纤丝轻链(Piccioli等,Proc.natl.Acad.Sci.USA,88:5611-5615,1991);和神经元特异性vgf基因(Piccioli等,neuron.15:373-384,1995)。
任选地,含编码治疗用和免疫原性产物转基因的载体,可包含可检测标记,或报告报告基因可含有编码遗传霉素、潮霉素或嘌呤霉素抗性序列。这些可选择性报告或标记基因(优选位于被包装入病毒粒子之中的病毒基因组之外)可用作信号,表明细菌细胞中存在该质粒。该载体的其它组分包括复制起点。这些和其它启动子与载体的选择是常规工作,许多这样的序列可以得到(见例如,Sambrook等及本文引用的参考文献)。
可用本文提供的序列,结合本领域技术人员已知的技术产生这些载体。这些技术包括cDNA的常规克隆技术,如教课书中所述(Sambrook等,“分子克隆实验手册”,Cold Spring Harbor Press,纽约冷泉港),采用腺病毒基因组重叠的寡核苷酸序列、聚合酶链反应和能提供所需核苷酸序列的任何适合方法。
III.重组病毒粒子的产生
一实施例中,用猿猴腺病毒质粒(或其它载体)来产生重组腺病毒颗粒。一实施例中,该重组腺病毒功能性缺失E1a或E1b基因,任选地含其它突变,如温度敏感性突变或其它基因缺失。其它实施例中,需要保留此重组腺病毒中的E1a和/或E1b区。完整的E1区可位于腺病毒基因组中其天然所位于的位置,或置于天然腺病毒基因组中的缺失部位(如E3区中)。
构建向人(或其它哺乳动物)细胞输送基因的有用的猿猴腺病毒载体时,此载体中可采用某范围的腺病毒核酸序列。例如,可从形成该重组病毒一部分的猿猴腺病毒序列中去掉腺病毒的延迟早期基因E3。猿猴E3的功能据信与该重组病毒颗粒的功能和产生无关。也可构建具有E4基因的至少ORF6区域缺失的猿猴腺病毒载体,更理想是缺失整个E4区,因为该区域功能过剩。本发明另一载体含缺失的延迟早期基因E2a。也可在猿猴腺病毒基因组中制作晚期基因L1-L5之一的缺失。类似的,中期基因IX和IVa2的缺失可用于某些目的。可在腺病毒的其它结构性或非结构性基因中制作其它缺失。上述缺失可单独应用,即用于本发明的腺病毒序列可只含一个区域的缺失。或者,可联合利用能破坏其生物活性的整个基因或其部分的缺失。例如,一示范性载体中,腺病毒序列缺失了E1基因和E4基因,或缺失E1,E2和E3基因,或E1和E3基因,或E1,E2a和E4基因以及缺失或不缺失E3基因等。如上所述可上联合利用这种缺失与其它突变,如温度敏感性突变,以达到所需结果。
可在缺乏病毒传染力和腺病毒粒子增殖所需的腺病毒基因产物时培养缺失任何必须的腺病毒序列(如E1a、E1b、E2a、E2b、E4 ORF6、L1、L2、L3、L4和L5)的腺病毒载体。在存在一种或多种辅助性构建物(如质粒或病毒)时或包装的宿主细胞中,培养所述腺病毒载体可提供这些辅助功能。见例如,1996.5.9公布的国际专利申请WO96/13597(纳入本文参考)中制备:“最小”人Ad载体所述技术。
1.辅助性病毒
因此,取决于所用的携带小基因的病毒载体中的猿猴腺病毒基因组成,辅助性腺病毒或非复制性病毒片段可能是必须的,以提供产生含该小基因的传染性重组病毒粒子所需的充分猿猴腺病毒基因序列。有用的辅助性病毒所含经过挑选的腺病毒基因序列在所述腺病毒载体构建物中不存在和/或不为转染此载体的包装细胞系所表达。一实施例中,该辅助病毒是复制缺陷型,含除上述序列以外的各种腺病毒基因。这类辅助病毒与E1表达细胞系联用为理想。
也可使辅助病毒形成聚阳离子偶联物,如Wu等,J.Biol.Chem.,264:16985-16987,1989;K.J.Fisher和J.M.Wilson.,Biochem.J.,299;49(1994.4.1)所述。辅助病毒可任选含第二报告小基因。许多这类报告基因是本领域已知的。辅助病毒上存在与腺病毒载体上转基因不同的报告基因,使得能独自监测Ad载体和辅助病毒。利用此第二报告子得以在纯化时分离产生的重组病毒与辅助病毒。
2.互补细胞系
为了产生缺失上述任一基因的重组猿猴腺病毒(Ad),如果此缺失基因区的功能为病毒复制和传染力所必须,应通过辅助病毒或细胞系,如互补或包装细胞系,补充此重组病毒。许多情况下可用表达人E1的细胞系反式互补此黑猩猩Ad载体。这有特殊优点,因为在目前可得的包装细胞中发现本发明黑猩猩Ad序列和人AdE1序列之间存在多样性,采用目前的含人E1细胞阻止了复制和生产过程中产生复制活性腺病毒。然而,某些情况下,需要利用能表达E1基因产物的细胞系,可用于产生E1缺失的猿猴腺病毒。已描述了这类细胞系,见美国专利6,083,716。
如需要,可采用本文提供的序列来产生包装细胞或细胞系,这些细胞在一启动子的转录控制下,能在所选亲代细胞系中,以最小程度表达Pan5、Pan6、Pan7、SV1、SV25或SV39的腺病毒E1基因。为此目的可采用诱导型或组成型启动子。选择能产生表达任何所需Ad Pan5、Pan6、Pan7、SV1、SV25或SV39基因的新细胞系。不受限制,此类亲代细胞系可以是其中的Hela(ATCC登录号CCL2)、A549(ATCC,CCL185)、HEK 293、KB(CCL17)、Detroit(如Detroit510、CCL72)和WI-38(CCL75)细胞。这些细胞可从美国模式培养物保藏所,10801 University Boulevard,马纳萨斯,弗吉尼亚,20110-2209获得。其它适合的亲代细胞系可从其它来源获得。
这种E1表达细胞系可用于产生重组猿朱病毒E1缺失载体。此外,或者,本发明提供的能表达一种或多种猿猴腺病毒基因产物,如E1a、E1b、E2a和/或E4ORF6,的细胞系可用产生重组猿猴病毒载体基本相同的方法构建。可用此细胞系反式互补缺失了编码这些产物所必须基因的腺病毒载体,或提供辅助依赖性病毒(如腺相关病毒)包装所必须的辅助功能。制备本发明宿主细胞涉及安装所选DNA序列等技术。这种安装可用常规技术进行。此类技术包括cDNA和基因组克隆,是众所周知的,见上述Sambrook等。利用腺病毒基因组的重叠寡核苷酸序列,结合聚合酶链反应、合成方法和其它合适的方法,可提供所需的核苷酸序列。
或者,通过腺病毒载体和/或辅助病毒反式提供必须的腺病毒基因产物。例如,可从生物中,包括原核细胞(如细菌)和真核细胞,包括昆虫细胞、酵母细胞和哺乳动物细胞。选择合适的宿主细胞。特别理想的宿主细胞选自哺乳动物细胞,包括但不限于A549、WEHI、3T3、10T1/2、HEK293细胞或PERC6(这二者表达功能性腺病毒E1)(Fallaux,FJ等,Hum Gene Ther,9:1909-1917,1998)、saos、C2C12、L细胞、HT1080、HepG2和原代成纤维细胞、肝细胞和衍生自哺乳动物,包括人、猴、小鼠、大鼠、家兔和仑鼠的成肌细胞。提供此细胞的哺乳动物种类选择不限于本发明,也不限于成纤维细胞、肝细胞、肿瘤细胞等哺乳动物细胞。
3.病毒粒子的安装和细胞系的转染
通常,当用转染输送含小基因的载体时,该载体输送量约5-100μg DNA,优选约10-50μg DNA,向约1×104-1×1013个细胞,优选向约105个细胞提供。然而,考虑所选载体、输送方法和所选宿主细胞等因素,可调整向宿主细胞输送的载体DNA相对量。
此载体可以是本领域已知的或上述的任何载体,包括裸DNA、质粒、噬菌体、转座子、粘粒、游离体、病毒等。将此载体引入宿主细胞可用本领域已知或上述方法进行,包括转染和感染。将一种或多种腺病毒基因稳定整合入宿主细胞有基因组中,作为游离体稳定表达,或短时表达。可在游离体上,或稳定整合的基因组中表达基因产物,某些基因产物可稳定表达,而其它短时表达。而且可独立地从组成型启动子、诱导型启动子或腺病毒天然启动子中为各腺病毒基因选择启动子。例如,生物或细胞的特定生理状态(即分化状态或复制或静止状态细胞),或外加因子可调节这些启动子。
可用本领域技术人员已知的技术和如本说明书所述,将该分子(质粒或病毒)引入宿主细胞。优选实施例中,采用标准技术,如CaPO4转染或电穿孔。
将所选腺病毒DNA序列以及转基因和其它载体元件装配入各种中间质粒和用此质粒和载体产生重组病毒颗粒,均可用常规技术进行。这类技术包括如教课书所述的常规cDNA克隆技术(Sambrook等,见上述)。利用腺病毒基因组的重叠寡核苷酸序列、聚合酶链反应、合成方法组,可提供所需的核苷酸序列。可采用标准的转染和共转染技术,如CaPO4沉淀技术。其它可用的常技术包括病毒基因的同源性重组、琼脂叠层病毒空斑、测定信号产生的方法等。
例如,构建和装配含所需小基因的病毒载体后,在存在辅助病毒时将此载体转染入包装细胞系中。辅肋和载体序列之间发生同源重组,使该载体中的腺病毒-转基因序列得以复制并包装入病毒粒子衣壳中,产生重组病毒载体颗粒。产生此类病毒颗粒的现有方法以转染为基础。然而,本发明不限于这类方法。可用产生的重组猿猴病毒将所选转基因转移到所选细胞中。重组病毒在包装细胞系中增殖的体内实验中本发明E1缺失重组猿猴腺病毒载体证明可用于将转基因转移到非猿猴,优选人的细胞中。
IV.重组腺病毒载体的用途
可用本发明的重组猿猴腺病毒载体将基因体外、活体外、的体内输送给病人或非猿猴病兽。
本文所述重组腺病毒载体可用作表达载体,来体外产生异源基因编码的产物。例如,可将含插入到E1缺失部位某基因的重组腺病毒载体,转染入上述E1表达细胞系中。或者,在其它所选细胞系中采用复制活性腺病毒。然后以常规方法培养转染细胞,使该重组腺病毒表达此启动子的基因产物,然后用蛋白分离和培养回收的常规技术,从培养液中回收基因产物。
本发明Pan5、Pan6、Pan7、SV1、SV25或SV39衍生的猿猴腺病毒载体提供了有效的基因转移载体,可在体外或活体外,将所选转基因输送给所选宿主细胞,甚至在该生物已有一种或多种AAV血清型的中和抗体时。一实施例中,在活体外混合rAAV和细胞,用常规方法培养已感染的细胞,将转导细胞重输回给病人。这些组合物特别适合治疗基因的输送和免疫接种,诱导保护性免疫力。
更常见,本发明的Pan5、Pan6、Pan7、SV1、SV25或SV39重组腺病毒载体用作输送治疗性和免疫原性分子,见以下所述。不难理解这两种用途。本发明的重组腺病毒特别适合反复输送重组腺病毒载体的治疗方案。此方案通常包括输送一系列病毒衣壳已改变的病毒载体。每次后续给药,或给予具体血清型衣壳预定次数(一、二、三、四或五次)后,病毒的衣壳都可能改变。此给药方案包括输送含第一猿猴腺病毒衣壳的rAd,输送含第二衣壳的rAd和输送含第三衣壳的rAd。本领域技术人员明白,有各种单独采用本发明Ad衣壳,联合采用,或与其它血清型Ad联用的方案。任选地,这种方案包括给予含其它非人灵长类动物腺病毒、人腺病毒衣壳或本文所述人造血清型衣壳的rAd。此给药方案的每一期包括给予一系列注射(或其它途径)一种Ad血清型衣壳,然后注射一系列另一Ad血清型衣壳。或者,本发明重组Ad载体可用于涉及其它非腺病毒介导的输送系统,包括其它病毒系统、非病毒输送系统、蛋白质、肽和其它生物活性分子。
以下章节集中描述可通过本发明腺病毒载体输送的示范性分子。
A.治疗分子Ad-介导的输送
一实施例中,按已发表的基因治疗方法给予人上述重组载体。可给予人的携带所选转基因的猿猴腺病毒载体,优选悬浮于生物相容性液体或药学上可接受的输送载体中。一种合适的载体包括灭菌盐水。本领域技术人员熟知的其它水性和非水性等渗灭菌注射液和水性与非水性灭菌悬药学上可接受的载体,可用于此目的。
给予足够量的猿猴腺病毒载体转导靶细胞,并提供足够水平的基因转移和表达,以年供治疗效益而无过多副作用,或医学上可接受的生理作用,这可由医疗领域技术人员来决定。常规的和药学上可接受的给药途径包括但不限于,直接输送给视网膜和其它眼内输药方法、输送给肝脏、吸入、鼻内、静脉内、肌肉内、气管内、皮下、皮内、直肠内、中服和其它非肠胃道给药途径。如需要,可联用给药途径,或根据转基因或病况调整。给药途径主要取决于所治疾病的性质。
病毒载体的剂量主要取决于所治疾病、患者年龄、体重和健康状况,患者之间可能不同。例如,成人或病兽此病毒载体的治疗有效量一般为100微升-100毫升含浓度约1×106-1×1015颗粒、约1×1011-1×1013颗粒、约1×109-1×1012颗粒病毒的载体。剂量范围取决于动物大小和给药途径。例如,肌内注射人和兽(约80kg)的适合剂量范围是一部位每毫升约1×109-5×1012颗粒。任选可多部位给药。另一实施例中,中服给药人和兽的适合剂量范围是一部位每毫升约1×1011-5×1015颗粒。本领域技术人员可根据给药途径确定这些剂量,此重组载体可用于治疗或疫苗应用。可监测转基因或免疫原的表达水平、特循环抗体水平,以给药的频率。本领域技术人员不难明白确定给药的时间和频率。
较优的给药方法包括给予该病毒载体的同时、之前、之后,给予合适量的短时作用免疫调节剂。所选的免疫调节剂本文定义为:能抑制针对本发明重组载体的中和性抗体产生的制剂,或能抑制可消除该载体的溶细胞性T淋巴细胞(CTL)的制剂。此免疫调节剂可干扰T辅助细胞亚组(Th1或Th2)和B细胞之间的相互反应,从而抑制中和抗体的形成。各种有用的免疫调节剂和其使用剂量已公开,见例如,Yang等,J.Viol.,70(9)(1996.9);1996.5.2公开的国际专利申请WO96/12406;和国际专利申请PCT/US96/03035,均纳入本文作参考。
1.治疗性转基因
此转基因编码的有用治疗性产品包括激素和生长分化因子,包括但不限于胰岛素、胰高血糖素、生长素(GH)、副甲状腺素(PTH)、生长激素释放因子(GRF)、促滤泡激gxi(FSH)、黄体生成素(LH)、人绒毛膜促性腺激素(hCG)、血管内皮生长激素(VEGF)、血管生成素、血管、粒细胞集落刺激因子(GCSF)、红细胞生成素(EPO)、结缔组织生长因子(CTGF)、碱性成纤维细胞生长因子(bFGF)、酸性成纤维细胞生长因子(aFGF)、表皮生长因子(EGF)、转化生长因子(TGF)、血小板生长因子(PDGF)、胰岛素生长因子I和II(IGF-I和IGF-II)、转化生长因子超家族之一包括TGF、活化素、抑制素、骨形态发生蛋白(BMP)之一BMP1-15、生长因子的调蛋白(heregluin)/神经调节蛋白/ASIA/神经分化因子(NDF)家族之一、神经生长因子(NGF)、脑衍生神经营养因子(BDNF)、神经营养素NT-3和NT-4/5、睫状神经营养因子(CNTF)、胶质细胞神经养顺子(GDNF)、neurturin、集聚蛋白、脑信号蛋白/脑衰蛋白家族之一、导蛋白-1和导蛋白-2、肝细胞生长因子(HGF)、肝配蛋白、头蛋白、sonic hedgehog和酪氨酸羟化酶。
其它有用的转基因产物包括能调节免疫系统的蛋白质,包括但不限于,细胞因子和淋巴因子,如血小板生成素(TPO)、白介素IL1-25(包括IL-2、IL-4、IL-12和IL-18)、单核细胞趋化蛋白、白血病抑制因子、粒细胞-巨噬细胞集落刺激因子、flk-2/flk3配体。本发明也采用免疫系统产生的基因产物,这包括但不限于,免疫球蛋白IgG、IgM、IgA、IgD和IgE、嵌合性免疫球蛋白、人源化抗体、单链抗体、T细胞受体、嵌合性T细胞受体、单链T细胞受体、MHCI类和II类分子、以及工程化免疫球蛋白和MHC分子。有用的基因产物还包括补体调节蛋白,如实补体调节蛋白、膜共因子蛋白(MCP)、衰变加速因子(DAF)、CR1\CF2和CD59。
有用的基因产物还包括激素、生长因子、细胞因子、淋巴因子、调节蛋白和免疫系统蛋白任何一种的受体。本发明的胆固醇调节受体包括低密度脂蛋白(LDL)受体、高密度脂蛋白(HDL)受体、极低密度脂蛋白(VLDL)受体和清除剂受体。本发明还包括基因产物,如类固醇激素受体超家族的成员,包括糖皮质激素受体和雌激素受体、维生素D受体和其它核受体。此外,有用的基因产物包括转录因子,如jun、fos、max、mad、血清反应因子(SRF)、AP-1、AP2、myb、MyoD、ZF5、NFAT、CREB、HNF-4、C/EBP、SP1、CCAAT-盒结合蛋白、干扰素调节因子(IRF-1)、Wilms肿瘤蛋白、ETS-结合蛋白、STAT、GATA-盒结合蛋白如GATA-3和翼状螺旋蛋白的forkhead家族。
其它有用的基因产物包括,氨甲酰基合成酶I、鸟氨酸转氨甲基酶、精氨琥珀酸合成酶、精氨琥珀酸裂解酶、精胺酶、富马酰乙酰乙酸氢化酶、苯丙氨酸羟化酶、α-抗胰蛋白酶、葡萄糖-6-磷酸酶、胆色素原脱氨酶、因子VIII、因子IX、胱硫胺β-合成酶、支链酮酸脱羧酶、白蛋白、异价辅酶A脱氢酶、丙酰辅酶A羧化酶、甲基丙二酸单酰辅酶A岐化酶、谷胺酰辅酶A脱氢酶、胰岛素、β-葡萄糖苷酶、丙酮酸羧化酶、磷酸化酶、磷酸化酶激酶、甘氨酸脱羧酶、H-蛋白、T-蛋白、囊性纤维化跨膜调节(CFTR)序列和营养不良蛋白cDNA序列。
其它有用的基因产物包括非天然存在多肽,如具有非天然氨基酸序列含有插入、缺失或氨基酸取代的嵌合性或杂交多肽。例如,单链工程化免疫球蛋白可用于某些免疫力低下患者。其它类型的非天然基因序列包括反义分子和催化性核酸,如能用于降低靶分子过度表达的核糖酶。
治疗以高度增殖细胞为特征的高增殖疾病,如癌症和牛皮癣特别需要降低和/或调节基因的表达。靶多肽包括与正常细胞相比在高增殖细胞中单独或以较高水平产生的多肽。靶抗原包括致癌基因,如myb、myc、fyn和转位基因如ber/abl、ras、src、P53、neu、trk和EGRF编码的多肽。除了癌基因产物可作为靶抗原外,抗癌治疗和保护性方案所用靶多肽包括B细胞淋巴瘤产生的抗体的可变区和T细胞淋巴瘤的T细胞受体的可变区,其在某些实施例中,也可用作身免疫病的靶抗原。可用作靶多肽的其它肿瘤相关多肽,例如肿瘤细胞中水平较高的多肽,包括单克隆抗体17-1A所识别的多肽和叶酸结合多肽。
其它适合用于治疗的多肽和蛋白质包括可用于治疗自身免疫疾病患者的多肽和蛋白质,机理是通过赋予对与自身免疫相关靶组织(包括细胞受体和产生自身抗体的细胞)广泛的保护性免疫应答。T细胞介导的自身免疫病包括类风湿关节炎(RA)、多发性硬皮病(MS)、斯耶格伦综合征、结节病、胰岛素依赖性糖尿病(IDDM)、自身免疫甲状腺炎、反应性关节炎、关节强硬性脊椎炎、硬皮病、多肌炎、皮肌炎、牛皮癣、脉管炎、韦格纳肉芽肿病、Crohn病和溃疡性结肠炎。这些疾病的每一种都以能结合内源性抗原并引起与自身免疫病相关的炎症性级联反应的T细胞受体(TCR)为特征。
本发明的猿猴腺病毒载体特别适合用于多次腺病毒输送转基因的治疗方案,如反复输送同一转基因的方案,或联合输送其它转基因的方案。这些方案包括给予Pan5、Pan6、Pan7、SV1、SV25或SV39猿猴腺病毒载体,随后再给予同一血清型腺病毒的载体。特别理想的方案包括给予本发明Pan5、Pan6、Pan7、SV1、SV25或SV39猿猴腺病毒载体,其中第一次给予的病毒载体血清型不同于后续一次或多次所用病毒载体的血清型。例如,治疗方案包括给予Pan5、Pan6、Pan7、SV1、SV25或SV39载体,并反复给予相同或不同血清型的一种或多种腺病毒载体。另一实施例中,治疗方宁包括给予一种腺病毒载体,随后反复给予本发明不同于第一次输送的腺病毒载体的Pan5、Pan6、Pan7、SV1、SV25或SV39载体,任选地还给予另一种相同的,或优选不同于前一次给药载体血清型的载体。这些方案不限于用本发明Pan5、Pan6、Pan7、SV1、SV25或SV39猿猴血清型载体构建的腺病毒载体。而且,这些方案可容易地采用其它血清型的载体,包括但不限于其它猿猴腺病毒血清型(如Pan9或C68,C1等),其它非人灵长动物腺现毒血清型或人腺病毒血清型,与本发明的Pan5、Pan6、Pan7、SV1、SV25或SV39载体联用。此说明书中讨论了这种猿猴、其它非人灵长动物主人腺病毒血清型。另外,这些治疗方案包括同时或依次输送本发明Pan5、Pan6、Pan7、SV1、SV25或SV39腺病毒载体,联用非腺病毒载体、非病毒载体和/或各种其它有用的治疗化合物或分子。本发明不限于这些治疗方案,本领域技术人员不维明白这些方案的不同。
B.Ad介导的免疫原性转基因输送
该重组猿猴腺病毒也可用作免疫原性组合物。本文的免疫组合物是将其输送给哺乳动物,优选灵长动物后,能对其输送的转基因产物产生体液(如抗体)或细胞(如细胞毒T细胞)免疫应答的组合物。本发明提供的重组猿猴Ad在其腺病毒序列中可含有编码所需免疫原基因的缺失。与人来源的腺病毒相比,这种猿猴腺病毒可能更适合作为不同种动物的重组活病毒疫苗,但不仅限于此用途。该重组腺病毒可作为预防或治疗疫苗,所针对的病原是其抗原对诱导免疫应答是决定性的并能限制该病原(已得到鉴定并且其cDNA可获得)传播。
如上所述以适当的输送载体配罅这种疫苗(或免疫原性)组合物。通常,该免疫原性组合物的剂量范围如以上治疗性组合物。可监测所选基因产物的免疫力水平以确定是否需要加强免疫。评估血清抗体效价后可能需要加强免疫。
任选地,配制本发明的疫苗组合物含有其它组合物,包括佐剂、稳定剂、PH调节剂、防腐剂等。这些组合物是疫苗领域技术人员所熟知的。适当的佐剂包括但不限于脂质体、铝盐、单磷酰脂质A和生物活性因子如细胞因子、白介素、趋化因子、配体及它们的优化组合。这些生物活性因子的某些可通过质粒或病毒在体内表达。例如,与只用编码某抗原的DNA疫苗首次免疫产生的免疫应答相比,给予佐剂和编码该抗原的首次DNA疫苗免疫,提高了抗原特异性免疫应答。
以“免疫原性剂量”给予该重组腺病毒,即重组腺病毒的剂量给药后能有效转染所需细胞并导致所选基因足够水平的表达而引发免疫应答。当提供保护性免疫力时,该重组腺病毒可作为疫苗组合物用于预防传染和/或复发疾病。
或者,或此外,本发明的载体可含有编码能引发对所选免疫原产生免疫应答的肽、多肽或蛋白质的转基因。预计本发明的重组腺病毒在诱导针对插入的该载体表达的异源抗原蛋白的溶细胞性T细胞和抗体就答中高度有效。
例如,可从各种病毒科选择免疫原。可产生理想免疫应答的理想病毒科的例子,包括小RNA病毒科,其包括引起普通感冒50%病例的鼻病毒属;肠病毒属包括脊髓灰质炎病毒、柯萨奇病毒、艾柯病毒;和人肠道病毒如甲肝病毒;和主要在非人动物中引起口足疾病的口疮病毒(apthovirus)属。小RNA病毒科中靶抗原包括VP1、VP2、VP3、VP4和VPG。另一病毒科包括calcivirus科,其包括流行性胃肠炎重要致病因子的Norwalk病毒群。其它用于靶向抗原诱导人和非人动物产生免疫应答的病毒科是披膜病毒科,其包括α-病毒属新培斯病毒、罗斯河病毒、委内瑞那西方和东方马脑炎病毒以及风疹病毒。黄病毒科包括登革热、黄热病、日本脑炎、圣露易斯脑炎和蜱传脑炎病毒。其它靶抗原可产生自丙肝病毒科或冠状病毒科,包括非人病毒如传染性支气管炎病毒(家禽)、猪传播胃肠炎病毒(猪)、猪血凝性脑脊髓炎病毒(猪)、猪传染性腹膜炎病毒(猫)、猪小肠冠状病毒(猫)、犬冠状病毒(狗)和引起普通感冒和/或非甲非乙或丙肝的人呼吸道冠状病毒。冠状病毒科中,靶抗原包括E1(也称为M或基质蛋白)、E2(也称为S或Spike蛋白)、E3(也称为HE或血凝素-elterose)、糖蛋白(不存在于所有冠状病毒中)或N(核衣壳)。其它抗原可靶向弹状病毒科,包括水泡病毒属(如水泡性口炎病毒)和狂犬病毒属(如狂犬病毒)。弹状病毒科中合适的抗原可衍生自G蛋白或N蛋白。线状病毒科包括出血热病毒如马堡和埃博拉病毒可能是适合的抗原来源。副粘病毒科包括I型副流感病毒、3型副流感病毒、牛3型副流感病毒、风疹病毒(腮腺炎病毒)、2型副流感病毒、4型副流感病毒、新城堡病毒(鸡)、引起麻疹和犬瘟病的牛瘟麻疹病毒和肺病毒包括呼吸道合胞病毒。流感病毒分类属于正粘病毒中合适的抗原来源(如HA蛋白、N1蛋白)。布尼亚病毒科包括布尼亚病毒属(加利福尼亚脑炎,La Crosse)、白蛉热病毒(里夫特裂谷热)、汉坦病毒(出血热病毒)、内罗必病毒(内罗必绵羊病)和各种未命名的bungaviruses。沙拉病毒科提供了LCM和拉沙热病毒的抗原来源。呼肠弧病毒科包括呼肠弧病毒属、轮状病毒(引起儿童急性胃肠炎)、环状病毒和cultivirus(科洛拉多蜱传热,Lebombo(人)、马退行性脑病、兰舌症)。
逆转录病毒科,包括人类和兽类疾病如猪白血病病毒、HTLVI、HTLVII、慢病毒属(包括人免疫缺陷病毒(HIV)、猿猴免疫缺陷病毒(SIV)、猪免疫缺陷病毒(FIV)、马传染性贫血病毒和泡沫病毒)等致肿瘤病毒亚科。慢病毒科中已报道了许多合适的抗原,不难选择。合适的HIV和SIV抗原例子包括但不限于:gag、pol、Vif、Vpx、VPR、Env、Tat、Nef和Rev蛋白,以及它们的各种片段。例如,合适的Env蛋白片段可包括如gp120、gp160、gp41等亚单位,或它们更小的至少长8个氨基酸的片段。类似地,可选择Tat蛋白的片段。(见美国专利5,891,994和6,193,981)。HIV和SIV蛋白还可见D.H.Barouch等,J.Viol.,75(5):2462-2467(2001年3月)和R.R.Amara等,Science.,292:69-74(2001.4.6)。另一实施例中,可用HIV和/或SIV免疫原性蛋白主产生融合蛋白或其它免疫原性分子。见例如,2001.8.2公布的WO01/54719和1999.4.8公布的WO99/16884中所述的HIV-1Tat和/或Nef融合蛋白和免疫方案。本发明不限于本文所述HIV和/或SIV免疫原性蛋白或肽。此外,已报道了这些蛋白的各种修饰,本领域技术人员不难制备。见例如,美国专利5,972,596所述的修饰的gag蛋白。另外,可单独或联合输送理想的HIV和/或SIV免疫原。这种联合可包括从一个载体或从多个载体表达。任选地,另一种组合可包括输送一种或多种表达的免疫和输送蛋白形式的一种或多种免疫原。下面将更详细讨论这种组合。
乳多空病毒科包括多瘤病毒亚科(BKU和JCU病毒)和乳头瘤病毒亚科(与癌症或乳头瘤恶变有关)。腺病毒亚科包括引起呼吸道和/或肠道疾病的病毒。细小病毒科包括猫细小病毒(猫肠炎)、猫传染性粒细胞减少症病毒、犬细小病毒和猪细小病毒。疱疹病毒科包括α-疱疹病毒亚科单纯疱疹病毒(HSVI,HSVII)属、水痘病毒属(假性狂犬病、水痘带状疱疹)和β-疱疹病毒亚科巨细胞病毒(HCMV,muromegalovirus)属和γ-疱疹病毒亚科淋巴隐病毒属、EBV(Burkitts淋巴瘤)、传染性鼻气管炎、Marek病病毒和鼻病毒。痘病毒科包括脊髓动物痘病毒亚科正痘病毒属(天花(Smallpox)和牛痘(Cowpox))、副痘病毒、禽痘病毒、羊痘病毒、兔痘病毒、猪痘病毒和虫媒痘病毒亚科。肝DNA病毒科包括乙肝病毒。查能是抗原适当来源的一种未分类病毒是δ-肝炎病毒。其它病毒来源包括禽传染性法氏囊病病毒和猪呼吸道生殖道综合征病毒。α-病毒属包括马动脉炎病毒和各种脑炎病毒。
本发明还包括可用于免疫人类和非人类动物的免疫原,以抵抗包括细菌、真菌、寄生性微生物或多细胞寄生虫(感染人和非人脊椎动物或形成癌细胞或肿瘤细胞)的其它病原。
病原菌包括致病性革兰阳性球菌,有肺炎球菌、葡萄球菌和链球菌。致病性革兰阴性球菌包括脑膜炎球菌,淋球菌。致病性肠道革兰阴性杆菌包括肠杆菌科、假单胞菌属、不动杆菌属和埃肯菌属、类鼻疽菌、沙门菌、志贺菌、嗜血杆菌属、莫拉菌属、杜克雷嗜血杆菌属(引起软下疳)、布鲁菌属、土拉热弗拉西丝氏菌(引起兔热病)、耶尔森菌科(巴斯德菌属)、链杆菌属、念珠棘虫属和螺菌属;阳性杆菌包括李斯特单核细胞增多症菌、猪红斑丹毒丝菌、白喉棒状杆菌属(白喉)、霍乱弧菌、炭疽杆菌(炭疽)、杜诺凡菌病菌(腹股沟肉芽肿)和巴尔通体病菌。致病性厌氧菌引起的疾病包括破伤风、肉毒、其它梭菌属病、结核病、麻风和其它分枝杆菌病。致病性螺旋体疾病包括梅毒、密螺旋体病、雅司、品他病和地方流行性梅毒及钩端螺施体病。高等病原菌和病原性真菌引起的其它感染疾病包括放线菌病、诺卡放线菌病、隐球菌病、芽生菌病、组织胞浆菌病、球孢子菌病、念珠菌病、曲霉菌病、毛霉菌病、孢子丝菌病、副球孢子菌病、petrielliiosis、球拟酵母菌病、足分枝菌病、着色芽生菌病和皮肤真菌病。立克氏体感染包括斑疹伤寒热、洛基山斑疹热、Q热和立克氏体痘。支原体和衣原体感染的例子包括支原体肺炎、性病性淋巴肉芽肿、鹦鹉热和围生期衣原体感染。致病性真核细胞包括致病性原虫和蠕虫,其产生的感染包括阿米巴虫病、疟疾、利什曼虫病、锥虫病、弓形虫病、卡氏肺囊虫病、Trichans、鼠弓形虫病巴贝虫病、梨形鞭毛虫病、旋毛虫病、丝虫病、血吸虫病、线虫病、吸虫病和绦虫感染。
疾病控制中心(CDC,美国卫生和人类服务部)已鉴定了可用于生物攻击潜在因子的许多这样的微生物和/或它们产生的毒素。例如,一些这样的生物因子包括:炭疽杆菌(炭疽)、梭状肉毒菌(肉毒)及其毒素;鼠耶尔森菌属(鼠疫)、天花、土拉热弗朗西丝菌(土拉热)、病毒性出血热(纤丝病毒属,如埃博拉、马堡病毒)和沙粒病毒(如拉沙热,Machupo),以上这些病原目前发类为A类因子:伯纳特柯克斯立克氏体(Q热)、布鲁菌(波状热)、鼻疽菌(马鼻疽)、假鼻疽菌、蓖麻(Ricinus communis)及其毒素(蓖麻毒素)、产气荚膜梭菌及其毒素(ε毒素)、葡萄球菌及其毒素(肠毒素B)、鹦鹉热衣原体(鹦鹉热)、水安全性威胁(如霍乱弧菌、细小隐孢子菌)、斑疹伤寒(普氏立克氏体)和病毒性脑炎(如α-病毒如委内端拉马脑炎、东方马脑炎、西方马脑炎;所有这些目前分类为B类病原;和日本病毒与汉坦病毒,其目前分类为C类病毒。此外,可鉴定这样分类或不同分类的其它生物并用于将来这种目的。不难理解本文所述的病毒载体和其它构建物可用于输送这些生物、病毒、其毒素或副产物的抗原,来治疗和预防这些生物因子的感染或其它不良反应。
给予本发明的载体来输送T细胞可变区的免疫原,以诱导针对包括CTL的免疫应答来消灭这些T细胞。在RA中,已特征鉴定到几个参与此病的TCR特定可变区。这些TCR包括V-3、V-14、V-17和Vα-17。因此输送编码至少这些多肽之一的核酸序列将诱导针对参与RA的靶T细胞的免疫应答。在MS中,已鉴定到几个参与此病的TCR特定可变区。这些TCR区包括V-7和Vα-10。因此输送编码至少这些多肽之一的核酸序列将诱导针对参与MS的靶T细胞的免疫应答。硬皮病中,已特征鉴定到几个参与此病的TCR特定可变区。这些TCR包括V-6、V-8、V-14、Vα-16、Vα-3C、Vα-7、Vα-14、Vα-15、Vα-16、Vα-28和Vα-12。因此输送编码至少这些多肽之一的重组腺病毒将诱导针对参与硬皮病的靶T细胞的免疫应答。
C.Ad介导的输送方法
可监测所选基因的治疗水平或免疫力水平以确定是否需要加强免疫评估CD8+T细胞应答后,或任选的血清抗体效价后,可能需要任选地进行加强免疫。任选地,单独给予或以各种联合方案,如与涉及其它活性成分的治疗方案或程序,或初免-加强方案联合,输送本发明的重组猿猴腺病毒。本专业已描述了各种这样的方案,可不难选择。
例如,初免-加强方案可能包括给予DNA载体(如质粒)初步激发免疫系统,第二次加强免疫给予传统抗原台蛋白质可携带编码该抗原序列的重组病毒。见例如,纳入参考文的2000.5.2发表的WO 00/11140。或者,一免疫接种方案包括给予本发明的重组猿猴腺病毒载体以加强对携带抗原或蛋白的载体的免疫应答。另外,一免疫接种方案包括给予蛋白质然后编码该抗原的载体加强。
一实施例中,本发明通过输送载有所选抗原的质粒DNA载体作初次免疫,然后用本发明的重组猿猴腺病毒载体加强诱导对该抗原的初次和加强免疫应答。一实施例中,该初免-加强方案包括表达初免和/或加强载体携带的多种蛋白质。见例如,R.R.Amara,Science,292:69-74(2001.4.6)所述用于产生对HIV、SIV免疫应答而表达蛋白亚单位的多种蛋白方案。例如,DNA初免可输送一个转录物的Gag、Pol、Vif、VPX、Vpr和Env、Tat和Rev。或者,可在本发明重组腺病毒构建物中输送SIV Gag、Pol和HIV-1 Env。其它方案见WO 99/16884和WO 01/54719。
然而,初免-加强方案不仅限于HIV免疫或输送这些抗原。例如,初免可包括输送要发明第一种黑猩猩载体,然后用第二种黑猩猩载体或用含蛋白形式的该抗原组合物加强。一实施例中,该初免-加强方案可提供对产生该抗原的病毒、细菌或其它生物的保护性应答。另一理想的实施例中,该初免-加强方案提供的治疗效果可用常规试验检测,检测是否还存在给药所治疗的疾病。
可在机体各种部位以剂量依赖方式给予初免组合物,这取决于诱导所需免疫应答的目标抗原。本发明不限于上述注射量或部位或药物载体。而且,该方案可包括初免和/或另强步骤,每一步骤可包括一次剂量给药,每小时、每天、每周、每月或每年一次。例如,哺乳动物可接受含载体中约10-50μg的质粒一剂或二剂。所需DNA组合物的剂量范围约1-10,000μg DNA载体。剂量可根据患者体重不同每公斤约1-1000μg DNA。输注量或部位根据哺乳动物种类和病况按需选择。
本文描述了输送抗原给哺乳动物的载体的合适的剂量单位。通过将载体悬浮或溶于药学上或生理上可接受的载体如等渗盐水、等渗盐溶液可其域它本领域技术人 员知道和制剂中制备该载体用于给药。合适的载体是本领域技术人员知道的,主要取决于给药途径。按上述途径给予哺乳动物本发明的组合物,采用生物可降解性和生物相容性聚合物缓释剂型,或用微胶团、凝胶和脂质体定位输送。任选地,本发明的初免步骤还包括给予初免组合物和适当量的本文定义的佐剂。
优选在给予哺乳动物受试者初免组合物后2-27周给予加强组合物。采用有效剂量的含有能够输送与初免DNA疫苗相同抗原的加强组合物,进行加强组合物的给药。此加强组合物可含衍生自同一病毒来源(如本发明的腺病毒序列)的可另一来源的重组病毒载体。可者,该“加强组合物”可以是含有初免DNA疫苗编码的但为蛋白或多肽形式的同一抗原的组合物,此组合物可在宿主中诱导免疫应答。另一实施例中,该加强组合物含有编码此抗原的DNA序列,该序列在调控序列的控制下指导其在哺乳动物中的表达,如众所周知的细菌载体或病毒载体。此加强组合物的主要要求是,其抗原应是初免组合物所编码的同一抗原,或交叉反应抗原。
另一实施例中,本发明的猿猴腺病毒载体也适合用于各种其它免疫和治疗方案。此方案可包括同时或依次输送本发明猿猴腺病毒载体和不同血清型衣壳的Ad载体。此方案中,同时或依次输送本发明的腺病毒载体和非Ad载体。此方案中,同时或依次输送本发明的腺病毒载体和蛋白质、肽和/或其它生物学有用的治疗或免疫原性组合物。本领域技术人员不难懂得这些用途。
以下实施例将阐明猿猴腺病毒的克隆和本发明示范性的重组腺病载体。这些实施例只是为了说明而非限制本发明的范围。
实施例1-病毒增殖
Pan5(ATCC登录号VR-591)、Pan6(ATCC登录号VR-592)、Pan7(ATCC登录号VR-593)病毒最初分离自黑猩猩淋巴结,在293细胞(ATCCCRL1573)中增殖。通常,这些细胞培养在含10%胎牛血清(FCS)(Sigma或Hyclone,Logan,UT)和1%青、链霉素(Sigma)的Dulbecco改进的Eagle培养基(DMEM,Sigma,St.Louis,MO)中。在含2%FCS的DMEM中培养24小时进行293细胞的感染。当100%细胞显示出病毒诱导的细胞病变作用(CPE)时离心收集并浓缩感染细胞。在10mM Tris(PH8.0)中重悬细胞团并经三轮冻融裂解。经二次氯化铯密度梯度超离心后获得现毒制品,将病毒贮存液用10mM Tris/100mM NaCl/50%甘油稀释至1-5×1012颗粒/ml,并-70℃保存。
293细胞增殖这些腺病毒的能力出乎依据其它非人腺病毒血清型作出的预计。
病毒 产量(8×108细胞产生的病毒颗粒数)
Pan5 8.8×1013
Pan6 1.6×1014
Pan7 8.8×1013
实施例2-病毒基因组DNA的特征
分离实施例1纯化病毒制品的基因组DNA,按制造商说明用HindIII或BamH1限制性酶消化。结果(未显示)表明本发明的Pan5、Pan6、Pan7基因组和发表的Pan9(C68)基因组显示了不同的限制性酶切模式,表明彼此不同。
测定了Pan5、Pan6和Pan7的核苷酸序列。SEQ ID NO:1中报告了Pan5 DNA上链的核苷酸序列。SEQ ID NO:5中报告了Pan6 DNA上链的核苷酸序列。SEQ ID NO:9中报告了Pan57DNA上链的核苷酸序列。
用上述常设置的”Clustal W”程序,测定与已知腺病毒序列的同源性鉴定了此病毒DNA序列中的调控和编码区。见上表提供的腺病毒序列。翻译开放读码框并与前已报导的腺病毒蛋白序列Ad4、Ad5、Ad7、Ad12主Ad40比较同源性,检验预测的氨基酸序列。
此序列的分析表明基因组的结构类似于人腺病毒,与人Ad4最相似。然而注意到黑猩猩腺病毒和其它已知腺病毒,包括AdHu4之间在六邻体超变区中存在实质性差异。这些差异与已获得的血清交叉反应数据很相符(见下)。
图1显示六邻体一部分序列的排列对比。所示部分是六邻体相应于向外倾延伸环DE1和FG1的区域,此地发现为血清型之间变异最大部分。也存在对六邻体(相应于发表的AdC68序列的残基308-368,见美国专利6,083,716)的基础有贡献的介入部分和血清型之间高保守部分。下表小结六邻体蛋白中氨基酸的配对比较: 比较 六邻体氨基酸的相似性(%) #1 #2 AdC5 AdC7 99.0 AdC5 AdC68 98.3 AdC5 AdC6 88.0 AdC5 AdC1 84.9 AdC6 AdC7 87.7 AdC6 AdC68 87.3 AdC6 AdC1 84.9 AdC7 AdC68 97.5 AdC7 AdC1 84.8 AdC8 AdC1 84.9
分析了黑猩猩腺病毒的纤维瘤区(负责受体结合)显示结构上整体类似(图2)。
人Ad5和C68的E1蛋白之间(见下表)序列同一性程度与人Ad5和Pan5、Pan6和Pan7之间相似。 比较 六邻体氨基酸的相似性(%) #1 #2 AdHu5 AdC5 36.6 AdHu5 AdC6 28.5 AdHu5 AdC7 34.9 AdHu5 AdC68 35.6 AdHu5 AdC1 35.6 AdC5 AdC6 68.3 AdC5 AdC7 96.9 AdC5 AdC68 80.4 AdC5 AdC1 51.3 AdC6 AdC7 69.3 AdC6 AdC68 59.4 AdC6 AdC1 37.7 AdC7 AdC68 81.5 AdC7 AdC1 51.0 AdC68 AdC1 54.9 与人Ad5的序列同一性 E1b小T蛋白 E1b大T蛋白 C68 47.3% 55.8% Pan-5 43.2% 54.5% Pan-6 45.3% 54.5% Pan-7 46.4% 53.8%
用以下实施例所述分子克隆方法产生了复制缺陷性AdC5、AdC6和AdC7,它产中间的小基因盒被插入到E1a和E1b基因位置。挽救该重组病毒的分子克隆并培养在293细胞中作大规模纯化,采用已发表的CsCI半沉淀方法(K.Fisher等,J.Virol.,70:520,1996)。将50块平板约1×109个293细胞产生的载体用相应的病毒感染。分光光度法测定病毒颗粒浓度确定产量。构建E1缺陷载体后,决定用HEK293细胞(表达人腺病互血清型5E1功能)反式互补该新型病毒载体的E1缺失,以产生高效价贮存液。一些这种重组病毒的病毒产量举例见下表。
通过巨细胞病毒启动子表达了这些载体、β-半乳糖苷酶(LacZ)、绿荧光蛋白(GFP)、α-1-抗胰蛋白(AIAT)、埃博拉糖蛋白(ebo)、缺少跨膜和胞浆结构域的埃博拉可溶性糖蛋白变体(sEbo)和埃博拉糖蛋白的三个缺失突变(Ebo△2、Ebo△3和Ebo△4)的转基因。下一中ND表示该项研究没做。转基因病毒骨架/载体产量(病毒颗粒×1013)AdHu5 AdC7 AdC68 ADC6 CMVLacZ 1.5 1.4 3.3 6.1 CMVGFP 2.5 3.6 8 10 CMVAIAT 3.7 6 10 ND CMVEbo 1.1 4.3 ND ND CMVsEbo 4.9 5.4 ND ND CMVEbo△2 1 9.3 ND ND CMVEbo△3 0.8 9.5 ND ND CMVEbo△4 1.4 6.2 ND ND
人腺病毒E1反式互补本发明E1缺失黑猩猩病毒有很大优点,因为它得以产生本发明的E1缺失黑猩猩腺病毒,同时减少或消除了同源重组的风险,因为如本文所述人Ad和黑猩猩腺病毒之间存在序列差异。
实施例3-Pan5、6和7病毒的血清学研究
因为六邻体超变区中的差异,预测C5、C6和C7病毒在血清学上与人腺病毒包括AdHu4不同。
1.野生型病毒的交叉反应性
为了筛选野生型病毒以测定抗体交叉反应,采用了复制活性病毒并测定了对细胞病变作用(CPE)的抑制。简言之,将贮存的5×1012颗粒/ml的腺病毒制品(Adhu5、Pan-5、Pan-6、Pan-7和Pan-68)作1/600稀释用于试验。选此病毒浓度是因为无中和时48小时内它可产生100%CPE。将病毒加到293细胞(4×104细胞/孔,96孔板)前,加入1∶20稀释血清。读取存在与不存在CPE时的试验数据,完全中和为无细胞病变。结果小结于下表中。事实上9/36份人血清可中和Adhu5诱导的CPE,这与先前对人群中和抗体的估计相一致。表中数字表明有中和作用的个体数(分子)和受筛检总数(分母),ND为未测。
用1∶20稀释的血清中和
人 猴 黑猩猩
(N=36) (N=52) (N=20)
Adhu5 9/36 ND ND
AdC68 1/36 0/52 12/20
Pan5 0/36 0/52 10/20
Pan6 0/36 0/52 9/20
Pan7 0/36 0/52 12/20
筛检所有人血清中35/36为AdC68中和阴性,而36/36为Pan-5、Pan-6和Pan-7中和阴性;筛检的52只猴中没有一只显示可中和黑猩猩腺病毒;优选这些猴子作为评价HIV疫苗的临床前模型。20只黑猩猩有9-12只可基本中和黑猩猩的一种或另一种腺病毒,这与它们确实是流行性黑猩猩特定病原体的事实相符。令人囊兴趣的是黑猩猩的中和抗体只针对Pan-5、Pan-6或AdC68,从而支持以下假设:几种这些黑猩猩腺病毒载体彼此无中和作用,在血清型上不同。
对20只黑猩猩血清样品进行了相同试验。50%样品血清学上有反应,对Pan-5反应程度不同;对Pan-6为40%;对pan-7为55%;对C68为60%。阳性血清样品中,一只对所有四种黑猩猩病毒有强中和活性。
2.与重组病毒的交叉中和作用
获得各猿猴腺病毒的高效价我克隆抗体以更精确地估计不同血清型间交叉中和的程度。用含前述C68黑猩猩腺病毒作为辅佐的GFP重组病毒肌内免疫接种家兔。然后测定血清对本发明三种黑猩猩腺病毒AdC5、AdC6和AdC7每一种的中和活性。给家兔每公斤肌内注射5×1012病毒颗粒的C68CMVGFP质粒,五周后用相同剂量加强免疫。9周时间点内收集的血液显示对C68及Pan-5和Pan-7而非Pan-6有极强的中和活性(见下表),表明给予C68(或Pan-5和Pan-7)疫苗,然后用Pan-6载体加强可能有效。然而,已现此内在相关性水平不一定能用再给予来防止,抗病毒抗体效价不如此家兔达到的那样高。下表中,+表示33%CPE;++表示66%CPE;+++表示100%CPE。用以下病毒感染293细胞Pan-5Pan-6Pan-7Pan-9(C68)C68 GFP血清稀释度 -+++---1/20 -+++---1/40-+++---1/80-+++---1/160-+++---1/320-+++---1/640-+++---1/1,280-+++---1/2,560-+++---1/5,120++++---1/10,240++++++--1/20,480++++++++--1/40,960++++++++++1/81,920+++++++++++++1/163,840+++++++++++++++1/327,680+++++++++++++++1/665,360+++++++++++++++1/1,320,720+++++++++++++++1/2,621,440
3.检测中和性抗体的定量试验
用一种更具定量性的试验检测依据GFP载体转导产生的中和抗体来验证以上结果。简言之,用5×1010颗粒/ml的Pan-5、Pan-6、Pan-7和C68肌内或静脉内免疫C57BL/6小鼠。测定第28天血清1/20和1/80稀释度对C68CMVGFP的交叉中和活性。结果,当测定人免疫球蛋白药物制品对Pan5、6和7,及C68的血清反应性时,检测到一些低水平的抗Pan7和C68中和活性。36份人血清样品作了相同测试,以1/20稀释度测定血清样品。结果表明只有一个人具有对C68的明显中和活性。未测到对Pan5、Pan6或Pan7的中和活性。
4.体外交叉中和测定
测定了针对腺病毒Pan-5、Pan-6、Pan-7和C68各自高效价兔多克隆抗体对猿猴腺病毒的交叉中和作用。能
肌内注射1013全颗粒的每种黑猩猩腺病毒免疫家兔,40天后用同样剂量加弗氏不完全佐剂加强。将系列倍比稀释的血清液与表达GFP的各相应黑猩猩腺病毒109基因组拷贝一起培育,并当加入到293细胞时测定GFP表达的减弱,来分析血清中是否存在中和抗体。记录能导致GFP表达减少50%的血清稀释度,作为抗该具体病毒的中和抗体效价。
结果见表中所示。此数据与对六邻体氨基酸序列分析所得预测相符,表明与黑猩猩其它腺病毒相比,Ad Pan-6可能是最常见的血清型。用以下109基因组拷贝感染293细胞用以下免疫家兔得到的血清Ad Pan-5Ad Pan-6Ad Pan-7ADC68Ad Pan-5 Ad Pan-6 Ad Pan-7 AdC681/5120 <1/20 1/2560 1/2560无中和作用 1/20,480 <1/20 <1/201/2560 1/60 1/63,840 1/2560无中和作用 <1/20 <1/20 1/5120
为了确定以与猿猴腺病毒起交叉反应的抗体是否可能在人中低流行,测定了猿猴腺病毒SV1、SV39和SV25当与商品化合并的人免疫球蛋白一起培育时能否能抵抗中和作用。也用Adhu5和黑猩猩腺病毒Pan-5、Pan-6、Pan-7和C68进行了同一试验。进一步研究中,测定了用黑猩猩腺病毒C5、C6、C7和C68之一免疫小鼠的血清交叉中和猿猴腺病毒SV-15、SV-23、SA-17和狒狒腺病毒的能力。任何情况下没发现交叉反应。
实施例4-重组E1-缺失Pan5载体的产生
用定点诱变破坏pX(Clontech)的bla基因区中的FspI位点制备了修饰的pX质粒。该修饰质粒称为pX’,是一种3000bp的环形质粒,含f1起始序列和氨苄表霉素抗性基因(AmpR-cds)。
A.Pan-5腺病毒质粒的产生
产生能将Pan5DNA片段依次克隆入pX’的多聚接头。用MluI和EcoRI消化后,此多聚接头替代了原来的pX’多聚接头。将Pan5的钝端FseI片段插入多聚接头的Smal和FseI位点。此片段含腺病毒基因组(bp 1-3606,SEQ ID NO:1)的5’端。用侧接pShuttle(Clontech)的I-Ceu和PI-Sce位点的一短序列置换Pan5(bp455-3484,SEQ ID NO:1)的SnaBI-FspI片段,以消除所述腺病毒基因组中的E1区。将EcoRI-平头片段(bp28656-36462,SEQ ID NO:1)插入该多聚接头的EcoRI和EcoRV位点(提供所述腺病毒基因组的3’端,将FseI-MlnI片段(bp3606-15135,SEQ IDNO:1)插入此多聚接头中,并将MluI-EcoRI片段插入此多聚接头(bp15135-28658,SEQ ID NO:1)。任选地,将一需要的转基因插入该新产生的pX’Pan5△E1载体的I-CeuI和PI-SceI位点。
B.产生pX’Pan5△E1的另一方法
如上所述从Clontech获得衍生自pAdX腺病毒质粒的初始质粒pX。然后,缺失掉pX’的PacI-XhoI区并将该平头Pan5多聚接头插入FspI位点产生pX’PLNK(2994bp)。将Pan5r的5’端FseI区(bp 1-3607,SEQ ID NO:1)插入pX’LNK的SmaI和FseI位点,产生pX’Pan5-5’质粒(6591 bp)。切下pX’Pan5-5’的SnaBi-NdeI区并用Ceu/Sce盒代替,经PCR扩增从pRCS产生pX’Pan5-5’△E1(4374 bp)。简言之,PCR扩增pRCS(3113 bp)的含I-CeuI和PI-SceI罕见刻纹头位点的序列。此3’PCR引物将NdeI位点引入该PCR产物中。
为延伸pX’Pan5 △E1(4374 bp)中的Pan5DNA,加入Pan5的FseI-MluI区(bp3607-15135,SEQ ID NO:1),产生pX’Pan5-5’Mlu(15900)。将Pan5序列的残留MluI-3’端(bp 15135-36462,SEQ ID NO:1)加到该载体多聚接头的MluI和EcoRV位点之间,形成pX’Pan5△E1,其含E1区缺失的全长Pan5序列。
C.重组病毒的产生
为了从pX’Pan5△E1产生重组腺病毒,将质粒与辅助病毒表达的E1共转染,或从E1-表达包装细胞系,如293细胸系或上述制备的细胞系产生。包装细胞中的E1表达得以复制Pan5△E1并将其包装入病毒粒子衣壳中。另一实施例中,已被pX’Pan5△E1转染的包装细胞用上述带有感兴趣转基因的腺病毒载体转染。在辅助病毒与质粒之间发生同源重组,使得载体中的腺病毒转基因被置换并包装入病毒粒子衣壳中,产生该重组腺病毒。
转染后琼脂覆盖2周,病毒形成空斑,扩大,筛检转基因的表达。再经几轮空斑纯化,然后再扩增培养物。最后收获细胞,制备病毒提取物,含所需转基因的该重组黑猩猩腺病毒用CsCi梯度浮力密度超离心纯化,或用本领域技术人员已知的其它方法纯化。
实施例5-重组E1缺失Pan6载体的产生
A.构建Pan-6腺病毒质粒的方法
1.未端片段的克隆
用链霉蛋白酶和蛋白酶K处理及酚抽提除去Pan-6病毒的蛋白质。如Berkner和Sharp,NucleiCAcids Rasearch,11:6003,1983所述,将合成的12bp Pme I接头连接于此病毒DNA。然后用XbaI消化此病毒DNA分离得到5’未端片段(6043bp)。将Ad6XbaI5’片段连接在SmaI和XbaI位点处的pX接头,形成pX-Ad Pan6-0-16.5。带PmeI接头的病毒DNA也用PacI消化,分离得到6475 bp的3’端片段并克隆入连接在PacI和SmaI位点的pX中,得到pXAd Pan6-82-100。
2.缺失5’克隆的E1
为了缺失掉E1(m.u.1.2-9),用跨越经BsiWi和XbaI处理的m.u.9-16.7片段的PCR片段代替pX-Ad Pan6-0-16.5中的BsiWi-XbaI片段,产生pX-Ad-Pan6m.u.0-1,9-16.5。
3.融合5’和3’克隆并产生一锚着位点以接受中等大小HindIII片段
首先,将Pan6基因组的2ndXbaI片段(4350 bp,m.u.16.5-28)插入pX-Ad-Pan6m.u.0-1,9-16.5中的XbaI位点,进一步扩大5’克隆pX-Ad-Pan6m.n.0-1,9-16.5。此构建物命名为pXAd-Pan6-mu 0-1,9-28。
其次,将覆盖Pan6基因组m.u.41-82的15026 bp MluI/PacI片段插入pXAdPan6-82-100的MluI/PacI位点,产生pXAd Pan6-m.u.41-100。
然后,分离pXAd-Pan6-mn 0-1,9-28的8167 bp HindIII/Eco 47III Pan6片段,并亚克隆入HindIII和XbaI平头位点处的pXAd Pan6-m.u.41-100。此5’和3’融合克隆称为pXAd Pan6mu0-1,9-19.5,64-100。
4.该基因组的中等大小片段插入此融合克隆
将Pan6的16335 bp HindIII片段(m.u.19.5-64)插入pXAd Pan6mu0-1,9-19.5,64-100的HindIII位点,形成pXAd Pan6-0-1,9-100。
5.将PKGFP可选择标记引入最终构建物中,指导感兴趣基因的克隆和重组转化物的绿/白选择。
用SapI和DrAIII消化然后经补平反应,分离pShuttle-pKGFP(bare)的表达GFP的小基因盒,此盒在Lac启动子控制下并侧接编码限制酶PI-SceI和I-CeuI的罕见内含子的识别位点。pShuttle-pKGFP(bare)质粒长4126 bp,含ColE1-ori、卡那霉抗性基因、plac、lacZ启动子-GFPmut3-1 cds(Clontech)。将此盒亚克隆入SrfI cut和平头pXAd Pan6-0-1,9-100中。该最终构建物称为pX-Pan6-pKGFPm.u.0-1,9-100,可通过直接和绿/白选择与pShuttlepKGFP基因载体结合,用于产生携带感兴趣基因的E1缺失重组pan6分子克隆。
B.产生Pan-6质粒的另一种方法
1.5’未端片段的克隆
如上所述用链霉蛋白酶和蛋白酶K处理及酚抽提除去Pan-6病毒的蛋白质,并如所述将合成的12bp Pme I接头连接于此病毒DNA。分离Ad Pan5’XbaI片段并连接入pX,形成A部分所述的pX-Ad Pan6-0-16.5(9022 bp)。
2.缺失5’克隆的E1
为了缺失E1(m.u.1.2-9),用SnaBI和NdeI消化pX-Ad Pan6-0-16.5,除去编码E1a和E1b蛋白的区域(3442-6310 bp)。然后有BriWI消化制品中的此载体,用携带可选择标记的小基因盒补平。
3.引入可选择标记
如上所述分离pShuttle-pKGFP(bare)的表达GFP的小基因盒,此盒在Lac启动子控制下并侧接编码限制酶PI-XceI和I-CeuI的罕见内含子的识别位点。然后将DraIII-SapI片段连接于消化的pX-Ad Pan6-0-16.5,形成pX-Ad Pan6 MU 0-16.5
△E1(7749bp)。
4.延长Pan-6腺病毒序列
使pX-Ad Pan6 MU 0-16.5△E1经XbaI消化得以插入XbaI-RsrII接头。分离Ad Pan6基因组的XbaI/RsrII消化片段(mu 28-100,26240 bp)并连接入XbaI/RsrII消化的pX-Ad Pan6 MU 0-16.5△E1,提供pX-Ad Pan6 MU 0-1,9-16.5,28-100。然后将Pan6基因组(mu 16.5-28,4350 bp)的第二个XbaI片段连接入此质粒中,形成pX-Ad Pan6 MU 0-1,9-100(38551 bp)。
C.重组腺病毒的产生
为了从部分A和B所述制备的E1缺失Pan6质粒产生重组腺病毒,将该质粒与辅助病毒表达的E1共转染,或从E1-表达包装细胞系,如293细胸系或上述制备的细胞系产生。包装细胞中的E1表达得以复制和将Pan6-pKGFP mu.0-1,9-100包装入病毒粒子衣壳中。或者,将已经pX-Pan6-pKGFP mu.0-1,9-100转染的该包装细胞,用上述携带另一感兴趣转基因的腺病毒载体转染。
实施例6-组E1-缺失Pan7载体的产生
A.Pan7质粒的产生
将含限制位点paCI-SmaI-FseI-MluI-EcoRV-PacI的合成接头克隆入已经EcoRI和NdeI切割的pBR322中。将Ad Pan7的左端(bp 1-3618)克隆到SmaI和FseI位点之间的接头中。然后用SnaBI和NdeI切割从此克隆左端切下腺病毒E1,并在此位置插入pShuttle(Clontech)的I-CeuI-GFP-PI-SceI盒。所得质粒用FseI和MluI切割,插入Ad Pan7的片段FseI(bp 3618)至MluI(bp 155114),以延长其左端。将21421bp的Ad Pan7右端片段(从MluI位点bp 15114开始)插入到上述质粒的MluI和EcoRV位点之间,产生缺失腺病毒Pan7E1的完全分子克隆,完成此构建物(pPan7pGFP),其适合于产生重组腺病毒。任选地,将所需转基因插入此新建pPan7载体质粒的I-ceuI和PI-SceI位点。
B.构建E1-缺失的Pan7病毒载体
为了从pPan7△E1产生此重组腺病毒,将该质粒与辅助病毒表达的E1共转染,或从E1-表达包装细胞系,如293细胸系或上述制备的细胞系产生。包装细胞中的E1表达得以复制和将Pan7△E1包装入病毒粒子衣壳中。另一实施例中,将已经pX’-Pan7△E1转染的该包装细胞,用上述携带感兴趣转基因的腺病毒载体转染。在辅助病毒与质粒之间发生同源重组,使得载体中的腺病毒转基因被置换并包装入病毒粒子衣壳中,产生该重组腺病毒。如上所述进行转染和纯化。
实施例7-生表达E1基因的质粒载体
构建编码PanE1区基因的质粒载体,利用这些质粒产生表达病毒E1蛋白的稳定细胞系。基本上如实施例4所述,将Pan5的E1区克隆入pX’中,再用pShuttle(Clontech)的片段置换此区。该表达质粒含Pan5腺病毒基因组序列,此序列跨越Pan5基因组序列中的至少bp 1-3959。因此,该表达质粒含编码黑猩猩在异源启动子控制下的Ad Pan5的E1a和E1b序列。可利用上表确定的Ad Pan6和Ad Pan7E1区产生类似的表达质粒。
实施例8-生表达黑猩猩腺病毒E1蛋白的细胞系
用实施例6的质粒转染Hela(ATCCAcc.No.CCL2)产生表达病毒E1蛋白的细胞系。这些细胞系通过共转染上述基因组病毒DNA和表达质粒,用于产生E1缺失重组黑猩猩腺病毒。用其它腺病毒如人腺病毒常规方法,进行这些细胞系的转染以及重组黑猩猩腺病毒的纯化(见例如,Horwitz,见上和其它标准教课书)。
A.表达Pan5E1蛋白的细胞系
采用CellphectTM试剂盒(PharmaciAUppsala,Sweden)按厂家程序用10μg pX-Pan51-E1 DNA转染10cm平皿中的Hela细胞。转染22小时后,使细胞经历3分钟甘油休克(15%甘油,以Hepes缓冲盐水配,PH7.5),用含10%FCS,1%Pen-Strep的DMEM(Hela)或F12K(A549;Life Technologies有限公司,GrandIsland,NY)培养液洗一次,然后在上述培养液中37℃培养6小时。将转染后的细胞按1∶20,1∶40,1∶80,1∶160和1∶320比率分装在一式二份的15cm平板中。37℃培养过夜,培养液中加入浓度lug/ml的G418(Life Technologies有限公司)。每五天换液一次,转染后20天分离克隆。
分离HelaE1细胞克隆,并测定其促进腺相关病毒(AVV)感染和下述重组LacZ蛋白表达的能力。
B.筛选有表达细胞系的AAV促进试验
AAV需要腺病毒表达的蛋白质来完成其生命周期。腺病毒的E4蛋白和E4区-编码的ORF6蛋白是促进AAV感染所必须的。采用基于AAV促进的E1表达试验。简言之,鉴定腺病毒有
表达方法包括在分开的培养物中感染推测的腺病毒E1表达细胞和不含腺病毒序列的细胞,与二者一起腺相关病毒(AAV)在适当时间内可表达标记基因,及aAV表达人腺病毒E4基因的ORF6。测定所得细胞中标记基因活性选出具有比对照细胞高得多标记物活性的那些强胞,作为经验证的E1表达细胞。下述实验中,标记基因为LacZ基因,标记物活性为呈现兰色。
例如,用携带标记基因如AV.LacZ(K.Fisher等,J.Virol.,70:520,1996)的AAV载体,或表达人5的ORF6区(AV.orf6)的AAV载体,以每个细胞100个基因组感染上述细胞系及未感染的对照细胞(Hela)。此质粒的DNA序列可产生一种新的重组腺相关病毒(rAAV),其含LacZ转基因和AdE4ORF6的开放阅读框,所表达的产物能促进产生rAAV基因组DNA的单链(ss)转变为双链(ds)。在含2%FCS、1%Pen-Strep的培养液中37℃培养这些载体4小时,此时刻加入含10%FCS的等量培养液。本领域技术人员应理解此试验第一个AAV载体中可采用符何标记基因(或报告基因),如碱性磷酸酶、荧光素酶及其它。也可用抗体酶试验来定量抗原水平,当该标记物表达抗原时。此试验不受标记基因身份的限制。感染后20-24小时用标准方法染色细胞观察LacZ活性。4小时后用显微镜观察细胞,具有比A549或hela对照细胞显著更兰色的细胞系评为阳性。
实施例9-将转基因输送给宿主细胞
然后利用得到的实施例4,5或6所述重组黑猩猩腺病毒将转基因输送给哺乳动物,优选人细胞。例如,纯化该重组病毒后,以每个细胞MOI50感染人胚肾293细胞。感染后24小时验证GFP表达。
A.小鼠模型中通过Pan-6、Pan-7和Pan-9载体的基因转移
比较了小鼠肝定向基因转移,小鼠肺定向基因转移,小鼠肌肉定向基因转移中重组黑猩猩腺病毒的基因转移效率和毒理学图谱。
采用本文技术构建了人Ad5、黑猩猩Pan6、黑猩猩Pan7和黑猩猩Pan9(C68)的含有在CMV控制下LacZ的E1缺失腺病毒载体。如下将此载体输送给免疫缺陷NCR裸鼠(每项研究80只)。对于肝脏研究,在尾静脉内注射100μl(1×1011颗粒)。对于肺研究,气管内输送50μl(5×1010颗粒)。对二肌肉研究,胫动脉内注射25μl(5×1010颗粒)。注射载体后3、7、14和28天年死小鼠(每次每组5只动物),每次处死收集肝/肺/肌肉组织并准备冷冻和石蜡包埋。冷冻切片作X-gal染色,石蜡切片作H&E染色进行组织学分析。每次进行终未取血,血清样品作肝功能试验。
此实验观察到将基因转移至肝和至肺中,黑猩猩腺病毒Pan-6、pan-7和Pan-9效果不如huAd5。然而这在某些情况需减少huAd5所见的肝毒性时,可能是需要的。肌肉中的基因转移效率血清之间差别不大。
B.反复给予血清型在Adhu5、Pan-6、Pan-7和Pan-9载体间转换的腺病毒载体的可行性小鼠研究
尾静脉给予小鼠(C57/B16;4只/组)基于huad5、Pan-6、Pan-7和Pan-9(H5.040CMVLacZ、Pan6.000CMVLacZ、Pan7.000CMVLacZ、Pan9.000CMVLacZ;1011颗粒/注射)的LacZ载体。30天后,小鼠再给予表达α1-抗胰蛋白酶的腺病毒载体(H5.040CMVhA1AT、Pan6.000CMVhA1at、Pan7.000CMVhA1At、Pan9.000CMVhA1At;1011颗粒/注射)。测定再次给药后3天和7天的血清α1-抗胰蛋白酶来监测再给予载体的转导是否成功。
测定了分别基于huAd5、Pan-6、Pan-7和Pan-9的腺病毒载体在存在其它血清型中和抗体时转导小鼠肝脏的能力。结果见下表: 第一次注射第二次注射交叉中和作用 Adhu5 Adhu5 有(对照+) Pan-6 无 Pan-7 无 Pan-9(C68) 无 Pan-6 Adhu5 无 Pan-6 有(对照+) Pan-7 有 Pan-9(C68) 无 Pan-7 Adhu5 无 Pan-6 有 pan-7 有(对照+) Pan-9(C68) 有 Pan-9 Adhu5 无 Pan-6 无 Pan-7 有 Pan-9(C68) 有(对照+)
在存在其它血清型中和抗体时,这些载体能转导小鼠肝脏载体。
因此,用huAd5免疫不阻止用黑猩猩腺病毒载体Pan-6、Pan-7或Pan-9(C68)任一种再给药。此实验也表明Pan-7的抗原相关性谱位于Pan-6和Pan-9之间,与二者有交叉反应。然而,Pan-6和Pan-9彼此不能中和。这是基于同源性比较的令人惊奇的结果,表明Pan-6完全不同于Pan-7和Pan-9。产生的抗Pan-9抗血清表明与Pan-6无交叉是和作用,但能某种程度中和Pan-7,更加说明Pan-6不同于其它血清型。
实施例10-重组E1缺失的SV-25载体的产生
构建含除工程化E1缺失之外的完全SV-25基因组。位于E1缺失部位的限制性酶I-CeuI和PI-SceI位点,允许插入穿梭质粒中的转基因,此穿梭质粒中的转基因盒测接插入有此二酶的识别位点。
将含有限制位点SwaI-SnaBI-SpeI-AflII-EcoRV-SwaI的合成接头克隆入已用EcoRI和NdeI切割的pBR322中。通过一起退火二个合成性寡聚物:SV25T(5’-AATTTAAATACGTAGCGCACTATGCGCGCTAAGCGCGGATATCATTTAAA-3’,SEQ ID NO:38)和SV25B(5’-TATTTAAATGATATCCGCGCTTAGCGCGACTAGTGCGCTACGTATTTA-3’,SEQ ID NO:39),并将其插入到已用EcoRI和NdeI消化的pBR22中。将Ad SV25的左端(bp1-1057,SEQ ID NO:29)克隆到SnaBI和SpeI位点之间的上述接头中。将Ad SV25的右端(bp 28059-31042,SEQ ID NO:29)克隆到AflII和EcoRV位点之间的接头中。然后如下在此克隆左端EcoRI位点(bp547)至XhoI(bp 2031)之间切割腺病毒E1。将用PCR从pShuttle(Clontech)产生的I-CeuI-PI-SceI盒插入EcoRI和SpeI位点之间。然后将Ad SV-25的1054 bp XhoI片段(bp 2031-12185,SEQ ID NO:29)插入SpeI位点。用HindIII消化得到的质粒,将18344 bp Ad SV-25 HindIII片段(bp 11984-30328,SEQ ID NO:29)插入,完成此构建,产生缺失E1的腺病毒SV25的完整分子克隆,其适合产生重组腺病毒。相同任选地,将所需转基因插入该新建pSV25载体质粒的I-CeuI和PI-SceI位点。
为产生带有标记基因的Ad SV25,用限制性酶I-CeuI和PI-SceI切割质粒pShuttle(Clontech)中先已克隆的GFP(绿荧光蛋白)表达盒,并连接入已用相同酶消化的之SV25(或本文所述另一黑猩猩Ad质粒)中。用SwaI消化得到的质粒(pSV25GFP)以分离细菌质粒骨架,并转染入E1感受态细胞系HEK293中。约10天后观察到细胸病变作用,表明存在复制活性病毒。将转染培养物加到新鲜培养物上,证实成功产生了基于表达GFP的腺病毒载体Ad SV25。观察细胞群中的绿色荧光确定存在次代感染细胞。
实施例11-构建E3缺失的Pan-5、Pan-6、Pan-7和C68载体
为了提高此腺病毒载体的克隆容量,可缺失掉其E3区,因不此区的编码基因为病毒培养增殖所不需要。为此目的,制备了Pan-5、Pan-6、Pan-7和C68的E3缺失版本(缺失含E31-9缺失的3.5kb Nru-AveII片段)。
A.E3缺失的Pan5载体
用AvrII内切核酸酶处理E1缺失的pPan5-pKGFP质粒,分离到含E3区的5.8kb片段,并通过AvrII缺失重环化pPan5-pKGFP,形成构建物pPan5-pKGFP-E3-AvrII。随后将5.8kbAvrII片段亚克隆入pSL-Pan5-E3-AvrII中,为用NruI消化进一步缺失E3区。这导致质粒pSL-Pan5-E3-缺失。从pSL-Pan5-E3-缺失质粒中去除4.3kbAvrII/SpeI片段,并插入到pPan5-pKGFP-E3-AvrIIr AvrII位点,产生最终构建物pPan5-E3-pKGFP。此最终构建物中,实现了E3区的3.1kb缺失。
B.Pan6载体中的E3缺失
用SbfI和NotI消化E1缺失的pPan6-pKGFP分子克隆,分离到19.3kb的片段,将其连回SbfI位点。用Eco47III和SwaI处理得到的构建物pPan6-Sbf I-E3,产生pPan6-E3。最后将pPan6-pKGFP的Sbf I消化后的21kbSbf I片段亚克隆入pPan6-E3中,产生含含E3中4kb缺失的pPan6-E3-pKGFP。
C.E3缺失的Pan7和Pan9载体
用同样方法实现二载体中的E3缺失。首先,将跨越E3区的5.8kb AvrII片段亚克隆为pSL-1180,随后用NruI消化而缺失E3。用SpeI和AvrII处理所得质粒获得4.4kb片段,将其克隆入pPan7-pPKGFP和pPan9-pKGFP的AvrII位点,分别取代原先的含AvrII片段的E3。最终的pPan7-E3-pKGFP和pPan9-E3-pKGFP构物含有3.5kbE3-缺失。
实施例12-构建E3-和E4-缺失的Pan-7载体
虽然腺病毒缺失了E1区(第一代腺病毒载体)使它们复制不完全,但所述腺病毒载体骨架基因的表达未完全消除。缺失E4区大大减弱了此残余基因的表达,可能具有安全性优点。构建了含2.5kb缺失的E4缺失Pan-7载体(缺失含E4ORFI-ORF7的PvuII-Agel片段)。用HEK293-细胞系产生此病毒的高效价贮存液,其除E1外,可表达基本的E4基因(orf6)。
1.Pan7分子克隆中的E4缺失
缺失掉pPan7-pKGFP中的19kb XbaI片段,产生pPan7-XbaI,用AgeI和Pvu II部分消化从其缺失2.5kbE4片段,产生pPan7-XbAI-E4。在二次克隆步骤中,加入来自pPan7-pKGFP构建物的19kb XbAI和15kb I-CeuI/MluI二片段,从pPan7-XbAI-E4产生pPan7-E4-pKGFP质粒。
2.将E3和E4缺失引入Pan9载体
在EcoRI消化和身连接后通过挽回pPan9-pKGFP的11 kbEcoRI片段,产生含E4区的11kb质粒pPan9-EcoRI。通过AgeI消化/填充和PvuII部分消化和自身连接,从此构建物缺失掉E4区,产生pPan9-EcoRI-E4。多pPan9-pKGFP分离得到23kb EcoRI片段,并插入到pPan9-EcoRI-E4的EcoRI位点然后加入pPan9-pKGFP的5.8kb AvrII片段,形成最终产物pPan9-E3-E4-pKGF。与野生型Pan9的基因组大小相比,此E1-E3-E4-缺失载体具有高达8kb的转基因容量》
3.将带有感兴趣基因,包括报告基因、Ebo的糖蛋白和核蛋白的小基因盒际入Pan载体的分子克隆中
采用高效定向克隆和绿/白选择程序产生重组病毒的分子克隆。简言之,通过筛选重组物的白色集落,将感兴趣的基因克隆入pShuttlepKGFP中。然后,将此小基因盒转移到黑猩猩腺病含各种缺失的毒骨架质粒pPanX-pKGFP中,便于与pKGFP盒在I-CeuI和PI-SceI位点交换和筛选正确重组体的少数白色集落。
4.挽救早期区域中带多种缺失的Pan载体分子克隆和病毒增殖
为了挽救黑猩猩腺病毒载体E1-E3-缺失的分子克隆,用适当的限制性酶线性化这些克隆,并转染入调节性293细胞中。一但在转染细胞中观察到完全的细胞病变作用,收集粗裂解物并在293细胞中扩增成大规模感染。CsCI半沉降法纯化病毒。
对于E1-E4和E1-E2-E4-缺失的Pan载体,采用10-3细胞、293为基础的1-E4-感受态细胞系来挽救和增殖这些载体。在培养液中加入150μM ZnSO4诱导10-3细胞中的E4ORF6基因表达。
实施例13-用表达野生型和变体EboZ GP的腺病毒载体作疫苗接种
产生含埃博拉包膜嵌合体有AdHu5或AdC7载体,用于C57BL/6小鼠体内免疫实验。用分子克隆方法产生具有不同病毒骨架的重组病毒,其中E1缺失位置插入了小基因盒。挽救所有重组病毒的分子克隆,培养在293细胞中用CsCI半沉降法大规模纯化。选择并产生5个AdHu5或Ad Pan7(C7)编码的EboZ变体,肌内Ad注射后评价它们的相对免疫原性。在初次疫苗研究中评价了wtEbo(一种可溶性Ebo变体)、Ebo△1、Ebo△2、Ebo△3、Ebo△4、Ebo△5、Ebo△6、Ebo△7和Ebo△8。数据小结于下表中,通过分光光度法读数建立了感染的293细胞产生和扩增的病毒颗粒数(每ml,或总数)。
表:产生编码EboZ变体的Adhu5或AdC7腺病毒载体
HuAd5 AdC7
效价 总产量 效价 总产量
基因 (VP×1012/ml) (VP×1012) (VP×1012/ml) (VP×1012)
Ebo wt 2.6 12 4.3 43
EboS 4.9 49 4.6 55
Ebo△2 2.1 9 5.8 93
Ebo△3 1.7 8 5.3 95
Ebo△4 3 12 4.1 62
肌肉内给予C57BL/6小鼠以上载体(1011基因组拷贝/细胞),评价28天后存在的中和抗体(VNA0)作为抗Ebola包膜糖蛋白免疫应答的第一次测量值。VNA此地定义为能抑制具有野生型Ebola包膜伪型的HIV载体所介导的Hela细胞转导的血清抗体(效价)。
检测到抗EboZ伪型的NVA,Ad Pan7(C7)产生的比AdHu5产生的效价更高。就转基因靶子而言,EboZ△3诱导了最高的VNA。数据小结于下表中,提供了抗HIV-EboZ-GFP伪型的中和抗体效价(稀释度倒数)(N=5只动物/每组)。
VNA效价
EboZ野生型 EboZs EboZ△3
AdHu5 12 16 12
AdC7 44 12 140
实施例14-Pan7介导的Ebola蛋白表达
进行了评价Pan-7载体表达Ebola包膜蛋白和Ebola核抗原的小鼠研究。直接评价了C57BI/6小鼠肌肉注射(IM)表达Ebola env构物之一的Adhu5或Pan-7后的中和抗体。
A.评价IM注射表达Ebola env构建物的Adhu5或Pan-7后C57B1/6小鼠的CTL
1.用Ebola病毒攻击小鼠的实验
通过观察免疫小鼠血清所介导的慢病毒(HIV)载体伪型中和作用分析了对Ebola包膜的中和性抗体(NAB)应答,此伪型载体含Ebola包膜糖蛋白的几种构建物(eEbo、NTD2、NTD3、VTD4)。C57BL/6或BALB/c小鼠接受了一次肌肉注射,每只小鼠注射C7(Ad Pan-7)编码Ebola包膜变体5×1010颗粒。接种疫苗后30天评价中和抗体。简言之,将编码β-半乳糖苷酶的Ebola Zaire伪型HIV载体与不同稀度的热灭活小鼠血清一起37℃培育2小时。与血培育后,用EboZ-HIV-LacZ感染Hela细胞37℃16小时。β-半乳糖苷酶阳性的转导Hela细胞用X-gal染色显示感染性。当观察到β-半乳糖苷酶阳性的兰色细胞数降低50%时的血清稀释度为中和抗体效价。收集一次肌肉注射(IM)5×1010颗粒/每动物免疫后30天的血清。测定所有各组小鼠搞Ebola伪型HIV的中和抗体,抗体效价范围是:Ad-EboZ(Adhu5表达的EboZ)、Ad-NTD3(Adhu5表达的NTD3)和C7-sEbo(Ad Pan-7表达的可溶性EboZ)为20,至C7-NTD3(Ad Pan-7表达的可溶性NTD3)和C7-NTD4(Ad Pan-7表达的可溶性NTD3)为130以上。在BABL/c小鼠中同样免疫方案产生了对Ad-和C7-NTD2和NTD4较低的中和抗体效价。
B.细胞免疫应答
评价了每只小鼠I.M注射5×1010个C7-LacZ或C7-Ebola包膜变体颗粒后8天,C57BL/6小鼠对Ebola包膜的细胞免疫应答。小鼠I.M疫苗注射5×1010个C7编码的LacZ或C7-Ebla包膜变体颗粒。疫苗接种后8天收集免疫小鼠的脾淋巴细胞,体外用饲养细胞(用编码野生型Ebola包膜的人腺病毒血清型5感染的未治疗小鼠的脾淋巴细胞并经放射照光)刺激。用EboZ表达子转染的51Cr-标记的同系C57细胞进行标准5-小时CTL试验。
所有编码Ebola包膜变体的Ad Pan-7观察到阳性MHV-限制性细胞毒T淋巴细胞(CTL),NTD2、NTD3或NTD4免疫小鼠有较高应答。C7编码Ebola包膜变体免疫小鼠的效应细胞能识别EboZ转染的靶细胞并产生回忆性CTL应答,特异性溶解高达30%。而原初或LacZ免疫对照小鼠的效应细胞溶解不到5%,从而证实溶解是Ebola包膜抗原特异性的。
C.保护性研究
评价编码EboZ变体C7(Ad Pan-7)是否为成功的疫苗的大多数方法是评估用小鼠适应性Ebola Zaire病毒致死性攻击后,能否保护小鼠避免失重和死亡。如前所述每只小鼠用一剂5×1010个颗粒免疫BALB/c小鼠,21目寸光天后用200LD50小鼠适应性Ebola Zaire病毒攻击接种疫苗小鼠。攻击后5-9天之间所有对照小鼠(载体和C7-LacZ)死亡。相反,除一只外(C7-sEbo组)所有接种疫苗小鼠用Ebola Zaire攻击后仍存活。
接种C7-sEbo疫苗后4-7天观察到失重。也注意到接种C7-sEbo、NTD2和NTD3疫苗的小鼠在4-7天之间有竖毛和轻至重度呆滞疾病症状。接种C7-EboZ和C7-NTD4的小鼠无疾病症状。总之,一剂C7-EboZ和C7-NTD4完全保护了免疫小鼠避免患病和死亡,可能是由于显著的T细胞介导免疫力。
所有引用的文献纳入本文作参考。对本发明的许多修饰和变化均包括在本说明书的范围内,这是本领哉技术人员明白的。对本发明组合物和方法的修改和变化,如不同小基因的选择或载体或免疫调节剂的选择均在本发明附属权利要地书的范围内。
序列表
<110>宾夕法尼亚州立大学托管会(The Trustees of the University of Pennsylvania)
J.M.威尔森(Wilson,James M.)
G.高(Gao,Guangping)
S.罗伊(Roy,Soumitra)
<120>猿猴腺病毒的核酸和氨基酸序列,含有它们的载体以及用法
<130>UPN-02677PCT
<150>US 60/331,951
<151>2001-11-21
<150>US 60/366,798
<151>2002-03-22
<160>39
<170>PatentIn version 3.1
<210>1
<211>36462
<212>DNA
<213>黑猩猩腺病毒血清型Pan5
<220>
<221>CDS
<222>(13898)..(15490)
<223>L2五邻体
<220>
<221>CDS
<222>(18315)..(21116)
<223>L3六邻体
<220>
<221>CDS
<222>(32035)..(33372)
<223>L5纤维
<400>1
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg aggtatttga 60
atttggggat gcggggcggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660
gtgacgaccc tccggagccc cctaccccat ttgaagcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgca gcgaaccagg gagtgaaaac agcgagcgag ggctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagctgggga ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgtagatgag acccccacta 1260
cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattcata 1320
gaccagttgc agtgagagtc accgggcgta gagcagctgt ggagagtttg gatgacttgc 1380
tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440
cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500
tccgtgttga ctttaagtgc gtggtttatg actcaggggt ggggactgtg ggtatataag 1560
caggtgcaga cctgtgtggt cagttcagag caggactcat ggagatctgg acagtcttgg 1620
aagactttca ccagactaga cagctgctag agaactcatc ggagggagtc tcttacctgt 1680
ggagattctg cttcggtggg cctctagcta agctagtcta tagggccaag caggattata 1740
aggatcaatt tgaggatatt ttgagagagt gtcctggtat ttttgactct ctcaacttgg 1800
gccatcagtc tcactttaac cagagtattc tgagagccct tgacttttct actcctggca 1860
gaactaccgc cgcggtagcc ttttttgcct ttatccttga caaatggagt caagaaaccc 1920
atttcagcag ggattaccgt ctggactgct tagcagtagc tttgtggaga acatggaggt 1980
gccagcgcct gaatgcaatc tccggctact tgccagtaca gccggtagac acgctgagga 2040
tcctgagtct ccagtcaccc caggaacacc aacgccgcca gcagccgcag caggagcagc 2100
agcaagagga ggaccgagaa gagaacctga gagccggtct ggaccctccg gtggcggagg 2160
aggaggagta gctgacttgt ttcccgagct gcgccgggtg ctgactaggt cttccagtgg 2220
acgggagagg gggattaagc gggagaggca tgaggagact agccacagaa ctgaactgac 2280
tgtcagtctg atgagtcgca ggcgcccaga atcggtgtgg tggcatgagg tgcagtcgca 2340
ggggatagat gaggtctcag tgatgcatga gaaatattcc ctagaacaag tcaagacttg 2400
ttggttggag cccgaggatg attgggaggt agccatcagg aattatgcca agctggctct 2460
gaggccagac aagaagtaca agattaccaa actgattaat atcagaaatt cctgctacat 2520
ttcagggaat ggggccgagg tggagatcag tacccaggag agggtggcct tcagatgctg 2580
catgatgaat atgtacccgg gggtggtggg catggaggga gtcaccttta tgaacgcgag 2640
gttcaggggt gatgggtata atggggtggt ctttatggcc aacaccaagc tgacagtgca 2700
cggatgctcc ttctttggct tcaataacat gtgcattgag gcctggggca gtgtttcagt 2760
gaggggatgc agtttttcag ccaactggat gggggtcgtg ggcagaacca agagcatggt 2820
gtcagtgaag aaatgcctgt tcgagaggtg ccacctgggg gtgatgagcg agggcgaagc 2880
caaagtcaaa cactgcgcct ctaccgagac gggctgcttt gtactgatca agggcaatgc 2940
caaagtcaag cataatatga tctgtggggc ctcggatgag cgcggctacc agatgctgac 3000
ctgcgccggt gggaacagcc atatgctagc caccgtgcat gtggcctcgc acccccgcaa 3060
gacatggccc gagttcgagc acaacgtcat gacccgctgc aatgtgcacc tggggtcccg 3120
ccgaggcatg ttcatgccct accagtgcaa catgcaattt gtgaaggtgc tgctggagcc 3180
cgatgccatg tccagagtga gcctgacggg ggtgtttgac atgaatgtgg agctgtggaa 3240
aattctgaga tatgatgaat ccaagaccag gtgccgggcc tgcgaatgcg gaggcaagca 3300
cgccaggctt cagcccgtgt gtgtggaggt gacggaggac ctgcgacccg atcatttggt 3360
gttgtcctgc aacgggacgg agttcggctc cagcggggaa gaatctgact agagtgagta 3420
gtgtttggga ctgggtggga gcctgcatga tgggcagaat gactaaaatc tgtgtttttc 3480
tgcgcagcag catgagcgga agcgcctcct ttgagggagg ggtattcagc ccttatctga 3540
cggggcgtct cccctcctgg gcgggagtgc gtcagaatgt gatgggatcc acggtggacg 3600
gccggcccgt gcagcccgcg aactcttcaa ccctgaccta cgcgaccctg agctcctcgt 3660
ccgtggacgc agctgccgcc gcagctgctg cttccgccgc cagcgccgtg cgcggaatgg 3720
ccctgggcgc cggctactac agctctctgg tggccaactc gagttccacc aataatcccg 3780
ccagcctgaa cgaggagaag ctgctgctgc tgatggccca gctcgaggcc ctgacccagc 3840
gcctgggcga gctgacccag caggtggctc agctgcaggc ggagacgcgg gccgcggttg 3900
ccacggtgaa aaccaaataa aaaatgaatc aataaataaa cggagacggt tgttgatttt 3960
aacacagagt cttgaatctt tatttgattt ttcgcgcgcg gtaggccctg gaccaccggt 4020
ctcgatcatt gagcacccgg tggatctttt ccaggacccg gtagaggtgg gcttggatgt 4080
tgaggtacat gggcatgagc ccgtcccggg ggtggaggta gctccattgc agggcctcgt 4140
gctcgggggt ggtgttgtaa atcacccagt catagcaggg gcgcagggcg tggtgctgca 4200
cgatgtcctt gaggaggaga ctgatggcca cgggcagccc cttggtgtag gtgttgacga 4260
acctgttgag ctgggaggga tgcatgcggg gggagatgag atgcatcttg gcctggatct 4320
tgagattggc gatgttcccg cccagatccc gccgggggtt catgttgtgc aggaccacca 4380
gcacggtgta tccggtgcac ttggggaatt tgtcatgcaa cttggaaggg aaggcgtgaa 4440
agaatttgga gacgcccttg tgaccgccca ggttttccat gcactcatcc atgatgatgg 4500
cgatgggccc gtgggcggcg gcttgggcaa agacgtttcg ggggtcggac acatcgtagt 4560
tgtggtcctg ggtgagctcg tcataggcca ttttaatgaa tttggggcgg agggtgcccg 4620
actgggggac gaaggtgccc tcgatcccgg gggcgtagtt gccctcgcag atctgcatct 4680
cccaggcctt gagctcggag ggggggatca tgtccacctg cggggcgatg aaaaaaacgg 4740
tttccggggc gggggagatg agctgggccg aaagcaggtt ccggagcagc tgggacttgc 4800
cgcagccggt ggggccgtag atgaccccga tgaccggctg caggtggtag ttgagggaga 4860
gacagctgcc gtcctcgcgg aggagggggg ccacctcgtt catcatctcg cgcacatgca 4920
tgttctcgcg cacgagttcc gccaggaggc gctcgccccc aagcgagagg agctcttgca 4980
gcgaggcgaa gtttttcagc ggcttgagcc cgtcggccat gggcattttg gagagggtct 5040
gttgcaagag ttccagacgg tcccagagct cggtgatgtg ctctagggca tctcgatcca 5100
gcagacctcc tcgtttcgcg ggttggggcg actgcgggag tagggcacca ggcgatgggc 5160
gtccagcgag gccagggtcc ggtccttcca ggggcgcagg gtccgcgtca gcgtggtctc 5220
cgtcacggtg aaggggtgcg cgccgggctg ggcgcttgcg agggtgcgct tcaggctcat 5280
ccggctggtc gagaaccgct cccggtcggc gccctgcgcg tcggccaggt agcaattgag 5340
catgagttcg tagttgagcg cctcggccgc gtggcccttg gcgcggagct tacctttgga 5400
agtgtgtccg cagacgggac agaggaggga cttgagggcg tagagcttgg gggcgaggaa 5460
gacggactcg ggggcgtagg cgtccgcgcc gcagctggcg cagacggtct cgcactccac 5520
gagccaggtg aggtctggcc ggtcggggtc aaaaacgagg tttcctccgt gctttttgat 5580
gcgtttctta cctctggtct ccatgagctc gtgtccccgc tgggtgacaa agaggctgtc 5640
cgtgtccccg tagaccgact ttatgggccg gtcctcgagc ggggtgccgc ggtcctcgtc 5700
gtagaggaac cccgcccact ccgagacgaa ggcccgggtc caggccagca cgaaggaggc 5760
cacgtgggag gggtagcggt cgttgtccac cagcgggtcc accttctcca gggtatgcaa 5820
gcacatgtcc ccctcgtcca catccaggaa ggtgattggc ttgtaagtgt aggccacgtg 5880
accgggggtc ccggccgggg gggtataaaa gggggcgggc ccctgctcgt cctcactgtc 5940
ttccggatcg ctgtccagga gcgccagctg ttggggtagg tattccctct cgaaggcggg 6000
catgacctcg gcactcaggt tgtcagtttc tagaaacgag gaggatttga tattgacggt 6060
gccgttggag acgcctttca tgagcccctc gtccatctgg tcagaaaaga cgatcttttt 6120
gttgtcgagc ttggtggcga aggagccgta gagggcgttg gagagcagct tggcgatgga 6180
gcgcatggtc tggttctttt ccttgtcggc gcgctccttg gcggcgatgt tgagctgcac 6240
gtactcgcgc gccacgcact tccattcggg gaagacggtg gtgagcttgt cgggcacgat 6300
tctgacccgc cagccgcggt tgtgcagggt gatgaggtcc acgctggtgg ccacctcgcc 6360
gcgcaggggc tcgttggtcc agcagaggcg cccgcccttg cgcgagcaga aggggggcag 6420
cgggtccagc atgagctcgt cgggggggtc ggcgtccacg gtgaagatgc cgggcaggag 6480
ctcggggtcg aagtagctga tgcaggtgcc cagatcgtcc agcgccgctt gccagtcgcg 6540
cacggccagc gcgcgctcgt aggggctgag gggcgtgccc cagggcatgg ggtgcgtgag 6600
cgcggaggcg tacatgccgc agatgtcgta gacgtagagg ggctcctcga ggacgccgat 6660
gtaggtgggg tagcagcgcc ccccgcggat gctggcgcgc acgtagtcgt acagctcgtg 6720
cgagggcgcg aggagcccgg tgccgaggtt ggagcgctgc ggcttttcgg cgcggtagac 6780
gatctggcgg aagatggcgt gggagttgga ggagatggtg ggcctctgga agatgttgaa 6840
gtgggcgtgg ggcagtccga ccgagtccct gatgaagtgg gcgtaggagt cctgcagctt 6900
ggcgacgagc tcggcggtga cgaggacgtc cagggcgcag tagtcgaggg tctcttggat 6960
gatgtcgtac ttgagctggc ccttctgctt ccacagctcg cggttgagaa ggaactcttc 7020
gcggtccttc cagtactctt cgagggggaa cccgtcctga tcggcacggt aagagcccac 7080
catgtagaac tggttgacgg ccttgtaggc gcagcagccc ttctccacgg ggagggcgta 7140
agcttgcgcg gccttgcgca gggaggtgtg ggtgagggcg aaggtgtcgc gcaccatgac 7200
cttgaggaac tggtgcttga agtcgaggtc gtcgcagccg ccctgctccc agagctggaa 7260
gtccgtgcgc ttcttgtagg cggggttggg caaagcgaaa gtaacatcgt tgaagaggat 7320
cttgcccgcg cggggcatga agttgcgagt gatgcggaaa ggctggggca cctcggcccg 7380
gttgttgatg acctgggcgg cgaggacgat ctcgtcgaag ccgttgatgt tgtgcccgac 7440
gatgtagagt tccacgaatc gcgggcggcc cttgacgtgg ggcagcttct tgagctcgtc 7500
gtaggtgagc tcggcggggt cgctgaggcc gtgctgctcg agggcccagt cggcgaggtg 7560
ggggttggcg ccgaggaagg aagtccagag atccacggcc agggcggtct gcaagcggtc 7620
ccggtactga cggaactgct ggcccacggc cattttttcg ggggtgacgc agtagaaggt 7680
gcgggggtcg ccgtgccagc ggtcccactt gagctggagg gcgaggtcgt gggcgagctc 7740
gacgagcggc gggtccccgg agagtttcat gaccagcatg aaggggacga gctgcttgcc 7800
gaaggacccc atccaggtgt aggtttccac gtcgtaggtg aggaagagcc tttcggtgcg 7860
aggatgcgag ccgatgggga agaactggat ctcctgccac cagttggagg aatggctgtt 7920
gatgtgatgg aagtagaaat gccgacggcg cgccgagcac tcgtgcttgt gtttatacaa 7980
gcgtccgcag tgctcgcaac gctgcacggg atgcacgtgc tgcacgagct gtacctgggt 8040
tcctttgacg aggaatttca gtgggcagtg gagcgctggc ggctgcatct ggtgctgtac 8100
tacgtcctgg ccatcggcgt ggccatcgtc tgcctcgatg gtggtcatgc tgacgaggcc 8160
gcgcgggagg caggtccaga cctcggctcg gacgggtcgg agagcgagga cgagggcgcg 8220
caggccggag ctgtccaggg tcctgagacg ctgcggagtc aggtcagtgg gcagcggcgg 8280
cgcgcggttg acttgcagga gcttttccag ggcgcgcggg aggtccagat ggtacttgat 8340
ctccacggcg ccgttggtgg cgacgtccac ggcttgcagg gtcccgtgcc cctggggcgc 8400
caccaccgtg ccccgtttct tcttgggtgc tggcggcggc ggctccatgc ttagaagcgg 8460
cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520
ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580
gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640
gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700
tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760
tgctcgatct cctcctcctg aaggtctccg cgaccggcgc gctcgacggt ggccgcgagg 8820
tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880
cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940
agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000
gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060
ctgacgtcgc ccagggcttc caagcgctcc atggcctcgt agaagtccac ggcgaagttg 9120
aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180
gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc ttccatctcc 9240
tcctcctctt ccatctcctc cactaacatc tcttctactt cctcctcagg aggcggcggc 9300
gggggagggg ccctgcgtcg ccggcggcgc acgggcagac ggtcgatgaa gcgctcgatg 9360
gtctccccgc gccggcgacg catggtctcg gtgacggcgc gcccgtcctc gcggggccgc 9420
agcgtgaaga cgccgccgcg catctccagg tggccgccgg gggggtctcc gttgggcagg 9480
gagagggcgc tgacgatgca tcttatcaat tggcccgtag ggactccgcg caaggacctg 9540
agcgtctcga gatccacggg atccgaaaac cgctgaacga aggcttcgag ccagtcgcag 9600
tcgcaaggta ggctgagccc ggtttcttgt tcttcgggta tttggtcggg aggcgggcgg 9660
gcgatgctgc tggtgatgaa gttgaagtag gcggtcctga gacggcggat ggtggcgagg 9720
agcaccaggt ccttgggccc ggcttgctgg atgcgcagac ggtcggccat gccccaggcg 9780
tggtcctgac acctggcgag gtccttgtag tagtcctgca tgagccgctc cacgggcacc 9840
tcctcctcgc ccgcgcggcc gtgcatgcgc gtgagcccga acccgcgctg cggctggacg 9900
agcgccaggt cggcgacgac gcgctcggcg aggatggcct gctggatctg ggtgagggtg 9960
gtctggaagt cgtcgaagtc gacgaagcgg tggtaggctc cggtgttgat ggtgtaggag 10020
cagttggcca tgacggacca gttgacggtc tggtggccgg ggcgcacgag ctcgtggtac 10080
ttgaggcgcg agtaggcgcg cgtgtcgaag atgtagtcgt tgcaggtgcg cacgaggtac 10140
tggtatccga cgaggaagtg cggcggcggc tggcggtaga gcggccatcg ctcggtggcg 10200
ggggcgccgg gcgcgaggtc ctcgagcatg aggcggtggt agccgtagat gtacctggac 10260
atccaggtga tgccggcggc ggtggtggag gcgcgcggga actcgcggac gcggttccag 10320
atgttgcgca gcggcaggaa gtagttcatg gtggccgcgg tctggcccgt gaggcgcgcg 10380
cagtcgtgga tgctctagac atacgggcaa aaacgaaagc ggtcagcggc tcgactccgt 10440
ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga gtccctgctc 10500
gaatcaggct ggagccgcag ctaacgtggt actggcactc ccgtctcgac ccaagcctgc 10560
taacgaaacc tccaggatac ggaggcgggt cgttttggcc attttcgtca ggccggaaat 10620
gaaactagta agcgcggaaa gcggccgtcc gcgatggctc gctgccgtag tctggagaaa 10680
gaatcgccag ggttgcgttg cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc 10740
ggattccgcg gctaacgtgg gcgtggctgc cccgtcgttt ccaagacccc ttagccagcc 10800
gacttctcca gttacggagc gagcccctct ttttcttgtg tttttgccag atgcatcccg 10860
tactgcggca gatgcgcccc caccctccac cacaaccgcc cctaccgcag cagcagcaac 10920
agccggcgct tctgcccccg ccccagcagc agcagccagc cactaccgcg gcggccgccg 10980
tgagcggagc cggcgttcag tatgacctgg ccttggaaga gggcgagggg ctggcgcggc 11040
tgggggcgtc gtcgccggag cggcacccgc gcgtgcagat gaaaagggac gctcgcgagg 11100
cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 11160
gcgcctcccg cttccacgcg gggcgggagc tgcggcgcgg cctggaccga aagcgggtgc 11220
tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 11280
tggccgcggc caacctggtc acggcgtacg agcagaccgt gaaggaggag agcaacttcc 11340
aaaaatcctt caacaaccac gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc 11400
tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccacgagc aagccgctga 11460
cggcgcagct gtttctggtg gtgcagcaca gtcgggacaa cgagacgttc agggaggcgc 11520
tgctgaatat caccgagccc gagggccgct ggctcctgga cctggtgaac attctgcaga 11580
gcatcgtggt gcaggagcgc gggctgccgc tgtccgagaa gctggcggcc atcaacttct 11640
cggtgctgag cctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 11700
tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 11760
tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggtg agcgccagcc 11820
gccggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 11880
ccgggaccga gggggagagc tactttgaca tgggcgcgga cctgcgctgg cagcctagcc 11940
gccgggcctt ggaagctgcc ggcggttccc cctacgtgga ggaggtggac gatgaggagg 12000
aggagggcga gtacctggaa gactgatggc gcgaccgtat ttttgctaga tgcagcaaca 12060
gccaccgccg cctcctgatc ccgcgatgcg ggcggcgctg cagagccagc cgtccggcat 12120
taactcctcg gacgattgga cccaggccat gcaacgcatc atggcgctga cgacccgcaa 12180
tcccgaagcc tttagacagc agcctcaggc caaccgactc tcggccatcc tggaggccgt 12240
ggtgccctcg cgctcgaacc ccacgcacga gaaggtgctg gccatcgtga acgcgctggt 12300
ggagaacaag gccatccgcg gcgacgaggc cgggctggtg tacaacgcgc tgctggagcg 12360
cgtggcccgc tacaacagca ccaacgtgca gacgaacctg gaccgcatgg tgaccgacgt 12420
gcgcgaggcg gtgtcgcagc gcgagcggtt ccaccgcgag tcgaacctgg gctccatggt 12480
ggcgctgaac gccttcctga gcacgcagcc cgccaacgtg ccccggggcc aggaggacta 12540
caccaacttc atcagcgcgc tgcggctgat ggtggccgag gtgccccaga gcgaggtgta 12600
ccagtcgggg ccggactact tcttccagac cagtcgccag ggcttgcaga ccgtgaacct 12660
gagccaggct ttcaagaact tgcagggact gtggggcgtg caggccccgg tcggggaccg 12720
cgcgacggtg tcgagcctgc tgacgccgaa ctcgcgcctg ctgctgctgc tggtggcgcc 12780
cttcacggac agcggcagcg tgagccgcga ctcgtacctg ggctacctgc ttaacctgta 12840
ccgcgaggcc atcgggcagg cgcacgtgga cgagcagacc taccaggaga tcacccacgt 12900
gagccgcgcg ctgggccagg aggacccggg caacctggag gccaccctga acttcctgct 12960
gaccaaccgg tcgcagaaga tcccgcccca gtacgcgctg agcaccgagg aggagcgcat 13020
cctgcgctac gtgcagcaga gcgtggggct gttcctgatg caggaggggg ccacgcccag 13080
cgccgcgctc gacatgaccg cgcgcaacat ggagcccagc atgtacgccc gcaaccgccc 13140
gttcatcaat aagctgatgg actacttgca tcgggcggcc gccatgaact cggactactt 13200
taccaacgcc atcttgaacc cgcactggct cccgccgccc gggttctaca cgggcgagta 13260
cgacatgccc gaccccaacg acgggttcct gtgggacgac gtggacagca gcgtgttctc 13320
gccgcgcccc accaccacca ccgtgtggaa gaaagagggc ggggaccggc ggccgtcctc 13380
ggcgctgtcc ggtcgcgcgg gtgctgccgc ggcggtgccc gaggccgcca gccccttccc 13440
gagcctgccc ttttcgctga acagcgtgcg cagcagcgag ctgggtcggc tgacgcggcc 13500
gcgcctgctg ggcgaggagg agtacctgaa cgactccttg cttcggcccg agcgcgagaa 13560
gaacttcccc aataacggga tagagagcct ggtggacaag atgagccgct ggaagacgta 13620
cgcgcacgag cacagggacg agccccgagc tagcagcagc accggcgcca cccgtagacg 13680
ccagcggcac gacaggcagc ggggtctggt gtgggacgat gaggattccg ccgacgacag 13740
cagcgtgttg gacttgggtg ggagtggtgg tggtaacccg ttcgctcacc tgcgcccccg 13800
tatcgggcgc ctgatgtaag aatctgaaaa aataaaagac ggtactcacc aaggccatgg 13860
cgaccagcgt gcgttcttct ctgttgtttg tagtagt atg atg agg cgc gtg tac 13915
Met Met Arg Arg Val Tyr
1 5
ccg gag ggt cct cct ccc tcg tac gag agc gtg atg cag cag gcg gtg 13963
Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Val
10 15 20
gcg gcg gcg atg cag ccc ccg ctg gag gcg cct tac gtg ccc ccg cgg 14011
Ala Ala Ala Met Gln Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg
25 30 35
tac ctg gcg cct acg gag ggg cgg aac agc att cgt tac tcg gag ctg 14059
Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu
40 45 50
gca ccc ttg tac gat acc acc cgg ttg tac ctg gtg gac aac aag tcg 14107
Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser
55 60 65 70
gcg gac atc gcc tcg ctg aac tac cag aac gac cac agc aac ttc ctg 14155
Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu
75 80 85
acc acc gtg gtg cag aac aac gat ttc acc ccc acg gag gcc agc acc 14203
Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr
90 95 100
cag acc atc aac ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa 14251
Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys
105 110 115
acc atc atg cac acc aac atg ccc aac gtg aac gag ttc atg tac agc 14299
Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser
120 125 130
aac aag ttc aag gcg cgg gtg atg gtc tcg cgc aag acc ccc aac ggg 14347
Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly
135 140 145 150
gtc aca gta aca gat ggt agt cag gac gag ctg acc tac gag tgg gtg 14395
Val Thr Val Thr Asp Gly Ser Gln Asp Glu Leu Thr Tyr Glu Trp Val
155 160 165
gag ttt gag ctg ccc gag ggc aac ttc tcg gtg acc atg acc atc gat 14443
Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp
170 175 180
ctg atg aac aac gcc atc atc gac aac tac ttg gcg gtg ggg cgg cag 14491
Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln
185 190 195
aac ggg gtg ctg gag agc gac atc ggc gtg aag ttc gac acg cgc aac 14539
Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn
200 205 210
ttc cgg ctg ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg 14587
Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val
215 220 225 230
tac acc aac gag gcc ttc cac ccc gac atc gtc ctg ctg ccc ggc tgc 14635
Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys
235 240 245
ggc gtg gac ttc acc gag agc cgc ctc agc aac ctg ctg ggc atc cgc 14683
Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg
250 255 260
aag cgg cag ccc ttc cag gag ggc ttc cag atc ctg tac gag gac ctg 14731
Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp Leu
265 270 275
gag ggg ggc aac atc ccc gcg ctg ctg gac gtg gac gcc tac gag aaa 14779
Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys
280 285 290
agc aag gag gat agc gcc gcc gcg gcg acc gca gcc gtg gcc acc gcc 14827
Ser Lys Glu Asp Ser Ala Ala Ala AlaThr Ala Ala Val Ala Thr Ala
295 300 305 310
tct acc gag gtg cgg ggc gat aat ttt gct agc gcc gcg aca ctg gca 14875
Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Thr Leu Ala
315 320 325
gcg gcc gag gcg gct gaa acc gaa agt aag ata gtg atc cag ccg gtg 14923
Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val
330 335 340
gag aag gac agc aag gag agg agc tac aac gtg ctc gcg gac aag aaa 14971
Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys
345 350 355
aac acc gcc tac cgc agc tgg tac ctg gcc tac aac tac ggc gac ccc 15019
Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro
360 365 370
gag aag ggc gtg cgc tcc tgg acg ctg ctc acc acc tcg gac gtc acc 15067
Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr
375 380 385 390
tgc ggc gtg gag caa gtc tac tgg tcg ctg ccc gac atg atg caa gac 15115
Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp
395 400 405
ccg gtc acc ttc cgc tcc acg cgt caa gtt agc aac tac ccg gtg gtg 15163
Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val
410 415 420
ggc gcc gag ctc ctg ccc gtc tac tcc aag agc ttc ttc aac gag cag 15211
Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln
425 430 435
gcc gtc tac tcg cag cag ctg cgc gcc ttc acc tcg ctc acg cac gtc 15259
Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val
440 445 450
ttc aac cgc ttc ccc gag aac cag atc ctc gtt cgc ccg ccc gcg ccc 15307
Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro
455 460 465 470
acc att acc acc gtc agt gaa aac gtt cct gct ctc aca gat cac ggg 15355
Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly
475 480 485
acc ctg ccg ctg cgc agc agt atc cgg gga gtc cag cgc gtg acc gtc 15403
Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val
490 495 500
act gac gcc aga cgc cgc acc tgc ccc tac gtc tac aag gcc ctg ggc 15451
Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly
505 510 515
gta gtc gcg ccg cgc gtc ctc tcg agc cgc acc ttc taa aaaatgtcca 15500
Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe
520 525 530
ttctcatctc gcccagtaat aacaccggtt ggggcctgcg cgcgcccagc aagatgtacg 15560
gaggcgctcg ccaacgctcc acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc 15620
cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc 15680
aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc gcccgtctcc accgtggacg 15740
ccgtcatcga cagcgtggtg gccgacgcgc gccggtacgc ccgcgccaag agccggcggc 15800
ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc 15860
gcagggccag gcgcacggga cgcagggcca tgctcagggc ggccagacgc gcggcctccg 15920
gcagcagcag cgccggcagg acccgcagac gcgcggccac ggcggcggcg gcggccatcg 15980
ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg 16040
tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgctgactt cgcgatgttg 16100
atgtgtccca gcggcgagga ggatgtccaa gcgcaaattc aaggaagaga tgctccaggt 16160
catcgcgcct gagatctacg gcccggcggc ggtgaaggag gaaagaaagc cccgcaaact 16220
gaagcgggtc aaaaaggaca aaaaggagga ggaagatgtg gacggactgg tggagtttgt 16280
gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg cggaaagtga aaccggtgct 16340
gcgacccggc accacggtgg tcttcacgcc cggcgagcgt tccggctccg cctccaagcg 16400
ctcctacgac gaggtgtacg gggacgagga catcctcgag caggcggccg aacgtctggg 16460
cgagtttgct tacggcaagc gcagccgccc cgcgcccttg aaagaggagg cggtgtccat 16520
cccgctggac cacggcaacc ccacgccgag cctgaagccg gtgaccctgc agcaggtgct 16580
gcctggtgcg gcgccgcgcc ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac 16640
catgcagctg atggtgccca agcgccagaa gctggaggac gtgctggagc acatgaaggt 16700
ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct 16760
gggcgtgcag accgtggaca tcaagatccc cacggagccc atggaaacgc agaccgagcc 16820
cgtgaagccc agcaccagca ccatggaggt gcagacggat ccctggatgc cggcaccggc 16880
ttccaccacc cgccgaagac gcaagtacgg cgcggccagc ctgctgatgc ccaactacgc 16940
gctgcatcct tccatcatcc ccacgccggg ctaccgcggc acgcgcttct accgcggcta 17000
caccagcagc cgccgccgca agaccaccac ccgccgccgc cgtcgtcgca cccgccgcag 17060
cagcaccgcg acttccgccg ccgccctggt gcggagagtg taccgcagcg ggcgcgagcc 17120
tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc atttaactac cgcctcctac 17180
ttgcagatat ggccctcaca tgccgcctcc gcgtccccat tacgggctac cgaggaagaa 17240
agccgcgccg tagaaggctg acggggaacg ggctgcgtcg ccatcaccac cggcggcggc 17300
gcgccatcag caagcggttg gggggaggct tcctgcccgc gctgatgccc atcatcgccg 17360
cggcgatcgg ggcgatcccc ggcatagctt ccgtggcggt gcaggcctct cagcgccact 17420
gagacacagc ttggaaaatt tgtaataaaa aatggactga cgctcctggt cctgtgatgt 17480
gtgtttttag atggaagaca tcaatttttc gtccctggca ccgcgacacg gcacgcggcc 17540
gtttatgggc acctggagcg acatcggcaa cagccaactg aacgggggcg ccttcaattg 17600
gagcagtctc tggagcgggc ttaagaattt cgggtccacg ctcaaaacct atggcaacaa 17660
ggcgtggaac agcagcacag ggcaggcgct gagggaaaag ctgaaagagc agaacttcca 17720
gcagaaggtg gtcgatggcc tggcctcggg catcaacggg gtggtggacc tggccaacca 17780
ggccgtgcag aaacagatca acagccgcct ggacgcggtc ccgcccgcgg ggtccgtgga 17840
gatgccccag gtggaggagg agctgcctcc cctggacaag cgcggcgaca agcgaccgcg 17900
tcccgacgcg gaggagacgc tgctgacgca cacggacgag ccgcccccgt acgaggaggc 17960
ggtgaaactg ggtctgccca ccacgcggcc cgtggcgcct ctggccaccg gggtgctgaa 18020
acccagcagc agcagcagcc agcccgcgac cctggacttg cctccgcctg cttcccgccc 18080
ctccacagtg gctaagcccc tgccgccggt ggccgtcgcg tcgcgcgccc cccgaggccg 18140
cccccaggcg aactggcaga gcactctgaa cagcatcgtg ggtctgggag tgcagagtgt 18200
gaagcgccgc cgctgctatt aaaagacact gtagcgctta acttgcttgt ctgtgtgtat 18260
atgtatgtcc gccgaccaga aggaggagga agaggcgcgt cgccgagttg caag atg 18317
Met
gcc acc cca tcg atg ctg ccc cag tgg gcg tac atg cac atc gcc gga 18365
Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala Gly
535 540 545
cag gac gct tcg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc 18413
Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg
550 555 560
gcc aca gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg 18461
Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr
565 570 575
gtg gcg ccc acg cac gat gtg acc acc gac cgc agc cag cgg ctg acg 18509
Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr
580 585 590 595
ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac tcg tac aaa 18557
Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys
600 605 610
gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc 18605
Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala
615 620 625
agc acc tac ttt gac atc cgc ggc gtg ctg gat cgg ggc cct agc ttc 18653
Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe
630 635 640
aaa ccc tac tcc ggc acc gct tac aac agc ctg gct ccc aag gga gcg 18701
Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala
645 650 655
ccc aac act tgc cag tgg aca tat aaa gct gat ggt gat act ggt aca 18749
Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly Thr
660 665 670 675
gaa aaa acc tat aca tat gga aat gcg cct gtg caa ggc att agt att 18797
Glu LysThr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Ser Ile
680 685 690
aca aaa gat ggt att caa ctt gga act gac act gat gat cag ccc att 18845
Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr Asp Asp Gln Pro Ile
695 700 705
tat gca gat aaa act tat caa cca gag cct caa gtg ggt gat gct gaa 18893
Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala Glu
710 715 720
tgg cat gac atc act ggt act gat gaa aaa tat gga ggc aga gct ctc 18941
Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu
725 730 735
aag cct gac acc aaa atg aag ccc tgc tat ggt tct ttt gcc aag cct 18989
Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro
740 745 750 755
acc aat aaa gaa gga ggt cag gca aat gtg aaa acc gaa aca ggc ggt 19037
Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Glu Thr Gly Gly
760 765 770
acc aaa gaa tat gac att gac atg gca ttc ttc gat aat cga agt gca 19085
Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala
775 780 785
gct gcg gct ggc ctg gcc cca gaa att gtt ttg tat act gag aat gtg 19133
Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val
790 795 800
gat ctg gaa act cca gat act cat att gta tac aag gcg ggc aca gat 19181
Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp
805 810 815
gac agc agc tct tct atc aat ttg ggt cag cag tcc atg ccc aac aga 19229
Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ser Met Pro Asn Arg
820 825 830 835
ccc aac tac att ggc ttt aga gac aac ttt atc ggg ctc atg tac tac 19277
Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr
840 845 850
aac agc act ggc aac atg ggc gtg ctg gct ggt cag gcc tcc cag ctg 19325
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu
855 860 865
aat gct gtg gtg gac ttg cag gac aga aac act gaa ctg tcc tac cag 19373
Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln
870 875 880
ctc ttg ctt gac tct ctg ggc gac aga acc agg tat ttc agt atg tgg 19421
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp
885 890 895
aat cag gcg gtg gac agc tat gac ccc gat gtg cgc att att gaa aat 19469
Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn
900 905 910 915
cac ggt gtg gag gat gaa ctc cct aac tat tgc ttc ccc ctg gat gct 19517
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala
920 925 930
gtg ggt aga act gat act tac cag gga att aag gcc aat ggt gct gat 19565
Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Ala Asp
935 940 945
caa acc acc tgg acc aaa gat gat act gtt aat gat gct aat gaa ttg 19613
Gln Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu
950 955 960
ggc aag ggc aat cct ttc gcc atg gag atc aac atc cag gcc aac ctg 19661
Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu
965 970 975
tgg cgg aac ttc ctc tac gcg aac gtg gcg ctg tac ctg ccc gac tcc 19709
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser
980 985 990 995
tac aag tac acg ccg gcc aac atc acg ctg ccg acc aac acc aac 19754
Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn
1000 1005 1010
acc tac gat tac atg aac ggc cgc gtg gtg gcg ccc tcg ctg gtg 19799
Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val
1015 1020 1025
gac gcc tac atc aac atc ggg gcg cgc tgg tcg ctg gac ccc atg 19844
Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met
1030 1035 1040
gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg cgc 19889
Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg
1045 1050 1055
tac cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac 19934
Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His
1060 1065 1070
atc cag gtg ccc caa aag ttc ttc gcc atc aag agc ctc ctg ctc 19979
Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu
1075 1080 1085
ctg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac gtc 20024
Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val
1090 1095 1100
aac atg atc ctg cag agc tcc ctc ggc aac gac ctg cgc acg gac 20069
Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp
1105 1110 1115
ggg gcc tcc atc gcc ttc acc agc atc aac ctc tac gcc acc ttc 20114
Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe
1120 1125 1130
ttc ccc atg gcg cac aac acc gcc tcc acg ctc gag gcc atg ctg 20159
Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu
1135 1140 1145
cgc aac gac acc aac gac cag tcc ttc aac gac tac ctc tcg gcg 20204
Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala
1150 1155 1160
gcc aac atg ctc tac ccc atc ccg gcc aac gcc acc aac gtg ccc 20249
Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro
1165 1170 1175
atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc gga tgg tcc 20294
Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser
1180 1185 1190
ttc acg cgc ctc aag acc cgc gag acg ccc tcg ctc ggc tcc ggg 20339
Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly
1195 1200 1205
ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac ctc gac 20384
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp
1210 1215 1220
ggc acc ttc tac ctc aac cac acc ttc aag aag gtc tcc atc acc 20429
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr
1225 1230 1235
ttc gac tcc tcc gtc agc tgg ccc ggc aac gac cgc ctc ctg acg 20474
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr
1240 1245 1250
ccc aac gag ttc gaa atc aag cgc acc gtc gac gga gag ggg tac 20519
Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr
1255 1260 1265
aac gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg gtc cag 20564
Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln
1270 1275 1280
atg ctg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc 20609
Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro
1285 1290 1295
gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag 20654
Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln
1300 1305 1310
ccc atg agc cgc cag gtc gtg gac gag gtc aac tac aag gac tac 20699
Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr
1315 1320 1325
cag gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc ttc gtc 20744
Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val
1330 1335 1340
ggc tac ctc gcg ccc acc atg cgc cag gga cag ccc tac ccc gcc 20789
Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala
1345 1350 1355
aac tac ccc tac ccg ctc atc ggc aag agc gcc gtc gcc agc gtc 20834
Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser Val
1360 1365 1370
acc cag aaa aag ttc ctc tgc gac cgg gtc atg tgg cgc atc ccc 20879
Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro
1375 1380 1385
ttc tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac ctc ggc 20924
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly
1390 1395 1400
cag aac atg ctc tac gcc aac tcc gcc cac gcg cta gac atg aat 20969
Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn
1405 1410 1415
ttc gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat gtt gtc 21014
Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val
1420 1425 1430
ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc 21059
Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly
1435 1440 1445
gtc atc gag gcc gtc tac ctg cgc acg ccc ttc tcg gcc ggc aac 21104
Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn
1450 1455 1460
gcc acc acc taa gccccgctct tgcttcttgc aagatgacgg cctgtgcggg 21156
Ala Thr Thr
ctccggcgag caggagctca gggccatcct ccgcgacctg ggctgcgggc cctg1cttcct 21216
gggcaccttc gacaagcgct tcccgggatt catggccccg cacaagctgg cctgcgccat 21276
cgtcaacacg gccggccgcg agaccggggg cgagcactgg ctggccttcg cctggaaccc 21336
gcgctcccac acctgctacc tcttcgaccc cttcgggttc tcggacgagc gcctcaagca 21396
gatctaccag ttcgagtacg agggcctgct gcgccgcagc gccctggcca ccgaggaccg 21456
ctgcgtcacc ctggaaaagt ccacccagac cgtgcagggt ccgcgctcgg ccgcctgcgg 21516
gctcttctgc tgcatgttcc tgcacgcctt cgtgcactgg cccgaccgcc ccatggacaa 21576
gaaccccacc atgaacttgc tgacgggggt gcccaacggc atgctccagt cgccccaggt 21636
ggaacccacc ctgcgccgca accaggaggc gctctaccgc ttcctcaacg cccactccgc 21696
ctactttcgc tcccaccgcg cgcgcatcga gaaggccacc gccttcgacc gcatgaatca 21756
agacatgtaa accgtgtgtg tatgtgaatg ctttattcat aataaacagc acatgtttat 21816
gccacctttt ctgaggctct gactttattt agaaatcgaa ggggttctgc cggctctcgg 21876
cgtgccccgc gggcagggat acgttgcgga actggtactt gggcagccac ttgaactcgg 21936
ggatcagcag cttcggcacg gggaggtcgg ggaacgagtc gctccacagc ttgcgcgtga 21996
gttgcagggc gcccagcagg tcgggcgcgg agatcttgaa atcgcagttg ggacccgcgt 22056
tctgcgcgcg ggagttgcgg tacacggggt tgcagcactg gaacaccatc agggccgggt 22116
gcttcacgct cgccagcacc gtcgcgtcgg tgatgccctc cacgtccaga tcctcggcgt 22176
tggccatccc gaagggggtc atcttgcagg tctgccgccc catgctgggc acgcagccgg 22236
gcttgtggtt gcaatcgcag tgcaggggga tcagcatcat ctgggcctgc tcggagctca 22296
tgcccgggta catggccttc atgaaagcct ccagctggcg gaaggcctgc tgcgccttgc 22356
cgccctcggt gaagaagacc ccgcaggact tgctagagaa ctggttggtg gcgcagccgg 22416
cgtcgtgcac gcagcagcgc gcgtcgttgt tggccagctg caccacgctg cgcccccagc 22476
ggttctgggt gatcttggcc cggtcggggt tctccttcag cgcgcgctgc ccgttctcgc 22536
tcgccacatc catctcgatc gtgtgctcct tctggatcat cacggtcccg tgcaggcatc 22596
gcagcttgcc ctcggcctcg gtgcacccgt gcagccacag cgcgcagccg gtgcactccc 22656
agttcttgtg ggcgatctgg gagtgcgagt gcacgaagcc ctgcaggaag cggcccatca 22716
tcgtggtcag ggtcttgttg ctggtgaagg tcagcgggat gccgcggtgc tcctcgttca 22776
catacaggtg gcagatgcgg cggtacacct cgccctgctc gggcatcagc tggaaggcgg 22836
acttcaggtc gctctccacg cggtaccggt ccatcagcag cgtcatgact tccatgccct 22896
tctcccaggc cgagacgatc ggcaggctca gggggttctt caccgccgtt gtcatcttag 22956
tcgccgccgc tgaggtcagg gggtcgttct cgtccagggt ctcaaacact cgcttgccgt 23016
ccttctcggt gatgcgcacg gggggaaagc tgaagcccac ggccgccagc tcctcctcgg 23076
cctgcctttc gtcctcgctg tcctggctga tgtcttgcaa aggcacatgc ttggtcttgc 23136
ggggtttctt tttgggcggc agaggcggcg gcggagacgt gctgggcgag cgcgagttct 23196
cgctcaccac gactatttct tcttcttggc cgtcgtccga gaccacgcgg cggtaggcat 23256
gcctcttctg gggcagaggc ggaggcgacg ggctctcgcg gttcggcggg cggctggcag 23316
agccccttcc gcgttcgggg gtgcgctcct ggcggcgctg ctctgactga cttcctccgc 23376
ggccggccat tgtgttctcc tagggagcaa caagcatgga gactcagcca tcgtcgccaa 23436
catcgccatc tgcccccgcc gccgccgacg agaaccagca gcagaatgaa agcttaaccg 23496
ccccgccgcc cagccccacc tccgacgccg ccgcggcccc agacatgcaa gagatggagg 23556
aatccatcga gattgacctg ggctacgtga cgcccgcgga gcacgaggag gagctggcag 23616
cgcgcttttc agccccggaa gagaaccacc aagagcagcc agagcaggaa gcagagagcg 23676
agcagcagca ggctgggctc gagcatggcg actacctgag cggggcagag gacgtgctca 23736
tcaagcatct ggcccgccaa tgcatcatcg tcaaggacgc gctgctcgac cgcgccgagg 23796
tgcccctcag cgtggcggag ctcagccgcg cctacgagcg caacctcttc tcgccgcgcg 23856
tgccccccaa gcgccagccc aacggcacct gcgagcccaa cccgcgcctc aacttctacc 23916
cggtcttcgc ggtgcccgag gccctggcca cctaccacct ctttttcaag aaccaaagga 23976
tccccgtctc ctgccgcgcc aaccgcaccc gcgccgacgc cctgctcaac ctgggtcccg 24036
gcgcccgcct acctgatatc gcctccttgg aagaggttcc caagatcttc gagggtctgg 24096
gcagcgacga gactcgggcc gcgaacgctc tgcaaggaag cggagaggag catgagcacc 24156
acagcgccct ggtggagttg gaaggcgaca acgcgcgcct ggcggtgctc aagcgcacgg 24216
tcgagctgac ccacttcgcc tacccggcgc tcaacctgcc ccccaaggtc atgagcgccg 24276
tcatggacca ggtgctcatc aagcgcgcct cgcccctctc ggatgaggac atgcaggacc 24336
ccgagagctc ggacgagggc aagcccgtgg tcagcgacga gcagctggcg cgctggctgg 24396
gagcgagtag caccccccag agcttggaag agcggcgcaa gctcatgatg gccgtggtcc 24456
tggtgaccgt ggagctggag tgtctgcgcc gcttcttcgc cgacgcagag accctgcgca 24516
aggtcgagga gaacctgcac tacctcttca ggcacgggtt tgtgcgccag gcctgcaaga 24576
tctccaacgt ggagctgacc aacctggtct cctacatggg catcctgcac gagaaccgcc 24636
tggggcagaa cgtgctgcac accaccctgc gcggggaggc ccgccgcgac tacatccgcg 24696
actgcgtcta cctgtacctc tgccacacct ggcagacggg catgggcgtg tggcagcagt 24756
gcctggagga gcagaacctg aaagagctct gcaagctcct gcagaagaac ctgaaggccc 24816
tgtggaccgg gttcgacgag cgcaccaccg cctcggacct ggccgacctc atcttccccg 24876
agcgcctgcg gctgacgctg cgcaacggac tgcccgactt tatgagtcaa agcatgttgc 24936
aaaactttcg ctctttcatc ctcgaacgct ccgggatcct gcccgccacc tgctccgcgc 24996
tgccctcgga cttcgtgccg ctgaccttcc gcgagtgccc cccgccgctc tggagccact 25056
gctacctgct gcgcctggcc aactacctgg cctaccactc ggacgtgatc gaggacgtca 25116
gcggcgaggg tctgctcgag tgccactgcc gctgcaacct ctgcacgccg caccgctccc 25176
tggcctgcaa cccccagctg ctgagcgaga cccagatcat cggcaccttc gagttgcaag 25236
gccccggcga gggcaagggg ggtctgaaac tcaccccggg gctgtggacc tcggcctact 25296
tgcgcaagtt cgtgcccgag gactaccatc ccttcgagat caggttctac gaggaccaat 25356
cccagccgcc caaggccgaa ctgtcggcct gcgtcatcac ccagggggcc atcctggccc 25416
aattgcaagc catccagaaa tcccgccaag aatttctgct gaaaaagggc cacggggtct 25476
acctggaccc ccagaccgga gaggagctca accccagctt cccccaggat gccccgagga 25536
agcagcaaga agctgaaagt ggagctgccg ccgccggagg atttggagga agactgggag 25596
agcagtcagg cagaggagga ggagatggaa gactgggaca gcactcaggc agaggaggac 25656
agcctgcaag acagtctgga agacgaggtg gaggaggagg cagaggaaga agcagccgcc 25716
gccagaccgt cgtcctcggc ggagaaagca agcagcacgg ataccatctc cgctccgggt 25776
cggggtcgcg gcgaccgggc ccacagtagg tgggacgaga ccgggcgctt cccgaacccc 25836
accacccaga ccggtaagaa ggagcggcag ggatacaagt cctggcgggg gcacaaaaac 25896
gccatcgtct cctgcttgca agcctgcggg ggcaacatct ccttcacccg ccgctacctg 25956
ctcttccacc gcggggtgaa cttcccccgc aacatcttgc attactaccg tcacctccac 26016
agcccctact actgtttcca agaagaggca gaaacccagc agcagcagaa aaccagcggc 26076
agcagcagct agaaaatcca cagcggcggc aggtggactg aggatcgcag cgaacgagcc 26136
ggcgcagacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 26196
gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 26256
cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 26316
tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccacacacg 26376
gaaaaaggcg ggaattacgt caccacctgc gcccttcgcc cgaccatcat catgagcaaa 26436
gagattccca cgccttacat gtggagctac cagccccaga tgggcctggc cgccggcgcc 26496
gcccaggact actccacccg catgaactgg ctcagcgccg ggcccgcgat gatctcacgg 26556
gtgaatgaca tccgcgcccg ccgaaaccag atactcctag aacagtcagc gatcaccgcc 26616
acgccccgcc atcaccttaa tccgcgtaat tggcccgccg ccctggtgta ccaggaaatt 26676
ccccagccca cgaccgtact acttccgcga gacgcccagg ccgaagtcca gctgactaac 26736
tcaggtgtcc agctggccgg cggcgccgcc ctgtgtcgtc accgccccgc tcagggtata 26796
aagcggctgg tgatccgagg cagaggcaca cagctcaacg acgaggtggt gagctcttcg 26856
ctgggtctgc gacctgacgg agtcttccaa ctcgccggat cggggagatc ttccttcacg 26916
cctcgtcagg ccgtcctgac tttggagagt tcgtcctcgc agccccgctc gggtggcatc 26976
ggcactctcc agttcgtgga ggagttcact ccctcggtct acttcaaccc cttctccggc 27036
tcccccggcc actacccgga cgagttcatc ccgaacttcg acgccatcag cgagtcggtg 27096
gacggctacg attgaatgtc ccatggtggc gcagctgacc tagctcggct tcgacacctg 27156
gaccactgcc gccgcttccg ctgcttcgct cgggatctcg ccgagtttgc ctactttgag 27216
ctgcccgagg agcaccctca gggcccggcc cacggagtgc ggatcatcgt cgaagggggc 27276
ctcgactccc acctgcttcg gatcttcagc cagcgaccga tcctggtcga gcgcgagcaa 27336
ggacagaccc ttctgaccct gtactgcatc tgcaaccacc ccggcctgca tgaaagtctt 27396
tgttgtctgc tgtgtactga gtataataaa agctgagatc agcgactact ccggactcga 27456
ttgtggtgtt cctgctatca accggtccct gttcttcacc gggaacgaga ccgagctcca 27516
gcttcagtgt aagccccaca agaagtacct cacctggctg ttccagggct ccccgatcgc 27576
cgttgtcaac cactgcgaca acgacggagt cctgctgagc ggccccgcca accttacttt 27636
ttccacccgc agaagcaagc tccagctctt ccaacccttc ctccccggga cctatcagtg 27696
cgtctcggga ccctgccatc acaccttcca cctgatcccg aataccacag cgccgctccc 27756
cgctactaac aaccaaacta cccaccatcg ccaccgtcgc gacctttctg aatctaacac 27816
taccacccac accggaggtg agctccgagg tcgaccaacc tctgggattt actacggccc 27876
ctgggaggtg gtggggttaa tagcgctagg cctagttgtg ggtgggcttt tggctctctg 27936
ctacctatac ctcccttgct gttcgtactt agtggtgctg tgttgctggt ttaagaaatg 27996
gggaagatca ccctagtgag ctgcggtgcg ctggtggcgg tggtggtgtt ttcgattgtg 28056
ggactgggcg gcgcggctgt agtgaaggag aaggccgatc cctgcttgca tttcaatccc 28116
gacaattgcc agctgagttt tcagcccgat ggcaatcggt gcgcggtgct gatcaagtgc 28176
ggatgggaat gcgagaacgt gagaatcgag tacaataaca agactcggaa caatactctc 28236
gcgtccgtgt ggcagcccgg ggaccccgag tggtacaccg tctctgtccc cggtgctgac 28296
ggctccccgc gcaccgtgaa caatactttc atttttgcgc acatgtgcga cacggtcatg 28356
tggatgagca agcagtacga tatgtggccc cccacgaagg agaacatcgt ggtcttctcc 28416
atcgcttaca gcgcgtgcac ggcgctaatc accgctatcg tgtgcctgag cattcacatg 28476
ctcatcgcta ttcgccccag aaataatgcc gaaaaagaga aacagccata acacgttttt 28536
tcacacacct ttttcagacc atggcctctg ttaaattttt gcttttattt gccagtctca 28596
ttactgttat aagtaatgag aaactcacta tttacattgg cactaaccac actttagacg 28656
gaattccaaa atcctcatgg tattgctatt ttgatcaaga tccagactta actatagaac 28716
tgtgtggtaa caagggaaaa aatacaagca ttcatttaat taactttaat tgcggagaca 28776
atttgaaatt aattaatatc actaaagagt atggaggtat gtattactat gttgcagaaa 28836
ataacaacat gcagttttat gaagttactg taactaatcc caccacacct agaacaacaa 28896
caaccaccac cacaaaaact acacctgtta ccactatgca gctcactacc aataacattt 28956
ttgccatgcg tcaaatggtc aacaatagca ctcaacccac cccacccagt gaggaaattc 29016
ccaaatccat gattggcatt attgttgctg tagtggtgtg catgttgatc atcgccttgt 29076
gcatggtgta ctatgccttc tgctacagaa agcacagact gaacgacaag ctggaacact 29136
tactaagtgt tgaattttaa ttttttagaa ccatgaagat cctaggcctt ttaatttttt 29196
ctatcattac ctctgctcta tgcaattctg acaatgagga cgttactgtc gttgtcggaa 29256
ccaattatac actgaaaggt ccagcgaagg gtatgctttc gtggtattgc tggtttggaa 29316
ctgacgagca acagacagag ctctgcaatg ctcaaaaagg caaaacctca aattctaaaa 29376
tctctaatta tcaatgcaat ggcactgact tagtactgct caatgtcacg aaagcatatg 29436
ctggcagcta cacctgccct ggagatgata ctgagaacat gattttttac aaagtggaag 29496
tggttgatcc cactactcca cctccaccca ccacaactac tcacaccaca cacacagaac 29556
aaaccacagc agaggaggca gcaaagttag ccttgcaggt ccaagacagt tcatttgttg 29616
gcattacccc tacacctgat cagcggtgtc cggggctgct cgtcagcggc attgtcggtg 29676
tgctttcggg attagcagtc ataatcatct gcatgttcat ttttgcttgc tgctatagaa 29736
ggctttaccg acaaaaatca gacccactgc tgaacctcta tgtttaattt tttccagagc 29796
catgaaggca gttagcactc tagttttttg ttctttgatt ggcactgttt ttagtgttag 29856
ctttttgaaa caaatcaatg ttactgaggg ggaaaatgtg acactggtag gcgtagaggg 29916
tgctcaaaat accacctgga caaaattcca tctagatggg tggaaagaaa tttgcacctg 29976
gaatgtcagt acttatacat gtgaaggagt taatcttacc attgtcaatg tcagccaaat 30036
tcaaaagggt tggattaaag ggcaatctgt tagtgttagc aatagtgggt actataccca 30096
gcatactctt atctatgaca ttatagttat accactgcct acacctagcc cacctagcac 30156
taccacacag acaacccaca ctacacaaac aaccacatac agtacatcaa atcagcctac 30216
caccactaca acagcagagg ttgccagctc gtctggggtc cgagtggcat ttttgatgtt 30276
ggccccatct agcagtccca ctgctagtac caatgagcag actactgaat ttttgtccac 30336
tgtcgagagc cacaccacag ctacctcgag tgccttctct agcaccgcca atctatcctc 30396
gctttcctct acaccaatca gtcccgctac tactcctacc cccgctattc tccccactcc 30456
cctgaagcaa acagacggcg acatgcaatg gcagatcacc ctgctcattg tgatcgggtt 30516
ggtcatcctg gccgtgttgc tctactacat cttctgccgc cgcattccca acgcgcaccg 30576
caagccggcc tacaagccca tcgttgtcgg gcagccggag ccgcttcagg tggaaggggg 30636
tctaaggaat cttctcttct cttttacagt atggtgattg aattatgatt cctagacaaa 30696
tcttgatcac tattcttatc tgcctcctcc aagtctgtgc caccctcgct ctggtggcca 30756
acgccagtcc agactgtatt gggcccttcg cctcctacgt gctctttgcc ttcatcacct 30816
gcatctgctg ctgtagcata gtctgcctgc ttatcacctt cttccagttc attgactgga 30876
tctttgtgcg catcgcctac ctgcgccacc acccccagta ccgcgaccag cgagtggcgc 30936
ggctgctcag gatcctctga taagcatgcg ggctctgcta cttctcgcgc ttctgctgtt 30996
agtgctcccc cgtcccgtcg acccccggac ccccacccag tcccccgagg aggtccgcaa 31056
atgcaaattc caagaaccct ggaaattcct caaatgctac cgccaaaaat cagacatgca 31116
tcccagctgg atcatgatca ttgggatcgt gaacattctg gcctgcaccc tcatctcctt 31176
tgtgatttac ccctgctttg actttggttg gaactcgcca gaggcgctct atctcccgcc 31236
tgaacctgac acaccaccac agcaacctca ggcacacgca ctaccaccac caccacagcc 31296
taggccacaa tacatgccca tattagacta tgaggccgag ccacagcgac ccatgctccc 31356
cgctattagt tacttcaatc taaccggcgg agatgactga cccactggcc aacaacaacg 31416
tcaacgacct tctcctggac atggacggcc gcgcctcgga gcagcgactc gcccaacttc 31476
gcattcgcca gcagcaggag agagccgtca aggagctgca ggacggcata gccatccacc 31536
agtgcaagaa aggcatcttc tgcctggtga aacaggccaa gatctcctac gaggtcaccc 31596
agaccgacca tcgcctctcc tacgagctcc tgcagcagcg ccagaagttc acctgcctgg 31656
tcggagtcaa ccccatcgtc atcacccagc agtcgggcga taccaagggg tgcatccact 31716
gctcctgcga ctcccccgac tgcgtccaca ctctgatcaa gaccctctgc ggcctccgcg 31776
acctcctccc catgaactaa tcaccccctt atccagtgaa ataaagatca tattgatgat 31836
ttgagtttaa taaaaataaa gaatcactta cttgaaatct gataccaggt ctctgtccat 31896
gttttctgcc aacaccactt cactcccctc ttcccagctc tggtactgca ggccccggcg 31956
ggctgcaaac ttcctccaca ccctgaaggg gatgtcaaat tcctcctgtc cctcaatctt 32016
cattttatct tctatcag atg tcc aaa aag cgc gtc cgg gtg gat gat gac 32067
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp
1465 1470
ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32112
Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val
1475 1480 1485
ccc ttc atc aac ccc ccc ttc gtc tct tca gat gga ttc caa gag 32157
Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu
1490 1495 1500
aag ccc ctg ggg gtg ctg tcc ctg cgt ctg gcc gat ccc gtc acc 32202
Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr
1505 1510 1515
acc aag aac ggg gaa atc acc ctc aag ctg gga gat ggg gtg gac 32247
Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Asp Gly Val Asp
1520 1525 1530
ctc gac tcc tcg gga aaa ctc atc tcc aac acg gcc acc aag gcc 32292
Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala
1535 1540 1545
gcc gcc cct ctc agt ttt tcc aac aac acc att tcc ctt aac atg 32337
Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met
1550 1555 1560
gat acc cct ttt tac aac aac aat gga aag tta ggc atg aaa gtc 32382
Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu Gly Met Lys Val
1565 1570 1575
act gct cca ctg aag ata cta gac aca gac ttg cta aaa aca ctt 32427
Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu Lys Thr Leu
1580 1585 1590
gtt gta gct tat gga caa ggt tta gga aca aac acc act ggt gcc 32472
Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Asn Thr Thr Gly Ala
1595 1600 1605
ctt gtt gcc caa cta gca tcc cca ctt gct ttt gat agc aat agc 32517
Leu Val Ala Gln Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn Ser
1610 1615 1620
aaa att gcc ctt aat tta ggc aat gga cca ttg aaa gtg gat gca 32562
Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala
1625 1630 1635
aat aga ctg aac atc aat tgc aat aga gga ctc tat gtt act acc 32607
Asn Arg Leu Asn Ile Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr
1640 1645 1650
aca aaa gat gca ctg gaa gcc aat ata agt tgg gct aat gct atg 32652
Thr Lys Asp Ala Leu Glu Ala Asn Ile Ser Trp Ala Asn Ala Met
1655 1660 1665
aca ttt ata gga aat gcc atg ggt gtc aat att gat aca caa aaa 32697
Thr Phe Ile Gly Asn Ala Met Gly Val Asn Ile Asp Thr Gln Lys
1670 1675 1680
ggc ttg caa ttt ggc acc act agt acc gtc gca gat gtt aaa aac 32742
Gly Leu Gln Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn
1685 1690 1695
gct tac ccc ata caa atc aaa ctt gga gct ggt ctc aca ttt gac 32787
Ala Tyr Pro Ile Gln Ile Lys Leu Gly Ala Gly Leu Thr Phe Asp
1700 1705 1710
agc aca ggt gca att gtt gca tgg aac aaa gat gat gac aag ctt 32832
Ser Thr Gly Ala Ile Val Ala Trp Asn Lys Asp Asp Asp Lys Leu
1715 1720 1725
aca cta tgg acc aca gcc gac ccc tct cca aat tgt cac ata tat 32877
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr
1730 1735 1740
tct gaa aag gat gct aag ctt aca ctt tgc ttg aca aag tgt ggc 32922
Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly
1745 1750 1755
agt cag att ctg ggc act gtt tcc ctc ata gct gtt gat act ggc 32967
Ser Gln Ile Leu Gly Thr Val Ser Leu Ile Ala Val Asp Thr Gly
1760 1765 1770
agt tta aat ccc ata aca gga aca gta acc act gct ctt gtc tca 33012
Ser Leu Asn Pro Ile Thr Gly Thr Val Thr Thr Ala Leu Val Ser
1775 1780 1785
ctt aaa ttc gat gca aat gga gtt ttg caa agc agc tca aca cta 33057
Leu Lys Phe Asp Ala Asn Gly Val Leu Gln Ser Ser Ser Thr Leu
1790 1795 1800
gac tca gac tat tgg aat ttc aga cag gga gat gtt aca cct gct 33102
Asp Ser Asp Tyr Trp Asn Phe Arg Gln Gly Asp Val Thr Pro Ala
1805 1810 1815
gaa gcc tat act aat gct ata ggt ttc atg ccc aat cta aaa gca 33147
Glu Ala Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Lys Ala
1820 1825 1830
tac cct aaa aac aca agt gga gct gca aaa agt cac att gtt ggg 33192
Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His Ile Val Gly
1835 1840 1845
aaa gtg tac cta cat ggg gat aca ggc aaa cca ctg gac ctc att 33237
Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu Ile
1850 1855 1860
att act ttc aat gaa aca agt gat gaa tct tgc act tac tgt att 33282
Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile
1865 1870 1875
aac ttt caa tgg cag tgg ggg gct gat caa tat aaa aat gaa aca 33327
Asn Phe Gln Trp Gln Trp Gly Ala Asp Gln Tyr Lys Asn Glu Thr
1880 1885 1890
ctt gcc gtc agt tca ttc acc ttt tcc tat att gct aaa gaa taa 33372
Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
1895 1900 1905
accccactct gtaccccatc tctgtctatg gaaaaaactc tgaaacacaa aataaaataa 33432
agttcaagtg ttttattgat tcaacagttt tacaggattc gagcagttat ttttcctcca 33492
ccctcccagg acatggaata caccaccctc tccccccgca cagccttgaa catctgaatg 33552
ccattggtga tggacatgct tttggtctcc acgttccaca cagtttcaga gcgagccagt 33612
ctcgggtcgg tcagggagat gaaaccctcc gggcactccc gcatctgcac ctcacagctc 33672
aacagctgag gattgtcctc ggtggtcggg atcacggtta tctggaagaa gcagaagagc 33732
ggcggtggga atcatagtcc gcgaacggga tcggccggtg gtgtcgcatc aggccccgca 33792
gcagtcgctg tcgccgccgc tccgtcaagc tgctgctcag ggggtccggg tccagggact 33852
ccctcagcat gatgcccacg gccctcagca tcagtcgtct ggtgcggcgg gcgcagcagc 33912
gcatgcggat ctcgctcagg tcgctgcagt acgtgcaaca caggaccacc aggttgttca 33972
acagtccata gttcaacacg ctccagccga aactcatcgc gggaaggatg ctacccacgt 34032
ggccgtcgta ccagatcctc aggtaaatca agtggcgccc cctccagaac acgctgccca 34092
tgtacatgat ctccttgggc atgtggcggt tcaccacctc ccggtaccac atcaccctct 34152
ggttgaacat gcagccccgg atgatcctgc ggaaccacag ggccagcacc gccccgcccg 34212
ccatgcagcg aagagacccc gggtcccgac aatggcaatg gaggacccac cgctcgtacc 34272
cgtggatcat ctgggagctg aacaagtcta tgttggcaca gcacaggcat atgctcatgc 34332
atctcttcag cactctcagc tcctcggggg tcaaaaccat atcccagggc acggggaact 34392
cttgcaggac agcgaacccc gcagaacagg gcaatcctcg cacataactt acattgtgca 34452
tggacagggt atcgcaatca ggcagcaccg ggtgatcctc caccagagaa gcgcgggtct 34512
cggtctcctc acagcgtggt aagggggccg gccgatacgg gtgatggcgg gacgcggctg 34572
atcgtgttcg cgaccgtgtt atgatgcagt tgctttcgga cattttcgta cttgctgtag 34632
cagaacctgg tccgggcgct gcacaccgat cgccggcggc ggtcccggcg cttggaacgc 34692
tcggtgttga agttgtaaaa cagccactct ctcagaccgt gcagcagatc tagggcctca 34752
ggagtgatga agatcccatc atgcctgatg gctctaatca catcgaccac cgtggaatgg 34812
gccagaccca gccagatgat gcaattttgt tgggtttcgg tgacggcggg ggagggaaga 34872
acaggaagaa ccatgattaa cttttaatcc aaacggtctc ggagcacttc aaaatgaaga 34932
tcgcggagat ggcacctctc gcccccgctg tgttggtgga aaataacagc caggtcaaag 34992
gtgatacggt tctcgagatg ttccacggtg gcttccagca aagcctccac gcgcacatcc 35052
agaaacaaga caatagcgaa agcgggaggg ttctctaatt cctcaatcat catgttacac 35112
tcctgcacca tccccagata attttcattt ttccagcctt gaatgattcg aactagttcc 35172
tgaggtaaat ccaagccagc catgataaag agctcgcgca gagcgccctc caccggcatt 35232
cttaagcaca ccctcataat tccaagatat tctgctcctg gttcacctgc agcagattga 35292
caagcggaat atcaaaatct ctgccgcgat ccctaagctc ctccctcagc aataactgta 35352
agtactcttt catatcctct ccgaaatttt tagccatagg accaccagga ataagattag 35412
ggcaagccac agtacagata aaccgaagtc ctccccagtg agcattgcca aatgcaagac 35472
tgctataagc atgctggcta gacccggtga tatcttccag ataactggac agaaaatcgc 35532
ccaggcaatt tttaagaaaa tcaacaaaag aaaaatcctc caggtgcacg tttagagcct 35592
cgggaacaac gatggagtaa atgcaagcgg tgcgttccag catggttagt tagctgatct 35652
gtagaaaaaa acaaaaatga acattaaacc atgctagcct ggcgaacagg tgggtaaatc 35712
gttctctcca gcaccaggca ggccacgggg tctccggcac gaccctcgta aaaattgtcg 35772
ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat gattcgacaa 35832
gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc aaggaagcaa 35892
taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg atgaagcaca 35952
aaattctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag caaagccccc 36012
gatccctcca ggtacacata caaagcctca gcgtccatag cttaccgagc agcagcacac 36072
aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct ctctgctcaa 36132
tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc gccaaataat 36192
cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata cgcgcacttc 36252
ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatca aaattcgact 36312
ttcaaattcc gtcgaccgtt aaaaacgtcg cccgccccgc ccctaacggt cgccgctccc 36372
gcagccaatc accgccccgc atccccaaat tcaaatacct catttgcata ttaacgcgca 36432
ccaaaagttt gaggtatatt attgatgatg 36462
<210>2
<211>530
<212>PRT
<213>黑猩猩腺病毒血清型Pan5
<400>2
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gln Asp Glu
145 150 155 160
Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser
165 170 175
Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr
180 185 190
Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val
195 200 205
Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu
210 215 220
Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile
225 230 235 240
Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser
245 250 255
Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln
260 265 270
Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp
275 280 285
Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr
290 295 300
Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala
305 310 315 320
Ser Ala Ala Thr Leu Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys
325 330 335
Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn
340 345 350
Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala
355 360 365
Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu
370 375 380
Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu
385 390 395 400
Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val
405 410 415
Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys
420 425 430
Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe
435 440 445
Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu
450 455 460
Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro
465 470 475 480
Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly
485 490 495
Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr
500 505 510
Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg
515 520 525
Thr Phe
530
<210>3
<211>933
<212>PRT
<213>黑猩猩腺病毒血清型Pan5
<400>3
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 l0 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly
130 135 140
Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Ser
145 150 155 160
Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr Asp Asp Gln Pro
165 170 175
Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala
180 185 190
Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Glu Thr Gly
225 230 235 240
Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser
245 250 255
Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn
260 265 270
Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr
275 280 285
Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ser Met Pro Asn
290 295 300
Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr
305 310 315 320
Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln
325 330 335
Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr
340 345 350
Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met
355 360 365
Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu
370 375 380
Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp
385 390 395 400
Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Ala
405 410 415
Asp Gln Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu
420 425 430
Leu Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn
435 440 445
Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp
450 455 460
Ser Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn
465 470 475 480
Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp
485 490 495
Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn
500 505 510
Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser
515 520 525
Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro
530 535 540
Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr
545 550 555 560
Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser
565 570 575
Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr
580 585 590
Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala
595 600 605
Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe
610 615 620
Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn
625 630 635 640
Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe
645 650 655
Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu
660 665 670
Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr
675 680 685
Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile
690 695 700
Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr
705 710 715 720
Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn
725 730 735
Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu
740 745 750
Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr
755 760 765
Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg
770 775 780
Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu
785 790 795 800
Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr
805 810 815
Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile
820 825 830
Gly Lys Ser Ala Val Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp
835 840 845
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly
850 855 860
Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His
865 870 875 880
Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu
885 890 895
Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro
900 905 910
His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala
915 920 925
Gly Asn Ala Thr Thr
930
<210>4
<211>445
<212>PRT
<213>黑猩猩腺病毒血清型Pan5
<400>4
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Asp Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu
100 105 110
Gly Met Lys Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu
115 120 125
Lys Thr Leu Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Asn Thr Thr
130 135 140
Gly Ala Leu Val Ala Gln Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn
145 150 155 160
Ser Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala
165 170 175
Asn Arg Leu Asn Ile Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr Thr
180 185 190
Lys Asp Ala Leu Glu Ala Asn Ile Ser Trp Ala Asn Ala Met Thr Phe
195 200 205
Ile Gly Asn Ala Met Gly Val Asn Ile Asp Thr Gln Lys Gly Leu Gln
210 215 220
Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn Ala Tyr Pro Ile
225 230 235 240
Gln Ile Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile
245 250 255
Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala
260 265 270
Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Glu Lys Asp Ala Lys Leu
275 280 285
Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser
290 295 300
Leu Ile Ala Val Asp Thr Gly Ser Leu Asn Pro Ile Thr Gly Thr Val
305 310 315 320
Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gln
325 330 335
Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp Asn Phe Arg Gln Gly Asp
340 345 350
Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn
355 360 365
Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His Ile
370 375 380
Val Gly Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu
385 390 395 400
Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile
405 410 415
Asn Phe Gln Trp Gln Trp Gly Ala Asp Gln Tyr Lys Asn Glu Thr Leu
420 425 430
Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
435 440 445
<210>5
<211>36604
<212>DNA
<213>黑猩猩腺病毒血清型Pan6
<220>
<221>CDS
<222>(13878)..(15467)
<223>L2五邻体
<220>
<221>CDS
<222>(18284)..(21112)
<223>L3六邻体
<220>
<221>CDS
<222>(32162)..(33493)
<223>L5纤维
<400>5
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc tatgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660
gtgacgaccc tccagagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagagcgacc ctaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgcg gtgaaccagg gagtgaaaac tgcgggcgag agctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260
tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320
agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380
ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440
ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500
atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560
gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gactgtcttg 1620
gaagactttc accagactag acagttgcta gagaactcat cggagggagt ctcttacctg 1680
tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740
aaggaacaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800
ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860
agaactaccg ccgcggtagc cttttttgcc tttattcttg acaaatggag tcaagaaacc 1920
catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980
tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040
atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100
cagcaagagg aggaccgaga agagaacccg agagccggtc tggaccctcc ggtggcggag 2160
gaggaggagt agctgacttg tttcccgagc tgcgccgggt gctgactagg tcttccagtg 2220
gacgggagag ggggattaag cgggagaggc atgaggagac tagccacaga actgaactga 2280
ctgtcagtct gatgagccgc aggcgcccag aatcggtgtg gtggcatgag gtgcagtcgc 2340
aggggataga tgaggtctcg gtgatgcatg agaaatattc cctagaacaa gtcaagactt 2400
gttggttgga gcccgaggat gattgggagg tagccatcag gaattatgcc aagctggctc 2460
tgaagccaga caagaagtac aagattacca aactgattaa tatcagaaat tcctgctaca 2520
tttcagggaa tggggccgag gtggagatca gtacccagga gagggtggcc ttcagatgtt 2580
gtatgatgaa tatgtacccg ggggtggtgg gcatggaggg agtcaccttt atgaacacga 2640
ggttcagggg tgatgggtat aatggggtgg tctttatggc caacaccaag ctgacagtgc 2700
acggatgctc cttctttggc ttcaataaca tgtgcatcga ggcctggggc agtgtttcag 2760
tgaggggatg cagcttttca gccaactgga tgggggtcgt gggcagaacc aagagcaagg 2820
tgtcagtgaa gaaatgcctg ttcgagaggt gccacctggg ggtgatgagc gagggcgaag 2880
ccaaagtcaa acactgcgcc tctaccgaga cgggctgctt tgtgctgatc aagggcaatg 2940
cccaagtcaa gcataacatg atctgtgggg cctcggatga gcgcggctac cagatgctga 3000
cctgcgccgg tgggaacagc catatgctgg ccaccgtgca tgtggcctcg cacccccgca 3060
agacatggcc cgagttcgag cacaacgtca tgacccgctg caatgtgcac ctgggctccc 3120
gccgaggcat gttcatgccc taccagtgca acatgcaatt tgtgaaggtg ctgctggagc 3180
ccgatgccat gtccagagtg agcctgacgg gggtgtttga catgaatgtg gagctgtgga 3240
aaattctgag atatgatgaa tccaagacca ggtgccgggc ctgcgaatgc ggaggcaagc 3300
acgccaggct tcagcccgtg tgtgtggagg tgacggagga cctgcgaccc gatcatttgg 3360
tgttgtcctg caacgggacg gagttcggct ccagcgggga agaatctgac tagagtgagt 3420
agtgtttggg gctgggtgtg agcctgcatg aggggcagaa tgactaaaat ctgtggtttt 3480
ctgtgtgttg cagcagcatg agcggaagcg cctcctttga gggaggggta ttcagccctt 3540
atctgacggg gcgtctcccc tcctgggcgg gagtgcgtca gaatgtgatg ggatccacgg 3600
tggacggccg gcccgtgcag cccgcgaact cttcaaccct gacctacgcg accctgagct 3660
cctcgtccgt ggacgcagct gccgccgcag ctgctgcttc cgccgccagc gccgtgcgcg 3720
gaatggccct gggcgccggc tactacagct ctctggtggc caactcgagt tccaccaata 3780
atcccgccag cctgaacgag gagaagctgc tgctgctgat ggcccagctc gaggccctga 3840
cccagcgcct gggcgagctg acccagcagg tggctcagct gcaggcggag acgcgggccg 3900
cggttgccac ggtgaaaacc aaataaaaaa tgaatcaata aataaacgga gacggttgtt 3960
gattttaaca cagagtcttg aatctttatt tgatttttcg cgcgcggtag gccctggacc 4020
accggtctcg atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt 4080
ggatgttgag gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg 4140
cctcgtgctc ggggatggtg ttgtaaatca cccagtcata gcaggggcgc agggcgtggt 4200
gctgcacgat gtccttgagg aggagactga tggccacggg cagccccttg gtgtaggtgt 4260
tgacgaacct gttgagctgg gagggatgca tgcgggggga gatgagatgc atcttggcct 4320
ggatcttgag attggcgatg ttcccgccca gatcccgccg ggggttcatg ttgtgcagga 4380
ccaccagcac ggtgtatccg gtgcacttgg ggaatttgtc atgcaacttg gaagggaagg 4440
cgtgaaagaa tttggagacg cccttgtgac cgcccaggtt ttccatgcac tcatccatga 4500
tgatggcgat gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat 4560
cgtagttgtg gtcctgggtg agctcgtcat aggccatttt aatgaatttg gggcggaggg 4620
tgcccgactg ggggacgaag gtgccctcga tcccgggggc gtagttgccc tcgcagatct 4680
gcatctccca ggccttgagc tcggaggggg ggatcatgtc cacctgcggg gcgatgaaaa 4740
aaacggtttc cggggcgggg gagatgagct gggccgaaag caggttccgg agcagctggg 4800
acttgccgca accggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga 4860
gggagagaca gctgccgtcc tcgcggagga ggggggccac ctcgttcatc atctcgcgca 4920
catgcatgtt ctcgcgcacg agttccgcca ggaggcgctc gccccccagc gagaggagct 4980
cttgcagcga ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga 5040
gggtctgttg caagagttcc agacggtccc agagctcggt gatgtgctct agggcatctc 5100
gatccagcag acctcctcgt ttcgcgggtt ggggcgactg cgggagtagg gcaccaggcg 5160
atgggcgtcc agcgaggcca gggtccggtc cttccagggc cgcagggtcc gcgtcagcgt 5220
ggtctccgtc acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag 5280
gctcatccgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 5340
attgagcatg agttcgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 5400
tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460
gaggaagacg gactcggggg cgtaggcgtc cgcgccgcag ctggcgcaga cggtctcgca 5520
ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 5580
tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 5640
gctgtccgtg tccccgtaga ccgactttat gggccggtcc tcgagcgggg tgccgcggtc 5700
ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 5760
ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tctccagggt 5820
atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 5880
cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gctcgtcctc 5940
actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 6000
ggcgggcatg acctcggcac tcaggttgtc agtttctaga aacgaggagg atttgatatt 6060
gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 6120
ctttttgttg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 6180
gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 6240
ctgcacgtac tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 6300
cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 6360
ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcg agcagaaggg 6420
gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480
caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540
gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600
cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660
gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 6720
ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 6780
gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 6840
gttgaagtgg gcgtggggca ggccgaccga gtccctgatg aagtgggcgt aggagtcctg 6900
cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960
ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 7020
ctcttcgcgg tccttccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 7080
gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 7140
ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 7200
catgaccttg aggaactggt gcttgaagtc gaggtcgtcg cagccgccct gctcccagag 7260
ttggaagtcc gtgcgcttct tgtaggcggg gttaggcaaa gcgaaagtaa catcgttgaa 7320
gaggatcttg cccgcgcggg gcatgaagtt gcgagtgatg cggaaaggct ggggcacctc 7380
ggcccggttg ttgatgacct gggcggcgag gacgatctcg tcgaagccgt tgatgttgtg 7440
cccgacgatg tagagttcca cgaatcgcgg gcggcccttg acgtggggca gcttcttgag 7500
ctcgtcgtag gtgagctcgg cggggtcgct gagcccgtgc tgctcgaggg cccagtcggc 7560
gacgtggggg ttggcgctga ggaaggaagt ccagagatcc acggccaggg cggtctgcaa 7620
gcggtcccgg tactgacgga actgttggcc cacggccatt ttttcggggg tgacgcagta 7680
gaaggtgcgg gggtcgccgt gccagcggtc ccacttgagc tggagggcga ggtcgtgggc 7740
gagctcgacg agcggcgggt ccccggagag tttcatgacc agcatgaagg ggacgagctg 7800
cttgccgaag gaccccatcc aggtgtaggt ttccacatcg taggtgagga agagcctttc 7860
ggtgcgagga tgcgagccga tggggaagaa ctggatctcc tgccaccagt tggaggaatg 7920
gctgttgatg tgatggaagt agaaatgccg acggcgcgcc gagcactcgt gcttgtgttt 7980
atacaagcgt ccgcagtgct cgcaacgctg cacgggatgc acgtgctgca cgagctgtac 8040
ctgggttcct ttggcgagga atttcagtgg gcagtggagc gctggcggct gcatctcgtg 8100
ctgtactacg tcttggccat cggcgtggcc atcgtctgcc tcgatggtgg tcatgctgac 8160
gagcccgcgc gggaggcagg tccagacctc ggctcggacg ggtcggagag cgaggacgag 8220
ggcgcgcagg ccggagctgt ccagggtcct gagacgctgc ggagtcaggt cagtgggcag 8280
cggcggcgcg cggttgactt gcaggagctt ttccagggcg cgcgggaggt ccagatggta 8340
cttgatctcc acggcgccgt tggtggctac gtccacggct tgcagggtgc cgtgcccctg 8400
gggcgccacc accgtgcccc gtttcttctt gggcgctgct tccatgtcgg tcagaagcgg 8460
cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520
ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580
gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640
gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700
tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760
tgctcgatct cctcctcctg aaggtctccg cggccggcgc gctcgacggt ggccgcgagg 8820
tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880
cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940
agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000
gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060
ctgacgtcgc ccagggcttc caagcgttcc atggcctcgt agaagtccac ggcgaagttg 9120
aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180
gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc catctcctcc 9240
tcttcctcct ccactaacat ctcttctact tcctcctcag gaggcggtgg cgggggaggg 9300
gccctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat ggtctccccg 9360
cgccggcgac gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg cagcatgaag 9420
acgccgccgc gcatctccag gtggccgccg ggggggtctc cgttgggcag ggagagggcg 9480
ctgacgatgc atcttatcaa ttgacccgta gggactccgc gcaaggacct gagcgtctcg 9540
agatccacgg gatccgaaaa ccgctgaacg aaggcttcga gccagtcgca gtcgcaaggt 9600
aggctgagcc cggtttcttg ttcttcgggt atttggtcgg gaggcgggcg ggcgatgctg 9660
ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720
tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780
cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840
cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct gcggctggac gagcgccagg 9900
tcggcgacga cgcgctcggt gaggatggcc tgctggatct gggtgagggt ggtctggaag 9960
tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtgtagga gcagttggcc 10020
atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080
gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggcgc gcacgaggta ctggtatccg 10140
acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200
ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260
atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320
agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380
atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440
gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500
ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560
ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620
ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680
cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740
cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800
ggagcgagcc cctctttttt tttcttgtgt ttttgccaga tgcatcccgt actgcggcag 10860
atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920
ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980
gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtcgtcg 11040
ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100
cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160
cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220
ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280
ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340
aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400
gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460
ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520
gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580
gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gctgagtctg 11640
ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700
aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760
ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820
agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880
gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940
gctgccggcg gttcccccta cgtggaggag gtggacgatg aggaggagga gggcgagtac 12000
ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060
ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120
acgattggac ccaggccatg caacgcatca tggcgctgac gacccgcaat cccgaagcct 12180
ttagacagca gcctcaggcc aaccggctct cggccatcct ggaggccgtg gtgccctcgc 12240
gctcgaaccc cacgcacgag aaggtgctgg ccatcgtgaa cgcgctggtg gagaacaagg 12300
ccatccgcgg tgacgaggcc gggctggtgt acaacgcgct gctggagcgc gtggcccgct 12360
acaacagcac caacgtgcag acgaacctgg accgcatggt gaccgacgtg cgcgaggcgg 12420
tgtcgcagcg cgagcggttc caccgcgagt cgaacctggg ctccatggtg gcgctgaacg 12480
ccttcctgag cacgcagccc gccaacgtgc cccggggcca ggaggactac accaacttca 12540
tcagcgcgct gcggctgatg gtggccgagg tgccccagag cgaggtgtac cagtcggggc 12600
cggactactt cttccagacc agtcgccagg gcttgcagac cgtgaacctg agccaggctt 12660
tcaagaactt gcagggactg tggggcgtgc aggccccggt cggggaccgc gcgacggtgt 12720
cgagcctgct gacgccgaac tcgcgcctgc tgctgctgct ggtggcgccc ttcacggaca 12780
gcggcagcgt gagccgcgac tcgtacctgg gctacctgct taacctgtac cgcgaggcca 12840
tcggacaggc gcacgtggac gagcagacct accaggagat cacccacgtg agccgcgcgc 12900
tgggccagga ggacccgggc aacctggagg ccaccctgaa cttcctgctg accaaccggt 12960
cgcagaagat cccgccccag tacgcgctga gcaccgagga ggagcgcatc ctgcgctacg 13020
tgcagcagag cgtggggctg ttcctgatgc aggagggggc cacgcccagc gcggcgctcg 13080
acatgaccgc gcgcaacatg gagcccagca tgtacgcccg caaccgcccg ttcatcaata 13140
agctgatgga ctacttgcat cgggcggccg ccatgaactc ggactacttt accaacgcca 13200
tcttgaaccc gcactggctc ccgccgcccg ggttctacac gggcgagtac gacatgcccg 13260
accccaacga cgggttcctg tgggacgacg tggacagcag cgtgttctcg ccgcgtccag 13320
gaaccaatgc cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg 13380
gtcgcgcggg tgctgccgcg gcggtgcccg aggccgccag ccccttcccg agcctgccct 13440
tttcgctgaa cagcgtgcgc agcagcgagc tgggtcggct gacgcgaccg cgcctgctgg 13500
gcgaggagga gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca 13560
ataacgggat agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc 13620
acagggacga gccccgagct agcagcgcag gcacccgtag acgccagcgg cacgacaggc 13680
agcggggact ggtgtgggac gatgaggatt ccgccgacga cagcagcgtg ttggacttgg 13740
gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc ctgatgtaag 13800
aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860
ctgttgtttg tagtagt atg atg agg cgc gtg tac ccg gag ggt cct cct 13910
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro
1 5 10
ccc tcg tac gag agc gtg atg cag cag gcg gtg gcg gcg gcg atg cag 13958
Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Val Ala Ala Ala Met Gln
15 20 25
ccc ccg ctg gag gcg cct tac gtg ccc ccg cgg tac ctg gcg cct acg 14006
Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr
30 35 40
gag ggg cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat 14054
Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp
45 50 55
acc acc cgg ttg tac ctg gtg gac aac aag tcg gca gac atc gcc tcg 14102
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Aa Asp Ile Ala Ser
60 65 70 75
ctg aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg cag 14150
Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln
80 85 90
aac aac gat ttc acc ccc acg gag gcc agc acc cag acc atc aac ttt 14198
Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe
95 100 105
gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc atg cac acc 14246
Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr
110 115 120
aac atg ccc aac gtg aac gag ttc atg tac agc aac aag ttc aag gcg 14294
Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala
125 130 135
cgg gtg atg gtc tcg cgc aag acc ccc aac ggg gtg gat gat gat tat 14342
Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr
140 145 150 155
gat ggt agt cag gac gag ctg acc tac gag tgg gtg gag ttt gag ctg 14390
Asp Gly Ser Gln Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu
160 165 170
ccc gag ggc aac ttc tcg gtg acc atg acc atc gat ctg atg aac aac 14438
Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn
175 180 185
gcc atc atc gac aac tac ttg gcg gtg ggg cgg cag aac ggg gtg ctg 14486
Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu
190 195 200
gag agc gac atc ggc gtg aag ttc gac acg cgc aac ttc cgg ctg ggc 14534
Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly
205 210 215
tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac gag 14582
Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu
220 225 230 235
gcc ttc cac ccc gac atc gtc ctg ctg ccc ggc tgc ggc gtg gac ttc 14630
Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe
240 245 250
acc gag agc cgc ctc agc aac ctg ctg ggc atc cgc aag cgg cag ccc 14678
Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro
255 260 265
ttc cag gag ggc ttc cag atc ctg tac gag gac ctg gag ggg ggc aac 14726
Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn
270 275 280
atc ccc gcg ctc ttg gat gtc gaa gcc tac gag aaa agc aag gag gat 14774
Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp
285 290 295
agc acc gcc gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag gtg 14822
Ser Thr Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val
300 305 310 315
cgg ggc gat aat ttt gct agc gct gcg gca gcg gcc gag gcg gct gaa 14870
Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu
320 325 330
acc gaa agt aag ata gtc atc cag ccg gtg gag aag gac agc aag gac 14918
Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asp
335 340 345
agg agc tac aac gtg ctc gcg gac aag aaa aac acc gcc tac cgc agc 14966
Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser
350 355 360
tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc tcc 15014
Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser
365 370 375
tgg acg ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag caa gtc 15062
Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val
380 385 390 395
tac tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tcc 15110
Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser
400 405 410
acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc gag ctc ctg ccc 15158
Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro
415 420 425
gtc tac tcc aag agc ttc ttc aac gag cag gcc gtc tac tcg cag cag 15206
Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln
430 435 440
ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc aac cgc ttc ccc gag 15254
Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu
445 450 455
aac cag atc ctc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15302
Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser
460 465 470 475
gaa aac gtt cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc agc 15350
Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser
480 485 490
agt atc cgg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc 15398
Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg
495 500 505
acc tgc ccc tac gtc tac aag gcc ctg ggc gta gtc gcg ccg cgc gtc 15446
Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val
510 515 520
ctc tcg agc cgc acc ttc taa aaaatgtcca ttctcatctc gcccagtaat 15497
Leu Ser Ser Arg Thr Phe
525
aacaccggtt ggggcctgcg cgcgcccagc aagatgtacg gaggcgctcg ccaacgctcc 15557
acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc 15617
cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc 15677
aactacacgc ccgccgccgc gcccgtctcc accgtggacg ccgtcatcga cagcgtggtg 15737
gccgacgcgc gccggtacgc ccgcaccaag agccggcggc ggcgcatcgc ccggcggcac 15797
cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc gcagggccag gcgcacggga 15857
cgcagggcca tgctcagggc ggccagacgc gcggcctccg gcagcagcag cgccggcagg 15917
acccgcagac gcgcggccac ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg 15977
cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc 16037
cgcccccctc gcacttgaag atgctgactt cgcgatgttg atgtgtccca gcggcgagga 16097
ggatgtccaa gcgcaaatac aaggaagaga tgctccaggt catcgcgcct gagatctacg 16157
gccccgcggc ggcggtgaag gaggaaagaa agccccgcaa actgaagcgggtcaaaaagg 16217
acaaaaagga ggaggaagat gacggactgg tggagtttgt gcgcgagttc gccccccggc 16277
ggcgcgtgca gtggcgcggg cggaaagtga aaccggtgct gcggcccggc accacggtgg 16337
tcttcacgcc cggcgagcgt tccggctccg cctccaagcg ctcctacgac gaggtgtacg 16397
gggacgagga catcctcgag caggcggtcg agcgtctggg cgagtttgcg tacggcaagc 16457
gcagccgccc cgcgcccttg aaagaggagg cggtgtccat cccgctggac cacggcaacc 16517
ccacgccgag cctgaagccg gtgaccctgc agcaggtgct accgagcgcg gcgccgcgcc 16577
ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac catgcagctg atggtgccca 16637
agcgccagaa gctggaggac gtgctggagc acatgaaggt ggaccccgag gtgcagcccg 16697
aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct gggcgtgcag accgtggaca 16757
tcaagatccc cacggagccc atggaaacgc agaccgagcc cgtgaagccc agcaccagca 16817
ccatggaggt gcagacggat ccctggatgc cagcaccagc ttccaccagc actcgccgaa 16877
gacgcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 16937
tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg ctacaccagc agccgccgcc 16997
gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag cagcaccgcg acttccgcct 17057
tggtgcggag agtgtatcgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17117
acccgagcat cgccatttaa ctaccgcctc ctacttgcag atatggccct cacatgccgc 17177
ctccgcgtcc ccattacggg ctaccgagga agaaagccgc gccgtagaag gctgacgggg 17237
aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga 17297
ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga tcggggcgat ccccggcata 17357
gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca caaaaaagca tggatttgta 17417
ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg tgtgttttta gatggaagac 17477
atcaattttt cgtccctggc accgcgacac ggcacgcggc cgtttatggg cacctggagc 17537
gacatcggca acagccaact gaacgggggc gccttcaatt ggagcagtct ctggagcggg 17597
cttaagaatt tcgggtccac gctcaaaacc tatggcaaca aggcgtggaa cagcagcaca 17657
gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc agcagaaggt ggttgatggc 17717
ctggcctcag gcatcaacgg ggtggttgac ctggccaacc aggccgtgca gaaacagatc 17777
aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg agatgcccca ggtggaggag 17837
gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc gtcccgacgc ggaggagacg 17897
ctgctgacgc acacggacga gccgcccccg tacgaggagg cggtgaaact gggcctgccc 17957
accacgcggc ccgtggcgcc tctggccacc ggagtgctga aacccagcag cagccagccc 18017
gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg ctaagcccct gccgccggtg 18077
gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga actggcagag cactctgaac 18137
agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aaagacactg 18197
tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg ccgaccagaa ggaggagtgt 18257
gaagaggcgc gtcgccgagt tgcaag atg gcc acc cca tcg atg ctg ccc cag 18310
Met Ala Thr Pro Ser Met Leu Pro Gln
530 535
tgg gcg tac atg cac atc gcc gga cag gac gct tcg gag tac ctg agt 18358
Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser
540 545 550
ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg 18406
Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu
555 560 565 570
ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18454
Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr
575 580 585
acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg ccc gtg gac cgc 18502
Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg
590 595 600
gag gac aac acc tac tcg tac aaa gtg cgc tac acg ctg gcc gtg ggc 18550
Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly
605 610 615
gac aac cgc gtg ctg gac atg gcc agc acc tac ttt gac atc cgc ggc 18598
Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly
620 625 630
gtg ctg gac cgg ggc cct agc ttc aaa ccc tac tct ggc acc gcc tac 18646
Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr
635 640 645 650
aac agc cta gct ccc aag gga gct ccc aat tcc agc cag tgg gag caa 18694
Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser Gln Trp Glu Gln
655 660 665
gca aaa aca ggc aat ggg gga act atg gaa aca cac aca tat ggt gtg 18742
Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val
670 675 680
gcc cca atg ggc gga gag aat att aca aaa gat ggt ctt caa att gga 18790
Ala Pro Met Gly Gly Glu Asn Ile Thr Lys Asp Gly Leu Gln Ile Gly
685 690 695
act gac gtt aca gcg aat cag aat aaa cca att tat gcc gac aaa aca 18838
Thr Asp Val Thr Ala Asn Gln Asn Lys Pro Ile Tyr Ala Asp Lys Thr
700 705 710
ttt caa cca gaa ccg caa gta gga gaa gaa aat tgg caa gaa act gaa 18886
Phe Gln Pro Glu Pro Gln Val Gly Glu Glu Asn Trp Gln Glu Thr Glu
715 720 725 730
aac ttt tat ggc ggt aga gct ctt aaa aaa gac aca aac atg aaa cct 18934
Asn Phe Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro
735 740 745
tgc tat ggc tcc tat gct aga ccc acc aat gaa aaa gga ggt caa gct 18982
Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala
750 755 760
aaa ctt aaa gtt gga gat gat gga gtt cca acc aaa gaa ttc gac ata 19030
Lys Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile
765 770 775
gac ctg gct ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa gac 19078
Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gln Asp
780 785 790
gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat ttg gaa 19126
Glu Tyr Lys Ala Asp Ile Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu
795 800 805 810
act cca gac acg cat gtg gta tac aaa cea ggc aag gat gat gca agt 19174
Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser
815 820 825
tct gaa att aac ctg gtt cag cag tct atg ccc aac aga ccc aac tac 19222
Ser Glu Ile Asn Leu Val Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr
830 835 840
att ggg ttc agg gac aac ttt atc ggt ctt atg tac tac aac agc act 19270
Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr
845 850 855
ggc aat atg ggt gtg ctt gct ggt cag gcc tcc cag ctg aat gct gtg 19318
Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val
860 865 870
gtt gat ttg caa gac aga aac acc gag ctg tcc tac cag ctc ttg ctt 19366
Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu
875 880 885 890
gac tct ttg ggt gac aga acc cgg tat ttc agt atg tgg aac cag gcg 19414
Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala
895 900 905
gtg gac agt tat gac ccc gat gtg cgc atc atc gaa aac cat ggt gtg 19462
Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val
910 915 920
gag gat gaa ttg cca aac tat tgc ttc ccc ttg gac ggc tct ggc act 19510
Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr
925 930 935
aac gcc gca tac caa ggt gtg aaa gta aaa gat ggt caa gat ggt gat 19558
Asn Ala Ala Tyr Gln Gly Val Lys Val Lys Asp Gly Gln Asp Gly Asp
940 945 950
gtt gag agt gaa tgg gaa aat gac gat act gtt gca gct cga aat caa 19606
Val Glu Ser Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gln
955 960 965 970
tta tgt aaa ggt aac att ttc gcc atg gag att aat ctc cag gct aac 19654
Leu Cys Lys Gly Asn Ile Phe Ala Met Glu Ile Asn Leu Gln Ala Asn
975 980 985
ctg tgg aga agt ttc ctc tac tcg aac gtg gcc ctg tac ctg ccc gac 19702
Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp
990 995 1000
tcc tac aag tac acg ccg acc aac gtc acg ctg ccg acc aac acc 19747
Ser Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr
1005 1010 1015
aac acc tac gat tac atg aat ggc aga gtg aca cct ccc tcg ctg 19792
Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu
1020 1025 1030
gta gac gcc tac ctc aac atc ggg gcg cgc tgg tcg ctg gac ccc 19837
Val Asp Ala Tyr Leu Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro
1035 1040 1045
atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg 19882
Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu
1050 1055 1060
cgc tac cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg ccc ttc 19927
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe
1065 1070 1075
cac atc cag gtg ccc caa aag ttt ttc gcc atc aag agc ctc ctg 19972
His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu
1080 1085 1090
ctc ctg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac 20017
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp
1095 1100 1105
gtc aac atg atc ctg cag agc tcc cta ggc aac gac ctg cgc acg 20062
Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr
1110 1115 1120
gac ggg gcc tcc atc gcc ttc acc agc atc aac ctc tac gcc acc 20107
Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr
1125 1130 1135
ttc ttc ccc atg gcg cac aac acc gcc tcc acg ctc gag gcc atg 20152
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met
1140 1145 1150
ctg cgc aac gac acc aac gac cag tcc ttc aac gac tac ctc tcg 20197
Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser
1155 1160 1165
gcg gcc aac atg ctc tac ccc atc ccg gcc aac gcc acc aac gtg 20242
Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val
1170 1175 1180
ccc atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc gga tgg 20287
Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp
1185 1190 1195
tcc ttc acg cgc ctg aag acc cgc gag acg ccc tcg ctc ggc tcc 20332
Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser
1200 1205 1210
ggg ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac cta 20377
Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu
1215 1220 1225
gac ggc acc ttc tac ctc aac cac acc ttc aag aag gtc tcc atc 20422
Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile
1230 1235 1240
acc ttc gac tcc tcc gtc agc tgg ccc ggc aac gac cgc ctc ctg 20467
Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu
1245 1250 1255
acg ccc aac gag ttc gaa atc aag cgc acc gtc gac gga gag gga 20512
Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly
1260 1265 1270
tac aac gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg gtc 20557
Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val
1275 1280 1285
cag atg ctg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg 20602
Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val
1290 1295 1300
ccc gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc 20647
Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe
1305 1310 1315
cag ccc atg agc cgc cag gtc gtg gac gag gtc aac tac aag gac 20692
Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp
1320 1325 1330
tac cag gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc ttc 20737
Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe
1335 1340 1345
gtc ggc tac ctc gcg ccc acc atg cgc cag ggc cag ccc tac ccc 20782
Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro
1350 1355 1360
gcc aac tac ccc tac ccg ctc atc ggc aag agc gcc gtc gcc agc 20827
Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser
1365 1370 1375
gtc acc cag aaa aag ttc ctc tgc gac cgg gtc atg tgg cgc atc 20872
Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile
1380 1385 1390
ccc ttc tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac ctc 20917
Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu
1395 1400 1405
ggc cag aac atg ctc tac gcc aac tcc gcc cac gcg cta gac atg 20962
Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met
1410 1415 1420
aat ttc gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat gtt 21007
Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val
1425 1430 1435
gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc 21052
Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg
1440 1445 1450
ggc gtc atc gaa gcc gtc tac ctg cgc acg ccc ttc tcg gcc ggc 21097
Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly
1455 1460 1465
aac gcc acc acc taa gccgctcttg cttcttgcaa gatgacggcg ggctccggcg 21152
Asn Ala Thr Thr
1470
agcaggagct cagggccatc ctccgcgacc tgggctgcgg gccctgcttc ctgggcacct 21212
tcgacaagcg cttccctgga ttcatggccc cgcacaagct ggcctgcgcc atcgtgaaca 21272
cggccggccg cgagaccggg ggcgagcact ggctggcctt cgcctggaac ccgcgctccc 21332
acacatgcta cctcttcgac cccttcgggt tctcggacga gcgcctcaag cagatctacc 21392
agttcgagta cgagggcctg ctgcgtcgca gcgccctggc caccgaggac cgctgcgtca 21452
ccctggaaaa gtccacccag accgtgcagg gtccgcgctc ggccgcctgc gggctcttct 21512
gctgcatgtt cctgcacgcc ttcgtgcact ggcccgaccg ccccatggac aagaacccca 21572
ccatgaactt actgacgggg gtgcccaacg gcatgctcca gtcgccccag gtggaaccca 21632
ccctgcgccg caaccaggaa gcgctctacc gcttcctcaa tgcccactcc gcctactttc 21692
gctcccaccg cgcgcgcatc gagaaggcca ccgccttcga ccgcatgaat caagacatgt 21752
aaaaaaccgg tgtgtgtatg tgaatgcttt attcataata aacagcacat gtttatgcca 21812
ccttctctga ggctctgact ttatttagaa atcgaagggg ttctgccggc tctcggcatg 21872
gcccgcgggc agggatacgt tgcggaactg gtacttgggc agccacttga actcggggat 21932
cagcagcttg ggcacgggga ggtcggggaa cgagtcgctc cacagcttgc gcgtgagttg 21992
cagggcgccc agcaggtcgg gcgcggagat cttgaaatcg cagttgggac ccgcgttctg 22052
cgcgcgagag ttgcggtaca cggggttgca gcactggaac accatcaggg ccgggtgctt 22112
cacgcttgcc agcaccgtcg cgtcggtgat gccctccacg tccagatcct cggcgttggc 22172
catcccgaag ggggtcatct tgcaggtctg ccgccccatg ctgggcacgc agccgggctt 22232
gtggttgcaa tcgcagtgca gggggatcag catcatctgg gcctgctcgg agctcatgcc 22292
cgggtacatg gccttcatga aagcctccag ctggcggaag gcctgctgcg ccttgccgcc 22352
ctcggtgaag aagaccccgc aggacttgct agagaactgg ttggtggcgc agccggcgtc 22412
gtgcacgcag cagcgcgcgt cgttgttggc cagctgcacc acgctgcgcc cccagcggtt 22472
ctgggtgatc ttggcccggt tggggttctc cttcagcgcg cgctgcccgt tctcgctcgc 22532
cacatccatc tcgatagtgt gctccttctg gatcatcacg gtcccgtgca ggcaccgcag 22592
cttgccctcg gcttcggtgc agccgtgcag ccacagcgcg cagccggtgc actcccagtt 22652
cttgtgggcg atctgggagt gcgagtgcac gaagccctgc aggaagcggc ccatcatcgc 22712
ggtcagggtc ttgttgctgg tgaaggtcag cgggatgccg cggtgctcct cgttcacata 22772
caggtggcag atgcggcggt acacctcgcc ctgctcgggc atcagctgga aggcggactt 22832
caggtcgctc tccacgcggt accggtccat cagcagcgtc atcacttcca tgcccttctc 22892
ccaggccgaa acgatcggca ggctcagggg gttcttcacc gccattgtca tcttagtcgc 22952
cgccgccgag gtcagggggt cgttctcgtc cagggtctca aacactcgct tgccgtcctt 23012
ctcgatgatg cgcacggggg gaaagctgaa gcccacggcc gccagctcct cctcggcctg 23072
cctttcgtcc tcgctgtcct ggctgatgtc ttgcaaaggc acatgcttgg tcttgcgggg 23132
tttctttttg ggcggcagag gcggcggcga tgtgctggga gagcgcgagt tctcgttcac 23192
cacgactatt tcttcttctt ggccgtcgtc cgagaccacg cggcggtagg catgcctctt 23252
ctggggcaga ggcggaggcg acgggctctc gcggttcggc gggcggctgg cagagcccct 23312
tccgcgttcg ggggtgcgct cctggcggcg ctgctctgac tgacttcctc cgcggccggc 23372
cattgtgttc tcctagggag caacaacaag catggagact cagccatcgt cgccaacatc 23432
gccatctgcc cccgccgcca ccgccgacga gaaccagcag cagaatgaaa gcttaaccgc 23492
cccgccgccc agccccacct ccgacgccgc ggccccagac atgcaagaga tggaggaatc 23552
catcgagatt gacctgggct acgtgacgcc cgcggagcac gaggaggagc tggcagcgcg 23612
cttttcagcc ccggaagaga accaccaaga gcagccagag caggaagcag agaacgagca 23672
gaaccaggct gggcacgagc atggcgacta cctgagcggg gcagaggacg tgctcatcaa 23732
gcatctggcc cgccaatgca tcatcgtcaa ggacgcgctg ctcgaccgcg ccgaggtgcc 23792
cctcagcgtg gcggagctca gccgcgccta cgagcgcaac ctcttctcgc cgcgcgtgcc 23852
ccccaagcgc cagcccaacg gcacctgtga gcccaacccg cgcctcaact tctacccggt 23912
cttcgcggtg cccgaggccc tggccaccta ccacctcttt ttcaagaacc aaaggatccc 23972
cgtctcctgc cgcgccaacc gcacccgcgc cgacgccctg ctcaacctgg gccccggcgc 24032
ccgcctacct gatatcacct ccttggaaga ggttcccaag atcttcgagg gtctgggcag 24092
cgacgagact cgggccgcga acgctctgca aggaagcgga gaggagcatg agcaccacag 24152
cgccctggtg gagttggaag gcgacaacgc gcgcctggcg gtcctcaagc gcacggtcga 24212
gctgacccac ttcgcctacc cggcgctcaa cctgcccccc aaggtcatga gcgccgtcat 24272
ggaccaggtg ctcatcaagc gcgcctcgcc cctctcggag gaggagatgc aggaccccga 24332
gagttcggac gagggcaagc ccgtggtcag cgacgagcag ctggcgcgct ggctgggagc 24392
gagtagcacc ccccagagcc tggaagagcg gcgcaagctc atgatggccg tggtcctggt 24452
gaccgtggag ctggagtgtc tgcgccgctt ctttgccgac gcggagaccc tgcgcaaggt 24512
cgaggagaac ctgcactacc tcttcaggca cgggttcgtg cgccaggcct gcaagatctc 24572
caacgtggag ctgaccaacc tggtctccta catgggcatc ctgcacgaga accgcctggg 24632
gcaaaacgtg ctgcacacca ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg 24692
cgtctacctg tacctctgcc acacctggca gacgggcatg ggcgtgtggc agcagtgcct 24752
ggaggagcag aacctgaaag agctctgcaa gctcctgcag aagaacctca aggccctgtg 24812
gaccgggttc gacgagcgta ccaccgcctc ggacctggcc gacctcatct tccccgagcg 24872
cctgcggctg acgctgcgca acgggctgcc cgactttatg agccaaagca tgttgcaaaa 24932
ctttcgctct ttcatcctcg aacgctccgg gatcctgccc gccacctgct ccgcgctgcc 24992
ctcggacttc gtgccgctga ccttccgcga gtgccccccg ccgctctgga gccactgcta 25052
cttgctgcgc ctggccaact acctggccta ccactcggac gtgatcgagg acgtcagcgg 25112
cgagggtctg ctggagtgcc actgccgctg caacctctgc acgccgcacc gctccctggc 25172
ctgcaacccc cagctgctga gcgagaccca gatcatcggc accttcgagt tgcaaggccc 25232
cggcgacggc gagggcaagg ggggtctgaa actcaccccg gggctgtgga cctcggccta 25292
cttgcgcaag ttcgtgcccg aggactacca tcccttcgag atcaggttct acgaggacca 25352
atcccagccg cccaaggccg agctgtcggc ctgcgtcatc acccaggggg ccatcctggc 25412
ccaattgcaa gccatccaga aatcccgcca agaatttctg ctgaaaaagg gccacggggt 25472
ctacttggac ccccagaccg gagaggagct caaccccagc ttcccccagg atgccccgag 25532
gaagcagcaa gaagctgaaa gtggagctgc cgccgccgga ggatttggag gaagactggg 25592
agagcagtca ggcagaggag gaggagatgg aagactggga cagcactcag gcagaggagg 25652
acagcctgca agacagtctg gaggaggaag acgaggtgga ggaggcagag gaagaagcag 25712
ccgccgccag accgtcgtcc tcggcggaga aagcaagcag cacggatacc atctccgctc 25772
cgggtcgggg tcgcggcggc cgggcccaca gtaggtggga cgagaccggg cgcttcccga 25832
accccaccac ccagaccggt aagaaggagc ggcagggata caagtcctgg cgggggcaca 25892
aaaacgccat cgtctcctgc ttgcaagcct gcgggggcaa catctccttc acccggcgct 25952
acctgctctt ccaccgcggg gtgaacttcc cccgcaacat cttgcattac taccgtcacc 26012
tccacagccc ctactactgt ttccaagaag aggcagaaac ccagcagcag cagaaaacca 26072
gcggcagcag cagctagaaa atccacagcg gcggcaggtg gactgaggat cgcggcgaac 26132
gagccggcgc agacccggga gctgaggaac cggatctttc ccaccctcta tgccatcttc 26192
cagcagagtc gggggcagga gcaggaactg aaagtcaaga accgttctct gcgctcgctc 26252
acccgcagtt gtctgtatca caagagcgaa gaccaacttc agcgcactct cgaggacgcc 26312
gaggctctct tcaacaagta ctgcgcgctc actcttaaag agtagcccgc gcccgcccac 26372
acacggaaaa aggcgggaat tacgtcacca cctgcgccct tcgcccgacc atcatgagca 26432
aagagattcc cacgccttac atgtggagct accagcccca gatgggcctg gccgccggcg 26492
ccgcccagga ctactccacc cgcatgaact ggctcagtgc cgggcccgcg atgatctcac 26552
gggtgaatga catccgcgcc caccgaaacc agatactcct agaacagtca gcgatcaccg 26612
ccacgccccg ccatcacctt aatccgcgta attggcccgc cgccctggtg taccaggaaa 26672
ttccccagcc cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta 26732
actcaggtgt ccagctggcc ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta 26792
taaagcggct ggtgatccga ggcagaggca cacagctcaa cgacgaggtg gtgagctctt 26852
cgctgggtct gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca 26912
cgcctcgtca ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggcggca 26972
tcggcactct ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg 27032
gctcccccgg ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg 27092
tggacggcta cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc 27152
tggaccactg ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg 27212
agctgcccga ggagcaccct cagggcccag cccacggagt gcggatcatc gtcgaagggg 27272
gcctcgactc ccacctgctt cggatcttca gccagcgacc gatcctggtc gagcgcgaac 27332
aaggacagac ccttcttact ttgtactgca tctgcaacca ccccggcctg catgaaagtc 27392
tttgttgtct gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactc 27452
gattgtggtg ttcctgctat caaccggtcc ctgttcttca ccgggaacga gaccgagctc 27512
cagctccagt gtaagcccca caagaagtac ctcacctggc tgttccaggg ctccccgatc 27572
gccgttgtca accactgcga caacgacgga gtcctgctga gcggccctgc caaccttact 27632
ttttccaccc gcagaagcaa gctccagctc ttccaaccct tcctccccgg gacctatcag 27692
tgcgtctcag gaccctgcca tcacaccttc cacctgatcc cgaataccac agcgccgctc 27752
cccgctacta acaaccaaac tacccaccaa cgccaccgtc gcgacctttc ctctgaatct 27812
aataccacta ccggaggtga gctccgaggt cgaccaacct ctgggattta ctacggcccc 27872
tgggaggtgg tggggttaat agcgctaggc ctagttgcgg gtgggctttt ggttctctgc 27932
tacctatacc tcccttgctg ttcgtactta gtggtgctgt gttgctggtt taagaaatgg 27992
ggaagatcac cctagtgagc tgcggtgcgc tggtggcggt gttgctttcg attgtgggac 28052
tgggcggcgc ggctgtagtg aaggagaagg ccgatccctg cttgcatttc aatcccaaca 28112
aatgccagct gagttttcag cccgatggca atcggtgcgc ggtactgatc aagtgcggat 28172
gggaatgcga gaacgtgaga atcgagtaca ataacaagac tcggaacaat actctcgcgt 28232
ccgtgtggca gcccggggac cccgagtggt acaccgtctc tgtccccggt gctgacggct 28292
ccccgcgcac cgtgaataat actttcattt ttgcgcacat gtgcaacacg gtcatgtgga 28352
tgagcaagca gtacgatatg tggcccccca cgaaggagaa catcgtggtc ttctccatcg 28412
cttacagcct gtgcacggcg ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca 28472
tcgctattcg ccccagaaat aatgccgaga aagagaaaca gccataacac gttttttcac 28532
acaccttgtt tttacagaca atgcgtctgt taaatttttt aaacattgtg ctcagtattg 28592
cttatgcctc tggttatgca aacatacaga aaacccttta tgtaggatct gatggtacac 28652
tagagggtac ccaatcacaa gccaaggttg catggtattt ttatagaacc aacactgatc 28712
cagttaaact ttgtaagggt gaattgccgc gtacacataa aactccactt acatttagtt 28772
gcagcaataa taatcttaca cttttttcaa ttacaaaaca atatactggt acttattaca 28832
gtacaaactt tcatacagga caagataaat attatactgt taaggtagaa aatcctacca 28892
ctcctagaac taccaccacc accactactg caaagcccac tgtgaaaact acaactagga 28952
ccaccacaac tacagaaacc accaccagca caacacttgc tgcaactaca cacacacaca 29012
ctaagctaac cttacagacc actaatgatt tgatcgccct gctgcaaaag ggggataaca 29072
gcaccacttc caatgaggag atacccaaat ccatgattgg cattattgtt gctgtagtgg 29132
tgtgcatgtt gatcatcgcc ttgtgcatgg tgtactatgc cttctgctac agaaagcaca 29192
gactgaacga caagctggaa cacttactaa gtgttgaatt ttaatttttt agaaccatga 29252
agatcctagg cctttttagt ttttctatca ttacctctgc tctttgtgaa tcagtggata 29312
gagatgttac tattaccact ggttctaatt atacactgaa agggccaccc tcaggtatgc 29372
tttcgtggta ttgctatttt ggaactgaca ctgatcaaac tgaattatgc aattttcaaa 29432
aaggcaaaac ctcaaactct aaaatctcta attatcaatg caatggcact gatctgatac 29492
tactcaatgt cacgaaagca tatggtggca gttattattg ccctggacaa aacactgaag 29552
aaatgatttt ttacaaagtg gaagtggttg atcccactac accacccacc accacaacta 29612
ttcataccac acacacagaa caaacaccag aggcaacaga agcagagttg gccttccagg 29672
ttcacggaga ttcctttgct gtcaataccc ctacacccga tcagcggtgt ccggggccgc 29732
tagtcagcgg cattgtcggt gtgctttcgg gattagcagt cataatcatc tgcatgttca 29792
tttttgcttg ctgctataga aggctttacc gacaaaaatc agacccactg ctgaacctct 29852
atgtttaatt ttttccagag ccatgaaggc agttagcgct ctagtttttt gttctttgat 29912
tggcattgtt tttaatagta aaattaccag agttagcttt attaaacatg ttaatgtaac 29972
tgaaggagat aacatcacac tagcaggtgt agaaggtgct caaaacacca cctggacaaa 30032
ataccatcta ggatggagag atatttgcac ctggaatgta acttattatt gcataggagt 30092
taatcttacc attgttaacg ctaaccaatc tcagaatggg ttaattaaag gacagagtgt 30152
tagtgtgacc agtgatgggt actataccca gcatagtttt aactacaaca ttactgtcat 30212
accactgcct acgcctagcc cacctagcac taccacacag acaaccacat acagtacatc 30272
aaatcagcct accaccacta cagcagcaga ggttgccagc tcgtctgggg tccgagtggc 30332
atttttgatg ttggccccat ctagcagtcc cactgctagt accaatgagc agactactga 30392
atttttgtcc actgtcgaga gccacaccac agctacctcc agtgccttct ctagcaccgc 30452
caatctctcc tcgctttcct ctacaccaat cagccccgct actactccta gccccgctcc 30512
tcttcccact cccctgaagc aaacagacgg cggcatgcaa tggcagatca ccctgctcat 30572
tgtgatcggg ttggtcatcc tggccgtgtt gctctactac atcttctgcc gccgcattcc 30632
caacgcgcac cgcaagccgg cctacaagcc catcgttatc gggcagccgg agccgcttca 30692
ggtggaaggg ggtctaagga atcttctctt ctcttttaca gtatggtgat tgaactatga 30752
ttcctagaca attcttgatc actattctta tctgcctcct ccaagtctgt gccaccctcg 30812
ctctggtggc caacgccagt ccagactgta ttgggccctt cgcctcctac gtgctctttg 30872
ccttcgtcac ctgcatctgc tgctgtagca tagtctgcct gcttatcacc ttcttccagt 30932
tcattgactg gatctttgtg cgcatcgcct acctgcgcca ccacccccag taccgcgacc 30992
agcgagtggc gcagctgctc aggctcctct gataagcatg cgggctctgc tacttctcgc 31052
gcttctgctg ttagtgctcc cccgtcccgt cgacccccgg tcccccactc agtcccccga 31112
ggaggttcgc aaatgcaaat tccaagaacc ctggaaattc ctcaaatgct accgccaaaa 31172
atcagacatg catcccagct ggatcatgat cattgggatc gtgaacattc tggcctgcac 31232
cctcatctcc tttgtgattt acccctgctt tgactttggt tggaactcgc cagaggcgct 31292
ctatctcccg cctgaacctg acacaccacc acagcagcaa cctcaggcac acgcactacc 31352
accaccacag cctaggccac aatacatgcc catattagac tatgaggccg agccacagcg 31412
acccatgctc cccgctatta gttacttcaa tctaaccggc ggagatgact gacccactgg 31472
ccaataacaa cgtcaacgac cttctcctgg acatggacgg ccgcgcctcg gagcagcgac 31532
tcgcccaact tcgcattcgt cagcagcagg agagagccgt caaggagctg caggacggca 31592
tagccatcca ccagtgcaag agaggcatct tctgcctggt gaaacaggcc aagatctcct 31652
acgaggtcac ccagaccgac catcgcctct cctacgagct cctgcagcag cgccagaagt 31712
tcacctgcct ggtcggagtc aaccccatcg tcatcaccca gcagtcgggc gataccaagg 31772
ggtgcatcca ctgctcctgc gactcccccg actgcgtcca cactctgatc aagaccctct 31832
gcggcctccg cgacctcctc cccatgaact aatcaccccc ttatccagtg aaataaagat 31892
catattgatg atgatttaaa taaaaaaaat aatcatttga tttgaaataa agatacaatc 31952
atattgatga tttgagttta acaaaaataa agaatcactt acttgaaatc tgataccagg 32012
tctctgtcca tgttttctgc caacaccacc tcactcccct cttcccagct ctggtactgc 32072
aggccccggc gggctgcaaa cttcctccac acgctgaagg ggatgtcaaa ttcctcctgt 32132
ccctcaatct tcattttatc ttctatcag atg tcc aaa aag cgc gtc cgg gtg 32185
Met Ser Lys Lys Arg Val Arg Val
1475
gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32230
Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala
1480 1485 1490
ccg acc gtg ccc ttc atc aac ccc ccc ttc gtc tct tca gat gga 32275
Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly
1495 1500 1505
ttc caa gag aag ccc ctg ggg gtg ttg tcc ctg cga ctg gct gac 32320
Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp
1510 1515 1520
ccc gtc acc acc aag aac ggg gaa atc acc ctc aag ctg gga gag 32365
Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu
1525 1530 1535
ggg gtg gac ctc gac tcg tcg gga aaa ctc atc tcc aac acg gcc 32410
Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala
1540 1545 1550
acc aag gcc gcc gcc cct ctc agt att tca aac aac acc att tcc 32455
Thr Lys Ala Ala Ala Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser
1555 1560 1565
ctt aaa act gct gcc cct ttc tac aac aac aat gga act tta agc 32500
Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser
1570 1575 1580
ctc aat gtc tcc aca cca tta gca gta ttt ccc aca ttt aac act 32545
Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr
1585 1590 1595
tta ggc ata agt ctt gga aac ggt ctt cag act tca aat aag ttg 32590
Leu Gly Ile Ser Leu Gly Asn Gly Leu Gln Thr Ser Asn Lys Leu
1600 1605 1610
ttg act gta caa cta act cat cct ctt aca ttc agc tca aat agc 32635
Leu Thr Val Gln Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser
1615 1620 1625
atc aca gta aaa aca gac aaa ggg cta tat att aac tcc agt gga 32680
Ile Thr Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly
1630 1635 1640
aac aga gga ctt gag gct aat ata agc cta aaa aga gga cta gtt 32725
Asn Arg Gly Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Val
1645 1650 1655
ttt gac ggt aat gct att gca aca tat att gga aat ggc tta gac 32770
Phe Asp Gly Asn Ala Ile Ala Thr Tyr Ile Gly Asn Gly Leu Asp
1660 1665 1670
tat gga tct tat gat agt gat gga aaa aca aga ccc gta att acc 32815
Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Val Ile Thr
1675 1680 1685
aaa att gga gca gga tta aat ttt gat gct aac aaa gca ata gct 32860
Lys Ile Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala Ile Ala
1690 1695 1700
gtc aaa cta ggc aca ggt tta agt ttt gac tcc gct ggt gcc ttg 32905
Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Ala Gly Ala Leu
1705 1710 1715
aca gct gga aac aaa cag gat gac aag cta aca ctt tgg act acc 32950
Thr Ala Gly Asn Lys Gln Asp Asp Lys Leu Thr Leu Trp Thr Thr
1720 1725 1730
cct gac cca agc cct aat tgt caa tta ctt tca gac aga gat gcc 32995
Pro Asp Pro Ser Pro Asn Cys Gln Leu Leu Ser Asp Arg Asp Ala
1735 1740 1745
aaa ttt act ctc tgt ctt aca aaa tgc ggt agt caa ata cta ggc 33040
Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly
1750 1755 1760
act gtg gca gtg gcg gct gtt act gta gga tca gca cta aat cca 33085
Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro
1765 1770 1775
att aat gac aca gtc aaa agc gcc ata gtt ttc ctt aga ttt gat 33130
Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu Arg Phe Asp
1780 1785 1790
tcc gat ggt gta ctc atg tca aac tca tca atg gta ggt gat tac 33175
Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr
1795 1800 1805
tgg aac ttt agg gag gga cag acc act caa agt gta gcc tat aca 33220
Trp Asn Phe Arg Glu Gly Gln Thr Thr Gln Ser Val Ala Tyr Thr
1810 1815 1820
aat gct gtg gga ttc atg cca aat ata ggt gca tat cca aaa acc 33265
Asn Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr
1825 1830 1835
caa agt aaa aca cct aaa aat agc ata gtc agt cag gta tat tta 33310
Gln Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gln Val Tyr Leu
1840 1845 1850
act gga gaa act act atg cca atg aca cta acc ata act ttc aat 33355
Thr Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn
1855 1860 1865
ggc act gat gaa aaa gac aca acc cca gtt agc acc tac tct atg 33400
Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met
1870 1875 1880
act ttt aca tgg cag tgg act gga gac tat aag gac aaa aat att 33445
Thr Phe Thr Trp Gln Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile
1885 1890 1895
acc ttt gct acc aac tca ttc tct ttt tcc tac atc gcc cag gaa 33490
Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr Ile Ala Gln Glu
1900 1905 1910
taa tcccacccag caagccaacc ccttttccca ccacctttgt ctatatggaa 33543
actctgaaac agaaaaataa agttcaagtg ttttattgaa tcaacagttt tacaggactc 33603
gagcagttat ttttcctcca ccctcccagg acatggaata caccaccctc tccccccgca 33663
cagccttgaa catctgaatg ccattggtga tggacatgct tttggtctcc acgttccaca 33723
cagtttcaga gcgagccagt ctcggatcgg tcagggagat gaaaccctcc gggcactccc 33783
gcatctgcac ctcacagctc aacagctgag gattgtcctc ggtggtcggg atcacggtta 33843
tctggaagaa gcagaagagc ggcggtggga atcatagtcc gcgaacggga tcggccggtg 33903
gtgtcgcatc aggccccgca gcagtcgctg ccgccgccgc tccgtcaagc tgctgctcag 33963
ggggttcggg tccagggact ccctcagcat gatgcccacg gccctcagca tcagtcgtct 34023
ggtgcggcgg gcgcagcagc gcatgcgaat ctcgctcagg tcactgcagt acgtgcaaca 34083
caggaccacc aggttgttca acagtccata gttcaacacg ctccagccga aactcatcgc 34143
gggaaggatg ctacccacgt ggccgtcgta ccagatcctc aggtaaatca agtggcgctc 34203
cctccagaag acgctgccca tgtacatgat ctccttgggc atgtggcggt tcaccacctc 34263
ccggtaccac atcaccctct ggttgaacat gcagccccgg atgatcctgc ggaaccacag 34323
ggccagcacc gccccgcccg ccatgcagcg aagagacccc ggatcccggc aatgacaatg 34383
gaggacccac cgctcgtacc cgtggatcat ctgggagctg aacaagtcta tgttggcaca 34443
gcacaggcat atgctcatgc atctcttcag cactctcagc tcctcggggg tcaaaaccat 34503
atcccagggc acggggaact cttgcaggac agcgaacccc gcagaacagg gcaatcctcg 34563
cacataactt acattgtgca tggacagggt atcgcaatca ggcagcaccg ggtgatcctc 34623
caccagagaa gcgcgggtct cggtctcctc acagcgtggt aagggggccg gccgatacgg 34683
gtgatggcgg gacgcggctg atcgtg~ct cgaccgtgtc atgatgcagt tgctttcgga 34743
cattttcgta cttgctgtag cagaacctgg tccgggcgct gcacaccgat cgccggcggc 34803
ggtctcggcg cttggaacgc tcggtgttaa agttgtaaaa cagccactct ctcagaccgt 34863
gcagcagatc tagggcctca ggagtgatga agatcccatc atgcctgata gctctgatca 34923
catcgaccac cgtggaatgg gccaggccca gccagatgat gcaattttgt tgggtttcgg 34983
tgacggcggg ggagggaaga acaggaagaa ccatgattaa cttttaatcc aaacggtctc 35043
ggagcacttc aaaatgaagg tcacggagat ggcacctctc gcccccgctg tgttggtgga 35103
aaataacagc caggtcaaag gtgatacggt tctcgagatg ttccacggtg gcttccagca 35163
aagcctccac gcgcacatcc agaaacaaga caatagcgaa agcgggaggg ttctctaatt 35223
cctcaaccat catgttacac tcctgcacca tccccagata attttcattt ttccagcctt 35283
gaatgattcg aactagttcc tgaggtaaat ccaagccagc catgataaaa agctcgcgca 35343
gagcaccctc caccggcatt cttaagcaca ccctcataat tccaagatat tctgctcctg 35403
gttcacctgc agcagattga caagcggaat atcaaaatct ctgccgcgat ccctgagctc 35463
ctccctcagc aataactgta agtactcttt catatcgtct ccgaaatttt tagccatagg 35523
acccccagga ataagagaag ggcaagccac attacagata aaccgaagtc ccccccagtg 35583
agcattgcca aatgtaagat tgaaataagc atgctggcta gacccggtga tatcttccag 35643
ataactggac agaaaatcgg gtaagcaatt tttaagaaaa tcaacaaaag aaaaatcttc 35703
caggtgcacg tttagggcct cgggaacaac gatggagtaa gtgcaagggg tgcgttccag 35763
catggttagt tagctgatct gtaaaaaaac aaaaaataaa acattaaacc atgctagcct 35823
ggcgaacagg tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc 35883
gaccctcgta aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc 35943
cggcgtgaat gattcgagaa gaagcataca cccccggaac attggagtcc gtgagtgaaa 36003
aaaagcggcc gaggaagcaa tgaggcacta caacgctcac tctcaagtcc agcaaagcga 36063
tgccatgcgg atgaagcaca aaattttcag gtgcgtaaaa aatgtaatta ctcccctcct 36123
gcacaggcag cgaagctccc gatccctcca gatacacata caaagcctca gcgtccatag 36183
cttaccgagc ggcagcagca gcggcacaca acaggcgcaa gagtcagaga aaagactgag 36243
ctctaacctg tccgcccgct ctctgctcaa tatatagccc cagatctaca ctgacgtaaa 36303
ggccaaagtc taaaaatacc cgccaaataa tcacacacgc ccagcacacg cccagaaacc 36363
ggtgacacac tcagaaaaat acgcgcactt cctcaaacgg ccaaactgcc gtcatttccg 36423
ggttcccacg ctacgtcatc aaaacacgac tttcaaattc cgtcgaccgt taaaaacatc 36483
acccgccccg cccctaacgg tcgccgctcc cgcagccaat caccttcctc cctccccaaa 36543
ttcaaacagc tcatttgcat attaacgcgc accaaaagtt tgaggtatat tattgatgat 36603
g 36604
<210>6
<211>529
<212>PRT
<2 13>黑猩猩腺病毒血清型Pan6
<400>6
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr Asp Gly Ser Gln Asp
145 150 155 160
Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe
165 170 175
Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn
180 185 190
Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly
195 200 205
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr
210 215 220
Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp
225 230 235 240
Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu
245 250 255
Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe
260 265 270
Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu
275 280 285
Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp Ser Thr Ala Ala Ala
290 295 300
Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe
305 310 315 320
Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile
325 330 335
Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val
340 345 350
Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr
355 360 365
Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr
370 375 380
Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro
385 390 395 400
Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser
405 410 415
Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser
420 425 430
Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr
435 440 445
Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val
450 455 460
Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala
465 470 475 480
Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val
485 490 495
Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val
500 505 510
Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr
515 520 525
Phe
<210>7
<211>942
<212>PRT
<213>黑猩猩腺病毒血清型Pan6
<400>7
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Ser Gln Trp Glu Gln Ala Lys Thr Gly Asn Gly Gly
130 135 140
Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn
145 150 155 160
Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Asp Val Thr Ala Asn Gln
165 170 175
Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val
180 185 190
Gly Glu Glu Asn Trp Gln Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg
210 215 220
Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys Leu Lys Val Gly Asp Asp
225 230 235 240
Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala Phe Phe Asp Thr
245 250 255
Pro Gly Gly Thr Val Asn Gly Gln Asp Glu Tyr Lys Ala Asp Ile Val
260 265 270
Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val
275 280 285
Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu Ile Asn Leu Val Gln
290 295 300
Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe
305 310 315 320
Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala
325 330 335
Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn
340 345 350
Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr
355 360 365
Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp
370 375 380
Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr
385 390 395 400
Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gln Gly Val
405 410 415
Lys Val Lys Asp Gly Gln Asp Gly Asp Val Glu Ser Glu Trp Glu Asn
420 425 430
Asp Asp Thr Val Ala Ala Arg Asn Gln Leu Cys Lys Gly Asn Ile Phe
435 440 445
Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr
450 455 460
Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Thr
465 470 475 480
Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly
485 490 495
Arg Val Thr Pro Pro Ser Leu Val Asp Ala Tyr Leu Asn Ile Gly Ala
500 505 510
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His
515 520 525
Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg
530 535 540
Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys
545 550 555 560
Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg
565 570 575
Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg
580 585 590
Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr
595 600 605
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu
610 615 620
Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala
625 630 635 640
Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser
645 650 655
Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg
660 665 670
Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr
675 680 685
Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu
690 695 700
Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser
705 710 715 720
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys
725 730 735
Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr
740 745 750
Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr
755 760 765
Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe
770 775 780
Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn
785 790 795 800
Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser
805 810 815
Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr
820 825 830
Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser
835 840 845
Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro
850 855 860
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln
865 870 875 880
Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu
885 890 895
Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val
900 905 910
Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala
915 920 925
Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
930 935 940
<210>8
<211>443
<212>PRT
<213>黑猩猩腺病毒血清型Pan6
<400>8
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Ile Ser Asn Asn Thr
85 90 95
Ile Ser Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu
100 105 110
Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr
115 120 125
Leu Gly Ile Ser Leu Gly Asn Gly Leu Gln Thr Ser Asn Lys Leu Leu
130 135 140
Thr Val Gln Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser Ile Thr
145 150 155 160
Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly Asn Arg Gly
165 170 175
Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Val Phe Asp Gly Asn
180 185 190
Ala Ile Ala Thr Tyr Ile Gly Asn Gly Leu Asp Tyr Gly Ser Tyr Asp
195 200 205
Ser Asp Gly Lys Thr Arg Pro Val Ile Thr Lys Ile Gly Ala Gly Leu
210 215 220
Asn Phe Asp Ala Asn Lys Ala Ile Ala Val Lys Leu Gly Thr Gly Leu
225 230 235 240
Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Gln Asp Asp
245 250 255
Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Leu
260 265 270
Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly
275 280 285
Ser Gln Ile Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser
290 295 300
Ala Leu Asn Pro Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu
305 310 315 320
Arg Phe Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly
325 330 335
Asp Tyr Trp Asn Phe Arg Glu Gly Gln Thr Thr Gln Ser Val Ala Tyr
340 345 350
Thr Asn Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr
355 360 365
Gln Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gln Val Tyr Leu Thr
370 375 380
Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn Gly Thr
385 390 395 400
Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr
405 410 415
Trp Gln Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile Thr Phe Ala Thr
420 425 430
Asn Ser Phe Ser Phe Ser Tyr Ile Ala Gln Glu
435 440
<210>9
<211>36535
<212>DNA
<213>黑猩猩腺病毒血清型Pan7
<220>
<221>CDS
<222>(13874)..(15469)
<223>L2五邻体
<220>
<221>CDS
<222>(18288)..(21086)
<223>L3六邻体
<220>
<221>CDS
<222>(32094)..(33425)
<223>L5纤维
<400>9
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60
atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120
tgacgttttt aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660
gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260
tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320
agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380
ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440
ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500
atccgtgttg actttaagtg cgtggtttat gactcagggg tggggactgt gggtatataa 1560
gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620
gaagactttc accagactag acagctgcta gagaactcat cggagggggt ctcttacctg 1680
tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740
aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800
ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860
agaactaccg ccgcggtagc cttttttgcc tttatccttg acaaatggag tcaagaaacc 1920
catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980
tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040
atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100
cagcaagagg aggaggagga tcgagaagag aacccgagag ccggtctgga ccctccggtg 2160
gcggaggagg aggagtagct gacttgtttc ccgagctgcg ccgggtgctg actaggtctt 2220
ccagtggacg ggagaggggg attaagcggg agaggcatga ggagactagc cacagaactg 2280
aactgactgt cagtctgatg agccgcaggc gcccagaatc ggtgtggtgg catgaggttc 2340
agtcgcaggg gatagatgag gtctcggtga tgcatgagaa atattccctg gaacaagtca 2400
agacttgttg gttggagcct gaggatgatt gggaggtagc catcaggaat tatgccaagc 2460
tggctctgaa gccagacaag aagtacaaga ttaccaaact gattaatatc agaaattcct 2520
gctacatttc agggaatggg gccgaggtgg agatcagtac ccaggagagg gtggccttca 2580
gatgttgtat gatgaatatg tacccggggg tggtgggcat ggagggagtc acctttatga 2640
acgcgaggtt caggggtgat gggtataatg gggtggtctt tatggccaac accaagctga 2700
cagtgcacgg atgctccttc tttgggttca ataacatgtg catcgaggcc tggggcagtg 2760
tttcagtgag gggatgcagc ttttcagcca actggatggg ggtcgtgggc agaaccaaga 2820
gcaaggtgtc agtgaagaaa tgcctgttcg agaggtgcca cctgggggtg atgagcgagg 2880
gcgaagccaa agtcaaacac tgcgcctcta ctgagacggg ctgctttgtg ctgatcaagg 2940
gcaatgccca agtcaagcat aacatgatct gtggggcctc ggatgagcgc ggctaccaga 3000
tgctgacctg cgccggtggg aacagccata tgctggccac cgtgcatgtg acctcgcacc 3060
cccgcaagac atggcccgag ttcgagcaca acgtcatgac ccgatgcaat gtgcacctgg 3120
ggtcccgccg aggcatgttc atgccctacc agtgcaacat gcaatttgtg aaggtgctgc 3180
tggagcccga tgccatgtcc agagtgagcc tgacgggggt gtttgacatg aatgtggagc 3240
tgtggaaaat tctgagatat gatgaatcca agaccaggtg ccgggcctgc gaatgcggag 3300
gcaagcacgc caggcttcag cccgtgtgtg tggaggtgac ggaggacctg cgacccgatc 3360
atttggtgtt gtcctgcaac gggacggagt tcggctccag cggggaagaa tctgactaga 3420
gtgagtagtg tttgggggag gtggagggct tgtatgaggg gcagaatgac taaaatctgt 3480
gtttttctgt gtgttgcagc agcatgagcg gaagcgcctc ctttgaggga ggggtattca 3540
gcccttatct gacggggcgt ctcccctcct gggcgggagt gcgtcagaat gtgatgggat 3600
ccacggtgga cggccggccc gtgcagcccg cgaactcttc aaccctgacc tacgcgaccc 3660
tgagctcctc gtccgtggac gcagctgccg ccgcagctgc tgcttccgcc gccagcgccg 3720
tgcgcggaat ggccctgggc gccggctact acagctctct ggtggccaac tcgacttcca 3780
ccaataatcc cgccagcctg aacgaggaga agctgctgct gctgatggcc cagctcgagg 3840
ccctgaccca gcgcctgggc gagctgaccc agcaggtggc tcagctgcag gcggagacgc 3900
gggccgcggt tgccacggtg aaaaccaaat aaaaaatgaa tcaataaata aacggagacg 3960
gttgttgatt ttaacacaga gtcttgaatc tttatttgat ttttcgcgcg cggtaggccc 4020
tggaccaccg gtctcgatca ttgagcaccc ggtggatttt ttccaggacc cggtagaggt 4080
gggcttggat gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt 4140
gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg 4200
cgtggtgctg cacgatgtcc ttgaggagga gactgatggc cacgggcagc cccttggtgt 4260
aggtgttgac gaacctgttg agctgggagg gatgcatgcg gggggagatg agatgcatct 4320
tggcctggat cttgagattg gcgatgttcc cgcccagatc ccgccggggg ttcatgttgt 4380
gcaggaccac cagcacggtg tatccggtgc acttggggaa tttgtcatgc aacttggaag 4440
ggaaggcgtg aaagaatttg gagacgccct tgtgaccgcc caggttttcc atgcactcat 4500
ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg 4560
acacatcgta gttgtggtcc tgggtgagct cgtcataggc cattttaatg aatttggggc 4620
ggagggtgcc cgactggggg acgaaggtgc cctcgatccc gggggcgtag ttgccctcgc 4680
agatctgcat ctcccaggcc ttgagctcgg agggggggat catgtccacc tgcggggcga 4740
tgaaaaaaac ggtttccggg gcgggggaga tgagctgggc cgaaagcagg ttccggagca 4800
gctgggactt gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt 4860
agttgaggga gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct 4920
cgcgcacatg catgttctcg cgcacgagtt ccgccaggag gcgctcgccc cccagcgaga 4980
ggagctcttg cagcgaggcg aagtttttca gcggcttgag yccgtcggcc atgggcattt 5040
tggagagggt ctgttgcaag agttccagac ggtcccagag ctcggtgatg tgctctaggg 5100
catctcgatc cagcagacct cctcgtttcg cgggttgggg cgactgcggg agtagggcac 5160
caggcgatgg gcgtccagcg aggccagggt ccggtccttc cagggtcgca gggtccgcgt 5220
cagcgtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg 5280
cttcaggctc atccggctgg tcgagaaccg ctcccggtcg gcgccctgcg cgtcggccag 5340
gtagcaattg agcatgagtt cgtagttgag cgcctcggcc gcgtggccct tggcgcggag 5400
cttacctttg gaagtgtgtc cgcagacggg acagaggagg gacttgaggg cgtagagctt 5460
gggggcgagg aagacggact cgggggcgta ggcgtccgcg ccgcagctgg cgcagacggt 5520
ctcgcactcc acgagccagg tgaggtcggg ccggttgggg tcaaaaacga ggtttcctcc 5580
gtgctttttg atgcgtttct tacctctggt ctccatgagc tcgtgtcccc gctgggtgac 5640
aaagaggctg tccgtgtccc cgtagaccga ctttatgggc cggtcctcga gcggggtgcc 5700
gcggtcctcg tcgtagagga accccgccca ctccgagacg aaggcccggg tccaggccag 5760
cacgaaggag gccacgtggg aggggtagcg gtcgttgtcc accagcgggt ccaccttctc 5820
cagggtatgc aagcacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt 5880
gtaggccacg tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc 5940
gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct 6000
ctcgaaggct ggcataacct cggcactcag gttgtcagtt tctagaaacg aggaggattt 6060
gatattgacg gtgccgttgg agacgccttt catgagcccc tcgtccatct ggtcagaaaa 6120
gacgatcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgt tggagaggag 6180
cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggcggcgat 6240
gttgagctgc acgtactcgc gcgccacgca cttccattcg gggaagacgg tggtgagctc 6300
gtcgggcacg attctgaccc gccagccgcg gttgtgcagg gtgatgaggt ccacgctggt 6360
ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct tgcgcgagca 6420
gaaggggggc agcgggtcca gcatgagctc gtcggggggg tcggcgtcca cggtgaagat 6480
gccgggcaga agctcggggt cgaagtagct gatgcaggtg tccagatcgt ccagcgccgc 6540
ttgccagtcg cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc cccagggcat 6600
ggggtgcgtg agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc 6660
gaggacgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc 6720
gtacagctcg tgcgagggcg cgaggagccc cgtgccgagg ttggagcgtt gcggcttttc 6780
ggcgcggtag acgatctggc ggaagatggc gtgggagttg gaggagatgg tgggcctctg 6840
gaagatgttg aagtgggcgt ggggcaggcc gaccgagtcc ctgatgaagt gggcgtagga 6900
gtcctgcagc ttggcgacga gctcggcggt gacgaggacg tccagggcgc agtagtcgag 6960
ggtctcttgg atgatgtcgt acttgagctg gcccttctgc ttccacagct cgcggttgag 7020
aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatcggcacg 7080
gtaagagccc accatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac 7140
ggggagggcg taagcttgtg cggccttgcg cagggaggtg tgggtgaggg cgaaggtgtc 7200
gcgcaccatg accttgagga actggtgctt gaagtcgagg tcgtcgcagc cgccctgctc 7260
ccagagctgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc 7320
gttgaagagg atcttgcccg cgcggggcat gaagttgcga gtgatgcgga aaggctgggg 7380
cacctcggcc cggttgttga tgacctgggc ggcgaggacg atctcgtcga agccgttgat 7440
gttgtgcccg acgatgtaga gttccacgaa tcgcgggcgg cccttaacgt ggggcagctt 7500
cttgagctcg tcgtaggtga gctcggcggg gtcgctgagc ccgtgctgct cgagggccca 7560
gtcggcgacg tgggggttgg cgctgaggaa ggaagtccag agatccacgg ccagggcggt 7620
ctgcaagcgg tcccggtact gacggaactg ctggcccacg gccatttttt cgggggtgac 7680
gcagtagaag gtgcgggggt cgccgtgcca gcggtcccac ttgagctgga gggcgaggtc 7740
gtgggcgagc tcgacgagcg gcgggtcccc ggagagtttc atgaccagca tgaaggggac 7800
gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag 7860
cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc accagttgga 7920
ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg cgcgccgagc actcgtgctt 7980
gtgtttatac aagcgtccgc agtgctcgca acgctgcacg ggatgcacgt gctgcacgag 8040
ctgtacctgg gttcctttga cgaggaattt cagtgggcag tggagcgctg gcggctgcat 8100
ctggtgctgt actacgtcct ggccatcggc gtggccatcg tctgcctcga tggtggtcat 8160
gctgacgagc ccgcgcggga ggcaggtcca gacttcggct cggacgggtc ggagagcgag 8220
gacgagggcg cgcaggccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt 8280
gggcagcggc ggcgcgcggt tgacttgcag gagcttttcc agggcgcgcg ggaggtccag 8340
atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca gggtcccgtg 8400
cccctggggc gccaccaccg tgccccgttt cttcttgggc gctgcttcca tgccggtcag 8460
aagcggcggc gaggacgcgc gccgggcggc aggggcggct cgggacccgg aggcaggggc 8520
ggcaggggca cgtcggcgcc gcgcgcgggc aggttctggt actgcgcccg gagaagactg 8580
gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640
ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt atcgttgacg 8700
gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc gatctcggtc 8760
atgaactgct cgatctcctc ctcctgaagg tctccgcggc cggcgcgctc gacggtggcc 8820
gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc ggcctcgttc 8880
cagacgcggc tgtagaccac ggctccgtcg gggtcgcgcg cgcgcatgac cacctgggcg 8940
aggttgagct cgacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000
ttgagcgtgg tggcgatgtg ctcggtgacg aagaagtaca tgatccagcg gcggagcggc 9060
atctcgctga cgtcgcccag ggcttccaag cgctccatgg cctcgtagaa gtccacggcg 9120
aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180
agctcagcga tggtggcgcg cacctcgcgc tcgaaggccc cggggggctc ctcttcttcc 9240
atctcttcct cctccactaa catctcttct acttcctcct caggaggcgg cggcggggga 9300
ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 9360
ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 9420
aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 9480
gcgctgacga tgcatcttat caattggccc gtagggactc cgcgcaagga cctgagcgtc 9540
tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 9600
ggtaggctga gcccggtttc ttgttcttcg gggatttcgg gaggcgggcg ggcgatgctg 9660
ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720
tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780
cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840
cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct ggggctggac gagcgccagg 9900
tcggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960
tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtatagga gcagttggcc 10020
atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080
gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggtgc gcacgaggta ctggtatccg 10140
acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200
ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260
atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320
agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380
atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440
gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500
ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560
ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620
ggaaagcgac cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680
cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740
cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800
ggagcgagcc cctctttttc ttgtgttttt gccagatgca tcccgtactg cggcagatgc 10860
gcccccaccc tccacctcaa ccgcccctac cgccgcagca gcagcaacag ccggcgcttc 10920
tgcccccgcc ccagcagcag ccagccacta ccgcggcggc cgccgtgagc ggagccggcg 10980
ttcagtatga cctggccttg gaagagggcg aggggctggc gcggctgggg gcgtcgtcgc 11040
cggagcggca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc 11100
agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcc tcccgcttcc 11160
acgcggggcg ggagctgcgg cgcggcctgg accgaaagcg ggtgctgagg gacgaggatt 11220
tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc 11280
tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca 11340
accacgtgcg cacgctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 11400
acctgctgga ggccatcgtg cagaacccca cgagcaagcc gctgacggcg cagctgtttc 11460
tggtggtgca gcacagtcgg gacaacgaga cgttcaggga ggcgctgctg aatatcaccg 11520
agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg 11580
agcgcgggct gccgctgtcc gagaagctgg cggctatcaa cttctcggtg ctgagcctgg 11640
gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 11700
agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 11760
gggtgtaccg caacgacagg atgcaccgcg cggtgagcgc cagccgccgg cgcgagctga 11820
gcgaccagga gctgatgcac agcctgcagc gggccctgac cggggccggg accgaggggg 11880
agagctactt tgacatgggc gcggacctgc gctggcagcc cagccgccgg gccttggaag 11940
ctgccggcgg ttccccctac gtggaggagg tggacgatga ggaggaggag ggcgagtacc 12000
tggaagactg atggcgcgac cgtatttttg ctagatgcag caacagccac cgcctcctga 12060
tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 12120
gacccaggcc atgcaacgca tcatggcgct gacgacccgc aatcccgaag cctttagaca 12180
gcagcctcag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctcgaa 12240
ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 12300
cggcgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 12360
caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg cggtgtcgca 12420
gcgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgctga acgccttcct 12480
gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 12540
gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg ggccggacta 12600
cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 12660
cttgcaggga ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct 12720
gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg acagcggcag 12780
cgtgagccgc gactcgtacc tgggctacct gcttaacctg taccgcgagg ccatcgggca 12840
ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg cgctgggcca 12900
ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc ggtcgcagaa 12960
gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct acgtgcagca 13020
gagcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgccgcgc tcgacatgac 13080
cgcgcgcaac atggagccca gcatgtacgc tcgcaaccgc ccgttcatca ataagctgat 13140
ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg ccatcttgaa 13200
cccgcactgg ctcccgccgc ccgggttcta cacgggcgag tacgacatgc ccgaccccaa 13260
cgacgggttc ctgtgggacg acgtggacag cagcgtgttc tcgccgcgcc ccgccaccac 13320
cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg gtcgcgcggg 13380
tgctgccgcg gcggtgcctg aggccgccag ccccttcccg agcctgccct tttcgctgaa 13440
cagcgtgcgc agcagcgagc tgggtcggct gacgcggccg cgcctgctgg gcgaggagga 13500
gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 13560
agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc acagggacga 13620
gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 13680
tctggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact tgggtgggag 13740
tggtggtggt aacccgttcg ctcacttgcg cccccgtatc gggcgcctga tgtaagaatc 13800
tgaaaaaata aaaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt 13860
tgtttgtagt agt atg atg agg cgc gtg tac ccg gag ggt cct cct ccc 13909
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro
1 5 10
tcg tac gag agc gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc 13957
Ser Tyr Glu Ser Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro
15 20 25
ccg ctg gag gcg cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag 14005
Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu
30 35 40
ggg cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat acc 14053
Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr
45 50 55 60
acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg ctg 14101
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu
65 70 75
aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg cag aac 14149
Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn
80 85 90
aac gat ttc acc ccc acg gag gcc agc acc cag acc atc aac ttt gac 14197
Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp
95 100 105
gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc atg cac acc aac 14245
Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn
110 115 120
atg ccc aac gtg aac gag ttc atg tac agc aac aag ttc aag gcg cgg 14293
Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg
125 130 135 140
gtg atg gtc tcg cgc aag acc ccc aat ggg gtc gcg gtg gat gag aat 14341
Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn
145 150 155
tat gat ggt agt cag gac gag ctg act tac gag tgg gtg gag ttt gag 14389
Tyr Asp Gly Ser Gln Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu
160 165 170
ctg ccc gag ggc aac ttc tcg gtg acc atg acc atc gat ctg atg aac 14437
Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn
175 180 185
aac gcc atc atc gac aac tac ttg gcg gtg ggg cgt cag aac ggg gtg 14485
Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val
190 195 200
ctg gag agc gac atc ggc gtg aag ttc gac acg cgc aac ttc cgg ctg 14533
Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu
205 210 215 220
ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac 14581
Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn
225 230 235
gag gcc ttc cac ccc gac atc gtc ctg ctg ccc ggc tgc ggc gtg gac 14629
Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp
240 245 250
ttc acc gag agc cgc ctc agc aac ctg ctg ggc atc cgc aag cgg cag 14677
Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln
255 260 265
ccc ttc cag gag ggc ttc cag atc ctg tac gag gac ctg gag ggg ggc 14725
Pro Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly
270 275 280
aac atc ccc gcg ctc ttg gat gtc gaa gcc tat gag aaa agc aag gag 14773
Asn Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu
285 290 295 300
gag gcc gcc gca gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag 14821
Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu
305 310 315
gtg cgg ggc gat aat ttt gct agc gcc gcg gca gtg gcc gag gcg gct 14869
Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala
320 325 330
gaa acc gaa agt aag ata gtc atc cag ccg gtg gag aag gac agc aag 14917
Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys
335 340 345
gac agg agc tac aac gtg ctc gcg gac aag aaa aac acc gcc tac cgc 14965
Asp Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg
350 355 360
agc tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc 15013
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg
365 370 375 380
tcc tgg acg ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag caa 15061
Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln
385 390 395
gtc tac tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15109
Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg
400 405 410
tcc acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc gag ctc ctg 15157
Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu
415 420 425
ccc gtc tac tcc aag agc ttc ttc aac gag cag gcc gtc tac tcg cag 15205
Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln
430 435 440
cag ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc aac cgc ttc ccc 15253
Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro
445 450 455 460
gag aac cag atc ctc gtc cgc ccg ccc gcg ccc acc att acc acc gtc 15301
Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val
465 470 475
agt gaa aac gtt cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc 15349
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg
480 485 490
agc agt atc cgg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc 15397
Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg
495 500 505
cgc acc tgc ccc tac gtc tac aag gcc ctg ggc gta gtc gcg ccg cgc 15445
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg
510 515 520
gtc ctc tcg agc cgc acc ttc taa aaaatgtcca ttctcatctc gcccagtaat 15499
Val Leu Ser Ser Arg Thr Phe
525 530
aacaccggtt ggggcctgcg cgcgcccagc aagatgtacg gaggcgctcg ccaacgctcc 15559
acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc 15619
cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc 15679
aactacacgc ccgccgccgc gcccgcctcc accgtggacg ccgtcatcga cagcgtggtg 15739
gccgatgcgc gccggtacgc ccgcgccaag agccggcggc ggcgcatcgc ccggcggcac 15799
cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc gcagggccag gcgcacggga 15859
cgcagggcca tgctcagggc ggccagacgc gcggcctccg gcagcagcag cgccggcagg 15919
acccgcagac gcgcggccac ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg 15979
cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc 16039
cgcccccctc gcacttgaag atgctgactt cgcgatgttg atgtgtccca gcggcgagga 16099
ggatgtccaa gcgcaaatac aaggaagaga tgctccaggt catcgcgcct gagatctacg 16159
gccccgcggt gaaggaggaa agaaagcccc gcaaactgaa gcgggtcaaa aaggacaaaa 16219
aggaggagga agatgtggac ggactggtgg agtttgtgcg cgagttcgcc ccccggcggc 16279
gcgtgcagtg gcgcgggcgg aaagtgaaac cggtgctgcg gcccggcacc acggtggtct 16339
tcacgcccgg cgagcgttcc ggctccgcct ccaagcgctc ctacgacgag gtgtacgggg 16399
acgaggacat cctcgagcag gcggtcgagc gtctgggcga gtttgcttac ggcaagcgca 16459
gccgccccgc gcccttgaaa gaggaggcgg tgtccatccc gctggaccac ggcaacccca 16519
cgccgagcct gaagccggtg accctgcagc aggtgctgcc gagcgcggcg ccgcgccggg 16579
gcttcaagcg cgagggcggc gaggatctgt acccgaccat gcagctgatg gtgcccaagc 16639
gccagaagct ggaggacgtg ctggagcaca tgaaggtgga ccccgaggtg cagcccgagg 16699
tcaaggtgcg gcccatcaag caggtggccc cgggcctggg cgtgcagacc gtggacatca 16759
agatccccac ggagcccatg gaaacgcaga ccgagcccgt gaagcccagc accagcacca 16819
tggaggtgca gacggatccc tggatgccgg cgccggcttc caccactcgc cgaagacgca 16879
agtacggcgc ggccagcctg ctgatgccca actacgcgct gcatccttcc atcatcccca 16939
cgccgggcta ccgcggcacg cgcttctacc gcggctacac cagcagccgc cgcaagacca 16999
ccacccgccg ccgccgtcgt cgcacccgcc gcagcagcac cgcgacttcc gccgccgccc 17059
tggtgcggag agtgtaccgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17119
acccgagcat cgccatttaa ctctgccgtc gcctcctact tgcagatatg gccctcacat 17179
gccgcctccg cgtccccatt acgggctacc gaggaagaaa gccgcgccgt agaaggctga 17239
cggggaacgg gctgcgtcgc catcaccacc ggcggcggcg cgccatcagc aagcggttgg 17299
ggggaggctt cctgcccgcg ctgatcccca tcatcgccgc ggcgatcggg gcgatccccg 17359
gcatagcttc cgtggcggtg caggcctctc agcgccactg agacacagct tggaaaattt 17419
gtaataaaaa aatggactga cgctcctggt cctgtgatgt gtgtttttag atggaagaca 17479
tcaatttttc gtccctggca ccgcgacacg gcacgcggcc gtttatgggc acctggagcg 17539
acatcggcaa cagccaactg aacgggggcg ccttcaattg gagcagtctc tggagcgggc 17599
ttaagaattt cgggtccacg ctcaaaacct atggcaacaa ggcgtggaac agcagcacag 17659
ggcaggcgct gagggaaaag ctgaaagagc agaacttcca gcagaaggtg gtcgatggcc 17719
tggcctcggg catcaacggg gtggtggacc tggccaacca ggccgtgcag aaacagatca 17779
acagccgcct ggacgcggtc ccgcccgcgg ggtccgtgga gatgccccag gtggaggagg 17839
agctgcctcc cctggacaag cgcggcgaca agcgaccgcg tcccgacgcg gaggagacgc 17899
tgctgacgca cacggacgag ccgcccccgt acgaggaggc ggtgaaactg ggtctgccca 17959
ccacgcggcc cgtggcgcct ctggccaccg gggtgctgaa acccagcagc agcagccagc 18019
ccgcgaccct ggacttgcct ccgcctgctt cccgcccctc cacagtggct aagcccctgc 18079
cgccggtggc cgtcgcgtcg cgcgcccccc gaggccgccc ccaggcgaac tggcagagca 18139
ctctgaacag catcgtgggt ctgggagtgc agagtgtgaa gcgccgccgc tgctattaaa 18199
agacactgta gcgcttaact tgcttgtctg tgtgtatatg tatgtccgcc gaccagaagg 18259
aggaagaggc gcgtcgccga gttgcaag atg gcc acc cca tcg atg ctg ccc 18311
Met Ala Thr Pro Ser Met Leu Pro
535
cag tgg gcg tac atg cac atc gcc gga cag gac gct tcg gag tac ctg 18359
Gln Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu
540 545 550 555
agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 18407
Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser
560 565 570
ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg 18455
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val
575 580 585
acc acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg ccc gtg gac 18503
Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp
590 595 600
cgc gag gac aac acc tac tcg tac aaa gtg cgc tac acg ctg gcc gtg 18551
Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val
605 610 615
ggc gac aac cgc gtg ctg gac atg gcc agc acc tac ttt gac atc cgc 18599
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg
620 625 630 635
ggc gtg ctg gat cgg ggg ccc agc ttc aaa ccc tac tcc ggc acc gcc 18647
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala
640 645 650
tac aac agc ctg gct ccc aag gga gcg ccc aac act tgc cag tgg aca 18695
Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr
655 660 665
tat aaa gct ggt gat act gat aca gaa aaa acc tat aca tat gga aat 18743
Tyr Lys Ala Gly Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn
670 675 680
gca cct gtg caa ggc att agc att aca aag gat ggt att caa ctt gga 18791
Ala Pro Val Gln Gly Ile Ser Ile Thr Lys Asp Gly Ile Gln Leu Gly
685 690 695
act gac agc gat ggt cag gca atc tat gca gac gaa act tat caa cca 18839
Thr Asp Ser Asp Gly Gln Ala Ile Tyr Ala Asp Glu Thr Tyr Gln Pro
700 705 710 715
gag cct caa gtg ggt gat gct gaa tgg cat gac atc act ggt act gat 18887
Glu Pro Gln Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp
720 725 730
gaa aaa tat gga ggc aga gct ctt aag cct gac acc aaa atg aag cct 18935
Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro
735 740 745
tgc tat ggt tct ttt gcc aag cct acc aat aaa gaa gga ggc cag gca 18983
Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala
750 755 760
aat gtg aaa acc gaa aca ggc ggt acc aaa gaa tat gac att gac atg 19031
Asn Val Lys Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met
765 770 775
gca ttc ttc gat aat cga agt gca gct gcc gcc ggc cta gcc cca gaa 19079
Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu
780 785 790 795
att gtt ttg tat act gag aat gtg gat ctg gaa act cca gat acc cat 19127
Ile Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His
800 805 810
att gta tac aag gca ggt aca gat gac agt agc tct tct atc aat ttg 19175
Ile Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu
815 820 825
ggt cag cag tcc atg ccc aac aga ccc aac tac att ggc ttc aga gac 19223
Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp
830 835 840
aac ttt atc ggt ctg atg tac tac aac agc act ggc aat atg ggt gta 19271
Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val
845 850 855
ctg gct gga cag gcc tcc cag ctg aat gct gtg gtg gac ttg cag gac 19319
Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp
860 865 870 875
aga aac acc gaa ctg tcc tac cag ctc ttg ctt gac tct ctg ggt gac 19367
Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp
880 885 890
aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac agt tat gac 19415
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp
895 900 905
ccc gat gtg cgc att att gaa aat cac ggt gtg gag gat gaa ctt cct 19463
Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro
910 915 920
aac tat tgc ttc ccc ctg gat gct gtg ggt aga act gat act tac cag 19511
Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln
925 930 935
gga att aag gcc aat ggt gat aat caa acc acc tgg acc aaa gat gat 19559
Gly Ile Lys Ala Asn Gly Asp Asn Gln Thr Thr Trp Thr Lys Asp Asp
940 945 950 955
act gtt aat gat gct aat gaa ttg ggc aag ggc aat cct ttc gcc atg 19607
Thr Val Asn Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe Ala Met
960 965 970
gag atc aac atc cag gcc aac ctg tgg cgg aac ttc ctc tac gcg aac 19655
Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn
975 980 985
gtg gcg ctg tac ctg ccc gac tcc tac aag tac acg ccg gcc aac atc 19703
Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile
990 995 1000
acg ctg ccc acc aac acc aac acc tac gat tac atg aac ggc cgc 19748
Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg
1005 1010 1015
gtg gtg gcg ccc tcg ctg gtg gac gcc tac atc aac atc ggg gcg 19793
Val Val Ala Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala
1020 1025 1030
cgc tgg tcg ctg gac ccc atg gac aac gtc aac ccc ttc aac cac 19838
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His
1035 1040 1045
cac cgc aac gcg ggc ctg cga tac cgc tcc atg ctc ctg ggc aac 19883
His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn
1050 1055 1060
ggg cgc tac gtg ccc ttc cac atc cag gtg ccc caa aag ttt ttc 19928
Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe
1065 1070 1075
gcc atc aag agc ctc ctg ctc ctg ccc ggg tcc tac acc tac gag 19973
Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu
1080 1085 1090
tgg aac ttc cgc aag gac gtc aac atg atc ctg cag agc tcc ctc 20018
Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu
1095 1100 1105
ggc aac gac ctg cgc acg gac ggg gcc tcc atc gcc ttc acc agc 20063
Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser
1110 1115 1120
atc aac ctc tac gcc acc ttc ttc ccc atg gcg cac aac acc gcc 20108
Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala
1125 1130 1135
tcc acg ctc gag gcc atg ctg cgc aac gac acc aac gac cag tcc 20153
Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser
1140 1145 1150
ttc aac gac tac ctc tcg gcg gcc aac atg ctc tac ccc atc ccg 20198
Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro
1155 1160 1165
gcc aac gcc acc aac gtg ccc atc tcc atc ccc tcg cgc aac tgg 20243
Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp
1170 1175 1180
gcc gcc ttc cgc ggc tgg tcc ttc acg cgc ctc aag acc cgc gag 20288
Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu
1185 1190 1195
acg ccc tcg ctc ggc tcc ggg ttc gac ccc tac ttc gtc tac tcg 20333
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser
1200 1205 1210
ggc tcc atc ccc tac ctc gac ggc acc ttc tac ctc aac cac acc 20378
Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr
1215 1220 1225
ttc aag aag gtc tcc atc acc ttc gac tcc tcc gtc agc tgg ccc 20423
Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro
1230 1235 1240
ggc aac gac cgc ctc ctg acg ccc aac gag ttc gaa atc aag cgc 20468
Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg
1245 1250 1255
acc gtc gac gga gag ggg tac aac gtg gcc cag tgc aac atg acc 20513
Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr
1260 1265 1270
aag gac tgg ttc ctg gtc cag atg ctg gcc cac tac aac atc ggc 20558
Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly
1275 1280 1285
tac cag ggc ttc tac gtg ccc gag ggc tac aag gac cgc atg tac 20603
Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr
1290 1295 1300
tcc ttc ttc cgc aac ttc cag ccc atg agc cgc cag gtc gtg gac 20648
Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp
1305 1310 1315
gag gtc aac tac aag gac tac cag gcc gtc acc ctg gcc tac cag 20693
Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln
1320 1325 1330
cac aac aac tcg ggc ttc gtc ggc tac ctc gcg ccc acc atg cgc 20738
His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg
1335 1340 1345
cag ggc cag ccc tac ccc gcc aac tac ccc tac ccg ctc atc ggc 20783
Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly
1350 1355 1360
aag agc gcc gtc gcc agc gtc acc cag aaa aag ttc ctc tgc gac 20828
Lys Ser Ala Val Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp
1365 1370 1375
cgg gtc atg tgg cgc atc ccc ttc tcc agc aac ttc atg tcc atg 20873
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met
1380 1385 1390
ggc gcg ctc acc gac ctc ggc cag aac atg ctc tac gcc aac tcc 20918
Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser
1395 1400 1405
gcc cac gcg cta gac atg aat ttc gaa gtc gac ccc atg gat gag 20963
Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu
1410 1415 1420
tcc acc ctt ctc tat gtt gtc ttc gaa gtc ttc gac gtc gtc cga 21008
Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg
1425 1430 1435
gtg cac cag ccc cac cgc ggc gtc atc gag gcc gtc tac ctg cgc 21053
Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg
1440 1445 1450
acg ccc ttc tcg gcc ggc aac gcc acc acc taa gcctcttgct 21096
Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1455 1460
tcttgcaaga tgacggcctg cgcgggctcc ggcgagcagg agctcagggc catcctccgc 21156
gacctgggct gcgggccctg cttcctgggc accttcgaca agcgcttccc gggattcatg 21216
gccccgcaca agctggcctg cgccatcgtc aacacggccg gccgcgagac cgggggcgag 21276
cactggctgg ccttcgcctg gaacccgcgc tcccacacct gctacctctt cgaccccttc 21336
gggttctcgg acgagcgcct caagcagatc taccagttcg agtacgaggg cctgctgcgt 21396
cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac ccagaccgtg 21456
cagggtccgc gctcggccgc ctgcgggctc ttctgctgca tgttcctgca cgccttcgtg 21516
cactggcccg accgccccat ggacaagaac cccaccatga acttgctgac gggggtgccc 21576
aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca ggaggcgctc 21636
taccgcttcc tcaacgccca ctccgcctac tttcgctccc accgcgcgcg catcgagaag 21696
gccaccgcct tcgaccgcat gaatcaagac atgtaatccg gtgtgtgtat gtgaatgctt 21756
tattcatcat aataaacagc acatgtttat gccaccttct ctgaggctct gactttattt 21816
agaaatcgaa ggggttctgc cggctctcgg catggcccgc gggcagggat acgttgcgga 21876
actggtactt gggcagccac ttgaactcgg ggatcagcag cttcggcacg gggaggtcgg 21936
ggaacgagtc gctccacagc ttgcgcgtga gttgcagggc gcccagcagg tcgggcgcgg 21996
agatcttgaa atcgcagttg ggacccgcgt tctgcgcgcg agagttacgg tacacggggt 22056
tgcagcactg gaacaccatc agggccgggt gcttcacgct cgccagcacc gtcgcgtcgg 22116
tgatgccctc cacgtccaga tcctcggcgt tggccatccc gaagggggtc atcttgcagg 22176
tctgccgccc catgctgggc acgcagccgg gcttgtggtt gcaatcgcag tgcaggggga 22236
tcagcatcat ctgggcctgc tcggagctca tgcccgggta catggccttc atgaaagcct 22296
ccagctggcg gaaggcctgc tgcgccttgc cgccctcggt gaagaagacc ccgcaggact 22356
tgctagagaa ctggttggtg gcgcagccag cgtcgtgcac gcagcagcgc gcgtcgttgt 22416
tggccagctg caccacgctg cgcccccagc ggttctgggt gatcttggcc cggtcggggt 22476
tctccttcag cgcgcgctgc ccgttctcgc tcgccacatc catctcgatc gtgtgctcct 22536
tctggatcat cacggtcccg tgcaggcacc gcagcttgcc ctcggcctcg gtgcacccgt 22596
gcagccacag cgcgcagccg gtgctctccc agttcttgtg ggcgatctgg gagtgcgagt 22656
gcacgaagcc ctgcaggaag cggcccatca tcgtggtcag ggtcttgttg ctggtgaagg 22716
tcagcggaat gccgcggtgc tcctcgttca catacaggtg gcagatacgg cggtacacct 22776
cgccctgctc gggcatcagc tggaaggcgg acttcaggtc gctctccacg cggtaccggt 22836
ccatcagcag cgtcatcact tccatgccct tctcccaggc cgaaacgatc ggcaggctca 22896
gggggttctt caccgttgtc atcttagtcg ccgccgccga agtcaggggg tcgttctcgt 22956
ccagggtctc aaacactcgc ttgccgtcct tctcggtgat gcgcacgggg ggaaagctga 23016
agcccacggc cgccagctcc tcctcggcct gcctttcgtc ctcgctgtcc tggctgatgt 23076
cttgcaaagg cacatgcttg gtcttgcggg gtttcttttt gggcggcaga ggcggcggcg 23136
gagacgtgct gggcgagcgc gagttctcgc tcaccacgac tatttcttct ccttggccgt 23196
cgtccgagac cacgcggcgg taggcatgcc tcttctgggg cagaggcgga ggcgacgggc 23256
tctcgcggtt cggcgggcgg ctggcagagc cccttccgcg ttcgggggtg cgctcctggc 23316
ggcgctgctc tgactgactt cctccgcggc cggccattgt gttctcctag ggagcaagca 23376
tggagactca gccatcgtcg ccaacatcgc catctgcccc cgccgccgcc gacgagaacc 23436
agcagcagca gaatgaaagc ttaaccgccc cgccgcccag ccccacctcc gacgccgcag 23496
ccccagacat gcaagagatg gaggaatcca tcgagattga cctgggctac gtgacgcccg 23556
cggagcacga ggaggagctg gcagcgcgct tttcagcccc ggaagagaac caccaagagc 23616
agccagagca ggaagcagag agcgagcaga accaggctgg gctcgagcat ggcgactacc 23676
tgagcggggc agaggacgtg ctcatcaagc atctggcccg ccaatgcatc atcgtcaagg 23736
acgcgctgct cgaccgcgcc gaggtgcccc tcagcgtggc ggagctcagc cgcgcctacg 23796
agcgcaacct cttctcgccg cgcgtgcccc ccaagcgcca gcccaacggc acctgcgagc 23856
ccaacccgcg cctcaacttc tacccggtct tcgcggtgcc cgaggccctg gccacctacc 23916
acctcttttt caagaaccaa aggatccccg tctcctgccg cgccaaccgc acccgcgccg 23976
acgccctgct caacctgggc cccggcgccc gcctacctga tatcgcctcc ttggaagagg 24036
ttcccaagat cttcgagggt ctgggcagcg acgagactcg ggccgcgaac gctctgcaag 24096
gaagcggaga ggagcatgag caccacagcg ccctggtgga gttggaaggc gacaacgcgc 24156
gcctggcggt cctcaagcgc acggtcgagc tgacccactt cgcctacccg gcgctcaacc 24216
tgccccccaa ggtcatgagc gccgtcatgg accaggtgct catcaagcgc gcctcgcccc 24276
tctcggagga ggagatgcag gaccccgaga gctcggacga gggcaagccc gtggtcagcg 24336
acgagcagct ggcgcgctgg ctgggagcga gtagcacccc ccagagcctg gaagagcggc 24396
gcaagctcat gatggccgtg gtcctggtga ccgtggagct ggagtgtctg cgccgcttct 24456
tcgccgacgc ggagaccctg cgcaaggtcg aggagaacct gcactacctc ttcagacacg 24516
ggttcgtgcg ccaggcctgc aagatctcca acgtggagct gaccaacctg gtctcctaca 24576
tgggcatcct gcacgagaac cgcctggggc agaacgtgct gcacaccacc ctgcgcgggg 24636
aggcccgccg cgactacatc cgcgactgcg tctacctgta cctctgccac acctggcaga 24696
cgggcatggg cgtgtggcag cagtgcctgg aggagcagaa cctgaaagag ctctgcaagc 24756
tcctgcagaa gaacctcaag gccctgtgga ccgggttcga cgagcgcacc accgccgcgg 24816
acctggccga cctcatcttc cccgagcgcc tgcggctgac gctgcgcaac gggctgcccg 24876
actttatgag ccaaagcatg ttgcaaaact ttcgctcttt catcctcgaa cgctccggga 24936
tcctgcccgc cacctgctcc gcgctgccct cggacttcgt gccgctgacc ttccgcgagt 24996
gccccccgcc gctctggagc cactgctacc tgctgcgcct ggccaactac ctggcctacc 25056
actcggacgt gatcgaggac gtcagcggcg agggcctgct cgagtgccac tgccgctgca 25116
acctctgcac gccgcaccgc tccctggcct gcaaccccca gctgctgagc gagacccaga 25176
tcatcggcac cttcgagttg caaggccccg gcgagggcaa ggggggtctg aaactcaccc 25236
cggggctgtg gacctcggcc tacttgcgca agttcgtgcc cgaggactac catcccttcg 25296
agatcaggtt ctacgaggac caatcccagc cgcccaaggc cgagctgtcg gcctgcgtca 25356
tcacccaggg ggccatcctg gcccaattgc aagccatcca gaaatcccgc caagaatttc 25416
tgctgaaaaa gggccacggg gtctacttgg acccccagac cggagaggag ctcaacccca 25476
gcttccccca ggatgccccg aggaagcagc aagaagctga aagtggagct gccgccgccg 25536
ccggaggatt tggaggaaga ctgggagagc agtcaggcag aggaggagga gatggaagac 25596
tgggacagca ctcaggcaga ggaggacagc ctgcaagaca gtctggagga ggaagacgag 25656
gtggaggagg cagaggaaga agcagccgcc gccagaccgt cgtcctcggc ggaggaggag 25716
aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcgg ccgggcccac 25776
agtagatggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25836
cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg cttgcaagcc 25896
tgcgggggca acatctcctt cacccggcgc tacctgctct tccaccgcgg ggtgaacttc 25956
ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctactactg tttccaagaa 26016
gaggcagaaa cccagcagca gcagcagcag cagaaaacca gcggcagcag ctagaaaatc 26076
cacagcggcg gcaggtggac tgaggatcgc ggcgaacgag ccggcgcaga cccgggagct 26136
gaggaaccgg atctttccca ccctctatgc catcttccag cagagtcggg ggcaagagca 26196
ggaactgaaa gtcaagaacc gttctctgcg ctcgctcacc cgcagttgtc tgtatcacaa 26256
gagcgaagac caacttcagc gcactctcga ggacgccgag gctctcttca acaagtactg 26316
cgcgctcact cttaaagagt agcccgcgcc cgcccacaca cggaaaaagg cgggaattac 26376
gtcaccacct gcgcccttcg cccgaccatc atcatgagca aagagattcc cacgccttac 26436
atgtggagct accagcccca gatgggcctg gccgccggcg ccgcccagga ctactccacc 26496
cgcatgaact ggctcagtgc cgggcccgcg atgatctcac gggtgaatga catccgcgcc 26556
caccgaaacc agatactcct agaacagtca gcgatcaccg ccacgccccg ccatcacctt 26616
aatccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc cacgaccgta 26676
ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt ccagctggcc 26736
ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta taaagcggct ggtgatccga 26796
ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct gcgacctgac 26856
ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca ggccgtcctg 26916
actttggaga gttcgtcctc gcagccccgc tcgggtggca tcggcactct ccagttcgtg 26976
gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg ccactacccg 27036
gacgagttca tcccgaactt cgacgccatc agcgagtcgg tggacggcta cgattgaatg 27096
tcccatggtg gcgcggctga cctagctcgg cttcgacacc tggaccactg ccgccgcttc 27156
cgctgcttcg ctcgggatct cgccgagttt gcctactttg agctgcccga ggagcaccct 27216
cagggcccgg cccacggagt gcggatcgtc gtcgaagggg gtctcgactc ccacctgctt 27276
cggatcttca gccagcgtcc gatcctggcc gagcgcgagc aaggacagac ccttctgacc 27336
ctgtactgca tctgcaacca ccccggcctg catgaaagtc tttgttgtct gctgtgtact 27396
gagtataata aaagctgaga tcagcgacta ctccggactt ccgtgtgttc ctgctatcaa 27456
ccagtccctg ttcttcaccg ggaacgagac cgagctccag ctccagtgta agccccacaa 27516
gaagtacctc acctggctgt tccagggctc tccgatcgcc gttgtcaacc actgcgacaa 27576
cgacggagtc ctgctgagcg gccctgccaa ccttactttt tccacccgca gaagcaagct 27636
ccagctcttc caacccttcc tccccgggac ctatcagtgc gtctcgggac cctgccatca 27696
caccttccac ctgatcccga ataccacagc gtcgctcccc gctactaaca accaaactac 27756
ccaccaacgc caccgtcgcg acctttcctc tgggtctaat accactaccg gaggtgagct 27816
ccgaggtcga ccaacctctg ggatttacta cggcccctgg gaggtggtag ggttaatagc 27876
gctaggccta gttgcgggtg ggcttttggc tctctgctac ctatacctcc cttgctgttc 27936
gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga agatcaccct agtgagctgc 27996
ggtgtgctgg tggcggtggt gctttcgatt gtgggactgg gcggcgcggc tgtagtgaag 28056
gagaaggccg atccctgctt gcatttcaat cccgacaaat gccagctgag ttttcagccc 28116
gatggcaatc ggtgcgcggt gctgatcaag tgcggatggg aatgcgagaa cgtgagaatc 28176
gagtacaata acaagactcg gaacaatact ctcgcgtccg tgtggcagcc cggggacccc 28236
gagtggtaca ccgtctctgt ccccggtgct gacggctccc cgcgcaccgt gaataatact 28296
ttcatttttg cgcacatgtg cgacacggtc atgtggatga gcaagcagta cgatatgtgg 28356
ccccccacga aggagaacat cgtggtcttc tccatcgctt acagcgtgtg cacggcgcta 28416
atcaccgcta tcgtgtgcct gagcattcac atgctcatcg ctattcgccc cagaaataat 28476
gccgaaaaag aaaaacagcc ataacacgtt ttttcacaca cctttttcag accatggcct 28536
ctgttaaatt tttgctttta tttgccagtc tcattgccgt cattcatgga atgagtaatg 28596
agaaaattac tatttacact ggcactaatc acacattgaa aggtccagaa aaagccacag 28656
aagtttcatg gtattgttat tttaatgaat cagatgtatc tactgaactc tgtggaaaca 28716
ataacaaaaa aaatgagagc attactctca tcaagtttca atgtggatct gacttaaccc 28776
taattaacat cactagagac tatgtaggta tgtattatgg aactacagca ggcatttcgg 28836
acatggaatt ttatcaagtt tctgtgtctg aacccaccac gcctagaatg accacaacca 28896
caaaaactac acctgttacc actatacagc tcactaccaa tggctttctt gccatgcttc 28956
aagtggctga aaatagcacc agcattcaac ccaccccacc cagtgaggaa attcccagat 29016
ccatgattgg cattattgtt gctgtagtgg tgtgcatgtt gatcatcgcc ttgtgcatgg 29076
tgtactatgc cttctgctac agaaagcaca gactgaacga caagctggaa cacttactaa 29136
gtgttgaatt ttaatttttt agaaccatga agatcctagg ccttttagtt ttttctatca 29196
ttacctctgc tctatgcaat tctgacaatg aggacgttac tgtcgttgtc ggatcaaatt 29256
atacactaaa aggtccagca aaaggtatgc tttcgtggta ttgttggttc ggaactgacg 29316
agcaacagac agaactttgc aatgctcaaa aaggcaaaac ctcaaattct aaaatctcta 29376
attatcaatg caatggcact gacttagtat tgctcaatgt cacgaaagca tatgctggca 29436
gttacacctg ccctggagat gatgccgaca atatgatttt ttacaaagtg gaagtggttg 29496
atcccactac tccaccgccc accaccacaa ctactcatac cacacacaca gaacaaacac 29556
cagaggcagc agaagcagag ttggccttcc aggttcacgg agattccttt gctgtcaata 29616
cccctacacc cgatcagcgg tgtccggggc tgctcgtcag cggcattgtc ggtgtgcttt 29676
cgggattagc agtcataatc atctgcatgt tcatttttgc ttgctgctat agaaggcttt 29736
accgacaaaa atcagaccca ctgctgaacc tctatgttta attttttcca gagccatgaa 29796
ggcagttagc gctctagttt tttgttcttt gattggcatt gtttttagtg ctgggttttt 29856
gaaaaatctt accatttatg aaggtgagaa tgccactcta gtgggcatca gtggtcaaaa 29916
tgtcagctgg ctaaaatacc atctagatgg gtggaaagac atttgcgatt ggaatgtcac 29976
tgtgtataca tgtaatggag ttaacctcac cattactaat gccacccaag atcagaatgg 30036
taggtttaag ggccagagtt tcactagaaa taatgggtat gaatcccata acatgtttat 30096
ctatgacgtc actgtcatca gaaatgagac tgccaccacc acacagatgc ccactacaca 30156
cagttctacc actactacca tgcaaaccac acagacaacc actacatcaa ctcagcatat 30216
gaccaccact acagcagcaa agccaagtag tgcagcgcct cagccccagg ctttggcttt 30276
gaaagctgca caacctagta caactactag gaccaatgag cagactactg aatttttgtc 30336
cactgtcgag agccacacca cagctacctc cagtgccttc tctagcaccg ccaatctctc 30396
ctcgctttcc tctacaccaa tcagtcccgc tactactccc accccagctc ttctccccac 30456
tcccctgaag caaactgagg acagcggcat gcaatggcag atcaccctgc tcattgtgat 30516
cgggttggtc atcctggccg tgttgctcta ctacatcttc tgccgccgca ttcccaacgc 30576
gcaccgcaaa ccggcctaca agcccatcgt tatcgggcag ccggagccgc ttcaggtgga 30636
agggggtcta aggaatcttc tcttctcttt tacagtatgg tgattgaact atgattccta 30696
gacaattctt gatcactatt cttatctgcc tcctccaagt ctgtgccacc ctcgctctgg 30756
tggccaacgc cagtccagac tgtattgggc ccttcgcctc ctacgtgctc tttgccttca 30816
tcacctgcat ctgctgctgt agcatagtct gcctgcttat caccttcttc cagttcattg 30876
actggatctt tgtgcgcatc gcctacctgc gccaccaccc ccagtaccgc gaccagcgag 30936
tggcgcggct gctcaggctc ctctgataag catgcgggct ctgctacttc tcgcgcttct 30996
gctgttagtg ctcccccgcc ccgtcgaccc ccggtccccc actcagtccc ccgaagaggt 31056
ccgcaaatgc aaattccaag aaccctggaa attcctcaaa tgctaccgcc aaaaatcaga 31116
catgcttccc agctggatca tgatcattgg gatcgtgaac attctggcct gcaccctcat 31176
ctcctttgtg atttacccct gctttgactt tggttggaac tcgccagagg cgctctatct 31236
cccgcctgaa cctgacacac caccacagca acctcaggca cacgcactac caccaccaca 31296
gcctaggcca caatacatgc ccatattaga ctatgaggcc gagccacagc gacccatgct 31356
ccccgctatt agttacttca atctaaccgg cggagatgac tgacccactg gccaacaaca 31416
acgtcaacga ccttctcctg gacatggacg gccgcgcctc ggagcagcga ctcgcccaac 31476
ttcgcattcg ccagcagcag gagagagccg tcaaggagct gcaggacggc atagccatcc 31536
accagtgcaa gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc tacgaggtca 31596
ccccgaccga ccatcgcctc tcctacgagc tcctgcagca gcgccagaag ttcacctgcc 31656
tggtcggagt caaccccatc gtcatcaccc agcagtcggg cgataccaag gggtgcatcc 31716
actgctcctg cgactccccc gactgcgtcc acactctgat c~gaccctc tgcggcctcc 31776
gcgacctcct ccccatgaac taatcacccc cttatccagt gaaataaata tcatattgat 31836
gatgatttaa ataaaaaata atcatttgat ttgaaataaa gatacaatca tattgatgat 31896
ttgagtttta aaaaataaag aatcacttac ttgaaatctg ataccaggtc tctgtccatg 31956
ttttctgcca acaccacctc actcccctct tcccagctct ggtactgcag accccggcgg 32016
gctgcaaact tcctccacac gctgaagggg atgtcaaatt cctcctgtcc ctcaatcttc 32076
attttatctt ctatcag atg tcc aaa aag cgc gtc cgg gtg gat gat gac 32126
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp
1465 1470
ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32171
Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val
1475 1480 1485
ccc ttc atc aac ccc ccc ttc gtc tct tca gat gga ttc caa gag 32216
Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu
1490 1495 1500
aag ccc ctg ggg gtg ctg tcc ctg cga ctg gct gac ccc gtc acc 32261
Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr
1505 1510 1515
acc aag aac ggg gaa atc acc ctc aag ctg gga gag ggg gtg gac 32306
Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu Gly Val Asp
1520 1525 1530
ctc gac tcc tcg gga aaa ctc atc tcc aac acg gcc acc aag gcc 32351
Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala
1535 1540 1545
gcc gcc cct ctc agt ttt tcc aac aac acc att tcc ctt aac atg 32396
Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met
1550 1555 1560
gat acc cct ctt tat acc aaa gat gga aaa tta tcc tta caa gtt 32441
Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gln Val
1565 1570 1575
tct cca ccg tta aac ata tta aaa tca acc att ctg aac aca tta 32486
Ser Pro Pro Leu Asn Ile Leu Lys Ser Thr Ile Leu Asn Thr Leu
1580 1585 1590
gct gta gct tat gga tca ggt tta gga ctg agt ggt ggc act gct 32531
Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala
1595 1600 1605
ctt gca gta cag ttg gcc tct cca ctc act ttt gat gaa aaa gga 32576
Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly
1610 1615 1620
aat att aaa att aac cta gcc agt ggt cca tta aca gtt gat gca 32621
Asn Ile Lys Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala
1625 1630 1635
agt cga ctt agt atc aac tgc aaa aga ggg gtc act gtc act acc 32666
Ser Arg Leu Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr
1640 1645 1650
tca gga gat gca att gaa agc aac ata agc tgg cct aaa ggt ata 32711
Ser Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile
1655 1660 1665
aga ttt gaa ggt aat ggc ata gct gca aac att ggc aga gga ttg 32756
Arg Phe Glu Gly Asn Gly Ile Ala Ala Asn Ile Gly Arg Gly Leu
1670 1675 1680
gaa ttt gga acc act agt aca gag act gat gtc aca gat gca tac 32801
Glu Phe Gly Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr
1685 1690 1695
cca att caa gtt aaa ttg ggt act ggc ctt acc ttt gac agt aca 32846
Pro Ile Gln Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr
1700 1705 1710
ggc gcc att gtt gct tgg aac aaa gag gat gat aaa ctt aca tta 32891
Gly Ala Ile Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu
1715 1720 1725
tgg acc aca gcc gac ccc tcg cca aat tgc aaa ata tac tct gaa 32936
Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys Ile Tyr Ser Glu
1730 1735 1740
aaa gat gcc aaa ctc aca ctt tgc ttg aca aag tgt gga agt caa 32981
Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln
1745 1750 1755
att ctg ggt act gtg act gta ttg gca gtg aat aat gga agt ctc 33026
Ile Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu
1760 1765 1770
aac cca atc aca aac aca gta agc act gca ctc gtc tcc ctc aag 33071
Asn Pro Ile Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys
1775 1780 1785
ttt gat gca agt gga gtt ttg cta agc agc tcc aca tta gac aaa 33116
Phe Asp Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys
1790 1795 1800
gaa tat tgg aac ttc aga aag gga gat gtt aca cct gct gag ccc 33161
Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro
1805 1810 1815
tat act aat gct ata ggt ttt atg cct aac ata aag gcc tat cct 33206
Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Ile Lys Ala Tyr Pro
1820 1825 1830
aaa aac aca tct gca gct tca aaa agc cat att gtc agt caa gtt 33251
Lys Asn Thr Ser Ala Ala Ser Lys Ser His Ile Val Ser Gln Val
1835 1840 1845
tat ctc aat ggg gat gag gcc aaa cca ctg atg ctg att att act 33296
Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu Ile Ile Thr
1850 1855 1860
ttt aat gaa act gag gat gca act tgc acc tac agt atc act ttt 33341
Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe
1865 1870 1875
caa tgg aaa tgg gat agt act aag tac aca ggt gaa aca ctt gct 33386
Gln Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala
1880 1885 1890
acc agc tcc ttc acc ttc tcc tac atc gcc caa gaa tga acactgtatc 33435
Thr Ser Ser Phe Thr Phe Ser Tyr Ile Ala Gln Glu
1895 1900 1905
ccaccctgca tgccaaccct tcccacccca ctctgtctat ggaaaaaact ctgaagcaca 33495
aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33555
tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagccttga 33615
acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33675
agcgagccag tctcgggtcg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33735
cctcacagct caacagctga ggattgtcct cggtggtcgg gatcacggtt atctggaaga 33795
agcagaagag cggcggtggg aatcatagtc cgcgaacggg atcggccggt ggtgtcgcat 33855
caggccccgc agcagtcgct gccgccgccg ctccgtcaag ctgctgctca gggggtccgg 33915
gtccagggac tccctcagca tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg 33975
ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acaggaccac 34035
caggttgttc aacagtccat agttcaacac gctccagccg aaactcatcg cgggaaggat 34095
gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggcgct ccctccagaa 34155
cacgctgccc acgtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34215
catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34275
cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34335
ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34395
tatgctcatg catctcttca gcactctcag ctcctcgggg gtcaaaacca tatcccaggg 34455
cacggggaac tcttgcagga cagcgaaccc cgcagaacag ggcaatcctc gcacataact 34515
tacattgtgc atggacaggg tatcgcaatc aggcagcacc gggtgatcct ccaccagaga 34575
agcgcgggtc tcggtctcct cacagcgtgg taagggggcc ggccgatacg ggtgatggcg 34635
ggacgcggct gatcgtgttc gcgaccgtgt catgatgcag ttgctttcgg acattttcgt 34695
acttgctgta gcagaacctg gtccgggcgc tgcacaccga tcgccggcgg cggtcccggc 34755
gcttggaacg ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34815
ctagggcctc aggagtgatg aagatcccat catgcctgat agctctgatc acatcgacca 34875
ccgtggaatg ggccagaccc agccagatga tgcaattttg ttgggtttcg gtgacggcgg 34935
gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagcactt 34995
caaaatgaag gtcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35055
ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggcttccagc aaagcctcca 35115
cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35175
tcatgttaca ctcctgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35235
gaactagttc ctgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35295
ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35355
cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctaagct cctccctcag 35415
caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35475
aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35535
aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35595
cagaaaatca cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtgcac 35655
gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35715
ttagctgatc tgtaaaaaac aaaaaataaa acattaaacc atgctagcct ggcgaacagg 35775
tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35835
aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35895
gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35955
gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36015
atgaagcaca aaatcctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36075
cgaagccccc gatccctcca gatacacata caaagcctca gcgtccatag cttaccgagc 36135
agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36195
ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36255
gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36315
cgcgcacttc ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatcg 36375
gaattcgact ttcaaattcc gtcgaccgtt aaaaacgtca cccgccccgc ccctaacggt 36435
cgcccgtctc tcggccaatc accttcctcc ctccccaaat tcaaacagct catttgcata 36495
ttaacgcgca ccaaaagttt gaggtatatt attgatgatg 36535
<210>10
<211>531
<212>PRT
<213>黑猩猩腺病毒血清型Pan7
<400>10
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn Tyr Asp Gly Ser
145 150 155 160
Gln Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly
165 170 175
Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile
180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp
195 200 205
Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro
210 215 220
Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His
225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser
245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu
260 265 270
Gly Phe Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala
275 280 285
Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala
290 295 300
Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp
305 310 315 320
Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser
325 330 335
Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr
340 345 350
Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu
355 360 365
Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu
370 375 380
Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser
385 390 395 400
Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln
405 410 415
Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser
420 425 430
Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala
435 440 445
Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile
450 455 460
Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val
465 470 475 480
Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg
485 490 495
Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro
500 505 510
Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser
515 520 525
Arg Thr Phe
530
<210>11
<211>932
<212>PRT
<213>黑猩猩腺病毒血清型Pan7
<400>11
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Gly Asp Thr Asp Thr
130 135 140
Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Ser Ile
145 150 155 160
Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Ser Asp Gly Gln Ala Ile
165 170 175
Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala Glu
180 185 190
Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu
195 200 205
Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro
210 215 220
Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Glu Thr Gly Gly
225 230 235 240
Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala
245 250 255
Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp
275 280 285
Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala
385 390 395 400
Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Asp Asn
405 410 415
Gln Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu
420 425 430
Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu
435 440 445
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser
450 455 460
Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr
465 470 475 480
Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala
485 490 495
Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val
500 505 510
Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met
515 520 525
Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln
530 535 540
Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr
545 550 555 560
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser
565 570 575
Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser
580 585 590
Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser
595 600 605
Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn
610 615 620
Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala
625 630 635 640
Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg
645 650 655
Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly
660 665 670
Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu
675 680 685
Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr
690 695 700
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro
705 710 715 720
Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val
725 730 735
Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala
740 745 750
His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys
755 760 765
Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln
770 775 780
Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala
785 790 795 800
Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met
805 810 815
Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly
820 825 830
Lys Ser Ala Val Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg
835 840 845
Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala
850 855 860
Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala
865 870 875 880
Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu
885 890 895
Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His
900 905 910
Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly
915 920 925
Asn Ala Thr Thr
930
<210>12
<211>443
<212>PRT
<213>黑猩猩腺病毒血清型Pan7
<400>12
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Ser Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Lys Ser Thr Ile Leu
115 120 125
Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly
130 135 140
Thr Ala Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys
145 150 155 160
Gly Asn Ile Lys Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala
165 170 175
Ser Arg Leu Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ser
180 185 190
Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile Arg Phe
195 200 205
Glu Gly Asn Gly Ile Ala Ala Asn Ile Gly Arg Gly Leu Glu Phe Gly
210 215 220
Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro Ile Gln Val
225 230 235 240
Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile Val Ala
245 250 255
Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro
260 265 270
Ser Pro Asn Cys Lys Ile Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu
275 280 285
Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Thr Val Leu
290 295 300
Ala Val Asn Asn Gly Ser Leu Asn Pro Ile Thr Asn Thr Val Ser Thr
305 310 315 320
Ala Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Leu Ser Ser
325 330 335
Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr
340 345 350
Pro Ala Glu Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Ile Lys
355 360 365
Ala Tyr Pro Lys Asn Thr Ser Ala Ala Ser Lys Ser His Ile Val Ser
370 375 380
Gln Val Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu Ile Ile
385 390 395 400
Thr Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe
405 410 415
Gln Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala Thr
420 425 430
Ser Ser Phe Thr Phe Ser Tyr Ile Ala Gln Glu
435 440
<210>13
<211>338
<212>PRT
<213>猿猴血清型C1
<400>13
Ala Pro Lys Gly Ala Pro Asn Thr Ser Gln Trp Leu Asp Lys Gly Val
1 5 10 15
Thr Thr Thr Asp Asn Asn Thr Glu Asn Gly Asp Glu Glu Asp Glu Val
20 25 30
Ala Glu Glu Gly Glu Glu Glu Lys Gln Ala Thr Tyr Thr Phe Gly Asn
35 40 45
Ala Pro Val Lys Ala Glu Ala Glu Ile Thr Lys Glu Gly Leu Pro Ile
50 55 60
Gly Leu Glu Val Pro Ser Glu Gly Asp Pro Lys Pro Ile Tyr Ala Asp
65 70 75 80
Lys Leu Tyr Gln Pro Glu Pro Gln Val Gly Glu Glu Ser Trp Thr Asp
85 90 95
Thr Asp Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Glu
100 105 110
Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Val
115 120 125
Lys Gly Gly Gln Ala Lys Val Lys Lys Val Glu Glu Gly Lys Val Glu
130 135 140
Tyr Asp Ile Asp Met Asn Phe Phe Asp Leu Arg Ser Gln Lys Thr Gly
145 150 155 160
Leu Lys Pro Lys Ile Val Met Tyr Ala Glu Asn Val Asp Leu Glu Thr
165 170 175
Pro Asp Thr His Val Val Tyr Lys Pro Gly Ala Ser Asp Ala Ser Ser
180 185 190
His Ala Asn Leu Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile
195 200 205
Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly
210 215 220
Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val
225 230 235 240
Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp
245 250 255
Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val
260 265 270
Asp Ser Tyr Asp Pro Asp Val Arg Val Ile Glu Asn His Gly Val Glu
275 280 285
Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Val Gly Pro Arg
290 295 300
Thr Asp Ser Tyr Lys Gly Ile Glu Thr Asn Gly Asp Glu Asn Thr Thr
305 310 315 320
Trp Lys Asp Leu Asp Pro Asn Gly Ile Ser Glu Leu Ala Lys Gly Asn
325 330 335
Pro Phe
<210>14
<211>315
<212>PRT
<213>黑猩猩腺病毒Pan-9
<400>14
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp
1 5 10 15
Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val
20 25 30
Gln Gly Ile Asn Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr
35 40 45
Asp Asp Gln Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln
50 55 60
Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr
65 70 75 80
Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly
85 90 95
Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys
100 105 110
Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe
115 120 125
Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu
130 135 140
Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr
145 150 155 160
Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln
165 170 175
Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile
180 185 190
Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly
195 200 205
Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr
210 215 220
Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg
225 230 235 240
Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val
245 250 255
Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys
260 265 270
Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys
27 280 285
Ala Asn Gly Thr Asp Gln Thr Thr Trp Thr Lys Asp Asp Ser Val Asn
290 295 300
Asp Ala Asn Glu Ile Gly Lys Gly Asn Pro Phe
305 310 315
<210>15
<211>315
<212>PRT
<213>黑猩猩腺病毒Pan-5
<400>15
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp
1 5 10 15
Gly Asp Thr Gly Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val
20 25 30
Gln Gly Ile Ser Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr
35 40 45
Asp Asp Gln Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln
50 55 60
Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr
65 70 75 80
Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly
85 90 95
Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys
100 105 110
Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe
115 120 125
Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu
130 135 140
Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr
145 150 155 160
Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln
165 170 175
Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile
180 185 190
Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly
195 200 205
Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr
210 215 220
Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg
225 230 235 240
Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val
245 250 255
Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys
260 265 270
Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys
275 280 285
Ala Asn Gly Ala Asp Gln Thr Thr Trp Thr Lys Asp Asp Thr Val Asn
290 295 300
Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe
305 310 315
<210>16
<211>324
<212>PRT
<213>黑猩猩腺病毒Pan-6
<400>16
Ala Pro Lys Gly Ala Pro Asn Ser Ser Gln Trp Glu Gln Ala Lys Thr
1 5 10 15
Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met
20 25 30
Gly Gly Glu Asn Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Asp Val
35 40 45
Thr Ala Asn Gln Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro
50 55 60
Glu Pro Gln Val Gly Glu Glu Asn Trp Gln Glu Thr Glu Asn Phe Tyr
65 70 75 80
Gly Gly Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly
85 90 95
Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys Leu Lys
100 105 110
Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala
115 120 125
Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gln Asp Glu Tyr Lys
130 135 140
Ala Asp Ile Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp
145 150 155 160
Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu Ile
165 170 175
Asn Leu Val Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe
180 185 190
Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met
195 200 205
Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu
210 215 220
Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu
225 230 235 240
Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser
245 250 255
Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu
260 265 270
Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala
275 280 285
Tyr Gln Gly Val Lys Val Lys Asp Gly Gln Asp Gly Asp Val Glu Ser
290 295 300
Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gln Leu Cys Lys
305 310 315 320
Gly Asn Ile Phe
<210>17
<211>314
<212>PRT
<213>黑猩猩腺病毒Pan-7
<400>17
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Gly
1 5 10 15
Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln
20 25 30
Gly Ile Ser Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Ser Asp
35 40 45
Gly Gln Ala Ile Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro Gln Val
50 55 60
Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly
65 70 75 80
Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser
85 90 95
Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr
100 105 110
Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp
115 120 125
Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr
130 135 140
Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys
145 150 155 160
Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ser
165 170 175
Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly
180 185 190
Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln
195 200 205
Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu
210 215 220
Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr
225 230 235 240
Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg
245 250 255
Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe
260 265 270
Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala
275 280 285
Asn Gly Asp Asn Gln Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp
290 295 300
Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe
305 310
<210>18
<211>179
<212>PRT
<213>黑猩猩腺病毒Pan9
<400>18
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Leu Ala
1 5 10 15
Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln
20 25 30
Ile Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn
35 40 45
Pro Ile Thr Gly Thr Val Ser Ser Ala Gln Val Phe Leu Arg Phe Asp
50 55 60
Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp
65 70 75 80
Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly Thr Pro Tyr Thr Asn Ala
85 90 95
Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gln Ser Ser
100 105 110
Thr Thr Lys Asn Asn Ile Val Gly Gln Val Tyr Met Asn Gly Asp Val
115 120 125
Ser Lys Pro Met Leu Leu Thr Ile Thr Leu Asn Gly Thr Asp Asp Ser
130 135 140
Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser
145 150 155 160
Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr Ile
165 170 175
Ala Gln Glu
<210>19
<211>185
<212>PRT
<213>黑猩猩腺病毒Pan6
<400>19
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Leu Leu Ser
1 5 10 15
Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln
20 25 30
Ile Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu
35 40 45
Asn Pro Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu Arg Phe
50 55 60
Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr
65 70 75 80
Trp Asn Phe Arg Glu Gly Gln Thr Thr Gln Ser Val Ala Tyr Thr Asn
85 90 95
Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr Gln Ser
100 105 110
Lys Thr Pro Lys Asn Ser Ile Val Ser Gln Val Tyr Leu Thr Gly Glu
115 120 125
Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn Gly Thr Asp Glu
130 135 140
Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr Trp Gln
145 150 155 160
Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile Thr Phe Ala Thr Asn Ser
165 170 175
Phe Ser Phe Ser Tyr Ile Ala Gln Glu
180 185
<210>20
<211>179
<212>PRT
<213>黑猩猩腺病毒Pan7
<400>20
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys Ile Tyr Ser
1 5 10 15
Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln
20 25 30
Ile Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn
35 40 45
Pro Ile Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp
50 55 60
Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp
65 70 75 80
Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn Ala
85 90 95
Ile Gly Phe Met Pro Asn Ile Lys Ala Tyr Pro Lys Asn Thr Ser Ala
100 105 110
Ala Ser Lys Ser His Ile Val Ser Gln Val Tyr Leu Asn Gly Asp Glu
115 120 125
Ala Lys Pro Leu Met Leu Ile Ile Thr Phe Asn Glu Thr Glu Asp Ala
130 135 140
Thr Cys Thr Tyr Ser Ile Thr Phe Gln Trp Lys Trp Asp Ser Thr Lys
145 150 155 160
Tyr Thr Gly Glu Thr Leu Ala Thr Ser Ser Phe Thr Phe Ser Tyr Ile
165 170 175
Ala Gln Glu
<210>21
<211>179
<212>PRT
<213>黑猩猩腺病毒Pan5
<400>21
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr Ser
1 5 10 15
Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln
20 25 30
Ile Leu Gly Thr Val Ser Leu Ile Ala Val Asp Thr Gly Ser Leu Asn
35 40 45
Pro Ile Thr Gly Thr Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp
50 55 60
Ala Asn Gly Val Leu Gln Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp
65 70 75 80
Asn Phe Arg Gln Gly Asp Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala
85 90 95
Ile Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly
100 105 110
Ala Ala Lys Ser His Ile Val Gly Lys Val Tyr Leu His Gly Asp Thr
115 120 125
Gly Lys Pro Leu Asp Leu Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu
130 135 140
Ser Cys Thr Tyr Cys Ile Asn Phe Gln Trp Gln Trp Gly Ala Asp Gln
145 150 155 160
Tyr Lys Asn Glu Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile
165 170 175
Ala Lys Glu
<210>22
<211>183
<212>PRT
<213>人腺病毒Ad 2
<400>22
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg Ile His Ser
1 5 10 15
Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gln
20 25 30
Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser
35 40 45
Met Thr Gly Thr Val Ala Ser Val Ser Ile Phe Leu Arg Phe Asp Gln
50 55 60
Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn
65 70 75 80
Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val
85 90 95
Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gln Ser Gln Thr
100 105 110
Ala Lys Asn Asn Ile Val Ser Gln Val Tyr Leu His Gly Asp Lys Thr
115 120 125
Lys Pro Met Ile Leu Thr Ile Thr Leu Asn Gly Thr Ser Glu Ser Thr
130 135 140
Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp
145 150 155 160
Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr
165 170 175
Phe Ser Tyr Ile Ala Gln Glu
180
<210>23
<211>182
<212>PRT
<213>人腺病毒Ad 5
<400>23
Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala
1 5 10 15
Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gln
20 25 30
Ile Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro
35 40 45
Ile Ser Gly Thr Val Gln Ser Ala His Leu Ile Ile Arg Phe Asp Glu
50 55 60
Asn Gly Val Leu Ile Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn
65 70 75 80
Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val
85 90 95
Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr
100 105 110
Ala Lys Ser Asn Ile Val Ser Gln Val Tyr Leu Asn Gly Asp Lys Thr
115 120 125
Lys Pro Val Thr Leu Thr Ile Thr Leu Asn Gly Thr Gln Glu Thr Gly
130 135 140
Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser
145 150 155 160
Gly His Asn Tyr Ile Asn Glu Ile Phe Ala Thr Ser Ser Tyr Thr Glu
165 170 175
Ser Tyr Ile Ala Gln Glu
180
<210>24
<211>34264
<212>DNA
<213>猿猴腺病毒SV-1
<220>
<221>CDS
<222>(12454)..(13965)
<223>L2五邻体
<220>
<221>CDS
<222>(16841)..(19636)
<223>L3六邻体
<220>
<221>CDS
<222>(28059)..(29150)
<223>L5纤维#2
<220>
<221>CDS
<222>(29183)..(30865)
<223>L5纤维#1
<400>24
tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60
gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 120
gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180
tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240
atatgtactt tttatgactg ggcggaattt ctgctgatca gcagtgaact ttgggcgctg 300
acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360
ctcctcagcg ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420
gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480
acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acagatctgc 540
tggatgttcc caactcgcct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600
aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtatgttc cctgaatgtc 660
tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720
tggacttgaa atgctacgag gaatgtctgc cttctagcga ttctgaaacg gaacagacag 780
ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840
ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat cctgagactc 900
tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960
tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttattatg tgtcattatt 1020
tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080
ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140
agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200
tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260
tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320
gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380
caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440
ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500
cactgttaat ataaaaaaga gtgtttatgt tatcggccat ggagctctgg tgcaggtgca 1560
gaccccagac cgggtggctt tcaattgcgg catgcagagt ttgggccccg gggtgatagg 1620
tttgaatgga gttacatttc aaaatgtcag gtttactggt gatgatttta atggctctgt 1680
gtttgtgact agcacccagc taaccctcca cggtgtttac ttttttaact ttaacaatac 1740
atgtgtggag tcatggggta gggtgtctct gaggggctgc agttttcatg gttgctggaa 1800
ggcggtggtg ggaagaatta aaagtgtcat gtctgtgaag aaatgcatat ttgaacgctg 1860
tgtgatagct ctagcagtag aggggtacgg acggatcagg aataacgccg catctgagaa 1920
tggatgtttt cttttgctga aaggtacggc cagcgttaag cataatatga tttgcggcag 1980
cggcctgtgc ccctcgcagc tcttaacttg cgcagatgga aactgtcaca ccttgcgcac 2040
cgtgcacata gtgtcccact cgcgccgcac ctggccaaca tttgagcaca atatgctcat 2100
gcgttgcgcc gttcacctag gtgctagacg cggcgtgttt atgccttatc aatgtaactt 2160
tagtcatact aagattttgc tggaaactga ttccttccct cgagtatgtt tcaatggggt 2220
gtttgacatg tcaatggaac tttttaaagt gataagatat gatgaaacca agtctcgttg 2280
tcgctcatgt gaatgcggag ctaatcattt gaggttgtat cctgtaaccc tgaacgttac 2340
cgaggagctg aggacggacc accacatgct gtcttgcctg cgtaccgact atgaatccag 2400
cgatgaggag tgaggtgagg ggcggagcca caaagggtat aaaggggcat gaggggtggg 2460
cgcggtgttt caaaatgagc gggacgacgg acggcaatgc gtttgagggg ggagtgttca 2520
gcccatatct gacatctcgt cttccttcct gggcaggagt tcgtcagaat gtagtgggct 2580
ccaccgtgga cggacggccg gtcgcccctg caaattccgc caccctcacc tatgccaccg 2640
tgggatcatc gttggacact gccgcggcag ctgccgcttc tgctgccgct tctactgctc 2700
gcggcatggc ggctgatttt ggactatata accaactggc cactgcagct gtggcgtctc 2760
ggtctctggt tcaagaagat gccctgaatg tgatcttgac tcgcctggag atcatgtcac 2820
gtcgcctgga cgaactggct gcgcagatat cccaagctaa ccccgatacc gcttcagaat 2880
cttaaaataa agacaaacaa atttgttgaa aagtaaaatg gctttatttg ttttttttgg 2940
ctcggtaggc tcgggtccac ctgtctcggt cgttaaggac tttgtgtatg ttttccaaaa 3000
cacggtacag atgggcttgg atgttcaagt acatgggcat gaggccatct ttggggtgga 3060
gataggacca ctgaagagcg tcatgttccg gggtggtatt gtaaatcacc cagtcgtagc 3120
agggtttttg agcgtggaac tggaatatgt ccttcaggag caggctaatg gccaagggta 3180
gacccttagt gtaggtgttt acaaagcggt tgagctggga gggatgcatg cggggggaga 3240
tgatatgcat cttggcttgg attttgaggt tagctatgtt accacccagg tctctgcggg 3300
ggttcatgtt atgaaggacc accagcacgg tatagccagt gcatttgggg aacttgtcat 3360
gcagtttgga ggggaaggcg tggaagaatt tagatacccc cttgtgcccc cctaggtttt 3420
ccatgcactc atccataata atggcaatgg gacccctggc ggccgcttta gcaaacacgt 3480
tttgggggtt ggaaacatca tagttttgct ctagagtgag ctcatcatag gccatcttta 3540
caaagcgggg taggagggtg cccgactggg ggatgatagt tccatctggg cctggagcgt 3600
agttgccctc acagatctgc atctcccagg ccttaatttc cgaggggggg atcatgtcca 3660
cctggggggc gataaaaaac acggtttctg gcggggggtt aatgagctgg gtggaaagca 3720
agttacgcaa cagctgggat ttgccgcaac cggtgggacc gtagatgacc ccgatgacgg 3780
gttgcagctg gtagttcaga gaggaacagc tgccgtcggg gcgcaggagg ggagctacct 3840
cattcatcat gcttctgaca tgtttatttt cactcactaa gttttgcaag agcctctccc 3900
cacccaggga taagagttct tccaggctgt tgaagtgttt cagcggtttc aggccgtcgg 3960
ccatgggcat cttttcaagc gactgacgaa gcaagtacag tcggtcccag agctcggtga 4020
cgtgctctat ggaatctcga tccagcagac ttcttggttt cgggggttgg gccgactttc 4080
gctgtagggc accagccggt gggcgtccag ggccgcgagg gttctgtcct tccagggtct 4140
cagcgttcgg gtgagggtgg tctcggtgac ggtgaaggga tgagccccgg gctgggcgct 4200
tgcgagggtg cgcttcaggc tcatcctgct ggtgctgaag cgggcgtcgt ctccctgtga 4260
gtcggccaga tagcaacgaa gcatgaggtc gtagctgagg gactcggccg cgtgtccctt 4320
ggcgcgcagc tttcccttgg aaacgtgctg acatttggtg cagtgcagac acttgagggc 4380
gtagagtttt ggggccagga agaccgactc gggcgagtag gcgtcggctc cgcactgagc 4440
gcagacggtc tcgcactcca ccagccacgt gagctcgggt ttagcgggat caaaaaccaa 4500
gttgcctcca ttttttttga tgcgtttctt accttgcgtc tccatgagtc tgtgtcccgc 4560
ttccgtgaca aaaaggctgt cggtatcccc gtagaccgac ttgagggggc gatcttccaa 4620
aggtgttccg aggtcttccg cgtacaggaa ctgggaccac tccgagacaa aggctcgggt 4680
ccaggctaac acgaaggagg cgatctgcga ggggtatctg tcgttttcaa tgagggggtc 4740
caccttttcc agggtgtgca gacacaggtc gtcctcctcc gcgtccacga aggtgattgg 4800
cttgtaagtg taggtcacgt gacccgcacc cccccaaggg gtataaaagg gggcgtgccc 4860
actctccccg tcactttctt ccgcatcgct gtggaccaga gccagctgtt cgggtgagta 4920
ggccctctca aaagccggca tgatttcggc gctcaagttg tcagtttcta caaacgaggt 4980
ggatttgata ttcacgtgcc ccgcggcgat gcttttgatg gtggaggggt ccatctgatc 5040
agaaaacacg atctttttat tgtcaagttt ggtggcgaaa gacccgtaga gggcgttgga 5100
aagcaacttg gcgatggagc gcagggtctg atttttctcc cgatcggccc tctccttggc 5160
ggcgatgttg agttgcacgt actcgcgggc cacgcaccgc cactcgggga acacggcggt 5220
gcgctcgtcg ggcaggatgc gcacgcgcca gccgcggttg tgcagggtga tgaggtccac 5280
gctggtggcc acctccccgc ggaggggctc gttggtccaa cacaatcgcc ccccttttct 5340
ggagcagaac ggaggcaggg gatctagcaa gttggcgggc ggggggtcgg cgtcgatggt 5400
aaatatgccg ggtagcagaa ttttattaaa ataatcgatt tcggtgtccg tgtcttgcaa 5460
cgcgtcttcc cacttcttca ccgccagggc cctttcgtag ggattcaggg gcggtcccca 5520
gggcatgggg tgggtcaggg ccgaggcgta catgccgcag atgtcgtaca cgtacagggg 5580
ctccctcaac accccgatgt aagtggggta acagcgcccc ccgcggatgc tggctcgcac 5640
gtagtcgtac atctcgtgag agggagccat gagcccgtct cccaagtggg tcttgtgggg 5700
tttttcggcc cggtagagga tctgcctgaa gatggcgtgg gagttggaag agatagtggg 5760
gcgttggaag acgttaaagt tggctccggg cagtcccacg gagtcttgga tgaactgggc 5820
gtaggattcc cggagcttgt ccaccagggc tgcggttacc agcacgtcga gagcgcagta 5880
gtccaacgtc tcgcggacca ggttgtaggc cgtctcttgt tttttctccc acagttcgcg 5940
attgaggagg tattcctcgc ggtctttcca gtactcttcg gcgggaaatc ctttttcgtc 6000
cgctcggtaa gaacctaaca tgtaaaattc gttcacggct ttgtatggac aacagccttt 6060
ttctaccggc agggcgtacg cttgagcggc ctttctgaga gaggtgtggg tgagggcgaa 6120
ggtgtcccgc accatcactt tcaggtactg atgtttgaag tccgtgtcgt cgcaggcgcc 6180
ctgttcccac agcgtgaagt cggtgcgctt tttctgcctg ggattgggga gggcgaatgt 6240
gacgtcgtta aagaggattt tcccggcgcg gggcatgaag ttgcgagaga tcctgaaggg 6300
tccgggcacg tccgagcggt tgttgatgac ttgcgccgcc aggacgatct cgtcgaagcc 6360
gttgatgttg tggcccacga tgtaaagttc gataaagcgc ggctgtccct tgagggccgg 6420
cgcttttttc aactcctcgt aggtgagaca gtccggcgag gagagaccca gctccgcccg 6480
ggcccagtcg gagagctgag ggttagccgc gaggaaagag ctccacaggt caagggctag 6540
cagagtttgc aagcggtcgc ggaactcgcg aaactttttc cccacggcca ttttctccgg 6600
cgtcaccacg tagaaagtgc aggggcggtc gttccagacg tcccatcgga gctctagggc 6660
cagctcgcag gcttgacgaa cgagggtctc ctcgcccgag acgtgcatga ccagcatgaa 6720
gggtaccaac tgtttcccga acgagcccat ccatgtgtag gtttctacgt cgtaggtgac 6780
aaagagccgc tgggtgcgcg cgtgggagcc gatcgggaag aagctgatct cctgccacca 6840
gttggaggaa tgggtgttga tgtggtgaaa gtagaagtcc cgccggcgca cagagcattc 6900
gtgctgatgt ttgtaaaagc gaccgcagta gtcgcagcgc tgcacgctct gtatctcctg 6960
aatgagatgc gcttttcgcc cgcgcaccag aaaccggagg gggaagttga gacgggggct 7020
tggtggggcg gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc cctccttctc 7080
tgggtggacg acggtgggga cgacgacgcc ccgggtgccg caagtccaga tctccgccac 7140
ggaggggcgc aggcgttgca ggaggggacg cagctgcccg ctgtccaggg agtcgagggc 7200
ggccgcgctg aggtcggcgg gaagcgtttg caagttcact ttcagaagac cggtaagagc 7260
gtgagccagg tgcacatggt acttgatttc caggggggtg ttggaagagg cgtccacggc 7320
gtagaggagg ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt ttatctcact 7380
cgtcgagggc gagcgccggg gggtagaggc ggctctgcgc cggggggcag cggaggcagt 7440
ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag cccggagact gctggcgtgg 7500
gcgacgacgc ggcggttgag gtcctggatg tgccgtctct gcgtgaagac caccggcccc 7560
cgggtcctga acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt aacggcggcc 7620
tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaac 7680
tgctccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740
tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800
cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg ggccacgttg 7860
agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag gtagttgagc 7920
gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg 7980
ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac ggcgaagttg 8040
aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100
gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160
ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220
cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc gccgcgccgg 8280
cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagacgccg 8340
ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400
gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460
tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 8520
ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580
ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaccacg 8640
tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700
cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760
cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820
gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880
tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940
acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000
taggcgcggg actcgaacac gtagtcgttg catgtgcgta ccagatactg gtagccaacc 9060
aggaagtggg gaggcggttc tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120
gacaggtcgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9180
ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240
gggcgaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca atcttgtacg 9300
ctctagatgg aaaaaagaca gggcggtcat cgactccctt ccgtagctcg gggggtaaag 9360
tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg ccgctcccga 9420
tgcgcctggc cccgcatcca cgacgtccgc gtcgagaccc agccgcgacg ctccgcccca 9480
atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggcagatgcg 9540
acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600
ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660
gccgtcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc aggcttttgt 9720
gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga tgcgcgattg 9780
caggtttcgg gcgggtagag agctgagggc gggcttcgat cgggagcggc tcctgagggc 9840
ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc acgtctcggc 9900
ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact tccaaaagag 9960
ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg ggctgatgca 10020
tctgtgggac ttcgtggagg cctacgtgca gaacccggcc agcaaacctc tgacggccca 10080
gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg ccatgttgaa 10140
catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc agagcatcgt 10200
ggtgcaggag aggggcctca gcttagcgga caaggtggcg gccattaact attcgatgca 10260
gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc ccatagacaa 10320
ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga cgctgagcga 10380
cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca gccgccggcg 10440
ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg gcgccgggga 10500
cgaggagcgc gaggcttact tcgacatggg agccgatctg cagtggcgtc ccagcgcgcg 10560
cgccttggag gcggcgggct accccgacga ggaggatcgg gacgatttgg aggaggcagg 10620
cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg gccggcggac 10680
ggggccaccg cggatcccgc acttttggca tccatgcaga gtcaaccttc gggcgtgacc 10740
gcctccgatg actgggcggc ggccatggac cgcattatgg cgctgactac ccgcaacccc 10800
gaggctttta gacagcaacc ccaggccaac cgtttttcgg ccatcttgga agcggtggtg 10860
ccctcccgca ccaaccccac acacgagaaa gtcctgacta tcgtgaacgc cctggtagac 10920
agcaaggcca tccgccgcga cgaggcgggc ttgatttaca acgctctgct ggaacgggtg 10980
gcgcgctaca acagcactaa cgttcagacc aatctggatc gcctcaccac cgacgtgaag 11040
gaggcgctgg ctcagaagga gcggtttctg agggacagca atctgggctc tctggtggca 11100
ctcaacgcct tcctgagcac gcagccggcc aacgtgcccc gcgggcagga ggactacgtg 11160
agcttcatca gcgctctgag gctgctggtg tccgaggtgc cccagagcga ggtgtatcag 11220
tctgggccgg attacttctt ccagacgtcc cgacagggct tgcaaacggt gaacctgact 11280
caggccttta aaaacttgca aggcatgtgg ggcgttaagg ccccggtggg cgatcgagcc 11340
accatctcca gtctgctgac ccccaacact cgcctgctgc tgctcttgat cgcgccgttc 11400
accaacagta gcactatcag ccgtgactcg tacctgggtc atctcatcac tttgtaccgc 11460
gaggccatcg gtcaggctca gatcgacgag cacacatatc aggagatcac taacgtgagc 11520
cgggccctgg gtcaggaaga taccggcagc ctggaagcca cgttgaactt tttgctaacc 11580
aaccggaggc aaaaaatacc ctcccagttt acgttaagcg ccgaggagga gaggattctg 11640
cgatacgtgc agcagtccgt gagtctgtac ttgatgcggg agggcgccac cgcttccacg 11700
gctttagaca tgacggctcg gaacatggaa ccgtcctttt actccgccca ccggccgttc 11760
attaaccgtc tgatggacta cttccatcgc gcggccgcca tgaacgggga gtacttcacc 11820
aatgccatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg cgagtttgac 11880
ctgcccgaag ccgacgacgg ctttctttgg gacgacgtgt ccgacagcat tttcacgccg 11940
ggcaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc cagcgtggag 12000
gcggcctcta ggggagagag tccctttccc agtctgtctt ccgccagcag tggtcgggta 12060
acgcgcccgc ggttgccggg ggagagcgac tacctgaacg accccttgct gcggccggct 12120
aggaagaaaa atttccccaa caacggggtg gaaagcttgg tggataaaat gaatcgttgg 12180
aagacctacg cccaggagca gcgggagtgg gaggacagtc agccgcgacc gctggttccg 12240
ccgcactggc gtcgtcagag agaagacccg gacgactccg cagacgatag tagcgtgttg 12300
gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg gcgttccagt 12360
cgcctctact aataaaaaag acgcggaaac ttaccagagc catggccaca gcgtgtgtcc 12420
tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg gtg aga gtc 12474
Met Arg Arg Ala Val Arg Val
1 5
acg ccg gcg gcg tat gag ggt ccg ccc cct tct tac gaa agc gtg atg 12522
Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met
10 15 20
gga tca gcg aac gtg ccg gcc acg ctg gag gcg cct tac gtt cct ccc 12570
Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr Val Pro Pro
25 30 35
aga tac ctg gga cct acg gag ggc aga aac agc atc cgt tac tcc gag 12618
Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu
40 45 50 55
ctg gca ccc ctg tac gat acc acc aag gtg tac ctg gtg gac aac aag 12666
Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val Asp Asn Lys
60 65 70
tcg gcg gac atc gcc tcc ctg aat tat caa aac gat cac agc aat ttt 12714
Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe
75 80 85
ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg gag gcg ggc 12762
Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Gly
90 95 100
acg cag acc att aac ttt gac gag cgt tcc cgc tgg ggc ggt cag ctg 12810
Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu
105 110 115
aaa acc atc ctg cac acc aac atg ccc aac atc aac gag ttc atg tcc 12858
Lys Thr Ile Leu His Thr Asn Met Pro Asn Ile Asn Glu Phe Met Ser
120 125 130 135
acc aac aag ttc agg gcc agg ctg atg gtt aaa aag gct gaa aac cag 12906
Thr Asn Lys Phe Arg Ala Arg Leu Met Val Lys Lys Ala Glu Asn Gln
140 145 150
cct ccc gag tac gaa tgg ttt gag ttc acc att ccc gag ggc aac tat 12954
Pro Pro Glu Tyr Glu Trp Phe Glu Phe Thr Ile Pro Glu Gly Asn Tyr
155 160 165
tcc gag acc atg act atc gat ctg atg aac aat gcg atc gtg gac aat 13002
Ser Glu Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Val Asp Asn
170 175 180
tac ctg caa gtg ggg agg cag aac ggg gta ttg gaa agc gat atc ggc 13050
Tyr Leu Gln Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly
185 190 195
gta aaa ttt gat acc aga aac ttc cga ctg ggg tgg gat ccc gtg acc 13098
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr
200 205 210 215
aag ctg gtg atg cca ggc gtg tac acc aac gag gct ttt cac ccc gac 13146
Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp
220 225 230
atc gtg ctg ctg ccg ggg tgc ggt gtg gac ttc act cag agc cgt ttg 13194
Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gln Ser Arg Leu
235 240 245
agt aac ctg tta ggg atc aga aag cgc cgc ccc ttc caa gag ggc ttt 13242
Ser Asn Leu Leu Gly Ile Arg Lys Arg Arg Pro Phe Gln Glu Gly Phe
250 255 260
cag atc atg tat gag gac ctg gaa gga ggt aac att cca ggt ttg cta 13290
Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Gly Leu Leu
265 270 275
gac gtg ccg gcg tat gaa gag agt gtt aaa cag gcg gag gcg cag gga 13338
Asp Val Pro Ala Tyr Glu Glu Ser Val Lys Gln Ala Glu Ala Gln Gly
280 285 290 295
cga gag att cga ggc gac acc ttt gcc acg gaa cct cac gaa ctg gta 13386
Arg Glu Ile Arg Gly Asp Thr Phe Ala Thr Glu Pro His Glu Leu Val
300 305 310
ata aaa cct ctg gaa caa gac agt aaa aaa cgg agt tac aac att ata 13434
Ile Lys Pro Leu Glu Gln Asp Ser Lys Lys Arg Ser Tyr Asn Ile Ile
315 320 325
tcc ggc act atg aat acc ttg tac cgg agc tgg ttt ctg gct tac aac 13482
Ser Gly Thr Met Asn Thr Leu Tyr Arg Ser Trp Phe Leu Ala Tyr Asn
330 335 340
tac ggg gat ccc gaa aag gga gtg aga tca tgg acc ata ctc acc acc 13530
Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Ile Leu Thr Thr
345 350 355
acg gac gtg acc tgc ggc tcg cag caa gtg tac tgg tcc ctg ccg gat 13578
Thr Asp Val Thr Cys Gly Ser Gln Gln Val Tyr Trp Ser Leu Pro Asp
360 365 370 375
atg atg caa gac ccg gtc acc ttc cgc ccc tcc acc caa gtc agc aac 13626
Met Met Gln Asp Pro Val Thr Phe Arg Pro Ser Thr Gln Val Ser Asn
380 385 390
ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc cat gcc aag agc ttc 13674
Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val His Ala Lys Ser Phe
395 400 405
tac aac gaa cag gcc gtc tac tcg caa ctc att cgc cag tcc acc gcg 13722
Tyr Asn Glu Gln Ala Val Tyr Ser Gln Leu Ile Arg Gln Ser Thr Ala
410 415 420
ctt acc cac gtg ttc aat cgc ttt ccc gag aac cag att ctg gtg cgc 13770
Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg
425 430 435
cct ccc gct cct acc att acc acc gtc agt gaa aac gtt ccc gcc ctc 13818
Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu
440 445 450 455
aca gat cac gga acc ctg ccg ctg cgc agc agt atc agt gga gtt cag 13866
Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Ser Gly Val Gln
460 465 470
cgc gtg acc atc acc gac gcc aga cgt cga acc tgt ccc tac gtt tac 13914
Arg Val Thr Ile Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr
475 480 485
aaa gct ctt ggc gta gtg gct cct aaa gtg ctc tct agt cgc acc ttc 13962
Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe
490 495 500
taa acatgtccat cctcatctct cccgataaca acaccggctg gggactgggc 14015
tccggcaaga tgtacggcgg agccaaaagg cgctccagtc agcacccagt tcgagttcgg 14075
ggccacttcc gtgctccctg gggagcttac aagcgaggac tctcgggccg aacggcggta 14135
gacgatacca tagatgccgt gattgccgac gcccgccggt acaaccccgg accggtcgct 14195
agcgccgcct ccaccgtgga ttccgtgatc gacagcgtgg tagctggcgc tcgggcctat 14255
gctcgccgca agaggcggct gcatcggaga cgtcgcccca ccgccgccat gctggcagcc 14315
agggccgtgc tgaggcgggc ccggagggta ggcagaaggg ctatgcgccg cgctgccgcc 14375
aacgccgccg ccgggagggc ccgccgacag gctgcccgcc aggctgctgc cgccatcgct 14435
agcatggcca gacccaggag agggaacgtg tactgggtgc gcgattctgt gacgggagtc 14495
cgagtgccgg tgcgcagccg acctccccga agttagaaga tccaagctgc gaagacggcg 14555
gtactgagtc tccctgttgt tatcagccca acatgagcaa gcgcaagttt aaagaagaac 14615
tgctgcagac gctggtgcct gagatctatg gccctccgga cgtgaagcct gacattaagc 14675
cccgcgatat caagcgtgtt aaaaagcggg aaaagaaaga ggaactcgcg gtggtagacg 14735
atggcggagt ggaatttatt aggagtttcg ccccgcgacg cagggttcaa tggaaagggc 14795
ggcgggtaca acgcgttttg aggccgggca ccgcggtagt ttttaccccg ggagagcggt 14855
cggccgttag gggtttcaaa aggcagtacg acgaggtgta cggcgacgag gacatattgg 14915
aacaggcggc tcaacagatc ggagaatttg cctacggaaa gcgttcgcgt cgcgaagacc 14975
tggccatcgc tttagacagc ggcaacccca cgcccagcct caaacctgtg acgctgcagc 15035
aggtgctccc cgtgagcgcc agcacggaca gcaagagggg aataaaaaga gaaatggaag 15095
atctgcagcc caccatccag ctcatggtcc ctaaacggca gaggctggaa gaggtcctgg 15155
agaaaatgaa agtggaccca agcatagagc cggacgtcaa agtcaggccg atcaaagaag 15215
tggcccctgg tctcggggtg cagacggtgg atatccagat ccccgtcacg tcagcttcga 15275
ccgccgtgga agccatggaa acgcaaacgg aaacccctgc cgcgatcggt accagggaag 15335
tggcgttgca aaccgacccc tggtacgaat acgccgcccc tcggcgtcag aggcgacccg 15395
ctcgttacgg ccccgccaac gccatcatgc cagaatatgc gctgcatccg tctatcctgc 15455
ccacccccgg ctaccgggga gtgacgtatc gcccgtcagg aacccgccgc cgaacccgtc 15515
gccgccgccg ctcccgtcgt gctctggccc ccgtgtcggt gcgccgcgta acacgccggg 15575
gaaagacagt taccattccc aacccgcgct accaccctag catcctttaa tgactctgcc 15635
gttttgcaga tggctctgac ttgccgcgtg cgccttcccg ttccgcacta tcgaggaaga 15695
tctcgtcgta ggagaggcat ggcgggtagt ggtcgccggc gggctttgcg caggcgcatg 15755
aaaggcggaa ttttacccgc tctgataccc ataatcgccg ccgccatcgg tgccataccc 15815
ggcgtcgctt cagtggcctt gcaagcagct cgtaataaat aaacgaaggc ttttgcactt 15875
atgtcctggt cctgactatt ttatgcagaa agagcatgga agacatcaat tttacgtcgc 15935
tggctccgcg gcacggctcg cggccgctca tgggcacctg gaacgacatc ggcaccagtc 15995
agctcaacgg gggcgctttc aattggggga gcctttggag cggcattaaa aactttggct 16055
ccacgattaa atcctacggc agcaaagcct ggaacagtag tgctggtcag atgctccgag 16115
ataaactgaa ggacaccaac ttccaagaaa aagtggtcaa tggggtggtg accggcatcc 16175
acggcgcggt agatctcgcc aaccaagcgg tgcagaaaga gattgacagg cgtttggaaa 16235
gctcgcgggt gccgccgcag agaggggatg aggtggaggt cgaggaagta gaagtagagg 16295
aaaagctgcc cccgctggag aaagttcccg gtgcgcctcc gagaccgcag aagcgaccca 16355
ggccagaact agaagaaact ctggtgacgg agagcaagga gcctccctcg tacgagcaag 16415
ccttgaaaga gggcgcctct ccaccctacc caatgacaaa accgatcgcg cctatggctc 16475
ggccggtgta cgggaaggac tacaagcctg tcacgctaga gctccccccg ccgccaccgc 16535
cgccccccac gcgcccgacc gttccccccc ccctgccggc tccgtcggcg ggacccgtgt 16595
ccgcacccgt cgccgtgcct ctgccagccg cccgcccagt ggccgtggcc actgccagaa 16655
accccagagg ccagagagga gccaactggc aaagcacgct gaacagcatc gtgggcctgg 16715
gagtgaaaag cctgaaacgc cgccgttgct attattaaaa gtgtagctaa aaaatttccc 16775
gttgtatacg cctcctatgt taccgccaga gacgcgtgac tgtcgccgcg agcgccgctt 16835
tcaag atg gcc acc cca tcg atg atg ccg cag tgg tct tac atg cac atc 16885
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile
505 510 515
gcc ggg cag gac gcc tcg gag tac ctg agc ccc ggt ctc gtg cag ttc 16933
Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe
520 525 530
gcc cgc gcc acc gac acc tac ttc agc ttg gga aac aag ttt aga aac 16981
Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn
535 540 545 550
ccc acc gtg gcc ccc acc cac gat gta acc acg gac cgc tcg caa agg 17029
Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg
555 560 565
ctg acc ctg cgt ttt gtg ccc gta gac cgg gag gac acc gcg tac tct 17077
Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser
570 575 580
tac aaa gtg cgc tac acg ctg gcc gta ggg gac aac cga gtg ctg gac 17125
Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp
585 590 595
atg gcc agc acc tac ttt gac atc cgg gga gtg ctg gat cgc ggt ccc 17173
Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro
600 605 610
agt ttt aag ccc tac tcg ggt acc gcg tac aat tcc ctg gct ccc aag 17221
Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys
615 620 625 630
ggc gct ccc aac cct gca gaa tgg acg aat tca gac agc aaa gtt aaa 17269
Gly Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys
635 640 645
gtg agg gca cag gcg cct ttt gtt agc tcg tat ggt gct aca gcg att 17317
Val Arg Ala Gln Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala Ile
650 655 660
aca aaa gag ggt att cag gtg gga gta acc tta aca gac tcc gga tca 17365
Thr Lys Glu Gly Ile Gln Val Gly Val Thr Leu Thr Asp Ser Gly Ser
665 670 675
aca cca cag tat gca gat aaa acg tat cag cct gag ccg caa att gga 17413
Thr Pro Gln Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Ile Gly
680 685 690
gaa cta cag tgg aac agc gat gtt gga acc gat gac aaa ata gca gga 17461
Glu Leu Gln Trp Asn Ser Asp Val Gly Thr Asp Asp Lys Ile Ala Gly
695 700 705 710
aga gtg cta aag aaa aca acg ccc atg ttc cct tgt tac ggc tca tat 17509
Arg Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr
715 720 725
gcc agg ccc act aat gaa aaa gga gga cag gca aca ccg tcc gct agt 17557
Ala Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala Thr Pro Ser Ala Ser
730 735 740
caa gac gtg caa aat ccc gaa tta caa ttt ttt gcc tct act aat gtc 17605
Gln Asp Val Gln Asn Pro Glu Leu Gln Phe Phe Ala Ser Thr Asn Val
745 750 755
gcc aat aca cca aaa gca gtt cta tat gcg gag gac gtg tca att gaa 17653
Ala Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser Ile Glu
760 765 770
gcg cca gac act cac ttg gtg ttc aaa cca aca gtc act gaa ggc att 17701
Ala Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly Ile
775 780 785 790
aca agt tca gag gct cta ctg acc caa caa gct gct ccc aac cgt cca 17749
Thr Ser Ser Glu Ala Leu Leu Thr Gln Gln Ala Ala Pro Asn Arg Pro
795 800 805
aac tac ata gcc ttt aga gat aat ttt att ggt ctc atg tac tac aat 17797
Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn
810 815 820
agc aca ggt aac atg gga gta ctg gca ggc cag gct tct cag cta aat 17845
Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn
825 830 835
gca gtt gtt gac ctg caa gac aga aat act gag ctg tcc tac caa ctc 17893
Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu
840 845 850
atg ttg gac gcc ctc gga gac cgc agt cgg tac ttt tct atg tgg aac 17941
Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn
855 860 865 870
caa gct gtg gat agt tac gat cct gat gta aga atc ata gaa aac cat 17989
Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His
875 880 885
ggc gta gaa gat gaa ttg cct aat tat tgc ttt cct ttg gga ggc atg 18037
Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met
890 895 900
gca gta acc gac acc tac tcg cct ata aag gtt aat gga gga ggc aat 18085
Ala Val Thr Asp Thr Tyr Ser Pro Ile Lys Val Asn Gly Gly Gly Asn
905 910 915
gga tgg gaa gcc aat aac ggc gtt ttc acc gaa aga gga gtg gaa ata 18133
Gly Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu Ile
920 925 930
ggt tca ggg aac atg ttt gcc atg gag att aac ctg caa gcc aac cta 18181
Gly Ser Gly Asn Met Phe Ala Met Glu Ile Asn Leu Gln Ala Asn Leu
935 940 945 950
tgg cgt agc ttt ctg tac tcc aat att ggg ctg tac ctg cca gac tct 18229
Trp Arg Ser Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser
955 960 965
ctc aaa atc act cct gac aac atc aca ctc cca gag aac aaa aac acc 18277
Leu Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr
970 975 980
tat cag tat atg aac ggt cgc gtg acg cca ccc ggg ctg gtt gac acc 18325
Tyr Gln Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr
985 990 995
tac gtt aac gtg ggc gcg cgc tgg tcc ccc gat gtc atg gac agt 18370
Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser
1000 1005 1010
att aac cct ttt aat cac cac cgc aac gcc gga ctc cgc tac cgt 18415
Ile Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg
1015 1020 1025
tcc atg ctc ctg gga aac gga cgc tac gtg ccc ttc cac atc cag 18460
Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln
1030 1035 1040
gtg ccc cag aaa ttc ttt gca att aaa aac ctg ctg ctg ctc ccc 18505
Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro
1045 1050 1055
ggt tcc tac acc tac gag tgg aac ttc cgc aag gac gtg aac atg 18550
Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met
1060 1065 1070
atc ttg cag agc tcg ctg ggc aat gac ctg cga gtg gac ggg gcc 18595
Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala
1075 1080 1085
agc atc cgc ttc gac agc atc aac ctg tac gcc aac ttt ttc ccc 18640
Ser Ile Arg Phe Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro
1090 1095 1100
atg gcc cac aac acg gcc tcc acc ctg gaa gcc atg ctg cgc aac 18685
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn
1105 1110 1115
gac acc aac gac caa tct ttc aac gac tac ctg tgc gcg gcc aac 18730
Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn
1120 1125 1130
atg ctg tac ccc atc ccc gcc aac gcc acc agc gtg ccc atc tcc 18775
Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser
1135 1140 1145
att ccc tct cgc aac tgg gca gcc ttc agg ggc tgg agt ttc acc 18820
Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr
1150 1155 1160
cgc ctc aaa acc aag gag acc ccc tcg ctg ggc tcc ggg ttc gac 18865
Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp
1165 1170 1175
ccc tac ttc gtc tac tcc ggc tcc atc ccc tac ctg gac ggc acc 18910
Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr
1180 1185 1190
ttc tac ctc aac cat act ttc aaa aag gtg tca atc atg ttc gac 18955
Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Met Phe Asp
1195 1200 1205
tcc tcc gtc agc tgg ccc ggc aac gac cgt ctg ctg acg ccc aac 19000
Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn
1210 1215 1220
gag ttc gaa atc aag cgt tcg gtg gac ggt gaa ggg tac aac gtg 19045
Glu Phe Glu Ile Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val
1225 1230 1235
gct cag agc aac atg acc aag gac tgg ttc ctg att cag atg ctc 19090
Ala Gln Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gln Met Leu
1240 1245 1250
agc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc gaa aat 19135
Ser His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Asn
1255 1260 1265
tac aag gac cgc atg tac tct ttc ttc aga aac ttc caa ccc atg 19180
Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met
1270 1275 1280
agc cgc caa att gta gat tca acg gct tac act aat tat cag gat 19225
Ser Arg Gln Ile Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gln Asp
1285 1290 1295
gtg aaa ctg cca tac cag cat aac aac tca ggg ttc gtg ggc tac 19270
Val Lys Leu Pro Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr
1300 1305 1310
atg gga ccc acc atg cga gag ggg cag gcc tac ccg gcc aac tat 19315
Met Gly Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn Tyr
1315 1320 1325
ccc tat ccc ctg att ggg gcc acc gcc gtg ccc agc ctc acg cag 19360
Pro Tyr Pro Leu Ile Gly Ala Thr Ala Val Pro Ser Leu Thr Gln
1330 1335 1340
aaa aag ttc ctc tgc gac cgg gtg atg tgg agg atc ccc ttc tct 19405
Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser
1345 1350 1355
agc aac ttc atg tct atg ggc tcc ctc acc gac ctg ggg cag aac 19450
Ser Asn Phe Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gln Asn
1360 1365 1370
atg ctg tac gcc aac tcc gct cac gcc ttg gat atg acc ttt gag 19495
Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu
1375 1380 1385
gtg gat ccc atg gat gag ccc acg ctt ctc tat gtt ctg ttt gaa 19540
Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu
1390 1395 1400
gtc ttc gac gtg gtg cgc atc cac cag ccg cac cgc ggc gtc atc 19585
Val Phe Asp Val Val Arg Ile His Gln Pro His Arg Gly Val Ile
1405 1410 1415
gag gcc gtc tac ctg cgc aca cct ttc tct gcc ggt aac gcc acc 19630
Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr
1420 1425 1430
acc taa agaagccgat gggctccagc gaacaggagc tgcaggccat tgttcgcgac 19686
Thr
ctgggctgcg ggccctactt tttgggcacc ttcgacaagc gttttcccgg cttcatgtcc 19746
ccccacaagc cggcctgtgc catcgttaac acggccggac gggagaccgg gggggtccac 19806
tggctcgcct tcgcctggaa cccgcgtaac cgcacctgct acctgttcga cccttttggt 19866
ttctccgacg aaaggctgaa gcagatctac cagttcgagt acgaggggct cctcaagcgc 19926
agcgctctgg cctccacgcc cgaccactgc gtcaccctgg aaaagtccac ccaaacggtc 19986
caggggcccc tctcggccgc ctgcgggctc ttctgttgca tgtttttgca cgccttcgtg 20046
cactggcctc acacccccat ggatcacaac cccaccatgg atctgctcac cggagtgccc 20106
aacagcatgc ttcacagccc ccaggtcgcc cccaccctgc gccgtaacca ggaacacctg 20166
tatcgctttc tggggaaaca ctctgcctat tttcgccgcc accggcagcg catcgaacgg 20226
gccacggcct tcgaaagcat gagccaaaga gtgtaatcaa taaaaaacat ttttatttga 20286
catgatacgc gcttctggcg ttttattaaa aatcgaaggg ttcgagggag gggtcctcgt 20346
gcccgctggg gagggacacg ttgcgatact ggaaacgggc gctccaacga aactcgggga 20406
tcaccagccg cggcaggggc acgtcttcta ggttctgctt ccaaaactgc cgcaccagct 20466
gcagggctcc catgacgtcg ggcgccgata tcttgaagtc gcagttaggg ccggagctcc 20526
cgcggctgtt gcggaacacg gggttggcac actggaacac cagcacgccg gggttgtgga 20586
tactggccag ggccgtcggg tcggtcacct ccgacgcatc cagatcctcg gcgttgctca 20646
gggcaaacgg ggtcagcttg cacatctgcc gcccaatctg gggtactagg tcgcgcttgt 20706
tgaggcagtc gcagcgcaga gggatcagga tgcgtcgctg cccgcgttgc atgatagggt 20766
aactcgccgc caggaactcc tccatttgac ggaaggccat ctgggctttg ccgccctcgg 20826
tgtagaatag cccgcaggac ttgctagaga atacgttatg accgcagttg acgtcctccg 20886
cgcagcagcg ggcgtcttcg ttcttcagct gaaccacgtt gcggccccaa cggttctgga 20946
ccaccttggc tctagtgggg tgctccttca gcgcccgctg tccgttctcg ctggttacat 21006
ccatttccaa cacgtgctcc ttgcagacca tctccactcc gtggaagcaa aacaggacgc 21066
cctcctgctg ggtactgcga tgctcccata cggcgcatcc ggtgggctcc cagctcttgt 21126
gttttacccc cgcgtaggct tccatgtaag ccataaggaa tctgcccatc agctcggtga 21186
aggtcttctg gttggtgaag gttagcggca ggccgcggtg ctcctcgttc aaccaagttt 21246
gacagatctt gcggtacacc gctccctggt cgggcagaaa cttaaaagcc gctctgctgt 21306
cgttgtctac gtggaacttc tccattaaca tcatcatggt ttccataccc ttctcccacg 21366
ctgtcaccag tggtttgctg tcggggttct tcaccaacac ggcggtagag gggccctcgc 21426
cggccccgac gtccttcatg gtcattcttt gaaactccac ggagccgtcc gcgcgacgta 21486
ctctgcgcac cggagggtag ctgaagccca cctccaccac ggtgccttcg ccctcgctgt 21546
cggagacaat ctccggggat ggcggcggcg cgggtgtcgc cttgcgagcc ttcttcttgg 21606
gagggagctg aggcgcctcc tgctcgcgct cggggctcat ctcccgcaag tagggggtaa 21666
tggagctgcc tgcttggttc tgacggttgg ccattgtatc ctaggcagaa agacatggag 21726
cttatgcgcg aggaaacttt aaccgccccg tcccccgtca gcgacgaaga tgtcatcgtc 21786
gaacaggacc cgggctacgt tacgccgccc gaggatctgg aggggcctga ccggcgcgac 21846
gctagtgagc ggcaggaaaa tgagaaagag gaggcctgct acctcctgga aggcgacgtt 21906
ttgctaaagc atttcgccag gcagagcacc atagttaagg aggccttgca agaccgctcc 21966
gaggtgccct tggacgtcgc cgcgctctcc caggcctacg aggcgaacct tttctcgcct 22026
cgagtgcctc cgaagagaca gcccaacggc acctgcgagc ccaacccgcg actcaacttc 22086
taccccgtgt tcgccgtacc agaggcgctg gccacctatc acattttttt caaaaaccaa 22146
cgcatccccc tatcgtgccg ggccaaccgc accgcggccg ataggaatct caggcttaaa 22206
aacggagcca acatacctga tatcacgtcg ctggaggaag tgcccaagat tttcgagggt 22266
ctgggtcgag atgagaagcg ggcggcgaac gctctgcaga aagaacagaa agagagtcag 22326
aacgtgctgg tggagctgga gggggacaac gcgcgtctgg ccgtcctcaa acgctgcata 22386
gaagtctccc acttcgccta ccccgccctc aacttgccac ccaaagttat gaaatcggtc 22446
atggatcagc tgctcatcaa gagagctgag cccctggatc ccgaccaccc cgaggcggaa 22506
aactcagagg acggaaagcc cgtcgtcagc gacgaggagc tcgagcggtg gctggaaacc 22566
agggaccccc aacagttgca agagaggcgc aagatgatga tggcggccgt gctggtcacc 22626
gtggagctgg aatgcctgca acggtttttc agcgacgtgg agacgctacg caaaatcggg 22686
gaatccctgc actacacctt ccgccagggc tacgtccgcc aggcctgcaa gatctccaac 22746
gtggagctca gcaacctggt ctcctacatg ggcatcctcc acgagaaccg gctggggcag 22806
agcgtgctgc actgcacctt gcaaggcgag gcgcggcggg actacgtgcg agactgcatc 22866
tacctcttcc tcaccctcac ctggcagacc gccatgggcg tctggcagca gtgcttggaa 22926
gagagaaacc tcaaagagct agacaaactc ctctgccgcc agcggcgcgc cctgtggtcc 22986
ggtttcagcg agcgcacggt cgccagcgct ctggcggaca tcatcttccc ggagcgcctg 23046
atgaaaacct tgcaaaacgg cctgccggat ttcatcagtc aaagcatttt gcaaaacttc 23106
cgctcttttg tcctggaacg ctccgggatc ttgcccgcca tgagctgcgc gctaccttct 23166
gactttgtcc ccctctccta ccgcgagtgc cctcccccac tgtggagcca ctgctacctc 23226
ttccaactgg ccaactttct ggcctaccac tccgacctca tggaagacgt aagcggagag 23286
ggtttactgg agtgccactg ccgctgcaac ctgtgcaccc cccacagatc gctggcctgc 23346
aacaccgagc tactcagcga aacccaggtc ataggtacct tcgagatcca ggggccccag 23406
cagcaagagg gtgcttccgg cttgaagctc actccggcgc tgtggacctc ggcttactta 23466
cgcaaatttg tagccgagga ctaccacgcc cacaaaattc agttttacga agaccaatct 23526
cgaccaccga aagcccccct cacggcctgc gtcatcaccc agagcaagat cctggcccaa 23586
ttgcaatcca tcaaccaagc gcgccgcgat ttccttttga aaaagggtcg gggggtgtac 23646
ctggaccccc agaccggcga ggaactcaac ccgtccacac tctccgtcga agcagccccc 23706
ccgagacatg ccgcccaagg gaaccgccaa gcagctgatc gctcggcaga gagcgaagaa 23766
gcaagagctg ctccagcagc aggtggagga cgaggaagag atgtgggaca gccaggcaga 23826
ggaggtgtca gaggacgagg aggagatgga aagctgggac agcctagacg aggaggagga 23886
cgagctttca gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct 23946
gagccgacag ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg 24006
taggtgggac gccaccgaat ctccagcggc agcggcaacg gcagcgggta aggccaaacg 24066
cgagcggcgg gggtattgct cctggcgggc ccacaaaagc agtattgtga actgcttgca 24126
acactgcggg ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc 24186
cttccctcgc aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg 24246
agaaaaaagc taaggcctcc tccgccgcga ggaaaaactc cgccgccgct gccgccgcca 24306
aggatccacc ggccaccgaa gagctgagaa agcgcatctt tcccactctg tatgctatct 24366
ttcagcaaag ccgcgggcag caccctcagc gcgaactgaa aataaaaaac cgctccttcc 24426
gctcgctcac ccgcagctgt ctgtaccaca agagagaaga ccagctgcag cgcaccctgg 24486
acgacgccga agcactgttc agcaaatact gctcagcgtc tcttaaagac taaaagaccc 24546
gcgctttttc cccctcggcc gccaaaaccc acgtcatcgc cagcatgagc aaggagattc 24606
ccacccccta catgtggagc tatcagcccc agatgggcct ggccgcgggg gccgcccagg 24666
actactccag caagatgaac tggctcagcg ccggccccca catgatctca cgagttaacg 24726
gcatccgagc ccaccgaaac cagattctct tagaacaggc ggcaatcacc gccacacccc 24786
ggcgccaact caacccgcct agttggcccg ccgcccaggt gtatcaggaa aatccccgcc 24846
cgaccacagt cctcctgcca cgcgacgcgg aggccgaagt cctcatgact aactctgggg 24906
tacaattagc gggcgggtcc aggtacgcca ggtacagagg tcgggccgct ccttactctc 24966
ccgggagtat aaagagggtg atcattcgag gccgaggtat ccagctcaac gacgagacgg 25026
tgagctcctc aaccggtctc agacctgacg gagtcttcca gctcggagga gcgggccgct 25086
cttccttcac cactcgccag gcctacctga ccctgcagag ctcttcctcg cagccgcgct 25146
ccgggggaat cggcactctc cagttcgtgg aagagttcgt tccctccgtc tacttcaacc 25206
ccttctccgg ctcgcctgga cgctacccgg acgccttcat tcccaacttt gacgcagtga 25266
gtgaatccgt ggacggctac gactgatgac agatggtgcg gccgtgagag ctcggctgcg 25326
acatctgcat cactgccgtc agcctcgctg ctacgctcgg gaggcgatcg tcttcagcta 25386
ctttgagctg ccggacgagc accctcaggg tccggctcac gggttgaaac tcgagatcga 25446
gaacgcgctc gagtctcgcc tcatcgacac cttcaccgcc cgacctctcc tggtagaaat 25506
ccaacggggg atcactacca tcaccctgtt ctgcatctgc cccacgcccg gattacatga 25566
agatctgtgt tgtcatcttt gcgctcagtt taataaaaac tgaacttttt gccgcacctt 25626
caacgccatc tgtgatttct acaacaaaaa gttcttctgg caaaggtaca caaactgtat 25686
tttattctaa ttctacctca tctatcgtgc tgaactgcgc ctgcactaac gaacttatcc 25746
agtggattgc aaacggtagt gtgtgcaagt acttttgggg gaacgatata gttagtagaa 25806
ataacagcct ttgcgagcac tgcaactcct ccacactaat cctttatccc ccatttgtta 25866
ctggatggta tatgtgcgtt ggctccggtt taaatcctag ttgctttcat aagtggtttc 25926
tacaaaaaga gacccttccc aacaattctg tttctttttt cgccctatcc tactgctgtt 25986
ctccctctgg ttactctttc aaacctctaa ttggtatttt agctttgata ctcataatct 26046
ttattaactt tataataatt aacaacttac agtaaacatg cttgttctac tgctcgccac 26106
atctttcgct ctctctcacg ccagaacaag tattgttggc gcaggttaca atgcaactct 26166
tcaatctgct tacatgccag attccgacca gataccccat attacgtggt acttacaaac 26226
ctccaaacct aattcttcat tttatgaagg aaacaaactc tgcgatgact ccgacaacag 26286
aacgcacaca tttccccacc cttcactaca attcgaatgc gtaaacaaaa gcttgaagct 26346
ttacaactta aagccttcag attctggctt gtaccatgct gtagttgaaa aaagtaattt 26406
agaagtccac agtgattaca ttgaattgac ggttgtggac ctgccacctc caaaatgtga 26466
ggtttcctcc tcttaccttg aagttcaagg cgtggatgcc tactgcctca tacacattaa 26526
ctgcagcaac tctaaatatc cagctagaat ttactataat ggacaggaaa gtaatctttt 26586
ttattattta acaacaagcg ctggtaacgg taaacagtta cctgactatt ttactgctgt 26646
tgttgaattt tccacctaca gagaaacgta tgccaagcgg ccttacaatt tctcataccc 26706
gtttaacgac ctttgcaatg aaatacaagc gctcgaaact ggaactgatt ttactccaat 26766
tttcattgct gccattgttg taagcttaat taccattatt gtcagcctag cattttactg 26826
cttttacaag cccaaaaacc ctaagtttga aaaacttaaa ctaaaacctg tcattcaaca 26886
agtgtgattt tgttttccag catggtagct gcatttctac ttctcctctg tctacccatc 26946
attttcgtct cttcaacttt cgccgcagtt tcccacctgg aaccagagtg cctaccgcct 27006
tttgacgtgt atctgattct cacctttgtt tgttgtatat ccatttgcag tatagcctgc 27066
ttttttataa caatctttca agccgccgac tatttttacg tgcgaattgc ttactttaga 27126
caccatcctg aatacagaaa tcaaaacgtt gcctccttac tttgtttggc atgattaagt 27186
tattgctgat acttaattat ttacccctaa tcaactgtaa ttgtccattc accaaaccct 27246
ggtcattcta cacctgttat gataaaatcc ccgacactcc tgttgcttgg ctttacgcag 27306
ccaccgccgc tttggtattt atatctactt gccttggagt aaaattgtat tttattttac 27366
acactgggtg gctacatccc agagaagatt tacctagata tcctcttgta aacgcttttc 27426
aattacagcc tctgcctcct cctgatcttc ttcctcgagc tccctctatt gtgagctact 27486
ttcaactcac cggtggagat gactgactct caggacatta atattagtgt ggaaagaata 27546
gctgctcagc gtcagcgaga aacgcgagtg ttggaatacc tggaactaca gcaacttaaa 27606
gagtcccact ggtgtgagaa aggagtgctg tgccatgtta agcaggcagc cctttcctac 27666
gatgtcagcg ttcagggaca tgaactgtct tacactttgc ctttgcagaa acaaaccttc 27726
tgcaccatga tgggctctac ctccatcaca atcacccaac aagccgggcc tgtagagggg 27786
gctatcctct gtcactgtca cgcacctgat tgcatgtcca aactaatcaa aactctctgt 27846
gctttaggtg atatttttaa ggtgtaaatc aataataaac ttaccttaaa tttgacaaca 27906
aatttctggt gacatcattc agcagcacca ctttaccctc ttcccagctc tcgtatggga 27966
tgcgatagtg ggtggcaaac ttcctccaaa ccctaaaaga aatattggta tccacttcct 28026
tgtcctcacc cacaattttc atcttttcat ag atg aaa aga acc aga gtt gat 28079
Met Lys Arg Thr Arg Val Asp
1435 1440
gaa gac ttc aac ccc gtc tac ccc tat gac acc aca acc act cct 28124
Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Thr Thr Thr Thr Pro
1445 1450 1455
gca gtt ccc ttt ata tca ccc ccc ttt gta aac agc gat ggt ctt 28169
Ala Val Pro Phe Ile Ser Pro Pro Phe Val Asn Ser Asp Gly Leu
1460 1465 1470
cag gaa aac ccc cca ggt gtt tta agt ctg cga ata gct aaa ccc 28214
Gln Glu Asn Pro Pro Gly Val Leu Ser Leu Arg Ile Ala Lys Pro
1475 1480 1485
cta tat ttc gac atg gag aga aaa cta gcc ctt tca ctt gga aga 28259
Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser Leu Gly Arg
1490 1495 1500
ggg ttg aca att acc gcc gcc gga caa tta gaa agt acg cag agc 28304
Gly Leu Thr Ile Thr Ala Ala Gly Gln Leu Glu Ser Thr Gln Ser
1505 1510 1515
gta caa acc aac cca ccg ttg ata att acc aac aac aac aca ctg 28349
Val Gln Thr Asn Pro Pro Leu Ile Ile Thr Asn Asn Asn Thr Leu
1520 1525 1530
acc cta cgt cat tct ccc ccc tta aac cta act gac aat agc tta 28394
Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu
1535 1540 1545
gtg cta ggc tac tcg agt cct ctc cgc gtc aca gac aac aaa ctt 28439
Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu
1550 1555 1560
aca ttt aac ttc aca tca cca ctc cgt tat gaa aat gaa aac ctt 28484
Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu
1565 1570 1575
act ttt aac tat aca gag cct ctt aaa ctt ata aat aac agc ctt 28529
Thr Phe Asn Tyr Thr Glu Pro Leu Lys Leu Ile Asn Asn Ser Leu
1580 1585 1590
gcc att gac atc aat tcc tca aaa ggc ctt agt agc gtc gga ggc 28574
Ala Ile Asp Ile Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly
1595 1600 1605
tca cta gct gta aac ctg agt tca gac tta aag ttt gac agc aac 28619
Ser Leu Ala Val Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn
1610 1615 1620
gga tcc ata gct ttt ggc ata caa acc ctg tgg acc gct ccg acc 28664
Gly Ser Ile Ala Phe Gly Ile Gln Thr Leu Trp Thr Ala Pro Thr
1625 1630 1635
tcg act ggc aac tgc acc gtc tac agc gag ggc gat tcc cta ctt 28709
Ser Thr Gly Asn Cys Thr Val Tyr Ser Glu Gly Asp Ser Leu Leu
1640 1645 1650
agt ctc tgt tta acc aaa tgc gga gct cac gtc tta gga agt gta 28754
Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly Ser Val
1655 1660 1665
agt tta acc ggt tta aca gga acc ata acc caa atg act gat att 28799
Ser Leu Thr Gly Leu Thr Gly Thr Ile Thr Gln Met Thr Asp Ile
1670 1675 1680
tct gtc acc att caa ttt aca ttt gac aac aat ggt aag cta cta 28844
Ser Val Thr Ile Gln Phe Thr Phe Asp Asn Asn Gly Lys Leu Leu
1685 1690 1695
agc tct cca ctt ata aac aac gcc ttt agt att cga cag aat gac 28889
Ser Ser Pro Leu Ile Asn Asn Ala Phe Ser Ile Arg Gln Asn Asp
1700 1705 1710
agt acg gcc tca aac cct acc tac aac gcc ctg gcg ttt atg cct 28934
Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu Ala Phe Met Pro
1715 1720 1725
aac agt acc ata tat gca aga ggg gga ggt ggt gaa cca cga aac 28979
Asn Ser Thr Ile Tyr Ala Arg Gly Gly Gly Gly Glu Pro Arg Asn
1730 1735 1740
aac tac tac gtc caa acg tat ctt agg gga aat gtt caa aaa cca 29024
Asn Tyr Tyr Val Gln Thr Tyr Leu Arg Gly Asn Val Gln Lys Pro
1745 1750 1755
atc att ctt act gta acc tac aac tca gtc gcc aca gga tat tcc 29069
Ile Ile Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr Ser
1760 1765 1770
tta tct ttt aag tgg act gct ctt gca cgt gaa aag ttt gca acc 29114
Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr
1775 1780 1785
cca aca acc tcg ttt tgc tac att aca gaa caa taa aaccgtgtac 29160
Pro Thr Thr Ser Phe Cys Tyr Ile Thr Glu Gln
1790 1795
cccaccgttt cgtttttttc ag atg aaa cgg gcg aga gtt gat gaa gac 29209
Met Lys Arg Ala Arg Val Asp Glu Asp
1800 1805
ttc aac cca gtg tac cct tat gac ccc cca cat gct cct gtt atg 29254
Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro His Ala Pro Val Met
1810 1815 1820
ccc ttc att act cca cct ttt acc tcc tcg gat ggg ttg cag gaa 29299
Pro Phe Ile Thr Pro Pro Phe Thr Ser Ser Asp Gly Leu Gln Glu
1825 1830 1835
aaa cca ctt gga gtg tta agt tta aac tac aga gat ccc att act 29344
Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp Pro Ile Thr
1840 1845 1850
acg caa aat gag tct ctt aca att aaa cta gga aac ggc ctc act 29389
Thr Gln Asn Glu Ser Leu Thr Ile Lys Leu Gly Asn Gly Leu Thr
1855 1860 1865
cta gac aac cag gga caa cta aca tca acc gct ggc gaa gta gaa 29434
Leu Asp Asn Gln Gly Gln Leu Thr Ser Thr Ala Gly Glu Val Glu
1870 1875 1880
cct cca ctc act aac gct aac aac aaa ctt gca ctg gtc tat agc 29479
Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu Val Tyr Ser
1885 1890 1895
gat cct tta gca gta aag cgc aac agc cta acc tta tcg cac acc 29524
Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser His Thr
1900 1905 1910
gct ccc ctt gtt att gct gat aac tct tta gca ttg caa gtt tca 29569
Ala Pro Leu Val Ile Ala Asp Asn Ser Leu Ala Leu Gln Val Ser
1915 1920 1925
gag cct att ttt ata aat gac aag gac aaa cta gcc ctg caa aca 29614
Glu Pro Ile Phe Ile Asn Asp Lys Asp Lys Leu Ala Leu Gln Thr
1930 1935 1940
gcc gcg ccc ctt gta act aac gct ggc acc ctt cgc tta caa agc 29659
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gln Ser
1945 1950 1955
gcc gcc cct tta ggc att gca gac caa acc cta aaa ctc ctg ttt 29704
Ala Ala Pro Leu Gly Ile Ala Asp Gln Thr Leu Lys Leu Leu Phe
1960 1965 1970
acc aac cct ttg tac ttg cag aat aac ttt ctc acg tta gcc att 29749
Thr Asn Pro Leu Tyr Leu Gln Asn Asn Phe Leu Thr Leu Ala Ile
1975 1980 1985
gaa cga ccc ctt gcc att acc aat act gga aag ctg gct cta cag 29794
Glu Arg Pro Leu Ala Ile Thr Asn Thr Gly Lys Leu Ala Leu Gln
1990 1995 2000
ctc tcc cca ccg cta caa aca gca gac aca ggc ttg act ttg caa 29839
Leu Ser Pro Pro Leu Gln Thr Ala Asp Thr Gly Leu Thr Leu Gln
2005 2010 2015
acc aac gtg cca tta act gta agc aac ggg acc cta ggc tta gcc 29884
Thr Asn Val Pro Leu Thr Val Ser Asn Gly Thr Leu Gly Leu Ala
2020 2025 2030
ata aag cgc cca ctt att att cag gac aac aac ttg ttt ttg gac 29929
Ile Lys Arg Pro Leu Ile Ile Gln Asp Asn Asn Leu Phe Leu Asp
2035 2040 2045
ttc aga gct ccc ctg cgt ctt ttc aac agc gac cca gta cta ggg 29974
Phe Arg Ala Pro Leu Arg Leu Phe Asn Ser Asp Pro Val Leu Gly
2050 2055 2060
ctt aac ttt tac acc cct ctt gcg gta cgc gat gag gcg ctc act 30019
Leu Asn Phe Tyr Thr Pro Leu Ala Val Arg Asp Glu Ala Leu Thr
2065 2070 2075
gtt aac aca ggc cgc ggc ctc aca gtg agt tac gat ggt tta att 30064
Val Asn Thr Gly Arg Gly Leu Thr Val Ser Tyr Asp Gly Leu Ile
2080 2085 2090
tta aat ctt ggt aag gat ctt cgc ttt gac aac aac acc gtt tct 30109
Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp Asn Asn Thr Val Ser
2095 2100 2105
gtc gct ctt agt gct gct ttg cct tta caa tac act gat cag ctt 30154
Val Ala Leu Ser Ala Ala Leu Pro Leu Gln Tyr Thr Asp Gln Leu
2110 2115 2120
cgc ctt aac gtg ggc gct ggg ctg cgt tac aat cca gtg agt aag 30199
Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro Val Ser Lys
2125 2130 2135
aaa ttg gac gtg aac ccc aat caa aac aag ggt tta acc tgg gaa 30244
Lys Leu Asp Val Asn Pro Asn Gln Asn Lys Gly Leu Thr Trp Glu
2140 2145 2150
aat gac tac ctc att gta aag cta gga aat gga tta ggt ttt gat 30289
Asn Asp Tyr Leu Ile Val Lys Leu Gly Asn Gly Leu Gly Phe Asp
2155 2160 2165
ggc gat gga aac ata gct gtt tct cct caa gtt aca tcg cct gac 30334
Gly Asp Gly Asn Ile Ala Val Ser Pro Gln Val Thr Ser Pro Asp
2170 2175 2180
acc tta tgg acc act gcc gac cca tcc ccc aat tgt tcc atc tac 30379
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Ile Tyr
2185 2190 2195
act gat tta gat gcc aaa atg tgg ctc tcg ttg gta aaa caa ggg 30424
Thr Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gln Gly
2200 2205 2210
ggt gtg gtt cac ggt tct gtt gct tta aaa gca ttg aaa gga acc 30469
Gly Val Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr
2215 2220 2225
cta ttg agt cct acg gaa agc gcc att gtt att ata cta cat ttt 30514
Leu Leu Ser Pro Thr Glu Ser Ala Ile Val Ile Ile Leu His Phe
2230 2235 2240
gac aat tat gga gtg cga att ctc aat tat ccc act ttg ggc act 30559
Asp Asn Tyr Gly Val Arg Ile Leu Asn Tyr Pro Thr Leu Gly Thr
2245 2250 2255
caa ggc acg ttg gga aat aat gca act tgg ggt tat agg cag gga 30604
Gln Gly Thr Leu Gly Asn Asn Ala Thr Trp Gly Tyr Arg Gln Gly
2260 2265 2270
gaa tct gca gac act aat gta ctc aat gca cta gca ttt atg ccc 30649
Glu Ser Ala Asp Thr Asn Val Leu Asn Ala Leu Ala Phe Met Pro
2275 2280 2285
agt tca aaa agg tac cca aga ggg cgt gga agc gaa gtt cag aat 30694
Ser Ser Lys Arg Tyr Pro Arg Gly Arg Gly Ser Glu Val Gln Asn
2290 2295 2300
caa act gtg ggc tac act tgt ata cag ggt gac ttt tct atg ccc 30739
Gln Thr Val Gly Tyr Thr Cys Ile Gln Gly Asp Phe Ser Met Pro
2305 2310 2315
gta ccg tac caa ata cag tac aac tat gga cca act ggc tac tcc 30784
Val Pro Tyr Gln Ile Gln Tyr Asn Tyr Gly Pro Thr Gly Tyr Ser
2320 2325 2330
ttt aaa ttt att tgg aga act gtt tca aga caa cca ttt gac atc 30829
Phe Lys Phe Ile Trp Arg Thr Val Ser Arg Gln Pro Phe Asp Ile
2335 2340 2345
cca tgc tgt ttt ttc tct tac att acg gaa gaa taa aacaactttt 30875
Pro Cys Cys Phe Phe Ser Tyr Ile Thr Glu Glu
2350 2355
tctttttatt ttctttttat tttacacgca cagtaaggct tcctccaccc ttccatctca 30935
cagcatacac cagcctctcc cccttcatgg cagtaaactg ttgtgagtca gtccggtatt 30995
tgggagttaa gatccaaaca gtctctttgg tgatgaaaca tggatccgtg atggacacaa 31055
atccctggga caggttctcc aacgtttcgg taaaaaactg catgccgccc tacaaaacaa 31115
acaggttcag gctctccacg ggttatctcc ccgatcaaac tcagacagag taaaggtgcg 31175
atgatgttcc actaaaccac gcaggtggcg ctgtctgaac ctctcggtgc gactcctgtg 31235
aggctggtaa gaagttagat tgtccagcag cctcacagca tggatcatca gtctacgagt 31295
gcgtctggcg cagcagcgca tctgaatctc actgagattc cggcaagaat cgcacaccat 31355
cacaatcagg ttgttcatga tcccatagct gaacacgctc cagccaaagc tcattcgctc 31415
caacagcgcc accgcgtgtc cgtccaacct tactttaaca taaatcaggt gtctgccgcg 31475
tacaaacatg ctacccgcat acagaacctc ccggggcaaa cccctgttca ccacctgcct 31535
gtaccaggga aacctcacat ttatcaggga gccatagata gccattttaa accaattagc 31595
taacaccgcc ccaccagctc tacactgaag agaaccggga gagttacaat gacagtgaat 31655
aatccatctc tcataacccc taatggtctg atggaaatcc agatctaacg tggcacagca 31715
gatacacact ttcatataca ttttcatcac atgtttttcc caggccgtta aaatacaatc 31775
ccaatacacg ggccactcct gcagtacaat aaagctaata caagatggta tactcctcac 31835
ctcactaaca ttgtgcatgt tcatattttc acattctaag taccgagagt tctcctctac 31895
aacagcactg ccgcggtcct cacaaggtgg tagctggtga cgattgtaag gagccagtct 31955
gcagcgatac cgtctgtcgc gttgcatcgt agaccaggga ccgacgcact tcctcgtact 32015
tgtagtagca gaaccacgtc cgctgccagc acgtctccaa gtaacgccgg tccctgcgtc 32075
gctcacgctc cctcctcaac gcaaagtgca accactcttg taatccacac agatccctct 32135
cggcctccgg ggcgatgcac acctcaaacc tacagatgtc tcggtacagt tccaaacacg 32195
tagtgagggc gagttccaac caagacagac agcctgatct atcccgacac actggaggtg 32255
gaggaagaca cggaagaggc atgttattcc aagcgattca ccaacgggtc gaaatgaaga 32315
tcccgaagat gacaacggtc gcctccggag ccctgatgga atttaacagc cagatcaaac 32375
attatgcgat tttccaggct atcaatcgcg gcctccaaaa gagcctggac ccgcacttcc 32435
acaaacacca gcaaagcaaa agcgttatta tcaaactctt cgatcatcaa gctgcaggac 32495
tgtacaatgc ccaagtaatt ttcatttctc cactcgcgaa tgatgtcgcg gcaaatagtc 32555
tgaaggttca tgccgtgcat attaaaaagc tccgaaaggg cgccctctat agccatgcgt 32615
agacacacca tcatgactgc aagatatcgg gctcctgaga cacctgcagc agatttaaca 32675
gacccaggtc aggttgctct ccgcgatcgc gaatctccat ccgcaaagtc atttgcaaat 32735
aattaaatag atctgcgccg actaaatctg ttaactccgc gctaggaact aaatcaggtg 32795
tggctacgca gcacaaaagt tccagggatg gcgccaaact cactagaacc gctcccgagt 32855
agcaaaactg atgaatggga gtaacacagt gtaaaatgtt cagccaaaaa tcactaagct 32915
gctcctttaa aaagtccagt acttctatat tcagttcgtg caagtactga agcaactgtg 32975
cgggaatatg cacagcaaaa aaaatagggc ggctcagata catgttgacc taaaataaaa 33035
agaatcatta aactaaagaa gcctggcgaa cggtgggata tatgacacgc tccagcagca 33095
ggcaagcaac cggctgtccc cgggaaccgc ggtaaaattc atccgaatga ttaaaaagaa 33155
caacagagac ttcccaccat gtactcggtt ggatctcctg agcacagagc aatacccccc 33215
tcacattcat atccgctaca gaaaaaaaac gtcccagata cccagcggga atatccaacg 33275
acagctgcaa agacagcaaa acaatccctc tgggagcaat cacaaaatcc tccggtgaaa 33335
aaagcacata catattagaa taaccctgtt gctggggcaa aaaggcccgt cgtcccagca 33395
aatgcacata aatatgttca tcagccattg ccccgtctta ccgcgtaaac agccacgaaa 33455
aaatcgagct aaaatccacc caacagccta tagctatata tacactccac ccaatgacgc 33515
taataccgca ccacccacga ccaaagttca cccacaccca caaaacccgc gaaaatccag 33575
cgccgtcagc acttccgcaa tttcagtctc acaacgtcac ttccgcgcgc cttttcactt 33635
tcccacacac gcccttcgcc cgcccgccct cgcgccaccc cgcgtcaccc cacgtcaccg 33695
cacgtcaccc cggccccgcc tcgctcctcc ccgctcatta tcatattggc acgtttccag 33755
aataaggtat attattgatg cagcaaaaca atccctctgg gagcaatcac aaaatcctcc 33815
ggtgaaaaaa gcacatacat attagaataa ccctgttgct ggggcaaaaa ggcccgtcgt 33875
cccagcaaat gcacataaat atgttcatca gccattgccc cgtcttaccg cgtaaacagc 33935
cacgaaaaaa tcgagctaaa atccacccaa cagcctatag ctatatatac actccaccca 33995
atgacgctaa taccgcacca cccacgacca aagttcaccc acacccacaa aacccgcgaa 34055
aatccagcgc cgtcagcact tccgcaattt cagtctcaca acgtcacttc cgcgcgcctt 34115
ttcactttcc cacacacgcc cttcgcccgc ccgccctcgc gccaccccgc gtcaccccac 34175
gtcaccgcac gtcaccccgg ccccgcctcg ctcctccccg ctcattatca tattggcacg 34235
tttccagaat aaggtatatt attgatgca 34264
<210>25
<211>503
<212>PRT
<213>猿猴腺病毒SV-1
<400>25
Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro
1 5 10 15
Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys
50 55 60
Val Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Gly Thr Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Leu His Thr Asn Met Pro
115 120 125
Asn Ile Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu Met
130 135 140
Val Lys Lys Ala Glu Asn Gln Pro Pro Glu Tyr Glu Trp Phe Glu Phe
145 150 155 160
Thr Ile Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp Leu Met
165 170 175
Asn Asn Ala Ile Val Asp Asn Tyr Leu Gln Val Gly Arg Gln Asn Gly
180 185 190
Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg
195 200 205
Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr
210 215 220
Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val
225 230 235 240
Asp Phe Thr Gln Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg
245 250 255
Arg Pro Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly
260 265 270
Gly Asn Ile Pro Gly Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser Val
275 280 285
Lys Gln Ala Glu Ala Gln Gly Arg Glu Ile Arg Gly Asp Thr Phe Ala
290 295 300
Thr Glu Pro His Glu Leu Val Ile Lys Pro Leu Glu Gln Asp Ser Lys
305 310 315 320
Lys Arg Ser Tyr Asn Ile Ile Ser Gly Thr Met Asn Thr Leu Tyr Arg
325 330 335
Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg
340 345 350
Ser Trp Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gln Gln
355 360 365
Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg
370 375 380
Pro Ser Thr Gln Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu
385 390 395 400
Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gln Ala Val Tyr Ser Gln
405 410 415
Leu Ile Arg Gln Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro
420 425 430
Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val
435 440 445
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg
450 455 460
Ser Ser Ile Ser Gly Val Gln Arg Val Thr Ile Thr Asp Ala Arg Arg
465 470 475 480
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys
485 490 495
Val Leu Ser Ser Arg Thr Phe
500
<210>26
<211>931
<212>PRT
<213>猿猴腺病毒SV-1
<400>26
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys Val
130 135 140
Arg Ala Gln Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala Ile Thr
145 150 155 160
Lys Glu Gly Ile Gln Val Gly Val Thr Leu Thr Asp Ser Gly Ser Thr
165 170 175
Pro Gln Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Ile Gly Glu
180 185 190
Leu Gln Trp Asn Ser Asp Val Gly Thr Asp Asp Lys Ile Ala Gly Arg
195 200 205
Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala
210 215 220
Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala Thr Pro Ser Ala Ser Gln
225 230 235 240
Asp Val Gln Asn Pro Glu Leu Gln Phe Phe Ala Ser Thr Asn Val Ala
245 250 255
Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser Ile Glu Ala
260 265 270
Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly Ile Thr
275 280 285
Ser Ser Glu Ala Leu Leu Thr Gln Gln Ala Ala Pro Asn Arg Pro Asn
290 295 300
Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser
305 310 315 320
Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala
325 330 335
Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met
340 345 350
Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gln
355 360 365
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly
370 375 380
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Ala
385 390 395 400
Val Thr Asp Thr Tyr Ser Pro Ile Lys Val Asn Gly Gly Gly Asn Gly
405 410 415
Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu Ile Gly
420 425 430
Ser Gly Asn Met Phe Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp
435 440 445
Arg Ser Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser Leu
450 455 460
Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr Tyr
465 470 475 480
Gln Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr Tyr
485 490 495
Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser Ile Asn
500 505 510
Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu
515 520 525
Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys
530 535 540
Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr
545 550 555 560
Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu
565 570 575
Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Ile Arg Phe Asp Ser Ile
580 585 590
Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr
595 600 605
Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp
610 615 620
Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr
625 630 635 640
Ser Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly
645 650 655
Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser
660 665 670
Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp
675 680 685
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Met Phe
690 695 700
Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn
705 710 715 720
Glu Phe Glu Ile Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala
725 730 735
Gln Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gln Met Leu Ser His
740 745 750
Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp
755 760 765
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Ile
770 775 780
Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gln Asp Val Lys Leu Pro Tyr
785 790 795 800
Gln His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr Met Arg
805 810 815
Glu Gly Gln Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Ala
820 825 830
Thr Ala Val Pro Ser Leu Thr Gln Lys Lys Phe Leu Cys Asp Arg Val
835 840 845
Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ser Leu
850 855 860
Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu
865 870 875 880
Asp Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr
885 890 895
Val Leu Phe Glu Val Phe Asp Val Val Arg Ile His Gln Pro His Arg
900 905 910
Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn
915 920 925
Ala Thr Thr
930
<210>27
<211>363
<212>PRT
<213>猿猴腺病毒SV-1
<400>27
Met Lys Arg Thr Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr
1 5 10 15
Asp Thr Thr Thr Thr Pro Ala Val Pro Phe Ile Ser Pro Pro Phe Val
20 25 30
Asn Ser Asp Gly Leu Gln Glu Asn Pro Pro Gly Val Leu Ser Leu Arg
35 40 45
Ile Ala Lys Pro Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser
50 55 60
Leu Gly Arg Gly Leu Thr Ile Thr Ala Ala Gly Gln Leu Glu Ser Thr
65 70 75 80
Gln Ser Val Gln Thr Asn Pro Pro Leu Ile Ile Thr Asn Asn Asn Thr
85 90 95
Leu Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu
100 105 110
Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu Thr
115 120 125
Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu Thr Phe
130 135 140
Asn Tyr Thr Glu Pro Leu Lys Leu Ile Asn Asn Ser Leu Ala Ile Asp
145 150 155 160
Ile Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly Ser Leu Ala Val
165 170 175
Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn Gly Ser Ile Ala Phe
180 185 190
Gly Ile Gln Thr Leu Trp Thr Ala Pro Thr Ser Thr Gly Asn Cys Thr
195 200 205
Val Tyr Ser Glu Gly Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys
210 215 220
Gly Ala His Val Leu Gly Ser Val Ser Leu Thr Gly Leu Thr Gly Thr
225 230 235 240
Ile Thr Gln Met Thr Asp Ile Ser Val Thr Ile Gln Phe Thr Phe Asp
245 250 255
Asn Asn Gly Lys Leu Leu Ser Ser Pro Leu Ile Asn Asn Ala Phe Ser
260 265 270
Ile Arg Gln Asn Asp Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu
275 280 285
Ala Phe Met Pro Asn Ser Thr Ile Tyr Ala Arg Gly Gly Gly Gly Glu
290 295 300
Pro Arg Asn Asn Tyr Tyr Val Gln Thr Tyr Leu Arg Gly Asn Val Gln
305 310 315 320
Lys Pro Ile Ile Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr
325 330 335
Ser Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr
340 345 350
Pro Thr Thr Ser Phe Cys Tyr Ile Thr Glu Gln
355 360
<210>28
<211>560
<212>PRT
<213>猿猴腺病毒SV-1
<400>28
Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr
1 5 10 15
Asp Pro Pro His Ala Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr
20 25 30
Ser Ser Asp Gly Leu Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Asn
35 40 45
Tyr Arg Asp Pro Ile Thr Thr Gln Asn Glu Ser Leu Thr Ile Lys Leu
50 55 60
Gly Asn Gly Leu Thr Leu Asp Asn Gln Gly Gln Leu Thr Ser Thr Ala
65 70 75 80
Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu
85 90 95
Val Tyr Ser Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser
100 105 110
His Thr Ala Pro Leu Val Ile Ala Asp Asn Ser Leu Ala Leu Gln Val
115 120 125
Ser Glu Pro Ile Phe Ile Asn Asp Lys Asp Lys Leu Ala Leu Gln Thr
130 135 140
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gln Ser Ala
145 150 155 160
Ala Pro Leu Gly Ile Ala Asp Gln Thr Leu Lys Leu Leu Phe Thr Asn
165 170 175
Pro Leu Tyr Leu Gln Asn Asn Phe Leu Thr Leu Ala Ile Glu Arg Pro
180 185 190
Leu Ala Ile Thr Asn Thr Gly Lys Leu Ala Leu Gln Leu Ser Pro Pro
195 200 205
Leu Gln Thr Ala Asp Thr Gly Leu Thr Leu Gln Thr Asn Val Pro Leu
210 215 220
Thr Val Ser Asn Gly Thr Leu Gly Leu Ala Ile Lys Arg Pro Leu Ile
225 230 235 240
Ile Gln Asp Asn Asn Leu Phe Leu Asp Phe Arg Ala Pro Leu Arg Leu
245 250 255
Phe Asn Ser Asp Pro Val Leu Gly Leu Asn Phe Tyr Thr Pro Leu Ala
260 265 270
Val Arg Asp Glu Ala Leu Thr Val Asn Thr Gly Arg Gly Leu Thr Val
275 280 285
Ser Tyr Asp Gly Leu Ile Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp
290 295 300
Asn Asn Thr Val Ser Val Ala Leu Ser Ala Ala Leu Pro Leu Gln Tyr
305 310 315 320
Thr Asp Gln Leu Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro
325 330 335
Val Ser Lys Lys Leu Asp Val Asn Pro Asn Gln Asn Lys Gly Leu Thr
340 345 350
Trp Glu Asn Asp Tyr Leu Ile Val Lys Leu Gly Asn Gly Leu Gly Phe
355 360 365
Asp Gly Asp Gly Asn Ile Ala Val Ser Pro Gln Val Thr Ser Pro Asp
370 375 380
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Ile Tyr Thr
385 390 395 400
Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gln Gly Gly Val
405 410 415
Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Ser
420 425 430
Pro Thr Glu Ser Ala Ile Val Ile Ile Leu His Phe Asp Asn Tyr Gly
435 440 445
Val Arg Ile Leu Asn Tyr Pro Thr Leu Gly Thr Gln Gly Thr Leu Gly
450 455 460
Asn Asn Ala Thr Trp Gly Tyr Arg Gln Gly Glu Ser Ala Asp Thr Asn
465 470 475 480
Val Leu Asn Ala Leu Ala Phe Met Pro Ser Ser Lys Arg Tyr Pro Arg
485 490 495
Gly Arg Gly Ser Glu Val Gln Asn Gln Thr Val Gly Tyr Thr Cys Ile
500 505 510
Gln Gly Asp Phe Ser Met Pro Val Pro Tyr Gln Ile Gln Tyr Asn Tyr
515 520 525
Gly Pro Thr Gly Tyr Ser Phe Lys Phe Ile Trp Arg Thr Val Ser Arg
530 535 540
Gln Pro Phe Asp Ile Pro Cys Cys Phe Phe Ser Tyr Ile Thr Glu Glu
545 550 555 560
<210>29
<211>31044
<212>DNA
<213>猿猴腺病毒SV-25
<220>
<221>CDS
<222>(12284)..(13801)
<223>五邻体
<220>
<221>CDS
<222>(16681)..(19446)
<223>六邻体
<220>
<221>CDS
<222>(25380)..(26423)
<223>纤维#2
<220>
<221>CDS
<222>(26457)..(28136)
<223>纤维#1
<400>29
catcatcaat aatatacctt attctggaaa cgtgccaata tgataatgag cggggaggag 60
cgaggcgggg ccggggtgac gtgcggtgac gcggggtggc gcgagggcgg ggcgaagggc 120
gcgggtgtgt gtgtgggagg cgcttagttt ttacgtatgc ggaaggaggt tttataccgg 180
aagatgggta atttgggcgt atacttgtaa gttttgtgta atttggcgcg aaaactgggt 240
aatgaggaag ttgaggttaa tatgtacttt ttatgactgg gcggaatttc tgctgatcag 300
cagtgaactt tgggcgctga cggggaggtt tcgctacgtg acagtaccac gagaaggctc 360
aaaggtccca tttattgtac tcttcagcgt tttcgctggg tatttaaacg ctgtcagatc 420
atcaagaggc cactcttgag tgctggcgag aagagttttc tcctccgtgc tgccacgatg 480
aggctggtcc ccgagatgta cggtgttttt agcgacgaga cggtgcgtaa ctcagatgac 540
ctgctgaatt cagacgcgct ggaaatttcc aattcgcctg tgctttcgcc gccgtcactt 600
cacgacctgt ttgtgttttg gctcaacgct tagcaacgtg ttatataggg tcaagaagga 660
gcaggagacg cagtttgcta ggctgttggc cgatactcct ggagtttttg tggctctgga 720
tctaggccat cactctcttt tccaagagaa aattatcaaa aacttaactt ttacgtctcc 780
tggtcgcacg gttgcttccg ctgcctttat tacctatatt ttggatcaat ggagcaacag 840
cgacagccac ctgtcgtggg agtacatgct ggattacatg tcgatggcgc tgtggagggc 900
catgctgcgg aggagggttt gcatttactt gcgggcgcag cctccgcggc tggaccgagt 960
ggaggaggag gacgagccgg gggagaccga gaacctgagg gccgggctgg accctccaac 1020
ggaggactag gtgctgagga tgatcccgaa gaggggacta gtggggctag gaagaagcaa 1080
aagactgagt ctgaacctcg aaactttttg aatgagttga ctgtgagttt gatgaatcgt 1140
cagcgtccgg agacaatttt ctggtctgaa ttggaggagg aattcaggag gggggaactg 1200
aacctgctat acaagtatgg gtttgaacag ttaaaaactc actggttgga gccgtgggag 1260
gattttgaaa ccgccttgga cacttttgct aaagtggctc tgcggccgga taaggtttac 1320
actatccgcc gcactgttaa cataaagaag agtgtttatg ttataggcca tggagctctg 1380
gtgcaggtgc aaaccgtcga ccgggtggcc tttagttgcg gtatgcaaaa tctgggcccc 1440
ggggtgatag gcttaaatgg tgtaacattt cacaatgtaa ggtttactgg tgaaagtttt 1500
aacggctctg tgtttgcaaa taacacacag ctgacgctcc acggcgttta cttttttaac 1560
tttaataaca catgtgtgga gtcgtggggc agggtgtctt tgaggggctg ctgttttcac 1620
ggctgctgga aggcggtggt gggaagactt aaaagtgtaa catctgtaaa aaaatgcgtg 1680
tttgagcggt gtgtgttggc tttaactgtg gagggctgtg gacgcattag gaataatgcg 1740
gcgtctgaga atggatgttt tcttttgcta aaaggcacgg ctagtattaa gcataacatg 1800
atatgcggca gcggtctgta cccttcacag ctgttaactt gcgcggatgg aaactgtcag 1860
accttgcgca ccgtgcacat agcgtcccac cagcgccgcg cctggccaac attcgagcac 1920
aatatgctta tgcgttgtgc cgtccacttg ggccctaggc gaggcgtgtt tgtgccttac 1980
cagtgtaact ttagccatac caagatttta ctagaacctg ataccttctc tcgagtgtgt 2040
ttcaatgggg tgtttgacat gtcaatggaa ctgtttaaag tgataagata tgatgaatcc 2100
aagtctcgtt gtcgcccatg tgaatgcgga gctaatcatc tgaggttgta tcctgtaacc 2160
ctaaacgtta ccgaggagct gaggacggat caccacatgt tgtcctgcct gcgcaccgac 2220
tatgaatcca gcgacgagga gtgaggtgag gggcggagcc acaaagggta taaaggggcg 2280
tgaggggtgg gtgtgatgat tcaaaatgag cgggacgacg gacggcaacg cgtttgaggg 2340
tggagtgttc agcccttatc tgacatctcg tcttccttcc tgggcaggag tgcgtcagaa 2400
tgtagtgggc tccaccgtgg acggacgacc ggtcgcccct gcaaattccg ccaccctcac 2460
ctatgccacc gtgggatcat cgttggacac tgccgcggca gctgccgctt ctgctgccgc 2520
ttctactgct cgcggcatgg cggctgattt tggactgtat aaccaactgg ccactgcagc 2580
tgtggcgtct cggtctctgg ttcaagaaga tgccctgaat gtgatcctga ctcgcctgga 2640
gatcatgtca cgtcgcttgg acgaactggc tgcgcagata tcccaagcta accccgatac 2700
cacttcagaa tcctaaaata aagacaaaca aatatgttga aaagtaaaat ggctttattt 2760
gttttttttg gctcggtagg ctcgggtcca cctgtctcgg tcgttaagaa ctttgtgtat 2820
gttttccaaa acacggtaca gatgggcttg gatgttcaag tacatgggca tgaggccatc 2880
tttggggtga agataggacc attgaagagc gtcatgctcc ggggtggtgt tgtaaattac 2940
ccagtcgtag cagggtttct gggcgtggaa ctggaagatg tcctttagga gtaggctgat 3000
ggccaagggc aggcccttag tgtaggtgtt tacaaagcgg ttaagctggg agggatgcat 3060
gcggggggag atgatatgca tcttggcttg gatcttgagg ttagctatgt taccacccag 3120
gtctctgcgg gggttcatgt tatgaaggac caccagcacg gtgtagccgg tgcatttggg 3180
gaacttgtca tgcagtttgg aggggaaggc gtggaagaat ttagagaccc ccttgtggcc 3240
ccctaggttt tccatgcact catccataat gatggcaatg ggacccctgg cggccgcttt 3300
ggcaaacacg ttttgggggt tggaaacatc atagttttgc tctagagtga gctcatcata 3360
ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 3420
gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgagggggg 3480
tatcatgtcc acctgggggg caataaagaa cacggtttct ggcgggggat tgatgagctg 3540
ggtggaaagc aagttacgca gcagttgaga tttgccacag ccggtggggc cgtagatgac 3600
cccgatgacg ggttgcagct ggtagttgag agaggaacag ctgccgtcgg ggcgcaggag 3660
gggggctacc tcattcatca tgcttctaac atgtttattt tcactcacta agttttgcaa 3720
gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcagcggttt 3780
taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccca 3840
gagctcggtg acgtgctcta tggaatctcg atccagcaga cttcttggtt gcgggggttg 3900
ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggttctgtcc 3960
ttccagggtc tcagcgtccg ggtgagggtg gtctcggtga cggtgaaggg atgagccccg 4020
ggctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 4080
tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 4140
gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 4200
cattggaggg cgtagagttt gggggccagg aagaccgact cgggcgagta ggcgtcggct 4260
ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 4320
tcaaaaacca agttgcctcc attttttttg atgcgtttct taccttgcgt ttccatgagt 4380
ttgtggcccg cttccgtgac aaaaaggctg tcggtgtctc cgtagacaga cttgaggggg 4440
cgatcttcca aaggtgttcc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500
aaggctctgg tccaggctaa cacgaaggag gcaatctgcg aggggtatct gtcgttttca 4560
atgagggggt ccaccttttc cagggtgtgc agacacaggt cgtcctcctc cgcgtccacg 4620
aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 4680
ggggcgtgcc caccctctcc gtcactttct tccgcatcgc tgtggaccag agccagctgt 4740
tcgggtgagt aggccctctc aaaagccggc atgatctcgg cgctcaagtt gtcagtttct 4800
acaaacgagg tggatttgat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 4860
tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 4920
agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttctc ccgatcggcc 4980
ctctccttgg cggcgatgtt gagttgcacg tactcccggg ccgcgcaccg ccactcgggg 5040
aacacggcgg tgcgctcgtc gggcaggatg cgcacgcgcc agccgcgatt gtgcagggtg 5100
atgaggtcca cgctggtagc cacctccccg cggaggggct cgttggtcca acacaatcgc 5160
cccccttttc tggagcagaa cggaggcagg ggatctagca agttggcggg cggggggtcg 5220
gcgtcgatgg tgaagatacc gggtagcagg atcttattaa aataatcgat ttcggtgtcc 5280
gtgtcttgca acgcgtcttc ccacttcttc accgccaggg ccctttcgta gggattcagg 5340
ggcggtcccc agggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 5400
acgtacaggg gttccctcaa caccccgatg taagtggggt aacagcgccc cccgcggatg 5460
ctggctcgca cgtagtcgta catctcgcgc gagggagcca tgaggccgtc tcccaagtgg 5520
gtcttgtggg gtttttcggc ccggtagagg atctgtctga agatggcgtg ggagttggaa 5580
gagatggtgg ggcgttggaa gacgttaaag ttggccccgg gtagtcccac ggagtcttgg 5640
atgaactggg cgtaggattc ccggagtttg tccaccaggg cggcggtcac cagcacgtcg 5700
agagcgcagt agtccaacgt ctcgcggacc aggttgtagg ccgtctcttg ttttttctcc 5760
cacagttcgc ggttgaggag gtattcctcg cggtctttcc agtactcttc ggcgggaaat 5820
cctttttcgt ccgctcggta agaacctaac atgtaaaatt cgttcaccgc tttgtatgga 5880
caacagcctt tttctaccgg cagggcgtac gcttgagcgg cctttctgag agaggtgtgg 5940
gtgagggcga aggtgtcccg caccatcact ttcaggtact gatgtttgaa gtccgtgtcg 6000
tcgcaggcgc cctgttccca cagcgtgaag tcggtgcgct ttttctgcct gggattgggg 6060
agggcgaagg tgacatcgtt aaagagtatt ttcccggcgc ggggcatgaa gttgcgagag 6120
atcctgaagg gcccgggcac gtccgagcgg ttgttgatga cctgcgccgc caggacgatc 6180
tcgtcgaagc cgttgatgtt gtgacccacg atgtaaagtt cgatgaagcg cggctgtccc 6240
ttgagggccg gcgctttttt caactcctcg taggtgagac agtccggcga ggagagaccc 6300
agctcagccc gggcccagtc ggagagttga ggattagccg caaggaagga gctccataga 6360
tccaaggcca ggagagtttg caagcggtcg cggaactcgc ggaacttttt ccccacggcc 6420
attttctccg gtgtcactac gtaaaaggtg ttggggcggt tgttccacac gtcccatcgg 6480
agctctaggg ccagctcgca ggcttggcga acgagggtct cctcgccaga gacgtgcatg 6540
accagcataa agggtaccaa ctgtttcccg aacgagccca tccatgtgta ggtttctacg 6600
tcgtaggtga caaagagccg ctgggtgcgc gcgtgggagc cgatcggaaa gaagctgatc 6660
tcctgccacc agctggagga atgggtgtta atgtggtgga agtagaagtc ccgccggcgc 6720
acagagcatt cgtgctgatg tttgtaaaag cgaccgcagt agtcgcagcg ctgcacgctc 6780
tgtatctcct gaacgagatg cgcttttcgc ccgcgcacca gaaaccggag ggggaagttg 6840
agacgggggg ctggtggggc gacatcccct tcgccttggc ggtgggagtc tgcgtctgcg 6900
tcctccttct ctgggtggac gacggtgggg acgacgacgc cccgggtgcc gcaagtccag 6960
atctccgcca cggaggggtg caggcgctgc aggaggggac gcagctgccc gctgtccagg 7020
gagtcgaggg aagtcgcgct gaggtcggcg ggaagcgttt gcaagttcac tttcagaaga 7080
ccggtaagag cgtgagccag gtgcagatgg tacttgattt ccaggggggt gttggatgaa 7140
gcgtccacgg cgtagaggag tccgtgtccg cgcggggcca ccaccgtgcc ccgaggaggt 7200
tttatctcac tcgtcgaggg cgagcgccgg ggggtagagg cggctctgcg ccggggggca 7260
gcggaggcag aggcacgttt tcgtgaggat tcggcagcgg ttgatgacga gcccggagac 7320
tgctggcgtg ggcgacgacg cggcggttga ggtcctggat gtgccgtctc tgcgtgaaga 7380
ccaccggccc ccgggtcctg aacctaaaga gagttccaca gaatcaatgt ctgcatcgtt 7440
aacggcggcc tgcctgagga tctcctgcac gtcgcccgag ttgtcctgat aggcgatctc 7500
ggccatgaac tgttccactt cttcctcgcg gaggtcaccg tggcccgctc gctccacggt 7560
ggcggccagg tcgttggaga tgcggcgcat gagttgagag aaggcgttga ggccgttctc 7620
gttccacacg cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg 7680
ggccacgttg agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag 7740
gtagttgagc gtggtggcga tgtgctcgca gacgaagaag tacataatcc agcgccgcag 7800
ggtcatctcg ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac 7860
ggcgaagttg aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg 7920
gatgagatcg gcgaccgtgt cgcgcacctc ctgttcgaaa gcgccccgag gcgcctctgc 7980
ttcttcctcc ggctcctcct cttccagggg ctcgggttcc tccggcagct ctgcgacggg 8040
gacggggcgg cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc 8100
gccgcgccgg cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc 8160
gaagacgccg ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg 8220
cagggacacg gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga 8280
tctgagaacg tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc 8340
gcaatcgcaa ggtaagctga gaacggtggg tcgctggggg gcgttcgcgg gcagttggga 8400
ggtgatgctg ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag 8460
gaggaccacg tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc 8520
ctcgctctga cagcgacgca ggtctttgta gaagtcttgc atcagtctct ccaccggaac 8580
ctctgcttct cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa 8640
cgctaggtcg gccacgaccc tttcggccag cacggcctgt tgaatctgcg tgagggtggc 8700
ctggaagtcg tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca 8760
gttggccatg acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt 8820
gaggcgcgag taggccctgg actcgaacac gtagtcgttg catgtgcgca ccagatactg 8880
gtagccgacc aggaagtgag gaggcggctc tcggtacagg ggccagccaa cggtggcggg 8940
ggcgccgggg gacaggtcgt ccagcatgag gcggtggtag tggtagatgt agcgggagag 9000
ccaggtgatg ccggccgagg tggttgcggc cctggtgaat tcgcggacgc ggttccagat 9060
gttgcgcagg ggaccaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca 9120
atcttgtacg ctctagatgg aaaaaagaca gggcggtcat cgactccttt ccgtagcttg 9180
gggggtaaag tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg 9240
ccgctcccga tgcgcctggc cccgcatcca cgacgtccgc gccgagaccc agccgcgacg 9300
ctccgcccca atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc 9360
ggcagatgcg accccagacg cccactacca ccgccgtggc ggcagtaaac ctgagcggag 9420
gcggtgacag ggaggaggaa gagctggctt tagacctgga agagggagag gggctggccc 9480
ggctgggagc gccatcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc 9540
aggcttttgt gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga 9600
tgcgcgattg caggtttcgg gcgggcagag agctcagggc gggcttcgat cgggagcggc 9660
tcctgagggc ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc 9720
acgtatcggc ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact 9780
tccaaaagag ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg 9840
ggctgatgca tctgtgggac ttcgtggagg cctacgtgca gaacccggct agcaaacccc 9900
tgacggccca gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg 9960
ccatgttgaa catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc 10020
agagcatcgt ggtgcaggag aggggcctga gtttagcgga caaggtggcg gccattaact 10080
attcgatgca gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc 10140
ccatagacaa ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga 10200
cgctgagcga cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca 10260
gccgccggcg ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg 10320
gcgccgggga cgaggagcgc gaggcttact tcgacatggg agccgatctg cagtggcgtc 10380
ccagcgcgcg cgccttggag gcggcgggtt atcccgacga ggaggatcgg gacgatttgg 10440
aggaggcagg cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg 10500
gccggcggac gggaccaccg cggatcccgc acttttggca tccatgcaga gtcaaccttc 10560
gggcgtgacc gcctccgatg actgggcggc ggccatggac cgcatcatgg cgctgaccac 10620
ccgcaacccc gaggctttta ggcagcaacc ccaggccaac cgtttttcgg ccatcttgga 10680
agcggtggtg ccgtcgcgca ccaacccgac gcacgagaaa gtcctgacta tcgtgaacgc 10740
cctggtagac agcaaggcca tccgccgtga cgaggcgggc ttgatttaca acgctctttt 10800
ggaacgcgtg gcgcgctaca acagcactaa cgtgcagacc aatctggacc gcctcaccac 10860
cgacgtgaag gaggcgctgg cgcagaagga gcggtttctg agggacagta atctgggctc 10920
tctggtggca ctgaacgcct tcctgagctc acagccggcc aacgtgcccc gcgggcagga 10980
ggattacgtg agcttcatca gcgctctgag actgctggtg tccgaggtgc cccagagcga 11040
ggtgtaccag tctgggccgg attacttttt ccagacgtcc cgacagggct tgcaaacggt 11100
gaacctgact caggccttta aaaacttgca aggcatgtgg ggggtcaagg ccccggtggg 11160
cgatcgcgcc actatctcca gtctgctgac ccccaacact cgcctgctgc tgctcttgat 11220
cgcaccgttt accaacagta gcactatcag ccgtgactcg tacctgggtc atctcatcac 11280
tctgtaccgc gaggccatcg gccaggctca gatcgacgag catacgtatc aggagattac 11340
taacgtgagc cgtgccctgg gtcaggaaga taccggcagc ctggaagcca cgttgaactt 11400
tttgctaacc aaccggaggc aaaaaatacc ctcccagttc acgttaagcg ccgaggagga 11460
gaggattctg cgatacgtgc agcagtccgt gagcctgtac ttgatgcgcg agggcgccac 11520
cgcttccacg gctttagaca tgacggctcg gaacatggaa ccgtcctttt actccgccca 11580
ccggccgttc attaaccgtc tgatggacta cttccatcgc gcggccgcca tgaacgggga 11640
gtacttcacc aatgccatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg 11700
ggagtttgac ctgcccgaag ccgacgacgg ctttctgtgg gacgacgtgt ccgatagcat 11760
tttcacgccg gctaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc 11820
cagcgtggaa gcggcctcaa ggggagagag tccctttcca agtctgtctt ccgccagtag 11880
cggtcgggta acgcgtccac ggttgccggg ggagagcgac tacctgaacg accccttgct 11940
gcgaccggct agaaagaaaa attttcccaa taacggggtg gaaagcttgg tggataaaat 12000
gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggcc 12060
gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120
tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180
gcgctcgagt cgcctgtatt aataaaaaag acgcggaaac ttaccagagc catggccaca 12240
gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg 12295
Met Arg Arg Ala
1
gtg aga gtc acg ccg gcg gcg tat gag ggc ccg ccc cct tct tac gaa 12343
Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu
5 10 15 20
agc gtg atg gga tca gcg aac gtg ccg gcc acg ctg gag gcg cct tac 12391
Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr
25 30 35
gtt cct ccc aga tac ctg gga cct acg gag ggc aga aac agc atc cgt 12439
Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser Ile Arg
40 45 50
tac tcc gag ctg gcg ccc ctg tac gat acc acc aag gtg tac ctg gtg 12487
Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val
55 60 65
gac aac aag tcg gcg gac atc gcc tcc ctg aat tac caa aac gat cac 12535
Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His
70 75 80
agt aac ttt ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg 12583
Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr
85 90 95 100
gag gcg ggc acg cag acc att aac ttt gac gag cgt tcc cgc tgg ggc 12631
Glu Ala Gly Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly
105 110 115
ggt cag ctg aaa acc atc ctg cac acc aac atg ccc aac atc aac gag 12679
Gly Gln Leu Lys Thr Ile Leu His Thr Asn Met Pro Asn Ile Asn Glu
120 125 130
ttc atg tcc acc aac aag ttc agg gct aag ctg atg gta gaa aaa agt 12727
Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met Val Glu Lys Ser
135 140 145
aat gcg gaa act cgg cag ccc cga tac gag tgg ttc gag ttt acc att 12775
Asn Ala Glu Thr Arg Gln Pro Arg Tyr Glu Trp Phe Glu Phe Thr Ile
150 155 160
cca gag ggc aac tat tcc gaa act atg act atc gat ctc atg aat aac 12823
Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp Leu Met Asn Asn
165 170 175 180
gcg atc gtg gac aat tac ctg caa gtg ggg aga cag aac ggg gtg ctg 12871
Ala Ile Val Asp Asn Tyr Leu Gln Val Gly Arg Gln Asn Gly Val Leu
185 190 195
gaa agc gat atc ggc gtg aaa ttc gat acc aga aac ttc cga ctg ggg 12919
Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly
200 205 210
tgg gat ccc gtg acc aag ctg gtg atg cca ggc gtg tac acc aac gag 12967
Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu
215 220 225
gct ttt cac ccg gac atc gtg ctg ctg ccg ggg tgc ggt gtg gac ttc 13015
Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe
230 23 240
act cag agc cgt ttg agt aac ctg tta gga att aga aag cgc cgc ccc 13063
Thr Gln Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Arg Pro
245 250 255 260
ttc caa gag ggc ttt caa atc atg tat gag gac ctg gag gga ggt aat 13111
Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn
265 270 275
ata ccc gcc tta ctg gac gtg tcg aag tac gaa gct agc ata caa cgc 13159
Ile Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala Ser Ile Gln Arg
280 285 290
gcc aaa gcg gag ggt aga gag att cgg gga gac acc ttt gcg gta gct 13207
Ala Lys Ala Glu Gly Arg Glu Ile Arg Gly Asp Thr Phe Ala Val Ala
295 300 305
ccc cag gac ctg gaa ata gtg cct tta act aaa gac agc aaa gac aga 13255
Pro Gln Asp Leu Glu Ile Val Pro Leu Thr Lys Asp Ser Lys Asp Arg
310 315 320
agc tac aat att ata aac aac acg acg gac acc ctg tat cgg agc tgg 13303
Ser Tyr Asn Ile Ile Asn Asn Thr Thr Asp Thr Leu Tyr Arg Ser Trp
325 330 335 340
ttt ctg gct tac aac tac gga gac ccc gag aaa gga gtg aga tca tgg 13351
Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp
345 350 355
acc ata ctc acc acc acg gac gtg acc tgt ggc tcg cag caa gtg tac 13399
Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gln Gln Val Tyr
360 365 370
tgg tcc ctg ccg gat atg atg caa gac ccg gtc acc ttc cgc ccc tcc 13447
Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Pro Ser
375 380 385
acc caa gtc agc aac ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc 13495
Thr Gln Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val
390 395 400
cat gcc aag agc ttc tac aac gag cag gcc gtc tac tcg caa ctt att 13543
His Ala Lys Ser Phe Tyr Asn Glu Gln Ala Val Tyr Ser Gln Leu Ile
405 410 415 420
cgc cag tcc acc gcg ctt acc cac gtg ttc aat cgc ttt ccc gag aac 13591
Arg Gln Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn
425 430 435
cag att ctg gtg cgc cct ccc gct cct acc att acc acc gtc agt gaa 13639
Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu
440 445 450
aac gtt ccc gcc ctc aca gat cac gga acc ctg ccg ctg cgc agc agt 13687
Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser
455 460 465
atc agt gga gtt cag cgc gtg acc atc acc gac gcc aga cgt cga acc 13735
Ile Ser Gly Val Gln Arg Val Thr Ile Thr Asp Ala Arg Arg Arg Thr
470 475 480
tgc ccc tac gtt tac aaa gcg ctt ggc gtg gtg gct cct aaa gtt ctt 13783
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu
485 490 495 500
tct agt cgc acc ttc taa aaacatgtcc atcctcatct ctcccgataa 13831
Ser Ser Arg Thr Phe
505
caacaccggc tggggactgg gctccggcaa gatgtacggc ggagccaaaa ggcgctccag 13891
tcagcaccca gttcgagttc ggggccactt ccgcgctcct tggggagctt acaagcgagg 13951
actctcgggt cgaacggctg tagacgatac catagatgcc gtgattgccg acgcccgccg 14011
gtacaacccc ggaccggtcg ctagcgccgc ctccaccgtg gattccgtga tcgacagcgt 14071
ggtagccggc gctcgggcct atgctcgccg caagaggcgg ctgcatcgga gacgtcgccc 14131
caccgccgcc atgctggcag ccagggccgt gctgaggcgg gcccggaggg caggcagaag 14191
ggctatgcgc cgcgctgccg ccaacgccgc cgccgggagg gcccgccgac aggctgcccg 14251
ccaggctgcc gctgccatcg ctagcatggc cagacccagg agagggaacg tgtactgggt 14311
gcgtgattct gtgacgggag tccgagtgcc ggtgcgcagc cgacctcccc gaagttagaa 14371
gatccaagct gcgaagacgg cggtactgag tctccctgtt gttatcagcc caacatgagc 14431
aagcgcaagt ttaaagaaga actgctgcag acgctggtgc ctgagatcta tggccctccg 14491
gacgtgaagc cagacattaa gccccgcgat atcaagcgtg ttaaaaagcg ggaaaagaaa 14551
gaggaactcg cggtggtaga cgatggcgga gtggaattta ttaggagttt cgccccgcga 14611
cgcagggttc aatggaaagg gcggcgggta caacgcgttt tgaggccggg caccgcggta 14671
gtttttaccc cgggagagcg gtcggccgtt aggggtttca aaaggcagta cgacgaggtg 14731
tacggcgacg aggacatatt ggaacaggcg gctcaacaga tcggagaatt tgcctacgga 14791
aagcgttcgc gtcgcgaaga cctggccatc gccttagaca gcggcaaccc cacgcccagc 14851
ctcaaacccg tgacgctgca gcaggtgctt cccgtgagcg ccagcacgga cagcaagagg 14911
gggattaaga gagaaatgga agatctgcat cccaccatcc aactcatggt ccctaaacgg 14971
cagaggctgg aagaggtcct ggagaagatg aaagtggacc ccagcataga gccggatgta 15031
aaagtcagac ctattaagga agtggccccc ggtcttgggg tgcaaacggt ggacattcaa 15091
atccccgtca ccaccgcttc aaccgccgtg gaagctatgg aaacgcaaac ggagacccct 15151
gccgcgatcg gtaccaggga agtggcgttg caaacggagc cttggtacga atacgcagcc 15211
cctcggcgtc agaggcgttc cgctcgttac ggccccgcca acgccatcat gccagaatat 15271
gcgctgcatc cgtctattct gcccactccc ggataccggg gtgtgacgta tcgcccgtct 15331
ggaacccgcc gccgaacccg tcgccgccgc cgctcccgtc gcgctctggc ccccgtgtcg 15391
gtgcggcgtg tgacccgccg gggaaagaca gtcgtcattc ccaacccgcg ttaccaccct 15451
agcatccttt aataactctg ccgttttgca gatggctctg acttgccgcg tgcgccttcc 15511
cgttccgcac tatcgaggaa gatctcgtcg taggagaggc atgacgggca gtggtcgccg 15571
gcgggctttg cgcaggcgca tgaaaggcgg aattttaccc gccctgatac ccataattgc 15631
cgccgccatc ggtgccatac ccggcgttgc ttcagtggcg ttgcaagcag ctcgtaataa 15691
ataaacaaag gcttttgcac ttatgacctg gtcctgacta ttttatgcag aaagagcatg 15751
gaagacatca attttacgtc gctggctccg cggcacggct cgcggccgct catgggcacc 15811
tggaacgaca tcggcaccag tcagctcaac gggggcgctt tcaattgggg gagcctttgg 15871
agcggcatta aaaactttgg ctccacgatt aaatcctacg gcagcaaagc ctggaacagt 15931
agtgctggtc agatgctccg agataaactg aaggacacca acttccaaga aaaagtggtc 15991
aatggggtgg tgaccggcat ccacggcgcg gtagatctcg ccaaccaagc ggtgcagaaa 16051
gagattgaca ggcgtttgga aagctcgcgg gtgccgccgc agagagggga tgaggtggag 16111
gtcgaggaag tagaagtaga ggaaaagctg cccccgctgg agaaagttcc cggtgcgcct 16171
ccgagaccgc agaagcggcc caggccagaa ctagaagaga ctctggtgac ggagagcaag 16231
gagcctccct cgtacgagca agccttgaaa gagggcgcct ctccaccctc ctacccgatg 16291
actaagccga tcgcacccat ggctcgaccg gtgtacggca aggattacaa gcccgtcacg 16351
ctagagctgc ccccaccgcc ccccacgcgc ccgaccgtcc cccccctgcc gactccgtcg 16411
gcggccgcgg cgggacccgt gtccgcacca tccgctgtgc ctctgccagc cgcccgtcca 16471
gtggccgtgg ccactgccag aaaccccaga ggccagagag gagccaactg gcaaagcacg 16531
ctgaacagca tcgtgggcct gggagtgaaa agcctgaaac gccgccgttg ctattattaa 16591
aaaagtgtag ctaaaaagtc tcccgttgta tacgcctcct atgttaccgc cagagacgag 16651
tgactgtcgc cgcgagcgcc gctttcaag atg gcc acc cca tcg atg atg ccg 16704
Met Ala Thr Pro Ser Met Met Pro
510
cag tgg tct tac atg cac atc gcc ggc cag gac gcc tcg gag tac ctg 16752
Gln Trp Ser Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu
515 520 525
agt ccc ggc ctc gtg cag ttt gcc cgc gcc acc gac acc tac ttc agc 16800
Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser
530 535 540 545
ttg gga aac aag ttt aga aac ccc acc gtg gcc ccc acc cac gat gtg 16848
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val
550 555 560
acc acg gac cgc tcg cag agg ctg acc ctg cgc ttt gtg ccc gta gac 16896
Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp
565 570 575
cgg gag gac acc gcg tac tct tac aaa gtg cgc tac acg ttg gcc gta 16944
Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val
580 585 590
ggg gac aac cga gtg ctg gac atg gcc agc acc tac ttt gac atc cgg 16992
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg
595 600 605
ggg gtg ctg gat cgg ggt ccc agc ttc aag ccc tat tcc ggc acc gct 17040
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala
610 615 620 625
tac aac tcc ctg gcc ccc aag gga gct ccc aac ccc tcg gaa tgg acg 17088
Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro Ser Glu Trp Thr
630 635 640
gac act tcc gac aac aaa ctt aaa gca tat gct cag gct ccc tac cag 17136
Asp Thr Ser Asp Asn Lys Leu Lys Ala Tyr Ala Gln Ala Pro Tyr Gln
645 650 655
agt caa gga ctt aca aag gat ggt att cag gtt ggg cta gtt gtg aca 17184
Ser Gln Gly Leu Thr Lys Asp Gly Ile Gln Val Gly Leu Val Val Thr
660 665 670
gag tca gga caa aca ccc caa tat gca aac aaa gtg tac caa ccc gag 17232
Glu Ser Gly Gln Thr Pro Gln Tyr Ala Asn Lys Val Tyr Gln Pro Glu
675 680 685
cca caa att ggg gaa aac caa tgg aat tta gaa caa gaa gat aaa gcg 17280
Pro Gln Ile Gly Glu Asn Gln Trp Asn Leu Glu Gln Glu Asp Lys Ala
690 695 700 705
gcg gga aga gtc cta aag aaa gat acc cct atg ttt ccc tgc tat ggg 17328
Ala Gly Arg Val Leu Lys Lys Asp Thr Pro Met Phe Pro Cys Tyr Gly
710 715 720
tca tat gcc agg ccc aca aac gaa caa gga ggg cag gca aaa aac caa 17376
Ser Tyr Ala Arg Pro Thr Asn Glu Gln Gly Gly Gln Ala Lys Asn Gln
725 730 735
gaa gta gat tta cag ttt ttt gcc act ccg ggc gac acc cag aac acg 17424
Glu Val Asp Leu Gln Phe Phe Ala Thr Pro Gly Asp Thr Gln Asn Thr
740 745 750
gct aaa gtg gta ctt tat gct gaa aat gtc aac ctg gaa act cca gat 17472
Ala Lys Val Val Leu Tyr Ala Glu Asn Val Asn Leu Glu Thr Pro Asp
755 760 765
act cac tta gtg ttt aaa ccc gat gac gac agc acc agt tca aaa ctt 17520
Thr His Leu Val Phe Lys Pro Asp Asp Asp Ser Thr Ser Ser Lys Leu
770 775 780 785
ctt ctt ggg cag cag gct gca cct aac aga ccc aac tac ata ggt ttt 17568
Leu Leu Gly Gln Gln Ala Ala Pro Asn Arg Pro Asn Tyr Ile Gly Phe
790 795 800
aga gat aat ttt att ggt tta atg tac tac aat agc act gga aac atg 17616
Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met
805 810 815
ggc gtg ctg gcc gga cag gct tct caa ttg aat gcc gta gtc gac ttg 17664
Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu
820 825 830
cag gac aga aac acc gag ttg tcc tac cag ctg atg ctg gac gca ctg 17712
Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met Leu Asp Ala Leu
835 840 845
ggg gat cgc agc cga tat ttt tca atg tgg aat cag gca gta gac agc 17760
Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser
850 855 860 865
tat gac cca gac gtt aga att ata gaa aac cac gga gtg gaa gac gaa 17808
Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu
870 875 880
ctg cca aac tat tgt ttt cct ctg gga gga atg gtg gtg act gac aat 17856
Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Val Val Thr Asp Asn
885 890 895
tac aac tct gtg acg cct caa aat gga ggc agt gga aat aca tgg cag 17904
Tyr Asn Ser Val Thr Pro Gln Asn Gly Gly Ser Gly Asn Thr Trp Gln
900 905 910
gca gac aat act aca ttt agt caa aga gga gcg cag att ggc tcc gga 17952
Ala Asp Asn Thr Thr Phe Ser Gln Arg Gly Ala Gln Ile Gly Ser Gly
915 920 925
aac atg ttt gcc ctg gaa att aac cta cag gcc aac ctc tgg cgc ggc 18000
Asn Met Phe Ala Leu Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Gly
930 935 940 945
ttc ttg tat tcc aat att ggg ttg tat ctt cca gac tct ctg aaa atc 18048
Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser Leu Lys Ile
950 955 960
acc ccc gac aac atc acg ctg cca gaa aac aaa aac act tat cag tac 18096
Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr Tyr Gln Tyr
965 970 975
atg aac ggt cgc gta acg cca ccc ggg ctc ata gac acc tat gta aac 18144
Met Asn Gly Arg Val Thr Pro Pro Gly Leu Ile Asp Thr Tyr Val Asn
980 985 990
gtg ggc gcg cgc tgg tcc ccc gat gtc atg gac agc att aac ccc ttc 18192
Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser Ile Asn Pro Phe
995 1000 1005
aac cac cac cgt aac gcg ggc ttg cgc tac cgc tcc atg ctc ttg 18237
Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu
1010 1015 1020
ggc aac ggc cgt tat gtg cct ttt cac att cag gtg ccc caa aaa 18282
Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys
1025 1030 1035
ttc ttt gcc att aaa aac ctg ctg ctt ctc ccc ggt tcc tat acc 18327
Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr
1040 1045 1050
tat gag tgg aac ttc cgc aag gat gtc aac atg atc ctg cag agc 18372
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser
1055 1060 1065
tcg ctg ggt aat gac ctg cga gtg gac ggg gcc agc ata cgc ttt 18417
Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Ile Arg Phe
1070 1075 1080
gac agc att aac ctg tat gcc aac ttt ttt ccc atg gcc cac aac 18462
Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn
1085 1090 1095
acg gcc tct acc ctg gaa gcc atg ctg cgc aac gac acc aat gac 18507
Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp
1100 1105 1110
cag tcc ttc aac gac tac ctg tgc gcg gct aac atg ctg tac ccc 18552
Gln Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro
1115 1120 1125
atc ccc gcc aac gcc acc agc gtg ccc att tct att cct tct cgg 18597
Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser Ile Pro Ser Arg
1130 1135 1140
aac tgg gct gcc ttc agg ggc tgg agt ttt act cgc ctc aaa acc 18642
Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr
1145 1150 1155
aag gag act ccc tcg ctg ggc tcc ggt ttt gac ccc tac ttt gtt 18687
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val
1160 1165 1170
tac tcc ggc tcc att ccc tac cta gat ggc acc ttt tac ctc aac 18732
Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn
1175 1180 1185
cac act ttc aaa aag gtg tct att atg ttt gac tcc tcg gtt agc 18777
His Thr Phe Lys Lys Val Ser Ile Met Phe Asp Ser Ser Val Ser
1190 1195 1200
tgg ccc ggc aac gac cgc ctg cta acg ccc aac gag ttc gaa att 18822
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile
1205 1210 1215
aag cgt tcc gtg gac ggt gaa ggg tac aac gtg gcc cag agc aac 18867
Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Ser Asn
1220 1225 1230
atg acc aag gac tgg ttt cta att caa atg ctc agt cac tat aat 18912
Met Thr Lys Asp Trp Phe Leu Ile Gln Met Leu Ser His Tyr Asn
1235 1240 1245
ata ggt tac cag ggc ttc tat gtg ccc gag aac tac aag gac cgc 18957
Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp Arg
1250 1255 1260
atg tac tcc ttc ttc cgc aac ttc caa cca atg agc cgg cag gtg 19002
Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val
1265 1270 1275
gta gat acc gtg act tat aca gac tac aaa gat gtc aag ctc ccc 19047
Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys Asp Val Lys Leu Pro
1280 1285 1290
tac caa cac aac aac tca ggg ttc gtg ggc tac atg gga ccc acc 19092
Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr
1295 1300 1305
atg cga gag gga cag gcc tac ccg gcc aac tat ccc tac ccc ctg 19137
Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu
1310 1315 1320
atc gga gag act gcc gta ccc agc ctc acg cag aaa aag ttc ctc 19182
Ile Gly Glu Thr Ala Val Pro Ser Leu Thr Gln Lys Lys Phe Leu
1325 1330 1335
tgc gac cgg gtg atg tgg agg ata ccc ttc tct agc aac ttt atg 19227
Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met
1340 1345 1350
tcg atg ggc tcc ctc acc gac ctg ggg cag aac atg ctg tac gcc 19272
Ser Met Gly Ser Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala
1355 1360 1365
aac tcc gct cac gcc ttg gac atg act ttt gag gtg gat ccc atg 19317
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met
1370 1375 1380
gat gag ccc acg ctt ctc tat gtt ctg ttt gaa gtc ttc gac gtg 19362
Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val
1385 1390 1395
gtg cgc atc cac cag ccg cac cgc ggc gtc atc gag gcc gtc tac 19407
Val Arg Ile His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr
1400 1405 1410
ctg cgc aca cct ttc tct gcc ggt aac gcc acc acc taa agaagctgat 19456
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1415 1420 1425
gggttccagc gaacaggagt tgcaggccat tgttcgcgac ctgggctgcg ggccctgctt 19516
tttgggcacc ttcgacaagc gttttcccgg attcatgtcc ccccacaagc cggcctgcgc 19576
catcgttaac acggccggac gggagacagg gggggtgcac tggctcgcct tcgcctggaa 19636
cccgcgcaac cgcacctgct acctgttcga cccttttggt ttctccgacg aaaggctgaa 19696
gcagatctac caattcgagt acgaggggct cctcaagcgc agcgctctgg cctccacgcc 19756
cgaccactgc gtcaccctgg aaaagtccac ccagacggtc caggggcccc tctcggccgc 19816
ctgcgggctt ttctgttgca tgtttttgca cgccttcgtg cactggcctc acacccccat 19876
ggagcgcaac cccaccatgg atctgctcac cggagtgccc aacagcatgc ttcacagtcc 19936
ccaggtcgcc cccaccctgc gtcgcaatca ggaccacctg tatcgctttc tggggaaaca 19996
ctctgcctat ttccgccgcc accggcagcg catcgaacag gccacggcct tcgaaagcat 20056
gagccaaaga gtgtaatcaa taaaaaccgt ttttatttga catgatacgc gcttctggcg 20116
tttttattaa aaatcgaagg gttcgaggga ggggtcctcg tgcccgctgg ggagggacac 20176
gttgcggtac tggaatcggg cgctccaacg aaactcgggg atcaccagcc gcggcagggc 20236
cacgtcttcc atgttctgct tccaaaactg tcgcaccagc tgcagggctc ccatcacgtc 20296
gggcgctgag atcttgaagt cgcagttagg gccggagccc ccgcggctgt tgcggaacac 20356
ggggttggca cactggaaca ccaacacgct ggggttgtgg atactagcca gggccgtcgg 20416
gtcggtcacc tccgatgcat ccagatcctc ggcattgctc agggcgaacg gggtcagctt 20476
gcacatctgc cgcccgatct ggggtaccag gtcgcgcttg ttgaggcagt cgcagcgcag 20536
agggatgagg atgcgacgct gcccgcgttg catgatgggg taactcgccg ccaggaactc 20596
ctctatctga cggaaggcca tctgggcctt gacgccctcg gtgaaaaata gcccacagga 20656
cttgctggaa aacacgttat tgccacagtt gatgtcttcc gcgcagcagc gcgcatcttc 20716
gttcttcagc tgaaccacgt tgcgacccca gcggttctga accaccttgg ctttcgtggg 20776
atgctccttc agcgcccgct gtccgttctc gctggtcaca tccatttcca ccacgtgctc 20836
cttgcagacc atctccactc cgtggaaaca gaacagaatg ccctcctgtt gggtattgcg 20896
atgctcccac acggcgcacc cggtggactc ccagctcttg tgtttcaccc ccgcgtaggc 20956
ttccatgtaa gccattagaa atctgcccat cagctcagtg aaggtcttct ggttggtgaa 21016
ggttagcggc aggccgcggt gttcctcgtt caaccaagtt tgacagatct tgcggtacac 21076
ggctccctgg tcgggcagaa acttaaaagt cgttctgctc tcgttgtcca cgtggaactt 21136
ctccatcaac atcgtcatga cttccatgcc cttctcccag gcagtcacca gcggcgcgct 21196
ctcggggttc ttcaccaaca cggcggtgga ggggccctcg ccggccccga cgtccttcat 21256
ggacattttt tgaaactcca cggtgccgtc cgcgcggcgt actctgcgca tcggagggta 21316
gctgaagccc acctccatga cggtgctttc gccctcgctg tcggagacga tctccgggga 21376
gggcggcgga acgggggcag acttgcgagc cttcttcttg ggagggagcg gaggcacctc 21436
ctgctcgcgc tcgggactca tctcccgcaa gtagggggtg atggagcttc ctggttggtt 21496
ctgacggttg gccattgtat cctaggcaga aagacatgga gcttatgcgc gaggaaactt 21556
taaccgcccc gtcccccgtc agcgacgaag aggtcatcgt cgaacaggac ccgggctacg 21616
ttacgccgcc cgaggatctg gaggggccct tagacgaccg gcgcgacgct agtgagcggc 21676
aggaaaatga gaaagaggag gaggagggct gctacctcct ggaaggcgac gttttgctaa 21736
agcatttcgc caggcagagc accatactca aggaggcctt gcaagaccgc tccgaggtgc 21796
ccttggacgt cgccgcgctc tcccaggcct acgaggcgaa ccttttctcg ccccgagtgc 21856
ctccgaagag acagcccaac ggcacctgcg agcccaaccc gcgactcaac ttctaccccg 21916
tgttcgccgt gcccgaggcg ctggccacct accacatctt tttcaaaaac cagcgcattc 21976
ccctttcctg ccgggccaac cgcaccgcgg ccgataggaa gctaacactc agaaacggag 22036
tcagcatacc tgatatcacg tcactggagg aagtgcctaa gatcttcgag ggtctgggtc 22096
gagatgagaa gcgggcggcg aacgctctgc agaaagaaca gaaagagagt cagaacgtgc 22156
tggtggagct ggagggggac aacgcgcgtc tgaccgtcct caaacgttgc atagaagttt 22216
cccacttcgc ctacccggcc ctcaacctgc cgcccaaagt tatgaaatcg gtcatggacc 22276
agctactcat caagagagct gagcccctga atcccgacca ccctgaggcg gaaaactcag 22336
aggacggaaa gcccgtcgtc agcgacgagg agctcgagcg gtggctggaa accagggacc 22396
cccagcagtt gcaagagagg cgcaagatga tgatggcggc cgtgctggtc acggtggagc 22456
tagaatgcct gcaacggttt ttcagcgacg tggagacgct acgcaaaatc ggggagtccc 22516
tgcactacac cttccgccag ggctacgttc gccaggcctg caaaatctcc aacgtagagc 22576
tcagcaacct ggtttcctac atgggcatcc tccacgagaa ccggctgggg cagagcgtgc 22636
tgcactgcac cttgcaaggc gaggcgcgaa gggactacgt ccgagactgc gtctacctct 22696
tcctcaccct cacctggcag accgccatgg gcgtgtggca gcagtgcttg gaagagagaa 22756
acctcaaaga gctggacaaa ctcctctgcc gccagcggcg ggccctctgg accggcttca 22816
gcgagcgcac ggtcgcctgc gccctggcag acatcatttt cccagaacgc ctgatgaaaa 22876
ccttgcagaa cggcctgccg gatttcatca gtcagagcat cttgcaaaac ttccgctcct 22936
tcgtcctgga gcgctccggg atcttgcccg ccatgagctg cgcgctgcct tctgactttg 22996
tccccctttc ctaccgcgag tgccctcccc cactgtggag ccactgctac ctcttccaac 23056
tggccaactt tctggcctac cactccgacc tcatggaaga cgtgagcgga gaggggctgc 23116
tcgagtgcca ctgccgctgc aacctctgca ccccccacag atcgctggcc tgcaacaccg 23176
agctgctcag cgaaacccag gtcataggta ccttcgagat ccaggggccc cagcagcaag 23236
agggtgcttc cggcttgaag ctcactccgg cgctgtggac ctcggcttac ttacgcaaat 23296
ttgtagccga ggactaccac gcccacaaaa ttcagtttta cgaagaccaa tctcgaccac 23356
cgaaagcccc cctcacggcc tgcgtcatca cccagagcaa aatcctggcc caattgcaat 23416
ccatcaacca agcgcgccga gatttccttt tgaaaaaggg tcggggggtg tacctggacc 23476
cccagaccgg cgaggaactc aacccgtcca cactttccgt cgaagcagcc cccccgagac 23536
atgccaccca agggaaccgc caagcagctg atcgctcggc agagagcgaa gaagcaagag 23596
ctgctccagc agcaggtgga ggacgaggaa gagctgtggg acagccaggc agaggaggtg 23656
tcagaggacg aggaggagat ggaaagctgg gacagcctag acgaggagga cgagctttca 23716
gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct gagccgacag 23776
ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg taggtgggac 23836
gccaccggat ctccagcggc agcggcaacg gcagcgggta aggccaaacg cgagcggcgg 23896
gggtattgct cctggcggac ccacaaaagc agtatcgtga actgcttgca acactgcggg 23956
ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc cttccctcgc 24016
aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg agaaaaaagc 24076
taaggcctcc tctgccgcga ggaaaaactc cgccgccgct gccgccaagg atccgccggc 24136
caccgaggag ctgagaaagc gcatctttcc cactctgtat gctatctttc agcaaagccg 24196
cgggcagcac cctcagcgcg aactgaaaat aaaaaaccgc tccttccgct cactcacccg 24256
cagctgtctg taccacaaga gagaagacca gctgcagcgc accctggacg acgccgaagc 24316
actgttcagc aaatactgct cagcgtctct taaagactaa aagacccgcg ctttttcccc 24376
ctcgggcgcc aaaacccacg tcatcgccag catgagcaag gagattccca ccccttacat 24436
gtggagctat cagccccaga tgggcctggc cgcgggggcc gcccaggact actccagcaa 24496
aatgaactgg ctcagcgccg gcccccacat gatctcacga gttaacggca tccgagccca 24556
ccgaaaccag atcctcttag aacaggcggc aatcaccgcc acaccccggc gccaactcaa 24616
cccgcccagt tggcccgccg cccaggtgta tcaggaaact ccccgcccga ccacagtcct 24676
cctgccacgc gacgcggagg ccgaagtcct catgactaac tctggggtac aattagcggg 24736
cgggtccagg tacgccaggt acagaggtcg ggccgctcct tactctcccg ggagtataaa 24796
gagggtgatc attcgaggcc gaggtatcca gctcaacgac gaggcggtga gctcctcaac 24856
cggtctcaga cctgacggag tcttccagct cggaggagcg ggccgctctt ccttcaccac 24916
tcgccaggcc tacctgaccc tgcagagctc ttcctcgcag ccgcgctccg ggggaatcgg 24976
cactctccag ttcgtggaag agttcgtccc ctccgtctac ttcaacccgt tttccggctc 25036
acctggacgc tacccggacg ccttcattcc caactttgac gcagtgagtg aatccgtgga 25096
cggctacgac tgatgacaga tggtgcggcc gtgagagctc ggctgcgaca tctgcatcac 25156
tgccgccagc ctcgctgcta cgctcgggag gcgatcgtgt tcagctactt tgagctgccg 25216
gacgagcacc ctcagggacc ggctcacggg ttgaaactcg agattgagaa cgcgcttgag 25276
tctcacctca tcgacgcctt caccgcccgg cctctcctgg tagaaaccga acgcgggatc 25336
actaccatca ccctgttctg catctgcccc acgcccggat tac atg aag atc tgt 25391
Met Lys Ile Cys
1430
gtt gtc atc ttt gcg ctc agt tta ata aaa act gaa ctt ttt gcc 25436
Val Val Ile Phe Ala Leu Ser Leu Ile Lys Thr Glu Leu Phe Ala
1435 1440 1445
gta cct tca acg cca cgc gtt gtt tct cct tgt gaa aaa acc cca 25481
Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys Thr Pro
1450 1455 1460
gga gtc ctt aac tta cac ata gca aaa ccc ttg tat ttt acc ata 25526
Gly Val Leu Asn Leu His Ile Ala Lys Pro Leu Tyr Phe Thr Ile
1465 1470 1475
gaa aaa caa cta gcc ctt tca att gga aaa ggg tta aca att tct 25571
Glu Lys Gln Leu Ala Leu Ser Ile Gly Lys Gly Leu Thr Ile Ser
1480 1485 1490
gct aca gga cag ttg gaa agc aca gca agc gta cag gac agc gct 25616
Ala Thr Gly Gln Leu Glu Ser Thr Ala Ser Val Gln Asp Ser Ala
1495 1500 1505
aca cca ccc cta cgt ggt att tcc cct tta aag ctg aca gac aac 25661
Thr Pro Pro Leu Arg Gly Ile Ser Pro Leu Lys Leu Thr Asp Asn
1510 1515 1520
ggt tta aca tta agc tat tca gat ccc ctg cgt gtg gta ggt gac 25706
Gly Leu Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp
1525 1530 1535
caa ctt acg ttt aat ttt act tct cca cta cgt tac gaa aat ggc 25751
Gln Leu Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly
1540 1545 1550
agt ctt aca ttc aac tac act tct ccc atg aca cta ata aac aac 25796
Ser Leu Thr Phe Asn Tyr Thr Ser Pro Met Thr Leu Ile Asn Asn
1555 1560 1565
agt ctt gct att aac gtc aat acc tcc aaa ggc ctc agt agt gac 25841
Ser Leu Ala Ile Asn Val Asn Thr Ser Lys Gly Leu Ser Ser Asp
1570 1575 1580
aac ggc aca ctc gct gta aat gtt act cca gat ttt aga ttt aac 25886
Asn Gly Thr Leu Ala Val Asn Val Thr Pro Asp Phe Arg Phe Asn
1585 1590 1595
agc tct ggt gcc tta act ttt ggc ata caa agt cta tgg act ttt 25931
Ser Ser Gly Ala Leu Thr Phe Gly Ile Gln Ser Leu Trp Thr Phe
1600 1605 1610
cca acc aaa act cct aac tgt acc gtg ttt acc gaa agt gac tcc 25976
Pro Thr Lys Thr Pro Asn Cys Thr Val Phe Thr Glu Ser Asp Ser
1615 1620 1625
ctg ctg agt ctt tgc ttg act aaa tgc gga gct cac gta ctt gga 26021
Leu Leu Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly
1630 1635 1640
agc gtg agt tta agc gga gtg gca gga acc atg cta aaa atg acc 26066
Ser Val Ser Leu Ser Gly Val Ala Gly Thr Met Leu Lys Met Thr
1645 1650 1655
cac act tct gtt acc gtt cag ttt tcg ttt gat gac agt ggt aaa 26111
His Thr Ser Val Thr Val Gln Phe Ser Phe Asp Asp Ser Gly Lys
1660 1665 1670
cta ata ttc tct cca ctt gcg aac aac act tgg ggt gtt cga caa 26156
Leu Ile Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly Val Arg Gln
1675 1680 1685
agc gag agt ccg ttg ccc aac cca tcc ttc aac gct ctc acg ttt 26201
Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu Thr Phe
1690 1695 1700
atg cca aac agt acc att tat tct aga gga gca agt aac gaa cct 26246
Met Pro Asn Ser Thr Ile Tyr Ser Arg Gly Ala Ser Asn Glu Pro
1705 1710 1715
caa aac aat tat tat gtc cag acg tat ctt aga ggc aac gtg cga 26291
Gln Asn Asn Tyr Tyr Val Gln Thr Tyr Leu Arg Gly Asn Val Arg
1720 1725 1730
aag cca att cta cta act gtt acc tac aac tca gtt aat tca gga 26336
Lys Pro Ile Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly
1735 1740 1745
tat tcc tta act ttt aaa tgg gat gct gtc gcc aat gaa aaa ttt 26381
Tyr Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe
1750 1755 1760
gcc act cct aca tct tcg ttt tgc tat gtt gca gag caa taa 26423
Ala Thr Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gln
1765 1770
aaccctgtta ccccaccgtc tcgttttttt cag atg aaa cga gcg aga gtt 26474
Met Lys Arg AlaArg Val
1775
gat gaa gac ttc aac cca gtg tac cct tat gac ccc cca tac gct 26519
Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro Tyr Ala
1780 1785 1790
ccc gtc atg ccc ttc att act ccg cct ttt acc tcc tcg gat ggg 26564
Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr Ser Ser Asp Gly
1795 1800 1805
ttg cag gaa aaa cca ctt gga gtg tta agt tta aac tac agg gat 26609
Leu Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp
1810 1815 1820
ccc att act aca caa aat ggg tct ctc acg tta aaa cta gga aac 26654
Pro Ile Thr Thr Gln Asn Gly Ser Leu Thr Leu Lys Leu Gly Asn
1825 1830 1835
ggc ctc act cta aac aac cag gga cag tta aca tca act gct ggc 26699
Gly Leu Thr Leu Asn Asn Gln Gly Gln Leu Thr Ser Thr Ala Gly
1840 1845 1850
gaa gtg gag cct ccg ctc act aat gct aac aac aaa ctt gca cta 26744
Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu
1855 1860 1865
gcc tat agc gaa cca tta gca gta aaa agc aac cgc cta act cta 26789
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu
1870 1875 1880
tca cac acc gct ccc ctt gtc atc gct aat aat tct tta gcg ttg 26834
Ser His Thr Ala Pro Leu Val Ile Ala Asn Asn Ser Leu Ala Leu
1885 1890 1895
caa gtt tca gag cct att ttt gta aat gac gat gac aag cta gcc 26879
Gln Val Ser Glu Pro Ile Phe Val Asn Asp Asp Asp Lys Leu Ala
1900 1905 1910
ctg cag aca gcc gcc ccc ctt gta acc aac gct ggc acc ctt cgc 26924
Leu Gln Thr Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg
1915 1920 1925
tta cag agc gct gcc cct tta gga ttg gtt gaa aat act ctt aaa 26969
Leu Gln Ser Ala Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys
1930 1935 1940
ctg ctg ttt tct aaa ccc ttg tat ttg caa aat gat ttt ctt gca 27014
Leu Leu Phe Ser Lys Pro Leu Tyr Leu Gln Asn Asp Phe Leu Ala
1945 1950 1955
tta gcc att gaa cgc ccc ctg gct gta gca gcc gca ggt act ctg 27059
Leu Ala Ile Glu Arg Pro Leu Ala Val Ala Ala Ala Gly Thr Leu
1960 1965 1970
acc cta caa ctt act cct cca tta aag act aac gat gac ggg cta 27104
Thr Leu Gln Leu Thr Pro Pro Leu Lys Thr Asn Asp Asp Gly Leu
1975 1980 1985
aca cta tcc aca gtc gag cca tta act gta aaa aac gga aac cta 27149
Thr Leu Ser Thr Val Glu Pro Leu Thr Val Lys Asn Gly Asn Leu
1990 1995 2000
ggc ttg caa ata tcg cgc cct tta gtt gtt caa aac aac ggc ctt 27194
Gly Leu Gln Ile Ser Arg Pro Leu Val Val Gln Asn Asn Gly Leu
2005 2010 2015
tcg ctt gct att acc ccc ccg ctg cgt ttg ttt aac agc gac ccc 27239
Ser Leu Ala Ile Thr Pro Pro Leu Arg Leu Phe Asn Ser Asp Pro
2020 2025 2030
gtt ctt ggt ttg ggc ttc act ttt ccc cta gct gtc aca aac aac 27284
Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala Val Thr Asn Asn
2035 2040 2045
ctc ctc tcc tta aac atg gga gac gga gtt aaa ctt acc tat aat 27329
Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu Thr Tyr Asn
2050 2055 2060
aaa cta aca gcc aat ttg ggt agg gat tta caa ttt gaa aac ggt 27374
Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gln Phe Glu Asn Gly
2065 2070 2075
gcg att gcc gta acg ctt act gcc gaa tta cct ttg caa tac act 27419
Ala Ile Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gln Tyr Thr
2080 2085 2090
aac aaa ctt caa ctg aat att gga gct ggc ctt cgt tac aat gga 27464
Asn Lys Leu Gln Leu Asn Ile Gly Ala Gly Leu Arg Tyr Asn Gly
2095 2100 2105
gcc agc aga aaa cta gat gta aac att aac caa aat aaa ggc tta 27509
Ala Ser Arg Lys Leu Asp Val Asn Ile Asn Gln Asn Lys Gly Leu
2110 2115 2120
act tgg gac aac gat gca gtt att ccc aaa cta gga tcg ggc tta 27554
Thr Trp Asp Asn Asp Ala Val Ile Pro Lys Leu Gly Ser Gly Leu
2125 2130 2135
caa ttt gac cct aat ggc aac atc gct gtt atc cct gaa acc gtg 27599
Gln Phe Asp Pro Asn Gly Asn Ile Ala Val Ile Pro Glu Thr Val
2140 2145 2150
aag ccg caa acg tta tgg acg act gca gat ccc tcg cct aac tgc 27644
Lys Pro Gln Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys
2155 2160 2165
tca gtg tac cag gac ttg gat gcc agg ctg tgg ctc gct ctt gtt 27689
Ser Val Tyr Gln Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val
2170 2175 2180
aaa agt ggc gac atg gtg cat gga agc att gcc cta aaa gcc cta 27734
Lys Ser Gly Asp Met Val His Gly Ser Ile Ala Leu Lys Ala Leu
2185 2190 2195
aaa ggg acg ttg cta aat cct aca gcc agc tac att tcc att gtg 27779
Lys Gly Thr Leu Leu Asn Pro Thr Ala Ser Tyr Ile Ser Ile Val
2200 2205 2210
ata tat ttt tac agc aac gga gtc agg cgt acc aac tat cca acg 27824
Ile Tyr Phe Tyr Ser Asn Gly Val Arg Arg Thr Asn Tyr Pro Thr
2215 2220 2225
ttt gac aac gaa ggc acc tta gct aac agc gcc act tgg gga tac 27869
Phe Asp Asn Glu Gly Thr Leu Ala Asn Ser Ala Thr Trp Gly Tyr
2230 2235 2240
cga cag ggg caa tct gct aac act aat gtg acc aat gcc act gaa 27914
Arg Gln Gly Gln Ser Ala Asn Thr Asn Val Thr Asn Ala Thr Glu
2245 2250 2255
ttt atg ccc agc tca agc agg tac ccc gtg aat aaa gga gac aac 27959
Phe Met Pro Ser Ser Ser Arg Tyr Pro Val Asn Lys Gly Asp Asn
2260 2265 2270
att caa aat caa tct ttt tca tac acc tgt att aaa gga gat ttt 28004
Ile Gln Asn Gln Ser Phe Ser Tyr Thr Cys Ile Lys Gly Asp Phe
2275 2280 2285
gct atg cct gtc ccg ttc cgt gta aca tat aat cac gcc ctg gaa 28049
Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His Ala Leu Glu
2290 2295 2300
ggg tat tcc ctt aag ttc acc tgg cgc gtt gta gcc aat cag gcc 28094
Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn Gln Ala
2305 2310 2315
ttt gat att cct tgc tgt tca ttt tca tac atc aca gaa taa 28136
Phe Asp Ile Pro Cys Cys Ser Phe Ser Tyr Ile Thr Glu
2320 2325 2330
aaaaccactt tttcatttta atttcttttt attttacacg aacagtgaga cttcctccac 28196
ccttccattt gacagcatac accagcctct cccccttcat agcagtaaac tgttgtgaat 28256
cagtccggta tttgggagtt aaaatccaaa cagtctcttt ggtgatgaaa cgtcgatcag 28316
taatggacac aaatccctgg gacaggtttt ccaacgtttc ggtgaaaaac tgcacaccgc 28376
cctacaaaac aaacaggttc aggctctcca cgggttatct ccccgatcaa actcagacag 28436
ggtaaaggtg cggtggtgtt ccactaaacc acgcaggtgg cgctgtctga acctctcggt 28496
gcgactcctg tgaggctggt aagaagttag attgtccagt agcctcacag catgtatcat 28556
cagtctacga gtgcgtctgg cgcagcagcg catctgaatc tcactgagat tccggcaaga 28616
atcgcacacc atcacaatca ggttgttcat gatcccatag ctgaacacgc tccagccaaa 28676
gctcattcgc tccaacagcg ccaccgcgtg tccgtccaac cttactttaa cataaatcag 28736
gtgtctgccg cgtacaaaca tgctacccac atacagaact tcccggggca ggcccctgtt 28796
caccacctgt ctgtaccagg gaaacctcac atttatcagg gagccataga tggccatttt 28856
aaaccaatta gctaataccg ccccaccagc tctacactga agagaaccgg gagagttaca 28916
atgacagtga ataatccatc tctcataacc cctgatggtc tgatgaaaat ctagatctaa 28976
cgtggcacaa caaatacaca ctttcatata cattttcata acatgttttt cccaggccgt 29036
taaaatacaa tcccaataca cgggccactc ctgcagtaca ataaagctaa tacaagatgg 29096
tatactcctc acctcactga cactgtgcat gttcatattt tcacattcta agtaccgaga 29156
gttctcctct acagcagcac tgctgcggtc ctcacaaggt ggtagctggt gatgattgta 29216
gggggccagt ctgcagcgat accgtctgtc gcgttgcatc gtagaccagg aaccgacgca 29276
cctcctcgta cttgtggtag cagaaccacg tccgctgcca gcacgtctcc acgtaacgcc 29336
ggtccctgcg tcgctcacgc tccctcctca atgcaaagtg caaccactct tgtaatccac 29396
acagatccct ctcggcctcc ggggtgatgc acacctcaaa cctacagatg tctcggtaca 29456
gttccaaaca cgtagtgagg gcgagttcca accaagacag acagcctgat ctatcccgac 29516
acactggagg tggaggaaga cacggaagag gcatgttatt ccaagcgatt caccaacggg 29576
tcgaaatgaa gatcccgaag atgacaacgg tcgcctccgg agccctgatg gaatttaaca 29636
gccagatcaa acgttatgcg attctccaag ctatcgatcg ccgcttccaa aagagcctgg 29696
acccgcactt ccacaaacac cagcaaagca aaagcactat tatcaaactc ttcaatcatc 29756
aagctgcagg actgtacaat gcctaagtaa ttttcgtttc tccactcgcg aatgatgtcg 29816
cggcagatag tctgaaggtt catcccgtgc agggtaaaaa gctccgaaag ggcgccctct 29876
acagccatgc gtagacacac catcatgact gcaagatatc gggctcctga gacacctgca 29936
gcagatttaa cagatcaagg tcaggttgct ctccgcgatc acgaatctcc atccgcaagg 29996
tcatttgcaa aaaattaaat aaatctatgc cgactagatc tgtcaactcc gcattaggaa 30056
ccaaatcagg tgtggctacg cagcacaaaa gttccaggga tggtgccaaa ctcactagaa 30116
ccgctcccga gtaacaaaac tgatgaatgg gagtaacaca gtgtaaaatg tgcaaccaaa 30176
aatcactaag gtgctccttt aaaaagtcca gtacttctat attcagtccg tgcaagtact 30236
gaagcaactg tgcgggaata tgcacaacaa aaaaaatagg gcggctcaga tacatgttga 30296
cctaaaataa aaagaatcat taaactaaag aagcttggcg aacggtggga taaatgacac 30356
gctccagcag cagacaggca accggctgtc cccgggaacc gcggtaaaat tcatccgaat 30416
gattaaaaag aacaacagaa acttcccacc atgtactcgg ttggatctcc tgagcacaca 30476
gcaatacccc cctcacattc atgtccgcca cagaaaaaaa acgtcccaga tacccagcgg 30536
ggatatccaa cgacagctgc aaagacagca aaacaatccc tctgggagcg atcacaaaat 30596
cctccggtga aaaaagcaca tacatattag aataaccctg ttgctggggc aaaaaggccc 30656
ggcgtcccag caaatgcaca taaatatgtt catcagccat tgccccgtct taccgcgtaa 30716
tcagccacga aaaaatcgag ctaaaattca cccaacagcc tatagctata tatacactcc 30776
gcccaatgac gctaataccg caccacccac gaccaaagtt cacccacacc cacaaaaccc 30836
gcgaaaatcc agcgccgtca gcacttccgc aatttcagtc tcacaacgtc acttccgcgc 30896
gccttttcac attcccacac acacccgcgc ccttcgcccc gccctcgcgc caccccgcgt 30956
caccgcacgt caccccggcc ccgcctcgct cctccccgct cattatcata ttggcacgtt 31016
tccagaataa ggtatattat tgatgatg 31044
<210>30
<211>505
<212>PRT
<213>猿猴腺病毒SV-25
<400>30
Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro
1 5 10 15
Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys
50 55 60
Val Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Gly Thr Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Leu His Thr Asn Met Pro
115 120 125
Asn Ile Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met
130 135 140
Val Glu Lys Ser Asn Ala Glu Thr Arg Gln Pro Arg Tyr Glu Trp Phe
145 150 155 160
Glu Phe Thr Ile Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp
165 170 175
Leu Met Asn Asn Ala Ile Val Asp Asn Tyr Leu Gln Val Gly Arg Gln
180 185 190
Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn
195 200 205
Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val
210 215 220
Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys
225 230 235 240
Gly Val Asp Phe Thr Gln Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg
245 250 255
Lys Arg Arg Pro Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu
260 265 270
Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala
275 280 285
Ser Ile Gln Arg Ala Lys Ala Glu Gly Arg Glu Ile Arg Gly Asp Thr
290 295 300
Phe Ala Val Ala Pro Gln Asp Leu Glu Ile Val Pro Leu Thr Lys Asp
305 310 315 320
Ser Lys Asp Arg Ser Tyr Asn Ile Ile Asn Asn Thr Thr Asp Thr Leu
325 330 335
Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly
340 345 350
Val Arg Ser Trp Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser
355 360 365
Gln Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr
370 375 380
Phe Arg Pro Ser Thr Gln Val Ser Asn Phe Pro Val Val Gly Thr Glu
385 390 395 400
Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gln Ala Val Tyr
405 410 415
Ser Gln Leu Ile Arg Gln Ser Thr Ala Leu Thr His Val Phe Asn Arg
420 425 430
Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr
435 440 445
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro
450 455 460
Leu Arg Ser Ser Ile Ser Gly Val Gln Arg Val Thr Ile Thr Asp Ala
465 470 475 480
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala
485 490 495
Pro Lys Val Leu Ser Ser Arg Thr Phe
500 505
<210>31
<211>921
<212>PRT
<213>猿猴腺病毒SV-25
<400>31
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Pro Ser Glu Trp Thr Asp Thr Ser Asp Asn Lys Leu Lys
130 135 140
Ala Tyr Ala Gln Ala Pro Tyr Gln Ser Gln Gly Leu Thr Lys Asp Gly
145 150 155 160
Ile Gln Val Gly Leu Val Val Thr Glu Ser Gly Gln Thr Pro Gln Tyr
165 170 175
Ala Asn Lys Val Tyr Gln Pro Glu Pro Gln Ile Gly Glu Asn Gln Trp
180 185 190
Asn Leu Glu Gln Glu Asp Lys Ala Ala Gly Arg Val Leu Lys Lys Asp
195 200 205
Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu
210 215 220
Gln Gly Gly Gln Ala Lys Asn Gln Glu Val Asp Leu Gln Phe Phe Ala
225 230 235 240
Thr Pro Gly Asp Thr Gln Asn Thr Ala Lys Val Val Leu Tyr Ala Glu
245 250 255
Asn Val Asn Leu Glu Thr Pro Asp Thr His Leu Val Phe Lys Pro Asp
260 265 270
Asp Asp Ser Thr Ser Ser Lys Leu Leu Leu Gly Gln Gln Ala Ala Pro
275 280 285
Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met
290 295 300
Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser
305 310 315 320
Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser
325 330 335
Tyr Gln Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser
340 345 350
Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile
355 360 365
Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu
370 375 380
Gly Gly Met Val Val Thr Asp Asn Tyr Asn Ser Val Thr Pro Gln Asn
385 390 395 400
Gly Gly Ser Gly Asn Thr Trp Gln Ala Asp Asn Thr Thr Phe Ser Gln
405 410 415
Arg Gly Ala Gln Ile Gly Ser Gly Asn Met Phe Ala Leu Glu Ile Asn
420 425 430
Leu Gln Ala Asn Leu Trp Arg Gly Phe Leu Tyr Ser Asn Ile Gly Leu
435 440 445
Tyr Leu Pro Asp Ser Leu Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro
450 455 460
Glu Asn Lys Asn Thr Tyr Gln Tyr Met Asn Gly Arg Val Thr Pro Pro
465 470 475 480
Gly Leu Ile Asp Thr Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp
485 490 495
Val Met Asp Ser Ile Asn Pro Phe Asn His His Arg Asn Ala Gly Leu
500 505 510
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His
515 520 525
Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu
530 535 540
Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met
545 550 555 560
Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser
565 570 575
Ile Arg Phe Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala
580 585 590
His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn
595 600 605
Asp Gln Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro
610 615 620
Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser Ile Pro Ser Arg Asn
625 630 635 640
Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu
645 650 655
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly
660 665 670
Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys
675 680 685
Lys Val Ser Ile Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp
690 695 700
Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Ser Val Asp Gly
705 710 715 720
Glu Gly Tyr Asn Val Ala Gln Ser Asn Met Thr Lys Asp Trp Phe Leu
725 730 735
Ile Gln Met Leu Ser His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val
740 745 750
Pro Glu Asn Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln
755 760 765
Pro Met Ser Arg Gln Val Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys
770 775 780
Asp Val Lys Leu Pro Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr
785 790 795 800
Met Gly Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn Tyr Pro
805 810 815
Tyr Pro Leu Ile Gly Glu Thr Ala Val Pro Ser Leu Thr Gln Lys Lys
820 825 830
Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe
835 840 845
Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala
850 855 860
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asp
865 870 875 880
Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg
885 890 895
Ile His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr
900 905 910
Pro Phe Ser Ala Gly Asn Ala Thr Thr
915 920
<210>32
<211>347
<212>PRT
<213>猿猴腺病毒SV-25
<400>32
Met Lys Ile Cys Val Val Ile Phe Ala Leu Ser Leu Ile Lys Thr Glu
1 5 10 15
Leu Phe Ala Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys
20 25 30
Thr Pro Gly Val Leu Asn Leu His Ile Ala Lys Pro Leu Tyr Phe Thr
35 40 45
Ile Glu Lys Gln Leu Ala Leu Ser Ile Gly Lys Gly Leu Thr Ile Ser
50 55 60
Ala Thr Gly Gln Leu Glu Ser Thr Ala Ser Val Gln Asp Ser Ala Thr
65 70 75 80
Pro Pro Leu Arg Gly Ile Ser Pro Leu Lys Leu Thr Asp Asn Gly Leu
85 90 95
Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp Gln Leu Thr
100 105 110
Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly Ser Leu Thr Phe
115 120 125
Asn Tyr Thr Ser Pro Met Thr Leu Ile Asn Asn Ser Leu Ala Ile Asn
130 135 140
Val Asn Thr Ser Lys Gly Leu Ser Ser Asp Asn Gly Thr Leu Ala Val
145 150 155 160
Asn Val Thr Pro Asp Phe Arg Phe Asn Ser Ser Gly Ala Leu Thr Phe
165 170 175
Gly Ile Gln Ser Leu Trp Thr Phe Pro Thr Lys Thr Pro Asn Cys Thr
180 185 190
Val Phe Thr Glu Ser Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys
195 200 205
Gly Ala His Val Leu Gly Ser Val Ser Leu Ser Gly Val Ala Gly Thr
210 215 220
Met Leu Lys Met Thr His Thr Ser Val Thr Val Gln Phe Ser Phe Asp
225 230 235 240
Asp Ser Gly Lys Leu Ile Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly
245 250 255
Val Arg Gln Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu
260 265 270
Thr Phe Met Pro Asn Ser Thr Ile Tyr Ser Arg Gly Ala Ser Asn Glu
275 280 285
Pro Gln Asn Asn Tyr Tyr Val Gln Thr Tyr Leu Arg Gly Asn Val Arg
290 295 300
Lys Pro Ile Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly Tyr
305 310 315 320
Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe Ala Thr
325 330 335
Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gln
340 345
<210>33
<211>559
<212>PRT
<213>猿猴腺病毒SV-25
<400>33
Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr
1 5 10 15
Asp Pro Pro Tyr Ala Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr
20 25 30
Ser Ser Asp Gly Leu Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Asn
35 40 45
Tyr Arg Asp Pro Ile Thr Thr Gln Asn Gly Ser Leu Thr Leu Lys Leu
50 55 60
Gly Asn Gly Leu Thr Leu Asn Asn Gln Gly Gln Leu Thr Ser Thr Ala
65 70 75 80
Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu
85 90 95
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu Ser
100 105 110
His Thr Ala Pro Leu Val Ile Ala Asn Asn Ser Leu Ala Leu Gln Val
115 120 125
Ser Glu Pro Ile Phe Val Asn Asp Asp Asp Lys Leu Ala Leu Gln Thr
130 135 140
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gln Ser Ala
145 150 155 160
Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys Leu Leu Phe Ser Lys
165 170 175
Pro Leu Tyr Leu Gln Asn Asp Phe Leu Ala Leu Ala Ile Glu Arg Pro
180 185 190
Leu Ala Val Ala Ala Ala Gly Thr Leu Thr Leu Gln Leu Thr Pro Pro
195 200 205
Leu Lys Thr Asn Asp Asp Gly Leu Thr Leu Ser Thr Val Glu Pro Leu
210 215 220
Thr Val Lys Asn Gly Asn Leu Gly Leu Gln Ile Ser Arg Pro Leu Val
225 230 235 240
Val Gln Asn Asn Gly Leu Ser Leu Ala Ile Thr Pro Pro Leu Arg Leu
245 250 255
Phe Asn Ser Asp Pro Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala
260 265 270
Val Thr Asn Asn Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu
275 280 285
Thr Tyr Asn Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gln Phe Glu
290 295 300
Asn Gly Ala Ile Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gln Tyr
305 310 315 320
Thr Asn Lys Leu Gln Leu Asn Ile Gly Ala Gly Leu Arg Tyr Asn Gly
325 330 335
Ala Ser Arg Lys Leu Asp Val Asn Ile Asn Gln Asn Lys Gly Leu Thr
340 345 350
Trp Asp Asn Asp Ala Val Ile Pro Lys Leu Gly Ser Gly Leu Gln Phe
355 360 365
Asp Pro Asn Gly Asn Ile Ala Val Ile Pro Glu Thr Val Lys Pro Gln
370 375 380
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Val Tyr Gln
385 390 395 400
Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val Lys Ser Gly Asp Met
405 410 415
Val His Gly Ser Ile Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Asn
420 425 430
Pro Thr Ala Ser Tyr Ile Ser Ile Val Ile Tyr Phe Tyr Ser Asn Gly
435 440 445
Val Arg Arg Thr Asn Tyr Pro Thr Phe Asp Asn Glu Gly Thr Leu Ala
450 455 460
Asn Ser Ala Thr Trp Gly Tyr Arg Gln Gly Gln Ser Ala Asn Thr Asn
465 470 475 480
Val Thr Asn Ala Thr Glu Phe Met Pro Ser Ser Ser Arg Tyr Pro Val
485 490 495
Asn Lys Gly Asp Asn Ile Gln Asn Gln Ser Phe Ser Tyr Thr Cys Ile
500 505 510
Lys Gly Asp Phe Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His
515 520 525
Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn
530 535 540
Gln Ala Phe Asp Ile Pro Cys Cys Ser Phe Ser Tyr Ile Thr Glu
545 550 555
<210>34
<211>34115
<212>DNA
<213>猿猴腺病毒SV-39
<220>
<221>CDS
<222>(13448)..(14959)
<223>L2五邻体
<220>
<221>CDS
<222>(17785)..(20538)
<223>L3六邻体
<220>
<221>CDS
<222>(29515)..(31116)
<223>L5纤维#1
<400>34
catcatcaat ataacaccgc aagatggcga ccgagttaac atgcaaatga ggtgggcgga 60
gttacgcgac ctttgtcttg ggaacgcgga agtgggcgcg gcgggtttcg gggaggagcg 120
cggggcgggg cgggcgtgtc gcgcggcggt gacgcgccgg ggacccggaa attgagtagt 180
ttttattcat tttgcaagtt tttctgtaca ttttggcgcg aaaactgaaa cgaggaagtg 240
aaaagtgaaa aatgccgagg tagtcaccgg gtggagatct gacctttgcc gtgtggagtt 300
tacccgctga cgtgtgggtt tcggtctcta ttttttcact gtggttttcc gggtacggtc 360
aaaggtcccc attttatgac tccacgtcag ctgatcgcta gggtatttaa tgcgcctcag 420
accgtcaaga ggccactctt gagtgccggc gagaagagtt ttctcctccg cgttccgcca 480
actgtgaaaa aatgaggaac ttcttgctat ctccggggct gccagcgacc gtagccgccg 540
agctgttgga ggacattgtt accggagctc tgggagacga tcctcaggtg atttctcact 600
tttgtgaaga ttttagtctt catgatctct atgatattga tccgggtgtt gaggggcaag 660
aggatgaatg gctggagtct gtggatgggt tttttccgga cgctatgctg ctagaggctg 720
atttgccacc acctcacaac tctcacactg agcccgagtc agctgctatt cctgaattgt 780
catcaggtga acttgacttg gcttgttacg agactatgcc tccggagtcg gatgaggagg 840
acagcgggat cagcgatccc acggctttta tggtctctaa ggcgattgct atactaaaag 900
aagatgatga tggcgatgat ggatttcgac tggacgctcc ggcggtgccg gggagagact 960
gtaagtcctg tgaataccac cgggatcgta ccggagaccc gtctatgttg tgttctctgt 1020
gttatctccg tcttaacgct gcttttgtct acagtaagtg ttttgtgctt ttttaccctg 1080
tggctttgtt gagtttattt ttttctgtgt ctcatagggt gttgtttatt ataggtcctg 1140
tttcagatgt ggaggaacct gatagtacta ctggaaatga ggaggaaaag ccctccccgc 1200
cgaaactaac tcagcgctgc agacctaata ttttgagacc ctcggcccag cgtgtgtcat 1260
cccggaaacg tgctgctgtt aattgcatag aagatttatt ggaagagccc actgaacctt 1320
tggacttgtc cttaaagcga ccccgcccgc agtagggcgc ggtgccagtt ttttctctct 1380
agcttccggg tgactcagtg caataaaaat tttcttggca acaggtgtat gtgtttactt 1440
tacgggcggg aagggattag gggagtataa agctggaggg gaaaaatctg aggctgtcag 1500
atcgagtgag aagttccatg gacttgtacg agagcctaga gaatctaagt tctttgcgac 1560
gtttgctgga ggaggcctcc gacagaacct cttacatttg gaggtttctg ttcggttccc 1620
ctctgagtcg ctttttgcac cgggtgaagc gagagcacct gacggaattt gatgggcttt 1680
tagagcagct gcctggactg tttgattctt tgaatctcgg ccaccggacg ctgctagagg 1740
agaggctttt tccacaattg gacttttcct ctccaggccg tctgtgttca gcgcttgctt 1800
ttgctgtaca tctgttggac agatggaacg agcagacgca gctcagcccg ggttacactc 1860
tggacttcct gacgctatgc ctatggaagt tcggaatcag gagggggagg aagctgtacg 1920
ggcgcttggt ggagaggcat ccgtctctgc gccagcagcg tctgcaagct caagtgctgc 1980
tgaggcggga ggatctggaa gccatttcgg aggaggagag cggcatggaa gagaagaatc 2040
cgagagcggg gctggaccct ccggcggagg agtagggggg ataccggacc cttttcctga 2100
gttggctttg ggggcggtgg ggggcgcttc tgtggtacgt gaggatgaag aggggcgcca 2160
acgcggtcag aagagggagc attttgagtc ctcgactttc ttggctgatg taaccgtggc 2220
cctgatggcg aaaaacaggc tggaggtggt gtggtacccg gaagtatggg aggactttga 2280
gaagggggac ttgcacctgc tggaaaaata taactttgag caggtgaaaa catactggat 2340
gaacccggat gaggactggg aggtggtttt gaaccgatac ggcaaggtag ctctgcgtcc 2400
cgactgtcgc taccaggttc gcgacaaggt ggtcctgcga cgcaacgtgt acctgttggg 2460
caacggcgcc accgtggaga tggtggaccc cagaaggggt ggttttgtgg ccaatatgca 2520
agaaatgtgc cctggggtgg tgggcttgtc tggggtgact tttcatagtg tgaggtttag 2580
cggtagcaat tttgggggtg tggttattac cgcgaacact cctgtggtcc tgcataattg 2640
ctactttttt ggcttcagca acacctgtgt ggaaatgagg gtgggaggca aagtgcgcgg 2700
gtgttccttt tacgcttgct ggaagggggt ggtgagccag ggtaaggcta aagtgtctgt 2760
tcacaagtgt atgttggaga gatgcacctt gggcatttcc agtgagggct tcctccacgc 2820
cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 2880
ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940
ctgcacagat ggcaaggtgc gcatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000
ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060
gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120
atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180
gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240
cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300
gcactcttgc ctgcgggagg agttttcttc ctcggacgag gaggactagg taggtggttg 3360
gggcgtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420
tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt 3480
gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540
cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600
tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3660
tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga 3720
cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3780
cgtgagccaa cagttgcagc tggtgtcgac ccgagtggag caactttccc gccctcccca 3840
gtaaccgcaa aaattcaata aacagaattt aataaacagc acttgagaaa agtttaaact 3900
tgtggttgac tttattcctg gatagctggg gggagggaac ggcgggaacg gtaagacctg 3960
gtccatcgtt cccggtcgtt gagaacacgg tggatttttt ccaagacccg atagaggtgg 4020
gtctgaacgt tgagatacat gggcatgagc ccgtctcggg ggtggaggta ggcccactgc 4080
agggcctcgt tttcaggggt ggtgttgtaa atgatccagt cgtaggcccc ccgctgggcg 4140
tggtgctgga agatgtcctt cagcagcaag ctgatggcaa cgggaagacc cttggtgtag 4200
gtgttgacaa agcggttgag ttgggagggg tgcatgcggg gactgatgag gtgcattttg 4260
gcctggatct tgaggttggc tatgttgccg cccagatcgc gcctgggatt catgttatgc 4320
aagaccacca gcaccgagta accggtgcag cgggggaatt tgtcgtgcag cttggaaggg 4380
aaagcgtgga agaatttgga gacccctcgg tgcccgccta ggttttccat gcactcatcc 4440
atgatgatgg cgatgggccc ccgggaggca gcctgggcaa aaacgttgcg ggggtccgtg 4500
acatcgtagt tgtggtcctg ggtgagttca tcataggaca ttttgacaaa gcgcgggcag 4560
agggtcccag actggggaat gatggttcca tccggtccgg gggcgtagtt gccctcgcag 4620
atttgcattt cccaggcttt gatttcagag ggagggatca tgtcaacctg gggggcgatg 4680
aaaaaaatgg tctctggggc gggggtgatg agctgggtgg aaagcaggtt gcgcaagagc 4740
tgtgacttgc cgcagccggt gggcccgtag atgacagcta tgacgggttg cagggtgtag 4800
tttagagagc tacaactgcc atcatccttc aaaagcgggg ccacactgtt taaaagttct 4860
ctaacatgta agttttcccg cactaagtcc tgcaggagac gtgaccctcc tagggagaga 4920
agttcaggaa gcgaagcaaa gtttttaagt ggcttgaggc catcggccaa gggcaagttc 4980
ctgagagttt gactgagcag ttccagccgg tcccagagct cggttacgtg ctctacggca 5040
tctcgatcca gcagacctcc tcgtttcggg ggttggggcg gctctggctg tagggaatga 5100
ggcggtgggc gtccagctgg gccatggtgc ggtccctcca tgggcgcagg gttctcttca 5160
gggtggtctc ggtcacggtg aatgggtggg ccccgggctg ggcgctggcc agggtgcgct 5220
tgaggctgag gcggctggtg gcgaaccgtt gcttttcgtc tccctgcaag tcagccaaat 5280
agcaacggac catgagctca tagtccaggc tctctgcggc atgtcctttg gcgcgaagct 5340
tgcctttgga aacgtgcccg cagtttgagc agagcaagca ttttagcgcg tagagttttg 5400
gcgccaagaa cacggattcc ggggaataag catccccacc gcagttggag caaacggttt 5460
cgcattccac cagccaggtc agctgaggat cttttgggtc aaaaaccaag cgcccgccgt 5520
tttttttgat gcgcttccta cctcgggtct ccatgaggcg gtgcccgcgt tcggtgacga 5580
agaggctgtc ggtgtctccg tagacggagg tcagggcgcg ctcctccagg ggggtcccgc 5640
ggtcctcggc gtagagaaac tcgcaccact ctgacataaa cgcccgggtc caggctagga 5700
cgaatgaggc gatgtgggaa gggtaccggt cgttatcgat gagggggtcg gttttttcca 5760
aggtgtgcag gcacatgtcc ccctcgtccg cttccaaaaa tgtgattggc ttgtaggtgt 5820
aagtcacgtg atcctgtcct tccgcggggg tataaaaggg ggcgtttccc ccctcctcgt 5880
cactctcttc cggttcgctg tcgccaaagg ccagctgttg gggtacgtaa acgcgggtga 5940
aggcgggcat gacctgtgcg ctgaggttgt cagtttctat atacgaggaa gatttgatgg 6000
cgagcgcccc cgtggagatg cccttgaggt gctcggggcc catttggtca gaaaacacaa 6060
tctgtcggtt atcaagcttg gtggcaaaag acccgtagag ggcgttggag agcaacttgg 6120
cgatggagcg ctgggtttgg tttttttccc ggtcggcttt ttccttggcc gcgatgttga 6180
gctggacgta ctccctggcc acgcacttcc agccgggaaa aacggccgtg cgctcgtccg 6240
gcaccagcct cacgctccat ccgcggttgt gcagggtgat gacgtcgatg ctggtggcca 6300
cctctccgcg caggggctcg ttggtccagc agaggcgacc gcccttgcga gagcagaagg 6360
ggggcagggg gtcaagcagg cgctcgtccg gggggtcggc gtcgatggta aagatggcgg 6420
gcagcaggtg tttgtcaaag taatcgatct gatgcccggg gcaacgcagg gcggtttccc 6480
agtcccgcac cgccaaggcg cgctcgtatg gactgagggg ggcgccccag ggcatgggat 6540
gcgtcagggc cgaggcgtac atgccgcaga tgtcatagac gtaaaggggc tcctccagga 6600
cgccgaggta ggtggggtag cagcgccccc cgcggatgct ggcccgtacg tagtcgtaga 6660
gctcgtgcga gggggccaga aggtggcggc tgaggtgagc gcgctggggc ttttcatctc 6720
ggaagaggat ctgcctgaag atggcgtggg agttggagga gatggtgggc cgctgaaaaa 6780
tgttgaagcg ggcgtcgggc agacccacgg cctcgccgat aaagtgggcg taggactctt 6840
gcagcttttc caccagggag gcggtgacca gcacgtccag agcgcagtag tccagggttt 6900
cccgcacgat gtcataatgc tcttcctttt tttccttcca gaggtctcgg ttgaagagat 6960
actcttcgcg gtctttccag tactcttgga gaggaaaccc gttttcgtct ccacggtaag 7020
agcccaacat gtaaaactgg ttgacggcct gatagggaca gcatcccttc tccacgggca 7080
gcgagtaggc cagggcggcc ttgcgcaggg aggtgtgagt cagggcaaag gtgtcgcgga 7140
ccataacttt tacaaactgg tacttaaagt cccggtcgtc gcacatgcct cgctcccagt 7200
ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 7260
agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 7320
ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 7380
gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 7440
gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 7500
ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560
cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620
tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 7680
gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 7740
cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 7800
gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 7860
ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgtgctgg tttttgtaaa 7920
agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatcttgc 7980
gtcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040
cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100
ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160
cgtgaatacg gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 8220
gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 8280
aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtctttatcg 8340
gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 8400
cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 8460
ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 8520
cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 8580
ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 8640
ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700
atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 8760
tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820
gagttccacg tggcgggcga aaaccggata gtttcggagg cgctgataca gatagttgag 8880
ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagctc 8940
gttgatgtcg cccagcgcct ccaggcgttc catggcctcg taaaagtcca cggcaaagtt 9000
gaaaaattgg ctgttcctgg ccgagaccgt gagctcttct tccaagagcc gaatgagatc 9060
cgccacggtg gccctgactt cgcgttcgaa agccccgggt gcctcctcca cctcttcctc 9120
ctcgacttct tcgaccgctt cgggcacctc ctcttcctcg accaccacct caggcggggc 9180
tcggcggcgc cggcggcgga cgggcaggcg gtcgacgaaa cgctcgatca tttcccccct 9240
ccgtcgacgc atggtctcgg tgacggcgcg accctgttcg cgaggacgca gggtgaaggc 9300
gccgccgccg agcggaggta acagggagat cggggggcgg tcgtggggga gactgacggc 9360
gctaactatg catctgatca atgtttgcgt agtgacctcg ggtcggagcg agctcagcgc 9420
ttgaaaatcc acgggatcgg aaaaccgttc caggaacgcg tctagccaat cacagtcgca 9480
aggtaagctg aggaccgtct cgggggcttg tctgttctgt cttcccgcgg tggtgctgct 9540
gatgaggtag ttgaagtagg cgctcttgag gcggcggatg gtggacagga gaaccacgtc 9600
tttgcgccca gcttgctgta tccgcaggcg gtcggccatg ccccacactt ctccttgaca 9660
gcggcggagg tccttgtagt attcttgcat cagcctttcc acgggcacct cgtcttcttc 9720
ttccgctcgg ccggacgaga gccgcgtcag gccgtacccg cgctgcccct gtggttggag 9780
cagggccagg tcggccacga cgcgctcggc cagcacggcc tgctggatgc gggtgagggt 9840
gtcctgaaag tcgtcgagat ccacaaagcg gtggtacgcg ccagtgttga tggtgtaggt 9900
gcagttgctc atgacggacc agtttacggt ctgggtgcca tggcccacgg tttccaggta 9960
gcggagacgc gagtaggccc gcgtctcgaa gatgtagtcg ttgcaggtcc gcagcaggta 10020
ctggtagccc accagcagat gcggcggcgg ctggcggtag aggggccacc gctgggtggc 10080
gggggcgttg ggggcgagat cttccaacat gaggcggtga tagccgtaga tgtagcgcga 10140
catccaagtg atgccgctgg ccgtggtgct ggcgcgggcg tagtcgcgaa cgcggttcca 10200
gatgtttcgc agcggctgga agtactcgat ggtggggcga ctctgccccg tgaggcgggc 10260
gcagtcggcg atgctctacg gggaaaaaga agggccagtg aacaaccgcc ttccgtagcc 10320
ggaggagaac gcaagggggt caaagaccac cgaggctcgg gttcgaaacc cgggtggcgg 10380
cccgaatacg gagggcggtt ttttgctttt ttctcagatg catcccgtgc tgcggcagat 10440
gcgtccgaac gcggggtccc agtccccggc ggtgcctgcg gccgtgacgg cggcttctac 10500
ggccacgtcg cgctccaccc cgcctaccac ggcccaggcg gcggtggctc tgcgcggcgc 10560
aggggaaccc gaagcagagg cggtgttgga cgtggaggag ggccaggggt tggctcggct 10620
gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680
gtacgtgccg cggagcaatc tgtttcgcga ccgcagcggc gaggaggccg aggagatgcg 10740
agacttgcgt tttcgggcgg ggagggagtt gcgtcacggg ctggaccggc agagggttct 10800
gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 10860
ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920
gaagagcttc aatcatcacg tgcgcacgct gattgcgcgc gaagaggtgg ccatcggcct 10980
catgcatctg tgggattttg tggaggcgta cgttcagaac cccagcagca agccgctgac 11040
ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca gggaggccat 11100
gctgaacatt gcagagcctg aggggcgctg gctcttggat ctcattaaca tcttgcagag 11160
tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaactacag 11220
catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgt tcgtccccat 11280
agacaaggag gtgaagatag acagctttta catgcgcatg gcgctcaagg tgctgactct 11340
aagcgacgac ctgggggtgt accgcaacga ccgcatacac aaggcggtga gcgccagccg 11400
ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460
cggcagcgcc gaggcggccg agtactttga cgccggagcg gacttgcgct ggcagccatc 11520
ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580
ggacgacgag ttgggcattg acttgtagcc gtttttcgtt agatatgtcg gcgaacgagc 11640
cgtctgcggc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700
cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760
gcatcctggc gctgacggcg cgcaaccccg aggcttttcg gcagcagccg caggcaaacc 11820
ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880
tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 11940
aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000
atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060
aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120
acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 12180
ccgaggtgcc gcagagcgag gtgtaccagt ctggccccaa ctactacttc cagacctccc 12240
ggcagggcct gcagacggta aacctgacgc aggcctttca gaacctgcag ggcctttggg 12300
gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360
ggctgctctt gcttctcatt gctccgttca ccgacagcgg ttccatcagc cgcgactctt 12420
acctgggaca cctgctcacc ctgtaccggg aggccatcgg gcaggcgcgg gtggacgagc 12480
agacgtacca ggaaatcacc agcgtgagcc gcgcgctggg gcaggaggac acgggcagct 12540
tggaggcgac tctgaacttc ctgctgacca accggcggca gcgcctacct ccccagtacg 12600
cgctgaacgc ggaggaggag cgcatcctgc gtttcgtgca gcagagcacc gcgctgtact 12660
tgatgcggga aggcgcctct cccagcgctt cgctggacat gacggcggcc aacatggagc 12720
catcgttcta cgccgccaac cgtcccttcg tcaaccggct aatggactat ttgcatcggg 12780
cggcggccct gaacccggaa tactttacta acgtcatcct gaacgaccgt tggctgccac 12840
ctcccggctt ctacacgggg gagttcgacc tcccggaggc caacgacggt ttcatgtggg 12900
acgacgtgga cagcgtgttc ctgcccggca agaaggaggc gggtgactct cagagccacc 12960
gcgcgagcct cgcagacctg ggggcgaccg ggcccgcgtc tccgctgcct cgcctgccga 13020
gcgccagcag cgccagcgtg gggcgggtga gccgtccgcg cctcagcggt gaggaggact 13080
ggtggaacga tccgctgctc cgtccggccc gcaacaaaaa cttccccaac aacgggatag 13140
aggatttggt agacaaaatg aaccgttgga agacgtatgc ccaggagcat cgggagtggc 13200
aggcgaggca acccatgggc cctgttctgc cgccctctcg gcgcccgcgc agggacgaag 13260
acgccgacga ttcagccgat gacagcagcg tgttggatct gggcgggagc gggaacccct 13320
ttgcccacct gcaacctcgc ggcgtgggtc ggcggtggcg ctaggaaaaa aaattattaa 13380
aagcacttac cagagccatg gtaagaagag caacaaaggt gtgtcctgct ttcttcccgg 13440
tagcaaa atg cgt cgg gcg gtg gca gtt ccc tcc gcg gca atg gcg tta 13489
Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu
1 5 10
ggc ccg ccc cct tct tac gaa agc gtg atg gca gcg gcc acc ctg caa 13537
Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gln
15 20 25 30
gcg ccg ttg gag aat cct tac gtg ccg ccg cga tac ctg gag cct acg 13585
Ala Pro Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr
35 40 45
ggc ggg aga aac agc att cgt tac tcg gag ctg acg ccc ctg tac gac 13633
Gly Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp
50 55 60
acc acc cgc ctg tac ctg gtg gac aac aag tca gca gat atc gcc acc 13681
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Thr
65 70 75
ttg aac tac cag aac gac cac agc aac ttt ctc acg tcc gtg gtg cag 13729
Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gln
80 85 90
aac agc gac tac acg ccc gcc gaa gcg agc acg cag acc att aac ttg 13777
Asn Ser Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gln Thr Ile Asn Leu
95 100 105 110
gac gac cgc tcg cgc tgg ggc ggg gac ttg aaa acc att ctg cac act 13825
Asp Asp Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr Ile Leu His Thr
115 120 125
aac atg ccc aac gtg aac gag ttc atg ttt acc aac tcg ttc agg gct 13873
Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala
130 135 140
aaa ctt atg gtg gcg cac gag gcc gac aag gac ccg gtt tat gag tgg 13921
Lys Leu Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp
145 150 155
gtg cag ctg acg ctg ccg gag ggg aac ttt tca gag att atg acc ata 13969
Val Gln Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu Ile Met Thr Ile
160 165 170
gac ctg atg aac aac gcc att atc gac cac tac ctg gcg gta gcc aga 14017
Asp Leu Met Asn Asn Ala Ile Ile Asp His Tyr Leu Ala Val Ala Arg
175 180 185 190
cag cag ggg gtg aaa gaa agc gag atc ggc gtc aag ttt gac acg cgc 14065
Gln Gln Gly Val Lys Glu Ser Glu Ile Gly Val Lys Phe Asp Thr Arg
195 200 205
aac ttt cgt ctg ggc tgg gac ccg gag acg ggg ctt gtg atg ccg ggg 14113
Asn Phe Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly
210 215 220
gtg tac acg aac gaa gct ttc cat ccc gac gtg gtc ctc ttg ccg ggc 14161
Val Tyr Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly
225 230 235
tgc ggg gtg gac ttt acc tac agc cgg tta aac aac ctg cta ggc ata 14209
Cys Gly Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly Ile
240 245 250
cgc aag aga atg ccc ttt cag gaa ggg ttt cag atc ctg tac gag gac 14257
Arg Lys Arg Met Pro Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp
255 260 265 270
ctg gag ggc ggt aac atc ccg gcc ctg ctg gac gtg ccg gcg tac gag 14305
Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu
275 280 285
gag agc atc gcc aac gca agg gag gcg gcg atc agg ggc gat aat ttc 14353
Glu Ser Ile Ala Asn Ala Arg Glu Ala Ala Ile Arg Gly Asp Asn Phe
290 295 300
gcg gcg cag ccc cag gcg gct cca acc ata aaa ccc gtt ttg gaa gac 14401
Ala Ala Gln Pro Gln Ala Ala Pro Thr Ile Lys Pro Val Leu Glu Asp
305 310 315
tcc aaa ggg cgg agc tac aac gta ata gcc aac acc aac aac acg gct 14449
Ser Lys Gly Arg Ser Tyr Asn Val Ile Ala Asn Thr Asn Asn Thr Ala
320 325 330
tac agg agc tgg tat ctg gct tat aac tac ggc gac ccg gag aag ggg 14497
Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly
335 340 345 350
gtt agg gcc tgg acc ctg ctc acc act ccg gac gtg acg tgc ggt tca 14545
Val Arg Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser
355 360 365
gag cag gtc tac tgg tcg ctg cct gac atg tac gtg gac cct gtg acg 14593
Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr
370 375 380
ttt cgc tcc acg cag caa gtt agc aac tac cca gtg gtg gga gcg gag 14641
Phe Arg Ser Thr Gln Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu
385 390 395
ctt atg ccg att cac agc aag agc ttt tac aac gag cag gcc gtc tac 14689
Leu Met Pro Ile His Ser Lys Ser Phe Tyr Asn Glu Gln Ala Val Tyr
400 405 410
tca cag ctc att cgt cag acc acc gcc cta acg cac gtt ttc aac cgc 14737
Ser Gln Leu Ile Arg Gln Thr Thr Ala Leu Thr His Val Phe Asn Arg
415 420 425 430
ttc ccc gag aac caa atc cta gtg cga cct cca gcg ccc acc atc acc 14785
Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr
435 440 445
acc gtc agc gag aac gtg ccc gct cta acc gat cac ggg acg ctg cct 14833
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro
450 455 460
ttg cag aac agc atc cgc gga gtt cag cga gtt acc atc acg gac gcc 14881
Leu Gln Asn Ser Ile Arg Gly Val Gln Arg Val Thr Ile Thr Asp Ala
465 470 475
cgt cgt cgg acc tgt ccc tac gtc tac aaa gcc ttg gga atc gtg gcc 14929
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala
480 485 490
ccg cgc gtc ctg tcg agt cgc act ttc tag atgtccatcc tcatctctcc 14979
Pro Arg Val Leu Ser Ser Arg Thr Phe
495 500
cagcaacaat accggttggg gtctgggcgt gaccaaaatg tacggaggcg ccaaacgacg 15039
gtccccacaa catcccgtgc gagtgcgcgg gcactttaga gccccatggg ggtcgcacac 15099
gcgcgggcgc accggccgaa ccaccgtcga cgacgtgatc gatagcgtgg tggccgacgc 15159
ccgcaactac cagcccgctc gatccacggt ggacgaagtc atcgacggcg tggtggccga 15219
cgccagggcc tacgcccgca gaaagtctcg tctgcgccgc cgccgttcgc taaagcgccc 15279
cacggccgcc atgaaagccg ctcgctctct gctgcgtcgc gcacgtatcg tgggtcgccg 15339
cgccgccaga cgcgcagccg ccaacgccgc cgccggccga gtgcgccgcc gggccgccca 15399
gcaggccgcc gccgccatct ccagtctatc cgccccccga cgcgggaatg tgtactgggt 15459
cagggactcg gccaccggcg tgcgagttcc cgtgagaacc cgtcctcctc gtccctgaat 15519
aaaaagttct aagcccaatc ggtgttccgt tgtgtgttca gctcgtcatg accaaacgca 15579
agtttaaaga ggagctgctg caagcgctgg tccccgaaat ctatgcgccg gcgccggacg 15639
tgaaaccgcg tcgcgtgaaa cgcgtgaaga agcaggaaaa gctagagaca aaagaggagg 15699
cggtggcgtt gggagacggg gaggtggagt ttgtgcgctc gttcgcgccg cgtcggcgag 15759
tgaattggaa ggggcgcaag gtgcaacggg tgctgcgtcc cggcacggtg gtgtctttca 15819
ccccgggtga aaaatccgcc tggaagggca taaagcgcgt gtacgatgag gtgtacgggg 15879
acgaagacat tctggagcag gcgctggata gaagcgggga gtttgcttac ggcaagaggg 15939
cgaggacggg cgagatcgcc atcccgctgg acacttccaa ccccaccccc agtctgaaac 15999
ccgtgacgct gcaacaggtg ttgccggtga gcgccccctc gcgacgcggc ataaaacgcg 16059
agggcggcga gctgcagccc accatgcagc tcctggttcc caagaggcag aaactagagg 16119
acgtactgga catgataaaa atggagcccg acgtgcagcc cgatattaaa atccgtccca 16179
tcaaagaagt ggcgccggga atgggcgtgc agaccgtgga catccagatt cccatgacca 16239
gcgccgcaca ggcggtagag gccatgcaga ccgacgtggg gatgatgacg gacctgcccg 16299
cagctgctgc cgccgtggcc agcgccgcga cgcaaacgga agccggcatg cagaccgacc 16359
cgtggacgga ggcgcccgtg cagccggcca gaagacgcgt cagacggacg tacggccccg 16419
tttctggcat aatgccggag tacgcgctgc atccttccat catccccacc cccggctacc 16479
gggggcgcac ctaccgtccg cgacgcagca ccactcgccg ccgtcgccgc acggcacgag 16539
tcgccaccgc cagagtgaga cgcgtaacga cacgtcgcgg ccgccgcttg accctgcccg 16599
tggtgcgcta ccatcccagc attctttaaa aaaccgctcc tacgttgcag atgggcaagc 16659
ttacttgtcg actccgtatg gccgtgcccg gctaccgagg aagatcccgc cgacgacgga 16719
ctttgggagg cagcggtttg cgccgccgtc gggcggttca ccggcgcctc aagggaggca 16779
ttctgccggc cctgatcccc ataatcgccg cagccatcgg ggccattccc ggaatcgcca 16839
gcgtagcggt gcaggctagc cagcgccact gattttacta accctgtcgg tcgcgccgtc 16899
tctttcggca gactcaacgc ccagcatgga agacatcaat ttctcctctc tggccccgcg 16959
gcacggcacg cggccgtata tggggacgtg gagcgagatc ggcacgaacc agatgaacgg 17019
gggcgctttc aattggagcg gtgtgtggag cggcttgaaa aatttcggtt ccactctgaa 17079
aacttacggc aaccgggtgt ggaactccag cacggggcag atgctgaggg acaagctaaa 17139
ggacacgcag tttcagcaaa aggtggtgga cggcatcgct tcgggcctca acggcgccgt 17199
cgacctggcc aaccaggcca ttcaaaagga aattaacagc cgcctggagc cgcggccgca 17259
ggtggaggag aacctgcccc ctctggaggc gctgcccccc aagggagaga agcgcccgcg 17319
gcccgacatg gaggagacgc tagttactaa gagcgaggag ccgccatcat acgaggaggc 17379
ggtgggtagc tcgcagctgc cgtccctcac gctgaagccc accacctatc ccatgaccaa 17439
gcccatcgcc tccatggcgc gccccgtggg agtcgacccg cccatcgacg cggtggccac 17499
tttggacctg ccgcgccccg aacccggcaa ccgcgtgcct cccgtcccca tcgctccgcc 17559
ggtttctcgc cccgccatcc gccccgtcgc cgtggccact ccccgctatc cgagccgcaa 17619
cgccaactgg cagaccaccc tcaacagtat tgtcggactg ggggtgaagt ctctgaagcg 17679
ccgtcgctgt ttttaaagca caatttatta aacgagtagc cctgtcttaa tccatcgttg 17739
tatgtgtgcc tatatcacgc gttcagagcc tgaccgtccg tcaag atg gcc act ccg 17796
Met Ala Thr Pro
505
tcg atg atg ccg cag tgg tcg tac atg cac atc gcc ggg cag gac gcc 17844
Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ala Gly Gln Asp Ala
510 515 520
tcg gag tac ctg agc ccg ggt ctg gtg cag ttt gcc cgt gcg acg gaa 17892
Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Glu
525 530 535
acc tac ttc tca ctg ggc aac aag ttc agg aac ccc acc gtg gcg ccc 17940
Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro
540 545 550 555
acc cac gac gtc acc acc gat cgg tcc cag cga ctg aca atc cgc ttc 17988
Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Ile Arg Phe
560 565 570
gtc ccc gtg gac aag gaa gac acc gct tac tcc tac aaa acc cgc ttc 18036
Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr Lys Thr Arg Phe
575 580 585
acg ctg gcc gtg ggc gac aac cgg gtg cta gac atg gcc agt acc tac 18084
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr
590 595 600
ttt gac atc cgc ggc gtg atc gac cgc gga cct agc ttc aag cct tac 18132
Phe Asp Ile Arg Gly Val Ile Asp Arg Gly Pro Ser Phe Lys Pro Tyr
605 610 615
tcc ggc acg gct tac aac tca ctg gct ccc aaa ggg gcg ccc aac aac 18180
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Asn
620 625 630 635
agc caa tgg aac gcc aca gat aac ggg aac aag cca gtg tgt ttt gct 18228
Ser Gln Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro Val Cys Phe Ala
640 645 650
cag gca gct ttt ata ggt caa agc att aca aaa gac gga gtg caa ata 18276
Gln Ala Ala Phe Ile Gly Gln Ser Ile Thr Lys Asp Gly Val Gln Ile
655 660 665
cag aac tca gaa aat caa cag gct gct gcc gac aaa act tac caa cca 18324
Gln Asn Ser Glu Asn Gln Gln Ala Ala Ala Asp Lys Thr Tyr Gln Pro
670 675 680
gag cct caa att gga gtt tcc acc tgg gat acc aac gtt acc agt aac 18372
Glu Pro Gln Ile Gly Val Ser Thr Trp Asp Thr Asn Val Thr Ser Asn
685 690 695
gct gcc gga cga gtg tta aaa gcc acc act ccc atg ctg cca tgt tac 18420
Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met Leu Pro Cys Tyr
700 705 710 715
ggt tca tat gcc aat ccc act aat cca aac ggg ggt cag gca aaa aca 18468
Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly Gln Ala Lys Thr
720 725 730
gaa gga gac att tcg cta aac ttt ttc aca aca act gcg gca gca gac 18516
Glu Gly Asp Ile Ser Leu Asn Phe Phe Thr Thr Thr Ala Ala Ala Asp
735 740 745
aat aat ccc aaa gtg gtt ctt tac agc gaa gat gta aac ctt caa gcc 18564
Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Leu Gln Ala
750 755 760
ccc gat act cac tta gta tat aag cca acg gtg gga gaa aac gtt atc 18612
Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly Glu Asn Val Ile
765 770 775
gcc gca gaa gcc ctg cta acg cag cag gcg tgt ccc aac aga gca aac 18660
Ala Ala Glu Ala Leu Leu Thr Gln Gln Ala Cys Pro Asn Arg Ala Asn
780 785 790 795
tac ata ggt ttc cga gat aac ttt atc ggt tta atg tat tat aac agc 18708
Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser
800 805 810
aca ggg aac atg gga gtt ctg gca ggt cag gcc tcg cag tta aac gca 18756
Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala
815 820 825
gtt gta gac ctg caa gat cga aac acg gaa ctg tcc tat cag cta atg 18804
Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met
830 835 840
cta gat gct ctg ggt gac aga act cga tat ttc tca atg tgg aat cag 18852
Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln
845 850 855
gcc gtg gac agc tac gat cca gac gtt agg att atc gag aac cat ggg 18900
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly
860 865 870 875
gtg gaa gac gag ctg ccc aat tac tgt ttt cca ctc cca ggc atg ggt 18948
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Pro Gly Met Gly
880 885 890
att ttt aac tcc tac aag ggg gta aaa cca caa aat ggc ggt aat ggt 18996
Ile Phe Asn Ser Tyr Lys Gly Val Lys Pro Gln Asn Gly Gly Asn Gly
895 900 905
aac tgg gaa gca aac ggg gac cta tca aat gcc aat gag atc gct tta 19044
Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn Glu Ile Ala Leu
910 915 920
gga aac att ttt gcc atg gaa att aac ctc cac gca aac ctg tgg cgc 19092
Gly Asn Ile Phe Ala Met Glu Ile Asn Leu His Ala Asn Leu Trp Arg
925 930 935
agc ttc ttg tac agc aat gtg gcg ctg tac ctg cca gac agc tat aaa 19140
Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys
940 945 950 955
ttc act ccc gct aac atc act ctg ccc gcc aac caa aac acc tac gag 19188
Phe Thr Pro Ala Asn Ile Thr Leu Pro Ala Asn Gln Asn Thr Tyr Glu
960 965 970
tat atc aac ggg cgc gtc act tct cca acc ctg gtg gac acc ttt gtt 19236
Tyr Ile Asn Gly Arg Val Thr Ser Pro Thr Leu Val Asp Thr Phe Val
975 980 985
aac att gga gcc cga tgg tcg ccg gat ccc atg gac aac gtc aac ccc 19284
Asn Ile Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro
990 995 1000
ttt aac cat cac cgg aac gcg ggc ctc cgt tac cgc tcc atg ctg 19329
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu
1005 1010 1015
ctg gga aat gga cgc gtg gtg cct ttc cac ata caa gtg ccg caa 19374
Leu Gly Asn Gly Arg Val Val Pro Phe His Ile Gln Val Pro Gln
1020 1025 1030
aaa ttt ttc gcg att aag aac ctc ctg ctt ttg ccc ggc tcc tac 19419
Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr
1035 1040 1045
act tac gag tgg agc ttc aga aaa gac gtg aac atg att ctg cag 19464
Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met Ile Leu Gln
1050 1055 1060
agc acc ctg ggc aat gat ctt cga gtg gac ggg gcc agc gtc cgc 19509
Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg
1065 1070 1075
att gac agc gtc aac ttg tac gcc aac ttt ttc ccc atg gcg cac 19554
Ile Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His
1080 1085 1090
aac acc gct tct acc ttg gaa gcc atg ctg cga aac gac acc aac 19599
Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn
1095 1100 1105
gac cag tcg ttt aac gac tac ctc agc gcg gcc aac atg ctt tat 19644
Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr
1110 1115 1120
ccc att ccg gcc aac gcc acc aac gtt ccc att tcc att ccc tcc 19689
Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser
1125 1130 1135
cgc aac tgg gcg gcc ttc cgg gga tgg agc ttc acc cgc ctt aaa 19734
Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys
1140 1145 1150
gcc aag gaa acg cct tcc ttg ggc tcc ggc ttt gac ccc tac ttt 19779
Ala Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe
1155 1160 1165
gtg tac tca ggc acc att cct tac ctg gac ggc agc ttt tac ctc 19824
Val Tyr Ser Gly Thr Ile Pro Tyr Leu Asp Gly Ser Phe Tyr Leu
1170 1175 1180
aac cac act ttc aaa cgt ctg tcc atc atg ttc gat tct tcc gta 19869
Asn His Thr Phe Lys Arg Leu Ser Ile Met Phe Asp Ser Ser Val
1185 1190 1195
agt tgg ccg ggc aac gac cgc ctc ctg acg ccg aac gag ttc gaa 19914
Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu
1200 1205 1210
att aag cgc att gtg gac ggg gaa ggc tac aac gtg gct caa agt 19959
Ile Lys Arg Ile Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Ser
1215 1220 1225
aac atg acc aaa gac tgg ttt tta att caa atg ctc agc cac tac 20004
Asn Met Thr Lys Asp Trp Phe Leu Ile Gln Met Leu Ser His Tyr
1230 1235 1240
aac atc ggc tac caa ggc ttc tat gtt ccc gag ggc tac aag gat 20049
Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp
1245 1250 1255
cgg atg tat tct ttc ttc cga aac ttt cag ccc atg agc cgc cag 20094
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln
1260 1265 1270
gtg ccg gat ccc acc gct gcc ggc tat caa gcc gtt ccc ctg ccc 20139
Val Pro Asp Pro Thr Ala Ala Gly Tyr Gln Ala Val Pro Leu Pro
1275 1280 1285
aga caa cac aac aac tcg ggc ttt gtg ggg tac atg ggc ccg acc 20184
Arg Gln His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr
1290 1295 1300
atg cgc gaa gga cag cca tac ccg gcc aac tac ccc tat ccc ctg 20229
Met Arg Glu Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu
1305 1310 1315
atc ggc gct acc gcc gtc ccc gcc att acc cag aaa aag ttt ttg 20274
Ile Gly Ala Thr Ala Val Pro Ala Ile Thr Gln Lys Lys Phe Leu
1320 1325 1330
tgc gac cgc gtc atg tgg cgc ata cct ttt tcc agc aac ttt atg 20319
Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met
1335 1340 1345
tca atg ggg gcc ctg acc gac ctc gga cag aac atg ctt tac gct 20364
Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala
1350 1355 1360
aac tcc gcc cat gcc ctg gat atg act ttt gag gtg gac ccc atg 20409
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met
1365 1370 1375
aac gag ccc acg ttg ctg tac atg ctt ttt gag gtg ttc gac gtg 20454
Asn Glu Pro Thr Leu Leu Tyr Met Leu Phe Glu Val Phe Asp Val
1380 1385 1390
gtc aga gtg cac cag ccg cac cgc ggt att atc gag gcc gtg tac 20499
Val Arg Val His Gln Pro His Arg Gly Ile Ile Glu Ala Val Tyr
1395 1400 1405
ctg cgc acc ccc ttc tct gcg ggc aat gcc acc aca taa gccgctgaac 20548
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1410 1415 1420
tagctggttt ttaccccaga tcccatgggc tccacggaag acgaactgcg ggccattgtg 20608
cgagacctgg gctgcggacc ctacttcctg ggcacctttg acaagcggtt tcccgggttc 20668
gtgtctcctc gcaaactcgc gtgcgcgatc gtgaataccg ccggccgaga gaccggagga 20728
gagcattggc tagctctggg ctggaacccc cgctcgtcca cgtttttcct gttcgacccc 20788
tttggctttt cagaccaacg cttgaagcag atctatgcat ttgaatatga gggtctactc 20848
aagcgaagcg cgctggcctc ctccgccgat cactgtctaa ccctggtaaa gagcactcag 20908
acggttcagg gccctcacag cgccgcctgt ggcctttttt gttgcatgtt tttgcacgcc 20968
tttgtgaact ggccggacac ccccatggaa aacaacccca ccatggacct cctgactggc 21028
gttcccaact ccatgctcca aagccccagc gtgcagacca ccctcctcca aaaccagaaa 21088
aatctgtacg cctttctgca caagcactct ccctactttc gccgccatcg ggaacaaata 21148
gaaaatgcaa ccgcgtttaa caaaactctg taacgtttaa taaatgaact ttttattgaa 21208
ctggaaaacg ggtttgtgat ttttaaaaat caaaggggtt gagctggaca tccatgtggg 21268
aggccggaag ggtggtgttc ttgtactggt acttgggcag ccacttaaac tctggaatca 21328
caaacttggg cagcggtatt tctgggaagt tgtcgtgcca cagctggcgg gtcagctgaa 21388
gtgcctgcag aacatcgggg gcggagatct tgaagtcgca gtttatctgg ttcacggcac 21448
gcgcgttgcg gtacatggga ttggcacact gaaacaccag caggctggga ttcttgatgc 21508
tagccagggc cacggcgtcg gtcacgtcac cggtgtcttc tatgttggac agcgaaaaag 21568
gcgtgacttt gcaaagctgg cgtcccgcgc gaggcacgca atctcccagg tagttgcact 21628
cacagcggat gggcagaaga agatgcttgt ggccgcgggt catgtaggga taggccgctg 21688
ccataaaagc ttcgatctgc ctgaaagcct gcttggcctt gtgcccttcg gtataaaaaa 21748
caccgcagga cttgttggaa aaggtattac tggcgcaagc ggcatcgtga aagcaagcgc 21808
gtgcgtcttc gtttcgtaac tgcaccacgc tgcggcccca ccggttctga atcaccttgg 21868
ccctgccggg gttttccttg agagcgcgct ggccggcttc gctgcccaca tccatttcca 21928
cgacatgctc cttgttaatc atggccagac cgtggaggca gcgcagctcc tcgtcatcgt 21988
cggtgcagtg atgctcccac acgacgcagc cagtgggctc ccacttgggc ttggaggcct 22048
cggcaatgcc agaatacagg agaacgtagt ggtgcagaaa acgtcccatc atggtgccaa 22108
aggttttctg gctgctgaag gtcatcgggc agtacctcca gtcctcgtta agccaagtgt 22168
tgcagatctt cctgaagacc gtgtactgat cgggcataaa gtggaactca ttgcgctcgg 22228
tcttgtcgat cttatacttt tccatcagac tatgcataat ctccatgccc ttttcccagg 22288
cgcaaacaat cttggtgcta cacgggttag gtatggccaa agtggttggc ctctgaggcg 22348
gcgcttgttc ttcctcttga gccctctccc gactgacggg ggttgaaaga gggtgcccct 22408
tggggaacgg cttgaacacg gtctggcccg aggcgtcccg aagaatctgc atcgggggat 22468
tgctggccgt catggcgatg atctgacccc ggggctcctc cacttcgtcc tcctcgggac 22528
tttcctcgtg cttttcgggg gacggtacgg gagtaggggg aagagcgcgg cgcgccttct 22588
tcttgggcgg cagttccgga gcctgctctt gacgactggc cattgtcttc tcctaggcaa 22648
gaaaaacaag atggaagact ctttctcctc ctcctcgtca acgtcagaaa gcgagtcttc 22708
caccttaagc gccgagaact cccagcgcat agaatccgat gtgggctacg agactccccc 22768
cgcgaacttt tcgccgcccc ccataaacac taacgggtgg acggactacc tggccctagg 22828
agacgtactg ctgaagcaca tcaggcggca gagcgttatc gtgcaagatg ctctcaccga 22888
gcgactcgcg gttccgctgg aagtggcgga acttagcgcc gcctacgagc gaaccctctt 22948
ctccccaaag actcccccca agaggcaggc taacggcacc tgcgagccta accctcgact 23008
caacttctac cctgcctttg ccgtgccaga ggtactggct acgtaccaca tttttttcca 23068
aaaccacaaa atccctctct cgtgccgcgc caaccgcacc aaagccgatc gcgtgctgcg 23128
actggaggaa ggggctcgca tacctgagat tgcgtgtctg gaggaagtcc caaaaatctt 23188
tgaaggtctg ggccgcgacg aaaagcgagc agcaaacgct ctggaagaga acgcagagag 23248
tcacaacagc gccttggtag aactcgaggg cgacaacgcc agactggccg tcctcaaacg 23308
gtccatagaa gtcacgcact tcgcctaccc cgccgttaac ctccctccaa aagttatgac 23368
agcggtcatg gactcgctgc tcataaagcg cgctcagccc ttagacccag agcacgaaaa 23428
caacagtgac gaaggaaaac cggtggtttc tgatgaggag ttgagcaagt ggctgtcctc 23488
caacgacccc gccacgttgg aggaacgaag aaaaaccatg atggccgtgg tgctagttac 23548
cgtgcaatta gaatgtctgc agaggttctt ttcccaccca gagaccctga gaaaagtgga 23608
ggaaacgctg cactacacat ttaggcacgg ctacgtgaag caagcctgca agatttccaa 23668
cgtagaactt agcaacctca tctcctacct ggggatcttg cacgaaaacc gcctcggaca 23728
aaacgtgctg cacagcacac tgaaaggaga agcccgccga gactatgtgc gagactgcgt 23788
gttcctagcg ctagtgtaca cctggcagag cggaatggga gtctggcagc agtgcctgga 23848
ggacgaaaac ctcaaagagc ttgaaaagct gctggtgcgc tccagaaggg cactgtggac 23908
cagttttgac gagcgcaccg ccgcgcgaga cctagctgat attatttttc ctcccaagct 23968
ggtgcagact ctccgggaag gactgccaga ttttatgagt caaagcatct tgcaaaactt 24028
ccgctctttc atcttggaac gctcgggaat cttgcccgcc actagctgcg ccctacccac 24088
agattttgtg cctctccact accgcgaatg cccaccgccg ctgtggccgt acacttactt 24148
gcttaaactg gccaactttc taatgttcca ctctgacctg gcagaagacg ttagcggcga 24208
ggggctgcta gaatgccact gccgctgcaa cctgtgcacc ccccaccgct ctctagtatg 24268
caacactccc ctgctcaatg agacccagat catcggtacc tttgaaatcc agggaccctc 24328
cgacgcggaa aacggcaagc aggggtctgg gctaaaactc acagccggac tgtggacctc 24388
cgcctacttg cgcaaatttg taccagaaga ctatcacgcc caccaaatta aattttacga 24448
aaaccaatca aaaccaccca aaagcgagtt aacggcttgc gtcattacgc agagcagcat 24508
agttgggcag ttgcaagcca ttaacaaagc gcggcaagag tttctcctaa aaaaaggaaa 24568
aggggtctac ttggaccccc agaccggcga ggaactcaac ggaccctcct cagtcgcagg 24628
ttgtgtgccc catgccgccc aaaaagaaca cctcgcagtg gaacatgcca gagacggagg 24688
aagaggagtg gagcagtgtg agcaacagcg aaacggagga agagccgtgg cccgaggggt 24748
gcaacgggga agaggacacg gagggacggc gaagtcttcg ccgaagaact ctcgccgctg 24808
cccccgaagt cccagccggc cgcctcggcc caagatcccg cacacacccg tagatgggat 24868
agcaagacca aaaagccggg taagagaaac gctcgccccc gccagggcta ccgctcgtgg 24928
agaaagcaca aaaactgcat cttatcgtgc ttgctccagt gcggcggaga cgtttcgttc 24988
acccgtagat acttgctttt taacaaaggg gtggccgtcc cccgtaacgt cctccactac 25048
taccgtcact cttacagctc cgaagcggac ggctaagaaa acgcagcagt tgccggcggg 25108
aggactgcgt ctcagcgccc gagaaccccc agccaccagg gagctccgaa accgcatatt 25168
tcccaccctc tacgctatct ttcagcaaag ccgggggcag cagcaagaac tgaaaataaa 25228
aaaccgcacg ctgaggtcgc ttacccgaag ctgcctctat cacaagagcg aagagcagct 25288
gcagcgaacc ctggaggacg cagaagcgct gttccagaag tactgcgcga ccaccctaaa 25348
taactaaaaa agcccgcgcg cgggacttca aaccgtctga cgtcaccagc cgcgcgccaa 25408
aatgagcaaa gagattccca cgccttacat gtggagttac cagccgcaga tgggattagc 25468
cgccggcgcc gcccaggatt actccacgaa aatgaactgg ctcagcgccg ggccccacat 25528
gatttcccgc gtaaacgaca ttcgcgccca ccgcaatcag ctattgttag aacaggctgc 25588
tctgaccgcc acgccccgta ataacctgaa ccctcccagc tggccagctg ccctggtgta 25648
ccaggaaacg cctccaccca ccagcgtact tttgccccgt gacgcccagg cggaagtcca 25708
gatgactaac gcgggcgcgc aattagcggg cggatcccgg tttcggtaca gagttcacgg 25768
cgccgcaccc tatagcccag gtataaagag gctgatcatt cgaggcagag gtgtccagct 25828
caacgacgag acagtgagct cttcgcttgg tctacgacca gacggagtgt tccagctcgc 25888
gggctcgggc cgctcttcgt tcacgcctcg ccaggcatac ctgactctgc agagctctgc 25948
ctctcagcct cgctcgggag gaatcggacc ccttcagttt gtggaggagt ttgtgccctc 26008
ggtctacttt cagcctttct ccggatcgcc cggccagtac ccggacgagt tcatccccaa 26068
cttcgacgcg gtgagtgact ctgtggacgg ttatgactga tgtcgagccc gcttcagtgc 26128
tagtggaaca agcgcggctc aatcacctgg ttcgttgccg ccgccgctgc tgcgtggctc 26188
gcgacttgag cttagctctc aagtttgtaa aaaacccgtc cgaaaccggg agcgctgtgc 26248
acgggttgga gctagtgggt cctgagaagg ccaccatcca cgttctcaga aactttgtgg 26308
aaaaacccat tttggttaaa cgagatcagg ggccttttgt aatcagctta ctctgcacct 26368
gtaaccatgt tgaccttcac gactatttta tggatcattt gtgcgctgaa ttcaataagt 26428
aaagcgaatt cttaccaaga ttatgatgtc catgactgtt cctcgccact atacgatgtt 26488
gtgccagtaa actctcttgt cgacatctat ctgaactgtt ccttttggtc cgcacagctt 26548
acttggtact acggtgacac cgtcctttct ggctcactgg gcagctcaca cggaataaca 26608
cttcacctct tttcgccgtt tcgatacgga aactacagct gtcgtgccgg tacctgcctc 26668
cacgttttca atcttcagcc ctgtccaccg accaaacttg tatttgtcga ctctaagcac 26728
ttacagctca actgcagcat tctaggcccc agtatcttgt ggacatacaa taaaatcagg 26788
ttggtggaat ttgtctacta cccacccagc gcccgcggtt ttggggaaat tcctttccag 26848
atctactaca actatcttgc cacacattat gcaagtcaac agcaactaaa cttgcaagca 26908
cccttcacgc caggagagta ctcctgtcac gtaggctcct gcacagaaac ttttattctc 26968
ttcaacagat cttctgccat tgaacgcttc actactaact actttagaaa ccaagttgtg 27028
cttttcactg acgaaacccc taacgtcacc ctggactgtg catgtttttc tcatgacacc 27088
gtaacttgga ctcttaacaa tactctctgg ctcgcgttcg ataaccaaag cttgattgtt 27148
aaaaattttg atttaacctt tactaaaccc tctcctcgcg aaatagttat ctttgctcct 27208
tttaatccaa aaactacctt agcctgtcag gttttgttta agccttgcca aacaaacttt 27268
aagtttgttt atttgcctcc gcaatctgtc aaactcatag aaaaatacaa caaagcgccc 27328
gtcttggctc ctaaaacctt ctaccactgg ctaacctaca cggggctgtt tgcactaatt 27388
gtttttttcc taattaacat ttttatatgt ttcttgcctt cctccttctt ttcgcgaaca 27448
ccgttgccgc agaaagacct ctccttatta ctgtagcgct tgctatacaa aaccaagagt 27508
ggtcaaccgt gctctcaatc tattttcaat ttttcatttt gtccttaata ctttctctta 27568
ttgtcgttaa caatgatctg gagcattggt ctcgcctttt tttggctgct tagtgcaaaa 27628
gccactattt ttcacaggta tgtggaagaa ggaactagca ccctctttac gatacctgaa 27688
acaattaagg cggctgatga agtttcttgg tacaaaggct cgctctcaga cggcaaccac 27748
tcattctcag gacagaccct ttgcatccaa gaaacttatt ttaaatcaga actacaatac 27808
agctgcataa aaaacttttt ccatctctac aacatctcaa aaccctatga gggtatttac 27868
aatgccaagg tttcagacaa ctccagcaca cggaactttt actttaatct gacagttatt 27928
aaagcaattt ccattcctat ctgtgagttt agctcccagt ttctttctga aacctactgt 27988
ttaattacta taaactgcac taaaaatcgc cttcacacca ccataatcta caatcacaca 28048
caatcacctt gggttttaaa cctaaaattt tctccacaca tgccttcgca atttctcacg 28108
caagttaccg tctctaacat aagcaagcag tttggctttt actatccttt ccacgaactg 28168
tgcgaaataa ttgaagccga atatgaacca gactacttta cttacattgc cattggtgta 28228
atcgttgttt gcctttgctt tgttattggg gggtgtgttt atttgtacat tcagagaaaa 28288
atattgctct cgctgtgctc ctgcggttac aaagcagaag aaagaattaa aatctctaca 28348
ctttattaat gttttccaga aatggcaaaa ctaacgctcc tacttttgct tctcacgccg 28408
gtgacgcttt ttaccatcac tttttctgcc gccgccacac tcgaacctca atgtttgcca 28468
ccggttgaag tctactttgt ctacgtgttg ctgtgctgcg ttagcgtttg cagtataaca 28528
tgttttacct ttgtttttct tcagtgcatt gactacttct gggtcagact ctactaccgc 28588
agacacgcgc ctcagtatca aaatcaacaa attgccagac tactcggtct gccatgattg 28648
tcttgtattt taccctgatt ttttttcacc ttacttgcgc ttgtgatttt cacttcactc 28708
aattttggaa aacgcaatgc ttcgacccgc gcctctccaa cgactggatg atggctcttg 28768
caattgccac gcttggggcg tttggacttt ttagtggttt tgctttgcat tacaaattta 28828
agactccatg gacacatggc tttctttcag attttccagt tacacctact ccgccgcctc 28888
ccccggccat cgacgtgcct caggttccct caccttctcc atctgtctgc agctactttc 28948
atctgtaatg gccgacctag aatttgacgg agtgcaatct gagcaaaggg ctatacactt 29008
ccaacgccag tcggaccgcg aacgcaaaaa cagagagctg caaaccatac aaaacaccca 29068
ccaatgtaaa cgcgggatat tttgtattgt aaaacaagct aagctccact acgagcttct 29128
atctggcaac gaccacgagc tccaatacgt ggtcgatcag cagcgtcaaa cctgtgtatt 29188
cttaattgga gtttccccca ttaaagttac tcaaaccaag ggtgaaacca agggaaccat 29248
aaggtgctca tgtcacctgt cagaatgcct ttacactcta gttaaaaccc tatgtggctt 29308
acatgattct atccccttta attaaataaa cttactttaa atctgcaatc acttcttcgt 29368
ccttgttttt gtcgccatcc agcagcacca ccttcccctc ttcccaactt tcatagcata 29428
ttttccgaaa agaggcgtac tttcgccaca ccttaaaggg aacgtttact tcgctttcaa 29488
gctctcccac gattttcatt gcagat atg aaa cgc gcc aaa gtg gaa gaa gga 29541
Met Lys Arg Ala Lys Val Glu Glu Gly
1425
ttt aac ccc gtt tat ccc tat gga tat tct act ccg act gac gtg 29586
Phe Asn Pro Val Tyr Pro Tyr Gly Tyr Ser Thr Pro Thr Asp Val
1430 1435 1440
gct cct ccc ttt gta gcc tct gac ggt ctt caa gaa aac cca cct 29631
Ala Pro Pro Phe Val Ala Ser Asp 6ly Leu Gln Glu Asn Pro Pro
1445 1450 1455
ggg gtc ttg tcc cta aaa ata tcc aaa cct tta act ttt aat gcc 29676
Gly Val Leu Ser Leu Lys Ile Ser Lys Pro Leu Thr Phe Asn Ala
1460 1465 1470
tcc aag gct cta agc ctg gct att ggt cca gga tta aaa att caa 29721
Ser Lys Ala Leu Ser Leu Ala Ile Gly Pro Gly Leu Lys Ile Gln
1475 1480 1485
gat ggt aaa cta gtg ggg gag gga caa gca att ctt gca aac ctg 29766
Asp Gly Lys Leu Val Gly Glu Gly Gln Ala Ile Leu Ala Asn Leu
1490 1495 1500
ccg ctt caa atc acc aac aac aca att tca cta cgt ttt ggg aac 29811
Pro Leu Gln Ile Thr Asn Asn Thr Ile Ser Leu Arg Phe Gly Asn
1505 1510 1515
aca ctt gcc ttg aat gac aat aat gaa ctc caa acc aca cta aaa 29856
Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gln Thr Thr Leu Lys
1520 1525 1530
tct tca tcg ccc ctt aaa atc aca gac cag act ctg tcc ctt aac 29901
Ser Ser Ser Pro Leu Lys Ile Thr Asp Gln Thr Leu Ser Leu Asn
1535 1540 1545
ata ggg gac agc ctt gca att aaa gat gac aaa cta gaa agc gct 29946
Ile Gly Asp Ser Leu Ala Ile Lys Asp Asp Lys Leu Glu Ser Ala
1550 1555 1560
ctt caa gcg acc ctc cca ctc tcc att agc aac aac acc atc agc 29991
Leu Gln Ala Thr Leu Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser
1565 1570 1575
ctc aac gtg ggc acc gga ctc acc ata aat gga aac gtt tta caa 30036
Leu Asn Val Gly Thr Gly Leu Thr Ile Asn Gly Asn Val Leu Gln
1580 1585 1590
gct gtt ccc tta aat gct cta agt ccc cta act att tcc aac aat 30081
Ala Val Pro Leu Asn Ala Leu Ser Pro Leu Thr Ile Ser Asn Asn
1595 1600 1605
aac atc agc ctg cgc tat ggc agt tcc ctg acg gtg ctt aac aat 30126
Asn Ile Ser Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn
1610 1615 1620
gaa ctg caa agc aac ctc aca gtt cac tcc cct tta aaa ctc aac 30171
Glu Leu Gln Ser Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn
1625 1630 1635
tcc aac aac tca att tct ctc aac act cta tct ccg ttt aga atc 30216
Ser Asn Asn Ser Ile Ser Leu Asn Thr Leu Ser Pro Phe Arg Ile
1640 1645 1650
gag aat ggt ttc ctc acg ctc tat ttg gga aca aaa tct ggc ttg 30261
Glu Asn Gly Phe Leu Thr Leu Tyr Leu Gly Thr Lys Ser Gly Leu
1655 1660 1665
cta gtt caa aac agt ggc tta aaa gtt caa gcg ggc tac ggc ctg 30306
Leu Val Gln Asn Ser Gly Leu Lys Val Gln Ala Gly Tyr Gly Leu
1670 1675 1680
caa gta aca gac acc aat gct ctc aca tta aga tat ctc gct cca 30351
Gln Val Thr Asp Thr Asn Ala Leu Thr Leu Arg Tyr Leu Ala Pro
1685 1690 1695
ctg acc att cca gac tcg ggc tca gaa caa ggc att ctt aaa gta 30396
Leu Thr Ile Pro Asp Ser Gly Ser Glu Gln Gly Ile Leu Lys Val
1700 1705 1710
aac act gga cag ggc cta agt gtg aac caa gct gga gcg ctt gaa 30441
Asn Thr Gly Gln Gly Leu Ser Val Asn Gln Ala Gly Ala Leu Glu
1715 1720 1725
aca tcc cta gga ggt gga tta aaa tat gct gat aac aaa ata acc 30486
Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp Asn Lys Ile Thr
1730 1735 1740
ttt gat aca gga aac gga ctg aca tta tct gaa aat aaa ctt gca 30531
Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn Lys Leu Ala
1745 1750 1755
gta gct gca ggt agt ggt cta act ttt aga gat ggt gcc ttg gta 30576
Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala Leu Val
1760 1765 1770
gcc acg gga acc gca ttt acg caa aca ctg tgg act acg gct gat 30621
Ala Thr Gly Thr Ala Phe Thr Gln Thr Leu Trp Thr Thr Ala Asp
1775 1780 1785
ccg tct ccc aac tgc aca att ata cag gac cgc gac aca aaa ttt 30666
Pro Ser Pro Asn Cys Thr Ile Ile Gln Asp Arg Asp Thr Lys Phe
1790 1795 1800
act ttg gcg ctt acc att agt ggg agc caa gtg ctg ggg acg gtt 30711
Thr Leu Ala Leu Thr Ile Ser Gly Ser Gln Val Leu Gly Thr Val
1805 1810 1815
tcc att att gga gta aaa ggc ccc ctt tca agt agc ata ccg tca 30756
Ser Ile Ile Gly Val Lys Gly Pro Leu Ser Ser Ser Ile Pro Ser
1820 1825 1830
gct acc gtt aca gta caa ctt aac ttt gat tcc aac gga gcc cta 30801
Ala Thr Val Thr Val Gln Leu Asn Phe Asp Ser Asn Gly Ala Leu
1835 1840 1845
ttg agc tcc tct tca ctt aaa ggt tac tgg ggg tat cgc caa ggt 30846
Leu Ser Ser Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gln Gly
1850 1855 1860
ccc tca att gac cct tac ccc ata att aat gcc tta aac ttt atg 30891
Pro Ser Ile Asp Pro Tyr Pro Ile Ile Asn Ala Leu Asn Phe Met
1865 1870 1875
cca aac tca ctg gct tat ccc ccg gga caa gaa atc caa gca aaa 30936
Pro Asn Ser Leu Ala Tyr Pro Pro Gly Gln Glu Ile Gln Ala Lys
1880 1885 1890
tgt aac atg tac gtt tct act ttt tta cga gga aat cca caa aga 30981
Cys Asn Met Tyr Val Ser Thr Phe Leu Arg Gly Asn Pro Gln Arg
1895 1900 1905
cca ata gtt tta aac atc act ttt aat aat caa acc agc ggg ttt 31026
Pro Ile Val Leu Asn Ile Thr Phe Asn Asn Gln Thr Ser Gly Phe
1910 1915 1920
tcc att aga ttt aca tgg aca aat tta acc aca gga gaa gca ttt 31071
Ser Ile Arg Phe Thr Trp Thr Asn Leu Thr Thr Gly Glu Ala Phe
1925 1930 1935
gca atg ccc cca tgc act ttt tcc tac att gct gaa caa caa taa 31116
Ala Met Pro Pro Cys Thr Phe Ser Tyr Ile Ala Glu Gln Gln
1940 1945 1950
actatgtaac cctcaccgtt aacccgcctc cgcccttcca ttttatttta taaaccaccc 31176
gatccacctt ttcagcagta aacaattgca tgtcagtagg ggcagtaaaa cttttgggag 31236
ttaaaatcca cacaggttct tcacaagcta agcgaaaatc agttacactt ataaaaccat 31296
cgctaacatc ggacaaagac aagcatgagt ccaaagcttc cggttctgga tcagattttt 31356
gttcattaac agcgggagaa acagcttctg gaggattttc catctccatc tccttcatca 31416
gttccaccat gtccaccgtg gtcatctggg acgagaacga cagttgtcat acacctcata 31476
agtcaccggt cgatgacgaa cgtacagatc tcgaagaatg tcctgtcgcc gcctttcggc 31536
agcactgggc cgaaggcgaa agcgcccatg tttaacaatg gccagcaccg cccgcttcat 31596
caggcgccta gttcttttag cgcaacagcg catgcgcagc tcgctaagac tggcgcaaga 31656
aacacagcac agaaccacca gattgttcat gatcccataa gcgtgctgac accagcccat 31716
actaacaaat tgtttcacta ttctagcatg aatgtcatat ctgatgttca agtaaattaa 31776
atggcgcccc cttatgtaaa cacttcccac gtacaacacc tcctttggca tctgataatt 31836
aaccacctcc cgataccaaa tacatctctg attaatagtc gccccgtaca ctacccgatt 31896
aaaccaagtt gccaacataa tcccccctgc catacactgc aaagaacctg gacggctaca 31956
atgacagtgc aaagtccaca cctcgttgcc atggataact gaggaacgcc ttaagtcaat 32016
agtggcacaa ctaatacaaa catgtaaata gtgtttcaac aagtgccact cgtatgaggt 32076
gagtatcatg tcccagggaa cgggccactc cataaacact gcaaaaccaa cacatcctac 32136
catcccccgc acggcactca catcgtgcat ggtgttcata tcacagtccg gaagctgagg 32196
acaaggaaaa gtctcgggag cattttcata gggcggtagt gggtactcct tgtaggggtt 32256
cagtcggcac cggtatctcc tcaccttctg ggccataaca cacaagttga gatctgattt 32316
caaggtactt tctgaatgaa aaccaagtgc tttcccaaca atgtatccga tgtcttcggt 32376
ccccgcgtcg gtagcgctcc ttgcagtaca cacggaacaa ccactcacgc aggcccagaa 32436
gacagttttc cgcggacggt gacaagttaa tccccctcag tctcagagcc aatatagttt 32496
cttccacagt agcataggcc aaacccaacc aggaaacaca agctggcacg tcccgttcaa 32556
cgggaggaca aggaagcaga ggcagaggca taggcaaagc aacagaattt ttattccaac 32616
tggtcacgta gcacttcaaa caccaggtca cgtaaatggc agcgatcttg ggtttcctga 32676
tggaacataa cagcaagatc aaacatgaga cgattctcaa ggtgattaac cacagctgga 32736
attaaatcct ccacgcgcac atttagaaac accagcaata caaaagcccg gttttctccg 32796
ggatctatca tagcagcaca gtcatcaatt agtcccaagt aattttcccg tttccaatct 32856
gttataattt gcagaataat gccctgtaaa tccaagccgg ccatggcgaa aagctcagat 32916
aatgcacttt ccacgtgcat tcgtaaacac accctcatct tgtcaatcca aaaagtcttc 32976
ttcttgagaa acctgtagta aattaagaat cgccaggtta ggctcgatgc ctacatcccg 33036
gagcttcatt ctcagcatgc actgcaaatg atccagcaga tcagaacagc aattagcagc 33096
cagctcatcc ccggtttcca gttccggagt tcccacggca attatcactc gaaacgtggg 33156
acaaatcgaa ataacatgag ctcccacgtg agcaaaagcc gtagggccag tgcaataatc 33216
acagaaccag cggaaaaaag attgcagctc atgtttcaaa aagctctgca gatcaaaatt 33276
cagctcatgc aaataacaca gtaaagtttg cggtatagta accgaaaacc acacgggtcg 33336
acgttcaaac atctcggctt acctaaaaaa gaagcacatt tttaaaccac agtcgcttcc 33396
tgaacaggag gaaatatggt gcggcgtaaa accagacgcg ccaccggatc tccggcagag 33456
ccctgataat acagccagct gtggttaaac agcaaaacct ttaattcggc aacggttgag 33516
gtctccacat aatcagcgcc cacaaaaatc ccatctcgaa cttgctcgcg tagggagcta 33576
aaatggccag tatagcccca tggcacccga acgctaatct gcaagtatat gagagccacc 33636
ccattcggcg ggatcacaaa atcagtcgga gaaaacaacg tatacacccc ggactgcaaa 33696
agctgttcag gcaaacgccc ctgcggtccc tctcggtaca ccagcaaagc ctcgggtaaa 33756
gcagccatgc caagcgctta ccgtgccaag agcgactcag acgaaaaagt gtactgaggc 33816
gctcagagca gcggctatat actctacctg tgacgtcaag aaccgaaagt caaaagttca 33876
cccggcgcgc ccgaaaaaac ccgcgaaaat ccacccaaaa agcccgcgaa aaacacttcc 33936
gtataaaatt tccgggttac cggcgcgtca ccgccgcgcg acacgcccgc cccgccccgc 33996
gctcctcccc gaaacccgcc gcgcccactt ccgcgttccc aagacaaagg tcgcgtaact 34056
ccgcccacct catttgcatg ttaactcggt cgccatcttg cggtgttata ttgatgatg 34115
<210>35
<211>503
<212>PRT
<213>猿猴腺病毒SV-39
<400>35
Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu Gly Pro
1 5 10 15
Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gln Ala Pro
20 25 30
Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr Gly Gly
35 40 45
Arg Asn Ser Ile Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp Thr Thr
50 55 60
Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Thr Leu Asn
65 70 75 80
Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gln Asn Ser
85 90 95
Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gln Thr Ile Asn Leu Asp Asp
100 105 110
Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr Ile Leu His Thr Asn Met
115 120 125
Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala Lys Leu
130 135 140
Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp Val Gln
145 150 155 160
Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu Ile Met Thr Ile Asp Leu
165 170 175
Met Asn Asn Ala Ile Ile Asp His Tyr Leu Ala Val Ala Arg Gln Gln
180 185 190
Gly Val Lys Glu Ser Glu Ile Gly Val Lys Phe Asp Thr Arg Asn Phe
195 200 205
Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly Val Tyr
210 215 220
Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly Cys Gly
225 230 235 240
Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly Ile Arg Lys
245 250 255
Arg Met Pro Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp Leu Glu
260 265 270
Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser
275 280 285
Ile Ala Asn Ala Arg Glu Ala Ala Ile Arg Gly Asp Asn Phe Ala Ala
290 295 300
Gln Pro Gln Ala Ala Pro Thr Ile Lys Pro Val Leu Glu Asp Ser Lys
305 310 315 320
Gly Arg Ser Tyr Asn Val Ile Ala Asn Thr Asn Asn Thr Ala Tyr Arg
325 330 335
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg
340 345 350
Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser Glu Gln
355 360 365
Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr Phe Arg
370 375 380
Ser Thr Gln Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Met
385 390 395 400
Pro Ile His Ser Lys Ser Phe Tyr Asn Glu Gln Ala Val Tyr Ser Gln
405 410 415
Leu Ile Arg Gln Thr Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro
420 425 430
Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val
435 440 445
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Gln
450 455 460
Asn Ser Ile Arg Gly Val Gln Arg Val Thr Ile Thr Asp Ala Arg Arg
465 470 475 480
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg
485 490 495
Val Leu Ser Ser Arg Thr Phe
500
<210>36
<211>917
<212>PRT
<213>猿猴腺病毒SV-39
<400>36
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Glu Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Ile Arg Phe Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Thr Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Ile Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Asn Ser Gln Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro
130 135 140
Val Cys Phe Ala Gln Ala Ala Phe Ile Gly Gln Ser Ile Thr Lys Asp
145 150 155 160
Gly Val Gln Ile Gln Asn Ser Glu Asn Gln Gln Ala Ala Ala Asp Lys
165 170 175
Thr Tyr Gln Pro Glu Pro Gln Ile Gly Val Ser Thr Trp Asp Thr Asn
180 185 190
Val Thr Ser Asn Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met
195 200 205
Leu Pro Cys Tyr Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly
210 215 220
Gln Ala Lys Thr Glu Gly Asp Ile Ser Leu Asn Phe Phe Thr Thr Thr
225 230 235 240
Ala Ala Ala Asp Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val
245 250 255
Asn Leu Gln Ala Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly
260 265 270
Glu Asn Val Ile Ala Ala Glu Ala Leu Leu Thr Gln Gln Ala Cys Pro
275 280 285
Asn Arg Ala Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met
290 295 300
Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser
305 310 315 320
Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser
325 330 335
Tyr Gln Leu Met Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser
340 345 350
Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile
355 360 365
Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu
370 375 380
Pro Gly Met Gly Ile Phe Asn Ser Tyr Lys Gly Val Lys Pro Gln Asn
385 390 395 400
Gly Gly Asn Gly Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn
405 410 415
Glu Ile Ala Leu Gly Asn Ile Phe Ala Met Glu Ile Asn Leu His Ala
420 425 430
Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro
435 440 445
Asp Ser Tyr Lys Phe Thr Pro Ala Asn Ile Thr Leu Pro Ala Asn Gln
450 455 460
Asn Thr Tyr Glu Tyr Ile Asn Gly Arg Val Thr Ser Pro Thr Leu Val
465 470 475 480
Asp Thr Phe Val Asn Ile Gly Ala Arg Trp Ser Pro Asp Pro Met Asp
485 490 495
Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg
500 505 510
Ser Met Leu Leu Gly Asn Gly Arg Val Val Pro Phe His Ile Gln Val
515 520 525
Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser
530 535 540
Tyr Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met Ile Leu Gln
545 550 555 560
Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg Ile
565 570 575
Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr
580 585 590
Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser
595 600 605
Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala
610 615 620
Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala
625 630 635 640
Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Ala Lys Glu Thr Pro Ser
645 650 655
Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Thr Ile Pro
660 665 670
Tyr Leu Asp Gly Ser Phe Tyr Leu Asn His Thr Phe Lys Arg Leu Ser
675 680 685
Ile Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu
690 695 700
Thr Pro Asn Glu Phe Glu Ile Lys Arg Ile Val Asp Gly Glu Gly Tyr
705 710 715 720
Asn Val Ala Gln Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gln Met
725 730 735
Leu Ser His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly
740 745 750
Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser
755 760 765
Arg Gln Val Pro Asp Pro Thr Ala Ala Gly Tyr Gln Ala Val Pro Leu
770 775 780
Pro Arg Gln His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr
785 790 795 800
Met Arg Glu Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile
805 810 815
Gly Ala Thr Ala Val Pro Ala Ile Thr Gln Lys Lys Phe Leu Cys Asp
820 825 830
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly
835 840 845
Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His
850 855 860
Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asn Glu Pro Thr Leu
865 870 875 880
Leu Tyr Met Leu Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro
885 890 895
His Arg Gly Ile Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala
900 905 910
Gly Asn Ala Thr Thr
915
<210>37
<211>533
<212>PRT
<213>猿猴腺病毒SV-39
<400>37
Met Lys Arg Ala Lys Val Glu Glu Gly Phe Asn Pro Val Tyr Pro Tyr
1 5 10 15
Gly Tyr Ser Thr Pro Thr Asp Val Ala Pro Pro Phe Val Ala Ser Asp
20 25 30
Gly Leu Gln Glu Asn Pro Pro Gly Val Leu Ser Leu Lys Ile Ser Lys
35 40 45
Pro Leu Thr Phe Asn Ala Ser Lys Ala Leu Ser Leu Ala Ile Gly Pro
50 55 60
Gly Leu Lys Ile Gln Asp Gly Lys Leu Val Gly Glu Gly Gln Ala Ile
65 70 75 80
Leu Ala Asn Leu Pro Leu Gln Ile Thr Asn Asn Thr Ile Ser Leu Arg
85 90 95
Phe Gly Asn Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gln Thr Thr
100 105 110
Leu Lys Ser Ser Ser Pro Leu Lys Ile Thr Asp Gln Thr Leu Ser Leu
115 120 125
Asn Ile Gly Asp Ser Leu Ala Ile Lys Asp Asp Lys Leu Glu Ser Ala
130 135 140
Leu Gln Ala Thr Leu Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser Leu
145 150 155 160
Asn Val Gly Thr Gly Leu Thr Ile Asn Gly Asn Val Leu Gln Ala Val
165 170 175
Pro Leu Asn Ala Leu Ser Pro Leu Thr Ile Ser Asn Asn Asn Ile Ser
180 185 190
Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn Glu Leu Gln Ser
195 200 205
Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn Ser Asn Asn Ser Ile
210 215 220
Ser Leu Asn Thr Leu Ser Pro Phe Arg Ile Glu Asn Gly Phe Leu Thr
225 230 235 240
Leu Tyr Leu Gly Thr Lys Ser Gly Leu Leu Val Gln Asn Ser Gly Leu
245 250 255
Lys Val Gln Ala Gly Tyr Gly Leu Gln Val Thr Asp Thr Asn Ala Leu
260 265 270
Thr Leu Arg Tyr Leu Ala Pro Leu Thr Ile Pro Asp Ser Gly Ser Glu
275 280 285
Gln Gly Ile Leu Lys Val Asn Thr Gly Gln Gly Leu Ser Val Asn Gln
290 295 300
Ala Gly Ala Leu Glu Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp
305 310 315 320
Asn Lys Ile Thr Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn
325 330 335
Lys Leu Ala Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala
340 345 350
Leu Val Ala Thr Gly Thr Ala Phe Thr Gln Thr Leu Trp Thr Thr Ala
355 360 365
Asp Pro Ser Pro Asn Cys Thr Ile Ile Gln Asp Arg Asp Thr Lys Phe
370 375 380
Thr Leu Ala Leu Thr Ile Ser Gly Ser Gln Val Leu Gly Thr Val Ser
385 390 395 400
Ile Ile Gly Val Lys Gly Pro Leu Ser Ser Ser Ile Pro Ser Ala Thr
405 410 415
Val Thr Val Gln Leu Asn Phe Asp Ser Asn Gly Ala Leu Leu Ser Ser
420 425 430
Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gln Gly Pro Ser Ile Asp
435 440 445
Pro Tyr Pro Ile Ile Asn Ala Leu Asn Phe Met Pro Asn Ser Leu Ala
450 455 460
Tyr Pro Pro Gly Gln Glu Ile Gln Ala Lys Cys Asn Met Tyr Val Ser
465 470 475 480
Thr Phe Leu Arg Gly Asn Pro Gln Arg Pro Ile Val Leu Asn Ile Thr
485 490 495
Phe Asn Asn Gln Thr Ser Gly Phe Ser Ile Arg Phe Thr Trp Thr Asn
500 505 510
Leu Thr Thr Gly Glu Ala Phe Ala Met Pro Pro Cys Thr Phe Ser Tyr
515 520 525
Ile Ala Glu Gln Gln
530
<210>38
<211>50
<212>DNA
<213>人工序列
<220>
<223>寡聚体SV25T
<400>38
aatttaaata cgtagcgcac tagtcgcgct aagcgcggat atcatttaaa 50
<210>39
<211>49
<212>DNA
<213>人工序列
<220>
<223>寡聚体SV25B
<400>39
tatttaaatg atatccgcgc ttaagcgcga ctagtgcgct acgtattta 49