用于生产丁烯基-多杀菌素杀虫剂的生物合成基因 发明概述
本发明提供新的丁烯基-多杀菌素生物合成基因、整合这些生物合成基因的载体以及用这些生物合成基因转化的刺糖多胞菌(Saccharopolyspora)株系,同时还提供使用这些基因去提高多杀菌素样的大环内酯类杀虫剂的产量的方法和利用这些基因或其片段去改变产多杀菌素的刺糖多胞菌株系所产生的代谢产物。
发明背景
天然生产的多杀菌素化合物由与12员环的大环内酯相融合的5,6,5-三环系统、中性糖(鼠李糖)和氨基糖(forosamine)组成(见Kirst等人,1991)。如果这个氨基糖不存在,这些化合物被称作假糖苷配基。如果这个核心糖不存在,这些化合物则被称为反转的(reversepseudoaglycone)。
A83543多杀菌素是由刺糖多胞菌NRRL18395菌株及其衍生物所产生。A83543多杀菌素家族的已知成员以及产生它们的株系已经在美国专利NO.5,362,634;5,202,242;5,840,861;5,539,089以及5,767,253上公开了。这些化合物以字母命名,多杀菌素A、B等。A83543多杀菌素A的结构在表1中给出。这些A83543多杀菌素化合物能够有效的控制蜘蛛类动物、线虫动物和昆虫[尤其是鳞翅目(Lepidoptera)和双翅目(Diptera)种类]。它们有着良好地环境和毒理学性质。
编码指导A83543多杀菌素生物合成的酶的DNA序列已经在美国专利NO.6,143,526中被公开。这些克隆基因和开放阅读框被命名为spnA、spnB、spnC、spnD、spnE、spnF、spnG、spnH、spnI、spnJ、spnK、spnL、spnM、spnN、spnO、spnP、spnQ、spnR、spnS、刺糖多胞菌gtt、刺糖多胞菌gdh、刺糖多胞菌epi和刺糖多胞菌kre。
除了鼠李糖生物合成的基因之外,这些多杀菌素生物合成基因,特别是spnA、spnB、spnC、spnD、spnE、spnF、spnG、spnH、spnI、spnJ、spnK、spnL、spnM、spnN、spnO、spnP、spnQ、spnR和spnS连续地排列在刺糖多胞菌染色体上大约74Kb的区域内。spnA、spnB、spnC、spnD和spnE等基因同负责聚酮化合物生物合成的基因具有相似性,若阻断spnA、spnD或spnE,则不能产生任何多杀菌素产物。A83543多杀菌素合成也涉及到这个内酯核的桥连-在大环内酯产生者中很少见的活性。spnF、spnJ、spnL和spnM基因被认为参与这个生物合成步骤。据报道,spnG、spnH、spnI和spnK基因参与鼠李糖的添加和修饰。而spnN、spnO、spnP、spnQ、spnR和spnS基因则参与forosamine糖的生物合成和添加。负责鼠李糖生物合成的那些基因并不是连续地分布在A83543多杀菌素生物合成基因其它位置上。刺糖多胞菌gtt和刺糖多胞菌kre在一个不同的片段上被克隆,而刺糖多胞菌gdh和刺糖多胞菌。epi则在其它不同片段上被克隆。
最近一种新的生物体-刺糖多胞菌LW107129(NRRL30141)及其衍生菌所产生的另一种多杀菌素-丁烯基-多杀菌素在美国专利申请NO.09/661,065(与WO 01/19840相应)和美国专利申请NO.60/277,601上被公开。在上述申请中有40多个该化合物家族的成员被定义。刺糖多胞菌LW107129(NRRL30141)产生的这种丁烯基-多杀菌素化合物同A83543多杀菌素系列中的化合物不同。这两类多杀菌素之间的主要区别是连在大环上C-21位置处的碳链取代物不同。天然的丁烯基-多杀菌素在C-21位置处被3-4个碳的碳链取代,优选为丁烯基,而天然的A83543多杀菌素在C-21位置处被1-2个碳的碳链取代,优选为乙基。
这些丁烯基-多杀菌素化合物用作反应物生产合成修饰的多杀菌素类化合物,后者在2001年3月21日提交的美国临时专利申请60/277,546“21-丁烯基及相关的多杀菌素的合成衍生物”中所公开。而优选,这些化合物及其合成衍生物用于控制蜘蛛类、线虫类以及昆虫[尤其是鳞翅目和双翅目种类]。
除了C-21上的丁烯基之外,丁烯基-多杀菌素同A83543多杀菌素系列相比还呈现出许多其它差异。表1总结了这些丁烯基-多杀菌素化合物的亚分类以及说明同A83543多杀菌素的不同之处的因素。表1给出了这些丁烯基-多杀菌素的命名,并且根据它们结构首字母缩写词称为“for-rham-I”,“for-rham-II”“for-rham-III”及其衍生物。在这些情况下,I、II和III指代被适当取代的大环内酯结构(I:R4=R5=H;II:R5=CH3,R4=H或OH;III:R5=H,R4=OH),“for”代表C-17位置上的糖(for=forosamine),而“rham”代表C-9位置上的糖(rham=3-O-甲基鼠李糖)。NRRL30141菌株产生的第二种大环内酯结构,具有通式(2),显示有14员环大环内酯环,在下文中被称作IV,而完全糖基化的化合物则被称为“for-rham-V。”式(1)和(2)的丁烯基-多杀菌素化合物用于治理蜘蛛类、线虫类和昆虫[尤其是鳞翅目和双翅目种类],它们对环境无害而且具有吸引人的毒理性质。
这些差别包括:C-21位上的广泛修饰、C-8位上的羟基化和其他的糖,包括中性糖在C-17位上被forosamine所取代。另外,具有与14员环大环稠合的5,6,5-三轮环系统,在C-17和C-9上各自连有forosamine和鼠李糖的化合物曾公开于上述专利申请中。
表1
化合物 名称 分子式 R3 * R4 R5 R8 R9 ** 1 A83543多杀菌素A (1) (3a) H H 乙基 (9a) 2 for-rham-I(丁烯基-多杀菌素) (1) (3a) H H 1-丁烯基 (9a) 3 2”-羟基-for-rham-I (1) (3a) H H 1-丁烯基 (9d) 4 for(3-O-去甲基 (desmethyl)-rham)-I(3-ODM) (1) (3c) H H 1-丁烯基 (9a) 5 for-rham-II (1) (3a) H CH3 1-丁烯基 (9a) 6 for-rham-III (1) (3a) OH H 1-丁烯基 (9a) 7 24,25-脱氢-for-rham-I (1) (3a) H H 1,3-丁二烯 (9a) 8 ami-rham-I (1) (3a) H H 1-丁烯基 (9e) 9 3”-O-甲基-谷氨酸-rham-I (1) (3a) H H 1-丁烯基 (9f) 10 ami-rham-III (1) (3a) OH H 1-丁烯基 (9e) 11 mole-rham-III (1) (3a) OH H 1-丁烯基 (9g) 12 24-去甲基-for-rham-I (1) (3a) H H 1-丙烯基 (9a) 13 rham-I (1) (3a) H H 1-丁烯基 H 14 24-羟基-rham-I (1) (3a) H H 3-羟基-1- 丁烯基 H 15 24-羟基-rham-III (1) (3a) OH H 3-羟基1- 丁烯基 H 16 22,23-二氢-rham-I (1) (3a) H H n-丁基 H 17 (4-N-去甲基-1”,4”-diepi- for)-rham-I (1) (3a) H H 1-丁基 (9h) 18 5”-epifor-rham-I (1) (3a) H H 1-丁烯基 (9i) 19 24,25-脱氢-for-rham-III (1) (3a) OH H 1,3-丁二烯 (9a) 20 24-去甲基-for--rham-III (1) (3a) OH H 1-丙烯基 (9a) 21 for--rham-IV (2) (3a) H H 乙基 (9a) 22 For-(4’-O-去甲基-rham)- (4-ODM) (1A) (3d) H H 1-丁烯基 (9a) 23 for-(3’4’-二-O-去甲基 -rham)-I (1A) (3e) H H 1-丁烯基 (9a)
*R3是具有下列(3a)-(3c)式之一的基团。
**R9是具有(9a)-(9i)式之一的基团。
表1中的化合物1-21由刺糖多胞菌LW107129(NRRL30141)产生,并已在美国专利申请09/661,065(相应于WO01/19840)中公开,32和33号化合物则已公开于2001年3月21日提交的美国临时专利申请60/277,601“大环内酯类杀虫剂”中。
尽管丁烯基-多杀菌素同A83543多杀菌素的结构存在差异,但可以推断出,它们的某些生物合成基因是相似的。但是,正如上面所详细论述的那样,刺糖多胞菌LW107129(NRRL30141)能够产生大量独特的丁烯基-多杀菌素因子和化合物,而这些在A83543多杀菌素中并未观察到。因此,这种生物也必须具有与刺糖多胞菌中的A83543多杀菌素生物合成酶不同的新的生物合成酶。特别是,相对于A83543多杀菌素来说,刺糖多胞菌LW107129(NRRL30141)的丁烯基-多杀菌素生物合成酶一定能通过2个碳原子(在C-21上连接丁烯基而不是乙基)来延伸聚酮化合物链。它们在C-17上必须能合成并连接上另外的氨基糖或中性糖,并且要在C-8和C-24位上发生羟基化。另外,相对于刺糖多胞菌,在刺糖多胞菌LW107129(NRRL30141)中发生的鼠李糖甲基化是不同的。在A83543多杀菌素上显示出鼠李糖甲基化特征改变的刺糖多胞菌阻断突变体(如美国专利5,202,242和5,840,861所公开)能够产生典型的A83543多杀菌素的单去甲基的(mono-desmethylated)鼠李糖衍生物。只有在西萘芬净这样的甲基化酶抑制剂存在时,才能观察到A83543多杀菌素的双去甲基(di-desmethyl)鼠李糖衍生物。甲基化酶抑制剂不存在时,鼠李糖甲基化改变的刺糖多胞菌LW107129(NRRL30141)突变体能够产生大量的丁烯基-多杀菌素双去甲基和三去甲基鼠李糖衍生物。
生产丁烯基-多杀菌素的一大障碍就是生产极少量的丁烯基-多杀菌素即需要很大的发酵体积。含有一或多个丁烯基-多杀菌素生物合成酶的基因的DNA克隆片段可复制基因来提高产率。通过能将大菌素转变成泰乐菌素的限制速度的甲基化转移酶的编码基因(Baltz等,1997)以及复制gtt和gdh基因(Baltz等,2000)可以分别在链霉菌属弗氏链霉菌(Streptomyces fradiae)和刺糖多胞菌的发酵物中提高这类化合物的产率。
克隆的丁烯基-多杀菌素生物合成基因也提供了生产具有不同杀虫剂活性谱的丁烯基-多杀菌素新衍生物的方法。采用重组DNA技术构建的刺糖多胞菌LW107129(NRRL30141)突变株(其中某些丁烯基-多杀菌素生物合成酶的编码基因已被阻断)能够合成特殊的中间物(或它们的天然衍生物)。利用这个策略可以有效地产生生产新的6-脱氧红霉素衍生物的红色糖多胞菌(Saccharopo1yspora erythraea)(Weber和McAlpine,1992)。丁烯基-多杀菌素生物合成基因在其它能够生产类似化合物的生物体,如刺糖多胞菌,中也能表达。以天然的丁烯基-多杀菌素启动子或异源启动子表达时这些基因产生兼有多杀菌素和丁烯基-多杀菌素独特结构性质的新杂合分子。
刺糖多胞菌LW107129(NRRL30141)或刺糖多胞菌的突变株也能合成新的中间体,这些突变株中参与丁烯基-多杀菌素生物合成的酶的编码基因的某些部分被在体外特异性突变的相同基因片段或来自于其它生物的相应基因片段所取代。这个杂合基因将会产生功能已经改变(或者缺乏某种活性或者进行新的酶促转化)的蛋白质。在突变株的发酵产物中将产生新的化学物质。利用这样的方法可构建能够产生新的脱水红霉素衍生物的红色糖多胞菌菌株(Donadio等,1993)。
丁烯基-多杀菌素的生物合成通过逐步缩合和修饰二碳和三碳羧酸前体,产生线性聚酮化合物来进行(图1A),这个聚酮化合物经环化和桥连可生成四环糖苷配基(图1B)。接下来形成假糖苷配基(含有三-氧-甲基化的鼠李糖),然后将二-氮-甲基化forosamine或者其他糖加上去完成这个生物合成过程(图1B)。其它的大环内酯类化合物,如抗生素红霉素、抗寄生虫的除虫霉素和免疫抑制剂纳巴霉素,都以相似的方式合成。在产生这些化合物的细菌中,抗生素是由I型聚酮化合物合成酶(PKS)中的几种大型且多功能的蛋白质催化而成(Donadio等,1991;Ikeda等,1999;Schwecke等,1995)。这些多肽形成由一个起始模块和几个延伸模块所组成的复合体。其中每一个都向正在增长的聚酮化合物链上增加一个特异性的乙酰CoA前体并以特异的方式去修饰这个β-酮基基团(图1A)。因此,聚酮化合物的结构由PKS中模块的组成和次序所决定的。模块包含多个结构域,而每个结构域执行特定的功能。起始区模块含有酰基转移酶(AT)结构域,用于从前体向酰基载体蛋白(ACP)结构域添加酰基基团。这个起始区模块也可能含有KSQ结构域,此结构域同β-酮基合成酶(KS)结构域高度相似,但其上活性位点的半胱氨酸被谷氨酰胺所取代(Bisang等,1999),因此,KSQ不再具有缩合活性。KSQ结构域保留脱羧酶活性并决定起始模块的前体特异性。延伸模块含有AT和ACP结构域以及完整的β-酮基合成酶(KS)结构域,后者通过脱羧缩合向已经存在的聚酮化合物链上增加新的酰基-ACP。其它的结构域也可能存在于每个延伸模块中以完成特定的β-酮基修饰:β-酮基还原酶(KR)结构域将β-酮基还原为羟基,脱水酶(DH)结构域去除羟基并留下双键,以及烯脂酰还原酶(ER)结构域还原上述双键并留下饱和碳。最后的延伸模块以硫酯酶(TE)结构域终止。此结构域以大环内酯形式从PKS上释放这个聚酮化合物。聚酮化合物合成酶主要由3-7个大的开放阅读框所编码(Donadio等,1991;Ikeda等,1999;Schwecke等,1995)。特异功能性聚酮化合物合成酶的组装要求这些蛋白质之间特异性蛋白质-蛋白质相互作用。
活化的大环内酯类抗生素由大环内酯经另外的修饰(如甲基化和还原状态的改变以及添加不寻常的糖)衍生而来。修饰、糖的合成与连接所需的大部分基因均聚集在PKS基因周围。在大环内酯类抗生素,如红霉素和泰乐霉素(Donadio等,1993;Merson-Davies和Cundliffe,1994)和胞外多糖,如沙门氏菌(Salmonella)和耶尔森氏菌(Yersinia)的O-抗原(Jiang等,1991;Trefzer等,1999)的产生者中,编码脱氧糖生物合成酶的基因是相似的。所有这些合成都涉及通过添加核苷二磷酸然后脱水,还原和/或差向立体异构化来活化葡萄糖。产生的脱氧糖可能会接受一或更多其它修饰,如脱氧、转氨和甲基化作用。然后这些糖被特异性糖基化转移酶的作用整合到大环内酯上。参与糖的合成和附着的基因可能紧密地聚集在一起-甚至作为单个的操纵子被转录-或者它们可能会被分散在不同位置(Ikeda等,1999;Shen等,2000;Aguirrzabalaga等,1998)。
这里所使用的术语定义如下:
a.a.-氨基酸。
AmR-阿泊拉霉素抗性赋予基因。
ACP-酰基载体蛋白结构域。
AT-酰基转移酶结构域。
阻断突变株-突变株,其突变阻断生物合成路径的特异的酶功能从而产生前体或别的(shunt)产物。
bp-碱基对。
bus-丁烯基-多杀菌素生物合成基因。
丁烯基-多杀菌素-结构上同A83543多杀菌素(表1)不同的一类发酵产物,在美国申请专利NO.091661,065和临时的美国申请专利NO.60/277,601中已公开。或者是一种由利用全部或大部分丁烯基-多杀菌素基因的微生物所产生的一种类似大环内酯的发酵产物。
丁烯基-多杀菌素基因-编码丁烯基-多杀菌素生物合成所需产品的DNA序列,特别是下文中所描述的基因busA、busB、busC、busD、busE、busF、busG、busH、busI、busJ、busK、busL、busM、busN、busO、busP、busQ、busR和busS,或者它们的功能等价物。
克隆-将DNA片段整合到重组DNA克隆载体并用这个重组DNA转化宿主细胞的过程。
密码偏倚性-使用特异密码子去针对特异的氨基酸的倾向性。对于刺糖多胞菌LW107129(NRRL30141)而言,这种倾向性指使用第三个碱基是胞嘧啶或鸟嘌呤的密码子。
互补-用克隆的基因使突变菌株恢复正常表型。
接合-遗传物质从一个细菌细胞转移到另一个细胞的过程。
cos-噬菌体λ的粘性末端序列。
粘粒-重组DNA克隆载体,其为不仅能在宿主细胞内以与质粒相同的方式复制,还能包装进噬菌体头部的质粒。
DH-脱水酶结构域
ER-烯酯酰还原酶结构域
基因-编码多肽的DNA序列
基因组文库-其中克隆有实质上代表特定生物体中全部DNA序列的DNA片段的一套重组DNA克隆载体。
同源性-序列之间的相似性程度
杂交-两个单链DNA分子退火形成双链DNA分子的过程,碱基可能完全也可能不完全配对。
体外包装-DNA经体外包装在衣壳蛋白内形成病毒样的颗粒,该颗粒能够通过感染的方式将其中的DNA引入宿主细胞中。
Kb-一千碱基对
KR-β-酮基还原酶结构域
KS-β-酮基合成酶结构域
突变形成-在DNA序列上产生变化。它们在体内或体外随机或有目的地产生。突变可能是沉默的,也可能会导致翻译产物的氨基酸序列的改变,结果此产物蛋白的性质发生改变并形成突变表型。
ORF-开放阅读框
ori-质粒上复制(oriR)或转移(oriT)的起始点。
%同源性-比较两个序列时,由BLAST程序所给出的同源性百分比。
%相似性-比较两个序列时,由BLAST程序所给出的相似性百分比。
PCR-聚合酶链式反应-特异扩增DNA某一区域的方法。
PKS-聚酮化合物合成酶
启动子-指导转录起始的DNA序列。
重组DNA克隆载体-任何能够自主复制或整合的工具,包括但并不限于质粒,一个或更多其它DNA分子能够被或已经被加到载体自身含有的DNA分子上。
重组DNA方法-用于创造、鉴定并修饰已克隆到重组DNA载体上的DNA片段的方法。
限制性片段--由一或多个限制性酶作用产生的线性DNA分子。
多杀菌素-也被称为A83543的发酵产物,或者由微生物利用全部或大部分A83543多杀菌素基因产生的类似A83543的大环内酯发酵产物。其典型分子特征为由5,6,5三环与12元的大环内酯环融合构成的稠环结构,在21碳原子位置上连有一个1~2个碳原子的碳链,同时其分子骨架上还连有一个中性糖(鼠李糖)和一个氨基糖(forosamine)。
多杀菌素基因--编码A83543生物合成所需产物的DNA序列。具体基因为spnA、spnB、spnC、spnD、spnE、spnF、spnG、spnH、spnI、spnJ、spnK、spnL、spnM、spnN、spnO、spnP、spnQ、spnR、spnS、刺糖多胞菌gtt,刺糖多胞菌gdh、刺糖多胞菌epi、和刺糖多胞菌kre及其功能等价物(如下文描述)。
spn为A83543生物合成基因。
亚克隆为带有插入DNA的克隆载体,该DNA来自于另一个与之大小相同或大于它的DNA。
TE为硫酯酶结构域。
转结合子为通过接合作用形成的重组菌株。
附图简述
图1中1A和1B阐明丁烯基-多杀菌基生物合成的示意图。
图2阐明HindII、EcoRV、和ScaI片段及开放阅读框在刺糖多胞菌LW107129(NRRL30141)的被克隆区中的排列顺序。
图3是粘粒Cosmid pOJ436的功能图及限制性位点。
图4阐明17-(4”-O-甲基夹竹桃糖)-丁烯基-多杀菌素[图1中的化合物(11)]的生物合成途径。
发明简述
本发明克隆了丁烯基-多杀菌素的生物合成基因和相关的ORFs,并测定了它们的DNA序列。它们在下文中分别指busA、busB、busC、busD、busE、busF、busG、busH、busI、busJ、busK、busL、busM、busN、busO、busP、busQ、busR、busS和ORF LI、ORF LII、ORF LIII、ORF LIV、ORF LVI、ORF LVII、ORF LVIII、ORF LIX、ORF RI、ORF RII和ORF RIII等基因。图1和下文的讨论证实了这些克隆基因在多杀菌素生物合成中的功能。
一方面,本发明提供含有编码丁烯基-多杀菌素生物合成酶DNA序列的分离的DNA分子。该酶的氨基酸序列由选自于SEQ ID NO:3-7和8-29所定义,或者由其中一或几个氨基酸发生替换而不改变所编码酶功能的上述氨基酸序列所决定。在优选的实施方案中,该DNA序列选自busA、busB、busC、busD、busE、ORF RI、ORF RII、ORF RIII、busF、busG、busH、busI、busJ、busK、busL、busM、busN、busO、busP、busQ、busR、busS、ORF LI、ORF LII、ORF LIII、ORF LIV、ORF LVI、ORF LVII、ORF LVIII和ORF LIX等基因,所述基因分别由SEQ ID NO:1中的碱基1-13032、13059-19505、19553-29053、29092-43890、43945-60636、62090-63937、65229-66602、68762-69676,和SEQ ID NO:2中的碱基114-938、1389-2558、2601-3350、3362-4546、4684-6300、6317-7507、7555-8403、8640-9569、9671-10666、10678-12135、12867-14177、14627-15967、16008-17141、17168-17914、18523-19932、19982-20488、20539-21033、21179-21922、22674-23453、23690-24886、26180-26923、27646-28473所描述。
另一方面,本发明提供含有编码丁烯基-多杀菌素PKS结构域的DNA序列的分离的DNA分子。该结构域选自KSi、ATi、ACPi、KSb、ATb、KRb、DHb、ACPb、KS1、AT1、KR1和ACP1等结构域。这些结构域各自由SEQ ID NO:3的氨基酸6-423、528-853、895-977、998-1413、1495-1836、1846-2028、2306-2158、2621-2710、2735-3160、3241-3604、3907-4086和4181-4262所描述。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基16-1269、1582-2559、2683-2931、2992-4239、4483-5508、5538-6084、6916-7554、7861-8130、8203-9480、9721-10812、11719-12258和12541-12786。
另一方面,本发明提供含有编码多杀菌素PKS结构域的DNA序列的分离的DNA分子。该结构域选自KS2、AT2、DH2、ER2、KR2和ACP2,而这些结构域各自由SEQ ID NO:4的氨基酸1-421、534-964、990-1075、1336-1681、1685-1864和1953-2031所描述。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基13059-14321、14658-15900、16026-16283、17064-18100、18111-18650和18915-19151。
另一方面,本发明提供含有编码多杀菌素PKS结构域的DNA序列的分离的DNA分子。该结构域选自KS3、AT3、KR3、ACP3、KS4、AT4、KR4和ACP4,而这些结构域各自由SEQ ID NO:5的氨基酸1-421、528-814、1157-1335、1422-1503、1526-1949、2063-2393、2697-2877、2969-3049所描述。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基19553-20815、21143-22000、23021-23557、23816-24061、24128-25399、25739-26731、27641-28183和28457-28699。
另一方面,本发明提供含有编码多杀菌素PKS结构域的DNA序列的分离的DNA分子。该结构域选自KS5、AT5、DH5、KR5、ACP5、KS6、AT6、KR6、ACP6、KS7、AT7、KR7和ACP7,而这些结构域各自由SEQ IDNO:6的氨基酸1-422、537-864、891-1076、1382-1563、1643-1724、1746-2170、2281-2611、2914-3093、3186-3267、3289-3711、3823-4151、4342-4636和4723-4804所描述。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基29092-30357、30700-31683、31762-32319、33235-33780、34018-34263、34327-35601、35932-36924、37831-38370、38647-38892、38956-40224、40560-41544、42115-42999和43258-43503。
另一方面,本发明提供含有编码多杀菌素PKS结构域的DNA序列的分离的DNA分子。该结构域选自KS8、AT8、DH8、KR8、ACP8、KS9、AT9、DH9、KR9、ACP9、KS10、AT10、DH10、KR10、ACP10和TE10,而这些结构域各自由SEQ ID NO:7的氨基酸1-424、530-848、885-1072、1371-1554、1650-1728、1751-2175、2289-2616、2642-2775、3131-3315、3396-3474、3508-3921、4036-4366、4389-4569、4876-5054、5148-5229和5278-5531所描述。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基43945-45216、45532-46488、46597-47160、48055-48606、48892-49083、49195-50469、50809-51792、51868-52269、53335-53889、54130-54366、54466-55707、56050-57042、57109-57651、58570-59106、59386-59631和59776-60537。
另一方面,本发明提供含有编码多杀菌素PKS模块的DNA序列的分离的DNA分子。该模块选自SEQ ID NO:3中的碱基6-997、SEQ IDNO:3中的碱基998-2710、SEQ ID NO:3中的碱基2735-4262、SEQ ID NO:4中的碱基1-2031、SEQ ID NO:5中的碱基1-1503、SEQ ID NO:5中的碱基1526-3049、SEQ ID NO:6中的碱基1-1724、SEQ ID NO:6中的碱基1746-3267、SEQ ID NO:6中的碱基3289-4804、SEQ ID NO:7中的碱基1-1728、SEQ ID NO:7中的碱基1751-3474、SEQ ID NO:7中的碱基3508-5531。在优选的实施方案中,该DNA序列选自SEQ ID NO:1中的碱基16-2931、2992-8130、8203-12786、13059-19151、19553-24061、24128-28699、29092-34263、34327-38892、38956-43503、43945-49083、49195-54366和54466-60537。
另一方面,本发明提供重组DNA载体,其含有本发明上述的DNA序列。
另一方面,本发明提供用本发明上述的重组载体转化的宿主细胞。
另一方面,本发明提供提高多杀菌素生产菌生产多杀菌素能力的方法。具体步骤如下:
1)用重组DNA载体或其部分转化微生物,该微生物可借助生物合成途经生产丁烯基-多杀菌素或其前体。该载体或其部分含有本发明上述的DNA序列,该序列编码上述途经中限速酶活性的表达。
2)在适合于细胞的生长和分裂、适合上述DNA序列的表达以及多杀菌素产生的条件下,对用本发明的重组DNA载体转化的微生物进行培养。
另一方面,本发明提供了生产多杀菌素的微生物,该微生物含有可操作的丁烯基-多杀菌素生物合成基因,其中busA、busB、busC、busD、busE、busF、busG、busH、busI、busJ、busK、busL、busM、busN、busO、busP、busQ、busR和busS这些基因中至少有一个被复制。
另一方面,本发明提供了生产丁烯基-多杀菌素的微生物,该微生物在其基因组中含有丁烯基-多杀菌素生物合成基因。在这些基因中至少有一个被失活,剩下的基因可操作性产生那个失活基因的表达产物以外的丁烯基-多杀菌素。优选微生物是刺糖多胞菌LW107129(NRRL30141)或刺糖多胞菌突变菌株。更优选微生物是刺糖多胞菌LW107129(NRRL30141)的突变菌株。
本发明也提供在正常情况下不产生丁烯基-多杀菌素的生物体中表达丁烯基-多杀菌素生物合成基因。这些基因可以在天然的bus基因的启动子或与受体菌兼容的异源启动子控制下表达。优选生物体能产生多杀菌素类化合物,更优选的微生物是刺糖多胞菌或其衍生物。
本发明也提供了生产丁烯基-多杀菌素的微生物,其基因组中含有可操作的丁烯基-多杀菌素生物合成基因。其中所说的基因:
a)包含的PKS模块比SEQ ID NO:1中存在的至少多一个或至少少一个;
b)含有由于缺失、失活或增加KR、DH或ER结构域或由于替换AT结构域而与SEQ ID NO:1中相应模块不同的PKS模块。优选的微生物是刺糖多胞菌LW107129(NRRL30141)突变菌株。
本发明也提供通过培养本发明的新型微生物而产生的丁烯基-多杀菌素。
另一方面,本发明提供分离丁烯基-多杀菌素生物合成基因的方法。该方法包括构建丁烯基-多杀菌素产生菌的基因组文库,并使用SEQID NO:1或者SEQ ID NO:2的至少二十个碱基的标记的核酸片断作为杂交探针。
本领域的技术人员都应理解:在要求保护的氨基酸序列上进行替代而在本质上不改变蛋白质的基本功能是可行的。因此,本发明也包括通过如此变异得到的氨基酸序列和相应编码变体的DNA序列。优选,氨基酸序列是那些本质上具有相同功能并且同天然的氨基酸序列有98%以上同一性的氨基酸序列。
发明详述
作为丁烯基-多杀菌素基因使用和表征的先决条件,必须分离和鉴定编码涉及这一杀虫剂生物合成的相关酶基因。在随后的实施例1中描述的方法涉及基因组粘粒文库的构建和通过DNA杂交进行的后续筛选。
实施例1
a.刺糖多胞菌LW107129(NRRL30141)细胞总DNA的分离
刺糖多胞菌LW107129(NRRL30141)被接种于100mL生长培养基中(9.0g/L右旋糖,30g/L trypticase大豆汤,3.0g/L酵母提取物,2.0g/L MgSO4·7H2O)上述培养基被置于500mL锥形瓶中,30℃,150rpm振荡培养72小时;然后4℃,3000rpm离心10分钟以得到沉淀的细胞;去除上清液,沉淀用20mL TE缓冲液(10mM Tris/HCl,pH8.0;1mM EDTA,pH8.0)清洗,再以3000rpm离心,并将沉淀置于-20℃冻存。进行细胞总DNA分离时再把它解冻。
刺糖多胞菌LW107129(NRRL30141)细胞总DNA利用基因组DNA纯化试剂盒(Qiagen Inc.,Valencia,CA)提取。来自于100mL培养物的冷冻沉淀细胞在11mL B1缓冲液(50 mM Tris/HCl,pH8.0;50mM EDTA,pH8.0;0.5%Tween 20,0.5%Triton X-100)中被重新悬浮,该缓冲液含有11μL Qiagen公司产的RNA酶A溶液(100mg/mL)。然后向这一悬液中加入300μL溶菌酶贮存液(100mg/mL;Sigma Chemical Co.,St.Louis,MO)和500μL蛋白酶K贮存液(50mg/mL;Sigma Chemical Co.),涡旋混合并在37℃温育30分钟,然后向细胞裂解物中加入4mLB2缓冲液(3M盐酸胍;20%Tween 20),在管中轻柔颠倒混合,然后该细菌裂解物在50℃温育30分钟,细菌裂解物中的总DNA利用Qiagen Genomic-tip 500/Gtips按照厂商的推荐使用说明提取出来。所得的纯化的DNA溶解在5mLTE缓冲液中并在4℃贮存。
b.基因组粘粒文库的构建
从刺糖多胞菌LW107129(NRRL30141)分离出的细胞总DNA[按照Ausubel等编写的现代分子生物学实验方法(John Wiley and Sons,Inc.,New York,NY)3.1.3节所述操作]用Sau 3A I酶进行部分消化。利用小规模反应(80μL反应体系中含有40μg细胞总DNA)来选择合适的消化酶和细胞总DNA的比例,从而能够将部分消化的DNA片断最大程度集中在25~50Kb范围内。然后反应物在65℃加热15分钟以使Sau 3A I酶失去活性,用3%的琼脂糖电泳分析等分的反应物,来检测部分消化的DNA片断在所需范围内的相对丰度。一旦选择好合适的消化酶与细胞总DNA的比例,反应体系可加大以获得足够量的部分消化的细胞总DNA,该DNA片断在粘粒文库的构建中用作为插入的DNA片断。一般规模的反应是将400μg刺糖多胞菌LW107129(NRRL30141)细胞总DNA同9个单位的Sau3A I(Gibco BRL,Gaithersburg,MD)在1×的React4缓冲液中(厂商提供10×母液)以总体积800μl于37℃温育15分钟,然后65加热20分钟以使Sau 3A I酶失去活性,部分消化的细胞总DNA用相同体积已平衡的苯酚-氯仿(50∶50;v/v)溶液混合,轻柔颠倒使之充分混合,然后14000g离心15分钟,取出水相。用等体积的氯仿-异戊醇溶液(24∶1,v∶v)与水相混合,轻柔颠倒使两项混合,14000g离心15分钟,水相移至1个新管中并加入0.1体积3M乙酸钠(pH5.2),然后再加2倍体积冰冷的100%乙醇,轻柔颠倒使之混合,为了帮助DNA沉淀下来,样品要在-70℃过夜,DNA沉淀通过14000g离心20分钟而得到。DNA被重悬于50μL双蒸水中并在-20℃贮存。
用于构建粘粒文库的载体为含用于筛选的阿泊拉霉素抗性基因的pOJ436(图3),为了最大程度避免粘粒载体DNA同自身的重新连接,在1.2mL总体积的1×SAP缓冲液(厂商提供10×母液)中将酶切过的DNA同20个单位的虾碱性磷酸酶(Roche/BoehringerMannheim,Indianapolis,IN)在37℃温育2小时,以使Bam HI酶切的POJ436 DNA发生去磷酸化作用,Sau 3A I消化的基因组DNA被连接到POJ436去磷酸化的Bam HI位点上,并用使部分消化的DNA与载体DNA为5∶1的比率。对于这个反应来说,插入序列同载体DNA在20单位T4 DNA连接酶(New England Biolabs Inc.,Beverly,MA)的作用下在1×的T4 DNA连接酶缓冲液(厂商提供10×母液)体系中16℃反应过夜。然后使用Gigapack III Gold Packaging Extract(Stratagene,La Jolla,CA)将连接混合物包装,使用大肠杆菌菌株DH5α-MRC+细胞(Gibco BRL)按照厂商的推荐使用说明对重组噬菌体进行滴度测量。将宿主细胞培养物和重组噬菌体的等份(20~40μL)涂布在含有100mg/L阿泊拉霉素(SigmaChemical Co.)的LB琼脂平板上(10g/L细菌-胰蛋白胨,10g/L氯化钠,5g/L细菌-酵母提取物,15g/L细菌琼脂;Difco实验室),37℃过夜培养。为了建立用于冻存的粘粒文库主平板,用无菌的牙签将单个克隆分别挑至无菌96孔板的每一孔中的微平板上[含250μL TB培养基(Terrific Broth):12g/L细菌-胰蛋白胨,24g/L Bacto-yeastextract,0.4%v/v甘油,17mM KH2PO4,72mM K2HPO4,100mg/L阿泊拉霉素]。并在37℃无振荡条件下过夜培养。为了从主平板产生复制平板,将96孔板复制器用于温育含250μL TB培养基(含100mg/L阿泊拉霉素)的无菌96孔微板,在37℃无振荡条件下过夜培养。
无论是主平板还是复制平板,都要用多通道移液器向平板或培养物中加入7%(v/v)的二甲基亚砜溶液并混合,平板在-70℃贮存。
确定要选择的重组粘粒插入基因的平均大小。首先利用NucleoSpin核酸纯化试剂盒(CLONTECH Laboratories,Inc.,PaloAlto,CA)分离出粘粒DNA,然后用20单位的限制性酶EcoRI(New EnglandBiolabs)37℃消化回收的DNA 1小时,在1.0%的琼脂糖凝胶电泳上分析限制性DNA片断,该DNA片断经0.5%溴化乙锭(Sigma Chemical Co.)染色后在紫外灯下即可看到。其相对大小可通过同1Kb的DNA梯(GibcoBRL)比较而得知。构建的粘粒文库的插入片段大小范围为20~40Kb。
C.筛选粘粒文库并鉴定含有丁烯基-多杀菌素物合成基因的粘粒
用96孔板复制器接种每种大肠杆菌(E.coli)粘粒克隆的代表双份到Hybond N+(Amersham Pharmac ia Biotech,Piscataway,NJ)核酸结合膜上。将膜铺于含有100mg/L阿泊拉霉素的LB琼脂板上,于37℃温育过夜。按照厂商推荐方法处理膜。接种过的膜以菌落面向上置于用0.5N NaOH饱和的3MM滤纸(Whatman,Clifton,NJ)上1分钟,然后将其转到用变性液(1M Tris-HCl,PH7.6)饱和的第二张滤纸上1分钟使DNA变性。将滤膜转到已经在中和液(1M Tris-HCl,PH7.6/1.5M NaCl)中饱和的第三张滤纸上1分钟以中和该膜。最后在1MTris-HCl,pH7.6/1.5M NaCl的溶液中洗涤。用紫外交联装置(UV Stratalinker 1800,Stratagene)在1200μJ的条件下将DNA固定到膜上。
用基于来自于刺糖多胞菌的spn基因制备的3个放射性标记的DNA探针去筛选上述已制备好的重组细菌文库(Baltz等,2000,表2)。利用聚合酶链式反应(PCR)技术,用寡核苷酸对去扩增spn生物合成基因簇的特异核酸序列。寡核苷酸引物在394 DNA/RNA合成仪(AppliedBiosystem/PekinElmer,Foster City,CA)上合成(见表2)。PCR反应遵照厂商推荐的方法,利用AmpliTaq_DNA聚合酶试剂盒(PekinElmer/Roche,Branchburg,NJ)进行。DNA片段扩增在48个样品的DNA热循环仪(Pekin Elmer Centus)中完成,循环条件如下:1)94℃,1分钟;55℃,2分钟;72℃,3分钟;25个循环。2)72℃,10分钟;1个循环。通过0.1%琼脂糖凝胶电泳来检测扩增产物,而相应分子量的目的条带从凝胶上提取出来,遵循厂商推荐采用QiagenII凝胶提取试剂盒(Qiagen公司)进行。
表2刺糖多胞菌基因探针探针长度(bp)正向引物 反向引物 spnS 499 SEQ ID NO.33 SEQ ID NO.34 spnF 536 SEQ ID NO.35 SEQ ID NO.36 spnE(TE) 506 SEQ ID NO.37 SEQ ID NO.38
将上述膜置于65℃温育3小时,向300ml预杂交液中加入放射性标记探针,预杂交液成分组成为:6×SSC(52.59g/L NaCl,24.66g/L柠檬酸钠,用10N NaOH将pH调到7.0),0.1%十二烷基磺酸钠(SDS),10×Denhardt氏液(50mg/L Ficoll[Type 400,Pharmacia],5.0mg/L聚乙烯吡咯烷酮,5.0mg/L牛血清白蛋白),100μg/L变性鲑鱼精。
用于制备探针的DNA片段浓度都调到25ng,沸水浴变性10分钟。按照厂商推荐的随机引物法,用4μl High Prime反应混合物(Boehringer Mannheim)中,以50μCi[α32P]的dCTP(比活性3000Ci/mMol)随机引发标记。利用NucTrap Push Column(Stratagene)分离放射性标记的探针与未掺入的核苷酸。将探针于沸水加热10分钟变性,然后加到预杂交膜上。大约有2.0×107cpm探针被加到膜上来进行所有DNA杂交。所有探针杂交条件均为:65℃,水浴振荡,16小时。
将含有放射性标记探针spnF,spnS和spnE(TE)的杂交溶液倾倒在膜上。每组膜都要在中度严紧条件下清洗:1)15分钟,室温,300ml3×SSC/0.5%SDS;2)30分钟,65℃,振荡,300ml新鲜的3×SSC/0.5%SDS;3)30分钟,室温,300ml 1×SSC/0.5%SDS。用来自于刺糖多胞菌LW107129(NRRL30141)粘粒9D3序列的放射性标记的探针进行膜筛选,在严紧条件下清洗:1)30分钟,65℃,振荡,300ml新鲜的1×SSC/0.5%SDS洗液;2)30分钟,65℃,振荡,300ml新鲜的0.33×SSC/0.5%SDS洗液;3)30分钟,65℃,振荡,300ml新鲜的0.1×SSC/0.5%SDS溶液。使用手动控制的Geiger-Mueller计数机监测这些滤膜以确定背景同位素干扰是否最低。将膜置于3MM滤纸上并用塑料袋罩上,对X底片曝光。在冲洗前膜可在-70℃曝光24-72小时。
通过限制性内切酶消化分析和粘粒载体的末端测序进一步鉴定推定的阳性粘粒克隆。使用NucleoSpin核酸纯化试剂盒(CLONTECHLaboratories,Inc.,Palo Alto,CA)分离粘粒DNA,并用20单位的限制酶EcoRI(New Enfland BioLabs)在37℃消化1小时。在1.0%琼脂糖凝胶中进行限制性DNA电泳。用溴化乙锭(EB)染色后,可以在紫外光下观察到DNA片段,其相对大小可通过与1KbDNA梯相比较而得知。另外,根据Burgett和Rosteck(1994)的方法,通过荧光循环测序法可以测知来自于粘粒/载体接头的刺糖多胞菌LW107129(NRRL30141)核苷酸序列。由3μl(2μg纯化的粘粒DNA)模板、1μl通用引物(4pmole)或反向引物(4pmole)、8μl Big Dye_反应混合器、1μlDMSO和7ml H2O构成的测序反应采用377ABIPrismTM测序仪(Applied Biosystem,Inc)在下述热循环条件下进行:96℃,30秒;50℃,15秒;60℃,4分钟;25个循环。
经鉴定,与刺糖多胞菌探针spnF,spnS和spnE(TE)阳性杂交的有8个粘粒克隆。粘粒8H3是与spnS和spnE(TE)探针都能杂交的两个克隆之一。粘粒9D3是仅与探针spnF杂交的三个克隆之一。粘粒10C1是仅与spnF探针杂交的3个克隆之一。来自粘粒9D3(SEQ ID NO:1上的碱基297477-30163)粘粒/载体末端核苷酸测序结果的刺糖多胞菌LW107129(NRRL30141)序列的放射性标记的PCR片段与粘粒9F4杂交,由此从基因组文库鉴定出:粘粒9F4。在粘粒9D3 DNA序列(SEQ ID NO:39和SEQ ID NO:40)的基础上合成出了两个引物。按照上面所描述的方法使用这些引物从刺糖多胞菌LWl07129(NRRL30141)基因组DNA扩增出416bp的DNA片段并用于杂交。
通过对噬菌体M13(SeqWright,Houston,TX)中克隆的随机DNA片段进行荧光循环测序,可以测得粘粒8H3、9D3、9F4和10C1的完整序列。8H3和9D3的插入片段重叠,9D3和9F4的插入片段重叠,而9F4和10C1的插入片段重叠(见图2)。综合起来,这4个粘粒的插入片段跨越了特异序列上111个Kb(SEQ ID NO:1和2)。SEQ ID NO.1包括busA的起始密码子及到其3’端的全部DNA(见图2)。SEQ ID NO.2开始自busA起始密码子之前的碱基并包括到这个碱基5’端的全部DNA。表3给出了SEQ ID NO.1和SEQ ID NO.2在这4个插入片段各自中的部分。
表3插入片段插入片段大小(碱基对)SEQ ID NO:1上的碱基SEQ ID NO:2上的碱基粘粒8H3 40,364 1-3,826 1-36,538粘粒9D3 31,743 1-30,200 1-1,543粘粒9F4 36,935 17,437-54,372 无粘粒10C1 40,618 34,624-75,242 无
图2给出了4个插入片段同110Kb序列之间的关系。
PKS基因
SEQ ID NO.1包括一个大约60Kb的中心区域,此区域与编码已知的大环内酯生产者(Donadio等,1991;McDaniel和Katz,,2001;Dehoff等,1997)聚酮化合物合成酶的DNA具有显著的同源性。丁烯基-多杀菌素PKS DNA区域在ACP结构域末端包含5个与其它产大环内酯细菌的PKS开放阅读框(ORF)相似的带有框内终止密码子的开放阅读框(ORF)。这5个丁烯基-多杀菌素PKS基因呈首尾相对排列(见图2),而并不干涉非PKS区功能,例如在红霉素PKS基因AI和AII(Donadio等,1993)之间发现的插入成分。PKS基因被命名为busA,busB,busC,busD和busE。表4中给出了对应于5个多杀菌素PKS基因各自的核酸序列及其相应多肽:
表4 基因SEQ ID NO:1上的碱基相应多肽 busA 1-13,032 SEQ ID NO:3 busB 13,059-19,505 SEQ ID NO:4 busC 19,553-29,053 SEQ ID NO:5 busD 29.092-43,890 SEQ ID NO:6 busE 43,945-60,636 SEQ ID NO:7
busA编码起始区模块(SEQ ID NO:1上的碱基1-2931),延伸区模块b(SEQ ID NO:1上的碱基2992-8130)和延伸区模块1(SEQ ID NO:1上的碱基8205-13032)。表5给出了在起始区模块和延伸区模块b及1内的每一个功能结构域的核苷酸序列和相应氨基酸序列。
表5 busA结构域SEQ ID NO:1中的碱基SEQ ID NO:3中的氨基酸 KSi 16-1269 6-423 ATi 1582-2559 528-853 ACPi 2683-2931 895-977 KSb 2992-4239 998-1413 ATb 4483-5508 1495-1836 DHb 5538-6084 1846-2028 KRb 6916-7554 2306-2518 ACPb 7861-8130 2621-2710 KS1 8203-9480 2735-3160 AT1 9721-10812 3241-3604 KR1 11719-12258 3907-4086 ACP1 12541-12786 4181-4262
busB编码延伸区模块2(SEQ ID NO:1中的碱基13059-19505)。表6给出了在延伸区模块2范围内每一个功能结构域的核苷酸序列和相应氨基酸序列:
表6 busB结构域SEQ ID NO:1中的碱基SEQ ID NO.4中的氨基酸 KS2 13059-14321 1-421 AT2 14658-15900 534-964 DH2 16026-16283 990-1075 ER2 17064-18101 1336-1681 KR2 18111-18650 1685-1864 ACP2 18915-19151 1953-2031
busC编码延伸区模块3(SEQID NO:1中的碱基19553-24061)和延伸区模块4(SEQID NO:1中的碱基24128-29053)。表7给出了在延伸区模块3和4范围内每一个功能结构域的核酸序列和相应氨基酸序列:
表7 busC结构域SEQ ID NO:1中的碱基SEQ ID NO.5中的氨基酸 KS3 19553-20815 1-421 AT3 21134-22000 528-814 KR3 23021-23557 1157-1335 ACP3 23816-24061 1422-1503 KS4 24128-25399 1526-1949 AT4 25739-26731 2063-2393 KR4 27641-28183 2697-2877 ACP4 28457-28699 2969-3049
busD编码延伸区模块5(SEQ ID NO:1中的碱基29092-34263),延伸区模块6(SEQ ID NO:1中的碱基34327-38892)和延伸区模块7(SEQ ID NO:1中的碱基38956-43503)。表8给出了在延伸区模块5、6和7范围内每一个功能结构域的核酸序列和相应氨基酸序列:
表8 busD结构域SEQ ID NO:1中的碱基SEQ ID NO:6中的氨基酸 KS5 29092-30357 1-422 AT5 30700-31683 537-864 DH5 31762-32319 891-1076 KR5 33235-33780 1382-1563 ACP5 34018-34263 1643-1724 KS6 34327-35601 1746-2170 AT6 35932-36924 2281-2611 KR6 37831-38370 2914-3093 AC6b 38647-38892 3186-3267 KS7 38956-40224 3289-3711 AT7 40560-41544 3823-4151 KR7 42115-42999 4342-4636 ACP7 43258-43503 4723-4804
spnE编码延伸区模块8(SEQ ID NO:1中的碱基43945-49083),延伸区模块9(SEQ ID NO:1中的碱基49195-54366)和延伸区模块10(SEQ ID NO:1中的碱基54466-60707)。表9给出了在延伸区模块8、9和10范围内每一个功能结构域的核酸序列和相应氨基酸序列:
表9 busE结构域SEQ ID NO:1中的碱基SEQ ID NO:7中的氨基酸KS8 43945-45216 1-424 AT8 45532-46488 530-848 DH8 46597-47160 885-1072 KR8 48055-48606 1371-1554 ACP8 48892-49083 1650-1728 KS9 49195-50469 1751-2175 AT9 50809-51792 2289-2616 DH9 51868-52269 2642-2775 KR9 53335-53889 3131-3315 ACP9 54130-54366 3396-3474 KS10 54466-55707 3508-3921 AT10 56050-57042 4036-4366 DH10 57109-57651 4389-4569 KR10 58570-59106 4876-5054 ACP10 59386-59631 5148-5229 TE10 59776-60537 5278-5531
基于同其它聚酮合成酶结构域上保守氨基酸序列的相似性,在上述的表7-11中所鉴定的55个结构域的界限和功能根据同其他聚酮化合物合成酶预测,尤其是红霉素聚酮合成酶中的保守氨基酸的相似性,(Donadio等,1992)。与A83543多杀菌素PKS相同,busPKS在起始区模块的氨基酸末端有一个KSQ结构域。此结构域不能作为β-酮基合成酶发挥作用,因为在第172个氨基酸处,它包含谷氨酸残基,代替了β-酮基合成酶活性所需的半胱氨酸(Siggard-Andersen,1993)。据报道,具有使丙二酰-ACP脱羧功能的KSQ结构域是链的起始因子(Bisang等,1999)。其它的丁烯基-多杀菌素PKS结构域也具有功能。它们当中没有一个具有在红霉素和雷怕霉素PKS基因中发现的无活性结构域的序列特征(Donadio等,1991;Aparicio等,1996)。
尽管在大小上busB-E同spnB-E相当,但busA仍然比spnA大5,244bp。前者开头的4245bp和末尾的3,486bp同后者有很高的相似性。但是,碱基4246-9548与spnA基因没有对应部分。这5Kb的区域编码的是另一个带有5个功能结构域的模块:KSb,ATb,DHb,KRb和ACPb。这些区域与上述起始结构域一起共同负责生物合成丁烯基侧链,这是丁烯基-多杀菌素相对于A83543多杀菌素的特征基团。克隆的bus PKS基因busB、busC、busD和busE显示与A83543多杀菌素PKS基因spnB、spnC、spnD和spnE的相似性(表10)(Baltz等,2000)。
表10丁烯基-多杀菌素基因busORF长度bp(氨基酸)功能结构域在A83543多杀菌素PKS中的最佳匹配spn ORF长度bp(氨基酸)功能结构域ORF%同一性 (DNA)ORF同一性(a.a.) busA 13032(4344) spnA 7788(2595) 1-4245 4245(1415)KSQ-KSb 21111-25214 4245(1415)KSQ-KS1 92%91.2% 4246-9548 5301(1767)ATb-KS1 无* NA 9549-13032 3486(1162)AT1-ACP1 26407-28896 3486(1162)AT1-ACP1 91%87.6% busB 6450(2149)KS2-ACP2 spnB 6459(2152)KS2-ACP2 93%93.1% busC 9546(3167)KS3-ACP4 spnC 9513(3170)KS3-ACP4 94%93.5% busD 14805(4935)KS5-ACP7 spnD 14787(4928)KS5-ACP7 94%93.6% busE 16692(5564)KS8-ACP10 spnE 16767(5588)KS8-ACP10 94%90.6%
*与刺糖多胞菌PKS基因和与bus和spn基因的其它类似结构域的相似性程度相当。
在多杀菌素的生物合成过程中进行相似反应的蛋白质具有87-93%的氨基酸同一性而基因具有93-94%的DNA序列同一性。应该注意到,spn PKS酶SpnB-E与相似的busPKS酶必须维持底物特异性,因为尽管这些酶所完成的反应相同,但聚酮化合物底物不同。另外,5个PKS酶聚集成1个PKS需要特异的蛋白质-蛋白质相互作用。参与这种亚基间分子识别的残基是未知的,可能并不是刺糖多胞菌和刺糖多胞菌LW107129(NRRL30141)中保守的残基。
与PKS的基因负责额外的修饰
在PKS基因(克隆于粘粒8H3中)的DNA上游区存在22个开放阅读框(ORF)。每一个都至少含有100个密码子,并且以ATG或GTG开始,以TAA,TAG或TGA结束。并且具有密码偏倚性,即其DNA含有高百分比的鸟嘌呤和胞嘧啶残基的生物体内的蛋白编码区将会有偏倚性(Bibb等,1984)。这22个开放阅读框(ORF)在图2中以图表的方式给出。根据将在下文中被讨论的证据,开放阅读框中的14个已经被认为是丁烯基-多杀菌素生物合成基因,分别命名为:busF,、busG、busH、busI、busJ、busK、busL、busM、busN、busO、busP、busQ和busS(图2标记为从F~S)。下文表11中,这14个基因及在spnS基因下游发现的开放阅读框(粘粒8H3中ORF LI、ORF LII、ORF LIII、ORF LIV、ORFLVI、ORF LVII、ORF LVIII和ORF LIX)的相应的多肽氨基酸序列及DNA序列都已经鉴定,表11也给出了PKS基因的下游ORF RI、ORF RII、ORFRIII的DNA序列以及相应的氨基酸序列(在粘粒2C10中)。
表11 基因SEQ ID NO:2上的碱基多肽 busF 114-938(C) SEQ ID NO:8 busG 1389-2558 SEQ ID NO:9 busH 2601-3350 SEQ ID NO:10 busI 3362-4546(C) SEQ ID NO:11 busJ 4684-6300 SEQ ID NO:12 busK 6317-7507 SEQ ID NO:13 busL 7555-8403 SEQ ID NO:14 busM 8640-9569 SEQ ID NO:15 busN 9671-10666(C) SEQ ID NO:16 busO 10678-12135(C) SEQ ID NO:17 busP 12867-14177(C) SEQ ID NO:18 busQ 14627-15967 SEQ ID NO:19 busR 16008-17141 SEQ ID NO:20 busS 17168-17914 SEQ ID NO:21 ORF LI 18523-19932(C) SEQ ID NO:22 ORF LII 19982-20488(C) SEQ ID NO:23 ORF LIII 20539-21033(C) SEQ ID NO:24 ORF LIV 21179-21922 SEQ ID NO:25 ORF LVI 22674-23453(C) SEQ ID NO:26 ORF LVII 23690-24886(C) SEQ ID NO:27 ORF LVIII 26180-26923(C) SEQ ID NO:28 ORF LIX 27646-28473 SEQ ID NO:29 基因 SEQ ID NO:1上的碱基 多肽 ORF RI 62090-63937 SEQ ID NO:30 ORF RII 65229-66602(C) SEQ ID NO:31 ORF RIII 68762-69676(C) SEQ ID NO:32
(C)指示序列表中所给出的互补链
为了指定表11所鉴定的多肽功能,本实验给出了4个明显证据:
1.同已知功能的序列的相似性。
2.同A83543多杀菌素生物合成基因的相似性。
3.阻断目标基因的实验结果。
4.生物转化实验的结果。
将预测多肽的氨基酸序列同生物技术国家信息中心(NCBI,Washington,DC)资料库中保存的序列对比,使用BLAST运算法则来测定它们同已知蛋白的相关性,定期重复在NCBI资料库进行BLAST搜索,可以从新添加的类似物得到新的判断。表12给出了2001年2月18日来自于基本BLAST搜索的显著匹配蛋白:
表12 基因 显著蛋白匹配基因库登记号BLAST分值* 报导的功能 busF C-5-O-甲基转移酶aveD(除虫链霉 菌Streptomyces avemitilis)T44579 156 C-甲基化 busG 糖基转移酶urdGT1b(弗氏链霉菌 Streptomyces fradiae)AF164961 205糖基转移 busH 3”’macarocin O-甲基转移酶 tylF(弗氏链霉菌)AF147703 297糖甲基化 busI 2”’macarocin O-甲基转移酶 tylF(弗氏链霉菌)AAD12164 287糖甲基化 busJ 己糖氧化酶(Chondrus crispus)U89770 148己糖氧化酶 busK 2”’macarocin O-甲基转移酶 tylF(弗氏链霉菌)AAD12164 310糖甲基化 busL mitM甲基转移酶(淡紫灰链霉菌 Streptomyces lavenduale)AF127374 120 C-甲基化 busM Lip4分泌脂酶(白色假丝酵母菌 Candida albicans)B70543 94脂酶 busN 3-酮还原酶aknQ(加利利链霉菌 Streptomyces galilaeus)AF264025 284己糖3-酮还原酶 busO urds(弗氏链霉菌)AF269227 404己糖2,3-脱水作用 busP dnrH糖基转移酶(Streptomyces puecetius)U77891 290糖基转移 busQ urdQ 3,4-脱水酶(弗氏链霉菌)AF269227 480己糖脱水酶 busR spsC芽孢外被多糖结合蛋白(枯 草芽孢杆菌Bacillus Subtillus)P39623 185己糖转氨作用 busS desVI N,N-二甲基转移酶(委内瑞 拉链霉菌(Streptomyces venezuelae))AF079762 240氨基甲基化 ORF LI ngt N-糖基转移酶(气生菌落糖丝 菌Saccharothrix aerocologenies)AB023593 221糖基转移 ORF LIV urdR己糖4-酮还原酶(弗氏链霉 菌)AF080235 243己糖酮还原作用 ORF LVI fkbM,FK506 O-甲基转移酶U65940 100甲基转移 ORF LVII oleP,P450单加氧酶(抗生链霉菌 Streptomyces antibioticus)L37200 387单加氧酶 ORF LVIII 易位酶(鸟分枝杆菌 Mycobacterium avium)AF107207 180易位作用 ORF LIX mmcR(淡紫灰链霉菌)AF127374 124甲基转移 ORF RI 解离酶样蛋白 (Acidithiobacillus ferrooxidans)U73041 97易位作用 ORF RII 假定的蛋白yvmC(枯草芽孢杆菌)AF017113 120 ORF RIII 醇脱氢酶[天蓝色链霉菌 Streptomyces coelicolorA3(2)]AL133236 155醇脱氢酶
*BLAST分值越高,表示相似性越高(Altschul等,1990)。
直接对比bus开放阅读框和A83543多杀菌素生物合成基因(登录号为AY007564)。它们的DNA和蛋白质序列的高度相似性暗示了这些基因在多杀菌素生物合成中可能行使相似的功能。表13给出了bus和spn基因的相似性比较。
表13丁烯基-多杀菌素基因bus ORF长度bp(a.a.)A83543多杀菌素基因Spn ORF长度bp(a.a.)BLAST分值ORF同一性百分比(DNA)ORF同一性百分比(氨基酸)GenBank报告的功能busF 828(275)spnF 828(275)1247 94%91%C-甲基化busG 1173(390)spnG 1173(390)1844 95%90%糖的添加busH 753(250)spnH 753(250)1328 97%97%糖甲基化busI 1188(395)spnI 1188(395)1966 96%92%未知busJ 1620(539)spnJ 1620(539)2587 95%83%氧化-还原busK 1194(397)spnK 1194(397)2163 96%88%未知busL 852(283)spnL 852(283)2274 94%94%C-甲基化busM 933(310)spnM 963(320)1909 95%96%未知busN 999(332)spnN 999(332)1772 96%91%未知busO 1461(486)spnO 1461(486)2319 95%92%脱氧糖的合成busP 1314(437)spnP 1368(455)2004 94%89%糖的添加busQ 1344(447)spnQ 1389(462)2355 94%81%双脱氧糖的合成busR 1137(378)spnR 1158(385)1852 95%89%糖转氨作用busS 750(249)spnS 750(249)1255 96%93%氨基糖甲基化
尽管一些bus基因同spn基因的DNA和氨基酸序列具有高度同G性,但是值得注意的是相比于A83543多杀菌素一些bus基因产物能够明显催化丁烯基-多杀菌素生物合成过程中不同的反应。这些差异体现在从刺糖多胞菌LW107129(NRRL30141)中分离得到的不同的丁烯基-多杀菌素化合物,所有已公开的天然多杀菌素在C-17位上都被forosamine或特异的forosamine异构体所取代(Kirst等,1992)。另一方面,丁烯基-多杀菌素在C-17位上也更宽范围的forosamine异构体以及象amicetose、O-甲基葡萄糖和O-甲基夹竹桃糖等中性糖所取代。这些相对A83543多杀菌素C-17位上糖基化的多样性要求能够催化糖基化反应的糖基转移酶以及能够生产糖的生物合成酶,这些糖可能是由位于bus基因附近或染色体其他位置上的特异的合成酶基因催化合成,或者也可能由列出的丁烯基-多杀菌素生物合成基因的其它底物特异性合成。Amicetose能够被bus基因蔟以外的基因产生,或者它也有可能是forosamine生物合成的中间产物(图4)。甲基夹竹桃糖可以作为forosamine生物合成的副产物被生成,也可由鼠李糖O-甲基转移酶(busH、busI和busK)合成。这个糖可从NDP-4-酮-2,6-脱氧-D-葡萄糖(forosamine生物合成的中间产物)合成而来。因此,由公开的基因和其它刺糖多胞菌LW107129(NRRL30141)基因对这一前体进行的酮还原作用和O-甲基化作用可以生物合成含甲基夹竹桃糖的多杀菌素衍生物(图4)。
另外,在表13中列出的9个基因直接同丁烯基-多杀菌素糖苷配基或PSA基因(busF、busG、busH、busI、busJ、busK、busL、busM和busP)相互作用。这些基因的糖苷配基和PSA底物同A83543多杀菌素的糖苷配基和PSA有着明显的差异。因此,这些基因同表13中列出的与之相关的spn对应物有着明显不同的底物特异性。
几个由刺糖多胞菌LW107129(NRRL30141)产生的丁烯基-多杀菌素类似物在C-8或C-24位上被羟基化(表2)。与红霉素生物合成中在C-6位上进行羟基化相同,大环内酯类化合物也能由P-450单加氧酶催化作用下在合成后进行羟基化(Weber和McAlpine,1992)。ORF LVII同P-450单加氧酶有着很高的相似性,丁烯基-多杀菌素C-8或C-24位上的羟基化可能是由ORFLVII或刺糖多胞菌LW107129(NRRL30141)染色体其它位置编码的单加氧酶负责。另外,和白霉素一样,羟基化的前体,如甘醇酸酯和甘油,可在聚酮化合物合成过程中被整合进去(Omura等,1983)。据报道,在尼达霉素产生菌(nid AT6)中,专门负责添加甘醇酸酯的AT结构域同红霉素和雷怕霉素PKS基因中甲基-丙二酰辅酶A特异的AT结构域相似(Katz等,2000)。PKS模块7负责丁烯基-多杀菌素上C-8和C-9的添加。然而busAT7结构域中并没有同nidAT6完全相同的甲基-丙二酰辅酶A特异序列。相对于其它AT结构域和nidAT6来说,busAT7中存在着负责甘醇酸酯特异性的独特序列。与A83543多杀菌素相比,负责这些修饰的丁烯基-多杀菌素生物合成基因是独一无二的,因为刺糖多胞菌不能产生这样的羟基化的多杀菌素。
另外,相对于刺糖多胞菌,刺糖多胞菌LW107129(NRRL30141)中的鼠李糖甲基化的特异性被改变了。在美国专利5,202,242和5,840,861中公开的刺糖多胞菌突变株显示出A83543多杀菌素鼠李糖上甲基化的变化,该突变株一般产生A83543多杀菌素的单去甲基化的鼠李糖衍生物,而其双去甲基鼠李糖衍生物只有在西萘芬净这样的甲基转移酶抑制剂存在时才能被检测到。在无甲基转移酶抑制剂的情况下,携有鼠李糖甲基化改变的基因的刺糖多胞菌LW107129(NRRL30141)突变株能够产生大量丁烯基-多杀菌素的双和三去甲基鼠李糖衍生物。
作为补充性研究,将含有刺糖多胞菌LW107129(NRRL30141)bus DNA的粘粒结合其中丁烯基-多杀菌素生物合成发生改变的该菌株的突变株(详细资料见随后的实施例4),而后测试这一转化接合子将阻断突变体的产物转化为其它多杀菌素的能力。使用的突变株是30141.8,它产生3’-O-去甲基鼠李糖-丁烯基-多杀菌素(3-ODM)和相关的化合物,而30141.8/8H3转化接合子产生丁烯基-多杀菌素而不是3-ODM,所以,负责鼠李糖3’位置处甲基化的基因应位于粘粒8H3中。
在目标基因破坏实验中,通过PCR从粘粒DNA中扩增内部片段进而将其克隆进质粒。然后将质粒转化入刺糖多胞菌LW107129(NRRL30141)中,随后分离、发酵培养含雷怕霉素抗性基因的转化接合子。基因破坏实验的基础是当一个带有内部基因片段的质粒整合进来后,结果产生两个不完全的生物合成基因拷贝,从而消除酶的功能。分析发酵产物来确定哪种丁烯基-多杀菌素被积累了。busO基因的破坏导致丁烯基-多杀菌素PSA的累积,这暗示着busO基因为forosamine的合成和添加所必需(见实施例5)。利用forosamine生物合成基因不能合成的C-17位含有糖的化合物在busO突变株中也能累积。
现在将要结合BLAST搜索、基因破坏实验和生物转化研究几方面结论,通过基因基础详细讨论一个基因。
由于PKS上游的14个基因同刺糖多胞菌的spnF基因有很高的相似性,并且BLAST搜索结果显示这些基因同已知的编码丁烯基-多杀菌素生物合成所需的酶基因有着惊人的相似性。因此,可认为它们参与了丁烯基-多杀菌素的生物合成。
busF,busJ,busL,busM
基因busF,busJ,busL和busM同spnF,spnJ,spnL和spnM有很高的相似性。据报道,这些A83543多杀菌素基因参与了从推测出的PKS基因的单环内酯产物生成糖苷配基的过程。busF的基因产物同spnF的相比具有91%氨基酸同一性。同样地,busL的基因产物同spnL的相比具有94%氨基酸同一性。据报道,spnF和spnL基因产物均为甲基化转移酶,并且4个蛋白同已知参与C-C键形成的来自链霉菌的酶有很高的相似性。busJ蛋白同报道为氧化还原酶的spnJ蛋白有83%的氨基酸同源性。busJ和spnJ同dnrW都有高度相似性,已知后者在柔红霉素的生物合成过程中参与C-C键形成。busM基因产物同spnM基因产物96%同一性,而busM和spnM基因产物与来自于白色假丝酵母的一类新的分泌型脂酶高度相似。busF和busL基因产物的功能是作为甲基化转移酶,而busJ基因产物作为氧化酶,busM基因产物作为脂酶则同报道的spnF,spnJ,spnL和spnM基因产物在C-C桥形成过程中所起的作用相一致。
busG,busH,busI,busK
基因busG,busH,busI和busK同来自于刺糖多胞菌的基因spnG,spnH,spnI和spnK有很高的相似性。据报道,这些基因参与向A83543多杀菌素糖苷配基添加鼠李糖的反应以及接下来的甲基化反应。busG基因同spnG基因有90%相似,而同参与向聚酮化合物衍生的抗生素上添加糖基反应的几个基因也高度相似(表11)。busH,busI和busK基因产物分别同spnH(97%),spnI(92%)和spnK(88%)基因产物有很高的氨基酸相似性,spnH,spnI和spnK的基因产物被报道参与多杀菌素生物合成过程中鼠李糖的甲基化反应。BusH,busI和busK 3个基因同来自于链霉菌属弗氏链霉菌(Streptomyces fradiae)的基因tylE(busI和busK)和tylF(busH)有高度的氨基酸相似性,tylE(busI和busK)和tylF(busH)基因产物经实验证明为macrocin-O-甲基化转移酶(Bate和Cundliffe,1999)。
busN,busO,busP,busQ,busR和busS
基因busN,busO,busP,busQ和busS同来自于刺糖多胞菌的基因spnN,spnO,spnP和spnQ,spnR和spnS有很高的相似性(表12)。据报道,这些基因参与了forosamine糖的生物合成或添加。busP同其它的糖基化转移酶(表11)的相似性表明:busP编码这种丁烯基-多杀菌素forosamyl转移酶。busO和urdS 2,3-脱水酶(表11;Hoffmeister等人,2000)之间的高度相似性说明:busO参与了forosamine生物合成过程中的2’-脱氧步骤。busQ基因产物和urdQ3,4-脱水酶(弗氏链霉菌;Hoffmeister等人,2000)之间的相似性表明:busQ参与forosamine生物合成过程中的3’-脱水步骤。busR与一组被认为功能是脱氧糖转氨酶的蛋白有着高达40%的同一性(Thorson等人,1993),这表明:busR参与forosamine生物合成过程中的4’-胺化步骤。而busS同氨基甲基化酶之间的高度相似性则表明:busS参与了forosamine的4’-氨基基团的甲基化过程。因此,busN,busO,busP,busQ和busS这些基因都参与丁烯基-多杀菌素的forosamine部分的产生。
来自于刺糖多胞菌LW107129(NRRL30141)的19个基因在丁烯基-多杀菌素的生物合成过程中被赋予了功能:5个PKS基因负责产生大环内酯,4个基因负责将这个大环内酯修饰成糖苷配基,4个基因负责添加和修饰鼠李糖,还有6个基因用于合成和添加forosamine。图S.1A和1B中概述了推测出的这个生物合成途径。
用途
克隆的刺糖多胞菌丁烯基-多杀菌素DNA有很多用途。这些克隆的基因可用以提高丁烯基-多杀菌素的产量并生产新的丁烯基-多杀菌素。产量的提高通过将一或更多的丁烯基-多杀菌素生物合成基因的复制拷贝整合进入具体的丁烯基-多杀菌素生产菌株的基因组中而得以实现。在一种极端情况下——在由于缺乏所需要的酶而其生物合成途径被中断的特定的突变菌株中,可以通过整合所需的基因拷贝恢复多杀菌素的生产。
利用克隆DNA片段去破坏丁烯基-多杀菌素的生物合成步骤可以产生新的化合物。这种破坏可能会导致前体或“旁路(shunt)”产物(天然加工得到的前体衍生物)的累积。通过破坏基因的方法产生的被修饰多杀菌素可能本身是昆虫控制剂,也可能作为进一步化学修饰的底物并产生新的带有独特性质和活性谱的半合成多杀菌素。busQ基因的断裂会导致丁烯基-多杀菌素PSA的累积。丁烯基-多杀菌素PSA作为起始物质对于合成在C-17处含有新基团的多杀菌素类似物有用。
通过将一或更多克隆的bus基因或它们的一部分转移到异源宿主内也可以产生新的丁烯基-多杀菌素。这些基因可以提供受体细胞中不存在的酶功能。这样的基因可能提供另一种糖基、修饰已经存在的糖基或糖苷配基碳原子,并允许另一种糖基连到糖苷配基上或改变此糖苷配基本身的基本结构。将克隆的bus基因转移到异源宿主内所产生的化合物可能本身作为昆虫控制剂,也可能作为进一步化学修饰的底物以产生新的具有独特性质和活性谱的半合成多杀菌素。来自于粘粒8H3和9D3的刺糖多胞菌LW107129(NRRL30141)DNA可以被转移到刺糖多胞菌-A83543的产生者中,并且此转化结合子可产生新的多杀菌素。
诱变克隆的基因以及用突变基因去替换产丁烯基-多杀菌素的生物中未突变的相应部分也能产生新的丁烯基-多杀菌素。诱变可包括,例如:1)缺失或失活KR、DH或ER结构域使得一或更多个相应功能被阻断,并且导致该菌株产生具有内酯核的多杀菌素,此内酯核含有双键、羟基或并不存在于多杀菌素核A上的酮基(见Donadio等人,1993);2)取代AT结构域使得不同的羧酸整合到内酯核上(见Ruan等人,1997);3)向已存在的PKS模块上添加KR,DH或ER结构域使得该菌株能产生具有内酯核的多杀菌素,而此内酯核上含有多杀菌素A核上不存在的饱和键、羟基或双键(MacDaniel和Katz,2002);4)增加或删减完整的PKS模块使得这个环内酯上碳原子的数目增加或减少。
来自于丁烯基-多杀菌素基因簇区域的DNA可作为杂交探针去识别同源序列。因此,该克隆DNA可用于定位来自于刺糖多胞菌LW107129(NRRL30141)基因文库的另外的质粒,既包括这里所描述的区域,同时也含有过去未克隆的来自于刺糖多胞菌LW107129(NRRL30141)基因组邻近区域的DNA。另外,剌糖多胞菌LW107129(NRRL30141)基因同S.spinosa spn基因的比较结果有助于鉴定不同于非多杀菌素生产的生物合成基因(如红霉素、雷怕霉素、泰乐美等的生物合成基因)的保守序列区域。这些多杀菌素特异的基因探针和来自于这个被克隆区域的全部DNA都可用于鉴定其它生物体内非同一但相似的序列。正常情况下,杂交探针至少长约20个碱基,并被标记以便检测。
按照诸如美国专利NO.5,362,634或2001年3月21日提交的临时美国专利申请60/277,601“大环内酯类杀虫剂”所提到的传统方案,通过培养本发明提供的修饰菌株可生产多杀菌素。上述实施例是非限制性的而不应该作为本发明的限制。
实施例2
利用LC-MS分析发酵液中丁烯基-多杀菌素代谢产物
下述方法利用以电喷雾质谱(ESI)监测发酵液中分子式(1)和其它成分产生的高效液相(HPLC)分离。通过分析电喷雾加成(adduct)离子,该系统可用来测定纯化的物质的分子量。表15给出了相关数据。
加入与发酵液等体积的变性乙醇。振荡混合物1小时,离心,过滤(孔径:0.22μm)以除去大量的细胞碎片。微量离心1ml等分样品,然后用下述LC-MS系统去分析澄清的提取物。
HPLC系统:柱固定相:250×4.6mm柱,基质钝化的硅胶,5μmC8(Hypersil-C8-BDS)。流动相:10mM乙酸铵-甲醇-乙腈线性梯度如下:
表14时间(分钟)溶剂A的百分含量 溶剂B的百分含量 0 20 25 30 35 100 0 0 100 0 100 100 0 100 0
其中溶剂A:10mM乙酸铵及溶剂B:甲醇-乙腈(1∶1);
流速:1mL/min;样品经过紫外检测器后被质谱断裂,因此MS:废物比例为约5∶95;
检测:ESI正谱在高弧电压和低弧电压下获得;
LC特征保留时间和特征质谱离子峰在表15中给出。
表15中化合物编号(来自表1) LC留时间[M+H]+的m/za二级离子的m/zb 1 4 5 6 9 13 23.8 22.9 24.3 22.1 21.4 22.0 758.4 744.4 722.4 774.4 810.4[M+NH4] 617.3 142.0(forosamine) 142.0 142.0 142.0 189.0(三-氧-甲基 鼠李糖) 189.0
a是指在+ESI模式中,低电弧电压下得到的母离子的m/z;
b是指在高电弧电压下,+ESI中观察到的主要丰富片段及加成离子的m/z。
实施例3
通过发酵制备丁烯基-多杀菌素代谢物
通过在发酵培养基(配方如下)中培养所需的刺糖多胞菌,其选自菌株NRRL30141、NRRL30142或其衍生物,来生成分子式(1)中的代谢物。先将1.8mL冷冻的生长培养物融化,然后接种于25mL生长培养基中,于30℃,150rpm,在125ml锥形瓶中培养72~96小时。
表16 生长培养基 成分 数量(g) 右旋糖 9.0 trypticase大豆汤 30.0 酵母提取物 3.0 MgSO4·7H2O 2.0 去离子水 1000.0
摇瓶发酵
取12mL第一阶段成熟菌种,接种于盛有50mL发酵培养基的500mL有挡板的锥形发酵瓶中。
表17 发酵培养基(每升H2O) 成分 数量(g) 右旋糖 80.0 棉籽粉 32.0 大豆粉 8.0 玉米浸渍粉 8.0 碳酸钙 5.0 酵母提取物 2.0
将发酵液在30℃,200rpm(50mm搅拌(stroke))条件下培养7~12天。然后将成熟的发酵液用合适的溶剂抽提,用色谱分离法回收代谢产物(见实施例1中的公开部分)。
实施例4
用粘粒8H3对菌株NRRL30421中鼠李糖甲基化缺陷的互补
菌株NRRL30421是刺糖多胞菌NRRL30141的突变种,它不能使丁烯基-多杀菌素上的鼠李糖完全甲基化,只能积累化合物4及在3’位置缺少O-甲基化的其它丁烯基-多杀菌素(3’-ODM)。这一甲基化缺陷被认为是busH、busI或busK基因编码的O-甲基转移酶之一突变的结果,而所有上述基因都存在于粘粒8H3中(见图2)。
通过接合转移将大肠杆菌ATCC 47055中的粘粒8H3(见图3)转入菌株NRRL30421中(Matsushima等,1994),然后将两株用粘粒8H3转化的单菌落用实施例2中的方法发酵,再用实施例1中的方法分析化合物1和化合物4的生产。
表18菌株(基因型) 化合物4(μg/mL) 化合物1(μg/mL) 化合物1∶4的比率 NRRL30421(3’ODM*) 1.0 0.7 0.7 NRRL30421(3’-ODM*)/8H3-42 0.5 8.9 17.8 NRRL30421(3’-ODM*)/8H3-45 0.1 3.0 30.0 NRRL30141 0.4 9.7 24.3
*在鼠李糖3’位置防止甲基化的突变菌株。
菌株NRRL30421主要生产化合物4,而含有粘粒8H3的NRRL30421菌株则主要生产化合物1(表18)。含有粘粒8H3的NRRL30421菌株中化合物1和4的产量与未突变的NRRL30141中的大致相同(表18)。这表明用粘粒8H3转化能够克服菌株NRRL30421鼠李糖甲基化缺陷,并可恢复提高的化合物1的产量。
实施例5
丁烯基-多杀菌素前体和由busO基因破坏所产生的旁路产物(Shunt
Product)的累积
通过整合busO基因内部片段而使busO基因失去活性。用寡核苷酸链对(一个对应于SEQ ID NO:2中的碱基11882~11861,另一个对应于SEQ ID NO:2中的碱基10970~10993)去扩增912bp大小的片段(位于1457bp的busO基因中),该片段对应SEQ ID NO:2中的碱基10970~11882。用含有该片段的质粒转化刺糖多胞菌LW107129(NRRL30141)会导致busO基因的部分复制,从而产生在质粒两侧的及抗生素抗性基因的截短的基因两个拷贝。
在FailSafeTMPCR仪(Epicenter)中用SEQ ID NO:33和34作为引物扩增912bp的busO基因内部片段,然后按厂商(Invitrogen)的说明把扩增的片段克隆入pCRII中,产生的质粒用EcoRI消化,随后busO基因内部片段被克隆到pOJ260的EcoRI位点内。产生的新质粒通过接合转移(Matsushima等,1994)的方式从大肠杆菌ATCC 47055转移到刺糖多胞菌NRRL30121的衍生菌株中。然后将6个独立的阿泊拉霉素抗性接合后体分别按实施例2中的方法发酵培养,并按实施例1中的方法分析化合物1和其他的多杀菌素衍生物的生产。
亲代菌株NRRL30141产生高水平的化合物1和低水平的假糖苷配基(Pseudoaglycone)PSA;化合物13)以及少量的化合物9(表19)。在6个busO突变菌株中均未检测到化合物1,这意味着busO基因是丁烯基-多杀菌素完全生物合成所必需的基因。另外,PSA在所有6个busO突变菌株中的表达水平均增加(可由Forosamine供应不足预测得知),C-17位上连有非Forosamine糖的化合物9在busO突变菌株中的表达水平也增加。
表19 菌株(基因型) 化合物1* 化合物13 化合物9 NRRL30141 366.3 1.0 0.4 NRRL30141 busO 65 nd 13.8 1.7 NRRL30141 busO 67 nd 12.3 3.7 NRRL30141 busO 68 nd 6.7 3.8 NRRL30141 busO 70 nd 9.3 1.3 NRRL30141 busO 71 nd 12.3 2.4 NRRL30141 busO 72 nd 5.4 1.6
*表中报导的数量是同NRRL30141中化合物13相比得出的结论。
nd表示未被检测到
参考文献
1.Altschul,S.F.,W.Gish,W.Miller,E.W.Myers,and D.J.Lipman(1990).Basic localalignment search tool.J.Molec.Biol.215:403-10。
2.Aparicio,J.F.,I.Molnar,T.Schwecke,A.Konig,S.F.Haydock,L.E.Khaw,J.Staunton & J.F.Leadlay(1996).″Organization of the biosynthetic gene cluster forrapamycin in Streptomyces hygroscopicus:anialysis of the enzymatic domains in themodular polyketide synthase,″Gene 169:9-16.
3.Ausebel F.,R. Brent,R.Kingston,D.Moore,J.Smith,J.Seidman,and K.Struhl,eds.(1987).Current Protocols in Molecular Biology.(John Wiley and Sons,New York).
4.Baltz,R.H.,M.C.Broughton,K.P.Crawford,K.Madduri,D.J.Merlo,P.J.Treadway,J.R.Turner and C.Waldron(2000)Biosynthetic Genes for SpinosynInsecticide Production.US Patent 6,143,526.
5.Bate,N.,& E.Cundeliffe(1999)The mycinose-biosynthetic genes of Streptomycesfradiae,J.Ind.Microbiol.Biotechnol.23:118-122.
6.Bibb,M.J.,P.R.Findlay & M.W.Johnson(1984).″The relationship between basecomposition and codon usage in bacterial genes and its use for the simple and reliableidentification of protein-coding sequences,″Gene 30:157-166.
7.Bierman,M.,R.Logan,K.O′Brien,E.T.Seno,R.N.Rao & B.E.Schoner(1992).″Plasmid cloning vectors for the conjugal transfer of DNA from Escherichia coli toStreptomyces spp,″Gene 116:43-49.
8.Broughton,M.C.,M.L.B.Huber,L.C.Creemer,H.A.Kirst & J.A.Turner(1991).″Biosynthesis of the macrolide insecticidal compound A83543 by Saccharopolysporaspinosa,″Ann.Mtg.Amer.Soc.Microbiol.
9.Bsang,C.,P.F.Long,J.Cortes,J.Westcott,J.Crosby,A.-L.Matharu,R.J.Cox,T.J.Simpson,J.Staunton and P.F.Leadlay(1999)“A chain initiation factor common toboth modular and aromatic polyketide synthases.”Nature 401:502-505.
10.Burgett,S.G.and P.R.J.Rosteck(1994)“Use of dimethyl sulfoxide to improvefluorescent,Taq cycle sequencing.”in:Automated DNA Sequencing and Analysis.M.Adams,C.Fields and J.C.Venter,eds.NY,Academic Press:pp.211-215.
11.Dehoff,B.S.,S.A.Kuhstoss,P.R.Rosteck & K.L.Sutton(1997).″Polyketide synthasegenes.″EPA 0791655.
12.Donadio,S.,J.B.McAlpine,P.S.Sheldon,M.Jackson & L.Katz(1993).″Anerythromycin analog produced by reprogramming of polyketide synthesis,″Proc.Natl.Acad.Sci.USA 90:7119-7123.
13.Donadio,S.& L.Katz(1992).″Organization of the enzymatic domains in themultifunctional polyketide synthase involved in erythromycin formation inSaccharopolyspora erythrae,″Gene 111:51-60.
14.Donadio,S.,M.J.Staver,J.B.McAlpine,S.J.Swanson & L.Katz(1991).″Modularorganization of genes required for complex polyketide biosynthesis,″Science 252:675-679.
15.Hoffmeister,D.,K.Ichinose,S.Dormann,B.Foust,A.Trefzer,G.Drager,A.Kirschining,C.Fischer,E.Kunzel,D.W.Bearden,J.Rhor and A.Bechthold(2000)The NDP-sugar co-substrate concentration and the enzyme expression level influencethe substrate specificity of glycosyltransferases:cloning and characterization of thedeoxysugar biosynthesis genes of the urdamycin biosynthetic gene cluster.Chemistry& Biology 7:821-831.
16.Ikeda,H.,T.Nomoniya,M.Usami,T.Ohta and S.Omura(1999)Organization of thebiosynthetic gene cluster of the polyketide anthelmintic macrolide avermectin inStreptomyces avermitilis.Proc.Nat.Acad.Sci.USA 96:9509-9514.
17.Jiang,X.M.,B.Neal,F.Santiago,S.J.Lee,L.K.Romana & P.R.Reeves(1991).″Structure and sequence of the rfb(O antigen)gene cluster of Salnonella serovartyphimurium(strain LT2),″Mol.Microbil.5:695-713.
18.Katz,L.,D.L.Stassi,R.G.Summers,Jr.,X.Ruan,A.Pereda-Lopez and S.J.Kakavs.(2000)Polyketide derivatives and recombinant methods for making same.US Patent6,060,234.
19.Kirst,H.A.,K.H.Michel,J.W.Martin,L.C.Creemer,E.H.Chino,R.C.Yao,W.M.Nakatsukasa,L.D.Boeck,J.L.Occolowitz,J.W.Paschal,J.B.Deeter,N.D.Jonesand G.D.Thompson.(1991)A83543A-D,unique fermentation-derived tetracyclicmacrolides.Tetrahedron Lett.32:4839-4842.
20.Liu,H.W.& J.S.Thorson(1994).″Pathways and mechanisms in the biogenesis ofnovel deoxysugars by bacteria,″Ann Rev Microbiol 48:223-256.
21.Matsushima,P.,M.C.Broughton,J.R.Turner & R.H.Baltz(1994).″Conjugaltransfer of cosmid DNA from Escherichia coli to Saccharopolyspora spinosa:effectsof chromosomal insertion on macrolide A83543 production,″Gene 146:39-45.
22.McDaniel,R.& L.Katz(2001)Genetic engineering of novel macrolide antibiotics.In:Dev.Novel Antimicrob.Agents:Emerging Strategies;K. Lohner,Ed.;pp.45-60;Horizon Scientific Press,Wymondham,UK.
23.Merson-Davies,L.A.and E.Cundeliffe(1994)Analysis of five tylosin biosyntheticgenes from the tylIBA region of the Streptomyces fradiae genome.Mol Microbiol.13:349-355.
24.Omura,S.,K.Tsuzuki,A.Nakagawa,and G.Lukacs(1983)Biosynthetic origin ofcarbons 3 and 4 of leucomycin aglycone.J.Antibiot.36:611-613.
25.Ruan,X.,A.A Pereda,D.L.Stassi,D.Zeidner,R.G.Summers,M.Jackson,A.Shivakumar,S.Kakavas,M.J.Stavier,S.Donadio and L.Katz(1997).″Acyltransferase Domain Substitutions in Erythromycin Polyketide Synthase YieldNovel Erythromycin Derivatives,″J.Bacteriology 179,6416-6425.
26.Sambrook,J.E.F.Fritch,and T.Maniatis(1989)Molecular Cloning a LaboratoryManual,Second Edition.(Cold Spring Harbor Press,Cold Spring Harbor,NY)
27.Schwecke,T.,J.F.Aparicio,I.Molnar,A.Konig,L.E.Khaw,S.F.Haydock,M.Oliynyk,P.Caffrey,J.Cortes,J.B.Lester,G.A.Bohm,J.Staunton and P.F.Leadlay(1995)The biosynthetic gene cluster for the polyketide immunosuppressant rapamycin.Proc.Nat.Acad.Sci.USA 92:7839-7843.
28.Shen,B.,W.Liu,S.D.Christianson and S.Standage(2000)Gene cluster of theproduction of the enediyne antitunor antibiotic C-1027.WO App.00/40596
29.Siggard-Andersen,M.(1993).″Conserved residues in condensing enzyme domains offatty acid synthases and related sequences,″ Protein Seq.Data Anal.5:325-335.
30.Simon,R.,U.Preifer & A.Puhler(1983).″A broad host range mobilization system forin vivo genetic engineering:transposon mutagenesis in Gram negative bacteria,″Bio/Technology 1:784-791.
31.Strobel,R.J.& W.M.Nakatsukasa(1993).″Response surface methods for optimizingSaccharopolyspora spinosa,a novel macrolide producer,″J.Ind.Microbiol.11:121-127.
32.Thorson,J.S.,S.F.Lo & H.Liu(1993).″Biosynthesis of 3,6-dideoxyhexoses:newmechanistic reflections upon 2,6-dideoxy,4,6-dideoxy,and amino sugar construction,″J.Am.Chem.Soc.115:6993-6994.
33.Trefzer,A.,J.A.Salas and A.Bechthold(1999)Genes and enzymes involved indeoxysugar biosynthesis.Nat.Prod.Rep.16:283-299.
34.Weber,J.M.& J.B.McAlpine(1992).″Erythromycin derivatives,″U.S.Patent5,141,926.
35.Wohlert,S.-E.,N.Lomovskaya,K.Kulowski,L.Fonstein,J.L.Occi,K.M.Gerwain,D.J.MacNeil and C.R.Hutchinson(2000)Biosynthesis of the avermectin deoxysugarL-oleandrose and novel avermectins.Genetics and Molecular Biology of IndustrialMicroorganisms Conference,Bloomington,IN,USA.
序列表
<110>Hahn,Donald
Jackson,Jim
Bullard,Brian
Gustafson,Gary
Waldron,Clive
Mitchell,Jon
<120>用于生产丁烯基-多杀菌素杀虫剂的生物合成基因
<130>51609
<140>
<141>
<150>us 60/280,175
<151>2001-03-01
<160>40
<170>PatentIn Ver.2.0
<210>1
<211>75236
<212>DNA
<213>刺糖多胞菌 NRRL30141
<400>1
atgagcgaag ccgggaacct gatcgccgtc gtcggattct cctgccgcct accccaggca 60
cctgacccgg cttctttctg gcggttgctg cgcaccggaa cggacgccat caccaccgtc 120
ccggaagggc ggtggggcga cccgttgccc ggccgggatg cgcccaaggg cccggaatgg 180
ggcggcttcc tggctgatgt cgactgcttc gatcccgagt tcttcgggat ctcgccgcga 240
gaagcggccg ccatggaccc ccagcagagg ctggctctgg agctcgcctg ggaggctctc 300
gaagacgccg gtatccccgc cggcgagctg cgcggcactg ccgcgggggt gttcatgggg 360
gcgatctctg acgactacgc cgccctgctt cgcaagagcc cgccggaagt ggctgcgcag 420
taccgtctca ccggcaccca tcgaagtctg atcgccaacc gcgtgtccta cgtgctcggc 480
ctgcgcgggc caagcctgac ggtggattca ggtcagtcct cgtccctggt cggcgtgcat 540
ctcgccagcg agagcctgcg acgtggcgag tgcgcgatcg ctctcgccgg cggcgtgaac 600
ctcaacctgg ctgccgagag caacagagcc ctgatggact tcggcgcgct ctccccggac 660
ggtcgctgct tcaccttcga tgcgcgggcg aacggttacg tccgcggcga aggcggcggc 720
ctcgtcgtgc tgaagaaggc cgatcaggct cgcgccgatg gcgaccggat ctactgcctc 780
atccgcggca gcgcggtcaa caacgacggg ggcggtgctg ggctcacggc tccggcggca 840
gacgcccagg cggagttgct gcgacaggca taccggaacg cgggtgtcga cccggccgcc 900
gtgcagtacg tcgagctcca cggcagcgcg accagagtcg gggaccccgt cgaagcagca 960
gccctcggat ctgtcctggg tgtggcaaga cggcccggcg acaagctgcg tgtggggtcg 1020
gcgaagacca acgtcggcca tctggaagca gcggcgggcg tcaccgggtt gctgaagacc 1080
gcactcagca tctggcaccg cgaactgccg ccgagtcttc acttcaccgc ccccaacccg 1140
gaaatcccgc tggacgaact gaatctacgc gtccagcgtg atctgcggcc gtggccggag 1200
agcgagggcc cgctgctggc cggcgtcagc gccttcggaa tgggaggcac gaactgccac 1260
ctggtgctct ccgattcgtc ccaggtggag cgaaggcgta gtggacccgc tgaggcgacc 1320
atgccttggg tcttgtcggc cagaacaccg gtcgcattgc gtgcgcaggc ggcgcgcttg 1380
cacacgcacc tcaatactgc cggtcaaagt ccattggacg tcggctactc actggcgacc 1440
actcgatccg cgctaccgca ccgagccgcg ctggtcgcgg acgacgtacc gaaactgctc 1500
gccgggttga aggccctcgc tgacggcgac gacgcgccca cgctgtgcac gggcacgact 1560
tccggcgagc gggcaacagt cttcgtcttt cccggacagg gcagccagtg gatcgggatg 1620
ggtaggcagc tgctccaaac ctccgaggtt ttcgcggcgt ccatggcgga ctgcgcggat 1680
gcgttggcgc cgcacctgga ttggtccctg ctggatgtgc tgcgtaacgc ggccggcgct 1740
tcgcagcttg atcgcgacga tgtcgtccag cccgcactgt tcgccgtcat ggtctcgctg 1800
gcagagctct ggcgttcgtg gggcgtgcgt ccggaggcgg tcgtcgggca ctcgcagggg 1860
gagatcgcgg cggcctgcgt cgccggggcc ctctccgtcc gcgatgccgc aagggtagtg 1920
gcggtgcgca gcaggcttct ggcggcgctg gcgggcagag gcgcgatggc gtcgttgcag 1980
catcccgttg aagaggtgcg acaaatcctg ttgccatggc gcgatcggat cggcgtggcg 2040
ggggtgaacg gaccgtcgtc gactctggtg tcgggggacc gggaggcgat ggcggaactg 2100
ctggccgagt gcgcgcgccg agagctccgg atgcgccgga ttccagttga atacgcctcc 2160
cattcgccgc acatcgagga tgtccgcgac gagctgctgg cgctgttggc gtcgatcgaa 2220
cccaggacag ggaacatccc ggtctattcg acgacgaccg gggaactgct ggaccggccg 2280
atggacgccg actactggta ccgcaacctt cgtcaaccgg tgctgttcga agcggcggtc 2340
gaggccctgt tgaagcgggg gcacaacgca ttcatcgaga tcagcccgca cccggtgctg 2400
actgcgagca tccaggaaac cgccgcgcga gcggggcggg aggtagtggc gctcgggaca 2460
ctccgccgcg gcgaaggtgg cctgcggcag gcgctgacgt cgctggccaa agcacacgtc 2520
cacggagtgg ccgcgaactg gcacgcggtc ttcgccggca ccggggcgca gcgggtcgac 2580
ctgccgacgt acgcctttca acgacagcgc tactggctgg acacgaaacc ttccgacctc 2640
gccatgcccg agggcgatgt gtcgacagcg ttgcgggaaa aactgcgctc ctcgccgggg 2700
gcggacgtgg actcagcgac cctcacaatt atccgggcac aggcagccgt ggtactcggc 2760
cactccgatc cgaaagagat ggactcggat cggacattca aagacctggg cttcgattcc 2820
tcgaccgtgg tcgagctgtg cgaccgcctc aacgccgcca ccggactgcg cctcgcgccg 2880
agcgtggttt tcgactgtcc gacgccctac aagctcgccc gccaggtacg gacgttgttg 2940
ttggacgagc cagtccccac gacgtcaccc cgaacggaga ccgaagcgga cgagcctatc 3000
gccgtgatcg ggatgggctg tcggtttccg ggtggcgtgt cctcgcccga ggagttgtgg 3060
cagctggtcg ctgctggacg ggacgtcgtg tcagagttcc cggctgaccg aggttgggac 3120
ccggagcgtg cggggacttc gcacgtgcgc gccggcggat tcctgcatgg cgccacggat 3180
ttcgatcccg ggttcttcgg gatttccccg cgcgaggcgt tggcgatgga tccgcagcag 3240
cgcttgctgc tggaaatcgc ctgggaggcg atcgaacgag gcgggatcaa cccgcagacc 3300
ctgcacggaa gtcaaaccgg cgtcttcgtc ggcgcaacct ccctggatta cgggccacgc 3360
ctgcacgaag cgtccgacga ggcggccggc tacgtgctca ccggcagcac cacgagtgtg 3420
gcgtcgggtc gggttgcgta ttcgtttggt cttgagggtc ctgcggtgac ggtggatacg 3480
gcgtgttcat cgtcgttggt ggcgttgcat ctggcgtgcc agtcgttgcg ttcgggtgag 3540
tgtgatttgg cgttggccgg tggtgtgacg gtgatggcca cgccggggat gttcgtggag 3600
ttttcgcgtc agcggggctt ggcacccgac ggtcgctgca agtcgttcgc ggaggccgcg 3660
gatggcaccg gctggtccga gggtgccggc ctggttctac tggagcggtt gtcggatgcc 3720
cggcggaatg ggcatgacgt tttggcggtg gttcgtggca gcgcggttaa ccaggacggc 3780
gcgtcgaacg gactgactgc tccgaatggc ccgtcgcagc ggcgggtgat cacccaagca 3840
ctcgccaacg cgaagttgtc ggtgtccgat gtggacgcag tggaggcgca cgggacgggc 3900
acccggcttg gtgatccgat cgaggcgcag gcgctgatcg ccacttacgg gcagggacgg 3960
ggtccggaac ggccgttgtg gttggggtcg gtcaagtcca acatcggtca tacgcaagcg 4020
gcggccggtg ttgccggtgt catcaagatg gtcatggcga tgcggtatgg ggagctgccc 4080
gccacgttgc acgtggacga gccctcctcg caggtggact ggtctgctgg gatggttcag 4140
gttctgaccg agcacgtgcc ttggcccgac aacagccgtc ctcgtcgggt gggggtgtcg 4200
tcgttcggga tcagcggcac caatgcgcac gtcatcctcg aacagtctcc gacagcgtca 4260
agtgagttcg tggagcacag cggacctgat tcggaatctg ctgtggatgt tccggtggtt 4320
ccgtgggtgg tgtcgggcaa aacgccggaa gcgctcagtg ctcaggcgga caacttggtg 4380
tcctatctgg atgatcgccc taatgtttcc gcgctgaatg tggcatattc gctggcttcc 4440
gaacgagccg cactggatga gcgggcggtg gtgctggggg cggatcgtga agcgttgttg 4500
tctggactga aagcactggc tgccggtcac gaggatcctg gtgtggcgtc gggatccctg 4560
gtttctggtg gggttgggtt tgtgttctcc ggtcagggtg gtcagtggtc ggggatgggc 4620
cgggggcttt accgggcgtt tccggtgttc gctgctgcct ttgacgaagc ttgtgccgaa 4680
ctggatgcac atctgggcca ggaagtgggg gttcgggatg tggcgttcgg ttccgatgcg 4740
cagttgctgg agcggacgtt gtgggcgcag tcgggtttgt tcgcgctgca ggtaggtttg 4800
ctgaggctgt tgggttcatg gggtgttcgg ccgggtgcgg tgctggggca ttcggtgggc 4860
gagttggcag cggcgcacgc ggcgggtgtg ttgtcgttgc cggatgcagc tcggttggtg 4920
gcgggtcgtg cccggttgat gcaggcgatg ccggatggcg gtggcatgct cgcggtggct 4980
acaagtgaga cccaggtcga acctatgctg gatggagtgc gggaccggat cgggatcgcg 5040
gcgatcaacg ctccggaatc ggtcgtgctc tccggtgacc gcgaactact cgccgaagtc 5100
gctgatcagc tgaacgatca agggtgccgg acacgatggt tgcaggtgtc tcacgctttc 5160
cattcgtatc ggatggaacc gatgctcgac gagttcgccc agatcgcagg cagcgtggat 5220
ttccggcgtt gcgaactgcc tatcatctcg accctgacag gaaacctcga tgacgtcggc 5280
gtgatggcta cgccggagta ttgggtgcgt caggtgcgtg agcccgtccg cttcgccgat 5340
ggtgtccagt cgctcgtcga gcaagatgtg gctactgttg tcgagcttgg ccctgatgcg 5400
attctgtcgg ctctgattcc tgattgtcat tcctggggtg atcagactgt gccgattccg 5460
ttgctgcgca aggaccgcgc tgaacccgaa actgtggtcg ccgcggtggc gcgggcgcac 5520
acgcgtggtg ttcaggtcga ttggtcggcg tttttcgctg gtaccggggc tgggcgggtc 5580
gagttgccga cgtatgcctt ccagcggcag cggtattggc tggagtcatc ggtttccggt 5640
gatgtgacag gtatcggtct ggctggggcg gagcatccgt tgctgggggc cgtggttgtg 5700
ttggccgacg gtgatgggat ggtgttgacc ggtcggttgt cggtggggac gcatcggtgg 5760
ctggccgagc atcgtgtgct gggggaggtc gtggttcccg gcacggctat cctggagatg 5820
gtcttgcatg cgggggcgcg ggttggttgt ggccgggtgg aggagctcac cctggaagca 5880
ccgctggtgg tgcccgaacg cgatgccatc gaaatccagc tgctggtgaa cgcgcccgac 5940
gacaagggtc ggcggtccgt gtcgctgcat tcccgcccgg ccggtgggtc tgggggtggg 6000
ggttggacgc ggcacgccac gggcgaactc gtcgtcgccg gcacgggtgg tggggcggtt 6060
actggttggt cgactgaggg tgccgagccg gttgctctcg gtgagtttta tgtcgttcag 6120
gcggggaacg ggttcgagta tgggccgttg ttccaggggc ttcgggcggc gtggcgtcgt 6180
ggtggcgagg ttctcgcgga ggtcgccctg ccggcagcgg ctggtgcgat ggcggggttc 6240
ttgatcaatc cggcgttgct ggatgccgcc ttgcaggcgt ccgcgctggg tgaccgtccg 6300
gcggagggtg gtgcgtggct gccgttctct tttaccgggg tagaactttc cggtcagggt 6360
gggacgatca gcagggcacg ggtggagtct acgcgacccg atgcggtgtc ggtggctgtg 6420
atggatgagg gtgggcggtt gctcgcctcg atcgattctc tccggttgcg gccggtgtcg 6480
tcggtgcggt tggcgaatcg ggacgttgtc ggtgacgcgc tgttcgaggt gacttgggag 6540
ccggtggcga cgcggtcgac ggtatcgggt cgctgggcgt tgcttggtga tgctgtcggc 6600
ggcatggccg gtctcattgg gctcgcacca ggttccgtcg atcgttgtgc gggtctggct 6660
gagctcgcgg ggaaccttga ttccggtgcg ctggttgctg atgtcgtggt ttattgcgcc 6720
ggtgaacagg cggatcccga cgccggcgtg gcggcactcg cggagacccg ggagatgctg 6780
gccctggtcc agtcgtggtt ggccgaggag cggttggccg ggtcacgtct ggtggtggtg 6840
acgtgtggcg cggtgacgac ggctgcgggt gacggcgcat caaagctggc gcatgcgccg 6900
ttgtgggggt tgttgcgttc agcgcagtcg gagaacccgg gccggtttgt gctggtcgat 6960
gtggacggta ccgccgagtc gtggcgcgcg ttgccgagtg cggtggggtc gatgcaaccg 7020
cagttggccg tgcgtaaggg tgtggtgaca gtgccgcgtg tggcgtcggt tccggggccg 7080
gtcgaggtgc ccgcggtggt ggccggtccc gaccggacgg tgctgatttc cggtggcacg 7140
ggtctgttgg gtggcgtggt ggcacgccac ctggtggccg agcgcggtgt tcgtcgagtg 7200
gtgttgacgg gccgtcgtgg ctgggatgct cccggaatca ccgagttggt gggtgagctg 7260
gagggtttcg gtgcggtggt cgatgtggtg gcgtgcgacg ttgcggatcg tgctggtctg 7320
gaggggttgc tggcggcggt cccggcggag tttccgctgt gtggtgtggt gcatgccgcg 7380
ggtgtgctgg ctgacggggt gatcgagtcg ttgacaccgg aggacgtggg ggcggtgttc 7440
ggtccgaagg cggcgggggc gtggaacctg cacgagctga ctcgggatat ggacttgtcg 7500
tttttcgcgt tgttctcctc gctgtccggg gtgaccggcg ccgcgggtca gggtaattat 7560
gcggcggcga acacgttcct ggacgcattg gcgcattacc ggcgggcgca gggattgcct 7620
gcggtgtcgt tggcgtgggg cttgtgggag cagtcgagcg ggatgaccgg gcggctcagt 7680
gatgtcgacc ggagcaggat cgcccgctcc agtccaccgt tgtccaccaa ggatggtttg 7740
cggctgttcg atgccgggct ggcgttggat cgggcagcgg tggttccggc gaggttggac 7800
agggccttcc tggccgagca ggcccggtcg ggaacgctac ccgcgatgct gacggcactg 7860
gtacctacca tcacctctat caggcgcagt agtggcaccg acctcgcgga cgaggacgcc 7920
ttgcttgggg tggtgcggga gcacgccgcg agggtgctgg ggtattcggg tgcggccgag 7980
gtcggggtcg agcgtgcttt ccgggatctg ggctttgatt cgttgtctgg tgtggagttg 8040
cgtaatcggc tggccggggt gctgggagcc cggctgccgg caaccgccgt attcgactac 8100
ccgacgccgc gggcgttggc ccggttcctg caccaggaac tggcaggcga ggtcgggacg 8160
acgccggcgc cggtgacgac cacgaccgcg agcgtcgaag acgatctcgt cgcgatagtc 8220
gggatggggt gtcgttatcc gggtggggtg tcctcaccgg aggagctttg gcgtttggtg 8280
gccgggggcg tggatgcggt cgcggacttc ccggacgatc gcggctggga tctggccgga 8340
ttgttcgatc cagatcccga tcgtttcggg acttcgtatg tgcgtgaggg cgggttcctg 8400
cgggacgcgg cggagttcga tgccgcgttt ttcgggattt ctccgcgtga ggcactggcg 8460
atggacccgc agcaacggtt gctgctggag ctgtcctggg aggccgttga acgcgctggg 8520
atcgatccgg ggtcgctgcg cgggagccgg acgggtgtgt tcgcggggct gatgtatcac 8580
gactacgccg gacggttcgc ggccggagtg ccggagggct tcgaaggcta tctcggtaat 8640
ggcagcgcgg gcagtgtggc ctcgggccgg gtcgcgtatt cgttcggttt cgagggtcct 8700
gcggtgacgg tggacacggc gtgttcgtca tcgctggtgg cgttgcacct ggcaggtcaa 8760
tcactgcgtt ccggtgagtg tgatctcgcc cttgccggtg gcgtgacggt gatggccacc 8820
ccggcgacgt ttgtggagtt ctcccgtcag cggggtctgg caccggatgg gcgctgcaag 8880
tcgttcgcgg aggccgcgga cgggaccggc tggggcgagg gtgctggcct agtgctgttg 8940
gagaggttgt cggatgcccg tcgtaatggg catcgggtgt tggcggtggt tcgtgggtcg 9000
gcggtgaatc aggacggcgc gtcaaacgga ctgaccgcgc cgaatggtcc ctcgcagcaa 9060
agggtgatca cccaagcact cacgagtgcg gggttgtccg tgtccgatgt ggatgctgtg 9120
gaggcgcacg ggaccgggac caggcttggt gatccgatcg aggcacaggc attgatcgcc 9180
acctatggcc gtgatcgtga tcctgaccgg ccgttgtggt tggggtcgat gaagtccaac 9240
atcggtcaca cacaggcagc ggcgggtgtt gccggtgtga tcaagatggt gatggcgatg 9300
cgccacgggg agctgccgcg cacattgcac gtcggcgagc ccacgtcgga ggtggattgg 9360
tcggcaggtt cggtccagct cctcacggag aacacgccct ggcccgacag cggccatcct 9420
cgtcgggcgg gagtgtcgtc gttcgggatc agcggcacca acgcacacgt catcctcgaa 9480
cagtctccga cagcgtcaag tgagttcgtg gagcacagcg gacctgattc ggaatctgct 9540
gtgaatgtcc ctgtggttcc gtgggtggtg tcgggcaaaa cacccgaagc gctcagtgct 9600
caggcggaca ccttggtgtc ctatctggac gatcgatctg atgtctcctc gcgggatgtt 9660
gggtattcgc tggcgatgac gcgttcggcg ctggatgagc gggcggtggt gctggggtcg 9720
gaccgtgaaa cgttgttgtc cgggttgaaa gcactggctg ccggtcatga ggccactggg 9780
gtggttacgg gatctgtggg ttctggcggc cggcccggtt ttgtgttcgc cggtcagggt 9840
ggtcagtggt tggggatggg ccgggggctt taccgggcgt ttccggtgtt cgctgatgcc 9900
tttgacgaag cttgtgccgg actggatgcg catctggggc agaaagtggg ggttcgggat 9960
gtggtgttcg gttccgacgc gcagttgctc gatcggacgt tgtgggcgca gtcgggtttg 10020
ttcgcgttgc aggttggttt gctgaagttg ttgggttcgt ggggtgttcg gcctgttgta 10080
gtgctgggcc attcggtcgg ggagctagca gcggcgttcg ccgccggtgt gctgtcgatg 10140
gcggaggcgg ctcggttggt ggccggtcgt gcccggttga tgcaggcgtt gccgtctggc 10200
ggtgccatgc tcgcggtggc gacaagtgag acccaggtcg aacctttgct ggatggagtg 10260
cgagaccgga tcgatatcgc ggcgatcaac gctccggaat cgatcgtgct ctccggtgac 10320
cgcgaactac tcaccgaagc cgctgatcag ctgcacgatc aagggtgccg gacacggtgg 10380
ttgcaggtgt cacacgcctt ccattcgccc cagatggatc cgatgctgga cgagttcgcc 10440
gacatcgcac gaaccgtgga tttccggggt tccgaactgc cggtcgtgtc gacgctgact 10500
ggtgcgctcg atgacagcgg cctgatggct acaccagagt attgggtgcg tcaggtgcga 10560
gagcccgtcc gcttcgccga cggggttcgg gcgctcgtcg agcacgatgt ggccactgtt 10620
gtcgagctcg gcccggacgg ggcgttgtca gcgctgatcc aggaatgtgc agccgaattc 10680
gatcagtcca gaagggtggc cgcggttccg gcgatgcgcc ggagccagga cgaggcgcag 10740
aaggtgatga cggccctggc gcaggtccat gtgcgtggtg gtgcggtgga ctggcggtca 10800
gttttcgctg gtacggggtc gaagcaggtc gagctgccga cgtatgcctt ccaacgacag 10860
cggtactggc tgaatgcggt gcatgaatct tctgccggcg acatgggtcg gcgtattgaa 10920
acggaattct ggagcgctgt cgagcacgaa gatgtgacat cgcttgcaaa catattgggt 10980
attgtggacg acggcgctgc cgtggattcc ttgcgaaacg cccttccggt gttggccggc 11040
tggcagagaa cccgtaatga cgagtcgatt atggatcggc agtgttaccg aatcggctgg 11100
aggcaggtag ccggactccc gccaagggga accgtcttcg gcacttggct ggtcttcgca 11160
ccccatggct ggtccggcga accgcaggtg gcgaactgcg ttgcggcatt gcgggcaagc 11220
ggtgcctcgg tggtgttggt ggaagctgat cccgacccgg tcgtcttcgg cgaccgggta 11280
cggaccctgt gttcggactc tccggatctt gttggcgttt tgtcaatgct gtgcttggaa 11340
gaatcggcga ttccgggatt ttctgcggtg tcacggggtt ttgcgttgac cgtggagttg 11400
gtgcgggctt tggcggccgc tggtgcggat gcccggttgt ggttgctgac gtgtggtggc 11460
gtgtcggtgg gggatgtacc ggttcgtcca gagcaggcat tggtgtgggg gttggggcgt 11520
gttgcggggt tggagcatcc ggactggtgg ggcggcttga tcgatattcc ggtcttgttc 11580
gacgaagatg ctcaagagcg cttgtcgatt gtgctggcag gtctcggtga ggaagaggtc 11640
gcgatccgtt ctgacggcgt gttcgcacgt cggttggtac gccatggtgt ctcggctggt 11700
gtgaagaagg cgtggcgccc ccggggatct gtactggtga cgggcggcac gggtggtttg 11760
ggggcgcacg ctgctcgctg gttggccgac gccggagccg aacatgtggt gatggtgagt 11820
cgacgcggag agcaggcacc gagtgcggag aaattgcgga cggaactgga ggatctgggt 11880
actcgggtgt cgatcctgtc atgcgatgtg accgatcgcg aagcactggc cgaagtgttg 11940
aaagcccttc cggctgaata tccattgact gcggtagtgc atacggcagg cgtgatcgag 12000
actggtgatg cggcgtcaat gagcttggct gatttcgatg acgtgttgtc cgcaaaggtg 12060
gctggtgccg cgaatctgga tgccttgctg gccgatgtgg aattggacgc gttcgtcttg 12120
ttctcatcgg tgtcgggagt ttggggcgct gggggacagg gggcttacgc ggcggcgaat 12180
gcctatctgg atgcgctggc ggaagagcgt cggtcgcgag ggttggtcgc gaccgcggtg 12240
gcgtgggggc cgtgggccgg cgaggggatg gccgccggcg aaacaggaga ccagctgcgt 12300
cgatacggcc tttccccaat gggtccgcag tacgccatcg ccggaattcg gcgggcagtg 12360
gaacaggacg aaatttccct ggtagtggcc gacgtcgatt gggcacgttt cagcgcggga 12420
ttcctggcgg ctaggccgcg gccactgctg aacgaactga ccgaggtcaa ggaactcctc 12480
gtcaatgctc agtccgaggt gggagtcgtt gccgaggcgt cggtggcatg gcggcagcga 12540
ttggccgcag caccgaggcc ggcacaggaa cagctgatcc tggagctggt acgcggcgaa 12600
acggctctgg tactgggaca tcccggagca gaggccgttg caccggaacg agctttcaag 12660
gacagcggat tcgactcgca ggccgcggtc gaactccgcg ttcggctcaa tcgagccacc 12720
ggcctccagt tgccatcgac aattatcttc agccatccca cgcctgcaga actggctgcg 12780
gagctgcggg cgaggctcct ccccgagtcc gcaggagtag acatttccga ggaggacgag 12840
gcgcgaatca gagcggcact gacgtcgatc ccgttcgcgg ccttgcgcga ggcagacttg 12900
gtgaatcgcc tgctcgccct tgccggacac ccagtcgact ccggcagctc cccggacgat 12960
gcggtcgcga cctcgatcga tgcgatggat gtagccgacc tcgtcgaagc agcgctgggc 13020
gaacgcgagt cctgagaccg cagacctggg agatcacggt gaccaccagt tacgaagaag 13080
ttgtcgaggc actgcgagca tcgctcaagg agaacgaacg cctccggcgc ggccgggatc 13140
gattcgccgc ggagaagggc gatcccatcg cgatcgtggc gatgagttgc cgttaccccg 13200
gtcaggtctc ctcgccggag gacttgtggc aactggccgc cggcggtgtg gacgcgatct 13260
ccgaagtccc gggggatcgc ggttgggacc tagccggcgt gttcgatccg gactccgatc 13320
gtcctggcac atcgtatgcc tgtgcgggcg gtttccttca gggcgtgtcg gagttcgatg 13380
cgggcttctt cgggatttct ccgcgtgagg cgttggcgat ggacccgcag cagcggttgc 13440
tgctggaagt cgcgtgggag gtcttcgaga gggctgggct ggagcagcgg tcgacacgtg 13500
gttcccgcgt tggcgtgttc gtcggtacca atggccagga ctacgcgtcg tggttgcgga 13560
cgccgccgtc tgaggtggca ggtcatgtgc tgacgggcgg cgcggcagcg attctttcgg 13620
gtcgggttgc gtattcgttc gggttcgagg gtcctgccgt gacggtggat acggcgtgtt 13680
cgtcgtcgtt ggtggcgttg cacctggcgg gtcaagcact gcgcgctggt gagtgcgacc 13740
tcgcccttgc cggtggcgta acggtgatgt cgacgccgaa ggcgttcctg gagttctccc 13800
gccaacgtgg tctcgcggct gacgggcggt gcaagtcgtt cgcggcggcg gcggatggta 13860
ctgggtgggg cgagggtgcc ggactgttgt tgctggagcg gctgtccgac gctcgtcgaa 13920
acggacaccg ggtgttggca gtggtgcgag gtagcgctgt gaaccaggac ggtgcctcca 13980
acgggctgac cgcaccgaac ggttcttccc aggcgcgggt gatcacccag gcgttggcaa 14040
gtgcggggtt gtcggtgtct gatgtggacg cagtggaggc gcatggcacg ggcacgcggc 14100
ttggtgatcc gattgaggcg caggctctga tcgccaccta tggccgtgat cgtgatcctg 14160
ctcggccgtt atggttgggt tcggtcaagt cgaacatcgg tcatacgcag gcggcggcgg 14220
gtgtggccgg cgtgatcaaa atggtgatgg cgatgcggca cgggcagctg ccgcgcacgt 14280
tgcacgtgga cgcgccgtcg ccggaggtgg attggtcggc agggacggtc caactcctta 14340
cggagaacat gctttggccc gagagcggtc gtgttcgccg ggcgggggtg tcgtcgttcg 14400
ggatcagcgg caccaacgcg cacgtcatcc ttgaacagcc cacgggcgag acgcgtcagt 14460
cagcggggcc ggattcgggc tctgtcgtgg atgttccggt ggtgccgtgg atggtatcgg 14520
gcaaaacacc ggatgcgctc ggcgcccagg cggacacatt gatgtcctat ctggatgatc 14580
gtgttgacgt cccttcgctg gatattgcgt attcgctggc gatgacgcgt tcggcgctgg 14640
atgagcgggc ggtggtcctg ggtccggacc gcgaaacgtt gttgtccggg ttgaaagcgc 14700
tgtctgccgg gcatgaggct tctggggtgg ttacgggatc tgtggggact gggggacgca 14760
tcgggtttgt gttttccggt cagggtggtc agtggctggg gatgggccgg gggctctata 14820
gggcttttcc ggtgttcgct gctgcctttg acgaagcttg tgccgagctg gaggcacatc 14880
tgggccagga ggttggggtt cgggatgtcg tgttcggttc ggatgcgcag ttgctgaatc 14940
ggacgttgtg ggcgcagtcg ggtttgttcg cgttgcaggt cggtttgctg aagttgctgg 15000
attcgtgggg tgttcggccg agtgcggtgc tgggccattc ggtgggtgag ttggcggcgg 15060
cgttcgcggc gggtgtgttg tcgttgtcgg atgcggctcg gttggtggcg ggtcgtgccc 15120
ggttgatgca ggcgttgccg tcaggcggtg ggatgttggc ggtggctgct ggcgaggagc 15180
aactgcggcc gttgttggcc gatcacggtg atcgtgtggg gctcgctgcg gtcaacgttg 15240
cggagtcggt ggtgctctcc ggtgatcggg atgtgctcga tgacattgcc gggcggctgg 15300
acgggcaagg ggttcggaca cggtggttgc gggtttcgca tgcgtttcat tcgtatcgga 15360
tggacccgat gctggacgag ttcgccgaaa tcgcacgagc cgtggactac cggcgttgcg 15420
aactgccgat cgtgtcgacc ctgacgggaa aactcgatga cgctggcagg atgagcggtc 15480
ccgactactg ggtgcgtcag gtgcgcgagc ccgtccgctt cgccgacggt gcccaggcac 15540
tcgtcgagca cgacgtggcc accatagtcg agatcggtcc ggacggggcg ttgtcggcgc 15600
tgatccagga atgtgtggcc gcatccgatc agtccagaag ggtggccgcg gtcccggcga 15660
tgcgcaggaa ccgggacgaa gcacagaact tgacaacagc cctggcgcag gtccatgtgc 15720
gtggtggtgc ggtggactgg cggtcgtttt tcgccggtac gggggcgaag caagtcgagc 15780
tgcccaccta tgccttccag cggcagcggt actggctgga gccatcggat tccggtgatg 15840
tgacaggtgc cggtctggcc ggggcggagc atccactgtt gggtgctgtg gtgccggtcg 15900
cgggtggtga tgaggtgttg ctgaccggca ggatttcggt ggggacccat ccgtggctgg 15960
ccgaacaccg ggtgctgggc gaagtgatcg tcccgggcac cgcgttgctg gagatcgcct 16020
tgcatgcggg ggaacgtctt ggttgtgaac gggtggaaga actcaccctg gaagcaccgc 16080
tggtccttcc ggagcgcggg gcgatgcagg ttcagctgcg agtgggtgcg cccgagaatt 16140
ccggacgcag gccgatggtg ctctactcgc gccccgaagg ggcggcggac catgactgga 16200
cacggcacgc cacgggccgg ttggcgccag gcggcggaga ggcggccgga gacctggccg 16260
actggccggc tcctggtgcg ctgccggtcg acctcgacga gttctatcgg gacctcgctg 16320
agcatggcct ggagtacggc ccgatcttcc aagggctcaa ggcggcctgg cggcaagggg 16380
acgaggtgta cgccgaagcc gcgctgccag gaacagaaga ctccggtttc ggggtgcatc 16440
cggcattgct ggacgcggct ctgcacgcaa cggctgtccg ggacatggat ggcgcatggt 16500
tgccattcca gtgggaaggt gtgtgcctgc acgccagggc cgcgtcggct ttgcgggtcc 16560
gcgtggtccc ggctggtgac gatgccaaat ccctgctggt gtgcgatggc accggtcgac 16620
cggtgatctc ggtggaccgg ctcgtgtttc ggtcggctgc ggccgggcgg accggtgcgc 16680
gccgacaggc ccatcgagct cggttgtacc ggttgggctg gccaacggtt caactgccga 16740
catccgctca gcccccgtcc tgcgtgcttc tcggcacctc ggaagtgtcc tctgacatgc 16800
aggtgtatcc ggacctccgg tcgttgacgg ccgcgttgga tgccggtgcc gaaccacccg 16860
gcgtcgtcat cgcacccacg ccccccggcg gtggacaaac agcggatgtc cgggagtcga 16920
ctcggcatgc actcgacctg gtacaaggct ggcttgccga tcagcgactc aacgattccc 16980
gattgttcct ggtgacacgg ggagcagtgg ccgtggagcc cggcgaaccc gtgaccgatc 17040
tggcgcaggc cgcgctctgg ggactgttgc gttcgacgca gaccgaacac cctgatcgct 17100
tcgtcctcgt cgatgtggct gagcccgcgc aactcctccc cgccctgccg ggggtgctgg 17160
cctgcggcga gcctcagctc gcactgcgac gtggcggcgc acacgcgccc agactggctg 17220
gactgggcgg cgatgacgtc ctgcccgtgc cggacagcat ggggtggcga ttggaggcca 17280
cgagcccagg aactctggat ggcttggcat tgctggacga accggcggcc acggcatcgc 17340
tgggtgacgg gcaggtcagg attgcgatgc gcgctgccgg ggtgaacttc cgggatgcgc 17400
tcatcgcgct cggcatgtat cccggtgcgg cttcgctggg cggtgagggg gccggggtcg 17460
tggtggagac cggccccggc gtcaccggcc tggcacccgg cgaccgggtg atggggatga 17520
tcccgaaggc gttcgggccg ctcgcagtcg ccgaccatcg catggtgacg aggattcccg 17580
ctggttggag cttcgcgcag gccgcatcgg tgccgatcgt ctttctcacc gcctactacg 17640
cgctggttga tctcgccggg ttgagaccag gggagtcgct gctggttcat tcggccgccg 17700
gtggggtggg catggccgcg atccaactcg ccaggcacct cggtgcagag gtgtacgcca 17760
ccgcaagcga ggacaagtgg caagccgtgg agctgacacg agaacgcctc gcttcgtcgc 17820
ggacgtgcga tttcgagaag cagttcctcg gggcgaccgg cggacgcggc gtcgacgtcg 17880
tgctcaactc cctcgccggg gacttcgccg atgcgtccct gcgaatgctg ccgcgcggtg 17940
gccgtttcct ggagttgggg aagacggatg ttcgtgaccc cgtcgaggtc gccgatgcgc 18000
atccgggcgt gtcctaccag gcgttcgaca ccgtagaggc cggcccgcag cgaatcggcg 18060
agatgcttga cgagctggtg gagctgttcg agggaggcgt gctggagccc ctgcctgtca 18120
cggcttggga cgttcggcag gcgcccgagg cgctacgaca cctgagccaa gcgcggcatg 18180
tgggcaagct ggtgctcacc atgcctccgg cgtgggacac cgccggcacg gttctggtta 18240
ccggcggaac gggagcactt ggagcagagg tcgcccggca cctcgtgatc gagcacgggg 18300
tgcgcaacct ggtgctcgtc agcagacgcg gtcccgcagc cagtggcgct gctgagctcg 18360
tggcgcaact gacggcctac ggtgccgagg tttccctgca ggcgtgtgat gtcgccgatc 18420
gtgagacctt ggcgaaggtg cttgccggca tcccggacga gcatacgttg accgccgtgg 18480
tgcacgcggc tggtgttctc gacgacggag tggccgaatc gctcacagcg caacggctgg 18540
accacgttct gcgcccgaag gtcgatggcg cgcgcaatct gcacgagctg atcgcacccg 18600
acgtggccct cgtgctgttc tcgtcggtgt cgggcgtgct cggcagcggt gggcagggta 18660
attacgcggc ggccaactcc ttcctcgacg cattggcgca gcaaaggcag tcgcgcggcc 18720
tacctacgag atcgttggcc tggggtccct gggcggaaca tggcatggca agcaccttgc 18780
gcgaagccga gcaggataga ttggcgctat ctgggctgct gccgatctcg accgaggagg 18840
ggttgtccca gttcgacgcc gcgtgcggcg gcgcgcatac cgtggtggct ccggttcgaa 18900
tcggccgctc gtccgacggg aacccgatca agtttcccgt cctgcgaggc ttggtcgagc 18960
cgcatcgcgt caacaaggcg accgcggatg atgccgagag catccggaaa cggttgggac 19020
gcttgccgga tgcagaacaa caccggattc tgctggacct cgtccgcacg cacgtggcgg 19080
cagtgctcgg attcgccggt ccccaggaga tcaccgcgga cggcacgttc aaggcgctgg 19140
gcttcgactc gttgaccgtg gtcgagttgc gcaaccggat caacggggca accgggctgc 19200
gactgcccgc caccctggtg ttcaactacc cgacgccgga tgcgctcgcc gcacacctcg 19260
tcaccgcgct ttccgcagac cgccttgccg ggacgttcga ggaactcgac aggtgggcgg 19320
cgaacctgcc cgcgctggcc agggatgagg ccacgcgggc gcagatcacc acccggctgc 19380
aggcgatctt gcagagcctg gcggacgtgt ctggcggaac cggcggcggc tccgtgccgg 19440
accggctcag atcggccacg gacgaagagc ttttccaact cctcgacaac gatctcgaac 19500
ttccctgatg cctcagccgg tgccttcgca gcttcctgga gggaaacgcc ccatgtcgaa 19560
cgaagagaag ctccgggagt acttgcggcg tgcgctcgtg gatctgcacc aggcgcgcga 19620
gcggcttgac gaggcggagt cgggggagca ggaacccatc gcgatcgtgg cgatgggctg 19680
tcggtacccg ggtggggtgc acgacccgga aggtctgtgg aaactggtcg cctccggtgg 19740
cgacgccatc ggtgaatttc ccgctgaccg tggttggcac ctcgacgagc tctacgatcc 19800
cgacccggat cagcccggaa cctgctacac ccggcacggc ggcttcctcc acgaggccgg 19860
cgagttcgac gcggggttct tcgacatcag cccccgtgag gcgctcgcca tggacccgca 19920
gcagcggctg ctgttggaaa tctcctggga gaccgtcgaa tccgctggga tggacccgag 19980
gtccttgcgg gggagccgca ccggggtgtt cgcgggattg atgtacgagg gctatgacac 20040
cggcgcccac ccggaaggtg tcgaaggcta tctcggaacc ggcaatgcgg ggagcgtcgc 20100
ctctggtcgg gttgcgtatt cgttcgggtt cgagggccca gcggtgacgg tggacacggc 20160
gtgctcgtcg tcgttggtgg ccctgcattt ggcgtgtcag tcgttgcggc agggcgagtg 20220
tgatttggcg ctggccggtg gagtgacggt gatggccacg ccggcgacgt tcgtggagtt 20280
ctcccgtcag cgtggtctcg caccggatgg gcggtgcaag tcgttcgcgg ctgctgcgga 20340
tggaaccggt tggggtgagg gtgccggctt ggtgttgctg gagcggctgt cggacgccag 20400
gcgcaacggg catcgggtac tggcggttgt tcgtggtagc gcggtgaatc aggacggtgc 20460
gtcgaacgga ttgacggccc ccaacgggct ggcccaggag cgggtcattc agcaggcgct 20520
cacgagtgcg gggctgtcgg tgtccgatgt ggacgttgtg gaggcgcatg ggacgggtac 20580
gcggcttggt gatccgatcg aggcgcaggc tctgatcgcc acctatggac aggatcggga 20640
ccgggatcgg ccgctgtggt tggggtcggt caagtccaac atcggtcata cgcaggcggc 20700
cgcgggcgtt gctggtgtga tcaagatggt catggcgatg cggcgcgggg agctgccgcg 20760
cacgttgcac gtggacgagc cgaattcgca cgtggactgg tcggctggtg cggtccggct 20820
cctcaccgag aacatccggt ggccagggac gggtacgcgc cgagttggcg tgtcgtcgtt 20880
cggggtaagc ggtaccaacg cacacgtcat cctcgaacac gacccgctcg ccttgaccga 20940
gaacgagaac gcagcggtgt ccccagcacc tgggatcgtg ccttgggcgt tgtccgggcg 21000
gtcgtcgacg gcgctgcgag cccaggccga acggctgagc gagctgtgcg agcagaccga 21060
tcccgacccc gtcgacgtcg gtttctcact ggccaccacg cgcacggcct gggagcaccg 21120
agcggtggtg cttggtggtg atagcgctac attgcgttcc ggacttggcg ttgtcgctag 21180
cggcgaaccg gcagtcgatg tcgttcaggg gagcgtcctg ggcggcgagg tcgtcttcgt 21240
ctttcccggt cagggttggc aatgggccgg tatggcagtc gacttgctgg acgcttcgcc 21300
gacgttcgcg cggcacatgg acgagtgcgc caccgcgctg cggaagtacg tggactggtc 21360
gttggtcgac gtgctgcgcg gagcggagaa cgccccaccg ctcgaccggg tggacgtcct 21420
gcaaccggtg tccttcgcgg tgatggtgtc gctcgccgag gtgtggcgtt cctacggggt 21480
gcggccggcg gccgtcgtcg gccacagtca aggcgagatc gccgcggcct gcgcagccgg 21540
ggtgctgcca ctggaggatg cggccaggct tgtcgccttg cgcagcagag cgttgaaggc 21600
actatcgggg cgaggtggca tggcgtcgct ggcttgctct gcggatgagg ccgcggcgtt 21660
gttcgcggga ttgggcggtc gtctggaaat tgcggcgatc aacggcccgc gatcggtcgt 21720
ggtgtccggc gatctggaag cggtggaaga actgctggca gagtgcgctg aaagggacat 21780
gcgtgcacgc cgtatccccg tcgactacgc ctcgcattcg gcacacgtgg aggtggttcg 21840
gagcccggtg ctggcggctg cggccggcgt gcggcaccgg gacggccagg tgccgtggtg 21900
gtcgacggtg atcggcgact ggttggatcc ggccgggctg gacggcgagt actggtaccg 21960
gaacctccgg cagccggtcc gattcgaaca cgccgtgcag ggcctggttg agcggggatt 22020
cggcctgttc atcgaaatga gtgcgcatcc ggtgctgacc atggcggtcg aggaaaccag 22080
tgccgagtcg gagtccgccg tggccgcggt aggtaccttg cgacgtgact cgggcggccg 22140
ccggaggttg ttacagtcgc tggccgaggc gtacgtgcgc ggcgccaccg tggactgggc 22200
cgtggcgttc gggggcgtgg gtcgacggct ggacctgccg acctacccgt tccagcgccg 22260
gcggtactgg ctggacaggg gagctgcctc cgaggaggct cgtgcgtttt cggacccggc 22320
ggcggactgg ttctggcaag ccgtggagcg ccaagacctg aaaggcgtgg ccgacgccct 22380
cgatctcgac gccgacgcac cgctgagcgc aacacttccg gccctgtccg tctggcaccg 22440
tcaggaacga gaaaaggtct tggtggacgg ttggcggtac cgagtcgact gggtaccggt 22500
ggccccgcag ccgatccgga gaacacggga aacctggctc ctggtcgttc ccgcgggcgg 22560
cattgaagaa gcgctggtcg aacggttgac ggatgcgttg aacacgcgag ggatcagcac 22620
cctgcgcctc gacgtgccac cgacggcgac aagtggggaa ctcgcgaccg gcctccgcgc 22680
cgcagttggc ggtgacccgg tgaagggaat cctgtcgctc actgcgttgg acgagcgaac 22740
acaccccgaa cgcaaggccg tccccagcgg gattgccttg ctgttgaacc tggtcaaggc 22800
gctcggtgaa ggcgacctca gagttcctct gtggacgatc acgcgtggtg cggtcaaggc 22860
agaccccgca gatcggctgc tgcgcccgat gcaggctcaa gcatggggtc tggggcgagt 22920
agccgcactc gaacaccccg agcgctgggg tgggctgatc gacctcccgg aatcgctgga 22980
cggcgacgtc ctcacaaggt tgggcgaagc gctcatcaac ggcttggcgg aggaccaact 23040
ggcgattcgc cagtcgggcg tgctggcccg gcgcctggta ccggccccgg cgaatcagcc 23100
cgctggacgt aagtggcgac cccgaggtag cgcgctgatc acgggcgggc tcggcgcggt 23160
gggcgcgcag gtggcgaggt ggttggccga aagcggagcc gagcgaatcg tgctcaccag 23220
tcgacggggc aaagaagcgc cgggcgccgc agagctggaa gccgaactcc gggcccttgg 23280
agcgcaagtg tccatcgtgg cttgtgacgt gaccgatcgt gccgagatgt ccgcactgct 23340
ggccgagttc ggcgtcaccg cggtgttcca cgcggccgga gtcggccggc tgctgccgct 23400
ggcggaaacc gaacagaacg acctggccga aatatgcacg gccaaggttc acggcgctca 23460
ggtgttggac gagctgtgcg acagcaccga tctcgatgcc ttcgtcctgt tctcctcggg 23520
tgccggggtc tggggcggtg gcggtcaggg cgcctatggc gcggcgaacg cattcttgga 23580
cacactcgcc gaacaacgcc gagcacgcgg tctgccggca accgcgatct cctggggcag 23640
ctggggcggc ggcatggccg acggcgcagc gggcgaactc ctgcggcgac ggggaatacg 23700
tccgatgccg gcggcgtcgg ccatcctggc tctgcaggaa gtactcgacc aggatgagac 23760
gtgcgtgtcg atcgctgatg tggactggga ccgattcgtg cccacgttcg ccgcgacccg 23820
cgccacccgg ttgctggacg aactgcccgc ggtgagaaag gcgatgtccg cgaacgggcc 23880
ggcagaacca ggcggctcgc cgttcgcccg caatctcgcg gagctgccgg aagcccaacg 23940
acgccacgaa ctggtagatc tggtcagcgc ccaggtggca gccgtgctcg ggcacggcag 24000
tcgcgaggaa gtccagcctg aacgggcgtt ccgcgcgctc gggttcgact ccctcatggc 24060
ggtggacctg cgcaatcgtt tgaccaccgc caccgggttg cgcctgccga ccacaactgt 24120
cttcgactac ccgaatccgg ccgcattggc cgctcacctg ctcgaggagc tggtgggcga 24180
tgtcgcgtcg gccgcggtga ccactgccat cgcgccgtcg actgacgaac cggtcgcgat 24240
cgtcgcgatg agctgccggt tccctggcgg cgcgcactcg ccggaagacc tgtggcggct 24300
ggtcgcctcc ggcgcggagg tgatcggcga gttcccctcc gaccggggtt gggatgcgga 24360
aagcctctac gatccggacg cttccaaacc tggaaccacg tatgcgcgga tggcgggatt 24420
cctttacgac gccggtgagt tcgatgccgg cttgttcggc atcagcccac gcgaggcgtt 24480
ggcgatggat ccgcagcagc ggttggtgct cgaaatcgcc tgggaagccc tcgaacgggc 24540
cggaatcgat ccgttgtcct tgaagggcag tggggtcggc acgtacatcg gtgctggaag 24600
ccgcgggtac gcgacggatg tgcggcagtt tcccgaggag gcggagggct acctcctgac 24660
gggtacctcc gccagtgtgc tgtcgggtcg ggtcgcgtac tcgtttggtt tcgagggtcc 24720
tgcggtgacg gtggacacgg cttgttcgtc gtcgttggtg gcgttgcact tggcgtgcca 24780
gtcgttgcgt tcgggtgagt gtgatctggc gttggccggt ggtgtgaccg taatgtcgac 24840
gccggagatg ttcgtggagt tctcccgtca gcgtggtttg gcgccggatg gccggtgcaa 24900
gtcgttcgcg gagagcgcgg acggcaccgg ctggggcgaa ggcgcgggcc tgttgttgct 24960
ggagcggttg tcggacgccc accggaatgg gcatcgggtg ttggcggtgg ttcgtgggtc 25020
tgcggtgaat caggacggcg catcgaacgg actggcggcg ccgaatggtc cgtcgcagca 25080
gcgggtgatc aaacaggcac tcgcgaatgc ggggctttcg gcgtctgatg tggacgccgt 25140
ggaggcccat ggaaccggga ccaggctggg tgatccgatc gaggcgcagg ccttgatcgc 25200
aacgtatggg cagggccggg agcgggatcg gccgttgtgg ttggggtcgg tcaagtcgaa 25260
catcggtcac acgcaggcgg cggcgggtgt tgccggtgtg atcaagatgg tgatgtccat 25320
gcggaacgac gagctgcccg ccacgctgca cgtgggtgcg cccacgtcgc aggtcgactg 25380
gtcggcgggg gcggtccggc tccttaccga acaggtacct tggccggagt ctgatcgcgt 25440
tcgtcgggtg ggggtgtctt cgttcgggat cagcggcacc aatgcacacg tgatcctcga 25500
acaatctacg aatgcgccag atagtcccgc ggccacggac aaatcaggat ccggatctac 25560
cgtggatatt ccggttgttc cctggttggt gtcgggacag acatcggatt ccctgcgggg 25620
acaggctgaa cgagtcttgt cccaggttga gtcccggccg gagcagcgtc cgctggatgt 25680
ggcctactcg cttgcttctg gccgagccgc gctggatgaa cgcgctgtcg tgctgggtgc 25740
ggaccggaat gagctggtag ctggattggt ggcgttggcc gccggtcatg aggcttccgg 25800
ggtgatcacc ggaactcgtg cttctgctcg gttcgggttc gtgttctcgg ggcagggcgg 25860
tcagtggttg gggatgggcc gggagctcta ctcgaagttt ccggtgttcg ctgctgcgtt 25920
tgatgaggct tgcgccgagt tggacgcaca tctgagtgaa gacctccggg tccgagatgt 25980
ggtcttcggt tccgatgcgc agctgctgga tcagacgttg tgggcgcagt cgggactgtt 26040
cgcgctgcaa gtcggcctct tggggctgct gggttcgtgg ggcgtccggc cggatgtggt 26100
gatggggcat tcggtcgggg agttggccgc cgcgtttgcg gctggagtgt tgtcgttgcg 26160
ggatgcggct cggttggtgg ccgcacgcgc ccggttgatg caagccctgc cctctgacgg 26220
cgcgatgctc gcggtggccg ctggtgaaga cctgattcgg ccattgctgg ctggtcggga 26280
ggcatccgtg aacgtcgccg cgctcaatgc ccccggttcg gtggtgttgt cgggtgatcg 26340
ggatgtgctg gccgacatcg ccggccggct gaacgagctc ggagtccgga cgagacggtt 26400
gcgggtctcc catgcttttc attcgcaccg gatggacccg atgttgggcg agttcgccca 26460
gatcgcggag tctgcggagt tcggtaggcc aacgacaccg cttgtgtcga cgttgacggg 26520
tgagctcgac agagctgggg aaatgagcac gccagggtat tgggtgcgtc aggtgcgtga 26580
acccgtccgt ttcgccgacg gtgtccgggc cctggcagcg cagggcgtag acacggttgt 26640
tgagctcggc ccggacggag cgctgtccgc actggttcag gagtgtgcca ccgggtttga 26700
tcgggtcggg cggatttcgc ctgttcccct gatgcgcagg gagcgggacg agacccgttc 26760
ggtgatgaca gccctggcgc atcttcacac ccgtggcggt gagttggact ggcaggcgtt 26820
tttctccggc accggggcca ggcaggtcga gttgcccacg tatgccttcc aacgacggca 26880
ctactggatc gaatccagtg cgcggacagc acgcgaccgc gcagacatcg gcgaggtggc 26940
tgaacagttc tggaccgcgg ttgaacaagg cgatctggaa gcattggtct ccgcactgga 27000
gcttggggcg gacgacgaca catgcgcatc tttgagcgat gtactgccgg cgctgtcatc 27060
ctggcgaagc ggactccgca accgttcgct cgtcgattcc tgccggtacc gaatcaattg 27120
gcattcctct cgggaagcac cggccccgaa gatttccggt acctggctgt tggtcgtgcc 27180
cggcgatgcg gatgacggct tggccacggc tttgacgagt tcactggtcg aaggtggcgc 27240
cgaggtcgtc cggatcgacc tgtccgaaga ggacctgcac cgcgaggacc tcgcacagcg 27300
gctggccaat gcgctgacgg atgtcggtcg actcggtggc gtgctgtcgc tgttggggct 27360
cgatgactcg gctgttggag aattctcctg cttgacaagg ggtttcgcgt tgactgtgca 27420
gctggtgcgg gccttgcgca acgccgagct cgaggcgcct ttgtgggcgg tgacgcgcgg 27480
cggcgtctcg ttggaagacg taagtgtgtc tcctgagcag gccttgattt gggggctgct 27540
gcgtgttgcg ggcctggagc atccggagtt ctggggtggc ttgatcgacc tgccatcgga 27600
ttgggacgac cgattgggtg cgcggttggt gggtgtgttg gcggatggtg gcgaggatca 27660
agttgccatt cgtcgtggtg gtgtgttcgt gcggcggttg gaacgcgccg gtgcgtcggg 27720
tgccgggtcg gtgtggcgtc ctcgggggac ggtgttggtg acgggtggta cgggcggttt 27780
gggggcgcat gttgctcggt ggttggcggg tgccggggct gagcatgtgg tgttgaccag 27840
ccgtcgtggc gcggaggctc cgggcgctgg ggaattgcga gcggagctgg aggcgctggg 27900
tgctcgggtg tcgattgtgc cctgcgatgt ggctgatcgt gacgccgtgg ctggagtgtt 27960
ggcagggatc ggcggggagt gtccgctgac tgcggtggtg cacgccgctg gggtcggcga 28020
ggcgggcggc gtggtggaga tggccttggc ggactttgca gaggtgttgt cggcgaaggt 28080
gcggggtgcg gcgaatctgg acgagttgct ggccgactcg gagttggatg cgtttgtgtt 28140
gttctcctcg gtgtcgggtg tgtggggtgc cgggggacaa ggtgcgtatg cggctgcgaa 28200
cgcctacttg gatgcgttgg ccgagcagcg tcgggcgagt gggttggccg ggaccgcggt 28260
tgcgtggggg ccgtgggcgg gtgacggcat ggccgcgggc gaaaccggcg cacagctgca 28320
tcgcatgggc ctggtgtcga tggaaccgag agcggctctg ctggcacttc agggcgcact 28380
ggaccgcgat gagacctccc tcgtcgtggc cgatgtcgac tgggcacggt tcgccccagc 28440
cttcacctcg gcacgtcggc gcccgctgct ggacaccatc gacgaggccc gagccgcatt 28500
ggaaaccacc agcgaaaaag cgggaacagg caaacccgtt gagctcaagc atcgcctggc 28560
cgggttgtca cggaaggaac gtgacgatgc ggtattggat ctggtgcggg cggaaacggc 28620
agctgtgctg ggacgcgacg atgccacggc cctggcgccg tcgcggccgt tccaggaact 28680
cggattcgac tccttgatgg cggtggagct gcgcaaccgg ctgaacaccg ccaccgggat 28740
ccagctgccc gccagcacga tcttcgacta ccccaatgcc gagtcgctgt cgcgtcacct 28800
ctgcgccggg cttttcccaa cagagacaac tgtggactcg gcccttgccg agctcgatcg 28860
aatcgagcag cagctctcga tgttcaccga ggaagcgcgg gcacgggacc gaatcgcgac 28920
acgactgcga gccctccacg cgaagtggaa cagcgcatct gaggcaccga ccggtgccga 28980
tgtcctgaac acactcgatt cggcaacgca cgacgagatc ttcgagttca tcgacaacga 29040
gctcgacctg tcctgagcag ttcctgcgga acgtccagtc gccgaaaccg ggtggaaatc 29100
acaatggcca atgaagaaaa gctcttcggc tatctgaaga aggtaactgc cgacctgcat 29160
cagacccggc agcgcctgct cgcagccgag agccggagtc aggagccgat cgtctccgcg 29220
agctgccggc tgcccggcgg cgtcgactct cccgaagcgc tttggcaact cgtgcgcact 29280
ggcactgacg ccatctcgga gttccccgcc gaccggggct gggatctcga ccggttgtac 29340
gatcctgacc cggaccacca gggaacctcg tacacgcggg ccggcggttt cctcgcagat 29400
gcgggcgatt tcgaccccgc catgttcggg atctcgccgc gtgaggcgtt ggcgatggac 29460
ccgcagcaac ggctgttgct ggagctgacc tgggaggccc tcgaacgggc gggaatagac 29520
ccgacatcgc tgcgcggcag caagaccggt gtcttcggcg gtgtcacgcc ccaggagtac 29580
gggccgccct tgccggagat gagccggaac tctggcggtt ttggactcac cgggcggatg 29640
gtgagtgtgg cgtcgggacg ggttgcgtat tcgtttggtt ttgagggtcc tgcggtgacg 29700
gtggatacgg cgtgttcgtc gtcgttggtg gcactgcatt tggcgtgtca gtcgttgcgt 29760
tccggcgaat gtgatctagc gttggccggc ggtgtgacgg tgatggccac gccggcgacg 29820
ttcgtggagt tctcccgtca gcgtggtttg gcgccggatg ggcggtgtaa gtcgtttgcg 29880
gctgctgcgg atggcaccgg gtggggtgag ggtgccggtc tagtgttgtt ggagcgcttg 29940
tcggatgccc ggcgcaatgg gcacaaggtt ctggcggtgg tccgtggtag cgcggtgaac 30000
caggacggcg cgtcgaatgg tttgacggcg ccgaatggtc cgtcgcagca gcgggtgatc 30060
acccaggcgt tgtcaaatgc agggttgtcg gtgtccgatg tggatgcggt cgaggcgcat 30120
gggacgggca cgcggcttgg tgatccgatc gaggcacagg ccctgatcgc cacgtacggg 30180
cagggccggg agaaggatcg gccgttgtgg ttggggtcgg tcaagtccaa catcggtcac 30240
acgcaggcgg ccgctggcgt tgccggcgtc atcaagatgg tcttggcgat gcggcacggg 30300
cagcttcccg ccacgttgca tgtggatgat cccacgtcgg cggtggactg gtcggcgggt 30360
tcggtccggc ttctcacgga gaacacgccc tggccggaca gtggtcgtcc ttgtcgggtg 30420
ggagtgtcgt cgttcgggat cagcggcacc aatgcacatg tcattctcga acaatctcca 30480
gtcgagcagg gcgaaccgac cgggccggtc gaaggcgagc gggaaccgga ggcagccatc 30540
cccgtggtgc cgtggatggt gtcgggtaag acaccggagg ccgcgcgggc ccaggccgaa 30600
cgggtgcttt cgcatatcga ggaccggccg gagctgtcgc cggtggatgt ggcgtattcg 30660
ctaggcatga cgcgtgcggc gctggatgaa cgcgcagtga tgttgggctc ggaccgtgac 30720
acgctcctga ccgggttgag ggcgttcgcc gacggttgcg acgtgcccga agtggtgtcg 30780
ggatctgtgg ggaatggggg ccgcgtcggg tttgtgttcg ccggccaggg tgggcagtgg 30840
ccggggatgg gccgggggct ctactcggtg tttccgggtt tcgccgatgc gtttgacgag 30900
gcttgcgctg agttggatac acacctgggc caggaactgg gggttcggga tgtggtgttc 30960
ggttcggatg cgcggctggt ggatcggacg gtgtgggcgc agtcggggtt gttcgcgttg 31020
caggttggtt tgttgcggct gctgggttcg tggggtgttc ggcctgatgt ggtgttgggg 31080
cattcggtgg gtgagctggc tgcggtgcac gcggcgggtg tgttgtcgtt gccggaggcg 31140
gcgcggttgg tggcgggtcg tgcccggttg atgcaggcat tgccttctgg tggtgccatg 31200
ttggcggtgg ccgcgagtga ggcccaggtc gaaccgttgc tggatcgggt gcggggccgg 31260
gtcgagatcg cggcgatcaa cggtccggga tcggttgtgc tctctggcga ccgcgagctg 31320
ctcaccgaga tcgccgatcg gttgcacgat caggggtgtc ggacgcgatg gttgcgggtg 31380
tcgcacgctt tccattcgcc ccacatggag ccgatgctgg aagagttcgc ccagatcgcc 31440
cgaagccgtg agtatcaagc acccgaactg ccgatcatct cgaccctgac cggtgagctg 31500
gacggtggtc gagtgatggg cactcccgag tactgggtgc gtcaggtgcg tgagcccgtc 31560
cgtttcgccg agggtgtcca ggcgcttgtc ggtcagggtg ccgacacgat tgtcgaattc 31620
ggtccggacg gggcgttgtc gacgttggtc gaggagtgtt tggcggaatc cgggcgggtg 31680
gccgggatcc cgctgatgcg caaggaccgc gacgaggcgc gaaccgtgct ggccgctttg 31740
gcgcagatcc acacccgtgg tggtgaggtg gaatggcagt cgtttttcgc cggcaccggg 31800
gcgaagcaag tcgagttgcc cacctacgct ttccagcggc agcgctactg gctggcatcc 31860
accggcggtg cgggtgacgt gaccgccgcg ggattggccg aggcggacca tccgttgctc 31920
ggtgcggtcg ttgcgttggc agacggcgaa ggtgtggtgc tgaccggtcg gctgacagcg 31980
gattcgcatc cgtggttgtc cgatcaccgg gtgctgggcg aaatcgtcgt ccccggcacc 32040
gcaatcgtcg agctggcgtg gcacgtcggc gagcgcctcg gttgtggccg ggtggaagaa 32100
ctggctttgg aagcgcccct gatcctgccg gatcatggag cggtccaggt tcaggtgctg 32160
gtgggaccgc ccggggaatc cggagcccgg tcggtggcgc tctactcccg ccctggagat 32220
gcgaccgaat ccgagtggaa gaagcacgcg acgggggtgc tgctgccacc cgtggccgcc 32280
gagaatcatg agctgcccgc ctggcccccg gagaatgcga ctgaaatcga tgccgacgag 32340
gtctacgaat tcctcgaagg gcacggtttc gcgtacggac cggcctttag atgtctgcgc 32400
ggtgcctggc gacgaggcgg ggaggtgttc gccgaagtcg cgttgccgga tggcatgcag 32460
gtgggggtgg atcgattcgg cgtccacccc gcgttgttgg acgcggttct gcatgccgcc 32520
gcggccgaga cgtccgtggt ccagagcgaa gcgcgggtgc cgttctcgtg gcgtggggtg 32580
gaacttcgcg ctaccgaaac cgcggtggtg cgggcacgca tctcgttgac cgcggatgac 32640
gagctgtcgt tggtcgcagt ggacccggtt ggcggattcg tggcctcggt cgattcgctg 32700
gtgacacgac cgatctcccg gcagcaggtg aggtctggcg cgatcggtga ttgcctgttc 32760
gaagtggagt ggcaccggag agcgttgttg gaaacagccg ccgacgacgg ccttgccatc 32820
gtcggtgacg gtgccagttg gccggaatcg gtgcgcgcaa ccgcacggtt cgcgaccctg 32880
gatgagctcc gttcggcggc ggactcggat gttcccgccc cgggtccggt gttggtcgca 32940
gctatgtcgg ccgaagaggt cgaaagtgaa tccctgccgt cgcgcgccca ggagtcgacc 33000
tccgatctgc tggctctcgt gcagtcgtgg cttgccgatg agcagttcgc cgaatcccag 33060
ctcgtggttg tcacgcgtgc agcggtgtcg gccgactcgg atacggacgt cgccgacctg 33120
gtgagtgcgt cgtcgtgggg gttgttgcgt tcagcccagt cggagaaccc gggtcgcttc 33180
gtactggtgg acgtggacgg cacaccagag tcgtggcagg cgttgccgac cgccgtgcga 33240
gcgggagaac cgcagctggc acttcggcgc ggcgtggcgc tggtgcctcg gttggcgcga 33300
ctcaaggcgc acggggaggg ctcctccccg cgactcgaca cggacgggac agtcctcatc 33360
accggtggca ctggtgcgtt gggtggagtg gttgcccgtc acctggtggc ggagcacggg 33420
atccggcgtt tggttttggc aggccggcgt ggctggaacg cgcctggagt ccacgatttg 33480
gtggatgagc tggcgcgctc gggcgctgtg gttgacgtgg tggcttgcga tgtgggtaac 33540
cggacagatc tggagcaggc gctggccgcc attccggtcg accgcccgtt gcgggggatc 33600
gtgcataccg ctggggtgtt ggccgacgga gtgctcgggt ccttgtcggc ggcggatgtg 33660
gacacggtgt tcgccccgaa ggtggcgggg gcgtggcatc tgcatgagtt gacccgcgag 33720
ctggatctgt cgttcttcgt tcttttctcg tccttctcgg ggattgcggg tgccgcgggg 33780
caggccaact acgcggcggc gaacacgttc ctggatgcat tggcaggtta tcgccgcgcg 33840
cgtggactac ccgggttgtc gttggcatgg ggactgtggg cgcaacccgg cggtatgacg 33900
agtggcttgg acgcggcgtc ggtggagcgg ttggcgcgga cgggcatagc agaacattcc 33960
acggaggatg gactccgcct gttcgatgcc gcgattgcga aggacagggc ttgcgtcgtt 34020
cccgctcgat tggacagggc gctgctggtc gagcacgcac ggtcgcacgc gattccagca 34080
ctgatgaccg cgttggctcc tgctcgtggc ggtgtggcga ggagagcaac caactctcag 34140
gccgcggatg aggacgcgct gttgggtttg gtgcgggacc acgtctcggc ggtactgggc 34200
tattcgggtg cggtcgaggt tgggggcgac cgtgctttcc gtgatctagg ttttgattcg 34260
ttgtctggag tggagttgcg gaaccgcctg gccggggtgc tgggggtgcg gttgccggcg 34320
actgcggtgt tcgattaccc gacgccgcgg gcgctggcgc gtttcttgca tcaggaattg 34380
gcaggcgagg tcgggtcgat gtcgacgccg gtgaccaggg cagcgagcgt cgaagaggat 34440
cttattgcga ttgtcgggat ggggtgtcgt tttccgggtg gggtgtcgtc gccggaggag 34500
ctttggcggt tggtggccgg gggcgtggat gcggtggctg ggttcccgga cgatcgcggc 34560
tgggatctgg cggggttgtt cgatccggat cccgatcatc tcggcacttc gtacgtatgt 34620
gagggcgggt ttctgcggga cgcggcggag ttcgatgccg acatgttcgg cgtcagcccg 34680
cgtgaggcgt tggcgatgga tccgcagcag cggttgctgc tggaggtcgc ttgggaaacc 34740
ctggagcggg ctgggatcga tccgttctcg ttgcacggca gccggaccgg tgtgttcgcg 34800
ggcttgatgt accacgacta cggggcccga ttcatcacca gagcaccgga gggcttcgaa 34860
gggcacctcg ggacgggtaa tgcggggagc gtgctgtcgg gtcgggttgc gtactcgttt 34920
ggttttgagg gtcctgcggt gacggtggat actgcgtgtt cgtcgtcgtt ggtggcgttg 34980
cacctggcgg gtcaagcact gcgggccggt gagtgcgaac tcgcccttgc cggtggcgtc 35040
acggtgatgt cgacgccgac gacgttcgtg gagttctccc gtcaacgggg actggctccg 35100
gatgggcggt gcaagtcgtt cgcggcggcc gcggatggca ccggttgggg agaaggcgcg 35160
ggcctggtgt tgctggagag gttgtcggat gcccggcgca acggacacaa ggtcctggcg 35220
gtggttcgtg gtagcgcggt gaaccaggac ggcgcgtcga atggtttgac cgcgccaaat 35280
ggcccgtcac agcaaagggt gatcacccag gcactcacga gtgccgggct gtccctgtcc 35340
gacgtggatg ctgtggaggc gcatgggacg ggcacgcggc taggtgatcc gatcgaggca 35400
caggcgttga tcgctacgta tggccgagat cgtgatcccg gtcggccgct gtggttgggg 35460
tcggtgaagt cgaatattgg tcatacccag gcggcagcgg gtgtggctgg tgtgatcaag 35520
atggtgatgg cgatgcggca tggggagctg ccgcgcacgt tgcacgtgga cgagccctcc 35580
gcgcaggtgg actggtctgc gggcacggtc caactcctca cggagaacac gccctggccc 35640
gacagcggtc gtcttcgtcg ggccggcgtg tcatcgttcg ggatcagcgg caccaacgcg 35700
cacctgatcc ttgaacaacc tccgcgagag acgcatcgcg caacagagcc ggattcgagt 35760
tctgtcctcg atgttccggt ggtgccgtgg atggtgtcgg gcaaaacacc cgaagcgcta 35820
tccgcccagg cagatgcact gatgtcctac ttgaacaatc gcgttgatgt ttctccacga 35880
gatatcgggt attcacttgc ggtgacccgt ccggcgttgg accaccgggc tgtcgtgctg 35940
ggtgcggatc gtgaagcgtt gctgccgggg ttgaaagcgc tggctgccag tcatgacgcc 36000
gctgaggtga tcacaggcac tcgtgccgct gggccggtcg gattcgtgtt ctccggtcaa 36060
ggtggtcagt ggcccgggat gggaagcggg ctctactcgg cgtttccggt gttcgccgac 36120
gcgtttgatg aagcctgcgg cgagctggat gcgcatctcg ggcagaaagc acgggttcga 36180
gacgtgatgt ccggttcgga taagcaactt ctggatcaga ctttgtgggc gcagtcgggc 36240
ctgtttgcgt tgcaagtcgg gctctgggag ttgttgggtt cgtggggtgt ccgacccggt 36300
gtggtgctgg gccattcggt cggtgagctg gcggcggcgt ttgcggctgg agtgttggcg 36360
ttgccggatg cggctcggtt ggtggcaggc cgtgcccggt tgatgcaagc cctgccacct 36420
ggcggtgcca tgctcgcggc ggctgctgga gagaaggagc tgcggccgtt gttggccgac 36480
cgggctgatc gtgtggggat cgccgcggtc aacgcacccg agtcggtggt gctctccggt 36540
gatcgggatg cgctcgatga catcgccggc cgactggacg ggcaaggggt ccggtcgagg 36600
tggttgcggg tttcgcatgc gtttcattcg catcggatgg atccgatgct ggaggagttc 36660
gccgaaatcg cacggagcgt ggactaccgg tcgccagggc tgccggctgt gtcgacgttg 36720
acgggtgagc tcgatgaggt cggcatgatg gctacgccgg agtattgggt gcgtcaggtg 36780
cgagaacccg tccgcttcgc cgacggtgtt gctgctctcg cggctcacgg tgtgagcagc 36840
atcgtcgagg tcggtccgga cggggtgttg tcggcgctgg tgcaggagtg tgcggccgga 36900
tccgatcagg gcggacgggt ggccgcggtt ccactcatgc gcagcaattg cgacgaggcg 36960
caaaaggtga taacggcctt ggcgcaggtc catgcgcgtg gtgctgaggt ggactggcgg 37020
tcgtttttcg ccggtaccgg ggcaaagcag gtcgagctgc ccacgtatgc cttccaacga 37080
cagcggtact ggcttgactc gccatccgaa ccggtcgggc aatccgccga tctcgcgccc 37140
cagtcgggct tctgggaact cgtcgagcag gaagatgtca gcgcgcttag cgccgccctg 37200
aatataaccg gcgatcccga cgtgcaggcg tccctggaat cggtggttcc ggtcctctcc 37260
tcctggcatc gccggatccg caacgaatcc ctggtgcacc agtggcggta ccgcatttcc 37320
tggcatgagc gggcagatct gccagaccgg tcgttgtcgg ggacatggct cgtcgtcgtg 37380
ccggagggtt ggtctacgag tcagcaagtt ctgcgtttcc gcgagatgtt cgaggaacgg 37440
ggttgcgcgg cggttttgtt cgagctcgcc gggcacgacg aggaagccct ggtgcaacga 37500
ttccgctcgt tgcctgtcgc gtcaggggga ataagcggcg tgctgtcctt gctggcgctg 37560
gatgaatcgc cgtcctcgtc gaacgctgcc ttgccgaatg gtgcgctgaa ctcattggta 37620
ctgctgcgag ctctgcggac cgcggatgtg ccggcgccat tgtggttggc gacgtgtggt 37680
ggggtggcgg taggggatgt gccggtgaat ccggggcagg cgctgatgtg gggactgggc 37740
cgcgtcgtcg gcctggaaaa tccggactgg tggggcggcc tggtcgacgt gccggacttg 37800
ctcgataagg acgctcaaga acgcttgtcg gtcgtgttgg ctggtcttgg cgaggacgag 37860
atcgcggtgc gccccgatgg cgtgttcgtg cggcggttgg aacgcgctga tttgccggat 37920
atggggtcgg catggcgtcc tcggggcacc gtgttggtga cgggtggtac gggcggtttg 37980
ggggcgcatg ttgctcggtg gctggcgggt gccggggccg agcatgtggt gttgaccagc 38040
cgtcgtggcg cggaggctcc gggcgctgga gatttgcgag cggagctgga ggcgctgggc 38100
gctcgggtgt cgatcagatc ctgcgatgtg gcagatcgtg acgctttggc cgaagtgttg 38160
gcgaccattc cggatgattg cccgctgacc gcggtgatgc atgcggcggg ggtcgttgaa 38220
gtcggcgacg tggcgtcgat gtgtctgacc gacttcattg gggtgctgtc ggcgaaggtg 38280
ggtggtgcgg cgaatctcga tgagttgctc gccgacgtcg agctggatgc cttcgtgctg 38340
ttctcctcgg tatcgggtgt gtggggtgct ggggggcagg gcgcttatgc ggcggcgaac 38400
gcctacttgg atgcgttggc gcagcagcgt cgggcaaggg gcttggccgg gactgcggtt 38460
gcgtgggggc cgtgggccgg tgacggcatg gccgcaggtg aaggcggcgc acagctgcgc 38520
cgtaccggcc tggtgccaat ggctgcggat cgcgcgttgc tggcacttca gggtgcattg 38580
gatcgagacg agacatccct ggtcgtagcc gatatggcat gggagaggtt cgccccggtg 38640
ttcgccatgt cccgtcggcg tccgctgctc gacgagctgc ccgaagcaca gcaggcgttg 38700
gcggatgcgg agaacaccac gggtgcggcg gactcggccg gcccgctgca gcggatcgtg 38760
ggcatggcag ccgccgaacg ccgccgggcg atgatggaac tggtgctggc ggagacctcg 38820
attgtgttgg ggcacaacgg gtcggatgca gtgagtcccg accgggcgtt ccaggagctc 38880
ggattcgatt cgctgatggc cgtcgaactg cgcaacaggc tgggcgaggc aacaggattg 38940
agtctgccga ccacgttgat cttcgattat ccgagcccat ccgctctggc ggagcagctg 39000
gtcggcgagc tggtgggagc gcagcccgcg accaccgtcg tggccggggc cgatccagtg 39060
gatgatccgg ttgtcgtggt cgcgatggga tgccggtatc cgggcgatgt ctgctcgcct 39120
gaggagctgt ggcagctggt ttccgcggga cgtgatgcgg tttcgacgtt ccccaccgat 39180
cggggttggg actgcgacgc gttgttcgac ccggatccgg atcgggcagg ccgtacctac 39240
gtgcgagaag gtgccttcct gaccggtgct gatcggttcg atgcggggtt cttcggcatc 39300
agccctcgcg aggcgcgagc aatggatccg cagcagaggt tgttgctcga ggtggcgtgg 39360
gaggttttcg aacgagcggg gatcgctccg ctgtcgttgc ggggcagcag gaccggtgtg 39420
ttcgcgggca ccaatggaca ggaccacggt gcgaaagtgg ctgccgcgcc ggaggcggcg 39480
ggtcacctcc tgaccggaaa cgccgcgagt gtcatggccg gccggatttc ctacacgttc 39540
ggcctcgagg gtcctgcggt ggcggtggat accgcgtgtt cgtcgtcatt ggtggcgttg 39600
catttggcgt gccagtcgct gcgttcgggt gagtgtgata tggcgttggc gggtggtgtg 39660
acggtgatgt cgacacccct ggcgttcctc gaattctctc gtcagcgcgg tttggcgccc 39720
gatggccggt gtaagtcgtt tgcggctgcg gcggatggca ccgggtgggg tgagggcgcc 39780
ggcctggttc tgctggagcg gttgtcggat gcgcgtcgta atggtcaccg ggtgttagcc 39840
gtggttcgcg ggtctgcggt gaatcaggat ggtgcgtcga atggcttgac ggcgccgaat 39900
ggcccgtcgc agcagcgggt gatccggcag gccctcgcga atgcgggact gtcggcgtcc 39960
gatgtggatg tcgtggaggc gcacgggacc ggcaccgggc tcggggatcc gatcgaggcg 40020
caggcactga tcgcggcata tgggcaggga cgggatcctg aacgggccct gtggttgggg 40080
tcgatcaagt ccaacatcgg ccacacgcag gcagcggccg gtgtggctgg ggtcatcaag 40140
atggtgcagg ccatgcggca tggggagttg cctgccacgt tgcacgtgga caaacccact 40200
ccgcaggtcg actggtctgc cggggccgtt cggctcctca ccgggaacac gccctggccc 40260
gagagcggcc gtcctcgtcg agctggggtg tcgtcgttcg ggatcagcgg caccaacgca 40320
cacctcatcc tcgaacaacc gccgtcggaa ccagcggaga tcgaccgttc gaatcggcgg 40380
gtcactgcgc atccggcggt gatcccgtgg atgttgtcgg ccaggagtct cacagcgctg 40440
caggcccagg cggctgcgct gcagggccgg ctggaccggg tgcctggcgc ttctccgctg 40500
gatttggggt attcactcgc gaccactcgt tctgtgctgg acgagcgcgc cgtcgtgtgg 40560
ggtgccgatc gggagaccct gttgtcgagg ctggcagcgc tggccgatgg ccggactgcg 40620
ccgggggtgg tcaccggcgc tgcgaattcc ggtggccgca tcggattcgt tttttccggt 40680
cagggcagtc agtggctggg gatgggaaag gcgttgtgcg cggctttccc ggcgttcgca 40740
gacgccttcg aggaagcctg cgacgcgctg ggcgcgcact tgggcgcgca cttgggcgcg 40800
gacttgggcg tggacgtccg gggcgtgctg ttcggtgctg atgagcaggt gctcgaccgg 40860
acgttgtggg cgcagccggg gatcttcgcg gttcaggtcg gcctcctggg attgctgagg 40920
tcgtggggcg tgcggccaga cgcggtgctg gggcactcgg tcggcgagtt ggctgcggcg 40980
cacgcggctg gtgtgttgtc cttgccggac gcggcacggt tggttgcggc ccgggccagc 41040
ctgatgcagg cattgcccac cggcggcgca atgctcgcgg tcgccaccag cgaggcggcg 41100
gtcgaaccgc tgcttgccgg gatgtgcgat cgggtcagca tcgctgcgat caacggcccg 41160
gagtcggtag tgctctccgg cgaccgcgac gtgctcgcag aggtcgccgg cgaactcgat 41220
gcccgagggc ttaggaccaa atggttgcgg gtctcccacg ctttccactc gcaccggatg 41280
caaccgattc tggacgagta cgccgaaacc gccgggtgcg tcgagttcgg tgaaccggtg 41340
gtgccgatcg tctccgccgc gaccggtgcg ctggacaccg ccggactgat gtgcgcagcc 41400
ggctactggg tgcgccaggt gcgtgatccc gtccgcttcg gagacggtgt ccaagcgctc 41460
gtggaccaag gcgtggacac gatcgtcgag ttcggcccgg acggggcgtt gtcggccttg 41520
gtccagcagt gcttggccgg gtccgaccag gccgggaggg tggcggcgat cccgctgatg 41580
cgcagggacc gcgatgaggt cgagaccgcg gtggctgccc tggcgcacgt gcatgtccgc 41640
ggcggtgcgg tggactggtc ggcttgcttc gccggcacgg gcgctcgcac cgtcgagttg 41700
cccacctacg ccttccagcg gcagcggtac tggctggccg ggcaagcgga cgggcgtggc 41760
ggcgatgtgg ttgccgaccc ggtcaacgcg cgcttctggg agttggtcga gcgcgccgat 41820
ccggaaccgt tggtggatga gctctgcatc gaccgggacc agcccttcag ggaggtgctg 41880
cccgtgctgg cttcctggcg cgagaaacaa cgccagaagg ccgtcacgga ttcttggcgc 41940
taccaggtgc ggtggaggtc cgtcgaggtg cagtccgcag ccagcctccg gggcgtgtgg 42000
ctggtggtgc ttccagctga cggactccga gatcaaccgg cggccgtcat cgacgcgctg 42060
atcgcgcgcg gcgccgaggt cgcggtcctg gaattgaccg agcaggactt ccaacgcggt 42120
gcgcttgtgg acaaggtgcg cgccgtcatt gccgaccgca ccgaggtgac gggtgtgctg 42180
tctctgttgg caatggacgg aatgccctgc gcagagcatc cgcacctgtc ccgtggtgtc 42240
gccgctaccg tgatcctgac gcaggtgttg ggcgatgcgg gcgtttccgc cccgctgtgg 42300
ctggccacga ctggtggcgt cgaggtcggg accgaggacg gtccggccga tccggaccac 42360
ggcttgatct gggggctcgg cagggtcgtc ggccttgaac atccgcagcg gtggggtggc 42420
ctgatcgacc ttccggcgac actggacgag acgtcccgga acgggttggt ggccgcgctc 42480
gccgggacgg cggccgaaga tcagctcgcc gtgcgttcat ccgggttgtt cgttcgcaga 42540
gtggtgcgcg cagcgcagaa ttcccgttca gggacatggc gtagccgggg aacggtcctc 42600
atcacgggcg gaacaggcgc gctcggtgcc gaggtcgcac gatggctggc ccggcggggt 42660
gctgagcatc tggtgttgat cagtcgccgc ggtccggaag ctcccggcgc cgcggacctg 42720
caggccgagc tgaccgagct cggcgtgaaa gtcacagtcg tggcctgtga tgtgacggac 42780
ggcgacgaac tgagggcggt gctggcggcc gttccgacgg agcatccgct gtcggcggta 42840
gtgcacaccg ccggcgtcgg gacgcctgcg aacctggccg agacgacctt ggcgcagttc 42900
gccgacgtgt tgtcggccaa ggtcgtcggc gcggcgaacc tggaccggct gcttggtggg 42960
caaccgttgg acgccttcgt gctgttctcc tcgatctcgg gggtttgggg agccggcggc 43020
caaggagcct attcggccgc caatgcgtat ctcgatgccc ttgccgagcg ccgacgggct 43080
tgcggtcggc cggcgacgtg cgtcgcctgg ggtccgtggg ccggtgcggg catggccgtt 43140
caggaaggca acgaggcgca tctccgccga aggggcctgg taccgatgga accgcagtcg 43200
gccctctccg cgctgcaaca ggccctgtcc cgacgagaaa ccgccatcac cgtcgcagat 43260
gtggactggg aacgattcgc cgccactttc accgcggccc gcccgcggcc actattggat 43320
gagatcgtgg atctacggcc caacaccgag actgcggaga agcacggtgc cggcgagctg 43380
gggcagcagc tggccgcact gccggccgct gagcgcggac atctgctgct ggaggtggtg 43440
ctggcggaaa ccgccaacac cctggggcac gattcggcgg aggctgtgca acccgatcgg 43500
accttcgccg aactgggctt cgattcgctt accgcggtag agctgcgcaa caggttgaac 43560
gcggtgaccg ggcttcgcct gccgccgacg ctggttttcg accacccgac accgctggcg 43620
gtgtccgaac agttggttcc ggcgttggtc gcggagccgg gcgatggcat cgagtcgttg 43680
ctcgcggagc tcgacaggct ggataccacg ttggcgcaac gaccttcgat cccaccggaa 43740
gaccaggcca aggtggcgga gcgcttgcag gcactcatcg ccaagtggga cggggcgcgt 43800
gatggcacgg ccaaagtgac gtcaccccaa tcgctgacgg cggccacgga cgacgaaatc 43860
ttcgacctca tcgaccggaa gttccggcgc tgaccgcctt cttcctcgcc tcagctcccc 43920
tgatcactgg aacggtgtat ttcgatggcc aatgaagaaa agctccgcga gtacctcaag 43980
cgtgtcgtcg tcgaactgga ggaggcgcac gaacgcctgc acgagttgga gcgccaggag 44040
cacgacccca tcgcgatcgt gtcgatggga tgccgttatc ccggtggcgt ctccactccg 44100
gaggagctgt ggcgactggt cgtcgacgga ggagacgcga tcgcgaactt ccccgaagac 44160
cgtggctgga acctgggcga gctgttcgat cctgatccgg gtcgagccgg gacctcctac 44220
gtccgcgagg gtggtttcct gcgcggagtc gcggacttcg atgccgggct cttcgggatc 44280
agtccgcgcg aggcgcaggc gatggacccg caacagcggt tgctgctgga gatctcgtgg 44340
gaagtgctcg agcgcgccgg tatcgacccg ttttccttgc ggggcaccaa gaccagtgtg 44400
tttgcgggcc tgatttacca cgactacgcg tcgcggttca gcaagacccc agccgagttc 44460
gagggttact tcgccaccgg gaacgcgggc agcgtcgcat ccggccgggt ggcttacacc 44520
ttcggattgg agggcccggc ggtcaccgtg gacaccgcct gctcgtcgtc cctggtggcg 44580
ttgcacctgg cctgccagtc cctgcggctg ggcgaatgcg acctggccct ggccggtggc 44640
atttcggtga tggccacgcc gggagccttc gtcgagttca gccggcaacg cgcactcgcc 44700
tcggatggcc ggtgcaagcc cttcgcggat gccgcggacg gcacgggctg gggcgagggc 44760
gccggaatgc tgctgctgga acggctgtcg gacgcacggc gaaacggcca cccggtgctg 44820
gcggcggtag tcggttccgc gatcaaccag gacgggatgt ccaacggcct gaccgcgccc 44880
agcggtcccg cacagcagcg agtgatccgc caggccctga cgaacgccgg gttgtcgccc 44940
gccgaggtcg atgtggtcga ggcgcacggt acgggcacgg ccttgggcga cccgatcgag 45000
gcgcgggccc tgatcgccac ctacggggcg aaccggtcgg cggatcaccc gctgctgctg 45060
ggttccctca agtcgaacat cggccacacc caggctgccg ccggtgtggc cggggtgatc 45120
aagtcggtca tggccatcag gcaccgggag atgccccgca gcctgcacat cgaccagccc 45180
tcgcggcacg tggactggtc ggcgggcgcg gtgcggctgc tcacggacag cgttgactgg 45240
gcggatcccg gccggccgcg ccgagcaggg gtgtcctcgt tcggcatgag cggtaccaac 45300
gcacacctga tcgtcgagga agtatccgac gagccggtct cgggcagtac cgagccgacc 45360
ggggcacttc cctggccgct gtccggcaag acggagaccg cattgcgcga gcaggctgcc 45420
gagctgctct ccgccgtgac cgcgcacccg gagccgggtc tggggaacgt cgggtactcg 45480
ctggccaccg gtcgcgctgc gatggagcac cgggctgtcg tggttgccga ggatcgggac 45540
tccttcgtcg ccggactgac ggcgttggct gcgggcgttc cggcagccaa cgtggtgcaa 45600
ggggcggccg actgcaaagg aaaggtcgcg ttcgtgttcc ccggccaggg ctcgcattgg 45660
caggggatgg cgagggaact gttcgaatcc tcgccggtgt tccggcggaa gctggaggaa 45720
tgcgcggcgg ctacggcccc ctacgtggac tggtcgctgc tcggcgtcct tcgcggtgat 45780
cccgatgcac ccgcactgga tcgcgacgac gtgattcagt tcgcgctgtt cgccatgatg 45840
gtgtcgctgg cagaactgtg gcgttcgtgc ggagtggagc ccgccgcggt ggtcggtcac 45900
tcccagggcg agatcgccgc cgcccatgtg gcgggggctt tgtccttgac ggatgcggtg 45960
cgcatcgtcg ctgcccgctg caatgcggtg tcggtgcttg cagggaaagg aggcatgctc 46020
gcgatcgcct tgccggaaag cgcagtggtg aagcgaatcg caggcctgcc agagttgacc 46080
gttgcagcgg tcaacggacc cggctccact gtcgtttccg gcgaaccgtc cgctctggag 46140
cgtttgcaga ccgaactgtc cgcggagaac gtgcaggctc ggcgggtgcg aattgattac 46200
gcctcgcact cggcgcagat cgcacaggtc cagggccggc ttctggaccg gctgggcgag 46260
gtcgggtccg aacctgctga gatcgctttc tactcgacgg tgaccggcga gcggacggac 46320
accggccggc ttgacgcgga ctactggtac cagaaccttc ggcagcccgt ccggttccag 46380
cagaccgtcg cccggatggc agatcagggc tatcggttct tcgtcgaggt gagcccgcac 46440
ccgctgctca ccgcgggaat ccaggaaacg ctggaagccg cggacgcgga cgcgggcggg 46500
gtggtggtcg gttcgctgcg gggtggcgag ggcggctccc ggcgctggct gacttcgctg 46560
gccgagtgcc aggtgcgcgg actaccggtg aattgggaac aggtattcct cgacaccgga 46620
gcccgacgcg tgccgctgcc gacatacccg ttccagcggc agcggtactg gttggagtcc 46680
gccgagtacg acgcgggcga tctcggttcg gtgggcttgc gctccgcgga gcatcccctg 46740
ctcggggctg cggtgacact ggccgatgcg ggcgggttcc tgctgaccgg caagctgtcg 46800
gtcaagaccc agccctggtt ggccgaccac gcggtccgtg gggcgatcct gctgcccggc 46860
accgccttcg tggaaatgct gatacgcgcc gcggaccagg tcgggtgcga tctgatcgag 46920
gagttgtccc tgacgactcc gctggttctg cccgcgaccg gtgcggtgca ggtgcagatc 46980
gcggttggcg gtccggacga ggccgggcgc cgctcggtcc gcgtgcattc ctgtcgggac 47040
gactccgtgc cgcaggactc gtggacctgc cacgcgaccg gcacgttgac caccagtgag 47100
caccgggacg ccggccaggc ccgcgatggg atttggccgc cgaacgatgc tgtcgcggtt 47160
ccactggaca gcttttacgc gcgcgcagct gagcggggct tcgatttcgg tccggcgttc 47220
caggggttgc aggcggtttg gaaacgcgga gacgagatct tcgccgaggt cggcctgccc 47280
gcagcacagc gcgaggacgc cggcaggttc ggagtccacc cggctctgct ggatgcggca 47340
ctgcaggcgc tgggcgcagc cgaggaggat ccggacgagg gatggctccc cttcgcgtgg 47400
caaggtgtgt ccctcaaggc gaccggcgcg ctttcgcttc gggtgcacat cgtcccggcg 47460
ggtgcgaacg cggtgtcggt gttcacgacc gacgcgacgg gccaagccgt gctctccatc 47520
gactcgctgg tgctgcgcaa gatttcggac gagcagttgg cagcggtccg tgcgatggac 47580
cacgagtccc tgttccgggt cgactggagg cgaatctcgc ccggcgctgc caagccggtc 47640
tcctgggcag tgatcggcaa tgacgaactc gctcgagcct gcggctcggc acttggcacg 47700
gaactccacc ccgacctgac cgggttggct gacccgcccc cggacgtggt ggtggtgcca 47760
tgcggtgcgt ttcaccagga cttggaggtt gcttccgagg cacgtgccgc aacgcaacgc 47820
gtgcttgacc tgatccaggg ttggttggcg gcggagcgat tcgccggatc tcgcctggtg 47880
gtggtgacgt gtggtgcggt gtcgaccggg cccgccgagg gtgtttccga cctggtgcat 47940
gctgcgtcgt ggggcttgtt gcgttcagcg cagtcggaga acccgaatcg attcgtgttg 48000
gtcgatgtgg acgcaaccgc cgagtcatgg cgcgcgctcg cggcggcggt gcgttccgga 48060
gaaccgcagc tagcgctgcg cgccggcgaa gtccgagtgc ctcgcctgac acgatgtgtt 48120
gccgccgagg acagccggat cccagtgcct ggtgcggatg ggacggtgtt gatttccggc 48180
ggtacgggcc tgctgggcgg gttggtagcc cggcatttgg tggcggagcg cggtgtccgc 48240
cgcctggtgc tggcagggcg acgcggctgg agcgcccccg gggtcaccga attggtggat 48300
gagctggtgg gcctgggagc tgtggtcgag gtggcgagct gtgatgtcgg ggaccgggcc 48360
cagctggacc ggctgctgac gacgatctcg gcagagttcc cgctgcgcgg agtggtgcat 48420
gcggccgggg cactggccga cggggtcgtc gagtcgttga caccagagca cgtggcaaag 48480
gtgttcgggc cgaaggtcgc cggtgcgtgg cacctgcacg agctgacccg tgaactggat 48540
ctctcgttct tcgtgctctt ctcctcgttc tccggggtgg tgggggctgc gggtcaagga 48600
aactacgcgg cggcgaacgc gttcctggac ggcctggctc agcaccggcg gacggcggga 48660
ctgcctgcgg tgtcgctggc ttggggcttg tgggagccga ccagcgggat gaccggagcg 48720
ctcgatgcgg cggaccgcag ccgcatttcg cgcaccaatc cgccgatgtc cgcggaggac 48780
gggttgcggc tgttcgagat ggcgtttcat gttccgggcg aatcgcttct ggtcccggtc 48840
cacatcgacc tgaacgccct gcgcgccgat gcggccgacg gcggtgtgcc tgcgttgttg 48900
cacgacctgg tgcccgcgcc cgtgcggcgg agcgcggtca acgagtcgga ggatgtcacc 48960
ggtctggtcg gtcggctgcg gaggcttccg gacctggatc aggaaaccct gctgttgggt 49020
ttggtgcggg agcatgtttc ggctgtgctg gggtattcgg gtgcggtcga ggttggggtc 49080
gagcgtgctt tccgggattt gggttttgat tcgttgtccg gtgtggagtt gcggaaccgg 49140
cttggcgggg tgctgggcgt tcggttgccg gctactgcgg tgttcgacta tccgacaccg 49200
cgggccttgg ttcggttctt gcgcgacaaa ctgattggtg gcgtggaggc acgcaattcg 49260
gcaccggcgg ttgtggaggc ggccagtggt gacgacccgg ttgtgatcgt ggggatgggg 49320
tgtcgttttc cgggtggggt gtcctcgccg gaggagcttt ggcgtttggt ggccgggggc 49380
ttggatgcgg tggcggagtt ccccgacgat cgtggttggg atcaggcggg gttgttcgat 49440
ccggatcccg atcgtctcgg gacttcgtat gtgtgtgagg gcggcttcct gcgagatgcg 49500
gcggagttcg atgccggttt cttcgggatt tccccgcgtg aggcgttggc gatggatccg 49560
cagcagcggt tgttgctgga gatcgcttgg gagaccttgg agcgggcggg gattgatccg 49620
ctttcgttgc gagggagtcg gaccggcgtg ttcgcggggc tgatgcacca cgactacggc 49680
gcgcggttcg tcaccagggc gccggagggt ttcgagggtt atctaggtaa tggcagcgcg 49740
ggcggcgtct tttcgggtcg ggtcgcgtat tcgtttggtt tcgagggtcc tgcggtgacg 49800
gtggatacgg cgtgttcgtc gtcgttggtg tccatgcacc tggcgggtca agcactgcgg 49860
tctggtgagt gtgatctggc tcttgcgggt ggtgtgacgg tgatggccac gccggggatg 49920
ttcgtggagt tttcgcgcca gaggggtttg gcggcggacg ggcggtgtaa gtcgtttgcg 49980
gctgctgcgg atggcaccgg ctggggcgaa ggcgcgggcc tggtgttgtt ggagcggctg 50040
tcggatgccc ggcgcaacgg gcacgcggtt ctggcggtcg tgcggggtag cgcggtgaat 50100
caggatggtg cgtcgaatgg tttgacggcg ccgaatggtc cgtcgcagca gcgggtgatc 50160
acgcaggcgt tggcgagtgc gggtctgtcg gtgtctgatg tggacgctgt ggaggcgcat 50220
gggactggga ccaggcttgg tgatccgatt gaggcgcagg cactgattgc cacttacggg 50280
caggagcggg atagggatcg gccgttgtgg ttggggtcgg tgaagtcgaa tattggtcat 50340
acgcaggcgg cagcgggtgt tgctggtgtg atcaagatgg tgatggcgat gcggcacgag 50400
cagctgcccg ccacgttgca tgtggatgaa cctacgccgg aagtggattg gtcggcgggg 50460
gaggtccagc tccttacgga gaacacgccc tggcccgaca gcggccatcc tcgtcgggcg 50520
ggagtgtcgt cgttcggcat cagcggcacc aacgcacatg tcatcctcga acaagcctcg 50580
aatacaccag acgagattgc gcagagcaac ggtcccgaat cggaatctac cgtggacatc 50640
ccagcggtcc cgttgatcgt gtcgggcaga acaccggaag cgctcagcgc tcaggcgagc 50700
gcattgatgt cctatttgga taatcgtccc gatatttcat cccttgatgc cgcgttttcg 50760
ttggcttctt cccgggccgc gttggaggag cgggcggtcg tgctgggagc ggaccgtgaa 50820
gcgctgttgt ctgggttgga agcgctggct gccggtcgcg acgcttctgg ggtggtgtcg 50880
ggatccctga tctctggcgg ggttgggttt gtgttttccg gtcagggtgg tcagtggctg 50940
gggatgggaa gagggctcta ctcggcgttt ccggtgttcg ctgacgcgtt tgacgaagct 51000
tgtgccggac tggatgcgca tctggggcag caggtggggg ttcgggatgt ggtgttcggt 51060
tccgacgggt ccttgctgga tcggacgttg tgggcgcagt cgggtttgtt cgcgttgcag 51120
gttggtttgc tgaggctgct gggctcgtgg ggtgttcggc ctggtgtggt gatggggcat 51180
tcggtgggtg agtttgcggc ggcgtttgcg gcgggtgtgt tgtcgttgcc ggatgcggct 51240
cggttggtgg cgggtcgtgc ccggttgatg caggcgttgc cggatggcgg tgccatgttg 51300
gcggtggctg ctggcgagga gcagctgcgg ccgttgttgg ccgctcgggg tgaaggggtg 51360
gggatcgccg cggtcaacgc ttctgagtcg gtggtgctct ccggcgatcg ggaggtgctt 51420
gaggacattg ccggcgggct ggatgggcaa ggggttcggt ggcggtggtt gcgggtttcg 51480
catgcgtttc attcgtatcg gatggacccg atgctgcagg agttcacaga tatcgcaggc 51540
agcgtggact accggcgttg cgacctgccg gtcgtgtcga cgttgacggg tgagctcgac 51600
accgctggca tgctggctac accagggtat tgggtgcgtc aggtgcgtga gcccgtccgc 51660
ttcgccgacg gggttcgggc gctcgcgcag cagggggtcg gcacgatctt cgagcttggc 51720
cctgatgcga ttctgtcggc tctgattcct gattgtcatt cctggggtga tcagactgtg 51780
ccgattccgt tgctgcgcaa ggaccgcgct gaacccgaaa ctgtggtcgc cgcggtggcg 51840
cgggcgcaca cgcgtggtgt tcaggtcgat tggtcggcgt ttttcgctgg taccggggct 51900
gggcgggtcg agttgccgac gtatgccttc cagcggcagc ggtattggct ggagtcatcg 51960
gtttccggtg atgtgacagg tatcggtctg gctggggcgg agcatccgtt gctgggggcc 52020
gtggttgtgt tggccgacgg tgatgggatg gtgttgaccg gtcggttgtc ggtggggacg 52080
catcggtggc tggccgagca tcgtgtgctg ggggaggtcg tggttcccgg cacggctatc 52140
ctggagatgg tcttgcatgc gggggcgcgg gttggttgtg gccgggtgga ggagctcacc 52200
ctggaagcac cgctggtggt gcccgaacgc gatgccatcg aaatccagct gctggtgaac 52260
gcgcccgacg acaagggtcg gcggtccgtg tcgctgcatt cccgcccggc cggtgggtct 52320
gggggtgggg gttggacgcg gcacgccacg ggcgaactcg tcgtcgccgg cacgggtggt 52380
ggggcggtta ctggttggtc gactgagggt gccgagccgg ttgctctcgg tgagttttat 52440
gtcgttcagg cggggaacgg gttcgagtat gggccgttgt tccaggggct tcgggcggcg 52500
tggcgtcgtg gtggcgaggt tctcgcggag gtcgccctgc cggcagcggc tggtgcgatg 52560
gcggggttct tgatcaatcc ggcgttgctg gatgccgcct tgcaggcgtc cgcgctgggt 52620
gaccgtccgg cggagggtgg tgcgtggctg ccgttctctt ttaccggggt agaactttcc 52680
ggtcagggtg ggacgatcag cagggcacgg gtggagtcta cgcgacccga tgcggtgtcg 52740
gtggctgtga tggatgaggg tgggcggttg ctcgcctcga tcgattctct ccggttgcgg 52800
ccggtgtcgt cggtgcggtt ggcgaatcgg gacgttgtcg gtgacgcgct gttcgaggtg 52860
acttgggagc cggtggcgac gcggtcgacg gtatcgggtc gctgggcgtt gcttggtgat 52920
gctgtcggcg gcatggccgg tctcattggg ctcgcaccag gttccgtcga tcgttgtgcg 52980
ggtctggctg agctcgcggg gaaccttgat tccggtgcgc tggttgctga tgtcgtggtt 53040
tattgcgccg gtgaacaggc ggatcccgac gccggcgtgg cggcactcgc ggagacccgg 53100
gagatgctgg ccctggtcca gtcgtggttg gccgaggagc ggttggccgg gtcacgtctg 53160
gtggtggtga cgtgtggcgc ggtgacgacg gctgcgggtg acggcgcatc aaagctggcg 53220
catgcgccgt tgtgggggtt gttgcgttca gcgcagtcgg agaacccggg ccggtttgtg 53280
ctggtcgatg tggacggtac cgccgagtcg tggcgcgcgt tgccgagtgc ggtggggtcg 53340
atgcaaccgc agttggccgt gcgtaagggt gtggtgacag tgccgcgtgt ggcgtcggtt 53400
ccggggccgg tcgaggtgcc cgcggtggtg gccggtcccg accggacggt gctgatttcc 53460
ggtggcacgg gtctgttggg tggcgtggtg gcacgccacc tggtggccga gcgcggtgtt 53520
cgtcgagtgg tgttgacggg ccgtcgtggc tgggatgctc ccggaatcac cgagttggtg 53580
ggtgagctgg agggtttcgg tgcggtggtc gatgtggtgg cgtgcgacgt tgcggatcgt 53640
gctggtctgg aggggttgct ggcggcggtc ccggcggagt ttccgctgtg tggtgtggtg 53700
catgccgcgg gtgtgctggc tgacggggtg atcgagtcgt tgacaccgga ggacgtgggg 53760
gcggtgttcg gtccgaaggc ggcgggggcg tggaacctgc acgagctgac tcgggatatg 53820
gacttgtcgt ttttcgcgtt gttctcctcg ctgtccgggg tgaccggcgc cgcgggtcag 53880
ggtaattatg cggcggcgaa cacgttcctg gacgcattgg cgcattaccg gcgggcgcag 53940
ggattgcctg cggtgtcgtt ggcgtggggc ttgtgggagc agtcgagcgg gatgaccggg 54000
cggctcagtg atgtcgaccg gagcaggatc gcccgctcca gtccaccgtt gtccaccaag 54060
gatggtttgc ggctgttcga tgccgggctg gcgttggatc gggcagcggt ggttccggcg 54120
aggttggaca gggccttcct ggccgagcag gcccggtcgg gaacgctacc cgcgatgctg 54180
acggcactgg tacctaccat cacctctatc aggcgcagta gtggcaccga cctcgcggac 54240
gaggacgcct tgcttggggt ggtgcgggag cacgccgcga gggtgctggg gtattcgggt 54300
gcggccgagg tcggggtcga gcgtgctttc cgggatctgg gctttgattc gttgtctggt 54360
gtggagttgc gtaatcggct ggccggggtg ctgggagccc ggctgccggc aaccgccgta 54420
ttcgactacc cgacgccgcg ggcgttggcc cggttcctgc accaggaact ggcaggcgag 54480
gtcgggacga cgccggcgcc ggtgacgacc acgaccgcga gcgtcgaaga cgatctcgtc 54540
gcgatagtcg ggatggggtg tcgttatccg ggtggggtgt cctcaccgga ggagctttgg 54600
cgtttggtgg ccgggggcgt ggatgcggtc gcggacttcc cggacgatcg cggctgggat 54660
ctggccggat tgttcgatcc agatcccgat cgtttcggga cttcgtatgt gcgtgagggc 54720
gggttcctgc gggacgcggc ggagttcgat gccgcgtttt tcgggatttc tccgcgtgag 54780
gcactggcga tggacccgca gcaacggttg ctgctggagc tgtcctggga ggccgttgaa 54840
cgcgctggga tcgatccggg gtcgctgcgc gggagccgga cgggtgtgtt cgcggggctg 54900
atgtatcacg actacgccgg acggttcgcg gccggagtgc cggagggctt cgaaggctat 54960
ctcggtaatg gcagcgcggg cagtgtggcc tcgggccggg tcgcgtattc gttcggtttc 55020
gagggtcctg cggtgacggt ggacacggcg tgttcgtcat cgctggtggc gttgcacctg 55080
gcaggtcaat cactgcgttc cggtgagtgt gatctcgccc ttgccggtgg cgtgacggtg 55140
atggccaccc cggcgacgtt tgtggagttc tcccgtcagc ggggtctggc accggatggg 55200
cgctgcaagt cgttcgcgga ggccgcggac gggaccggct ggggcgaggg tgctggccta 55260
gtgctgttgg agaggttgtc ggatgcccgt cgtaatgggc atcgggtgtt ggcggtggtt 55320
cgtgggtcgg cggtgaatca ggacggcgcg tcaaacggac tgaccgcgcc gaatggtccc 55380
tcgcagcaaa gggtgatcac ccaagcactc acgagtgcgg ggttgtccgt gtccgatgtg 55440
gatgctgtgg aggcgcacgg gaccgggacc aggcttggtg atccgatcga ggcacaggca 55500
ttgatcgcca cctatggccg tgatcgtgat cctgaccggc cgttgtggtt ggggtcgatg 55560
aagtccaaca tcggtcacac acaggcagcg gcgggtgttg ccggtgtgat caagatggtg 55620
atggcgatgc gccacgggga gctgccgcgc acattgcacg tcggcgagcc cacgtcggag 55680
gtggattggt cggcaggttc ggtccagctc ctcacggaga acacgccctg gcccgacagc 55740
ggccatcctc gtcgggcggg agtgtcgtcg ttcgggatca gcggcaccaa cgcacacgtc 55800
atcctcgaac agtctccgac agcgtcaagt gagttcgtgg agcacagcgg acctgattcg 55860
gaatctgctg tgaatgtccc tgtggttccg tgggtggtgt cgggcaaaac acccgaagcg 55920
ctcagtgctc aggcggacac cttggtgtcc tatctggacg atcgatctga tgtctcctcg 55980
cgggatgttg ggtattcgct ggcgatgacg cgttcggcgc tggatgagcg ggcggtggtg 56040
ctggggtcgg accgtgaaac gttgttgtcc gggttgaaag cactggctgc cggtcatgag 56100
gccactgggg tggttacggg atctgtgggt tctggcggcc ggcccggttt tgtgttcgcc 56160
ggtcagggtg gtcagtggtt ggggatgggc cgggggcttt accgggcgtt tccggtgttc 56220
gctgatgcct ttgacgaagc ttgtgccgga ctggatgcgc atctggggca ggaagtgggg 56280
gttcgggatg tggtgttcgg ttccgacgcg cagttgctcg atcggacgtt gtgggcgcag 56340
tcgggtttgt tcgcgttgca ggttggtttg ctgaagttgt tgggttcgtg gggtgttcgg 56400
cctgttgtag tgctgggcca ttcggtcggg gagctagcag cggcgttcgc cgccggtgtg 56460
ctgtcgatgg cggaggcggc tcggttggtg gccggtcgtg cccggttgat gcaggcgttg 56520
ccgtctggcg gtgccatgtt ggcggtggcc gcgaccgagg accgaatcag cccgctgctg 56580
gatggggtgc gggatcgtgt tggtgtcgca gcggttaatg ctccggggtc ggcggtgctt 56640
tccggtcacc gggatgtgct tgaggacgtt gttggccggt tggatgggct gggtgttcgg 56700
tggcgatggt tgcgggtttc gcatgcgttc cattcgtatc ggatggatcc gatgctggat 56760
gagttcgccg acatcgcacg gagcgtggat taccggtctc cagggctgcc gattgtctcg 56820
acgctgaccg gaaacctcga tgacgtgggc gtgatggcta cgccggagta ttgggtgcgt 56880
caggtgcgag agcccgttcg cttcgccgac ggtgtccagg cgcttgtgaa ccagggcgtc 56940
gacacgattg tggaactcgg tccggacggg gtgttgtcga gcttggttca tgagtgtgtg 57000
tcggagtccg ggcgggtgac ggggattccg ttggtgcgga aggaccgtga tgaggtccca 57060
acggtgctgg ccgctttggc gcagatccac actcgtggtg gcgcggtgga ctgggggtcg 57120
tttttcgctg gtacgggggc aaagcaggtc gaactgccca cgtatgcctt tcagcgacga 57180
cggtactggc tggagccatc ggattccggc gatgtgacag gtgctggcct taccggggcg 57240
gagcatccgc tgttgggggc cgtggtgccg gtcgcgggtg cggatgaggt gctgctgacc 57300
ggcaggctgt cggtggggac gcatccgtgg ctggccgacc atcgcgtgct gggcgaagtc 57360
gtcgtccccg gcaccgcgtt gctggagatg gcgtggcggg ccggtagcca ggtcggttgt 57420
gaacgtgtgg aggagctcac cttggaagca ccgctggttc tgccggagcg gggtgctgcg 57480
gcggtgcagt tggcggtggg ggctccggac gaggccggcc ggcgcagttt gcagctctat 57540
tcccgaggcg ctgacgaaga cggcgactgg cggcggattg cctccgggct gttggcccag 57600
gccagtgtgg tgccgccagc ggattcgact gcatggccgc cggacggtgc tgtgcaggtc 57660
gatctggcgg agttctacga gcgcctcgcc gagcgcggct tgacttatgg cccggtgttc 57720
caagggctcc gcgccgcatg gcggtacggc gacgatatct tcgccgagct tgccgtgtca 57780
ccagacgccg ctggtttcgg catccacccg gcgctgctgg acgctgcact gcacgcgatg 57840
gcgcttggtg cttcgcccga ctcggaagct cgtctaccgt tttcctggag tggcgcccag 57900
ttgtaccgcg ctggaggagc agcgcttcgg gtacggctct cgccgctggg caccggtgca 57960
gtctcattga cgctgatgga tgccgcaggg ggacaagtcg ctgcggtgga atcgctttcg 58020
acgcgaccgg tctccgccga ccagatcggt gccggtcgcg gcgatcacga gcggctgctg 58080
cacgtcgagt gggtaaggcc ggctgaatcg gcggggatgt ccctgacctc ctgcgcggtg 58140
gtcggtttgg acgaaccgga gtggcacgct gccctgaagg ccactggtgt ccaggtcgag 58200
tcccatgcgg atctggcttc gttggccacc gaggttgcca agcggggatc ggctcctggt 58260
gcggtgatcg tcccgtgccc gcgaccccag gcgatggagg agctgccgac cgccgcgcga 58320
agggcgacgc aacaggcgat ggcgttgctg caggaatggc ttgccgatga ccggttcgtc 58380
agtacgcgcc tgatcctgct gacgcatcgg gcggtcgccg cagttgctgg agaagacgtg 58440
ttcgacctgg tacacgcgcc gctgtggggc ttggtccgca gcgcgcaggc ggagcacccg 58500
gaccgattcg ccttgatcga tgtggacgag gcggaagcat cgcgggcagc actcgccgaa 58560
gcgctgactg caggagaagc gcagctcgcg gtgcggtcgg gagttgtgct ggtgccccgc 58620
ctcggccagg tgaaggcgag cggaggtgaa gcgttcaggt gggatgaagg caccgtgttg 58680
gttaccggcg gaaccggcgg gctaggggcc ctgctcgcac gccatctggt cagcgcacac 58740
ggtgtgcggc acctgttgct cgcaagtcgt cgcggtctgg cggcgccagg agcggatgag 58800
ctggtggccg agctggagca gtccggcgcc gatgtcgcgg tcgtcgcgtg cgacgcggca 58860
gatcgggact cgcttgcgcg gctggtggcg tcggtgcccg cggaaaaccc gttgagggcg 58920
gtggtgcacg cggccggtgt gctggatgac ggtgtgctga tgtcgatgtc gccggagcgc 58980
ttggacgcgg tgttgcggtc caaagtggat gccgcgtggt acctgcacga gctgactcgg 59040
gaactcggtc tgtcggcgtt cgtgttgttc tcctcggtcg cgggcctgct cggtggtgcg 59100
gggcagagta attacgctgc cggcaacgcg ttcctggatg ccttggcgca ttgccggcag 59160
gctcaggggc tgcccgcgct gtcgctggcc tccgggctgt gggcgagtat cgatggaatg 59220
gcgggtgacc tcgctgcggc ggacgtggag cggctgtcgc gggcaggcat tgccccgctt 59280
tcggcaccgg gagggctggc cctgttcgac gctgccattc gctcggacga accgttgctg 59340
gcgccggtgc gattggatgt cgaagcactg cgtgtgcagg cccgatccgc ggagacccgg 59400
attccggaaa tgctgcatgg catggcaatg gggccaagcc gccgcacttc gttcagctcc 59460
agggttgagc cgttgcaaga acggttggcc ggtttgtcag aggacgaacg tcggcagcaa 59520
gtgctccagc gcgtccgcgc cgatatcgcg gtggtactgg ggcacggcaa gtcgaacgac 59580
gtggacaccg agaagccctt ggccgagctg ggtttcgact cgctgacggc catcgaactc 59640
cgcaaccgcc tcgctaccgc caccggactg cggctacccg caacgctggc cttcgaccac 59700
ggcaccgcgg cagcactcgc ctggcacgtg tgcgcgcagc tgggtaccgc gaccgtgccg 59760
gcaccgaggc gaactgacga caacgactcc gcggagcccg tgaggtcgct cttccaacag 59820
gcgtatgcgg ctggtcggat acttgacggg atggatttgg tgaaggtcgc tgcccagttg 59880
cgaccggtgt tcggttcgcc tggcgagctg gaatccctgc cgaaacctgt ccagctttcc 59940
cgtggcccca aagagcctgc cttggtgtgc atgccggcgc tgatcgggat gccgcccgcg 60000
cagcagtacg cgcggatcgc cgccggcttc cgcgatgtgc gggacgtttc ggtggtcccg 60060
atgcctggat tcgttgcggg agaaccgctg ccgtccgcca tcgaagtggc ggttcggacg 60120
caggcggagg cggtgctgca ggagttcgcc ggtgactcgt tcgtgctggt cgggcattcc 60180
tctgggggct ggctggcgca tgaggtagcc ggtgtgctgg agcgtcgcgg ggtcctcccg 60240
gccggggtcg tactgctgga cacctacatc ccgggtgaga tcacgccgag gttctccgcg 60300
gcgatggccc accggacgta tgagaagctc gcgaccttca cggacatgca ggacatcgct 60360
atcaccgcga tgggcgggta cttccggatg ttcaccgagt ggaccccaac accgatcggt 60420
actccgacgc tgttcgtgcg gaccgaagac tgcgtcgcag accctgaagg gcggccgtgg 60480
accgatgact cctggcggcc ggggtggact ctcgcggatg ccacggtcca ggtgccgggc 60540
gaccacttct cgatgatgga cgagcactcc gggtccaccg cacaggcagt cgcgagttgg 60600
cttgagaaac tcagccagcg caccgctcgg caacgttgac gtacaccgtt cagggtgtcg 60660
gttccgtgtc catgttggct tgcgggagca ggagcaattc tgaagcgagg gatgtagcgt 60720
aggagcagca tggaacgggc tgtgaacgga agcgttttgc cctgcttttt tcggtttcgc 60780
aggtcactgt ctcttgtgga cttgactggg ggtcaagggg tcgcaggttc aaatcctgtc 60840
agcccgacgt tggcggaagc ccgctgaccg gcacgtaagt gcaggtcagc gggcttttta 60900
tgctgttgtg gatcttgggt gatcgtcccg aagtgtgccg tctcgaattt cggcgatttg 60960
tgcgaggatt tcgggagttg ttttctggaa gcggaatcgc gaggcggaaa cgctgctgat 61020
cgtgctgacg ggtggccgat ccgcatccgg aacaaggaga tcgactggtc ggggacaacg 61080
gcgctgccgg gaggcctcga agactggctc accgaggcgc cgaactatgt ctacctgcgg 61140
cgggccaagg cgcggccgac cgtcagggac aaggcattcc ggagcaaggt agcggatctc 61200
gtctccggtg tgaccgggaa ggatcccgag cggattgacg gccagtaccg gcgccagcgg 61260
cgccagcggg gggcctggcc gcggccgcgg tgcttcggcg ttgctggtgc tcgtcgtcat 61320
ggcgactgcg ctcattgtcc ggcaacgcga gcagactgac cagcaacggc gcgtcgccgt 61380
cggccgggag ctgatcaccg cagccgagga cctgcgggac aatgatccga ggctctcgtt 61440
gctgtccagc ctcgggagac ctgagctcgc cccaccccgg aagcccgtgc cgggctcgtg 61500
aataccctga tgcggacccg tttcgcaggt accccggcga gtttcaacgc gtacgaactc 61560
aacctcgtca cggcggccac cggcaaccgg gccgccacca gggcgcggag ccgctcggtc 61620
gagaccagcc cagatttcat ccacgaggtg gtcctgtggg gacaccggcg gtggtggggc 61680
gtggcgacgg ctcggggcgc tgcccgagtt cgtcggtgac ggcatctcgc tggcgctgag 61740
cgcggacggg aacaccctgg ccatcgggga caccgacgaa ggggtggtgc tatgggacat 61800
caccgacccg ggaaatcccc gcaggctggc cagcgcaccg ggggaggtac gtgtcctcgc 61860
tcgtgttcgg ccgaaacggg gaggcgcttg ccgtctcggg catcgacggg gtggcgctgt 61920
gggacatccg cggtgtacgg gaacgggaaa ccttggagcg ccgcgccacc ttgcccgggt 61980
tggcgcaggc gggcggggtg ctcctcagcc cagatggcct gcagctcgcc accacccggg 62040
acacgcgccg cgagaccgct ccggacgtta acgaccacca gacgaccctg tgggatgtct 62100
ccgatctcac gcggcctacc cggccagcgc cgatcagggg gcacttctcc gcgaccctgt 62160
tcgacgcggc gttcagcccg gacgggcaca cgatcgccac cgccggcgta cacggcgaca 62220
tcgtgctggt cgacattacc gacccggcca atcccaagga gctaaccgtc ctctagggac 62280
actccggcaa gtggatgacc tcggtggcgt tcagcccgga cgggggaaag ttggtcacca 62340
gcggcgaaga cgacaccgct gtcctgtggt acctcggcga ccggcagcac ccacagcagc 62400
tggcgacgct ggacggccac gccggcaggg tgtccgcagc ggcgttcagc tcgaacgggg 62460
cggcggtctt caccgcggat tcctccgcaa ccggcggtcc cgcggcgcgg tgcgccaatg 62520
gcgggcggcc gaccgtgcgc ggccggtggc gaccggcctc ctgaacggcc acgaactgag 62580
actgaccgcg gtgtcgacgg ttgacgactc cgcgctcgcc atcgcgagta agccccggtc 62640
tcctatccct gacccacggt gcaaagtagc caccacgtct tctgcctcat ccccgttgcc 62700
cgggctcgga ccggtcgtta ggtcgaattt cggcccgacc cggctcgggt tcgtgctgat 62760
gctgaagttc ttcgagttgg agggccggtt tcctcagttc gtggaggagt tcccgcaggc 62820
tgcggtcgac tacgtggccg gcgtggtcaa ggtgcccgcg gaggacttgg cgaaatacgg 62880
yctgtcgtcc cgctcggcga aggggcaccg tacacagatc cgcgagaccc tcgggtacyg 62940
gcccgcgacc cgcgccgacg aggaacggct gaccgcctgg ctcgccgatg aggtctgccc 63000
ggtcgagatg gtggaggacc ggctgcgcga ggccctgctt gtgcagtgtc gcagcgacca 63060
tgtcgagccc ccgggccgcg tcgagcggat cgtggccgca gcacgggcgc gggcggaccg 63120
cgtcttctgc gcgcagaccg tcgcgcgcct gggcgaggcg tgcgctggcc gcctgctgac 63180
cctggtggcg gagggcaacg aggagggtac ggcgctgctg gcctcgctga agcgggaccc 63240
gggcgcggtg gggctggact cgctgctggc ggagatcacg aagctgactg ccgtgcggcg 63300
gttgggtctg ccggaagggc tgttcgcgga ctgctcggag aagctggtgg ccgcgtgggc 63360
gggcgcgggc gatcaagatg tatccctcgg acttccggga cgctggcaag gatgtgcgga 63420
ccatgctgct ggcggcgctg tgcgcgtccc ggcaggcgga gatcaccgat gccctggtgg 63480
agctgctggt cgctctggtt cacataagat caatgctcgt gccgagcggc gggtggagcg 63540
gcagctgacg gcggagctga agaaggtacg gggcaaggag ggcatcctct tccagctcgc 63600
tgatgcgtcg gtcgggcagc ctgaggggac cgtgcgcagg gtgctgtttc cggtggtcgg 63660
ggagaagacg ctgcgcgacc tggtcgcgga ggcgaacgag aaggcgttca aggccagggt 63720
ccgtaccacg ctccggtcgt cgtacagctc gtactacccg gcagatgctg ccgtcactgc 63780
tgcggacgct cggcttcagg tgcaacaaca ccgcctaccg gccggtgatg gacgcgctcg 63840
tgctgctgga gaagtacgcc gacgtcgacg gcaagacccg cttctacgac gtcggcgacg 63900
tggtgccgat ggacggccta gtccgcaagg actggcgtga ggcggtcgtc gatgacaagg 63960
gcaggaccga gcgcatcccc tatgagctgt gcgtgctggt ggccctgcgg gatgcgatcc 64020
gccgccgcga gatctatgtc gggggcggga cgcggtggcg caacccggag gacgacctgc 64080
ccggcgactt cgagtcggcc ggcaccgtgc actacgccgc gattcgccag cccgaggacc 64140
cggggggagt tcgtcgccgg cctgaagcgg cggatgacgc agggcctgga ccggctgtct 64200
gcggcgctcg cggacggctc ggagggtggg gtgaaggtca ccacccgcaa gggcgagccc 64260
tggatcaggg tgccgaagct ggagccgctg gacgagccca cctgcctggc ggccctcaag 64320
gacgaggttg tacggcggtg gggcgtgctc gacctcctgg atgtgctgaa gaacgccgac 64380
ttcctcaccg gcttcaccga tgagttctcc tcggtcgccg cctatgagcg catcgaccgt 64440
gccaccctcc agcggcgtct gctgctcgcg ctgttcgcct gggcaccaac atgggcatcc 64500
gcgcgatcgt ggcgaccggc gagcacggcg agagcgaggc cgcgctgcgg cacgtgcgtc 64560
ggcacttcat caccgtcgac aacctgcgcg ccgcggtgac gaagctggtg aacgccacct 64620
tcgccgctcg ggacgcggca tggtgggggc agggcaccgc gtgcgcgtcg gattcgaaga 64680
agttcgggtc ctggtcctcg aacttcatga ccgagtacca cgcccgctac ggcgcaacgg 64740
cgtgatgttt tactggcacg tcgagaagaa gaacgtctgc atctattccc agctcaagag 64800
ctgttcctcg tccgcaggca tcccgcaggc agggttctcg acgagatcct caaccgcgcc 64860
accgcctggg ccacccgcgc cgcgtgcggc gaggataccg gcgttccgct ggtcgtgctc 64920
cacaaccagg tcaaggacag gactgaccga cgataacgat actgtggaca cgagcgcggc 64980
tcctgacacg atccttcggc gtaaccacct gaaagatcaa cagatagctg cgctcacccc 65040
ttcaccgggg ccaccaagac taaaccggac aaaccggagc ttccgcccaa aagatcaaac 65100
cgccggggtc ctgcggcatg accaagttcc tttctgagac aaaggacagt gtccattcgt 65160
gcagagggct tgtttgatct ctgggcggaa ccaagtatag gaattagcta gacgacgaac 65220
cgattttact gcgggaatgg gatcccgcgt tctgcgcgcg gctcgcgggc gcatattcgg 65280
aaacgctgca gcggagaatc gagagcgtcc agatttgcgt acaagattcg ccgtaattgt 65340
ctaagagtgc tggaagcaac agtttccaaa caaaaggtgc acacaacgtg aatacaccca 65400
tgacggctgg aagcgtgaac gtggatcccc agcttatgat gcggcatgaa cctgcgaccg 65460
aggagagcgc ccggatcttc cagcggcgtc gacacgccct tgttggcaca gctcttttca 65520
acagctactt caaacaagca cgactggacg agctcctgcg gtggacggtg accgagttcg 65580
agtcgctgca tgtgttcctc cctgacacga ccaccgcttt cacgatgcag gcccgcggct 65640
atccggcggt ggaagcagag cggaagacgc gacgcgaggc acgaaggctg cgcgggaaga 65700
tactcgggtc gcttgccatg ctcggcgtcg cgtcccccga gcagatgatc atcgacttcg 65760
cctggctcaa cgagaacgag gagtacctgc gcacgaggga cgaggttgcg gaggtcttcg 65820
ggtcggatga agacttcagg aaggcctgtc tggccgagtc tgaatcggtc gcgaggagca 65880
ggcggagaac cgacggcagt ctcacggagg aagagctgaa catggcggca cggtacttgc 65940
tggccgagat cccgcttttc gtaaacgctc ccgccatact tggcatgccc gagactgtgt 66000
tctgctacca ccgcatcgaa ccgttcgaac gtgacctgta ccgcggcgcg ttcgcggtga 66060
aagcagtacc tcggcaaggc tgggtcgtgg tcgagtccgc ttcttcggaa atggtggacg 66120
ccggtgccgc ggccgggtcc ctgccgcgac cacgtaccgc aggtgtgctg tgaaccgcgc 66180
cgatacggcc gattgcgaac cccggcggtc cggcgagcaa gtacgccgga agtccagcct 66240
gtggtaggcg cggcttcggc tggtctgcac ggcgaccaaa ccggctggtc ctcctggaat 66300
ctcgatcgtc ccgccgaccg gcagcgggac gatcgagggg agcagcaaga cccgtagtgc 66360
ttcaagaagg gttcgctggc tgtgagctgg ggtttcgtcg ggccgtcagg ctgcttggcg 66420
gtactcgttg atgagtcctg ccacggcttg tcggcgttcg atccgagctg tcggcagcgg 66480
tatgacgcgc gcgccacgta gggtgcctgc tggtcgtgtg cttggtgggg ccgatgggtg 66540
ttgaagtggc aggcgtattc ctggcggatc ttctccgcgt ggcctctgtc gaagatcagc 66600
actcggttgg tgcactcctc gcgaacggag cgtatgaatc gttgcgcgtg cgggttgcgg 66660
ttggggctgc gcggcgggat cttcgtgacg gtgatgcctt cgctggcgaa gattgtgtcg 66720
aaggcggcgg tgaacttcgc gtcgcggtca cggatgaggt gcgtaaatgt ggcggcatgt 66780
caccgagctg ccacagcagc tgccgggcgt gttgagtggc ccaggcggcg gtggggtggg 66840
cggtgaccct gcggatattc caggccgccg agtgagtgac gacgcgcttg atcgccgttc 66900
ttgtccagca cctgactccg tcctgagcct cactcggcga atgctcgcga tccattgagc 66960
gcgagggttt ggcggagtcg gtcggggttc atgagcccgg cggtgggtcg cccttcgcgt 67020
ttcgtatcca ggagagtggc ttccacgtcg tcgagggccc gccacgaagc tacggcggct 67080
gctcgaacac gccgacctca ccgatgcgca gcaacgcgcc catcctcggt gctggcgagg 67140
cgtggccatc gtgccgctca gagtgttaag ccgccgacat cgctgtcgtt tcggatgact 67200
ttgatgaacc cagtcagggc cccgttggca tcgcgctgcg ctgtgaccac cacgtgcgcc 67260
cagtaccggg tgccgttctt acgtacccgc cagccctcat cgacagaaaa gcccgcctcg 67320
gctgcctgat ccagctcctt ctgcggatac ccggctgcga cacgatccgg cgggtagaag 67380
accgaaacgt gccgaccgat gatctctctg gccgtgtagc ccttcatgcg ttcggcgtcg 67440
gtgttccagc tcctgacaac gccgtccgca tccagggcga agatcgcata ccggcggttt 67500
gctgacgtgg cagtcccctc aatgtcaaga cccactgcac aggcccgccc gtcgccggat 67560
acgaaggtcg cgataaaccc gaaagtcccg gcggcgtgcg ggacgctcca tgttgagact 67620
gcacccggga acgcgaccgc ttcaccgggc tttcgtcctg atgtcgcttg tccgtaagcc 67680
gtcccccacg tcaatcccag cgcctcgcct ttggccgatg ctcaagccga cccgaccacg 67740
caccgaacac gattacctcg ctctgactcg cccacgcggt gacataatgg acatcaggtg 67800
attggcggta acgccatgca aagtcgagac acctccgacc agaagcagcc ttggggtagc 67860
cccgggacgc gcgatagctt caccaatctt gacgcggcac ttcagggcca tccagcgatc 67920
gagcaggcca agggcatgtt gaagctggcc tacgggatcg atgaccacga cgcactcgaa 67980
ctgctcgcaa gcatgtcgca cgacacccgt acggacatgc gtgaggtggc ccagacgctg 68040
gtggagcggg tcacgggcgt agcgaacgcg ccgaccgtga agaccgtgag cgcccgtgtc 68100
cttgaggagt gggagcgact caaagagcgc taacccagcc ttgcaaaggc gatcaagggg 68160
catcgtctac ctcggccgct gatcgaacta cgttggtcga cgtgccgcgt ggtttcggtc 68220
aattgccgtt gttcttcgtt ggtgttggtc ctcgtgcgct ccggatgtgc tccatgcgcg 68280
tccggggcgt ctgcacagct tcgacctggc cgaaatccat acgatctggt ttcg5acctg 68340
gcctcccatt caggctccag cgcgttacgg acccgcgcct ggtggcatcg atccatcatc 68400
gcgagcacag tcgacgatcg tcactggctt cgccaggcgc agctgctcga acaccttctt 68460
gacgccgcat tccttcgcca caacatctcg caccgtgcca agcagtgccg actgcacggc 68520
gggaagaatg gcgacactcg cgttcagtgc tggacagtcc ccggccccaa cggcgccgag 68580
atcctcgacg ccctggggcc ggccgcgacg gtcctccttg atcaccgcac tcgcctgatc 68640
accgactacc cgtgaaccat catcaccgtc agaccgttgt gggagacggt cgaggatttc 68700
gcggaggagc tggcggaggc ggcttgagcc caagtccccc tgctcaccct cttcggcgtc 68760
acccgccggg agcgatcatg accacctggt ggtctccggg ccgtccggtg agggcgtcca 68820
tcatcgctga gaccggcccg gcgtccaggg ggacgacgtc ggtgatgatc cgctcgaccg 68880
ggatacggcc gcctccgaga agctccaagg ccgcctcgaa ggcgcctctg ttctgggcga 68940
agctgcccac gatccggatc tccttggtga tgagggcgac cgagtccagt tctgagggtc 69000
gctcgttcac tccgagtgcg acgatcgttc cccgtgtccg cacgcgtcgg gccgcgtccg 69060
tgagcgcggt caccgagccc acgcactcga acaccacgtc tgcgtccggg ccccgctcat 69120
cgacgtccac ggtggtgatc ccggtcgcgg tcagacgtgc gcgccgtacg gggtgcggtt 69180
ccaccacccg cacgtcctcg actccccgta gccgcagcac gtgtgccagc aggaagccga 69240
ccgtgccgcc tccgaggacc gtgacagtat ccgctgccga aattccggag cggtcgaccg 69300
cgtggaccgc gcaggacagc ggttccgtca gcgccgcgtg ctccagcgaa acaccgtccg 69360
gcaccgcgta caccgtgtgc tcggggacga cgacggcttc ggcgtagccg ccggggcgtg 69420
tgcccaggcc gagggaactc agccgccagg gctgtacagc gcacaggtgg ttgtcgccta 69480
cccggcagtc gtcgcagtcc ccgcagcccg ctttcggcca caccaccacg gcctgcccag 69540
ccgtcagccg ctccccgccc ggtgcggcga cgtgcccgga gatttcgtgt ccgagaaccg 69600
cgtcggccgg gacgaggtgc ggcattgccc gcaggtgcag gtcggaaccg cagatggcgc 69660
agaacgctac gtccacgcgt acccagccgg gctccggctc gcgttcctcg gcctccgcga 69720
gccgcagccc gcgctcctcg gtgatgacca gtgccctcaa ttttcctcca cagttttctc 69780
agcggccagg aacgcccgga tccgggcgac caggatgtcc ggctgctcct cggccgggaa 69840
atgcccgcag gaggcgatct gcgccccctc aagccgggcg gcgtagtccc gccacaggtc 69900
gagtacgggc agtgtcccca ccagcccttc ggcgccccac gccacgtgca cgggcattgt 69960
caggcggcgg cccgcagccg cgtcggcctc gtcagcttcc agatcctcat ggcgagcggc 70020
gcggtagtcg tcgaagctcg cgcgaagcgc cccgggctgt gcgtacgctt ccgcgtagtg 70080
ccgcaggtcc gcctcagtga acgcctcgcg ccggaatgct ccggcggcca gcatgaaccg 70140
gacgtactcc ccggcttttc cttcggtgag gaactcgggg aggtcgggga cgaggtggaa 70200
gagccagtgc cagtacgcgg ccccgacccg tgcgtccatg tgccgccaca tttcccgcgt 70260
cggtacgacg ctgagcagta tcaggcggct gatctcgtgt ggccggtcca gtccccaccg 70320
gtgggccacg cgtgcgcccc ggtcgtgccc gaccacaaca gcctgcgtgt atcccagctc 70380
gtgcatgagt ccgctcatgt ccgcggccat ggtccgcttg tcgtagccgc cccggggccg 70440
gctggacgcc ccgtatccgc gcagatccgg cgcgaccacc gtatggtccc gcgcgaggac 70500
gggcagaacg cggcgccagg cgtagctggt ctgcggccat ccgtggagga acaggatcag 70560
cggaccctct ccagcgcgcc gtacatggaa ccgcagaccg cccaccgtcc gcgtttcttc 70620
ccgcgcatgc atcgccacgg cggtttacct cccctgcgct gggaggggaa ttcctacggc 70680
cccctgcgct gggaggggaa ttcctacggc ccgctgcgct gggaggggaa tttctacggc 70740
ccgcgctggg gacgtgaatt ccgtcaggta gccgtcgtcg accatggggg cggtgacgag 70800
ttcgggcagg ggcgcgagca actcggccag gaggccgatc ttttccctgt cggccgacac 70860
aggccggtgc ggttcctgca gccggtgtgc ggcgaaatcg gctaccaggt cgtccatgat 70920
ctgctgcggc acctgcgggt cggcgtcgtg ctcgctcagg atgaggtcga gcatgcgggc 70980
gtgaacgagc gtgacggcct cggcatgggt gaacgatccg cccggacggc tgcgccacgc 71040
ctcggtcagt tccttctcga cccgctcggg cagtgtccca ggcgggcgtg aggcgaaccg 71100
gtctgaacgc agagcagtgc gcgccacttg gtaggccatc acggagttgt cgcccgcgaa 71160
ggtcacattg acctcgaagt cggtccgcag ggtgacgatc tggttgtggc agtggaaccc 71220
ctgcgacccg cacatctcgc ggcacgcggc caggacgtcc aggaccagcc aggagccgca 71280
ttttccggtc gcggtgagca ggtgcatgtt tttacggccg gggtcggtgt gccagtcgcg 71340
ttcgacgccc ctgacgacgg cgcgttccag caggcgcagc gcgaggcagc gcagttgaac 71400
cggatacagg cggtccagga acagcggctc gtccaggagg accctgcggg taggcgtggg 71460
gcgggtttcg cggtgaccgg cgaaccgcca ggtgaggtga gctgcgagtt cggaggctcg 71520
tgctccggcg ctgagcggga agatacgttc ctggacgaag gtctcgatgg atttcacgaa 71580
gcgggcgccg gcatcgggaa gcgtgctgga aaaacgtccg cccgcgtcga tgcgcgagta 71640
ccgccccatc agcgcctcgc gggggagccg gaccccggtg aagcgggcac cgccgacctg 71700
gttggcctgg atacctccct tcgggtcaca cgacaggatc gtgatacccg gcaggggagg 71760
accgttctcc tcgccgcgca gcgggacacg gaaccagtgg tggccgacgt cttcgccgtc 71820
gatcacaagg cgggccagga ccatgccgac ggtcgcggcg tgtttgatgt ttccgatcca 71880
gaacttgcag gcggcgtccg tgggggtgtc gagcgtgaaa ctctgctcct cacgattcca 71940
ggagaccgtc gtgcggatct cccgcaggtt ggtaccgccg ccgatctccg tgcagcagaa 72000
cgcgtagacc tggtgcatcc gcgtgatttc gtcgtggtac cgggcgacct gctcaggggt 72060
gccgtggttg aacagcgcgc tccccgcgat gaggtgatcg gtgatggtcg aggtgagcgc 72120
gaaatcgaag gcgcccgtca ggcccatcat ctcgcacatg gaccggaacg cgtgctcccg 72180
ggactggccc atccacatgt cgttgtcgac gagcccctcc cggaagatcg ctttcatccg 72240
gtcgatggtc aggtccatgt actcccgcgg ggacagatcg tcacgcagtc gcggatcgaa 72300
caactcctgc cgcagcaggt cacgcatcgt ccgccggtag gcggaggcgt cctgttccac 72360
gtccagggca gtcaaccgct ccgcctccgt ggcatgggct tgggtaacga gatgttgccc 72420
ataagggact gcagttgcgg ccatgtgatc gaaatcctct cgattcgtgg gcatcgcgca 72480
ttttcaaagg ggaccgtgcc agctgggcag ggcccatcca gcggccgttc acccacggtg 72540
cgggaaaggc atacaggccg gaagggcaga tattgtcgac tacgagtatc ggggtgccct 72600
ccgggagaat tccctgcacc gcccgctcgg cctgtttgta cggaaggaac ttctcgtgcg 72660
gcgaggccac gcacgggtcc gcctccggtc gcagttgccg gatctcgctc cgcgcttcag 72720
cgcggccagc gggcagacgg tccaccatgc cgtcgaggac catgtcggcg gcgagggccg 72780
cgtccagttc gtgattggct tcggtagatg cagcgaggaa gagcgcggac cgggaatgcg 72840
gtgccggctc cgtcgtcgta ctgaggcatt gcatgagaca acgacgcagc cagaacacct 72900
gaagatcgcc gcgttcaatt gaggtttgcc cattaatcgg gccggtcgcc gaagtggcgg 72960
gcacttcccc gcggtaggcg gcgcagaccg catcgatctc ttcggcgagt tccgcggcga 73020
tacactgtcc tcgcgaatag tgcgcgaagg gatccgtaaa ataggcccct gcgccgggaa 73080
ccatcagggc ggtgttcatg atcgcctttc atgcgccgca tccaaccaga tatagcgttt 73140
gtgtctaagg tttctcgccg tccatgctgc gctggttcgc gatctcatgt cgatatgaag 73200
gcaagcaggt atcgcagtgg gtgtcgcagg tcattgcgcc tggtgtccgt gatggcattt 73260
gccggtattg ataatgactt ctcgagtatc cctcaagctc atacgatcgg cggatggttt 73320
gcccgtctac ccatctcgtg ttgcccgtct gttccctggt tcgatgagta ccggcgctgc 73380
tgctcgaccc ggccaagcgg gacgccggtc gtggactgtg aactcagcga gatcgaagat 73440
gtgctgcgcg aacaccaccg ggttgatgat gctctggtcc tgcagggtgc gaagatcgaa 73500
gcgcacctcg tggtgaccga accagtgcag ccagccaact ctgtcgacct ggtgcaccct 73560
cctcctcggc acgtgcgaac cgtatgttta cgaggcggat ccgcagttga cgcgcgatga 73620
gagcgatgct ttggtgaatc acctattcgc ggtttccgag agtccgcagc gcgtttgaag 73680
gtaccggtgg gttccgggcg cggtcgaggt gtgggacaat cgggcgactc aacgcaataa 73740
cacactggcc gcaccacgca gcagttgtct accgggctgg ccattgctgt tgccacggtg 73800
gcactgcggc tcggtgaacc gttgggtgag ctggtgttcg gtgcgcgtag cggggctacc 73860
tacaccgtag ccttcatcct gcttgggctg actgcgctga tcgccactgc cggggcgttg 73920
agcctgcacc cgaatgccgg caacgccgtg cgcacgctgg gcacgcgggc gagcgggcga 73980
gcgcgacctc caccacggaa ccgggctgag actgccagca gttggacgtt gcaggccagc 74040
atgcaacgtc atgacaccaa cgacataagc aggtcacagc tgcagaggcg atcctctcgt 74100
tagcaacgag agattcgggc aacccccgag ggcactttcg ggtagaccga gagattcgat 74160
cttcttatcc ttgactcccg ccgcatagct ttccactgcg accgccccga cggtgcggga 74220
aatcatgtcg atcaacatca acggcggaat ccacagtgga cgttgtttct gccgcgtctc 74280
ggatgccacg ccgcgggtgg gaacgtgcag gtacgtgccg gagaagagga gaacgaccgt 74340
ggctgaggta gtggaatcac cctcacccaa ctcgttggaa gacgtacgcc cgctaccgca 74400
tcctcgatcc acgcgccagg gtcaagtcga acaactcccc gagttgactc cggcgttgtt 74460
gctggagtta actacggggc tctggagttt caaggcttcg ccgccgctgt cgagctggag 74520
ctgttcacca agctctccga cctcggctcg gccacagtcg aaacagcgtc cgaggccctc 74580
ggaccgcccg atcggcccac cgatctgcta ctggccgcgt gtgcgtctcc caactgaggc 74640
cctgcgggct ggtcgtacat aagaaggggc tcccgcagcc cagggagtga ctgtaactcc 74700
cgcagggggg ccgccacccg caccgccacg caaaacgagt tccgctaaca ggatctaagc 74760
gttgtgtcca gccgagagcg aaggaagaga catgccggca atccgattgg cagcgctggg 74820
tgacagcttc gtcgagggcc gtggtgatcc tttaaaacct cagcaatccc aggaggccga 74880
cctgaaccgc gaagatcccc ggctgcgccc acaacgtccg gtcgagcacc tgctcatcag 74940
caccgaacag cacgccccgg acgtccacgc ccaagtccgc gcccaagtgc gcgcccaagt 75000
gcgcgcccag cgcgtcgcag gcttcctcga aggcgtctgc gaacgccggg aaagccgcgc 75060
acaacgcctt tcccatcccc agccactgac tgccctgacc ggaaaaaacg aatccgatgc 75120
ggccaccgga attcgcagcg ccggtgacca cccccggcgc agtccggcca tcggccagcg 75180
ctgccagcct cgacaacagg gtctgatgaa ctacttgcag ttgacctcga atctct 75236
<210>2
<211>36538
<212>DNA
<213>Saccharopolyspora sp.NRRL30141
<400>2
tcggtactgc ccctccgatt cctggacaca ccatcagcaa gttcggcgtc ttccggcgcc 60
tccaccggcg attccggagc gcctgcatat catcgagttg cggcacacct tcagccgacc 120
ggcttccgcg ccgtcaaaat cgcatagccc atgtcgtcgg cgtatttctc gtaatcgcag 180
accgcggcgg cccagtcggc gacagccggc ccgtaccttt ccgcgatccc gtgctggtgc 240
gcagcgagcg cttcggcgaa ctgcggcatg aagtaccggg tccgcgacga cacgtcgtca 300
caagcgagga tttcgaaacc cgctgcacac agcgattcca gaagttgctc agccaggcag 360
atccggaggc cggtcggcca catgtcccag gacaccggga tcccgctgcc tatttctcgt 420
ttgacgacct cggtgactcc gaggatgcca ccgggtttga gcactcgaac gatttcccgg 480
atggcacggt ccggttccga catctccaac agcgactgga tggcccaggc agcgtcgaac 540
gcattgtccg ggtacggcag ggacatggcg tcgacgcacg agaagtccac ctggtggctt 600
agtccgcgtt cgcgggcgca atcaacggcg atggcagctt gcacctggct gaccgtgatg 660
ccggtgatcc ggatcgcgtt gtcgcgcgcg acgcgcagcg ctggttgtcc ggtgccgcac 720
cccacatcca gcagtcgatt gccgccatcg agcgcggtcc gttcggcgac aagatcggtg 780
agccggtcgg cggcctgctg ccaggaagtc cgcccgtcgt tctcccagta gccgtggtgg 840
atggcgcagg ggccgcccgc gaccgaattc agcaacgggg tgaccaggtc atacatctgc 900
ccgacctgct gcgatgttgg cacgccacct ggcaacaccg gtatgcctga tccctgcaac 960
gttcaccttc tcgaaatttt cttcaccgag cagacacaca gagagaaaag aaggcaaact 1020
agccgtcagt taattcgcgg ttaccgccgc atgcggcggt aacaactaat aaccatccgg 1080
ccaggaggcg aacagccagc gcaaattttc cccatgaatc cccgcagagc caacggtgat 1140
cagcgtatag gtctgtccca ccctgatcga gcgaagctca agcatttcac tcagacgggt 1200
gaaaccaggt gcggcaagct aaagatcctc gatgggggcg tgacactcct gcggcgacac 1260
actattattg ccgccggaag tccgttggac gtcaacggcc atcgacatac gcagcgcatt 1320
ttcctactcc tgaccaacag gagtaggggt tgctgcgcca atcgggggaa gagaccgggg 1380
tccgaagtat gcgcgtactc gtcgttccct tgccctatcc gacgcatctc atggcaatgg 1440
tgccgctgtg ctgggcgctg cgagcatccg ggcacgaggt tctggtcgcc gcgccaccgg 1500
agctgcaggc gaccgcgcat ggcgccggtc tcaccacggc cgagatccgc gggaacgaca 1560
agacccgcga cacgggtagc accacgcggc tgcgctttcc caatccggcg ttcggtcagc 1620
gcgacaccga gaccggccgg caactgtggg aacagaccgc gtcctatgtc gtgcagagct 1680
cgctcgatca gctccccgaa taccttcgac tggccgaggc ctggcgaccg tcagtgctgt 1740
tggtcgacgt ctgcgcgctg atcggccggg tgctcggcgg attgctcgac ctgccggtcg 1800
tgctgcaccg ctggggagtc gaccccaccg caggcccctt cagcgatcga gcccacgagt 1860
tgctcgaccc ggtgtgccgc caccacggac tggccggact gccgactcca gagctcatac 1920
tcgatccctg cccgcctagc ctgcaagcaa gcgacgcgcc gcgaggcgtt ccggtccagt 1980
acgtgccgta caacgggagc ggcgaactcc cggcctgggg cgcggcgcgc acctcagcac 2040
ggcgggtctg catctgcatg ggccgcatgg tgctgaacgc caccggaccg gctccgctgc 2100
tgcgcgcggt agcggctgcc accgggctgc ccggcgtcga ggctgtgatc gccgttcccc 2160
ctgagcaccg ggcacttctc accgacctac cggacaacgc acggatcgcc gaatcggtcc 2220
cgctcaacct gttcctgcgt acctgcgagc tggtcatctg cgcgggcggc tcgggaacgg 2280
cgttcaccgc gacccgactc ggcatcccgc aactcgtgct tccccagtac ttcgaccagt 2340
tcgactacgc gcgcaacctc accgctgccg gggcgggcat ctgcttgccg gatgagcagg 2400
cccagtccga ccacgaacag ttcaccggct ccatcgcaac agtgctcggc gacaccggct 2460
tcgcggctgc cgcaaccaaa ctcagcgacg agatcacggc catgcccaat cccgccgagc 2520
tggtgcggac gctggagagc tccgcggcca tcggtgcctg acgaactgct cacccgagaa 2580
cagacggatc cggagaaccg atgccctccc agaacgcgtt gtacctggac ctgctcaaga 2640
aggtactcac caacacgatc tacggtgatc ggccgcatac gaacgtctgg caggacaaca 2700
ccgactacag gcaggccgct cgggccaaag gcacggactg gccgactgtc gcgcacacga 2760
tgatcggtct ggagcggctg gacaacctcc agcactgcgt ggaagccgtg ctcgcagacg 2820
gtgttcccgg ggatttcgcc gagaccggtg tctggcgggg cggcgcatgc atcttcatgc 2880
gcgcggttct ccaggcattc ggagataccg gacgtaccgt ctgggtggtg gattctttcc 2940
agggaatgcc ggaaagctct gcgcaagacc acgagtcgga ccaggctatg gcgctgcacg 3000
agtacaacga cgtgcttggc gtcccgcttg agaccgtccg gcagaacttc gcccgctacg 3060
ggctgctcga cgaacaggtc aggttcctcc ctggctggtt ccgggacacc ttgcccaccg 3120
cccccatcca ggaactcgct gtgctgcgac tcgacggcga cctctacgaa tccacaatgg 3180
actctttgcg gaacctgtac ccgaagctct cgccgggcgg attcgtcatc atcgacgact 3240
acgtcctgcc gtcctgccag gacgcggtga aggggttccg cgcggaactc gggatcacgg 3300
aacccatcca cgacatcgac ggcacgggcg cctactggcg ccgcagctgg tgaacggctc 3360
agctgtcctc gacgccgagc gcttgccggg gcacgaaccc cggcgcggca ggctcagcgt 3420
tgagcccttt ttccacgaat accaggttgt ggtagaagtg cagggccgcc acgttccgtt 3480
ccgtgtagca gggctcggtc ccgcgccgcg attcgcgctc ctgataatgc aggccgtcga 3540
tcagttcttt gagcatgtcg atcgaggtgc gctgggccgc gggttccgta tcgcggccgc 3600
cgtagccggg ccagtacgac gtctggagat cttcgatgac gtacaaacca cccgggcgga 3660
cgtgcggaaa cagggcatgg aaggacttct tgacgtggtc gttgacatgg ctgccgtcgt 3720
cgatgacgat gtcgaacggg ccgatcttcc ccgccatgtc tgccaggaat tccgcatcgc 3780
tctggtcacc tcgcagcttt cgcactcggt gtccttcgtt cccggctttc tcgaaaatgt 3840
ccaggccgta cacgagacct cgccggaagt accgctgcca catgcgcagc gaagcgccac 3900
cgagttcggg tgcgtggtaa ccaccgattc ctatttccag cacgcgcacc gggacatcct 3960
ggaatcggga gaagtggtgc tcgtagtgtt cggtgtacca gtgcaggtcc gcccatttgt 4020
cggatccgta gcggaccgcc agctcaccga ggtccgagca cctggtggcg gctgcggcca 4080
gcaccgcgtg caccgcggtg gacaccttgt tccggaacgc cagcagtcgc tgcgcgccgg 4140
ccaggccctg gtcgggggag aactgcgtca tgctgtccga ccaacggact tcacggctgt 4200
tgtgccgcct gccgtcaacc gggccgaaga gcccttcgag cagatccacc gcgtcgaacc 4260
ggagaacagc aggtgcttcg gcgaccgccg cgagccgcag gcctgcgtga tcgagctcaa 4320
cggttctacg gaccagctga gcgccagaag tgatctccag gccgattcgc accgactcgt 4380
ccaacgacag cggatcgcaa cgcacgacga gttcgtcgac gatggcgtcg gccaccgcct 4440
ccagtccagc cacctgcact gcttcctgga gcctctccgt gcccgcaccc gccgcgagca 4500
gcaagtgctc caccaccgac cagggggcaa ctgcgatctc acccatggaa agtcatcacc 4560
ttttcggttc ctgcgcatag gacgccatgc gcaccgcgat accgatccac aaagtagagc 4620
cggcggaggc gaccagcttt cacgtatgag ccgaaacaca cagataatca cccggacgcg 4680
tggatgatct cggccgaggg cgaacaaagt ggaccagtca gcaaaggagg ggcggtgccc 4740
gatttccatg acccagcaac catgaatcgc cgaaccccag gaacagagat caccgtcgag 4800
cccggcgatc ctcgttatcc ggacctcgtc gtcgggcaca acccccgttt caccggaaaa 4860
cccgaacgca tccacatcgc cggctccacc gaagacgtcg tgcacgctgt cgccgaagcc 4920
gtgcgcaccg gcaggcgggt cggggtgcgc agcggcgggc actgcttcga gaatctcgtt 4980
gcggacccgg cgatccgggt gctcgtcgac ctctccgagc tcaaccgcgt gtacttcgac 5040
agcacgcgcg gggcattcgc gatcgaggcg ggcgccgcgc tcgggcaggt ataccgaacc 5100
ctgttcaaga actggggcgt gacgatcccg accggcgcat gtcccggggt gggcgcaggc 5160
gggcacatcc ccggcggggg atacggcccg ctgtcgcgcc gattcggttc ggtcgtcgac 5220
taccttcaag gcgtcgaggt cgtcgtggtc gaccgggccg gtgaagtgca cattgtcgag 5280
gtcgaccgga attccattgg tgccggtcac gacttgtggt gggcgcacac cggtggtggt 5340
ggcggcaact tcggggtcgt caccaggttc tggctccgag cgccggacgt ggtcagcacc 5400
gacccctcgg agctcctgcc acggccgccc gcgacggtgc tgctccgatc gttccactgg 5460
ccgtggtgcg aactgacaga gcagtcattc gccctcctgc tacggaactt cggcacttgg 5520
tacgagcagc acagcgcgcc ggaatccacg caactcgggt tgttcagcac gctcgtctgc 5580
gcacaccgcc aagccggcta cgtcacgctg aacatccatc tggacggcac ggatccgaac 5640
gcggaacgca ccttggccga acacctatcg gcgatcaacg accaggtcgg cgtgactcca 5700
gccgaagggc tgcgggaaac cctgccgtgg ttgcgatcga cccaggtgtc cggatcgctc 5760
gccgaaggcg gcgagccgag cgggcagcgg accaaggtca aggccgccta cttgcgcacc 5820
gggctgtccg aagcgcaact agccacggtt taccggcggc tgaccgactc cggatacgac 5880
aaccccgcag cagcgctgtt gctgctcggt tacggcggta gggcgaatgc cgtggcgccg 5940
tcggccacag cgctcgctca gcgcgactcg gttctcaaag cgctgttcgt cacgaactgg 6000
tcggagcccg ccgaggacga gcggcatctg acctggattc gtggtttcta ccgcgagatg 6060
tacgccgaaa ccggcggagt tccggtgcca ggtacccgtg tcgacggctc ctacatcaac 6120
tacccggaca ccgacctggc cgatccattg tggaacacct ccggagttgc ctggcacgac 6180
ctgtactaca aggacaacta cccgcggctg caacgggcca aagcgcggtg ggacccacag 6240
aacatcttcc agcacggcct gtcgatcaaa ccgccggaac ggctttcacc cggtcagcca 6300
tgaggagtcc gtcacgatgt ccgcaacgca cgagatcgaa accgtggaac gcatcatcct 6360
cgccgccgga tccagtgcgg cgagtctggc cgaactgacc accgaactcg gactggccag 6420
gatcgcaccc gtgctgatcg aggagatcct cttccgcgcg gaaccggccc ccgacatcga 6480
accgaccgag gtcgcggtcc agatcaccca cggggtcgag accgttgact tcgtcctgaa 6540
gctacagtcc ggtgagctca tcaaggccga gcaacgaccg gtcggagacg tcccgctgcg 6600
gatcggttac gagctcaccg atctcatcgc cgagttgttc ggcccaggag ctccgagggc 6660
cgtcggtgcc aggagcacca acttcctccg aaccaccaca tccggttcga tacccggccc 6720
gtccgaactg tccgatggct tccaagccat ctccgcagtg gtcgccggct gcgggcaccg 6780
acgtcccgac ctcgaccagc tcgcctccca ctaccgcacg gacaagtggg gcggtctgca 6840
ctggttcacc ccgctgtacg agcgacatct cggcgagttt cgtgatcgcc cggtgcgcat 6900
cctggagatc ggtgtcggtg gctacaactt cgacggtggc ggcggcgagt ccctgaaaat 6960
gtggaagcgc tacttccacc gcggcctcgt gttcgggatg gacgtcttcg acaagtcctt 7020
cctcgaccag cagcggctat acaccgtccg cgccgaccag agcaagcccg aggagttggc 7080
cgccgtcgac gacgagtacg gaccgttcga catcatcatc gacgacggca gccacatcaa 7140
cggacatgtg cgcacgtccc tggaaacgct gtttccccgg ttgcgcagcg gtggcgtata 7200
cgtgatcgag gatctgtgga cgacctatgc tcccggattc ggcgggcagg cgcagtcccc 7260
ggccgcgccc ggcaccacgg tcagcctgct caagaacctg ctggaaggcg ttcaacacga 7320
ggagcagccg catgcgggct cgtacgagcc gagctacctg gaacgcaatg tggtcggcct 7380
ccacgtctac cacaacatcg cgttcctgga gaaaggcgtc aacgccgagg gcgccgttcc 7440
tgcttgggtg ccgaggagtc tagacgacat tttgcacctg gccgacgtga acagcgcgga 7500
ggacaagtga acagcaaagg gtcgaacgca caggcctttc caagcgcgga tcaggtggag 7560
tccatcttcg acgcgttggc gcaagggcgt gccctgcacc acggatactg ggcgggcggg 7620
tatcgggagg atgccggggc cacaccttgg tcggacgctg ccgaccacct gaccgacctg 7680
ttcatcgaca aggccgcgct ccgccccgga gcgcacctgt tcgacctggg ctgtggcaat 7740
gggcagcccg tagtccgcgc ggcacgcacc aaaggcgttc gagtcaccgg aatcaccgtg 7800
aacgccgaac atctcgccgc cgctaccagg ctcgccaacg agaccggact ggccgacagt 7860
cttcggttcg atctagtcga cggcgcccgg ctgccctacc cggaaggttc ctttcacgcc 7920
gcatgggcga tgcagtccgt ggtacagatc gtcgaccagg ctgccgcgat ccgcgaggtc 7980
caccgaatcc tggaacccgg cggccagttc gtcctcgggg acatcatcac tcgtgctcga 8040
ctcccggaag agtacgcggc ggtttggacc ggcacgaccg cccatacctt gaacagcctc 8100
accgcgctgg taagcgaagc cgggttcgag attctcgaag tcaccgacct cacggcgcag 8160
accagatgca tggtctcctg gtatgtcgac gagttgctcc gggaactcga tgagctcgcc 8220
ggcgtcgagc ctgcggctgt cggcacctac cagcaacgct acttgggaga catcgcggcg 8280
aagcacggac cgggaccagc gcagctgatc gccgcggtcg cggaataccg gaaacatccg 8340
gattacgcca gaaacgagga aagcatgggt ttcatgctcc tgcaggcgcg aaagaagcag 8400
tcctgatggc ctccgagcac gccagcctgg tcggcgacga tctgcgggca cccgcggacg 8460
atcccttcta ccgaccgccg acgccgctgc cgccgggtgc cccgggcacg ctcatcaggg 8520
cccggcccgt cacggcactg cgcagcacgg gcgaacccgt cgcggccaag gtctggcaaa 8580
tcctctaccg gtccaactcc gccattggca ggccgaacgc cgtctccggc accgttctgg 8640
tgccgaacat cccgtggccg ggcgaagatc gccccatcat cactttcgca gtgggcaccc 8700
acggcctcgg cagccaagtt gccccgtcct acctgctccg aaccggaacc gagccggaga 8760
ccgagctgat cgccgtggcg ctcgaccgcg ggtgggccgt ggtcatcacc gactacgagg 8820
gcctcggtac tccgggaacg cacacctaca ccgtcggcag gccgcaggga cacgccatgc 8880
tcgatgccgc ccgcgctgcg cagcggctac cgggctcggg cctggggacc gactgcccgg 8940
tcggcatctg gggctatgcg cagggtgggc aagcgtcggc cttcgccggc gaactgcacc 9000
ccacctacgc ccctgaattg ccaatccgcg ctgcggccgc aggtgcggtg ccgatcgatc 9060
tgctggacat cctccaccga aatgacgggg tgttcaccgg gccagtgctg gccggcctgg 9120
tcgggcatgc cgccgcctac cccgatctgc cattcgacga gctgctcacc gacgcgggtc 9180
gtatcgccgt tgatcaagtg cgcgagctcg gcgcaccgga gctcgtcacc cgcttcctcg 9240
gccgcgagct gagcgatttc cttgatactt ccggcctttt cgagcaccct cgatggcgag 9300
cacgactggt ggagagcgtc gcaggtagga acggcggccc ggtggtcccc acgctcgtct 9360
accacagtac ggacgacgag atcgttccgt tcgcattcgg cgagcgactc cgggacagct 9420
accgcgcagc gggtacgccg gtgcggtggc atccgctctc cggattggct cacttccccg 9480
ctgccctggc cagctcgcga gtggtcgtct cctggttcga cgagcacttc tccgggccgt 9540
ccgcgatcag cggtccgcga gatgacgggt gagcggatgg cggtgagcct ctccagcggc 9600
ttcaccgccg gcttccggat gcccggtcag gcttcgaggc gaactactac ccgcggatgc 9660
cggacggcta cgtggacccg caccggcagg cctgaccgat cgcctccacc agcgcggcct 9720
gctggatcat cgattcgccc gaatctccgg ccaccgcagg ttcgtccacg cctgcctctg 9780
ctctgatgtc gcgtgcaaag gcggtgaccg ccttgcgaac ctgatcttcc gctggcaagg 9840
acaactcgtc gacaacgccc ttccgctcga ttcggatcac ggcctgccac tcggcgggcg 9900
gagtgaacgc ccgatcgatg acgattcgtc cacgactccc ccacagctcg tacgcgctgc 9960
ggtagtggtg cacgaaaccg tatccgaggt gggcaacggc gccaccttcc gattggagca 10020
gcacgctgcc cgacaagtcg acgcccgact catgagcctc gtgcgagctt gcgccggcaa 10080
ccgtgagcgg accgaggaga aagagccgag cggcacgggc gggatagaca ccgatgtcca 10140
gcaacgcccc gccaccgagt tcggtgcgat agcggatgtc cgtgtcggaa agcggcggaa 10200
tcccgaacac ggcggtgaac tcccggagct caccgatctc ctcggattgc agcaggtcgc 10260
ggaccacgtc gtgccggccg tggtggagga acaggtaatt ctcccgcagc agcaggtgct 10320
tccgccgggc cagcccgacc aggcgagcgg tttcggacgc cgtcgtcgtc agcggtttct 10380
cggcaagcac gtgtttgcct gcctcaagcg ccttgccgat ccactctgca tgcatgccag 10440
gaggcaacgg cacgtagacg gcatcgatgt ccggccgctc caggagccgc tggtaaccca 10500
gcaccgcctc gcattcgaat cgtgctgcga accgttcggc cttcgccgga tgacggctcg 10560
ccaccgccac cacctctgtt tcggccacgt cgcacatcgc gggcagcatc cgtcgccaag 10620
cgaaggaagc acacccgagc acaccgatgc gcaccggctt tcgcatcgag ctggtcatcg 10680
ccccaacgcc cacaagctat gcagggaggc aaccaagctg cgcgcctgga tgttcaagga 10740
gtgggtgctc cggagcagct cgcccaactg gcccaaggtc atccaccgga agtcgctcgg 10800
aggtcgtgcc gcgaagtcct catgcacctc gatgatccgg tacctgttct gcgcctggta 10860
gaaccgaccg ccttcttcag acaggatcga ttcgtaccgc acggtttcgg gatcggcggt 10920
gagcacgtcg tccacgaacg gcggccagtc gttgcgcgga gtgctttggt agttggccac 10980
actgcactgg accgtgggag cgatttccgc agtcgacttg taaccagcct ctacccgagc 11040
gcggaccaaa ccgtgcagca ctcctccgat ccgtttgacc aacagtgcga tctcacctgg 11100
ttctcgcggt tcgatcatcg gctgagtcca gctggagacc tcacgattgg tcgcggacac 11160
cgacactgcg atcaccgaga agtacttgcc gtcctggtgg gcgatctcgg tgtcggtgcg 11220
ataccacttg tcgaccctac tgagcggaac gcgcgttgcc cgcaagctgt agcgggcctt 11280
ggcttcctcg aaccaaccga ccgcctcggt aatactcgcc gaatcgatgc cgtgcgagag 11340
cgacctggcc accgcctgcc ggaagggctc cgccgaggcg gctagtccgg gcccggtggc 11400
ggaatcgtgg aacgggatgc aagacagcac cgtccgggtg tccatgttga cgatgttgtc 11460
ctgacgaagg agatccagca cctggccgag ggtcaaccag cagaagtcgg gcaggactgg 11520
cacttcctcg tcgacttcca ccaccatgtt acggttacgt ttccggtaga accaggcccc 11580
ctgttcagac tggagcacgt ctaccagcac gcggctgcgg ccccgcccga gcaagtagtc 11640
cacatagggt ggaacgctgc cacgatgtgc ctgcgtgtag ttgctccgag ttgcctggac 11700
cgtcggcgag agctgcagga cgttgacgtt gccgggttcc atcttggctg acatgaggca 11760
gtgcagcacg ccgtcgatct ccttgacgag aatgccgagg atacctactt cagcctggtt 11820
gatgatcggt tgatgccagc aggtcgccgc gccatagttg gtctcgacct gcaggccttc 11880
taccgtgaaa aatctgccat cagcatgaac caggttctca gtgctggcat cgaatttcca 11940
tttcgacagg cggtcgaacg ggatgcgagt ggtctcgaag ctgttctcgc ccaaccggtc 12000
ggccagccag cagtggaaac gcgtggtcgg aaacctgcca ttgcaagcgc tcagcgcaga 12060
gtcgacgaac cgccgcgtgt tgttgctgct gagcggcgca gcagcacttg cctcagcttc 12120
ggcaaaactg ctcataccca atccctcact gccgtgacga tgtgcacgcc gactcccagg 12180
ggcatcgcgg tctggtccag cgacctggcg aggatcaccg aagcgatccg cccgtaatga 12240
ccgcgagcac ctcgtcctgg cgttcggtca ttgaagaaat gcccgccgga gaagacacgg 12300
acatcggcct cagcctcggt gtgctctcgc caggcttccg cctcctccag ggtgaccttc 12360
gggtcggcat ctcccaccag cacatcgaac cgcggcgcga gcgctcggga cagcgggaag 12420
tagaagctgg ccgaaccacc tgcgtgcgga gcacgaccaa acgtgttcca gccccgggcg 12480
ccgggtggaa gcgacggatc caaatctcgc tgtgggttgc caagtcggtc acaattctcc 12540
ctcggacgac ggccgaactg gcgtgaacac ccaccccgcc ctggtagcgc accgactttc 12600
agagtcatca atcggtgttc tcgacaccag cggagcagga agtgagtggc atggcgatgc 12660
aagcggtcag gacagccgct gcggacgcgg catgacgtac ctttccaact tgtagtgcgc 12720
tgttgcttag cgggcgtcac gatcttcgtc ggaaagcggt tggagcgcaa acatctgggg 12780
agatcaacac tccattgggg gattccggac gagaatctga tttcgggact cgttcggcgt 12840
ccatgttgcc ggccatacgt tgctcacaga tggccgtcag acttcccagc gcggcggttc 12900
ttaccttcgg caaccagctt ctccagacgt gcaacgatgt cgtgcgggct gggattttcc 12960
tcgatttcac cgcggatcag cgccgcgttg gcggcgaacg acggctcgtc gagcaggcgg 13020
gccagctgac gtcgcacgtc gtcttcggta aacgtcgcgc ggtcgaggac cagaccggct 13080
ccccggtcgg cgaggagctc tgccctacga gattcgtccc agaaggtccc aggaagaatc 13140
aactgcggta cgccgttgac cgtggcggtt tcctgcgtcg tcgttgagcc atggtggatg 13200
atcgctgaac acgactccag cagttcgttg agcggtacgt attcgtggac ccggacgttc 13260
gggggcaact cccccatctc ccgtacttca ccgccagaca aggtggcgat cacctcgacg 13320
tcgagcccgg ccgcgccgcg caacaacgtt tccaccattg cctgttcctg ggcttctccc 13380
tcccactgct cggccaccct gctctgctgc cgcttggtca gcccgcgggt gacgcagaca 13440
cgcggcttgg tcggtggttc gcgcaaccac tccggcacca ccgccggacc gttgtacggc 13500
acgaagcgca tcgagatgta gtccaagtcc acaggcagtc gcatccagga tgaaaccgga 13560
tctatggtcg cttggcccgt cacgatctct tcatcgaacg tggcaccgaa cttggagagc 13620
ttcgctccga gccacgcccc gagcgggtcg acgcgctgct caggcggctt cgattccagg 13680
tattcgagga aaccggaccg cagccacccc gacacatcga gggcgacgag catccgtacg 13740
tgtcgtacgc cgagcgcttg cgccacaact ggccccgaac acaccatggc gtcccacaca 13800
acgagatccg gctgccattt ctcggcgaac cccatgagat cgtccagcga tcggtcatcc 13860
acaaggtgga gttgctccac ggcgtccatg tctctgccag agttgattga cagcagctcg 13920
tcgaagagtt ccggacgccg gccctcgtcg aacgcgaccc cgttgccgag aacgagtttg 13980
ttcctggccg ccaaggagat gaggtcgagc tcgtcgccga cgggaaccgc ggtgagtccc 14040
gctccggtga ccatcgacac catattcggg cagatggcga cacggacctc gtgccccgcc 14100
gcacgcaacg cccacgccaa cggcaccagg ttgaagaagt gcgaactcgc cggtagcggg 14160
gtgaacagaa cacgcatgca ctctccgatc gcaattgaac acccgggaaa acatggcaag 14220
aatcacagaa acatgtgata tacccccggg aaacgccgct cccctaagct acatcttcct 14280
cggtgcatcc aggcttaggc cttccaggtg atggtagcga tcttgacaag cgcaagcagg 14340
tcgttcccgc tagcctggac tctgtcgagt cgggtgtgcc gggtagatcg aggagcactg 14400
agtcaatgag cgcttctctg tgctccgctg tcctcatgtc ccgcaccgtg tcgaaccagg 14460
acaggaagga gtcatgctcc gggacagcac cattgtccca ctgggacccc ataacgcgat 14520
tcgccacggg aatcgctcac ctcctgaagg tcaaggcgcg aggactgatc gtcgcctgca 14580
tgcagagccg gaaaaagccg gagcgttggg gaaagggcgc gccagagtga cttcgtgtga 14640
cgacacttgc gctaccgcta ctgagatgac gccggatgcc aaggaccgga tattggcatc 14700
cgtgcgcgat taccaccgcg agcagaaatc ttcgatcttc gtagctggat cgacaccgat 14760
ccgaccatcg ggcgccgtgc tcgacgagga cgaccgggtg gcgctggtgg aagccgcgct 14820
ggagcttcgg atcgccgcag gcgggaatgc tcggcgattc gagagcgagt tcgcccgctt 14880
cttcggcctc cgcaaggctc acctcaccaa ctccggttcg tcggcaaatc ttctggcgtt 14940
gagttcgctt acctccccca acctcggcga ggcacgacta cggcccggcg acgaagtgat 15000
cactgcggcg gtcgggttcc ccacgaccat caatccagcg gtccaaaacg gactcgtccc 15060
ggtattcgtc gacgtggaac tgggcaccta caacgcaacg ccggaccgca tcaaggccgc 15120
cgtctcggaa cggacgcgag ccatcatgct ggcgcacacc ttgggcaacc cctttgccgc 15180
tgacgaaatc gcagagatcg cacgagaaca cgagctgttc ctcatcgaag acaactgcga 15240
tgcggtggga tccacctacc ggggacggct gaccggaacc ttcggcgacc tgacaacggt 15300
cagcttctat cctgcccatc acatcaccag cggtgagggt ggctgcgtgt tgaccggcag 15360
tctggagttg gctcgcatca tcgagtcgct gcgtgactgg ggacgggatt gctggtgcga 15420
gcccggcgtg gacaacacct gccgcaagag gttcgattac cagctcggta ctctcccagc 15480
cggctacgac cacaagtaca cgttctccca cgtcggttac aacctcaaga ccaccgacct 15540
gcaggccgcg cttgcgctga gccagctgag caagatttcc gaattcggat cggcacgccg 15600
ccgtaactgg cgacggttgc gcgaaggtct gtccggggtg ccgggcctgc tgctgccggt 15660
gcccacgccg cacagcgacc cgagctggtt cgggtttgcg atcactgtca gtgcagacgc 15720
cgggttcacc cgtgccgccc tggtgaactt cctggaatcc cgcaacatcg gcacccgact 15780
gctgttcggc ggtaacatca cccggcaccc ggccttccag catgtgcggt accggattgc 15840
cgacgcgctc accaacagcg acatcgtcac cgaccgaacc ttctgggtcg gcgtataccc 15900
aggcataacc gaccaaatga tcgactacgt cgccgaatcg atcgctgaat tcgtggccaa 15960
gaattcctag catccagcat ggctgcatct cggaggattt cagcaacgtg atcaacctgc 16020
accagccgac cctcggcgcc gaagaactcg acgcgatcgc ggaggtgttc gccagcaact 16080
ggatcgggct cgggccgcgc acccggacgt tcgaggccga cttcgcccac cacctgggcg 16140
tggatcccga ccagatcgtg ttcgtcaact cggggactgc cgcgctgttc cttaccgtgc 16200
aggtgctcga cctcggccca ggcgacgacg tggtacttcc ttcgataagc ttcgtagcgg 16260
cggccaacgc catcgcatcc tccggtgccc gcccggtgtt ctgcgacgtc gacccccgga 16320
cgttgaaccc cactctggat gatgtggcga aggccataac gccaacgacc aaggccgtgt 16380
tgttgctcca ctatggaggt tcgccgggcg aagtcaccga gatcgccggt ttctgccgtg 16440
aaaagggcct cgtgctcatc gaggacaccg cctgcgcggt ggcatcgtcc gtgcacggca 16500
ccgcctgcgg aacctttggt gacctggcca cttggagttt cgatgcgatg aagatcctgg 16560
tcaccgggga tgggggcatg ttctacgcgg cggaccgcga gctggcgcac cgtgcaagac 16620
gactcgccta ccacggtctt gagcagatga gcggattcga ttcggccaag tcttccaacc 16680
gctggtggga tatttgcgtc gaagacatcg gccaccggct gatcgggaac gacatgacgg 16740
cagcgcttgg cagcgtgcag ctgcgcaaac tgccagattt cgtcagcagg cgccgggaaa 16800
tcgctacgca gtacgaccgg ttgctttccg atgtgccggg tgtccacctg ccgccgacgc 16860
taccggatgg gcacgtctcg tcacactact tctactgggt ccagctcgct ccggagatcc 16920
gcgaccgggt agcgcaacaa atgctggaac gcggcatcta cacgagcttc cgctacccgc 16980
ccctgcacaa ggtccccatc taccgcgccg actgcaagct gccttctgcg gagcacgcct 17040
gccgcagaac actcctgcta ccactgcacc cgagccttga cgacgccgag gtgcgcacgg 17100
tggctgacga gttccgcaag gccgtcgagc aacacatcag ctgaagatca ccacgtcgaa 17160
agtgaggatg tcgcgcgtga gcggcacatt cgaagaactc tcctcggtat acagcccaga 17220
ccatgccgac atctacgacg cgatccactc cgcgcgtggc cgggactggg caaccgaggc 17280
cgaggaaata atccagctca tacgcaccag gctgcccgaa gcacagtccc tactcgacat 17340
cgcctgtggg accggggcgc acctagagcg gttccgtacc gaatacgcga aggtcgcggg 17400
gcttgaactg tccgatgcga tgcgggagat cgcgatcaga cgagtccctg aggtaccgat 17460
tcacactggt gacatccgcg atttcgacct cggcgagcca ttcgacgtcg tcacctgcct 17520
gtgctttacc gcagcttaca tgcggaccgt tgacgaactg cgacgcgtga cgcggaacat 17580
ggcccggcac ctggcccctg gcggagtcgc ggtcatcgaa ccctggtggt ttcccgacaa 17640
gttcatcgac gggttcgtca ccggagccgt cgctcaccac ggcgagcggg tgatcagccg 17700
gctatcgcac tcggtcctgg agggccgtac gagccgaatg accgttcgct acacagtcgc 17760
cgaacccgcc gggatccggg atttcacaga gttcgaaatc ctctcgctgt tcaccgagga 17820
cgagtacacc gccgcgctcg aggacgcagg aatccgcgcg gaataccttc ctggagggcc 17880
gaacggccga ggcctgttcg tcggaacccg caactgagcc cggaacaaag acgcaaggcc 17940
ctggctgggc aggccccaga actcactcga gcggtgagcc gacgtgaccc cgggatcact 18000
tcgatcgtcc ccgatgccgg caagttactg accgccgtga tctaattacc aacaccgtcg 18060
gacgagttct ggtacctgtt tcggctggtg cagcgaaccg ggaacggggg cgtgttcgcc 18120
aggggcatca gtgcgattca tagatcgagg agggccgcac cgccgtcgat caagtgcgca 18180
acctcggtcc accgggactc gtcacgcact tcctcgacag agagctgagc gagtttccca 18240
ccgttgcgga cctcttcgag caacctcgat ggcgggcacg actcgaggaa agttccgccg 18300
gcaggagtgg cccggtagcc ccaacgctcg tctaccacag cacggacgac gagatcgttc 18360
cgttcgcttt tcgagaacgg ctccgggaca gctaccgcgc ggcgggaacc ccggtgcggt 18420
ggcatccact gtccgggctg gcacacttcc ccgccgccct ggccggctcg caagtcgtca 18480
tcgcctggtt cgacgagcac ttctccgagc cgcccgcgat cagcgggagg cgatgacagg 18540
tcgacgagtg gtggcgagcc tttccaacaa cttcacggcc gtgccgcatg cgtcgtactc 18600
cgcatactca accgccagcc gctgcgcggc ctgccgatac cgaggatctg tcgacatcac 18660
cttcaccgcc ccagcaacct gctgcggcga cggcctccga gtccgcaaat caacccccgc 18720
gccactccac gcaacccgag cacacacatc cgttttgtcc tcactacggc cggcaacgac 18780
caacggcaga ccgtgcgaca acgcctgctg aacggtcccg aacccaccgt tggtcaccac 18840
cgccgccaac ttcggcatca actctcggta aggcaagaaa gacgccaccc gcgcattgtc 18900
cggaacataa cccagatcaa cgccttcgcg accggtcgtc gccaccacca gaacttgatc 18960
gccggccaga cctcgcagcg ctggacggat cagatcatcc gcatccacag ccatcgttcc 19020
ctgtgtcacc aacaccaccg gacgatcgcc gtccaactcc ccccaccagg acggcaatcc 19080
cacccccatc ggcgaatcag gttccaaccg tccaatgaag tgcatctgct gtggcagggc 19140
tcgcggatac tccaaggatc gcgttccagc ctgcatgaac agatacgggg attcgctgac 19200
ctctcggctg accggaaccc cgatggaatt ccagaacgcg ttgatcttct tcatccccgg 19260
atcgtgcacg agcgcattta tcaaccggtt cccgattctg ttccgcagcc ggtggaaggg 19320
gctggtgccg aacttccatc cagtaccgat gggcggcacc gctggatcgg gcagcagaat 19380
cggcatctgc gagatcgtgg cccacagaac gccggtaaca gcgtggacca gtttagctgg 19440
tccccatgac gcatcggcaa gcagcacatc agcccgggtt cggtccacta ctgccacgag 19500
atcgcgatac tggccttcgt acgccggaac ccagtggttg tccatcaacc agcgtgcccg 19560
gcgacgcgcg gacatctgga tgctttccgg aaacttctgc tccagctcgc ggccgtcgat 19620
gaagcgcccc tcgaccggcg cggcgaaatc cgctccggag cgttcaaccg cagcacggta 19680
gttctcgccc gtgtaccacg tgacctgatg gtcgcgctcc accaaggctc gggaaacagg 19740
aaccagcggg ccgatatggg cgtgatccgc ataagtagca aacacgaagt gcgccatgat 19800
ccagtcccaa ctccccaacc accgaggccc gcaatacacg ccaccgtaac agaattgcac 19860
gaccgccggg tgccgagcga ctcgggggta aagattcagt tggattgccg tgagtggtct 19920
gcgtgatggc atggcgtgat cgcccgatgg gttcggtgcc ggtcggggtt ggcgcgggtc 19980
aggctggggt ggggtccggt ggcatccagg ggtgtccgcg aatggcctcg cggagggctt 20040
cgatcatgtt gcggccgtgc ttgccggcgg tggagaggga tccgcggatt cggtagcggt 20100
ctttggtgcg tttctcgatg gtcagccgcc cggagatgtt ctgttgcacc ttggcgggcc 20160
gtaggtcgcg ttcggcttgg ttgggtgtgc gcggccccac tcgcagcggg gtttcgtcgc 20220
agcagacagc gtaggccagg gtgtggatcc gcttgtcgac ctcggtcaac acgcccgcgg 20280
cgcgggtaag cacgccgtgc acgaaaccca cgctgggcac ggcgccggtc agcgatgcca 20340
acagctcgac gcagcggtgt acgggaatga agtgcacgac catgaggtac acagcgaaag 20400
cctgcaggtt cgggccgtag cccaccgcgc cgggacgggc gccttccggg cgggcagcgg 20460
tgtgcaccct gccgcagccg cagcgcaccg cgtgctagtc gtactgggtg acctttcacc 20520
gacaccgggg agatctcatg ctgctggtag cgatccacca cccccagatc ccgtgccccg 20580
ggccaggtca ctgccgcact cgcatacgcc cccgggaaac cgatccttgt gatctcccgg 20640
gagatcggtc caggccagat tcgcccccgg cgccccgggc tgcttgccct tccgcttcac 20700
cgcgccgccg cgtttcgcct tcgccggagg cggtgtcctg cccgggccgt cgtctttgga 20760
cggagcagat gacgaattct tgctgttacg agacagcgca tgctccagct tcgccagtct 20820
ctcgcccagt gcctcgttga cctccgccaa ctcggccatc tgcgcggcca tcgctgtgat 20880
ctgccggtct cgcaccgcaa tctgctcgcc cagcaccgcg atccgcgcgc cctgctcacc 20940
cacgagctcg atcaactcgg cgacaccgac acagaaacca caccagctcc agcagacgca 21000
gagtgccacg ccagcacccc tgtcactaca cacagtgcac cggctgaatg tttacactcg 21060
gggagttagg accgcgggcg aattcgcaca tagcccagca ccgatgtcag tcgatcaagt 21120
tatgttccgg gccgcctccg ctctgcccga acccggtcca gtaccgtaac aacacatcat 21180
ggagataatc ggcagaggat tcatcgcccg caacttgctg cggatctccg ggcggcacgc 21240
agacgcggtc gcattggcgg ctggcgtgtc gaacaccagc tgccgctccg aggacgagta 21300
tcagcgggaa gccgccctcg tgtaccggac catcgaacgc tgccacgcta tcggccgcaa 21360
actactgttc ttctccaccg cgtcagcatc gatgtacgga gcgctcacct caccagggtt 21420
tgaggacggt ccggtgtacc cgccgaccac ctatggccgt cacaagctgg ccatggaagc 21480
ggtgatcaag gcatccggag tggactttct catcctgcgg ctggcctacg tcattggagc 21540
ccaccagcgc ggacaccaac tgctcccgtc cttggtgacc cagctcaggt ccggctcggt 21600
cacggtgcac cgaggcgcgc atcgcgatgt aatcgcggcg gacgacgtgg tgaccatcgt 21660
cgacgacctg ctcaccaagg cggtcgcggg gacggtggtc aacatcggct cggggttccc 21720
cgtcccggcc gagaagatcg tggcacattt ggagtatcgg ctgggaacgg cagctgcacg 21780
gcagtggatc gaccatccta ccgaatacca gatctcgttg acccggctga acacgctggt 21840
cccacgaatc gccgagttgg gcttcgggcc ggactattac cggcaggtgc tggaccacta 21900
cttggacctg tacccacagg cctgatcgat cgtcgtgacg agcacggcct gccggatcac 21960
cgactcatcc gaatctccag ccaccgcagg gtcatccacg cctgtcccgg ctctgatgtc 22020
gcgggcggaa gcggtgaccg cgttgcggtc ctgatcttcc gccggcaggg acaactcgtc 22080
gacgacgccc ttgcgctaga tccggatcac gggttgccac gtggcgggcg gattgaatgc 22140
ccgatcgatg acaatgccca ccgccgtatt ggccaccgcg cacatcgcgg gcagcatccg 22200
ccgccaggcg aaggaagccc acccgagaac gccgatgcac cggctttcgc accgaactga 22260
tcatctcccc gtcgttgacg ttgcccggtt gcatcttggc tgacatgagg cagtgcagca 22320
ccccgtcgat ctccttgacg agaatgccca cttcgacctg attgtgatcg gttgatgcca 22380
gcaggtcgcc gcgccgtagt cggtctcgac ctgaagacct tcgaccgtga agaatcgacc 22440
ttcggcgtga accaggttct cggtggccgc ttcggatttc cagttcacta gggcgctgaa 22500
cgggatgcgc gtggtctcga agctgttctc gcccaggcga tcggtcagcc agcagttgaa 22560
ctcagcattc ggcatcagcc gattgcaggt actcagcgct gagttgatga accgctttcc 22620
gccgagcagc gcatcggtac tcgccccagc ttcaacagag ctgcccctgt ctagtccctc 22680
actgccacaa cgatgtgcac gctggctccc aggaatattg gggactgcga gacttccacc 22740
tggtaatcaa gcttgcggag ccgatcgagc acctggcgca gccgaccgca gatgtcgtgc 22800
acttccacga ctatccgacg aatacggggc cacatctcgt catcgatccc gttcagcacg 22860
tcgagctccc cgcgttcgac atcgatcttg agcagatcga gcacatcaag ccggtgctgc 22920
cttgcgatct ctgtcaacgt ggtcacccgc acgtcgagtt cttccttggt gcggaccagc 22980
ccctgcatcg attcccccgc cagctcgggg ctgccgacgt tcgacatcac cgtgtcgatg 23040
ttgcgccgct catccgccgc atcgaggtgc agcgtcgaca aagatggccc tgcggggtaa 23100
tagacgaagc gggacgttcc gggctccgca cccaccgcca ggtcgaacgt cacacctcgc 23160
ggtacgtggc gggcgaagtt ttcccgcagg caggcgaagg tcgttggtgc cggttcgtaa 23220
gcgagtattc gtgctgctgg aattcgatcg gcgaagtaca tcgacgcaag gccgacatgc 23280
gcgccgacat ccacgatcac cgaatcagca cccaagccgc gcagaccgcg ggcgtaagcc 23340
gaatcgttcg cgatgtcctg ccaaatggca agtacttcaa gcgtgttggc gcatgcgact 23400
agacgcccat caggaagggc ctgcgtcgcc ctgtcgtcaa ccgtgtcgaa catcggtaac 23460
gccccgtcct cagcgattcg gaactcaacg gagttccctt cccagaacaa ttcccgagct 23520
atattcccga tatctcacgc ggccggcaat ccgaagatca cccaatcacc cttcgcaaca 23580
atcgaagggg tgagacggaa gtttccgacg gatgacccgt ggtgtagcag caccccgcgt 23640
gctgcagtcc actcgacgca gtcgtccccg acgggatgat ccgccatcac cacgtgatcg 23700
ggagggcctc cggcccgcgg gtcattcggc ctcgtttcca gacgacctca tccaccggaa 23760
ccgcgaagga cagctgcgga aaccgtttga tcagcgtgcc gatcgctacc tgcagctcca 23820
tcctcgccaa ttgggcgccg atgcagtagt gcgggccgtg ccccagcgcc atgtgcgagt 23880
tgtgctcccg agccaggtcg agttcgtccg gcccgtcgaa gacggcactg tcacgattcg 23940
ccgaggctat ctcgaagaac actgcgtcac cccggcgaat cgacactccg cccagttcga 24000
gatcctcagt cgcgatgcgc gggaacccag gtgtggcacc gagcggcgtg tagcgcagca 24060
actcctccac cgcgcgcggc accagctctg gatcggcgat cagcttgtca agctggtccg 24120
gatgggtgag caggttgaag gtgaagtttg cgatgtggtt agcggtggtc tcgaacccgg 24180
cgatcagcag acccgcgccg gtgacgacaa tctcctcctc gctcagctgg gctccttccg 24240
ctctggcctg gaccagcacg ctcagcaggt cctcagtcgg catcttcttg cgctgctgga 24300
ccagttctcc gatatacgcg cgaatttgat cgcggctttc ccgaatctcc tcaggactgt 24360
tcgatgtgat cgccaacgca atgtccgacc agacccggaa gcgctcgcgg tcggccaccg 24420
gaatgcccag caagtcgcag atcaccttga tgggcagcgg cagggccagg gctgacacca 24480
ggtcgccagg cggcccgtcc gcggccatcc gatcaagtag gtggtcaacg agctgctggg 24540
tgcgggggcg aagctgttcg actcgacgtg cggtgaacgc cttaccgacc agtttgcgca 24600
gccgcgtatg ctccggcggg tccatcgttc caagcgaatg ttctcgcaag atcagcggga 24660
acccccgcgg tacatccctg ttcaggatcg ccgcagcgct gaatcgcgga tcgccgagca 24720
ctgtcttgat gtccgcatac ctggtgacga gccaaccgtc accaccgtat ggaagccgga 24780
tcttgctcac cggttcgccc tcacgcaaaa cggcgtagcg atcatccagc aacagccggt 24840
cgatctcgcc gaagggatag gccattggct catcaccatt ggtcatcaac ggtcccctct 24900
ctcagacctg gccccgttcc ggggacgcgc ttcgcggagc cagcggtgga ctccacctcg 24960
attgcgctac ttcctcttgc tagtcagcag tttcccgatc acgaggatcc aacgactcgc 25020
tccgcggaac tggctcccag tgacgatcac ggaagcaatc cactggtgat gaccgcgagc 25080
acctcatacc ggggttcggc gaggtagaaa tgcccacccg gcaacacgtg gaggtccatc 25140
tcagcttcgg tgtgctcgcg ccaggattcc gcgtcatccg gagtgacctt cgggtcggca 25200
tcccccacga gcacggtgat cgaacaactc accttcgatc ccagcggaca acggtaggtt 25260
tcaaccgctc ggtaatcatt gcggacggcc gtcaggacca tgcgcagaag gtcctcgtca 25320
ccgagcaccc aggaggtggt cccgcacagc tgagcgctcg ggatagtggg aagtagaagc 25380
tagccgagcc gcccgcgtac ggcagcacga ccaaagagac cctggcccgg ggtgcgggat 25440
ggaaacgttg gatccacaga tcgtcgccgg ttgctaggtc ggtcacacga ttctccctag 25500
gacggcagcg gaactagcgt cgatcgctga gccagctgcc ggtagcgaac caactttcgg 25560
agccatccat caccattcgc tgatgtcacg atcttcttgg gaggccggtc ggcgcgcaaa 25620
cctgctgcga gatcagcact ccatcgggcg acttccgatg ggaaaatcca atctcggaac 25680
tcaaccgttg actgccgtga ttcaagctgg gcctcttgct ggaagtaact gttgcaggaa 25740
atacggaagc atagagcatc cggagcatcc ggagcatccg gagcatccgg agcatccgga 25800
gcatccggag catccggagc atccggccag atctcgaatt gctcgcggtc gacctccgga 25860
actcccgaca gcagcgagtt gatcaccttt atgggcaaat gtagggccag atcaaggaat 25920
tgtcaacacc tgtggatcaa gattgttcag gcgacggtgg gtggggcgtg gtcgtcgggg 25980
cgttcgacga gtttgccgcc ctcgaagcgt gctctggcgc gcaccagcgc gacgagttcg 26040
ggtgcgttga ccgcgggcca gcgagcctgg gctgactcga tcagcttgaa cgccacggcc 26100
agcaactcgt cgacatcgtc ggtgatcttg gcgactgcct tggggaactt cgccccgtag 26160
gcggcctcaa acgccctcac cgtgtccaac acgtgccggc tgtcctcggc attccagatc 26220
tcgcccaggg ccttcttcgc gccgggatgc gccgatttcg gcagcgcggc gagcacattg 26280
ccgatcttgt ggaaccagtg gcgctgctcg cgcgtatcag ggaaagcctc gcggagcgcg 26340
ccccagaacc ccagtgcacc gccgccgatg gccagtaccg gggcacgcat accgcggcgc 26400
ttgcagtcac gcagtaggtc agcccagccc aggactcggt ggattcccga tagccgtcgg 26460
ccagcgcgac gagctccttg cggccgtcgg cacggactcc gatcacgacc agcagagata 26520
gtttgtgttc ttccaggcgg atgttgacgt gaatgccgtc tgcccgcaga tacacaaagt 26580
ctacttcgga caaaccacgt tcgttgaaag cacggtgttc ggtcctccac tgctcggtca 26640
gtttcgtgat caacgtagcg gataacccct tgctgctgcc aaggaattgg cccagcgcgg 26700
gcacgaagtc tccgctggag agcccgtgta ggtacagcag cggcagcatt tcggtgatct 26760
tcggggtctt gcgcgcccac ggcggcagga tcgccgagga aaaccgccga cgcgcgccag 26820
tgtccgggtc ggtgcgcttg tcgttgactc gtggcgcagt gacctccacc gcgccagcgc 26880
tggtcaacac ctcacgaggc tggtcatggc cgttgcggac caccaggcga tggccgcact 26940
catcgcgctg atcagcgaac tacgcgatgt aggcatccac ctccgcctgc aacgcctcgg 27000
ccagcatccg gcgggcaccc tcacggacaa tctcatcgat caacgacgcc aaagctgcgg 27060
caggacggcc gtcgtcacgg gaatcagggt cggggactac gctgagcatc gggtcgtacc 27120
ttcccgaccg acgcggcaac gtcggccatg cttggaacct tgcatccgat cactgggaag 27180
gtacgcccct cctcagccga tccaaggttc cgagcattcc tcgaccagat ccgaaagaag 27240
gtcgacttgc gcaacggcgg cacgcaggac caactcggac tccgcctcaat cctgaccag 27300
tgcacgggca ctcgatcgga tggactgctc ctgtggctgc gagattttgg atatctttga 27360
cagaagagca ctttcgcgga tataggctga tccgatggct cgatcgaaac ccccacacgt 27420
cccgacagga aggcttggcc tttgaagact cacgacgccg cgtccggaac gacggccacc 27480
gtacaactcc agcgaatcat cggagggcac ttggcctacc acgtccttgg cgccgccgct 27540
cgattggaca tagccgacca tctgcgcgaa ggcccgctca cggcttccga gttgagcgac 27600
ctcatcggcg gcgacgatcc cgaaatagtt gacaaattcc tgcgagtggc cgaaacgatc 27660
ggtctggtgc gcaggaccag ctccggtcag ctggcggaga ccgaactgct cgctctgctg 27720
cggcgcgacg gggggcggta tcggagtacc gtcctggccc tcacagcccc gggtttcaac 27780
cgccccagcg agatgatgca ccgcgcagtg ctcagcggca gggcgcatac cgcgcaggtg 27840
ctgggcactg atctgtgggg ttactacggc accaaccccg aagaagccaa atggttcggc 27900
ggcgccatga ccgacctgac caacttggtc gcagacctgg tgctggcccg gtacgaattc 27960
tccggacgcg gcacgatcat ggatgtcggc ggcagccatg gcatattcct gtcccggatc 28020
ctgcacgccc aaccggacgc gaagggtgtg ctgttcgacc gcatggaggt ggtcgaagaa 28080
gcccgcaatc acctagatca ggacatccga acccgcatcc agatcgtcgg cggaaacttc 28140
ttcgagggag ttcccgaagg cggggatctc tacatcctga agagcgtgct gtgtgactgg 28200
gacgaccaga gctgcctgca aatactctcc cgcatccgga acgccgccat gcccggtgct 28260
tcgctactga tcgtcgactg gttgtaccct gacgagtccg accccggctt ggacgcgatc 28320
tacctccagc aggcgatctc ggtcaacggc cgggtccgca accaggaaca gttcgaatct 28380
ctcctaaagg caacaggttt cgcggtcacg agggtcgaac gaaccactcc ggagaactgg 28440
atcccggcga caatcatcga agcgatccgc cggtgatgac cgcgagcacc tcctcctggt 28500
gttcggcagg cagaagtggc ctgcgggcaa gaccgggagg tgcactccat tgtctcgtct 28560
atgaaccttg atcgggttat tgatcattag ctgttgggtg tcgggcaggc ttgctcggga 28620
aggtcaagac tggtggaggt ggtggatgcc tgtcggggct tcctcccacg gtagttgatc 28680
gctggcttcg gttcctcgct tcgtgctgat gagagtgcac gggtcgatgt ccggggtgca 28740
ggcggttcca gcgtgtcggt ctaccgatgc tctggtgttg gccgtttcgg gtagagccgg 28800
gtcatcgcgt ggatcgtgtc ggtcatgcgt gcgaccagtt cggcgggagc gagcacctgg 28860
acgtccgggc cgaggcgggg gtgtcaggtg ggttgtgtac aagatccgat agttgatgga 28920
gtacgcagat ggccacgact gatcagcagc aggaagggaa tccgttcggg cggccgccgg 28980
gcagccggga tcggcgcggg acgcggtcaa cgagatggtc gacgcgggcc tgcttgacgg 29040
gatgatggac gccatcgatc gggatgggct ggcgttgacc gggcagggtg ggttcctgcc 29100
tgagctggtc aaggccgtgctggagcgcgg tctacggacc gagatctcgg aacatctcgg 29160
ctatgacaag ggcgatccgg ccgggcgggg cagcccgaac tcccgcaacg gcaccagcgc 29220
caagaccctc tcgaccgagg tcggcgacgt ggacttggac gtgccccgcg accgcaacgg 29280
cagtttcgag ccgcggctgg tcccgaaggg ctttcgccgg gctggcggcc tggacgagat 29340
catcatcatc tcgctgtacg cgggcgggat gaccgtccgc gacatccagc accacgtgca 29400
gcgcacctac ggaaccgagc tctcgcacga gacgatctga accgtcgcgg tttcggtaga 29460
ggcgttgatg tgccccggcg ccggcggagt caggttactc gatcatcgtc atctcagtgc 29520
cccgactgga ggttgcctcg catgccccgg agatacccac cggagttccg ccgtgggaat 29580
ggatcaccgg cggcgaaagg ctgatgacca acggactgtc ctttcccctt gctggaaacc 29640
ctcaacgcct acctctcgaa ccaacgctca tggcagcgca ccagcgcgac actccacgtc 29700
caccggcaaa ccgtcctcta ccgcatacgc aagatcgaag agctcaccca ccacgacctc 29760
agcgaaacaa gcgacattgc cgaactgtgg ctggcgctgc gcgcactcga actcatctcc 29820
cagtgagtgc ctgctgatct cggggaagcg aacgcggtat ttagccgaag tcgcgtctgc 29880
tgaccgcgca aaccccgtgc cccgcggagc ggtgaacgtg cgaggtctcg gctactcttc 29940
tggagttccc aaagccgctg cgggcaagtt ccgcgacatg accggagctt tcgtacccgc 30000
tgcccgtgcg ttacggctgg gccaacgccg cccaggtcct gcaccggacg acccgcgtcc 30060
acgcccaggc cacccgtcgg gtcgtggaga ccacccagtt cctgcaggac gtacacgtac 30120
aggccgacgg tgggctgcca acccccgagg gccacgggat gcggtcggcg cagaagatcc 30180
ggctgctcca cgccacccat ccgatacttc ctgcggacca gcgacgactg ggactcggca 30240
gtgctcggcc tggccgcgaa ccaggaagat ctggccgccg ccatgggcac cttctccgtg 30300
tgcctgcccc gggggctggt ggcgctcggc gtggacctgc ccgaccacga ccgcgacgac 30360
tgcttccacg tctggtcggt cgtcggccac ctactcggcg tcgacccgca acttatgccc 30420
gccgggatcg acgaaggggt cgctctgatg gagcggatct ggggccgcca gaccgcggaa 30480
tccgacgccg ggaaggtctt caccgccgca ctggtgtagt ccgtgcgtaa tgtgctcggt 30540
cccgcgctgc acggcacccc atcgaggcga tgatcccgcg cttctgcggc gacgagttcg 30600
ccgacctgct cgccgtcgac cccgcggacc ggaccgccct cgctgccggc cgcgcctcaa 30660
ccgtcaacac ggtctacggc aagaccggcg accacagcga actcgccgcg tcgatcgcca 30720
gcaggattgg cgagcttctc ctcgacgccc gccctccaca cggccaaccg cagcaaccgc 30780
tacgactgga ccattcccaa ctgcgaacaa cagcaccgcg aaggctcgcg atgaacacca 30840
cagctattga tcgggaactt ataggtgtga tcttcgttgt tctggcatga tcgctggtgt 30900
gcgggacgcg cggaggttgt cgcctgaggc gcaggaggat ttgcggcgca gggtggtcgc 30960
tgctgttcat ggtgggatga gtcaggtcga ggcggcccgg gtgttcgcgg tggccccgca 31020
gtcggtgtcc agatgggtgc aggcgtggcg gaaacgtggc tcgaagggtc tcaccgggcg 31080
tcgccggggt cgcaagcccg gcgagcagaa agcgttgagt gcccgccggc agcgcaagct 31140
gcggtatgcg gtggccgagc acaccccggc cacgttcggg ctgaccggcc tggtgtggac 31200
ccgcaagaca gtggccgagc tgatccgggt gcgccacggc atcgtgttga acctgcgcac 31260
cgtcggcaac tacctgcgtt cctggggatt gtcgccgcag aaaccgatcc gcaaggccta 31320
cgaacaggac cccgagtccg tacgccgatg gctggaggag gactacctgg ccatcgccgc 31380
ccgcgcccgc cgcgagggcg cactgatcct gtggctggac cagaccggga tccgctccga 31440
cgccaccgta gcccgcacct gggcaccggc gggccagaca ccggtggtgg gcaaaacggg 31500
caaacgattc agcgtgaacg cgatgtgcgc gatcgggaac aaaggcgagc tgtacttcac 31560
cgtctacacc ggctcgttca acggcaaggt gttcctgtcg ttcctggacc ggctgacccg 31620
ccatctggac cgcaaggtcc acctgatcgt tgacggacac cccgtccacc gccgcaagac 31680
catccagcaa tggatcacca agcacgctga ggcgatcgcg atgcacttcc tgccgggata 31740
cagccccgaa ctcaaccccg acgagctact caatgccgac ctcaaacgca ccgtttccac 31800
cagcacagcc cccaaaaccc gcgccgagtt gaaacaagcg gtccgctcct tcctccaccg 31860
gctccagaag ctgcccgacc gagttcgctc ctacttcggc aaacccgaag ttcgctacgc 31920
cgcctaacat cacacatttg ccacccggat caataacatc ccttttccag gtcgtagtgg 31980
accgaccccg tcaggcaagt gcgagttcct tcggccctag gcacacgtcc aactctgcca 32040
gcgtctcgtc gttcacgtcg accgggcgca gcgtgtgcgg cagcgagcgg acggcgtcag 32100
cggcgatgaa cgcgtcctgg gtagtggtcg gcgatccgtc gcatggccga accggtgccc 32160
gcaggcccgg gcaaccattc gacgcctcac cataaaccgt ctggtgacgc tcaaaggggc 32220
gacgcgacag aacccagatc gcaccgccgc aacaccgcgg tccgtgccag cagccgcatg 32280
ttcgcccggg accagccgta tccaggccgt gcagcgtcgg gtggtcagga actctgaccg 32340
cggagtactc ctcccgtgtt gtcaacggaa gagctgtcgc ctggtcagcg cggatcgttc 32400
acttagatgg cggtttctgt taccacaagt gaagatttgc aggcttcgag gatgcgcgat 32460
gccggcagcg gatcacactt cttgacctga ccgcctcgag atcgtttctc tattctcagg 32520
ccgtgcggct gagccgctga ccgggcatgt agatcgaaag gcgggctatg cacgcttact 32580
actgtgcgaa gtgcaagaac gaacaaacag atcctgcgga tgcgctgaag ttcgcggcct 32640
cgatcggctt ggaattgtgg ctgccgaagg atgaggtgac gttcgacttt ccccacggcg 32700
cgcagcagtg ccagacggcg atcgagaaag cggaatgtgt gatctgccag cctccgatag 32760
gcaacgactg ctcctgggaa ctcggatacg ccatcgggat cggcaaacca gtctacgtca 32820
tcggaacgct ggccgagcag gactggatga ccaagctcgg cgtcactcac gtggacccgg 32880
cgtcgttggc tgccgagaaa gaatgaggtc ctgccgtgac tcagcacagt cggcggccca 32940
ccggcaaagc agtggggcgc cgcagagtag cggcgatcga ctgcgggacg aactcgatcc 33000
ggatgctggt tgccgacctc ggtgccgacg ggcatttgac ggaagtcacg aaacggttcg 33060
acatcgtccg tttgggtcag agagtcgacg aacacggctc gatctcccgc gaatccttcg 33120
aacgagcgcg cgccgtgttg gccgaatacg caggaaccat caccacggcc agtgtggagc 33180
gcgtgcgcat gtgcaccaca tggatctccc gacgagcgtc caacaacgac gaattccgca 33240
cactggtcca agagacactc ggctgcgcac cggaagacat caccagcgac gaggaagcac 33300
ggctcgcttt cgccggggct accagcggtc ttccgcaggc gagctacctg gtcgccgaca 33360
tcggtggcgg gtccacacag ctcgcgctgg gaacagcagg tttcgtggac agatccgtct 33420
ccgtcggcct gggctgcgta cgcctgaccg aacgccacct gcggtcggat cctccagcct 33480
cgggcgaact cgcggcggta caggacgaga tcacggcact ggccgaccag gcactggccg 33540
aacttcccga tttgtcgggc actcggcttc tcggtgtctc cgaagcagtc ggcacagttg 33600
ccgccatcgc gcttcctgga cgaaggctcc atcacgcacg aatgacctac gaccaggtcg 33660
attacgtgac agagcgggtg ctgggaatgt ccttcgccca gcgacaagcg ctgcccggaa 33720
tccacacagg tatggccgat gtactacccg tcagcgcgct cagcgtgcgt acggtgatgc 33780
aatgtgcccg ggccaaggag ctgatcatca gcgagcacga catcctgcat ggaatcgctt 33840
actccctgag ataagcggga tcagatccca atgccgaaca gcatcacggc gagcgggcaa 33900
tcgaatccac cgacgcaccg agtcccgaac accctgccag cccggcaggg tgttcggcgg 33960
cgggtgtcag gtctgccgcc acgggcttcc ggcgcctctg atggacgcct agccgcaaac 34020
ctttgcgtag tcggtttcct cgaccgcttc catcctcata caccacgaca ggcgcgaaaa 34080
cgtatgccgc gaaggtctcg catgcctcgc gtatgtcggc ctcggcgtgg agttaggcaa 34140
acgagttgcc aaggcagagg atcacgtcag aggtgacacc atgcgaaggt gcccagacat 34200
ggtgccgaag atgccgcgct actcgcgagc atcttcggca ccttcgcctt ccagcaccca 34260
ctcccacgcg ccgctggcga aaccggcttt tcatcgaacc agcacgaaca ccagcaggct 34320
cagaccacag cccagcacga acacggcata gaggatgtcg acaccgtgcg ggggatgctc 34380
ggctatgccg gccacgatcg ccgccgccgg caaccaggcg agcagcagca cacatgtcag 34440
cctgttgcgg cggacccgca tcccggagcg cgacgaccac ggtcgcggtg accacccacc 34500
ggtcagccag ggcggcagcc gccggacgaa caccaggacc agcgtgcagg cgaagcccag 34560
agccagaccg atcacggtcg tcgacgtcca gccaccagcg aggtggatcg cggcgagtac 34620
agcgaagacc ccaaggaacc accagcgcgg acgccaggca acgtcctcgg ccgccacctc 34680
ggccggctcg ggaacaggcg acttttcggc cgctgattca gctccgtcgg aaaccaccac 34740
accacccttg cccaaacaga cctgccgata gcgtatcgag cgcgatctga gcactctcgc 34800
gccgaagggc accacagcag ggagtccccg cgccccactg cgctggctgg ccggcgactt 34860
cgggttgcgt gatctcgtcg ccgacagcgg aaacacagca gccatcgccg catatgaaca 34920
ggccactgct caggcagcag aggcaaaact agatggcaac gctgcgcgac gcagcgacgt 34980
catgtcccgg gacaccgtca caagcagcga cacgctgtcg agtgcgggat caaccgcctc 35040
aaacggaacc gagccgtggc caccagatac gacaagcttg ccgtccgcta cgagccaccg 35100
tcaccatcgc agggatcaac gagtggctcc catgactttc gaaacaggcc caagcgcccg 35160
ccgcctggtc caagccccgg cgcgcgacag ccacacccga actcgcgttg tcgcggttgg 35220
gaatacccac aaccggctct ccgcgcgcgg ggcgccccag caacggcgcc ccgacacggt 35280
gctgcgtgga acggacgaga gtgcgctccg acgccagccc cgagcacctg tgcggttcga 35340
gcccgcggac gaactgcggc tctccgtcct gtcctcgctg atccccgccg ccgctgggcg 35400
caggtcttcc cagacacgcc gagcacactg atggcctggc accgcacgct tgtggcccgc 35460
cggtgggact actccgatcg gcgccgacct ggacgaccgc ccacaaggcc ggccatcaag 35520
aagctcgtgc tacgcctcgc ccgcgaaaac agtcagtggg gacaccgccg gatccggggc 35580
gaactggccc ggctcggaca cccgatcgcc gcctccaccg tctgggaaat cctgcacgta 35640
ggcggcatcg atccggcccc gcgccacagc ggcccgacct ggcgtgagtt cctgtccgca 35700
caagccagcc gtctgatcgc ctgcgacgtc ctgagcatcg acaccaccgg cctacaacgc 35760
ccatactccc tggtcttcct agaacaccga acgcggcgcc tgcacatcac cggcgtcacc 35820
gcacacccga cggcgcctgg gttactcagc aagcgcgcga cgtcgccacc gacctcggca 35880
cgcgtatgga ctcgctgcgt ttccgcaccg gagaccgcaa acagcaagta caccgacgat 35940
atcgagatca tcaagacacc ggcgcgggcg ccacgagcga atgtgcactg cgagagagcg 36000
atcggcagcc tccatcgaga agtcctcgac cactcccctc atcatcggta agacccatgc 36060
acgccgcgtt ctcaccgagt accaagagca ctacaacaaa caccgccccc accgggcccg 36120
caactagaac tagtgtcctg cgccggagat tcgttgacaa tatgatcgga gtgactccaa 36180
gatctcgtca gcgctcttcg tccagatgaa cggccgggga tcgttgttcc actcgtcgat 36240
ccagttgcgg atatcggcct cgagtgcctg gacgctggtg tgcacgccgc gttgcaggag 36300
tttggtggtc aattcgccga accagcgttc gacctggttg atccaggacg atccggcggt 36360
gcgatagcgc gccgatcggg tccttcgcgg ggaactccgg ggattccgtg ggccaccacg 36420
ccacgaccgg cgcgtccggc agcagcagcg gcaccaccga gccggcgccc tcgtcggcca 36480
gcggcccgta cagccgcagc acgatgacct ccgaggcccc agcgtcgccg ccgatgcg 36538
<210>3
<211>4344
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>3
Met Ser Glu Ala Gly Asn Leu Ile Ala Val Val Gly Phe Ser Cys Arg
1 5 10 15
Leu Pro Gln Ala Pro Asp Pro Ala Ser Phe Trp Arg Leu Leu Arg Thr
20 25 30
Gly Thr Asp Ala Ile Thr Thr Val Pro Glu Gly Arg Trp Gly Asp Pro
35 40 45
Leu Pro Gly Arg Asp Ala Pro Lys Gly Pro Glu Trp Gly Gly Phe Leu
50 55 60
Ala Asp Val Asp Cys Phe Asp Pro Glu Phe Phe Gly Ile Ser Pro Arg
65 70 75 80
Glu Ala Ala Ala Met Asp Pro Gln Gln Arg Leu Ala Leu Glu Leu Ala
85 90 95
Trp Glu Ala Leu Glu Asp Ala Gly Ile Pro Ala Gly Glu Leu Arg Gly
100 105 110
Thr Ala Ala Gly Val Phe Met Gly Ala Ile Ser Asp Asp Tyr Ala Ala
115 120 125
Leu Leu Arg Lys Ser Pro Pro Glu Val Ala Ala Gln Tyr Arg Leu Thr
130 135 140
Gly Thr His Arg Ser Leu Ile Ala Asn Arg Val Ser Tyr Val Leu Gly
145 150 155 160
Leu Arg Gly Pro Ser Leu Thr Val Asp Ser Gly Gln Ser Ser Ser Leu
165 170 175
Val Gly Val His Leu Ala Ser Glu Ser Leu Arg Arg Gly Glu Cys Ala
180 185 190
Ile Ala Leu Ala Gly Gly Val Asn Leu Asn Leu Ala Ala Glu Ser Asn
195 200 205
Arg Ala Leu Met Asp Phe Gly Ala Leu Ser Pro Asp Gly Arg Cys Phe
210 215 220
Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly
225 230 235 240
Leu Val Val Leu Lys Lys Ala Asp Gln Ala Arg Ala Asp Gly Asp Arg
245 250 255
Ile Tyr Cys Leu Ile Arg Gly Ser Ala Val Asn Asn Asp Gly Gly Gly
260 265 270
Ala Gly Leu Thr Ala Pro Ala Ala Asp Ala Gln Ala Glu Leu Leu Arg
275 280 285
Gln Ala Tyr Arg Asn Ala Gly Val Asp Pro Ala Ala Val Gln Tyr Val
290 295 300
Glu Leu His Gly Ser Ala Thr Arg Val Gly Asp Pro Val Glu Ala Ala
305 310 315 320
Ala Leu Gly Ser Val Leu Gly Val Ala Arg Arg Pro Gly Asp Lys Leu
325 330 335
Arg Val Gly Ser Ala Lys Thr Asn Val Gly His Leu Glu Ala Ala Ala
340 345 350
Gly Val Thr Gly Leu Leu Lys Thr Ala Leu Ser Ile Trp His Arg Glu
355 360 365
Leu Pro Pro Ser Leu His Phe Thr Ala Pro Asn Pro Glu Ile Pro Leu
370 375 380
Asp Glu Leu Asn Leu Arg Val Gln Arg Asp Leu Arg Pro Trp Pro Glu
385 390 395 400
Ser Glu Gly Pro Leu Leu Ala Gly Val Ser Ala Phe Gly Met Gly Gly
405 410 415
Thr Asn Cys His Leu Val Leu Ser Asp Ser Ser Gln Val Glu Arg Arg
420 425 430
Arg Ser Gly Pro Ala Glu Ala Thr Met Pro Trp Val Leu Ser Ala Arg
435 440 445
Thr Pro Val Ala Leu Arg Ala Gln Ala Ala Arg Leu His Thr His Leu
450 455 460
Asn Thr Ala Gly Gln Ser Pro Leu Asp Val Gly Tyr Ser Leu Ala Thr
465 470 475 480
Thr Arg Ser Ala Leu Pro His Arg Ala Ala Leu Val Ala Asp Asp Val
485 490 495
Pro Lys Leu Leu Ala Gly Leu Lys Ala Leu Ala Asp Gly Asp Asp Ala
500 505 510
Pro Thr Leu Cys Thr Gly Thr Thr Ser Gly Glu Arg Ala Thr Val Phe
515 520 525
Val Phe Pro Gly Gln Gly Ser Gln Trp Ile Gly Met Gly Arg Gln Leu
530 535 540
Leu Gln Thr Ser Glu Val Phe Ala Ala Ser Met Ala Asp Cys Ala Asp
545 550 555 560
Ala Leu Ala Pro His Leu Asp Trp Ser Leu Leu Asp Val Leu Arg Asn
565 570 575
Ala Ala Gly Ala Ser Gln Leu Asp Arg Asp Asp Val Val Gln Pro Ala
580 585 590
Leu Phe Ala Val Met Val Ser Leu Ala Glu Leu Trp Arg Ser Trp Gly
595 600 605
Val Arg Pro Glu Ala Val Val Gly His Ser Gln Gly Glu Ile Ala Ala
610 615 620
Ala Cys Val Ala Gly Ala Leu Ser Val Arg Asp Ala Ala Arg Val Val
625 630 635 640
Ala Val Arg Ser Arg Leu Leu Ala Ala Leu Ala Gly Arg Gly Ala Met
645 650 655
Ala Ser Leu Gln His Pro Val Glu Glu Val Arg Gln Ile Leu Leu Pro
660 665 670
Trp Arg Asp Arg Ile Gly Val Ala Gly Val Asn Gly Pro Ser Ser Thr
675 680 685
Leu Val Ser Gly Asp Arg Glu Ala Met Ala Glu Leu Leu Ala Glu Cys
690 695 700
Ala Arg Arg Glu Leu Arg Met Arg Arg Ile Pro Val Glu Tyr Ala Ser
705 710 715 720
His Ser Pro His Ile Glu Asp Val Arg Asp Glu Leu Leu Ala Leu Leu
725 730 735
Ala Ser Ile Glu Pro Arg Thr Gly Asn Ile Pro Val Tyr Ser Thr Thr
740 745 750
Thr Gly Glu Leu Leu Asp Arg Pro Met Asp Ala Asp Tyr Trp Tyr Arg
755 760 765
Asn Leu Arg Gln Pro Val Leu Phe Glu Ala Ala Val Glu Ala Leu Leu
770 775 780
Lys Arg Gly His Asn Ala Phe Ile Glu Ile Ser Pro His Pro Val Leu
785 790 795 800
Thr Ala Ser Ile Gln Glu Thr Ala Ala Arg Ala Gly Arg Glu Val Val
805 810 815
Ala Leu Gly Thr Leu Arg Arg Gly Glu Gly Gly Leu Arg Gln Ala Leu
820 825 830
Thr Ser Leu Ala Lys Ala His Val His Gly Val Ala Ala Asn Trp His
835 840 845
Ala Val Phe Ala Gly Thr Gly Ala Gln Arg Val Asp Leu Pro Thr Tyr
850 855 860
Ala Phe Gln Arg Gln Arg Tyr Trp Leu Asp Thr Lys Pro Ser Asp Leu
865 870 875 880
Ala Met Pro Glu Gly Asp Val Ser Thr Ala Leu Arg Glu Lys Leu Arg
885 890 895
Ser Ser Pro Gly Ala Asp Val Asp Ser Ala Thr Leu Thr Ile Ile Arg
900 905 910
Ala Gln Ala Ala Val Val Leu Gly His Ser Asp Pro Lys Glu Met Asp
915 920 925
Ser Asp Arg Thr Phe Lys Asp Leu Gly Phe Asp Ser Ser Thr Val Val
930 935 940
Glu Leu Cys Asp Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Ala Pro
945 950 955 960
Ser Val Val Phe Asp Cys Pro Thr Pro Tyr Lys Leu Ala Arg Gln Val
965 970 975
Arg Thr Leu Leu Leu Asp Glu Pro Val Pro Thr Thr Ser Pro Arg Thr
980 985 990
Glu Thr Glu Ala Asp Glu Pro Ile Ala Val Ile Gly Met Gly Cys Arg
995 1000 1005
Phe Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Gln Leu Val
1010 1015 1020
Ala Ala Gly Arg Asp Val Val Ser Glu Phe Pro Ala Asp Arg Gly
1025 1030 1035
Trp Asp Pro Glu Arg Ala Gly Thr Ser His Val Arg Ala Gly Gly
1040 1045 1050
Phe Leu His Gly Ala Thr Asp Phe Asp Pro Gly Phe Phe Gly Ile
1055 1060 1065
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu
1070 1075 1080
Leu Glu Ile Ala Trp Glu Ala Ile Glu Arg Gly Gly Ile Asn Pro
1085 1090 1095
Gln Thr Leu His Gly Ser Gln Thr Gly Val Phe Val Gly Ala Thr
1100 1105 1110
Ser Leu Asp Tyr Gly Pro Arg Leu His Glu Ala Ser Asp Glu Ala
1115 1120 1125
Ala Gly Tyr Val Leu Thr Gly Ser Thr Thr Ser Val Ala Ser Gly
1130 1135 1140
Arg Val Ala Tyr Ser Phe Gly Leu Glu Gly Pro Ala Val Thr Val
1145 1150 1155
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys
1160 1165 1170
Gln Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly
1175 1180 1185
Val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu Phe Ser Arg
1190 1195 1200
Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu
1205 1210 1215
Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu Val Leu
1220 1225 1230
Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Asp Val Leu
1235 1240 1245
Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn
1250 1255 1260
GlV Leu Thr Ala Pro Asn Gly Pro Ser Gln Arg Arg Val Ile Thr
1265 1270 1275
Gln Ala Leu Ala Asn Ala Lys Leu Ser Val Ser Asp Val Asp Ala
1280 1285 1290
Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu
1295 1300 1305
Ala Gln Ala Leu Ile Ala Thr Tyr Gly Gln Gly Arg Gly Pro Glu
1310 1315 1320
Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr
1325 1330 1335
Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala
1340 1345 1350
Met Arg Tyr Gly Glu Leu Pro Ala Thr Leu His Val Asp Glu Pro
1355 1360 1365
Ser Ser Gln Val Asp Trp Ser Ala Gly Met Val Gln Val Leu Thr
1370 1375 1380
Glu His Val Pro Trp Pro Asp Asn Ser Arg Pro Arg Arg Val Gly
1385 1390 1395
Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu
1400 1405 1410
Glu Gln Ser Pro Thr Ala Ser Ser Glu Phe Val Glu His Ser Gly
1415 1420 1425
Pro Asp Ser Glu Ser Ala Val Asp Val Pro Val Val Pro Trp Val
1430 1435 1440
Val Ser Gly Lys Thr Pro Glu Ala Leu Ser Ala Gln Ala Asp Asn
1445 1450 1455
Leu Val Ser Tyr Leu Asp Asp Arg Pro Asn Val Ser Ala Leu Asn
1460 1465 1470
Val Ala Tyr Ser Leu Ala Ser Glu Arg Ala Ala Leu Asp Glu Arg
1475 1480 1485
Ala Val Val Leu Gly Ala Asp Arg Glu Ala Leu Leu Ser Gly Leu
1490 1495 1500
Lys Ala Leu Ala Ala Gly His Glu Asp Pro Gly Val Ala Ser Gly
1505 1510 1515
Ser Leu Val Ser Gly Gly Val Gly Phe Val Phe Ser Gly Gln Gly
1520 1525 1530
Gly Gln Trp Ser Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro
1535 1540 1545
Val Phe Ala Ala Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala
1550 1555 1560
His Leu Gly Gln Glu Val Gly Val Arg Asp Val Ala Phe Gly Ser
1565 1570 1575
Asp Ala Gln Leu Leu Glu Arg Thr Leu Trp Ala Gln Ser Gly Leu
1580 1585 1590
Phe Ala Leu Gln Val Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly
1595 1600 1605
Val Arg Pro Gly Ala Val Leu Gly His Ser Val Gly Glu Leu Ala
1610 1615 1620
Ala Ala His Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg
1625 1630 1635
Leu Val Ala Gly Arg Ala Arg Leu Met Gln Ala Met Pro Asp Gly
1640 1645 1650
Gly Gly Met Leu Ala Val Ala Thr Ser Glu Thr Gln Val Glu Pro
1655 1660 1665
Met Leu Asp Gly Val Arg Asp Arg Ile Gly Ile Ala Ala Ile Asn
1670 1675 1680
Ala Pro Glu Ser Val Val Leu Ser Gly Asp Arg Glu Leu Leu Ala
1685 1690 1195
Glu Val Ala Asp Gln Leu Asn Asp Gln Gly Cys Arg Thr Arg Trp
1700 1705 1710
Leu Gln Val Ser His Ala Phe His Ser Tyr Arg Met Glu Pro Met
1715 1720 1725
Leu Asp Glu Phe Ala Gln Ile Ala Gly Ser Val Asp Phe Arg Arg
1730 1735 1740
Cys Glu Leu Pro Ile Ile Ser Thr Leu Thr Gly Asn Leu Asp Asp
1745 1750 1755
Val Gly Val Met Ala Thr Pro Glu Tyr Trp Val Arg Gln Val Arg
1760 1765 1770
Glu Pro Val Arg Phe Ala Asp Gly Val Gln Ser Leu Val Glu Gln
1775 1780 1785
Asp Val Ala Thr Val Val Glu Leu Gly Pro Asp Ala Ile Leu Ser
1790 1795 1800
Ala Leu Ile Pro Asp Cys His Ser Trp Gly Asp Gln Thr Val Pro
1805 1810 1815
Ile Pro Leu Leu Arg Lys Asp Arg Ala Glu Pro Glu Thr Val Val
1820 1825 1830
Ala Ala Val Ala Arg Ala His Thr Arg Gly Val Gln Val Asp Trp
1835 1840 1845
Ser Ala Phe Phe Ala Gly Thr Gly Ala Gly Arg Val Glu Leu Pro
1850 1855 1860
Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Glu Ser Ser Val
1865 1870 1875
Ser Gly Asp Val Thr Gly Ile Gly Leu Ala Gly Ala Glu His Pro
1880 1885 1890
Leu Leu Gly Ala Val Val Val Leu Ala Asp Gly Asp Gly Met Val
1895 1900 1905
Leu Thr Gly Arg Leu Ser Val Gly Thr His Arg Trp Leu Ala Glu
1910 1915 1920
His Arg Val Leu Gly Glu Val Val Val Pro Gly Thr Ala Ile Leu
1925 1930 1935
Glu Met Val Leu His Ala Gly Ala Arg Val Gly Cys Gly Arg Val
1940 1945 1950
Glu Glu Leu Thr Leu Glu Ala Pro Leu Val Val Pro Glu Arg Asp
1955 1960 1965
Ala Ile Glu Ile Gln Leu Leu Val Asn Ala Pro Asp Asp Lys Gly
1970 1975 1980
Arg Arg Ser Val Ser Leu His Ser Arg Pro Ala Gly Gly Ser Gly
1985 1990 1995
Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu Val Val Ala
2000 2005 2010
Gly Thr Gly Gly Gly Ala Val Thr Gly Trp Ser Thr Glu Gly Ala
2015 2020 2025
Glu Pro Val Ala Leu Gly Glu Phe Tyr Val Val Gln Ala Gly Asn
2030 2035 2040
Gly Phe Glu Tyr Gly Pro Leu Phe Gln Gly Leu Arg Ala Ala Trp
2045 2050 2055
Arg Arg Gly Gly Glu Val Leu Ala Glu Val Ala Leu Pro Ala Ala
2060 2065 2070
Ala Gly Ala Met Ala Gly Phe Leu Ile Asn Pro Ala Leu Leu Asp
2075 2080 2085
Ala Ala Leu Gln Ala Ser Ala Leu Gly Asp Arg Pro Ala Glu Gly
2090 2095 2100
Gly Ala Trp Leu Pro Phe Ser Phe Thr Gly Val Glu Leu Ser Gly
2105 2110 2115
Gln Gly Gly Thr Ile Ser Arg Ala Arg Val Glu Ser Thr Arg Pro
2120 2125 2130
Asp Ala Val Ser Val Ala Val Met Asp Glu Gly Gly Arg Leu Leu
2135 2140 2145
Ala Ser Ile Asp Ser Leu Arg Leu Arg Pro Val Ser Ser Val Arg
2150 2155 2160
Leu Ala Asn Arg Asp Val Val Gly Asp Ala Leu Phe Glu Val Thr
2165 2170 2175
Trp Glu Pro Val Ala Thr Arg Ser Thr Val Ser Gly Arg Trp Ala
2180 2185 2190
Leu Leu Gly Asp Ala Val Gly Gly Met Ala Gly Leu Ile Gly Leu
2195 2200 2205
Ala Pro Gly Ser Val Asp Arg Cys Ala Gly Leu Ala Glu Leu Ala
2210 2215 2220
Gly Asn Leu Asp Ser Gly Ala Leu Val Ala Asp Val Val Val Tyr
2225 2230 2235
Cys Ala Gly Glu Gln Ala Asp Pro Asp Ala Gly Val Ala Ala Leu
2240 2245 2250
Ala Glu Thr Arg Glu Met Leu Ala Leu Val Gln Ser Trp Leu Ala
2255 2260 2265
Glu Glu Arg Leu Ala Gly Ser Arg Leu Val Val Val Thr Cys Gly
2270 2275 2280
Ala Val Thr Thr Ala Ala Gly Asp Gly Ala Ser Lys Leu Ala His
2285 2290 2295
Ala Pro Leu Trp Gly Leu Leu Arg Ser Ala Gln Ser Glu Asn Pro
2300 2305 2310
Gly Arg Phe Val Leu Val Asp Val Asp Gly Thr Ala Glu Ser Trp
2315 2320 2325
Arg Ala Leu Pro Ser Ala Val Gly Ser Met Gln Pro Gln Leu Ala
2330 2335 2340
Val Arg Lys Gly Val Val Thr Val Pro Arg Val Ala Ser Val Pro
2345 2350 2355
Gly Pro Val Glu Val Pro Ala Val Val Ala Gly Pro Asp Arg Thr
2360 2365 2370
Val Leu Ile Ser Gly Gly Thr Gly Leu Leu Gly Gly Val Val Ala
2375 2380 2385
Arg His Leu Val Ala Glu Arg Gly Val Arg Arg Val Val Leu Thr
2390 2395 2400
Gly Arg Arg Gly Trp Asp Ala Pro Gly Ile Thr Glu Leu Val Gly
2405 2410 2415
Glu Leu Glu Gly Phe Gly Ala Val Val Asp Val Val Ala Cys Asp
2420 2425 2430
Val Ala Asp Arg Ala Gly Leu Glu Gly Leu Leu Ala Ala Val Pro
2435 2440 2445
Ala Glu Phe Pro Leu Cys Gly Val Val His Ala Ala Gly Val Leu
2450 2455 2460
Ala Asp Gly Val Ile Glu Ser Leu Thr Pro Glu Asp Val Gly Ala
2465 2470 2475
Val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His Glu Leu
2480 2485 2490
Thr Arg Asp Met Asp Leu Ser Phe Phe Ala Leu Phe Ser Ser Leu
2495 2500 2505
Ser Gly Val Thr Gly Ala Ala Gly Gln Gly Asn Tyr Ala Ala Ala
2510 2515 2520
Asn Thr Phe Leu Asp Ala Leu Ala His Tyr Arg Arg Ala Gln Gly
2525 2530 2535
Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gln Ser Ser
2540 2545 2550
Gly Met Thr Gly Arg Leu Ser Asp Val Asp Arg Ser Arg Ile Ala
2555 2560 2565
Arg Ser Ser Pro Pro Leu Ser Thr Lys Asp Gly Leu Arg Leu Phe
2570 2575 2580
Asp Ala Gly Leu Ala Leu Asp Arg Ala Ala Val Val Pro Ala Arg
2585 2590 2595
Leu Asp Arg Ala Phe Leu Ala Glu Gln Ala Arg Ser Gly Thr Leu
2600 2605 2610
Pro Ala Met Leu Thr Ala Leu Val Pro Thr Ile Thr Ser Ile Arg
2615 2620 2625
Arg Ser Ser Gly Thr Asp Leu Ala Asp Glu Asp Ala Leu Leu Gly
2630 2635 2640
Val Val Arg Glu His Ala Ala Arg Val Leu Gly Tyr Ser Gly Ala
2645 2650 2655
Ala Glu Val Gly Val Glu Arg Ala Phe Arg Asp Leu Gly Phe Asp
2660 2665 2670
Ser Leu Ser Gly Val Glu Leu Arg Asn Arg Leu Ala Gly Val Leu
2675 2680 2685
Gly Ala Arg Leu Pro Ala Thr Ala Val Phe Asp Tyr Pro Thr Pro
2690 2695 2700
Arg Ala Leu Ala Arg Phe Leu His Gln Glu Leu Ala Gly Glu Val
2705 2710 2715
Gly Thr Thr Pro Ala Pro Val Thr Thr Thr Thr Ala Ser Val Glu
2720 2725 2730
Asp Asp Leu Val Ala Ile Val Gly Met Gly Cys Arg Tyr Pro Gly
2735 2740 2745
Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly Gly
2750 2755 2760
Val Asp Ala Val Ala Asp Phe Pro Asp Asp Arg Gly Trp Asp Leu
2765 2770 2775
Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Phe Gly Thr Ser Tyr
2780 2785 2790
Val Arg Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala
2795 2800 2805
Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro
2810 2815 2820
Gln Gln Arg Leu Leu Leu Glu Leu Ser Trp Glu Ala Val Glu Arg
2825 2830 2835
Ala Gly Ile Asp Pro Gly Ser Leu Arg Gly Ser Arg Thr Gly Val
2840 2845 2850
Phe Ala Gly Leu Met Tyr His Asp Tyr Ala Gly Arg Phe Ala Ala
2855 2860 2865
Gly Val Pro Glu Gly Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala
2870 2875 2880
Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu
2885 2890 2895
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val
2900 2905 2910
Ala Leu His Leu Ala Gly Gln Ser Leu Arg Ser Gly Glu Cys Asp
2915 2920 2925
Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala Thr
2930 2935 2940
Phe Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg
2945 2950 2955
Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Thr Gly Trp Gly Glu
2960 2965 2970
Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg
2975 2980 2985
Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn
2990 2995 3000
Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser
3005 3010 3015
Gln Gln Arg Val Ile Thr Gln Ala Leu Thr Ser Ala Gly Leu Ser
3020 3025 3030
Val Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg
3035 3040 3045
Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Ile Ala Thr Tyr Gly
3050 3055 3060
Arg Asp Arg Asp Pro Asp Arg Pro Leu Trp Leu Gly Ser Met Lys
3065 3070 3075
Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly Val
3080 3085 3090
Ile Lys Met Val Met Ala Met Arg His Gly Glu Lsa Pro Arg Thr
3095 3100 3105
Leu His Val Gly Glu Pro Thr Ser Glu Val Asp Trp Ser Ala Gly
3110 3115 3120
Ser Val Gln Leu Leu Thr Glu Asn Thr Pro Trp Pro Asp Ser Gly
3125 3130 3135
His Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly Thr
3140 3145 3150
Asn Ala His Val Ile Leu Glu Gln Ser Pro Thr Ala Ser Ser Glu
3155 3160 3165
Phe Val Glu His Ser Gly Pro Asp Ser Glu Ser Ala Val Asn Val
3170 3175 3180
Pro Val Val Pro Trp Val Val Ser Gly Lys Thr Pro Glu Ala Leu
3185 3190 3195
Ser Ala Gln Ala Asp Thr Leu Val Ser Tyr Leu Asp Asp Arg Ser
3200 3205 3210
Asp Val Ser Ser Arg Asp Val Gly Tyr Ser Leu Ala Met Thr Arg
3215 3220 3225
Ser Ala Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Glu
3230 3235 3240
Thr Leu Leu Ser Gly Leu Lys Ala Leu Ala Ala Gly His Glu Ala
3245 3250 3255
Thr Gly Val Val Thr Gly Ser Val Gly Ser Gly Gly Arg Pro Gly
3260 3265 3270
Phe Val Phe Ala Gly Gln Gly Gly Gln Trp Leu Gly Met Gly Arg
3275 3280 3285
Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Asp Ala Phe Asp Glu
3290 3295 3300
Ala Cys Ala Gly Leu Asp Ala His Leu Gly Gln Lys Val Gly Val
3305 3310 3315
Arg Asp Val Val Phe Gly Ser Asp Ala Gln Leu Leu Asp Arg Thr
3320 3325 3330
Leu Trp Ala Gln Ser Gly Leu Phe Ala Leu Gln Val Gly Leu Leu
3335 3340 3345
Lys Leu Leu Gly Ser Trp Gly Val Arg Pro Val Val Val Leu Gly
3350 3355 3360
His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu
3365 3370 3375
Ser Met Ala Glu Ala Ala Arg Leu Val Ala Gly Arg Ala Arg Leu
3380 3385 3390
Met Gln Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val Ala Thr
3395 3400 3405
Ser Glu Thr Gln Val Glu Pro Leu Leu Asp Gly Val Arg Asp Arg
34l0 3415 3420
Ile Asp Ile Ala Ala Ile Asn Ala Pro Glu Ser Ile Val Lau Ser
3425 3430 3435
Gly Asp Arg Glu Leu Leu Thr Glu Ala Ala Asp Gln Leu His Asp
3440 3445 3450
Gln Gly Cys Arg Thr Arg Trp Leu G1n Val Ser His Ala Phe His
3455 3460 3465
Ser Pro Gln Met Asp Pro Met Leu Asp Glu Phe Ala Asp Ile Ala
3470 3475 3480
Arg Thr Val Asp Phe Arg Gly Ser Glu Leu Pro Val Val Ser Thr
3485 3490 3495
Leu Thr Gly Ala Leu Asp Asp Ser Gly Leu Met Ala Thr Pro Glu
3500 3505 3510
Tyr Trp Val Arg Gln Val Arg Glu Pro Val Arg Phe Ala Asp Gly
3515 3520 3525
Val Arg Ala Leu Val Glu His Asp Val Ala Thr Val Val Glu Leu
3530 3535 3540
Gly Pro Asp Gly Ala Leu Ser Ala LeuIle Gln Glu Cys Ala Ala
3545 3550 3555
Glu Phe Asp Gln Ser Arg Arg Val Ala Ala Val Pro Ala Met Arg
3560 3565 3570
Arg Ser Gln Asp Glu Ala Gln Lys Val Met Thr Ala Leu Ala Gln
3575 3580 3585
Val His Val Arg Gly Gly Ala Val Asp Trp Arg Ser Val Phe Ala
3590 3595 3600
Gly Thr Gly Ser Lys Gln Val Glu Leu Pro Thr Tyr Ala Phe Gln
3605 3610 3615
Arg Gln Arg Tyr Trp Leu Asn Ala Val His Glu Ser Ser Ala Gly
3620 3625 3630
Asp Met Gly Arg Arg Ile Glu Thr Glu Phe Trp Ser Ala Val Glu
3635 3640 3645
His Glu Asp Val Thr Ser Leu Ala Asn Ile Leu Gly Ile Val Asp
3650 3655 3660
Asp Gly Ala Ala Val Asp Ser Leu Arg Asn Ala Leu Pro Val Leu
3665 3670 3675
Ala Gly Trp Gln Arg Thr Arg Asn Asp Glu Ser Ile Met Asp Arg
3680 3685 3690
Gln Cys Tyr Arg Ile Gly Trp Arg Gln Val Ala Gly Leu Pro Pro
3695 3700 3705
Arg Gly Thr Val Phe Gly Thr Trp Leu Val Phe Ala Pro His Gly
37l0 37l5 3720
Trp Ser Gly Glu Pro Gln Val Ala Asn Cys Val Ala Ala Leu Arg
3725 3730 3735
Ala Ser Gly Ala Ser Val Val Ler Val Glu Ala Asp Pro Asp pro
3740 3745 3750
Val Val Phe Gly Asp Arg Val Arg Thr Leu Cys Ser Asp Ser Pro
3755 3760 3765
Asp Leu Val Gly Val Leu Ser Met Leu Cys Leu Glu Glu Ser Ala
3770 3775 3780
Ile Pro Gly Phe Ser Ala Val Ser Arg Gly Phe Ala Leu Thr Val
3785 3790 3795
Glu Leu Val Arg Ala Leu Ala Ala Ala Gly Ala Asp Ala Arg Leu
3800 3805 38l0
Trp Leu Leu Thr Cys Gly Gly Val Ser Val Gly Asp Val Pro Val
38l5 3820 3825
Arg Pro Glu Gln Ala Leu Val Trp Gly Leu Gly Arg Val Ala Gly
3830 3835 3840
Leu Glu His Pro Asp Trp Trp Gly Gly Leu Ile Asp Ile Pro Val
3845 3850 3855
Leu Phe Asp Glu Asp Ala Gln Glu Arg Leu Ser Ile Val Leu Ala
3860 3865 3870
Gly Leu Gly Glu Glu Glu Val Ala Ile Arg Ser Asp Gly Val Phe
3875 3880 3885
Ala Arg Arg Leu Val Arg His Gly Val Ser Ala Gly Val Lys Lys
3890 3895 3900
Ala Trp Arg Pro Arg Gly Ser Val Leu Val Thr Gly Gly Thr Gly
3905 39l0 39l5
Gly Leu Gly Ala His Ala Ala Arg Trp Leu Ala Asp Ala Gly Ala
3920 3925 3930
Glu His Val Val Met Val Ser Arg Arg Gly Glu Gln Ala Pro Ser
3935 3940 3945
Ala Glu Lys Leu Arg Thr Glu Leu Glu Asp Leu Gly Thr Arg Val
3950 3955 3960
Ser Ile Leu Ser Cys Vap Val Thr Asp Arg Glu Ala Leu Ala Glu
3965 3970 3975
Val Leu Lys Ala Leu Pro Ala Glu Tyr Pro Leu Thr Ala Val Val
3980 3985 3990
His Thr Ala Gly Val Ile Glu Thr Gly Asp Ala Ala Ser Met Ser
3995 4000 4005
Leu Ala Asp Phe Asp Asp Val Leu Ser Ala Lys Val Ala Gly Ala
40l0 40l5 4020
Ala Asn Leu Asp Ala Leu Leu Ala Asp Val Glu Leu Asp Ala Phe
4025 4030 4035
Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly Gln
4040 4045 4050
Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu
4055 4060 4065
Glu Arg Arg Ser Arg Gly Leu Val Ala Thr Ala Val Ala Trp Gly
4070 4075 4080
Pro Trp Ala Gly Glu Gly Met Ala Ala Gly Glu Thr Gly Asp Gln
4085 4090 4095
Leu Arg Arg Tyr Gly Leu Ser Pro Met Gly Pro Gln Tyr Ala Ile
4100 4105 4110
Ala Gly Ile Arg Arg Ala Val Glu Gln Asp GluIle Ser Leu Val
4115 4120 4125
Val Ala Asp Val Asp Trp Ala Arg Phe Ser Ala Gly Phe Leu Ala
4130 4135 4140
Ala Arg Pro Arg Pro LeuLeu Asn Glu Leu Thr Glu Val Lys Glu
4145 4150 4155
Leu Leu Val Asn Ala Gln Ser Glu Val Gly Val Val Ala Glu Ala
4160 4165 4170
Ser Val Ala Trp Arg Gln Arg Leu Ala Ala Ala Pro Arg Pro Ala
4175 4180 4185
Gln Glu Gln Leu Ile Leu Glu Leu Val Arg Gly Glu Thr Ala Leu
4190 4195 4200
Val Leu Gly His Pro Gly Ala Glu Ala Val Ala Pro Glu Arg Ala
4205 4210 4215
Phe Lys Asp Ser Gly Phe Asp Ser Gln Ala Ala Val Glu Leu Arg
4220 4225 4230
Val Arg Leu Asn Arg Ala Thr Gly Leu Gln Leu Pro Ser Thr Ile
4235 4240 4245
Ile Phe Ser His Pro Thr Pro Ala Glu Leu Ala Ala Glu Leu Arg
4250 4255 4260
Ala Arg Leu Leu Pro Glu Ser Ala Gly Val Asp Ile Ser Glu Glu
4265 4270 4275
Asp Glu Ala Arg Ile Arg Ala Ala Leu Thr Ser Ile Pro Phe Ala
4280 4285 4290
Ala Leu Arg Glu Ala Asp Leu Val Asn Arg Leu Leu Ala Leu Ala
4295 4300 4305
Gly His Pro Val Asp Ser Gly Ser Ser Pro Asp Asp Ala Val Ala
4310 4315 4320
Thr Ser Ile Asp Ala Met Asp Val Ala Asp Leu Val Glu Ala Ala
4325 4330 4335
Leu Gly Glu Arg Glu Ser
4340
<210>4
<211>2149
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>4
Val Thr Thr Ser Tyr Glu Glu Val Val Glu Ala Leu Arg Ala Ser Leu
1 5 10 15
Lys Glu Asn Glu Arg Leu Arg Arg Gly Arg Asp Arg Phe Ala Ala Glu
20 25 30
Lys Gly Asp Pro Ile Ala Ile Val Ala Met Ser Cys Arg Tyr Pro Gly
35 40 45
Gln Val Ser Ser Pro Glu Asp Leu Trp Gln Leu Ala Ala Gly Gly Val
50 55 60
Asp Ala Ile Ser Glu Val Pro Gly Asp Arg Gly Trp Asp Leu Ala Gly
65 70 75 80
Val Phe Asp Pro Asp Ser Asp Arg Pro Gly Thr Ser Tyr Ala Cys Ala
85 90 95
Gly Gly Phe Leu Gln Gly Val Ser Glu Phe Asp Ala Gly Phe Phe Gly
100 105 110
Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu
115 120 125
Leu Glu Val Ala Trp Glu Val Phe Glu Arg Ala Gly Leu Glu Gln Arg
130 135 140
Ser Thr Arg Gly Ser Arg Val Gly Val Phe Val Gly Thr Asn Gly Gln
145 150 155 160
Asp Tyr Ala Ser Trp Leu Arg Thr Pro Pro Ser Glu Val Ala Gly His
165 170 175
Val Leu Thr Gly Gly Ala Ala Ala Ile Leu Ser Gly Arg Val Ala Tyr
180 185 190
Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser
195 200 205
Ser Ser Leu Val Ala Leu His Leu Ala Gly Gln Ala Leu Arg Ala Gly
210 215 220
Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr Pro
225 230 235 240
Lys Ala Phe Leu Glu Phe Ser Arg Gln Arg Gly Leu Ala Ala Asp Gly
245 250 255
Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu
260 265 270
Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn
275 280 285
Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp
290 295 300
Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Ser Ser Gln Ala Arg
305 310 315 320
Val Ile Thr Gln Ala Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val
325 330 335
Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile
340 345 350
Glu Ala Gln Ala Leu Ile Ala Thr Tyr Gly Arg Asp Arg Asp Pro Ala
355 360 365
Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln
370 375 380
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg
385 390 395 400
His Gly Gln Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Pro Glu
405 410 415
Val Asp Trp Ser Ala Gly Thr Val Gln Leu Leu Thr Glu Asn Met Leu
420 425 430
Trp Pro Glu Ser Gly Arg Val Arg Arg Ala Gly Val Ser Ser Phe Gly
435 440 445
Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Pro Thr Gly Glu
450 455 460
Thr Arg Gln Ser Ala Gly Pro Asp Ser Gly Ser Val Val Asp Val Pro
465 470 475 480
Val Val Pro Trp Met Val Ser Gly Lys Thr Pro Asp Ala Leu Gly Ala
485 490 495
Gln Ala Asp Thr Leu Met Ser Tyr Lau Asp Asp Arg Val Asp Val Pro
500 505 510
Ser Leu Asp Ile Ala Tyr Ser Leu Ala Met Thr Arg Ser Ala Leu Asp
515 520 525
Glu Arg Ala Val Val Lau Gly Pro Asp Arg Glu Thr Leu Leu Ser Gly
530 535 540
Leu Lys Ala Leu Ser Ala Gly His Glu Ala Ser Gly Val Val Thr Gly
545 550 555 560
Ser Val Gly Thr Gly Gly Arg Ile Gly Phe Val Phe Ser Gly Gln Gly
565 570 575
Gly Gln Trp Leu Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro Val
580 585 590
Phe Ala Ala Ala Phe Asp Glu Ala Cys Ala Glu Leu Glu Ala His Leu
595 600 605
Gly Gln Glu Val Gly Val Arg Asp Val Val Phe Gly Ser Asp Ala Gln
610 615 620
Leu Leu Asn Arg Thr Leu Trp Ala Gln Ser Gly Leu Phe Ala Leu Gln
625 630 635 640
Val Gly Leu Leu Lys Leu Leu Asp Ser Trp Gly Val Arg Pro Ser Ala
645 650 655
Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly
660 665 670
Val Leu Ser Leu Ser Asp Ala Ala Arg Leu Val Ala Gly Arg Ala Arg
675 680 685
Leu Met Gln Ala Leu Pro Ser Gly Gly Gly Met Leu Ala Val Ala Ala
690 695 700
Gly Glu Glu Gln Leu Arg Pro Leu Leu Ala Asp His Gly Asp Arg Val
705 710 715 720
Gly Leu Ala Ala Val Asn Val Ala Glu Ser Val Val Leu Ser Gly Asp
725 730 735
Arg Asp Val Leu Asp Asp Ile Ala Gly Arg Leu Asp Gly Gln Gly Val
740 745 750
Arg Thr Arg Trp Leu Arg Val Ser His Ala Phe His Ser Tyr Arg Met
755 760 765
Asp Pro Met Leu Asp Glu Phe Ala Glu Ile Ala Arg Ala Val Asp Tyr
770 775 780
Arg Arg Cys Glu Leu Pro Ile Val Ser Thr Leu Thr Gly Lys Leu Asp
785 790 795 800
Asp Ala Gly Arg Met Ser Gly Pro Asp Tyr Trp Val Arg Gln Val Arg
805 810 815
Glu Pro Val Arg Phe Ala Asp Gly Ala Gln Ala Leu Val Glu His Asp
820 825 830
Val Ala Thr Ile Val Glu Ile Gly Pro Asp Gly Ala Leu Ser Ala Leu
835 840 845
Ile Gln Glu Cys Val Ala Ala Ser Asp GAn Ser Arg Arg Val Ala Ala
850 855 860
Val Pro Ala Met Arg Arg Asn Arg Asp Glu Ala Gln Asn Leu Thr Thr
865 870 875 880
Ala Leu Ala Gln Val His Val Arg Gly Gly Ala Val Asp Trp Arg Ser
885 890 895
Phe Phe Ala Gly Thr Gly Ala Lys Gln Val Glu Leu Pro Thr Tyr Ala
900 905 910
Phe Gln Arg Gln Arg Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val
915 920 925
Thr Gly Ala Gly Leu Ala Gly Ala Glu His Pro Leu Leu Gly Ala Val
930 935 940
Val Pro Val Ala Gly Gly Asp Glu Val Leu Leu Thr Gly Arg Ile Ser
945 950 955 960
Val Gly Thr His Pro Tro Leu Ala Glu His Arg Val Leu Gly Glu Val
965 970 975
Ile Val Pro Gly Thr Ala Leu Leu Glu Ile Ala Leu His Ala Gly Glu
980 985 990
Arg Leu Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala Pro Leu
995 1000 1005
Val Leu Pro Glu Arg Gly Ala Met Gln Val Gln Leu Arg Val Gly
1010 1015 1020
Ala Pro Glu Asn Ser Gly Arg Arg Pro Met Val Leu Tyr Ser Arg
1025 1030 1035
Pro Glu Gly Ala Ala Asp His Asp Trp Thr Arg His Ala Thr Gly
1040 1045 1050
Arg Leu Ala Pro Gly Gly Gly Glu Ala Ala Gly Asp Leu Ala Asp
1055 1060 1065
Trp Pro Ala Pro Gly Ala Leu Pro Val Asp Leu Asp Glu Phe Tyr
1070 1075 1080
Arg Asp Leu Ala Glu His Gly Leu Glu Tyr Gly Pro Ile Phe Gln
1085 1090 1095
Gly Leu Lys Ala Ala Trp Arg Gln Gly Asp Glu Val Tyr Ala Glu
1100 1105 1110
Ala Ala Leu Pro Gly Thr Glu Asp Ser Gly Phe Gly Val His Pro
1115 1120 1125
Ala Leu Leu Asp Ala Ala Leu His Ala Thr Ala Val Arg Asp Met
1130 1135 1140
Asp Gly Ala Trp Leu Pro Phe Gln Trp Glu Gly Val Cys Leu His
1145 1150 1155
Ala Arg Ala Ala Ser Ala Leu Arg Val Arg Val Val Pro Ala Gly
1160 1165 1170
Asp Asp Ala Lys Ser Leu Leu Val Cys Asp Gly Thr Gly Arg Pro
1175 1180 1185
Val Ile Ser Val Agp Arg Leu Val Phe Arg Ser Ala Ala Ala Gly
1190 1195 1200
Arg Thr Gly Ala Arg Arg Gln Ala His Arg Ala Arg Leu Tyr Arg
1205 1210 1215
Leu Gly Trp Pro Thr Val Gln Leu Pro Thr Ser Ala Gln Pro Pro
1220 1225 1230
Ser Cys Val Leu Leu Gly Thr Ser Glu Val Ser Ser Asp Met Gln
1235 1240 1245
Val Tyr Pro Asp Leu Arg Ser Leu Thr Ala Ala Leu Asp Ala Gly
1250 1255 1260
Ala Glu Pro Pro Gly Val Val Ile Ala Pro Thr Pro Pro Gly Gly
1265 1270 1275
Gly Gln Thr Ala Asp Val Arg Glu Ser Thr Arg His Ala Leu Asp
1280 1285 1290
Leu Val Gln Gly Trp Leu Ala Asp Gln Arg Leu Asn Asp Ser Arg
1295 1300 1305
Leu Phe Leu Val Thr Arg Gly Ala Val Ala Val Glu Pro Gly Glu
1310 1315 1320
Pro Val Thr Asp Leu Ala Gln Ala Ala Leu Trp Gly Leu Leu Arg
1325 1330 1335
Ser Thr Gln Thr Glu His Pro Asp Arg Phe Val Leu Val Asp Val
1340 1345 1350
Ala Glu Pro Ala Gln Leu Leu Pro Ala Leu Pro Gly Val Leu Ala
1355 1360 1365
Cys Gly Glu Pro Gln Leu Ala Leu Arg Arg Gly Gly Ala His Ala
1370 1375 1380
Pro Arg Leu Ala Gly Leu Gly Gly Asp Asp Val Leu Pro Val Pro
1385 1390 1395
Asp Ser Met Gly Trp Arg Leu Glu Ala Thr Ser Pro Gly Thr Leu
1400 1405 1410
Asp Gly Leu Ala Leu Leu Asp Glu Pro Ala Ala Thr Ala Ser Leu
1415 1420 1425
Gly Asp Gly Gln Val Arg Ile Ala Met Arg Ala Ala Gly Val Asn
1430 1435 1440
Phe Arg Asp Ala Leu Ile Ala Leu Gly Met Tyr Pro Gly Ala Ala
1445 1450 1455
Ser Leu Gly Gly Glu Gly Ala Gly Val Val Val Glu Thr Gly Pro
1460 1465 1470
Gly Val Thr Gly Leu Ala Pro Gly Asp Arg Val Met Gly Met Ile
1475 1480 1485
Pro Lys Ala Phe Gly Pro Leu Ala Val Ala Asp His Arg Met Val
1490 1495 1500
Thr Arg Ile Pro Ala Gly Trp Ser Phe Ala Gln Ala Ala Ser Val
1505 1510 1515
Pro Ile Val Phe Leu Thr Ala Tyr Tyr Ala Leu Val Asp Leu Ala
1520 1525 I530
Gly Leu Arg Pro Gly Glu Ser Leu Leu Val His Ser Ala Ala Gly
1535 1540 1545
Gly Val Gly Met Ala Ala Ile Gln Leu Ala Arg His Leu Gly Ala
1550 1555 1560
Glu Val Tyr Ala Thr Ala Ser Glu Asp Lys Trp Gln Ala Val Glu
1565 1570 1575
Leu Thr Arg Glu Arg Leu Ala Ser Ser Arg Thr Cys Asp Phe Glu
1580 1585 1590
Lys Gln Phe Leu Gly Ala Thr Gly Gly Arg Gly Val Asp Val Val
1595 1600 1605
Leu Asn Ser Leu Ala Gly Asp Phe Ala Asp Ala Ser Leu Arg Met
1610 1615 1620
Leu Pro Arg Gly Gly Arg Phe Leu Glu Leu Gly Lys Thr Asp Val
1625 1630 1635
Arg Asp Pro Val Glu Val Ala Asp Ala His Pro Gly Val Ser Tyr
1640 1645 1650
Gln Ala Phe Asp Thr Val Glu Ala Gly Pro Gln Arg Ile Gly Glu
1655 1660 1665
Met Leu Asp Glu Leu Val Glu Leu Phe Glu Gly Gly Val Leu Glu
1670 1675 1680
Pro Leu Pro Val Thr Ala Trp Asp Val Arg Gln Ala Pro Glu Ala
1685 1690 1695
Leu Arg His Leu Ser Gln Ala Arg His Val Gly Lys Leu Val Leu
1700 1705 1710
Thr Met Pro Pro Ala Trp Asp Thr Ala Gly Thr Val Leu Val Thr
1715 1720 1725
Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg His Leu Val
1730 1735 1740
Ile Glu His Gly Val Arg Asn Leu Val Leu Val Ser Arg Arg Gly
1745 1750 1755
Pro Ala Ala Ser Gly Ala Ala Glu Leu Val Ala Gln Leu Thr Ala
1760 1765 1770
Tyr Gly Ala Glu Val Ser Leu Gln Ala Cys Asp Val Ala Asp Arg
1775 1780 1785
Glu Thr Leu Ala Lys Val Leu Ala Gly Ile Pro Asp Glu His Thr
1790 1795 1800
Leu Thr Ala Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val
1805 1810 1815
Ala Glu Ser Leu Thr Ala Gln Arg Leu Asp His Val Leu Arg Pro
1820 1825 1830
Lys Val Asp Gly Ala Arg Asn Leu His Glu Leu Ile Ala Pro Asp
1835 1840 1845
Val Ala Leu Val Leu Phe Ser Ser Val Ser Gly Val Leu Gly Ser
1850 1855 1860
Gly Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ser Phe Leu Asp Ala
1865 1870 1875
Leu Ala Gln Gln Arg Gln Ser Arg Gly Leu Pro Thr Arg Ser Leu
1880 1885 1890
Ala Trp Gly Pro Trp Ala Glu His Gly Met Ala Ser Thr Leu Arg
1895 1900 1905
Glu Ala Glu Gln Asp Arg Leu Ala Leu Ser Gly Leu Leu Pro Ile
1910 1915 1920
Ser Thr Glu Glu Gly Leu Ser Gln Phe Asp Ala Ala Cys Gly Gly
1925 1930 1935
Ala His Thr Val Val Ala Pro Val Arg Ile Gly Arg Ser Ser Asp
1940 1945 1950
Gly Asn Pro Ile Lys Phe Pro Val Leu Arg Gly Leu Val Glu Pro
1955 1960 1965
His Arg Val Asn Lys Ala Thr Ala Asp Asp Ala Glu Ser Ile Arg
1970 1975 1980
Lys Arg Leu Gly Arg Leu Pro Asp Ala Glu Gln His Arg Ile Leu
1985 1990 1995
Leu Asp Leu Val Arg Thr His Val Ala Ala Val Leu Gly Phe Ala
2000 2005 2010
Gly Pro Gln Glu Ile Thr Ala Asp Gly Thr Phe Lys Ala Leu Gly
2015 2020 2025
Phe Asp Ser Leu Thr Val Val Glu Leu Arg Asn Arg Ile Asn Gly
2030 2035 2040
Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asn Tyr Pro
2045 2050 2055
Thr Pro Asp Ala Leu Ala Ala His Leu Val Thr Ala Leu Ser Ala
2060 2065 2070
Asp Arg Leu Ala Gly Thr Phe Glu Glu Leu Asp Arg Trp Ala Ala
2075 2080 2085
Asn Leu Pro Ala Leu Ala Arg Asp Glu Ala Thr Arg Ala Gln Ile
2090 2095 2100
Thr Thr Arg Leu Gln Ala Ile Leu Gln Ser Leu Ala Asp Val Ser
2105 2110 2115
Gly Gly Thr Gly Gly Gly Ser Val Pro Asp Arg Leu Arg Ser Ala
2120 2125 2130
Thr Asp Glu Glu Leu Phe Gln Leu Leu Asp Asn Asp Leu Glu Leu
2135 2140 2145
Pro
<210>5
<211>3167
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>5
Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala Leu Val
1 5 10 15
Asp Leu His Gln Ala Arg Glu Arg Leu Asp Glu Ala Glu Ser Gly Glu
20 25 30
Gln Glu Pro Ile Ala Ile Val Ala Met Gly Cys Arg Tyr Pro Gly Gly
35 40 45
Val His Asp Pro Glu Gly Leu Trp Lys Leu Val Ala Ser Gly Gly Asp
50 55 60
Ala Ile Gly Glu Phe Pro Ala Asp Arg Gly Trp His Leu Asp Glu Leu
65 70 75 80
Tyr Asp Pro Asp Pro Asp Gln Pro Gly Thr Cys Tyr Thr Arg His Gly
85 90 95
Gly Phe Leu His Glu Ala Gly Glu Phe Asp Ala Gly Phe Phe Asp Ile
100 105 110
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Ile Ser Trp Glu Thr Val Glu Ser Ala Gly Met Asp Pro Arg Ser
130 135 140
Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr Glu Gly
145 150 155 160
Tyr Asp Thr Gly Ala His Pro Glu Gly Val Glu Gly Tyr Leu Gly Thr
165 170 175
Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ser Phe Gly
180 185 190
Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu
195 200 205
Val Ala Leu His Leu Ala Cys Gln Ser Leu Arg Gln Gly Glu Cys Asp
210 215 220
Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala Thr Phe
225 230 235 240
Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys
245 250 255
Ser phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly
260 265 270
Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg
275 280 285
Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
290 295 300
Asn Gly Leu Thr Ala Pro Asn Gly Leu Ala Gln Glu Arg Val Ile Gln
305 310 315 320
Gln Ala Leu Thr Ser Ala Gly Leu Ser Val Ser Asp Val Asp Val Val
325 330 335
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Gln
340 345 350
Ala Leu Ile Ala Thr Tyr Gly Gln Asp Arg Asp Arg Asp Arg Pro Leu
355 360 365
Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala
370 375 380
Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg Arg Gly Glu
385 390 395 400
Leu Pro Arg Thr Leu His Val Asp Glu Pro Asn Ser His Val Asp Trp
405 410 415
Ser Ala Gly Ala Val Arg Leu Leu Thr Glu Asn Ile Arg Trp Pro Gly
420 425 430
Thr Gly Thr Arg Arg Val Gly Val Ser Ser Phe Gly Val Ser Gly Thr
435 440 445
Asn Ala His Val Ile Leu Glu His Asp Pro Leu Ala Leu Thr Glu Asn
450 455 460
Glu Asn Ala Ala Val Ser Pro Ala Pro Gly Ile Val Pro Trp Ala Leu
465 470 475 480
Ser Gly Arg Ser Ser Thr Ala Leu Arg Ala Gln Ala Glu Arg Leu Ser
485 490 495
Glu Leu Cys Glu Gln Thr Asp Pro Asp Pro Val Asp Val Gly Phe Ser
500 505 510
Leu Ala Thr Thr Arg Thr Ala Trp Glu His Arg Ala Val Val Leu Gly
515 520 525
Gly Asp Ser Ala Thr Leu Arg Ser Gly Leu Gly Val Val Ala Ser Gly
530 535 540
Glu Pro Ala Val Asp Val Val Gln Gly Ser Val Leu Gly Gly Glu Val
545 550 555 560
Val Phe Val Phe Pro Gly Gln Gly Trp Gln Trp Ala Gly Met Ala Val
565 570 575
Asp Leu Leu Asp Ala Ser Pro Thr Phe Ala Arg His Met Asp Glu Cys
580 585 590
Ala Thr Ala Leu Arg Lys Tyr Val Asp Trp Ser Leu Val Asp Val Leu
595 600 605
Arg Gly Ala Glu Asn Ala Pro Pro Leu Asp Arg Val Asp Val Leu Gln
610 615 620
Pro Val Ser Phe Ala Val Met Val Ser Leu Ala Glu Val Trp Arg Ser
625 630 635 640
Tyr Gly Val Arg Pro Ala Ala Val Val Gly His Ser Gln Gly Glu Ile
645 650 655
Ala Ala Ala Cys Ala Ala Gly Val Leu Pro Leu Glu Asp Ala Ala Arg
660 665 670
Leu Val Ala Leu Arg Ser Arg Ala Leu Lys Ala Leu Ser Gly Arg Gly
675 680 685
Gly Met Ala Ser Leu Ala Cys Ser Ala Asp Glu Ala Ala Ala Leu Phe
690 695 700
Ala Gly Leu Gly Gly Arg Leu Glu Ile Ala Ala Ile Asn Gly Pro Arg
705 710 715 720
Ser Val Val Val Ser Gly Asp Leu Glu Ala Val Glu Glu Leu Leu Ala
725 730 735
Glu Cys Ala Glu Arg Asp Met Arg Ala Arg Arg Ile Pro Val Asp Tyr
740 745 750
Ala Ser His Ser Ala His Val Glu Val Val Arg Ser Pro Val Leu Ala
755 760 765
Ala Ala Ala Gly Val Arg His Arg Asp Gly Gln Val Pro Trp Trp Ser
770 775 780
Thr Val Ile Gly Asp Trp Leu Asp Pro Ala Gly Leu Asp Gly Glu Tyr
785 790 795 800
Trp Tyr Arg Asn Leu Arg Gln Pro Val Arg Phe Glu His Ala Val Gln
805 810 815
Gly Leu Val Glu Arg Gly Phe Gly Leu Phe Ile Glu Met Ser Ala His
820 825 830
Pro Val Leu Thr Met Ala Val Glu Glu Thr Ser Ala Glu Ser Glu Ser
835 840 845
Ala Val Ala Ala Val Gly Thr Leu Arg Arg Asp Ser Gly Gly Arg Arg
850 855 860
Arg Leu Leu Gln Ser Leu Ala Glu Ala Tyr Val Arg Gly Ala Thr Val
865 870 875 880
Asp Trp Ala Val Ala Phe Gly Gly Val Gly Arg Arg Leu Asp Leu Pro
885 890 895
Thr Tyr Pro Phe Gln Arg Arg Arg Tyr Trp Leu Asp Arg Gly Ala Ala
900 905 910
Ser Glu Glu Ala Arg Ala Phe Ser Asp Pro Ala Ala Asp Trp Phe Trp
915 920 925
Gln Ala Val Glu Arg Gln Asp Leu Lys Gly Val Ala Asp Ala Leu Asp
930 935 940
Leu Asp Ala Asp Ala Pro Leu Ser Ala Thr Leu Pro Ala Leu Ser Val
945 950 955 960
Trp His Arg Gln Glu Arg Glu Lys Val Leu Val Asp Gly Trp Arg Tyr
965 970 975
Arg Val Asp Trp Val Pro Val Ala Pro Gln Pro Ile Arg Arg Thr Arg
980 985 990
Glu Thr Trp Leu Leu Val Val Pro Ala Gly Gly Ile Glu Glu Ala Leu
995 1000 1005
Val Glu Arg Leu Thr Asp Ala Leu Asn Thr Arg Gly Ile Ser Thr
1010 1015 1020
Leu Arg Leu Asp Val Pro Pro Thr Ala Thr Ser Gly Glu Leu Ala
1025 1030 1035
Thr Gly Leu Arg Ala Ala Val Gly Gly Asp Pro Val Lys Gly Ile
1040 1045 1050
Leu Ser Lau Thr Ala Leu Asp Glu Arg Thr His Pro Glu Arg Lys
1055 1060 1065
Ala Val Pro Ser Gly Ile Ala Leu Leu Leu Asn Leu Val Lys Ala
1070 1075 1080
Leu Gly Glu Gly Asp Leu Arg Val Pro Leu Trp Thr Ile Thr Arg
1085 1090 1095
Gly Ala Val Lys Ala Asp Pro Ala Asp Arg Leu Leu Arg Pro Met
1100 1105 1110
Gln Ala Gln Ala Trp Gly Leu Gly Arg Val Ala Ala Leu Glu His
1115 1120 1125
Pro Glu Arg Trp Gly Gly Leu Ile Asp Leu Pro Glu Ser Leu Asp
1130 1135 1140
Gly Asp Val Leu Thr Arg Leu Gly Glu Ala Leu Ile Asn Gly Leu
1145 1150 1155
Ala Glu Asp Gln Leu Ala Ile Arg Gln Ser Gly Val Leu Ala Arg
1160 1165 1170
Arg Leu Val Pro Ala Pro Ala Asn Gln Pro Ala Gly Arg Lys Trp
1175 1180 1185
Arg Pro Arg Gly Ser Ala Leu Ile Thr Gly Gly Leu Gly Ala Val
1190 1195 1200
Gly Ala Gln Val Ala Arg Trp Leu Ala Glu Ser Gly Ala Glu Arg
1205 1210 1215
Ile Val Leu Thr Ser Arg Arg Gly Lys Glu Ala Pro Gly Ala Ala
1220 1225 1230
Glu Leu Glu Ala Glu Leu Arg Ala Leu Gly Ala Gln Val Ser Ile
1235 1240 1245
Val Ala Cys Asp Val Thr Asp Arg Ala Glu Met Ser Ala Leu Leu
1250 1255 1260
Ala Glu Phe Gly Val Thr Ala Val Phe His Ala Ala Gly Val Gly
1265 1270 1275
Arg Leu Leu Pro Leu Ala Glu Thr Glu Gln Asn Asp Leu Ala Glu
1280 1285 1290
Ile Cys Thr Ala Lys Val His Gly Ala Gln Val Leu Asp Glu Leu
1295 1300 1305
Cys Asp Ser Thr Asp Leu Asp Ala Phe Val Leu Phe Ser Ser Gly
1310 1315 1320
Ala Gly Val Trp Gly Gly Gly Gly Gln Gly Ala Tyr Gly Ala Ala
1325 1330 1335
Asn Ala Phe Leu Asp Thr Leu Ala Glu Gln Arg Arg Ala Arg Gly
1340 1345 1350
Leu Pro Ala Thr Ala Ile Ser Trp Gly Ser Trp Gly Gly Gly Met
1355 1360 1365
Ala Asp Gly Ala Ala Gly Glu Leu Leu Arg Arg Arg Gly Ile Arg
1370 1375 1380
Pro Met Pro Ala Ala Ser Ala Ile Leu Ala Leu Gln Glu Val Leu
1385 1390 1395
Asp Gln Asp Glu Thr Cys Val Ser Ile Ala Asp Val Asp Trp Asp
1400 1405 1410
Arg Phe Val Pro Thr Phe Ala Ala Thr Arg Ala Thr Arg Leu Leu
1415 1420 1425
Asp Glu Leu Pro Ala Val Arg Lys Ala Met Ser Ala Asn Gly Pro
1430 1435 1440
Ala Glu Pro Gly Gly Ser Pro Phe Ala Arg Asn Leu Ala Glu Leu
1445 1450 1455
Pro Glu Ala Gln Arg Arg His Glu Leu Val Asp Leu Val Ser Ala
1460 1465 1470
Gln Val Ala Ala Val Leu Gly His Gly Ser Arg Glu Glu Val Gln
1475 1480 1485
Pro Glu Arg Ala Phe Arg Ala Leu Gly Phe Asp Ser Leu Met Ala
1490 1495 1500
Val Asp Leu Arg Asn Arg Leu Thr Thr Ala Thr Gly Leu Arg Leu
1505 1510 1515
Pro Thr Thr Thr Val Phe Asp Tyr Pro Asn Pro Ala Ala Leu Ala
1520 1525 1530
Ala His Leu Leu Glu Glu Leu Val Gly Asp Val Ala Ser Ala Ala
1535 1540 1545
Val Thr Thr Ala Ile Ala Pro Ser Thr Asp Glu Pro Val Ala Ile
1550 1555 1560
Val Ala Met Ser Cys Arg Phe Pro Gly Gly Ala His Ser Pro Glu
1565 1570 1575
Asp Leu Trp Arg Leu Val Ala Ser Gly Ala Glu Val Ile Gly Glu
1580 1585 1590
Phe Pro Ser Asp Arg Gly Trp Asp Ala Glu Ser Leu Tyr Asp Pro
1595 1600 1605
Asp Ala Ser Lys Pro Gly Thr Thr Tyr Ala Arg Met Ala Gly Phe
1610 1615 1620
Leu Tyr Asp Ala Gly Glu Phe Asp Ala Gly Leu Phe Gly Ile Ser
1625 1630 1635
Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Val Leu
1640 1645 1650
Glu Ile Ala Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro Leu
1655 1660 1665
Ser Leu Lys Gly Ser Gly Val Gly Thr Tyr Ile Gly Ala Gly Ser
1670 1675 1680
Arg Gly Tyr Ala Thr Asp Val Arg Gln Phe Pro Glu Glu Ala Glu
1685 1690 1695
Gly Tyr Leu Leu Thr Gly Thr Ser Ala Ser Val Leu Ser Gly Arg
1700 1705 1710
Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp
1715 1720 1725
Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Gln
1730 1735 1740
Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val
1745 1750 1755
Thr Val Met Ser Thr Pro Glu Met Phe Val Glu Phe Ser Arg Gln
1760 1765 1770
Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser
1775 1780 1785
Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu
1790 1795 1800
Glu Arg Leu Ser Asp Ala His Arg Asn Gly His Arg Val Leu Ala
1805 1810 1815
Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly
1820 1825 1830
Leu Ala Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Lys Gln
1835 1840 1845
Ala Leu Ala Asn Ala Gly Leu Ser Ala Ser Asp Val Asp Ala Val
1850 1855 1860
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala
1865 1870 1875
Gln Ala Leu Ile Ala Thr Tyr Gly Gln Gly Arg Glu Arg Asp Arg
1880 1885 1890
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln
1895 1900 1905
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ser Met
1910 1915 1920
Arg Asn Asp Glu Leu Pro Ala Thr Leu His Val Gly Ala Pro Thr
1925 1930 1935
Ser Gln Val Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Glu
1940 1945 1950
Gln Val Pro Trp Pro Glu Ser Asp Arg Val Arg Arg Val Gly Val
1955 1960 1965
Ser Ser Phe Gly Ile Ser Gly Thr Ash Ala His Val Ile Leu Glu
1970 1975 1980
Gln Ser Thr Asn Ala Pro Asp Ser Pro Ala Ala Thr Asp Lys Ser
1985 1990 1995
Gly Ser Gly Ser Thr Val Asp Ile Pro Val Val Pro Trp Leu Val
2000 2005 2010
Ser Gly Gln Thr Ser Asp Ser Leu Arg Gly Gln Ala Glu Arg Val
2015 2020 2025
Leu Ser Gln Val Glu Ser Arg Pro Glu Gln Arg Pro Leu Asp Val
2030 2035 2040
Ala Tyr Ser Leu Ala Ser Gly Arg Ala Ala Leu Asp Glu Arg Ala
2045 2050 2055
Val Val Leu Gly Ala Asp Arg Asn Glu Leu Val Ala Gly Leu Val
2060 2065 2070
Ala Leu Ala Ala Gly His Glu Ala Ser Gly Val Ile Thr Gly Thr
2075 2080 2085
Arg Ala Ser Ala Arg Phe Gly Phe Val Phe Ser Gly Gln Gly Gly
2090 2095 2100
Gln Trp Leu Gly Met Gly Arg Glu Leu Tyr Ser Lys Phe Pro Val
2105 2110 2115
Phe Ala Ala Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala His
2120 2125 2130
Leu Ser Glu Asp Leu Arg Val Arg Asp Val Val Phe Gly Ser Asp
2135 2140 2145
Ala Gln Leu Leu Asp Gln Thr Leu Trp Ala Gln Ser Gly Leu Phe
2150 2155 2160
Ala Leu Gln Val Gly Leu Leu Gly Leu Leu Gly Ser Trp Gly Val
2165 2170 2175
Arg Pro Asp Val Val Met Gly His Ser Val Gly Glu Leu Ala Ala
2180 2185 2190
Ala Phe Ala Ala Gly Val Leu Ser Leu Arg Asp Ala Ala Arg Leu
2195 2200 2205
Val Ala Ala Arg Ala Arg Leu Met Gln Ala Leu Pro Ser Asp Gly
2210 2215 2220
Ala Met Lau Ala Val Ala Ala Gly Glu Asp Leu Ile Arg Pro Leu
2225 2230 2235
Leu Ala Gly Arg Glu Ala Ser Val Asn Val Ala Ala Leu Asn Ala
2240 2245 2250
Pro Gly Ser Val Val Leu Ser Gly Asp Arg Asp Val Leu Ala Asp
2255 2260 2265
Ile Ala Gly Arg Leu Asn Glu Leu Gly Val Arg Thr Arg Arg Leu
2270 2275 2280
Arg Val Ser His Ala Phe His Ser His Arg Met Asp Pro Met Leu
2285 2290 2295
Gly Glu Phe Ala Gln Ile Ala Glu Ser Ala Glu Phe Gly Arg Pro
2300 2305 2310
Thr Thr Pro Leu Val Ser Thr Leu Thr Gly Glu Leu Asp Arg Ala
2315 2320 2325
Gly Glu Met Ser Thr Pro Gly Tyr Trp Val Arg Gln Val Arg Glu
2330 2335 2340
Pro Val Arg Phe Ala Asp Gly Val Arg Ala Leu Ala Ala Gln Gly
2345 2350 2355
Val Asp Thr Val Val Glu Leu Gly Pro Asp Gly Ala Leu Ser Ala
2360 2365 2370
Leu Val Gln Glu Cys Ala Thr Gly Phe Asp Arg Val Gly Arg Ile
2375 2380 2385
Ser Pro Val Pro Leu Met Arg Arg Glu Arg Asp Glu Thr Arg Ser
2390 2395 2400
Val Met Thr Ala Leu Ala His Leu His Thr Arg Gly Gly Glu Leu
2405 2410 2415
Asp Trp Gln Ala Phe Phe Ser Gly Thr Gly Ala Arg Gln Val Glu
2420 2425 2430
Leu Pro Thr Tyr Ala Phe Gln Arg Arg His Tyr Trp Ile Glu Ser
2435 2440 2445
Ser Ala Arg Thr Ala Arg Asp Arg Ala Asp Ile Gly Glu Val Ala
2450 2455 2460
Glu Gln Phe Trp Thr Ala Val Glu Gln Gly Asp Leu Glu Ala Leu
2465 2470 2475
Val Ser Ala Leu Glu Leu Gly Ala Asp Asp Asp Thr Cyg Ala Ser
2480 2485 2490
Leu Ser Asp Val Leu Pro Ala Leu Ser Ser Trp Arg Ser Gly Leu
2495 2500 2505
Arg Asn Arg Ser Leu Val Asp Ser Cys Arg Tyr Arg Ile Asn Trp
2510 2515 2520
His Ser Ser Arg Glu Ala Pro Ala pro Lys Ile Ser Gly Thr Trp
2525 2530 2535
Leu Leu Val Val Pro Gly Asp Ala Asp Asp Gly Leu Ala Thr Ala
2540 2545 2550
Leu Thr Ser Ser Leu Val Glu Gly Gly Ala Glu Val Val Arg Ile
2555 2560 2565
Asp Leu Ser Glu Glu Asp Leu His Arg Glu Asp Leu Ala Gln Arg
2570 2575 2580
Leu Ala Asn Ala Leu Thr Asp Val Gly Arg Leu Gly Gly Val Leu
2585 2590 2595
Ser Leu Leu Gly Leu Asp Asp Ser Ala Val Gly Glu Phe Ser Cys
2600 2605 2610
Leu Thr Arg Gly Phe Ala Leu Thr Val Gln Leu Val Arg Ala Leu
2615 2620 2625
Arg Asn Ala Glu Leu Glu Ala Pro Leu Trp Ala Val Thr Arg Gly
2630 2635 2640
Gly Val Ser Leu Glu Asp Val Ser Val Ser Pro Glu Gln Ala Leu
2645 2650 2655
Ile Trp Gly Leu Leu Arg Val Ala Gly Leu Glu His Pro Glu Phe
2660 2665 2670
Trp Gly Gly Leu Ile Asp Leu Pro Ser Asp Trp Asp Asp Arg Leu
2675 2680 2685
Gly Ala Arg Leu Val Gly Val Leu Ala Asp Gly Gly Glu Asp Gln
2690 2695 2700
Val Ala Ile Arg Arg Gly Gly Val Phe Val Arg Arg Leu Glu Arg
2705 2710 2715
Ala Gly Ala Ser Gly Ala Gly Ser Val Trp Arg Pro Arg Gly Thr
2720 2725 2730
Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala
2735 2740 2745
Arg Trp Leu Ala Gly Ala Gly Ala Glu His Val Val Leu Thr Ser
2750 2755 2760
Arg Arg Gly Ala Glu Ala Pro Gly Ala Gly Glu Leu Arg Ala Glu
2765 2770 2775
Leu Glu Ala Leu Gly Ala Arg Val Ser Ile Val Pro Cys Asp Val
2780 2785 2790
Ala Asp Arg Asp Ala Val Ala Gly Val Leu Ala Gly Ile Gly Gly
2795 2800 2805
Glu Cys Pro Leu Thr Ala Val Val His Ala Ala Gly Val Gly Glu
2810 2815 2820
Ala Gly Gly Val Val Glu Met Ala Leu Ala Asp Phe Ala Glu Val
2825 2830 2835
Leu Ser Ala Lys Val Arg Gly Ala Ala Asn Leu Asp Glu Lau Leu
2840 2845 2850
Ala Asp Ser Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Val Ser
2855 2860 2865
Gly Val Trp Gly Ala Gly Gly Gln Gly Ala Tyr Ala Ala Ala Asn
2870 2875 2880
Ala Tyr Leu Asp Ala Leu Ala Glu Gln Arg Arg Ala Ser Gly Leu
2885 2890 2895
Ala Gly Thr Ala Val Ala Trp Gly Pro Trp Ala Gly Asp Gly Met
2900 2905 2910
Ala Ala Gly Glu Thr Gly Ala Gln Leu His Arg Met Gly Leu Val
2915 2920 2925
Ser Met Glu Pro Arg Ala Ala Leu Leu Ala Leu Gln Gly Ala Leu
2930 2935 2940
Asp Arg Asp Glu Thr Ser Leu Val Val Ala Asp Val Asp Trp Ala
2945 2950 2955
Arg phe Ala Pro Ala Phe Thr Ser Ala Arg Arg Arg Pro Leu Leu
2960 2965 2970
Asp Thr Ile Asp Glu Ala Arg Ala Ala Leu Glu Thr Thr Ser Glu
2975 2980 2985
Lys Ala Gly Thr Gly Lys Pro Val Glu Leu Lys His Arg Leu Ala
2990 2995 3000
Gly Leu Ser Arg Lys Glu Arg Asp Asp Ala Val Leu Asp Leu Val
3005 3010 3015
Arg Ala Glu Thr Ala Ala Val Leu Gly Arg Asp Asp Ala Thr Ala
3020 3025 3030
Leu Ala Pro Ser Arg Pro Phe Gln Glu Leu Gly Phe Asp Ser Leu
3035 3040 3045
Met Ala Val Glu Leu Arg Asn Arg Leu Asn Thr Ala Thr Gly Ile
3050 3055 3060
Gln Lau Pro Ala Ser Thr Ile Phe Asp Tyr Pro Asn Ala Glu Ser
3065 3070 3075
Leu Ser Arg His Leu Cys Ala Gly Leu Phe Pro Thr Glu Thr Thr
3080 3085 3090
Val Asp Ser Ala Leu Ala Glu Leu Asp Arg Ile Glu Gln Gln Leu
3095 3100 3105
Ser Met Phe Thr Glu Glu Ala Arg Ala Arg Asp Arg Ile Ala Thr
3110 3115 3120
Arg Leu Arg Ala Leu His Ala Lys Trp Asn Ser Ala Ser Glu Ala
3125 3130 3135
Pro Thr Gly Ala Asp Val Leu Asn Thr Leu Asp Ser Ala Thr His
3140 3145 3150
Asp Glu Ile Phe Glu Phe Ile Asp Asn Glu Leu Asp Leu Ser
3155 3160 3165
<210>6
<211>4933
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>6
Val Glu Ile Thr Met Ala Asn Glu Glu Lys Leu Phe Gly Tyr Leu Lys
1 5 10 15
Lys Val Thr Ala Asp Leu His Gln Thr Arg Gln Arg Leu Leu Ala Ala
20 25 30
Glu Ser Arg Ser Gln Glu Pro Ile Val Ser Ala Ser Cys Arg Leu Pro
35 40 45
Gly Gly Val Asp Ser Pro Glu Ala Leu Trp Gln Leu Val Arg Thr Gly
50 55 60
Thr Asp Ala Ile Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp Leu Asp
65 70 75 80
Arg Leu Tyr Asp Pro Asp Pro Asp His Gln Gly Thr Ser Tyr Thr Arg
85 90 95
Ala Gly Gly Phe Leu Ala Asp Ala Gly Asp Phe Asp Pro Ala Met Phe
100 105 110
Gly Ile Ser pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu
115 120 125
Leu Leu Glu Leu Thr Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro
130 135 140
Thr Ser Leu Arg Gly Ser Lys Thr Gly Val Phe Gly Gly Val Thr Pro
145 150 155 160
Gln Glu Tyr Gly Pro Pro Leu Pro Glu Met Ser Arg Asn Ser Gly Gly
165 170 175
Phe Gly Leu Thr Gly Arg Met Val Ser Val Ala Ser Gly Arg Val Ala
180 185 190
Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys
195 200 205
Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Gln Ser Leu Arg Ser
210 215 220
Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr
225 230 235 240
Pro Ala Thr Phe Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp
245 250 255
Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly
260 265 270
Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg
275 280 285
Asn Gly His Lys Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln
290 295 300
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln
305 310 315 320
Arg Val Ile Thr Gln Ala Leu Ser Asn Ala Gly Leu Ser Val Ser Asp
325 330 335
Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro
340 345 350
Ile Glu Ala Gln Ala Leu Ile Ala Thr Tyr Gly Gln Gly Arg Glu Lys
355 360 365
Asp Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr
370 375 380
Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Leu Ala Met
385 390 395 400
Arg His Gly Gln Leu Pro Ala Thr Leu His Val Asp Asp Pro Thr Ser
405 4l0 4l5
Ala Val Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Asr Thr
420 425 430
Pro Trp Pro Asp Ser Gly Arg Pro Cys Arg Val Gly Val Ser Ser Phe
435 440 445
Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ser Pro Val
450 455 460
Glu Gln Gly Glu Pro Thr Gly Pro Val Glu Gly Glu Arg Glu Pro Glu
465 470 475 480
Ala Ala Ile Pro Val Val Pro Trp Met Val Ser Gly Lys Thr Pro Glu
485 490 495
Ala Ala Arg Ala Gln Ala Glu Arg Val Leu Ser His Ile Glu Asp Arg
500 505 510
Pro Glu Leu Ser Pro Val Asp Val Ala Tyr Ser Leu Gly Met Thr Arg
515 520 525
Ala Ala Leu Asp Glu Arg Ala Val Met Leu Gly Ser Asp Arg Asp Thr
530 535 540
Leu Leu Thr Gly Leu Arg Ala Phe Ala Asp Gly Cys Asp Val Pro Glu
545 550 555 560
Val Val Ser Gly Ser Val Gly Asn Gly Gly Arg Val Gly Phe Val Phe
565 570 575
Ala Gly Gln Gly Gly Gln Trp Pro Gly Met Gly Arg Gly Leu Tyr Ser
580 585 590
Val Phe Pro Gly Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala Glu Leu
595 600 605
Asp Thr His Leu Gly Gln Glu Leu Gly Val Arg Asp Val Val Phe Gly
610 615 620
Ser Asp Ala Arg Leu Val Asp Arg Thr Val Trp Ala Gln Ser Gly Leu
625 630 635 640
Phe Ala Leu Gln Val Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly Val
645 650 655
Arg Pro Asp Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Val
660 665 670
His Ala Ala Gly Val Leu Ser Leu Pro Glu Ala Ala Arg Leu Val Ala
675 680 685
Gly Arg Ala Arg Leu Met Gln Ala Leu Pro Ser Gly Gly Ala Met Leu
690 695 700
Ala Val Ala Ala Ser Glu Ala Gln Val Glu Pro Leu Leu Asp Arg Val
705 710 715 720
Arg Gly Arg Val Glu Ile Ala Ala Ile Asn Gly Pro Gly Ser Val Val
725 730 735
Leu Ser Gly Asp Arg Glu Leu Leu Thr Glu Ile Ala Asp Arg Leu His
740 745 750
Asp Gln Gly Cys Arg Thr Arg Trp Leu Arg Val Ser His Ala Phe His
755 760 765
Ser Pro His Met Glu Pro Met Leu Glu Glu Phe Ala Gln Ile Ala Arg
770 775 780
Ser Arg Glu Tyr Gln Ala Pro Glu Leu Pro Ile Ile Ser Thr Leu Thr
785 790 795 800
Gly Glu Leu Asp Gly Gly Arg Val Met Gly Thr Pro Glu Tyr Trp Val
805 810 815
Arg Gln Val Arg Glu Pro Val Arg Phe Ala Glu Gly Val Gln Ala Leu
820 825 830
Val Gly Gln Gly Ala Asp Thr Ile Val Glu Phe Gly Pro Asp Gly Ala
835 840 845
Leu Ser Thr Leu Val Glu Glu Cys Leu Ala Glu Ser Gly Arg Val Ala
850 855 860
Gly Ile Pro Leu Met Arg Lys Asp Arg Asp Glu Ala Arg Thr Val Leu
865 870 875 880
Ala Ala Leu Ala Gln Ile His Thr Arg Gly Gly Glu Val Glu Trp Gln
885 890 895
Ser Phe Phe Ala Gly Thr Gly Ala Lys Gln Val Glu Leu Pro Thr Tyr
900 905 910
Ala Phe Gln Arg Gln Arg Tyr Trp Leu Ala Ser Thr Gly Gly Ala Gly
915 920 925
Asp Val Thr Ala Ala Gly Leu Ala Glu Ala Asp His Pro Leu Leu Gly
930 935 940
Ala Val Val Ala Leu Ala Asp Gly Glu Gly Val Val Leu Thr Gly Arg
945 950 955 960
Leu Thr Ala Asp Ser His Pro Trp Leu Ser Asp His Arg Val Leu Gly
965 970 975
Glu Ile Val Val Pro Gly Thr Ala Ile Val Glu Leu Ala Trp His Val
980 985 990
Gly Glu Arg Leu Gly Cys Gly Arg Val Glu Glu Leu Ala Leu Glu Ala
995 1000 1005
Pro Leu Ile Leu Pro Asp His Gly Ala Val Gln Val Gln Val Leu
1010 1015 1020
Val Gly Pro Pro Gly Glu Ser Gly Ala Arg Ser Val Ala Leu Tyr
1025 1030 1035
Ser Arg Pro Gly Asp Ala Thr Glu Ser Glu Trp Lys Lys His AIa
1040 1045 1050
Thr Gly Val Leu Leu Pro Pro Val Ala Ala Glu Asn His Glu Leu
1055 1060 1065
Pro Ala Trp Pro Pro Glu Asn Ala Thr Glu Ile Asp Ala Asp Glu
1070 1075 1080
Val Tyr Glu Phe Leu Glu Gly His Gly Phe Ala Tyr Gly Pro Ala
1085 1090 1095
Phe Arg Cys Leu Arg Gly Ala Trp Arg Arg Gly Gly Glu Val Phe
1100 1105 1110
Ala Glu Val Ala Leu Pro Asp Gly Met Gln Val Gly Val Asp Arg
1115 1120 1125
Phe Gly Val His Pro Ala Leu Leu Asp Ala Val Leu His Ala Ala
1130 1135 1140
Ala Ala Glu Thr Ser Val Val Gln Ser Glu Ala Arg Val Pro Phe
1145 1150 1155
Ser TIp Arg Gly Val Glu Leu Arg Ala Thr Glu Thr Ala Val Val
1160 1165 1170
Arg Ala Arg Ile Ser Leu Thr Ala Asp Asp Glu Leu Ser Leu Val
1175 1180 1185
Ala Val Asp Pro Val Gly Gly Phe Val Ala Ser Val Asp Ser Leu
1190 1195 1200
Val Thr Arg Pro Ile Ser Arg Gln Gln Val Arg Ser Gly A1a Ile
1205 1210 1215
Gly Asp Cys Leu Phe Glu Val Glu Trp His Arg Arg Ala Leu Leu
1220 1225 1230
Glu Thr Ala Ala Asp Asp Gly Leu Ala Ile Val Gly Asp Gly Ala
1235 1240 1245
Ser Trp Pro Glu Ser Val Arg Ala Thr Ala Arg Phe Ala Thr Leu
1250 1255 1260
Asp Glu Leu Arg Ser Ala Ala Asp Ser Asp Val Pro Ala Pro Gly
1265 1270 1275
Pro Val Leu Val Ala Ala Met Ser Ala Glu Glu Val Glu Ser Glu
1280 1285 1290
Ser Leu Pro Ser Arg Ala Gln Glu Ser Thr Ser Asp Leu Leu Ala
1295 1300 1305
Leu Val Gln Ser Trp Leu Ala Asp Glu Gln Phe Ala Glu Ser Gln
1310 1315 1320
Leu Val Val Val Thr Arg Ala Ala Val Ser Ala Asp Ser Asp Thr
1325 1330 1335
Asp Val Ala Asp Leu Val Ser Ala Ser Ser Trp Gly Leu Leu Arg
1340 1345 1350
Ser Ala Gln Ser Glu Asn Pro Gly Arg Phe Val Leu Val Asp Val
1355 1360 1365
Asp Gly Thr Pro Glu Ser Trp Gln Ala Leu Pro Thr Ala Val Arg
1370 1375 1380
Ala Gly Glu Pro Gln Leu Ala Leu Arg Arg Gly Val Ala Leu Val
1385 1390 1395
Pro Arg Leu Ala Arg Leu Lys Ala His Gly Glu Gly Ser Ser Pro
1400 1405 1410
Arg Leu Asp Thr Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly
1415 1420 1425
Ala Leu Gly Gly Val Val Ala Arg His Leu Val Ala Glu His Gly
1430 1435 1440
Ile Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp Asn Ala Pro
1445 1450 1455
Gly Val His Asp Leu Val Asp Glu Leu Ala Arg Ser Gly Ala Val
1460 1465 1470
Val Asp Val Val Ala Cys Asp Val Gly Asn Arg Thr Asp Leu Glu
1475 1480 1485
Gln Ala Leu Ala Ala Ile Pro Val Asp Arg Pro Leu Arg Gly Ile
1490 1495 1500
Val His Thr Ala Gly Val Leu Ala Asp Gly Val Leu Gly Ser Leu
1505 1510 1515
Ser Ala Ala Asp Val Asp Thr Val Phe Ala Pro Lys Val Ala Gly
1520 1525 1530
Ala Trp His Leu His Glu Leu Thr Arg Glu Leu Asp Leu Ser Phe
1535 1540 1545
Phe Val Leu Phe Ser Ser Phe Ser Gly Ile Ala Gly Ala Ala Gly
1550 1555 1560
Gln Ala Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala
1565 1570 1575
Gly Tyr Arg Arg Ala Arg Gly Leu Pro Gly Leu Ser Leu Ala Trp
1580 1585 1590
Gly Lau Trp Ala Gln Pro Gly Gly Met Thr Ser Gly Leu Asp Ala
1595 1600 1605
Ala Ser Val Glu Arg Leu Ala Arg Thr Gly Ile Ala Glu His Ser
1610 1615 1620
Thr Glu Asp Gly Leu Arg Leu Phe Asp Ala Ala Ile Ala Lys Asp
1625 1630 1635
Arg Ala Cys Val Val Pro Ala Arg Leu Asp Arg Ala Leu Leu Val
1640 1645 1650
Glu His Ala Arg Ser His Ala Ile Pro Ala Leu Met Thr Ala Leu
1655 1660 1665
Ala Pro Ala Arg Gly Gly Val Ala Arg Arg Ala Thr Asn Ser Gln
1670 1675 1680
Ala Ala Asp Glu Asp Ala Leu Leu Gly Leu Val Arg Asp His Val
1685 1690 1695
Ser Ala Val Leu Gly Tyr Ser Gly Ala Val Glu Val Gly Gly Asp
1700 1705 1710
Arg Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu
1715 1720 1725
Leu Arg Asn Arg Leu Ala Gly Val Leu Gly Val Arg Leu Pro Ala
1730 1735 1740
Thr Ala Val Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe
1745 1750 1755
Leu His Gln Glu Leu Ala Gly Glu Val Gly Ser Met Ser Thr Pro
1760 1765 1770
Val Thr Arg Ala Ala Ser Val Glu Glu Asp Leu Ile Ala Ile Val
1775 1780 1785
Gly Met Gly Cys Arg Phe Pro Gly Gly Val Ser Ser Pro Glu Glu
1790 1795 1800
Leu Trp Arg Leu Val Ala Gly Gly Val Asp Ala Val Ala Gly Phe
1805 1810 1815
Pro Asp Asp Arg Gly Trp Asp Leu Ala Gly Leu Phe Asp Pro Asp
1820 1825 1830
Pro Asp His Leu Gly Thr Ser Tyr Val Cys Glu Gly Gly Phe Leu
1835 1840 1845
Arg Asp Ala Ala Glu Phe Asp Ala Asp Met Phe Gly Val Ser Pro
1850 1855 1860
Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
1865 1870 1875
Val Ala Trp Glu Thr Leu Glu Arg Ala Gly Ile Asp Pro Phe Ser
1880 1885 1890
Leu His Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr His
1895 1900 1905
Asp Tyr Gly Ala Arg Phe Ile Thr Arg Ala Pro Glu Gly Phe Glu
1910 1915 1920
Gly His Leu Gly Thr Gly Asn Ala Gly Ser Val Leu Ser Gly Arg
1925 1930 1935
Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp
1940 1945 1950
Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gln
1955 1960 1965
Ala Leu Arg Ala Gly Glu Cys Glu Leu Ala Leu Ala Gly Gly Val
1970 1975 1980
Thr Val Met Ser Thr Pro Thr Thr Phe Val Glu Phe Ser Arg Gln
1985 1990 1995
Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala
2000 2005 2010
Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Val Leu Leu
2015 2020 2025
Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Lys Val Leu Ala
2030 2035 2040
Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly
2045 2050 2055
Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Thr Gln
2060 2065 2070
Ala Leu Thr Ser Ala Gly Leu Ser Leu Ser Asp Val Asp Ala Val
2075 2080 2085
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala
2090 2095 2100
Gln Ala Leu Ile Ala Thr Tyr Gly Arg Asp Arg Asp Pro Gly Arg
2105 2110 2115
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln
2120 2125 2130
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met
2135 2140 2145
Arg His Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser
2150 2155 2160
Ala Gln Val Asp Trp Ser Ala Gly Thr Val Gln Leu Leu Thr Glu
2165 2170 2175
Asn Thr Pro Trp Pro Asp Ssr Gly Arg Leu Arg Arg Ala Gly Val
2180 2185 2190
Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu Ile Leu Glu
2195 2200 2205
Gln Pro Pro Arg Glu Thr His Arg Ala Thr Glu Pro Asp Ser Ser
2210 2215 2220
Ser Val Leu Asp Val Pro Val Val Pro Trp Met Val Ser Gly Lys
2225 2230 2235
Thr Pro Glu Ala Leu Ser Ala Gln Ala Asp Ala Leu Met Ser Tyr
2240 2245 2250
Leu Asn Asn Arg Val Asp Val Ser Pro Arg Asp Ile Gly Tyr Ser
2255 2260 2265
Leu Ala Val Thr Arg Pro Ala Leu Asp His Arg Ala Val Val Leu
2270 2275 2280
Gly Ala Asp Arg Glu Ala Leu Leu Pro Gly Leu Lys Ala Leu Ala
2285 2290 2295
Ala Ser His Asp Ala Ala Glu Val Ile Thr Gly Thr Arg Ala Ala
2300 2305 2310
Gly Pro Val Gly Phe Val Phe Ser Gly Gln Gly Gly Gln Trp Pro
2315 2320 2325
Gly Met Gly Ser Gly Leu Tyr Ser Ala Phe Pro Val Phe Ala Asp
2330 2335 2340
Ala Phe Asp Glu Ala Cys Gly Glu Leu Asp Ala His Leu Gly Gln
2345 2350 2355
Lys Ala Arg Val Arg Asp Val Met Ser Gly Ser Asp Lys Gln Leu
2360 2365 2370
Lau Asp Gln Thr Leu Trp Ala Gln Ser Gly Leu Phe Ala Leu Gln
2375 2380 2385
Val Gly Leu Trp Glu Leu Leu Gly Ser Trp Gly Val Arg Pro Gly
2390 2395 2400
Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala
2405 2410 2415
Ala Gly Val Leu Ala Leu Pro Asp Ala Ala Arg Leu Val Ala Gly
2420 2425 2430
Arg Ala Arg Leu Met Gln Ala Leu Pro Pro Gly Gly Ala Met Leu
2435 2440 2445
Ala Ala Ala Ala Gly Glu Lys Glu Leu Arg Pro Leu Leu Ala Asp
2450 2455 2460
Arg Ala Asp Arg Val Gly Ile Ala Ala Val Asn Ala Pro Glu Ser
2465 2470 2475
Val Val Leu Ser Gly Asp Arg Asp Ala Leu Asp Asp Ile Ala Gly
2480 2485 2490
Arg Leu Asp Gly Gln Gly Val Arg Ser Arg Trp Leu Arg Val Ser
2495 2500 2505
His Ala Phe His Ser His Arg Met Asp Pro Met Leu Glu Glu Phe
2510 2515 2520
Ala Glu Ile Ala Arg Ser Val Asp Tyr Arg Ser Pro Gly Leu Pro
2525 2530 2535
Ala Val Ser Thr Leu Thr Gly Glu Leu Asp Glu Val Gly Met Met
2540 2545 2550
Ala Thr Pro Glu Tyr Trp Val Arg Gln Val Arg Glu Pro Val Arg
2555 2560 2565
Phe Ala Asp Gly Val Ala Ala Leu Ala Ala His Gly Val Ser Ser
2570 2575 2580
Ile Val Glu Val Gly Pro Asp Gly Val Leu Ser Ala Leu Val Gln
2585 2590 2595
Glu Cys Ala Ala Gly Ser Asp Gln Gly Gly Arg Val Ala Ala Val
2600 2605 2610
Pro Leu Met Arg Ser Asn Cys Asp Glu Ala Gln Lys Val Ile Thr
2615 2620 2625
Ala Leu Ala Gln Val His Ala Arg Gly Ala Glu Val Asp Trp Arg
2630 2635 2640
Ser Phe Phe Ala Gly Thr Gly Ala Lys Gln Val Glu Leu Pro Thr
2645 2650 2655
Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Asp Ser Pro Ser Glu
2660 2665 2670
Pro Val Gly Gln Ser Ala Asp Leu Ala Pro Gln Ser Gly Phe Trp
2675 2680 2685
Glu Leu Val Glu Gln Glu Asp Val Ser Ala Leu Ser Ala Ala Leu
2690 2695 2700
Asn Ile Thr Gly Asp Pro Asp Val Gln Ala Ser Leu Glu Ser Val
2705 2710 2715
Val Pro Val Leu Ser Ser Trp His Arg Arg Ile Arg Asn Glu Ser
2720 2725 2730
Leu Val His Gln Trp Arg Tyr Arg Ile Ser Trp His Glu Arg Ala
2735 2740 2745
Asp Leu Pro Asp Arg Ser Leu Ser Gly Thr Trp Leu Val Val Val
2750 2755 2760
Pro Glu Gly Trp Ser Thr Ser Gln Gln Val Leu Arg Phe Arg Glu
2765 2770 2775
Met Phe Glu Glu Arg Gly Cys Ala Ala Val Leu Phe Glu Leu Ala
2780 2785 2790
Gly His Asp Glu Glu Ala Leu Val Gln Arg Phe Arg Ser Leu Pro
2795 2800 2805
Val Ala Ser Gly Gly Ile Ser Gly Val Leu Ser Leu Leu Ala Leu
2810 2815 2820
Asp Glu Ser Pro Ser Ser Ser Asn Ala Ala Leu Pro Asn Gly Ala
2825 2830 2835
Leu Asn Ser Leu Val Leu Leu Arg Ala Leu Arg Thr Ala Asp Val
2840 2845 2850
Pro Ala Pro Leu Trp Leu Ala Thr Cys Gly Gly Val Ala Val Gly
2855 2860 2865
Asp Val Pro Val Asn Pro Gly Gln Ala Leu Met Trp Gly Leu Gly
2870 2875 2880
Arg Val Val Gly Leu Glu Asn Pro Asp Trp Trp Gly Gly Leu Val
2885 2890 2895
Asp Val Pro Asp Leu Leu Asp Lys Asp Ala Gln Glu Arg Leu Ser
2900 2905 2910
Val Val Leu Ala Gly Leu Gly Glu Asp Glu Ile Ala Val Arg Pro
2915 2920 2925
Asp Gly Val Phe Val Arg Arg Leu Glu Arg Ala Asp Leu Pro Asp
2930 2935 2940
Met Gly Ser Ala Trp Arg Pro Arg Gly Thr Val Leu Val Thr Gly
2945 2950 2955
Gly Thr Gly Gly Leu Gly Ala His Val Ala Arg Trp Leu Ala Gly
2960 2965 2970
Ala Gly Ala Glu His Val Val Leu Thr Ser Arg Arg Gly Ala Glu
2975 2980 2985
Ala Pro Gly Ala Gly Asp Leu Arg Ala Glu Leu Glu Ala Leu Gly
2990 2995 3000
Ala Arg Val Ser Ile Arg Ser Cys Asp Val Ala Asp Arg Asp Ala
3005 3010 3015
Leu Ala Glu Val Leu Ala Thr Ile Pro Asp Asp Cys Pro Leu Thr
3020 3025 3030
Ala Val Met His Ala Ala Gly Val Val Glu Val Gly Asp Val Ala
3035 3040 3045
Ser Met Cys Leu Thr Asp Phe Ile Gly Val Leu Ser Ala Lys Val
3050 3055 3060
Gly Gly Ala Ala Asn Leu Asp Glu Leu Leu Ala Asp Val Glu Leu
3065 3070 3075
Asp Ala Phe Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Ala
3080 3085 3090
Gly Gly Gln Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala
3095 3100 3105
Leu Ala Gln Gln Arg Arg Ala Arg Gly Leu Ala Gly Thr Ala Val
3110 3115 3120
Ala Trp Gly Pro Trp Ala Gly Asp Gly Met Ala Ala Gly Glu Gly
3125 3130 3135
Gly Ala Gln Leu Arg Arg Thr Gly Leu Val Pro Met Ala Ala Asp
3140 3145 3150
Arg Ala Leu Leu Ala Leu Gln Gly Ala Leu Asp Arg Asp Glu Thr
3155 3160 3165
Ser Leu Val Val Ala Asp Met Ala Trp Glu Arg Phe Ala Pro Val
3170 3175 3180
Phe Ala Met Ser Arg Arg Arg Pro Leu Leu Asp Glu Leu Pro Glu
3185 3190 3195
Ala Gln Gln Ala Leu Ala Asp Ala Glu Asn Thr Thr Gly Ala Ala
3200 3205 3210
Asp Ser Ala Gly Pro Leu Gln Arg Ile Val Gly Met Ala Ala Ala
3215 3220 3225
Glu Arg Arg Arg Ala Met Met Glu Leu Val Leu Ala Glu Thr Ser
3230 3235 3240
Ile Val Leu Gly His Asn Gly Ser Asp Ala Val Ser Pro Asp Arg
3245 3250 3255
Ala Phe Gln Glu Leu Gly Phe Asp Ser Leu Met Ala Val Glu Leu
3260 3265 3270
Arg Asn Arg Leu Gly Glu Ala Thr Gly Leu Ser Leu Pro Thr Thr
3275 3280 3285
Leu Ile Phe Asp Tyr Pro Ser Pro Ser Ala Leu Ala Glu Gln Leu
3290 3295 3300
Val Gly Glu Leu Val Gly Ala Gln Pro Ala Thr Thr Val Val Ala
3305 3310 3315
Gly Ala Asp Pro Val Asp Asp Pro Val Val Val Val Ala Met Gly
3320 3325 3330
Cys Arg Tyr Pro Gly Asp Val Cys Ser Pro Glu Glu Leu Trp Gln
3335 3340 3345
Leu Val Ser Ala Gly Arg Asp Ala Val Ser Thr Phe Pro Thr Asp
3350 3355 3360
Arg Gly Trp Asp Cys Asp Ala Leu Phe Asp Pro Asp Pro Asp Arg
3365 3370 3375
Ala Gly Arg Thr Tyr Val Arg Glu Gly Ala Phe Leu Thr Gly Ala
3380 3385 3390
Asp Arg Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala
3395 3400 3405
Arg Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ala Trp
3410 3415 3420
Glu Val Phe Glu Arg Ala Gly Ile Ala Pro Leu Ser Leu Arg Gly
3425 3430 3435
Ser Arg Thr Gly Val Phe Ala Gly Thr Asn Gly Gln Asp His Gly
3440 3445 3450
Ala Lys Val Ala Ala Ala Pro Glu Ala Ala Gly His Leu Leu Thr
3455 3460 3465
Gly Asn Ala Ala Ser Val Met Ala Gly Arg Ile Ser Tyr Thr Phe
3470 3475 3480
Gly Leu Glu Gly Pro Ala Val Ala Val Asp Thr Ala Cys Ser Ser
3485 3490 3495
Ser Leu Val Ala Leu His Leu Ala Cys Gln Ser Leu Arg Ser Gly
3500 3505 3510
Glu Cys Asp Met Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr
3515 3520 3525
Pro Leu Ala Phe Leu Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro
3530 3535 3540
Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly
3545 3550 3555
Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp
3560 3565 3570
Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser
3575 3580 3585
Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn
3590 3595 3600
Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asn Ala
3605 3610 3615
Gly Leu Ser Ala Ser Asp Val Asp Val Val Glu Ala His Gly Thr
3620 3625 3630
Gly Thr Gly Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Ile Ala
3635 3640 3645
Ala Tyr Gly Gln Gly Arg Asp Pro Glu Arg Ala Leu Trp Leu Gly
3650 3655 3660
Ser Ile Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val
3665 3670 3675
Ala Gly Val Ile Lys Met Val Gln Ala Met Arg His Gly Glu Leu
3680 3685 3690
Pro Ala Thr Leu His Val Asp Lys Pro Thr Pro Gln Val Asp Trp
3695 3700 3705
Ser Ala Gly Ala Val Arg Leu Leu Thr Gly Asn Thr Pro Trp Pro
3710 3715 3720
Glu Ser Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile
3725 3730 3735
Ser Gly Thr Asn Ala His Leu Ile Leu Glu Gln Pro Pro Ser Glu
3740 3745 3750
Pro Ala Glu Ile Asp Arg Ser Asn Arg Arg Val Thr Ala His Pro
3755 3760 3765
Ala Val Ile Pro Trp Met Leu Ser Ala Arg Ser Leu Thr Ala Leu
3770 3775 3780
Gln Ala Gln Ala Ala Ala Leu Gln Gly Arg Leu Asp Arg Val Pro
3785 3790 3795
Gly Ala Ser Pro Leu Asp Leu Gly Tyr Ser Leu Ala Thr Thr Arg
3800 3805 3810
Ser Val Leu Asp Glu Arg Ala Val Val Trp Gly Ala Asp Arg Glu
3815 3820 3825
Thr Leu Leu Ser Arg Leu Ala Ala Leu Ala Asp Gly Arg Thr Ala
3830 3835 3840
Pro Gly Val Val Thr Gly Ala Ala Asn Ser Gly Gly Arg Ile Gly
3845 3850 3855
Phe Val Phe Ser Gly Gln Gly Ser Gln Trp Leu Gly Met Gly Lys
3860 3865 3870
Ala Leu Cys Ala Ala Phe Pro Ala Phe Ala Asp Ala Phe Glu Glu
3875 3880 3885
Ala Cys Asp Ala Leu Gly Ala His Leu Gly Ala His Leu Gly Ala
3890 3895 3900
Asp Leu Gly Val Asp Val Arg Gly Val Leu Phe Gly Ala Asp Glu
3905 3910 3915
Gln Val Leu Asp Arg Thr Leu Trp Ala Gln Pro Gly Ile Phe Ala
3920 3925 3930
Val Gln Val Gly Leu Leu Gly Leu Leu Arg Ser Trp Gly Val Arg
3935 3940 3945
Pro Asp Ala Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala
3950 3955 3960
His Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg Leu Val
3965 3970 3975
Ala Ala Arg Ala Ser Leu Met Gln Ala Leu Pro Thr Gly Gly Ala
3980 3985 3990
Met Leu Ala Val Ala Thr Ser Glu Ala Ala Val Glu Pro Leu Leu
3995 4000 4005
Ala Gly Met Cys Asp Arg Val Ser Ile Ala Ala Ile Asn Gly Pro
4010 4015 4020
Glu Ser Val Val Leu Ser Gly Asp Arg Asp Val Leu Ala Glu Val
4025 4030 4035
Ala Gly Glu Leu Asp Ala Arg Gly Leu Arg Thr Lys Trp Leu Arg
4040 4045 4050
Val Ser His Ala Phe His Ser His Arg Met Gln Pro Ile Leu Asp
4055 4060 4065
Glu Tyr Ala Glu Thr Ala Gly Cys Val Glu Phe Gly Glu Pro Val
4070 4075 4080
Val Pro Ile Val Ser Ala Ala Thr Gly Ala Leu Asp Thr Ala Gly
4085 4090 4095
Leu Met Cys Ala Ala Gly Tyr Trp Val Arg Gln Val Arg Asp Pro
4100 4105 4110
Val Arg Phe Gly Asp Gly Val Gln Ala Leu Val Asp Gln Gly Val
4115 4120 4125
Asp Thr Ile Val Glu Phe Gly Pro Asp Gly Ala Leu Ser Ala Leu
4130 4135 4140
Val Gln Gln Cys Leu Ala Gly Ser Asp Gln Ala Gly Arg Val Ala
4145 4150 4155
Ala Ile Pro Leu Met Arg Arg Asp Arg Asp Glu Val Glu Thr Ala
4160 4165 4170
Val Ala Ala Leu Ala His Val His Val Arg Gly Gly Ala Val Asp
4175 4180 4185
Trp Ser Ala Cys Phe Ala Gly Thr Gly Ala Arg Thr Val Glu Leu
4l90 4l95 4200
Pro Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Ala Gly Gln
4205 4210 4215
Ala Asp Gly Arg Gly Gly Asp Val Val Ala Asp Pro Val Asn Ala
4220 4225 4230
Arg Phe Trp Glu Leu Val Glu Arg Ala Asp Pro Glu Pro Leu Val
4235 4240 4245
Asp Glu Leu Cys Ile Asp Arg Asp Gln Pro Phe Arg Glu Val Leu
4250 4255 4260
Pro Val Leu Ala Ser Trp Arg Glu Lys Gln Arg Gln Lys Ala Val
4265 4270 4275
Thr Asp Ser Trp Arg Tyr Gln Val Arg Trp Arg Ser Val Glu Val
4280 4285 4290
Gln Ser Ala Ala Ser Leu Arg Gly Val Trp Leu Val Val Leu Pro
4295 4300 4305
Ala Asp Gly Leu Arg Asp Gln Pro Ala Ala Val Ile Asp Ala Leu
4310 4315 4320
Ile Ala Arg Gly Ala Glu Val Ala Val Leu Glu Leu Thr Glu Gln
4325 4330 4335
Asp Phe Gln Arg Gly Ala Leu Val Asp Lys Val Arg Ala Val Ile
4340 4345 4350
Ala Asp Arg Thr Glu Val Thr Gly Val Leu Ser Leu Leu Ala Met
4355 4360 4365
Asp Gly Met Pro Cys Ala Glu His Pro His Leu Ser Arg Gly Val
4370 4375 4380
Ala Ala Thr Val Ile Leu Thr Gln Val Leu Gly Asp Ala Gly Val
4385 4390 4395
Ser Ala Pro Leu Trp Leu Ala Thr Thr Gly Gly Val Glu Val Gly
4400 4405 4410
Thr Glu Asp Gly Pro Ala Asp Pro Asp His Gly Leu Ile Trp Gly
4415 4420 4425
Leu Gly Arg Val Val Gly Leu Glu His Pro Gln Arg Trp Gly Gly
4430 4435 4440
Leu Ile Asp Leu Pro Ala Thr Leu Asp Glu Thr Ser Arg Asn Gly
4445 4450 4455
Leu Val Ala Ala Leu Ala Gly Thr Ala Ala Glu Asp Gln Leu Ala
4460 4465 4470
Val Arg Ser Ser Gly Leu Phe Val Arg Arg Val Val Arg Ala Ala
4475 4480 4485
Gln Asn Ser Arg Ser Gly Thr Trp Arg Ser Arg Gly Thr Val Leu
4490 4495 4500
Ile Thr Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg Trp
4505 4510 4515
Leu Ala Arg Arg Gly Ala Glu His Leu Val Leu Ile Ser Arg Arg
4520 4525 4530
Gly Pro Glu Ala Pro Gly Ala Ala Asp Leu Gln Ala Glu Leu Thr
4535 4540 4545
Glu Leu Gly Val Lys Val Thr Val Val Ala Cys Asp Val Thr Asp
4550 4555 4560
Gly Asp Glu Leu Arg Ala Val Leu Ala Ala Val Pro Thr Glu His
4565 4570 4575
Pro Leu Ser Ala Val Val His Thr Ala Gly Val Gly Thr Pro Ala
4580 4585 4590
Asn Leu Ala Glu Thr Thr Leu Ala Gln Phe Ala Asp Val Leu Ser
4595 4600 4605
Ala Lys Val Val Gly Ala Ala Asn Leu Asp Arg Leu Leu Gly Gly
4610 4615 4620
Gln Pro Leu Asp Ala Phe Val Leu Phe Ser Ser Ile Ser Gly Val
4625 4630 4635
Trp Gly Ala Gly Gly Gln Gly Ala Tyr Ser Ala Ala Asn Ala Tyr
4640 4645 4650
Leu Asp Ala Leu Ala Glu Arg Arg Arg Ala Cys Gly Arg Pro Ala
4655 4660 4665
Thr Cys Val Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala Val
4670 4675 4680
Gln Glu Gly Asn Glu Ala His Leu Arg Arg Arg Gly Leu Val Pro
4685 4690 4695
Met Glu Pro Gln Ser Ala Leu Ser Ala Leu Gln Gln Ala Leu Ser
4700 4705 4710
Arg Arg Glu Thr Ala Ile Thr Val Ala Asp Val Asp Trp Glu Arg
4715 4720 4725
Phe Ala Ala Thr Phe Thr Ala Ala Arg Pro Arg Pro Leu Leu Asp
4730 4735 4740
Glu Ile Val Asp Leu Arg Pro Asn Thr Glu Thr Ala Glu Lys His
4745 4750 4755
Gly Ala Gly Glu Leu Gly Gln Gln Leu Ala Ala Leu Pro Ala Ala
4760 4765 4770
Glu Arg Gly His Leu Leu Leu Glu Val Val Leu Ala Glu Thr Ala
4775 4780 4785
Asn Thr Leu Gly His Asp Ser Ala Glu Ala Val Gln Pro Asp Arg
4790 4795 4800
Thr Phe Ala Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu
4805 4810 4815
Arg Asn Arg Lau Asn Ala Val Thr Gly Leu Arg Leu Pro Pro Thr
4820 4825 4830
Leu Val Phe Asp His Pro Thr Pro Leu Ala Val Ser Glu Gln Leu
483 5 4840 4845
Val Pro Ala Leu Val Ala Glu Pro Gly Asp Gly Ile Glu Ser Leu
4850 4855 4860
Leu Ala Glu Leu Asp Arg Leu Asp Thr Thr Leu Ala Gln Arg Pro
4865 4870 4875
Ser Ile Pro Pro Glu Asp Gln Ala Lys Val Ala Glu Arg Leu Gln
4880 4885 4890
Ala Leu Ile Ala Lys Trp Asp Gly Ala Arg Asp Gly Thr Ala Lys
4895 4900 4905
Val Thr Ser Pro Gln Ser Leu Thr Ala Ala Thr Asp Asp Glu Ile
4910 4915 4920
Phe Asp Leu Ile Asp Arg Lys Phe Arg Arg
4925 4930
<210>7
<211>5564
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>7
Met Ala Asr Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Val Val Val
1 5 10 15
Glu Leu Glu Glu Ala His Glu Arg Leu His Glu Leu Glu Arg Gln Glu
20 25 30
His Asp Pro Ile Ala Ile Val Ser Met Gly Cys Arg Tyr Pro Gly Gly
35 40 45
Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Val Val Asp Gly Gly Asp
50 55 60
Ala Ile Ala Asn Phe Pro Glu Asp Arg Gly Trp Asn Leu Gly Glu Leu
65 70 75 80
Phe Asp Pro Asp Pro Gly Arg Ala Gly Thr Ser Tyr Val Arg Glu Gly
85 90 95
Gly Phe Leu Arg Gly Val Ala Asp Phe Asp Ala Gly Leu Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Gln Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Ile Ser Trp Glu Val Leu Glu Arg Ala Gly Ile Asp Pro Phe Ser
130 135 140
Leu Arg Gly Thr Lys Thr Ser Val Phe Ala Gly Leu Ile Tyr His Asp
145 150 155 160
Tyr Ala Ser Arg Phe Ser Lys Thr Pro Ala Glu Phe Glu Gly Tyr Phe
165 170 175
Ala Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Thr
180 185 190
Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser
195 200 205
Ser Leu Val Ala Leu His Leu Ala Cys Gln Ser Leu Arg Leu Gly Glu
210 215 220
Cys Asp Leu Ala Leu Ala Gly Gly Ile Ser Val Met Ala Thr Pro Gly
225 230 235 240
Ala Phe Val Glu Phe Ser Arg Gln Arg Ala Leu Ala Ser Asp Gly Arg
245 250 255
Cys Lys Pro Phe Ala Asp Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly
260 265 270
Ala Gly Met Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asr Gly
275 280 285
His Pro Val Leu Ala Ala Val Val Gly Ser Ala Ile Asn Gln Asp Gly
290 295 300
Met Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gln Gln Arg Val
305 310 315 320
Ile Arg Gln Ala Leu Thr Asn Ala Gly Leu Ser Pro Ala Glu Val Asp
325 330 335
Val Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile Glu
340 345 350
Ala Arg Ala Leu Ile Ala Thr Tyr Gly Ala Asn Arg Ser Ala Asp His
355 360 365
Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn Ile Gly His Thr Gln Ala
370 375 380
Ala Ala Gly Val Ala Gly Val Ile Lys Ser Val Met Ala Ile Arg His
385 390 395 400
Arg Glu Met Pro Arg Ser Leu His Ile Asp Gln Pro Ser Arg His Val
405 410 415
Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Asp Ser Val Asp Trp
420 425 430
Ala Asp Pro Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Met
435 440 445
Ser Gly Thr Asn Ala His Leu Ile Val Glu Glu Val Ser Asp Glu Pro
450 455 460
Val Ser Gly Ser Thr Glu Pro Thr Gly Ala Leu Pro Trp Pro Leu Ser
465 470 475 480
Gly Lys Thr Glu Thr Ala Leu Arg Glu Gln Ala Ala Glu Leu Leu Ser
485 490 495
Ala Val Thr Ala His Pro Glu Pro Gly Leu Gly Asn Val Gly Tyr Ser
500 505 510
Leu Ala Thr Gly Arg Ala Ala Met Glu His Arg Ala Val Val Val Ala
515 520 525
Glu Asp Arg Asp Ser Phe Val Ala Gly Leu Thr Ala Leu Ala Ala Gly
530 535 540
Val Pro Ala Ala Asn Val Val Gln Gly Ala Ala Asp Cys Lys Gly Lys
545 550 555 560
Val Ala Phe Val Phe Pro Gly Gln Gly Ser His Trp Gln Gly Met Ala
565 570 575
Arg Glu Leu Phe Glu Ser Ser Pro Val Phe Arg Arg Lys Leu Glu GIu
580 585 590
Cys Ala Ala Ala Thr Ala Pro Tyr Val Asp Trp Ser Leu Leu Gly Val
595 600 605
Leu Arg Gly Asp Pro Asp Ala Pro Ala Leu Asp Arg Asp Asp Val Ile
610 615 620
Gln Phe Ala Leu Phe Ala Met Met Val Ser Leu Ala Glu Leu Trp Arg
625 630 635 640
Ser Cys Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gln Gly Glu
645 650 655
Ile Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Thr Asp Ala Val
660 665 670
Arg Ile Val Ala Ala Arg Cys Asn Ala Val ser Val Leu Ala Gly Lys
675 680 685
Gly Gly Met Leu Ala Ile Ala Leu Pro Glu Ser Ala Val Val Lys Arg
690 695 700
Ile Ala Gly Leu Pro Glu Leu Thr Val Ala Ala Val Asn Gly Pro Gly
705 710 715 720
Ser Thr Val Val Ser Gly Glu Pro Ser Ala Leu Glu Arg Leu Gln Thr
725 730 735
Glu Leu Ser Ala Glu Asn Val Gln Ala Arg Arg Val Arg Ile Asp Tyr
740 745 750
Ala Ser His Ser Ala Gln Ile Ala Gln Val Gln Gly Arg Leu Leu Asp
755 760 765
Arg Leu Gly Glu Val Gly Ser Glu Pro Ala Glu Ile Ala Phe Tyr Ser
770 775 780
Thr Val Thr Gly Glu Arg Thr Asp Thr Gly Arg Leu Asp Ala Asp Tyr
785 790 795 800
Trp Tyr Gln Asn Leu Arg Gln Pro Val Arg Phe Gln Gln Thr Val Ala
805 810 815
Arg Met Ala Asp Gln Gly Tyr Arg Phe Phe Val Glu Val Ser Pro His
820 825 830
Pro Leu Leu Thr Ala Gly Ile Gln Glu Thr Leu Glu Ala Ala Asp Ala
835 840 845
Asp Ala Gly Gly Val Val Val Gly Ser Leu Arg Gly Gly Glu Gly Gly
850 855 860
Ser Arg Arg Trp Leu Thr Ser Leu Ala Glu Cys Gln Val Arg Gly Leu
865 870 875 880
Pro Val Asn Trp Glu Gln Val Phe Leu Asp Thr Gly Ala Arg Arg Val
885 890 895
Pro Leu Pro Thr Tyr Pro Phe Gln Arg Gln Arg Tyr Trp Leu Glu Ser
900 905 910
Ala Glu Tyr Asp Ala Gly Asp Leu Gly Ser Val Gly Leu Arg Ser Ala
915 920 925
Glu His Pro Leu Leu Gly Ala Ala Val Thr Leu Ala Asp Ala Gly Gly
930 935 940
Phe Leu Leu Thr Gly Lys Leu Ser Val Lys Thr Gln Pro Trp Leu Ala
945 950 955 960
Asp His Ala Val Arg Gly Ala Ile Leu Leu Pro Gly Thr Ala Phe Val
965 970 975
Glu Met Leu Ile Arg Ala Ala Asp Gln Val Gly Cys Asp Leu Ile Glu
980 985 990
Glu Leu Ser Leu Thr Thr Pro Leu Val Leu Pro Ala Thr Gly Ala Val
995 1000 1005
Gln Val Gln Ile Ala Val Gly Gly Pro Asp Glu Ala Gly Arg Arg
1010 1015 1020
Ser Val Arg Val His Ser Cys Arg Asp Asp Ser Val Pro Gln Asp
1025 1030 1035
Ser Trp Thr Cys His Ala Thr Gly Thr Leu Thr Thr Ser Glu His
1040 1045 1050
Arg Asp Ala Gly Gln Ala Arg Asp Gly Ile Trp Pro Pro Asn Asp
1055 1060 1065
Ala Val Ala Val Pro Leu Asp Ser Phe Tyr Ala Arg Ala Ala Glu
1070 1075 1080
Arg Gly Phe Asp Phe Gly Pro Ala Phe Gln Gly Leu Gln Ala Val
1085 1090 1095
Trp Lys Arg Gly Asp Glu Ile Phe Ala Glu Val Gly Leu Pro Ala
1100 1105 1110
Ala Gln Arg Glu Asp Ala Gly Arg Phe Gly Val His Pro Ala Leu
1115 1120 1125
Leu Asp Ala Ala Leu Gln Ala Leu Gly Ala Ala Glu Glu Asp Pro
1130 1135 1140
Asp Glu Gly Trp Leu Pro Phe Ala Trp Gln Gly Val Ser Leu Lys
1145 1150 1155
Ala Thr Gly Ala Leu Ser Leu Arg Val His Ile Val Pro Ala Gly
1160 1165 1170
Ala Asn Ala Val Ser Val Phe Thr Thr Asp Ala Thr Gly Gln Ala
1175 1180 1185
Val Leu Ser Ile Asp Ssr Leu Val Leu Arg Lys Ile Ser Asp Glu
1190 1195 1200
Gln Leu Ala Ala Val Arg Ala Met Asp His Glu Ser Leu Phe Arg
1205 1210 1215
Val Asp Trp Arg Arg Ile Ser Pro Gly Ala Ala Lys Pro Val Ser
1220 1225 1230
Trp Ala Val Ile Gly Asn Asp Glu Leu Ala Arg Ala Cys Gly Ser
1235 1240 1245
Ala Leu Gly Thr Glu Leu His Pro Asp Leu Thr Gly Leu Ala Asp
1250 1255 1260
Pro Pro Pro Asp Val Val Val Val Pro Cys Gly Ala Phe His Gln
1265 1270 1275
Asp Leu Glu Val Ala Ser Glu Ala Arg Ala Ala Thr Gln Arg Val
1280 1285 1290
Leu Asp Leu Ile Gln Gly Trp Leu Ala Ala Glu Arg Phe Ala Gly
1295 1300 1305
Ser Arg Leu Val Val Val Thr Cys Gly Ala Val Ser Thr Gly Pro
1310 1315 1320
Ala Glu Gly Val Ser Asp Leu Val His Ala Ala Ser Trp Gly Leu
1325 1330 1335
Leu Arg Ser Ala Gln Ser Glu Asn Pro Asn Arg Phe Val Leu Val
1340 1345 1350
Asp Val Asp Ala Thr Ala Glu Ser Trp Arg Ala Leu Ala Ala Ala
1355 1360 1365
Val Arg Ser Gly Glu Pro Gln Leu Ala Leu Arg Ala Gly Glu Val
1370 1375 1380
Arg Val Pro Arg Leu Thr Arg Cys Val Ala Ala Glu Asp Ser Arg
1385 1390 1395
Ile Pro Val Pro Gly Ala Asp Gly Thr Val Leu Ile Ser Gly Gly
1400 1405 1410
Thr Gly Leu Leu Gly Gly Leu Val Ala Arg His Leu Val Ala Glu
1415 1420 1425
Arg Gly Val Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp Ser
1430 1435 1440
Ala Pro Gly Val Thr Glu Leu Val Asp Glu Leu Val Gly Leu Gly
1445 1450 1455
Ala Val Val Glu Val Ala Ser Cys Asp Val Gly Asp Arg Ala Gln
1460 1465 1470
Leu Asp Arg Leu Leu Thr Thr Ile Ser Ala Glu Phe Pro Leu Arg
1475 1480 1485
Gly Val Val His Ala Ala Gly Ala Leu Ala Asp Gly Val Val Glu
1490 1495 1500
Ser Leu Thr Pro Glu His Val Ala Lys Val Phe Gly Pro Lys Val
1505 1510 1515
Ala Gly Ala Trp His Leu His Glu Leu Thr Arg Glu Leu Asp Leu
1520 1525 1530
Ser Phe Phe Val Leu Phe Ser Ser Phe Ser Gly Val Val Gly Ala
1535 1540 1545
Ala Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly
1550 1555 1560
Leu Ala Gln His Arg Arg Thr Ala Gly Leu Pro Ala Val Ser Leu
1565 1570 1575
Ala Trp Gly Leu Trp Glu Pro Thr Ser Gly Met Thr Gly Ala Leu
1580 1585 1590
Asp Ala Ala Asp Arg Ser Arg Ile Ser Arg Thr Asn Pro Pro Met
1595 1600 1605
Ser Ala Glu Asp Gly Leu Arg Leu Phe Glu Met Ala Phe His Val
1610 1615 1620
Pro Gly Glu Ser Leu Leu Val Pro Val His Ile Asp Leu Asn Ala
1625 1630 1635
Leu Arg Ala Asp Ala Ala Asp Gly Gly Val Pro Ala Leu Leu His
1640 1645 1650
Asp Leu Val Pro Ala Pro Val Arg Arg Ser Ala Val Asn Glu Ser
1655 1660 1665
Glu Asp Val Thr Gly Leu Val Gly Arg Leu Arg Arg Leu Pro Asp
1670 1675 1680
Leu Asp Gln Glu Thr Leu Leu Leu Gly Leu Val Arg Glu His Val
1685 1690 1695
Ser Ala Val Leu Gly Tyr Ser Gly Ala Val Glu Val Gly Val Glu
1700 1705 1710
Arg Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu
1715 1720 1725
Leu Arg Asn Arg Leu Gly Gly Val Leu Gly Val Arg Leu Pro Ala
1730 1735 1740
Thr Ala Val Phe Asp Tyr Pro Thr Pro Arg Ala Leu Val Arg Phe
1745 1750 1755
Leu Arg Asp Lys Leu Ile Gly Gly Val Glu Ala Arg Asn Ser Ala
1760 1765 1770
Pro Ala Val Val Glu Ala Ala Ser Gly Asp Asp Pro Val Val Ile
1775 1780 1785
Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Ser Ser Pro Glu
1790 1795 1800
Glu Leu Trp Arg Leu Val Ala Gly Gly Leu Asp Ala Val Ala Glu
1805 1810 1815
Phe Pro Asp Asp Arg Gly Trp Asp Gln Ala Gly Leu Phe Asp Pro
1820 1825 183 0
Asp Pro Asp Arg Leu Gly Thr Ser Tyr Val Cys Glu Gly Gly Phe
1835 1840 1845
Leu Arg Asp Ala Ala Glu Phe Asp Ala Gly Phe Phe Gly Ile Ser
1850 1855 1860
Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
1865 1870 1875
Glu Ile Ala Trp Glu Thr Leu Glu Arg Ala Gly Ile Asp Pro Leu
1880 1885 1890
Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met His
1895 1900 1905
His Asp Tyr Gly Ala Arg Phe Val Thr Arg Ala Pro Glu Gly Phe
1910 1915 1920
Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Gly Val Phe Ser Gly
1925 1930 1935
Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val
1940 1945 1950
Asp Thr Ala Cys Ser Ser Ser Leu Val Ser Met His Leu Ala Gly
1955 1960 1965
Gln Ala Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly
1970 1975 1980
Val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu Phe Ser Arg
1985 1990 1995
Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala
2000 2005 2010
Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Val Leu
2015 2020 2025
Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Ala Val Leu
2030 2035 2040
Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn
2045 2050 2055
Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Thr
2060 2065 2070
Gln Ala Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val Asp Ala
2075 2080 2085
Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu
2090 2095 2100
Ala Gln Ala Leu Ile Ala Thr Tyr Gly Gln Glu Arg Asp Arg Asp
2105 2110 2115
Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr
2120 2125 2130
Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala
2135 2140 2145
Met Arg His Glu Gln Leu Pro Ala Thr Leu His Val Asp Glu Pro
2150 2155 2160
Thr Pro Glu Val Asp Trp Ser Ala Gly Glu Val Gln Leu Leu Thr
2165 2170 2175
Glu Asn Thr Pro Trp Pro Asp Ser Gly His Pro Arg Arg Ala Gly
2180 2185 2190
Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu
2195 2200 2205
Glu Gln Ala Ser Asn Thr Pro Asp Glu Ile Ala Gln Ser Asn Gly
2210 2215 2220
Pro Glu Ser Glu Ser Thr Val Asp Ile Pro Ala Val Pro LeuIle
2225 2230 2235
Val Ser Gly Arg Thr Pro Glu Ala Leu Ser Ala Gln Ala Ser Ala
2240 2245 2250
Leu Met Ser Tyr Leu Asp Asn Arg Pro Asp Ile Ser Ser Leu Asp
2255 2260 2265
Ala Ala Phe Ser Leu Ala Ser Ser Arg Ala Ala Leu Glu Glu Arg
2270 2275 2280
Ala Val Val Leu Gly Ala Asp Arg Glu Ala Leu Leu Ser Gly Leu
2285 2290 2295
Glu Ala Leu Ala Ala Gly Arg Asp Ala Ser Gly Val Val Ser Gly
2300 2305 2310
Ser Leu Ile Ser Gly Gly Val Gly Phe Val Phe Ser Gly Gln Gly
2315 2320 2325
Gly Gln Trp Leu Gly Met Gly Arg Gly Leu Tyr Ser Ala Phe Pro
2330 2335 2340
Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala Gly Leu Asp Ala
2345 2350 2355
His Leu Gly Gln Gln Val Gly Val Arg Asp Val Val Phe Gly Ser
2360 2365 2370
Asp Gly Ser Leu Leu Asp Arg Thr Leu Trp Ala Gln Ser Gly Leu
2375 2380 2385
Phe Ala Leu Gln Val Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly
2390 2395 2400
Val Arg Pro Gly Val Val Met Gly His Ser Val Gly Glu Phe Ala
2405 2410 2415
Ala Ala Phe Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg
2420 2425 2430
Leu Val Ala Gly Arg Ala Arg Leu Met Gln Ala Leu Pro Asp Gly
2435 2440 2445
Gly Ala Met Leu Ala Val Ala Ala Gly Glu Glu Gln Leu Arg Pro
2450 2455 2460
Leu Leu Ala Ala Arg Gly Glu Gly Val Gly Ile Ala Ala Val Asn
2465 2470 2475
Ala Ser Glu Ser Val Val Leu Ser Gly Asp Arg Glu Val Leu Glu
2480 2485 2490
Asp Ile Ala Gly Gly Leu Asp Gly Gln Gly Val Arg Trp Arg Trp
2495 2500 2505
Leu Arg Val Ser His Ala Phe His Ser Tyr Arg Met Asp Pro Met
2510 2515 2520
Leu Gln Glu Phe Thr Asp Ile Ala Gly Ser Val Asp Tyr Arg Arg
2525 2530 2535
Cys Asp Leu Pro Val Val Ser Thr Leu Thr Gly Glu Leu Asp Thr
2540 2545 2550
Ala Gly Met Leu Ala Thr Pro Gly Tyr Trp Val Arg Gln Val Arg
2555 2560 2565
Glu Pro Val Arg Phe Ala Asp Gly Val Arg Ala Leu Ala Gln Gln
2570 2575 2580
Gly Val Gly Thr Ile Phe Glu Leu Gly Pro Asp Ala Ile Leu Ser
2585 2590 2595
Ala Leu Ile Pro Asp Cys His Ser Trp Gly Asp Gln Thr Val Pro
2600 2605 2610
Ile Pro Leu Leu Arg Lys Asp Arg Ala Glu Pro Glu Thr Val Val
2615 2620 2625
Ala Ala Val Ala Arg Ala His Thr Arg Gly Val Gln Val Asp Trp
2630 2635 2640
Ser Ala Phe Phe Ala Gly Thr Gly Ala Gly Arg Val Glu Leu Pro
2645 2650 2655
Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Glu Ser Ser Val
2660 2665 2670
Ser Gly Asp Val Thr Gly Ile Gly Leu Ala Gly Ala Glu His Pro
2675 2680 2685
Leu Lau Gly Ala Val Val Val Leu Ala Asp Gly Asp Gly Met Val
2690 2695 2700
Leu Thr Gly Arg Leu Ser Val Gly Thr His Arg Trp Leu Ala Glu
2705 2710 2715
His Arg Val Leu Gly Glu Val Val Val Pro Gly Thr Ala Ile Leu
2720 2725 2730
Glu Met Val Leu His Ala Gly Ala Arg Val Gly Cys Gly Arg Val
2735 2740 2745
Glu Glu Leu Thr Leu Glu Ala Pro Leu Val Val Pro Glu Arg Asp
2750 2755 2760
Ala Ile Glu Ile Gln Leu Leu Val Asn Ala Pro Asp Asp Lys Gly
2765 2770 2775
Arg Arg Ser Val Ser Leu His Ser Arg Pro Ala Gly Gly Ser Gly
2780 2785 2790
Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu Val Val Ala
2795 2800 2805
Gly Thr Gly Gly Gly Ala Val Thr Gly Trp Ser Thr Glu Gly Ala
2810 2815 2820
Glu Pro Val Ala Leu Gly Glu Phe Tyr Val Val Gln Ala Gly Asn
2825 2830 2835
Gly Phe Glu Tyr Gly Pro Leu Phe Gln Gly Leu Arg Ala Ala Trp
2840 2845 2850
Arg Arg Gly Gly Glu Val Leu Ala Glu Val Ala Leu pro Ala Ala
2855 2860 2865
Ala Gly Ala Met Ala Gly Phe Leu Ile Asn Pro Ala Leu Leu Asp
2870 2875 2880
Ala Ala Leu Gln Ala Ser Ala Leu Gly Asp Arg Pro Ala Glu Gly
2885 2890 2895
Gly Ala Trp Leu Pro Phe Ser Phe Thr Gly Val Glu Leu Ser Gly
2900 2905 2910
Gln Gly Gly Thr Ile Ser Arg Ala Arg Val Glu Ser Thr Arg Pro
2915 2920 2925
Asp Ala Val Ser Val Ala Val Met Asp Glu Gly Gly Arg Leu Leu
2930 2935 2940
Ala Ser Ile Asp Ser Leu Arg Leu Arg Pro Val Ser Ser Val Arg
2945 2950 2955
Leu Ala Asn Arg Asp Val Val Gly Asp Ala Leu Phe Glu Val Thr
2960 2965 2970
Trp Glu Pro Val Ala Thr Arg Ser Thr Val Ser Gly Arg Trp Ala
2975 2980 2985
Leu Leu Gly Asp Ala Val Gly Gly Met Ala Gly Leu Ile Gly Leu
2990 2995 3000
Ala Pro Gly Ser Val Asp Arg Cys Ala Gly Leu Ala Glu Leu Ala
3005 3010 3015
Gly Asn Leu Asp Ser Gly Ala Leu Val Ala Asp Val Val Val Tyr
3020 3025 3030
Cys Ala Gly Glu Gln Ala Asp Pro Asp Ala Gly Val Ala Ala Leu
3035 3040 3045
Ala Glu Thr Arg Glu Met Leu Ala Leu Val Gln Ser Trp Leu Ala
3050 3055 3060
Glu Glu Arg Leu Ala Gly Ser Arg Leu Val Val Val Thr Cys Gly
3065 3070 3075
Ala Val Thr Thr Ala Ala Gly Asp Gly Ala Ser Lys Leu Ala His
3080 3085 3090
Ala Pro Leu Trp Gly Leu Leu Arg Ser Ala Gln Ser Glu Asn Pro
3095 3100 3105
Gly Arg Phe Val Leu Val Asp Val Asp Gly Thr Ala Glu Ser Trp
3110 3115 3120
Arg Ala Leu Pro Ser Ala Val Gly Ser Met Gln Pro Gln Leu Ala
3125 3130 3135
Val Arg Lys Gly Val Val Thr Val Pro Arg Val Ala Ser Val Pro
3140 3145 3150
Gly Pro Val Glu Val Pro Ala Val Val Ala Gly Pro Asp Arg Thr
3155 3160 3165
Val Leu Ile Ser Gly Gly Thr Gly Leu Leu Gly Gly Val Val Ala
3170 3175 3180
Arg His Leu Val Ala Glu Arg Gly Val Arg Arg Val Val Leu Thr
3185 3190 3195
Gly Arg Arg Gly Trp Asp Ala Pro Gly Ile Thr Glu Leu Val Gly
3200 3205 3210
Glu Leu Glu Gly Phe Gly Ala Val Val Asp Val Val Ala Cys Asp
3215 3220 3225
Val Ala Asp Arg Ala Gly Leu Glu Gly Leu Leu Ala Ala Val Pro
3230 3235 3240
Ala Glu Phe Pro Lsu Cys Gly Val Val His Ala Ala Gly Val Leu
3245 3250 3255
Ala Asp Gly Val Ile Glu Ser Leu Thr Pro Glu Asp Val Gly Ala
3260 3265 3270
Val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His Glu Leu
3275 3280 3285
Thr Arg Asp Met Asp Leu Ser Phe Phe Ala Leu Phe Ser Ser Leu
3290 3295 3300
Ser Gly Val Thr Gly Ala Ala Gly Gln Gly Asn Tyr Ala Ala Ala
3305 3310 3315
Asn Thr Phe Leu Asp Ala Leu Ala His Tyr Arg Arg Ala Gln Gly
3320 3325 3330
Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gln Ser Ser
3335 3340 3345
Gly Met Thr Gly Arg Leu Ser Asp Val Asp Arg Ser Arg Ile Ala
3350 3355 3360
Arg Ser Ser Pro Pro Leu Ser Thr Lys Asp Gly Leu Arg Leu Phe
3365 3370 3375
Asp Ala Gly Leu Ala Leu Asp Arg Ala Ala Val Val Pro Ala Arg
3380 3385 3390
Leu Asp Arg Ala Phe Leu Ala Glu Gln Ala Arg Ser Gly Thr Leu
3395 3400 3405
Pro Ala Met Leu Thr Ala Leu Val Pro Thr Ile Thr Ser Ile Arg
3410 3415 3420
Arg Ser Ser Gly Thr Asp Leu Ala Asp Glu Asp Ala Leu Leu Gly
3425 3430 3435
Val Val Arg Glu His Ala Ala Arg Val Leu Gly Tyr Ser Gly Ala
3440 3445 3450
Ala Glu Val Gly Val Glu Arg Ala Phe Arg Asp Leu Gly Phe Asp
3455 3460 3465
Ser Leu Ser Gly Val Glu Leu Arg Asn Arg Leu Ala Gly Val Leu
3470 3475 3480
Gly Ala Arg Leu Pro Ala Thr Ala Val Phe Asp Tyr Pro Thr Pro
3485 3490 3495
Arg Ala Leu Ala Arg Phe Leu His Gln Glu Leu Ala Gly Glu Val
3500 3505 3510
Gly Thr Thr Pro Ala Pro Val Thr Thr Thr Thr Ala Ser Val Glu
3515 3520 3525
Asp Asp Leu Val Ala Ile Val Gly Met Gly Cys Arg Tyr Pro Gly
3530 3535 3540
Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly Gly
3545 3550 3555
Val Asp Ala Val Ala Asp Phe Pro Asp Asp Arg Gly Trp Asp Leu
3560 3565 3570
Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Phe Gly Thr Ser Tyr
3575 3580 3585
Val Arg Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala
3590 3595 3600
Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro
3605 3610 3615
Gln Gln Arg Leu Leu Leu Glu Leu Ser Trp Glu Ala Val Glu Arg
3620 3625 3630
Ala Gly Ile Asp Pro Gly Ser Leu Arg Gly Ser Arg Thr Gly Val
3635 3640 3645
Phe Ala Gly Leu Met Tyr His Asp Tyr Ala Gly Arg Phe Ala Ala
3650 3655 3660
Gly Val Pro Glu Gly Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala
3665 3670 3675
Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu
3680 3685 3690
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val
3695 3700 3705
Ala Leu His Leu Ala Gly Gln Ser Leu Arg Ser Gly Glu Cys Asp
3710 3715 3720
Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala Thr
3725 3730 3735
Phe Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg
3740 3745 3750
Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Thr Gly Trp Gly Glu
3755 3760 3765
Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg
3770 3775 3780
Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn
3785 3790 3795
Gln Asp Gly Ala Ser Ash Gly Leu Thr Ala Pro Asn Gly Pro Ser
3800 3805 3810
Gln Gln Arg Val Ile Thr Gln Ala Leu Thr Ser Ala Gly Leu Ser
38l5 3820 3825
Val Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg
3830 3835 3840
Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Ile Ala Thr Tyr Gly
3845 3850 3855
Arg Asp Arg Asp Pro Asp Arg Pro Leu Trp Leu Gly Ser Met Lys
3860 3865 3870
Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly Val
3875 3880 3885
Ile Lys Met Val Met Ala Met Arg His Gly Glu Leu Pro Arg Thr
3890 3895 3900
Leu His Val Gly Glu Pro Thr Ser Glu Val Asp Trp Ser Ala Gly
3905 3910 3915
Ser Val Gln Leu Leu Thr Glu Asn Thr Pro Trp Pro Asp Ser Gly
3920 3925 3930
His Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly Thr
3935 3940 3945
Asn Ala His Val Ile Leu Glu Gln Ser Pro Thr Ala Ser Ser Glu
3950 3955 3960
Phe Val Glu His Ser Gly Pro Asp Ser Glu Ser Ala Val Asn Val
3965 3970 3975
Pro Val Val Pro Trp Val Val Ser Gly Lys Thr Pro Glu Ala Leu
3980 3985 3990
Ser Ala Gln Ala Asp Thr Leu Val Ser Tyr Leu Asp Asp Arg Ser
3995 4000 4005
Asp Val Ser Ser Arg Asp Val Gly Tyr Ser Leu Ala Met Thr Arg
4010 4015 4020
Ser Ala Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Glu
4025 4030 4035
Thr Leu Leu Ser Gly Leu Lys Ala Leu Ala Ala Gly His Glu Ala
4040 4045 4050
Thr Gly Val Val Thr Gly Ser Val Gly Ser Gly Gly Arg Pro Gly
4055 4060 4065
Phe Val Phe Ala Gly Gln Gly Gly Gln Trp Leu Gly Met Gly Arg
4070 4075 4080
Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Asp Ala Phe Asp Glu
4085 4090 4095
Ala Cys Ala Gly Leu Asp Ala His Leu Gly Gln Glu Val Gly Val
4100 4105 4110
Arg Asp Val Val Phe Gly Ser Asp Ala Gln Leu Leu Asp Arg Thr
4115 4120 4125
Leu Trp Ala Gln Ser Gly Leu Phe Ala Leu Gln Val Gly Leu Leu
4130 4135 4140
Lys Leu Leu Gly Ser Trp Gly Val Arg Pro Val Val Val Leu Gly
4145 4150 4155
His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu
4160 4165 4170
Ser Met Ala Glu Ala Ala Arg Leu Val Ala Gly Arg Ala Arg Leu
4175 4180 4185
Met Gln Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val Ala Ala
4190 4195 4200
Thr Glu Asp Arg Ile Ser Pro Leu Leu Asp Gly Val Arg Asp Arg
4205 4210 4215
Val Gly Val Ala Ala Val Asn Ala Pro Gly Ser Ala Val Leu Ser
4220 4225 4230
Gly His Arg Asp Val Leu Glu Asp Val Val Gly Arg Leu Asp Gly
4235 4240 4245
Leu Gly Val Arg Trp Arg Trp Leu Arg Val Ser His Ala Phe His
4250 4255 4260
Ser Tyr Arg Met Asp Pro Met Leu Asp Glu Phe Ala Asp Ile Ala
4265 4270 4275
Arg Ser Val Asp Tyr Arg Ser Pro Gly Leu Pro Ile Val Ser Thr
4280 4285 4290
Leu Thr Gly Asn Leu Asp Asp Val Gly Val Met Ala Thr Pro Glu
4295 4300 4305
Tyr Trp Val Arg Gln Val Arg Glu Pro Val Arg Phe Ala Asp Gly
4310 4315 4320
Val Gln Ala Leu Val Asn Gln Gly Val Asp Thr Ile Val Glu Leu
4325 4330 4335
Gly Pro Asp Gly Val Leu Ser Ser Leu Val His Glu Cys Val Ser
4340 4345 4350
Glu Ser Gly Arg Val Thr Gly Ile Pro Leu Val Arg Lys Asp Arg
4355 4360 4365
Asp Glu Val Pro Thr Val Leu Ala Ala Leu Ala Gln Ile His Thr
4370 4375 4380
Arg Gly Gly Ala Val Asp Trp Gly Ser Phe Phe Ala Gly Thr Gly
4385 4390 4395
Ala Lys Gln Val Glu Leu Pro Thr Tyr Ala Phe Gln Arg Arg Arg
4400 4405 4410
Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val Thr Gly Ala Gly
4415 4420 4425
Leu Thr Gly Ala Glu His Pro Leu Leu Gly Ala Val Val pro Val
4430 4435 4440
Ala Gly Ala Asp Glu Val Leu Leu Thr Gly Arg Leu Ser Val Gly
4445 4450 4455
Thr His Pro Trp Leu Ala Asp His Arg Val Leu Gly Glu Val Val
4460 4465 4470
Val Pro Gly Thr Ala Leu Leu Glu Met Ala Trp Arg Ala Gly Ser
4475 4480 4485
Gln Val Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala Pro
4490 4495 4500
Leu Val Leu Pro Glu Arg Gly Ala Ala Ala Val Gln Leu Ala Val
4505 45l0 4515
Gly Ala Pro Asp Glu Ala Gly Arg Arg Ser Leu Gln Leu Tyr Ser
4520 4525 4530
Arg Gly Ala Asp Glu Asp Gly Asp Trp Arg Arg Ile Ala Ser Gly
4535 4540 4545
Leu Leu Ala Gln Ala Ser Val Val Pro Pro Ala Asp Ser Thr Ala
4550 4555 4560
Trp Pro Pro Asp Gly Ala Val Gln Val Asp Leu Ala Glu Phe Tyr
4565 4570 4575
Glu Arg Leu Ala Glu Arg Gly Leu Thr Tyr Gly Pro Val Phe Gln
4580 4585 4590
Gly Leu Arg Ala Ala Trp Arg Tyr Gly Asp Asp Ile Phe Ala Glu
4595 4600 4605
Leu Ala Val Ser Pro Asp Ala Ala Gly Phe Gly Ile His Pro Ala
4610 46l5 4620
Leu Leu Asp Ala Ala Leu His Ala Met Ala Leu Gly Ala Ser Pro
4625 4630 4635
Asp Ser Glu Ala Arg Leu Pro Phe Ser Trp Ser Gly Ala Gln Leu
4640 4645 4650
Tyr Arg Ala Gly Gly Ala Ala Leu Arg Val Arg Leu Ser Pro Leu
4655 4660 4665
Gly Thr Gly Ala Val Ser Leu Thr Leu Met Asp Ala Ala Gly Gly
4670 4675 4680
Gln Val Ala Ala Val Glu Ser Leu Ser Thr Arg Pro Val Ser Ala
4685 4690 4695
Asp Gln Ile Gly Ala Gly Arg Gly Asp His Glu Arg Leu Leu His
4700 4705 4710
Val Glu Trp Val Arg Pro Ala Glu Ser Ala Gly Met Ser Leu Thr
4715 4720 4725
Ser Cys Ala Val Val Gly Leu Asp Glu Pro Glu Trp His Ala Ala
4730 4735 4740
Leu Lys Ala Thr Gly Val Gln Val Glu Ser His Ala Asp Leu Ala
4745 4750 4755
Ser Leu Ala Thr Glu Val Ala Lys Arg Gly Ser Ala Pro Gly Ala
4760 4765 4770
Val Ile Val Pro Cys Pro Arg Pro Gln Ala Met Glu Glu Leu Pro
4775 4780 4785
Thr Ala Ala Arg Arg Ala Thr Gln Gln Ala Met Ala Leu Leu Gln
4790 4795 4800
Glu Trp Leu Ala Asp Asp Arg Phe Val Ser Thr Arg Leu Ile Leu
4805 4810 4815
Leu Thr His Arg Ala Val Ala Ala Val Ala Gly Glu Asp Val Phe
4820 4825 4830
Asp Leu Val His Ala Pro Leu Trp Gly Leu Val Arg Ser Ala Gln
4835 4840 4845
Ala Glu His Pro Asp Arg Phe Ala Leu Ile Asp Val Asp Glu Ala
4850 4855 4860
Glu Ala Ser Arg Ala Ala Leu Ala Glu Ala Leu Thr Ala Gly Glu
4865 4870 4875
Ala Gln Leu Ala Val Arg Ser Gly Val Val Leu Val Pro Arg Leu
4880 4885 4890
Gly Gln Val Lys Ala Ser Gly Gly Glu Ala Phe Arg Trp Asp Glu
4895 4900 4905
Gly Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala Leu
4910 4915 4920
Leu Ala Arg His Leu Val Ser Ala His Gly Val Arg His Leu Leu
4925 4930 4935
Leu Ala Ser Arg Arg Gly Leu Ala Ala Pro Gly Ala Asp Glu Leu
4940 4945 4950
Val Ala Glu Leu Glu Gln Ser Gly Ala Asp Val Ala Val Val Ala
4955 4960 4965
Cys Asp Ala Ala Asp Arg Asp Ser Leu Ala Arg Leu Val Ala Ser
4970 4975 4980
Val Pro Ala Glu Asn Pro Leu Arg Ala Val Val His Ala Ala Gly
4985 4990 4995
Val Leu Asp Asp Gly Val Leu Met Ser Met Ser Pro Glu Arg Leu
5000 5005 5010
Asp Ala Val Leu Arg Ser Lys Val Asp Ala Ala Trp Tyr Leu His
5015 5020 5025
Glu Leu Thr Arg Glu Leu Gly Leu Ser Ala Phe Val Leu Phe Ser
5030 5035 5040
Ser Val Ala Gly Leu Leu Gly Gly Ala Gly Gln Ser Asn Tyr Ala
5045 5050 5055
Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala His Cys Arg Gln Ala
5060 5065 5070
Gln Gly Leu Pro Ala Leu Ser Leu Ala Ser Gly Leu Trp Ala Ser
5075 5080 5085
Ile Asp Gly Met Ala Gly Asp Leu Ala Ala Ala Asp Val Glu Arg
5090 5095 5l00
Leu Ser Arg Ala Gly Ile Ala Pro Leu Ser Ala Pro Gly Gly Leu
5105 5110 5115
Ala Leu Phe Asp Ala Ala Ile Arg Ser Asp Glu Pro Leu Leu Ala
5120 5125 5130
Pro Val Arg Leu Asp Val Glu Ala Leu Arg Val Gln Ala Arg Ser
5135 5140 5145
Ala Glu Thr Arg Ile Pro Glu Met Leu His Gly Met Ala Met Gly
5150 5155 5160
Pro Ser Arg Arg Thr Ser Phe Ser Ser Arg Val Glu Pro Leu Gln
5165 5170 5175
Glu Arg Leu Ala Gly Leu Ser Glu Asp Glu Arg Arg Gln Gln Val
5180 5185 5190
Leu Gln Arg Val Arg Ala Asp Ile Ala Val Val Leu Gly His Gly
5195 5200 5205
Lys Ser Asn Asp Val Asp Thr Glu Lys Pro Leu Ala Glu Leu Gly
5210 5215 5220
Phe Asp Ser Leu Thr Ala Ile Glu Leu Arg Asn Arg Leu Ala Thr
5225 5230 5235
Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu Ala Phe Asp His Gly
5240 5245 5250
Thr Ala Ala Ala Leu Ala Trp His Val Cys Ala Gln Leu Gly Thr
5255 5260 5265
Ala Thr Val Pro Ala Pro Arg Arg Thr Asp Asp Asn Asp Ser Ala
5270 5275 5280
Glu Pro Val Arg Ser Leu Phe Gln Gln Ala Tyr Ala Ala Gly Arg
5285 5290 5295
Ile Leu Asp Gly Met Asp Leu Val Lys Val Ala Ala Gln Leu Arg
5300 5305 5310
Pro Val Phe Gly Ser Pro Gly Glu Leu Glu Ser Leu Pro Lys Pro
5315 5320 5325
Val Gln Leu Ser Arg Gly Pro Lys Glu Pro Ala Leu Val Cys Met
5330 5335 5340
Pro Ala Leu Ile Gly Met Pro Pro Ala Gln Gln Tyr Ala Arg Ile
5345 5350 5355
Ala Ala Gly Phe Arg Asp Val Arg Asp Val Ser Val Val Pro Met
5360 5365 5370
pro Gly Phe Val Ala Gly Glu Pro Leu Pro Ser Ala Ile Glu Val
5375 5380 5385
Ala Val Arg Thr Gln Ala Glu Ala Val Leu Gln Glu Phe Ala Gly
5390 5395 5400
Asp Ser Phe Val Leu Val Gly His Ser Ser Gly Gly Trp Leu Ala
5405 5410 5415
His Glu Val Ala Gly Val Leu Glu Arg Arg Gly Val Leu Pro Ala
5420 5425 5430
Gly Val Val Leu Leu Asp Thr Tyr Ile Pro Gly Glu Ile Thr Pro
5435 5440 5445
Arg Phe Ser Ala Ala Met Ala His Arg Thr Tyr Glu Lys Leu Ala
5450 5455 5460
Thr Phe Thr Asp Met Gln Asp Ile Ala Ile Thr Ala Met Gly Gly
5465 5470 5475
Tyr Phe Arg Met Phe Thr Glu Trp Thr Pro Thr Pro Ile Gly Thr
5480 5485 5490
Pro Thr Leu Phe Val Arg Thr Glu Asp Cys Val Ala Asp Pro Glu
5495 5500 5505
Gly Arg Pro Trp Thr Asp Asp Ser Trp Arg Pro Gly Trp Thr Leu
55l0 5515 5520
Ala Asp Ala Thr Val Gln Val Pro Gly Asp His Phe Ser Met Met
5525 5530 5535
Asp Glu His Ser Gly Ser Thr Ala Gln Ala Val Ala Ser Trp Leu
5540 5545 5550
Glu Lys Leu Ser Gln Arg Thr Ala Arg Gln Arg
5555 5560
<210>8
<211>275
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>8
Val Leu Pro Gly Gly Val Pro Thr Ser Gln Gln Val Gly Gln Met Tyr
1 5 10 15
Asp Leu Val Thr Pro Leu Leu Asn Ser Val Ala Gly Gly Pro Cys Ala
20 25 30
Ile His His Gly Tyr Trp Glu Asn Asp Gly Arg Thr Ser Trp Gln Gln
35 40 45
Ala Ala Asp Arg Leu Thr Asp Leu Val Ala Glu Arg Thr Ala Leu Asp
50 55 60
Gly Gly Asn Arg Leu Leu Asp Val Gly Cys Gly Thr Gly Gln Pro Ala
65 70 75 80
Leu Arg Val Ala Arg Asp Asn Ala Ile Arg Ile Thr Gly IIe Thr Val
85 90 95
Ser Gln Val Gln Ala Ala Ile Ala Val Asp Cys Ala Arg Glu Arg Gly
100 105 110
Leu Ser His Gln Val Asp Phe Ser Cys Val Asp Ala Met Ssr Leu Pro
115 120 125
Tyr Pro Asp Asn Ala Phe Asp Ala Ala Trp Ala Ile Gln Ser Leu Leu
130 135 140
Glu Met Ser Glu Pro Asp Arg Ala Ile Arg Glu Ile Val Arg Val Leu
145 150 155 160
Lys Pro Gly Gly Ile Leu Gly Val Thr Glu Val Val Lys Arg Glu Ile
165 170 175
Gly Ser Gly Ile Pro Val Ser Trp Asp Met Trp Pro Thr Gly Leu Arg
180 185 190
Ile Cys Leu Ala Glu Gln Leu Leu Glu Ser Leu Cys Ala Ala Gly Phe
195 200 205
Glu Ile Leu Ala Cys Asp Asp Val Ser Ser Arg Thr Arg Tyr Phe Met
210 215 220
Pro Gln Phe Ala Glu Ala Leu Ala Ala His Gln His Gly Ile Ala Glu
225 230 235 240
Arg Tyr Gly Pro Ala Val Ala Asp Trp Ala Ala Ala Val Cys Asp Tyr
245 250 255
Glu Lys Tyr Ala Asp Asp Met Gly Tyr Ala Ile Leu Thr Ala Arg Lys
260 265 270
Pro Val Gly
275
<210>9
<2ll>390
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>9
Met Arg Val Leu Val Val Pro Leu Pro Tyr Pro Thr His Leu Met Ala
1 5 10 15
Met Val Pro Leu Cys Trp Ala Leu Arg Ala Ser Gly His Glu Val Leu
20 25 30
Val Ala Ala Pro Pro Glu Leu Gln Ala Thr Ala His Gly Ala Gly Leu
35 40 45
Thr Thr Ala Glu Ile Arg Gly Asn Asp Lys Thr Arg Asp Thr Gly Ser
50 55 60
Thr Thr Arg Leu Arg Phe Pro Asn Pro Ala Phe Gly Gln Arg Asp Thr
65 70 75 80
Glu Thr Gly Arg Gln Leu Trp Glu Gln Thr Ala Ser Tyr Val Val Gln
85 90 95
Ser Ser Leu Asp Gln Leu Pr0 Glu Tyr Leu Arg Leu Ala Glu Ala Trp
100 105 110
Arg Pro Ser Val Leu Leu Val Asp Val Cys Ala Leu Ile Gly Arg Val
115 120 125
Leu Gly Gly Leu Leu Asp Leu Pro Val Val Leu His Arg Trp Gly Val
130 135 140
Asp Pro Thr Ala Gly Pro Phe Ser Asp Arg Ala His Glu Leu Leu Asp
145 150 155 160
Pro Val Cys Arg His His Gly Leu Ala Gly Leu Pro Thr Pro Glu Leu
165 170 175
Ile Leu Asp Pro Cys Pro Pro Ser Leu Gln Ala Ser Asp Ala Pro Arg
180 185 190
Gly Val Pro Val Gln Tyr Val Pro Tyr Asn Gly Ser Gly Glu Leu Pro
195 200 205
Ala Trp Gly Ala Ala Arg Thr Ser Ala Arg Arg Val Cys Ile Cys Met
210 215 220
Gly Arg Met Val Leu Asn Ala Thr Gly Pro Ala Pro Leu Leu Arg Ala
225 230 235 240
Val Ala Ala Ala Thr Gly Leu Pro Gly Val Glu Ala Val Ile Ala Val
245 250 255
Pro Pro Glu His Arg Ala Leu Leu Thr Asp Leu Pro Asp Asn Ala Arg
260 265 270
Ile Ala Glu Ser Val Pro Leu Asn Leu Phe Leu Arg Thr Cys Glu Leu
275 280 285
Val Ile Cys Ala Gly Gly Ser Gly Thr Ala Phe Thr Ala Thr Arg Leu
290 295 300
Gly Ile Pro Gln Leu Val Leu Pro Gln Tyr Phe Asp Gln Phe Asp Tyr
305 310 315 320
Ala Arg Asn Leu Thr Ala Ala Gly Ala Gly Ile Cys Leu Pro Asp Glu
325 330 335
Gln Ala Gln Ser Asp His Glu Gln Phe Thr Gly Ser Ile Ala Thr Val
340 345 350
Leu GLy Asp Thr Gly Phe Ala Ala Ala Ala Thr Lys Leu Ser Asp Glu
355 360 365
Ile Thr Ala Met Pro Asn Pro Ala Glu Leu Val Arg Thr Leu Glu Ser
370 375 380
Ser Ala AlaIle Gly Ala
385 390
<210>10
<211>250
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>10
Met Pro Ser Gln Asn Ala Leu Tyr Leu Asp Leu Leu Lys Lys Val Leu
1 5 10 15
Thr Asn Thr Ile Tyr Gly Asp Arg Pro His Thr Asn Val Trp Gln Asp
20 25 30
Asn Thr Asp Tyr Arg Gln Ala Ala Arg Ala Lys Gly Thr Asp Trp Pro
35 40 45
Thr Val Ala His Thr Met Ile Gly Leu Glu Arg Leu Asp Asn Leu Gln
50 55 60
His Cys Val Glu Ala Val Leu Ala Asp Gly Val Pro Gly Asp Phe Ala
65 70 75 80
Glu Thr Gly Val Trp Arg Gly Gly Ala Cys Ile Phe Met Arg Ala Val
85 90 95
Leu Gln Ala Phe Gly Asp Thr Gly Arg Thr Val Trp Val Val Asp Ser
100 105 110
Phe Gln Gly Met Pro Glu Ser Ser Ala Gln Asp His Glu Ser Asp Gln
115 120 125
Ala Met Ala Leu His Glu Tyr Asn Asp Val Leu Gly Val Pro Leu Glu
130 135 140
Thr Val Arg Gln Asn Phe Ala Arg Tyr Gly Leu Leu Asp Glu Gln Val
145 150 155 160
Arg Phe Leu Pro Gly Trp Phe Arg Asp Thr Leu Pro Thr Ala Pro Ile
165 170 175
Gln Glu Leu Ala Val Leu Arg Leu Asp Gly Asp Leu Tyr Glu Ser Thr
180 185 190
Met Asp Ser Leu Arg Asn Leu Tyr Pro Lys Leu Ser Pro Gly Gly Phe
195 200 205
Val Ile Ile Asp Asp Tyr Val Leu Pro Ser Cys Gln Asp Ala Val Lys
2l0 215 220
Gly Phe Arg Ala Glu Leu Gly Ile Thr Glu Pro Ile His Asp Ile Asp
225 230 235 240
Gly Thr Gly Ala Tyr Trp Arg Arg Ser Trp
245 250
<210>11
<211>395
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>11
Met Gly Glu Ile Ala Val Ala Pro Trp Ser Val Val Glu His Leu Leu
1 5 10 15
Leu Ala Ala Gly Ala Gly Thr Glu Arg Leu Gln Glu Ala Val Gln Val
20 25 30
Ala Gly Leu Glu Ala Val Ala Asp Ala Ile Val Asp Glu Leu Val Val
35 40 45
Arg Cys Asp Pro Leu Ser Leu Asp Glu Ser Val Arg Ile Gly Leu Glu
50 55 60
Ile Thr Ser Gly Ala Gln Leu Val Arg Arg Thr Val Glu Leu Asp His
65 70 75 80
Ala Gly Leu Arg Leu Ala Ala Val Ala Glu Ala Pro Ala Val Leu Arg
85 90 95
Phe Asp Ala Val Asp Leu Leu Glu GIy Leu Phe Gly Pro Val Asp Gly
100 105 110
Arg Arg His Asn Ser Arg Glu Val Arg Trp Ser Asp Ser Met Thr Gln
115 120 125
Phe Ser Pro Asp Gln Gly Leu Ala Gly Ala Gln Arg Leu Leu Ala Phe
130 135 140
Arg Asn Lys Val Ser Thr Ala Val His Ala Val Leu Ala Ala Ala Ala
145 150 155 160
Thr Arg Cys Ser Asp Leu Gly Glu Leu Ala Val Arg Tyr Gly Ser Asp
165 170 175
Lys Trp Ala Asp Leu His Trp Tyr Thr Glu His Tyr Glu His His Phe
180 185 190
Ser Arg Phe Gln Asp Val Pro Val Arg Val Leu Glu Ile Gly Ile Gly
195 200 205
Gly Tyr His Ala Pro Glu Leu Gly Gly Ala Ser Leu Arg Met Trp Gln
210 215 220
Arg Tyr Phe Arg Arg Gly Leu Val Tyr Gly Leu Asp Ile Phe Glu Lys
225 230 235 240
Ala Gly Asn Glu Gly His Arg Val Arg Lys Leu Arg Gly Asp Gln Ser
245 250 255
Asp Ala Glu Phe Leu Ala Asp Met Ala Gly Lys Ile Gly Pro Phe Asp
260 265 270
Ile Val Ile Asp Asp Gly Ser His Val Asn Asp His Val Lys Lys Ser
275 280 285
Phe His Ala Leu Phe Pro His Val Arg Pro Gly Gly Leu Tyr Val Ile
290 295 300
Glu Asp Leu Gln Thr Ser Tyr Trp Pro Gly Tyr Gly Gly Arg Asp Thr
305 310 315 320
Glu Pro Ala Ala Gln Arg Thr Ser Ile Asp Met Leu Lys Glu Leu Ile
325 330 335
Asp Gly Leu His Tyr Gln Glu Arg Glu Ser Arg Arg Gly Thr Glu Pro
340 345 350
Cys Tyr Thr Glu Arg Asn Val Ala Ala Leu His Phe Tyr His Asn Leu
355 360 365
Val Phe Val Glu Lys Gly Leu Asn Ala Glu Pro Ala Ala Pro Gly Phe
370 375 380
Val Pro Arg Gln Ala Leu Gly Val Glu Asp Ser
385 390 395
<210>12
<211>539
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>12
Met Ile Ser Ala Glu Gly Glu Gln Ser Gly Pro Val Ser Lys Gly Gly
1 5 10 15
Ala Val Pro Asp Phe His Asp Pro Ala Thr Met Asn Arg Arg Thr Pro
20 25 30
Gly Thr Glu Ile Thr Val Glu Pro Gly Asp Pro Arg Tyr Pro Asp Leu
35 40 45
Val Val Gly His Asn Pro Arg Phe Thr Gly Lys Pro Glu Arg Ile His
50 55 60
Ile Ala Gly Ser Thr Glu Asp Val Val His Ala Val Ala Glu Ala Val
65 70 75 80
Arg Thr Gly Arg Arg Val Gly Val Arg Ser Gly Gly His Cys Phe Glu
85 90 95
Asn Leu Val Ala Asp Pro Ala Ile Arg Val Leu Val Asp Leu Ser Glu
100 105 110
Leu Asr Arg Val Tyr Phe Asp Ser Thr Arg Gly Ala Phe Ala Ile Glu
115 120 125
Ala Gly Ala Ala Leu Gly Gln Val Tyr Arg Thr Leu Phe Lys Asn Trp
130 135 140
Gly Val Thr Ile Pro Thr Gly Ala Cys Pro Gly Val Gly Ala Gly Gly
145 150 155 160
His Ile Pro Gly Gly Gly Tyr Gly Pro Leu Ser Arg Arg Phe Gly Ser
165 170 175
Val Val Asp Tyr Leu Gln Gly Val Glu Val Val Val Val Asp Arg Ala
180 185 190
Gly Glu Val His Ile Val Glu Val Asp Arg Asn Ser Ile Gly Ala Gly
195 200 205
His Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn Phe Gly
210 215 220
Val Val Thr Arg Phe Trp Leu Arg Ala Pro Asp Val Val Ser Thr Asp
225 230 235 240
Pro Ser Glu Leu Leu Pro Arg Pro Pro Ala Thr Val Leu Leu Arg Ser
245 250 255
Phe His Trp Pro Trp Cys Glu Leu Thr Glu Gln Ser Phe Ala Leu Leu
260 265 270
Leu Arg Asn Phe Gly Thr Trp Tyr Glu Gln His Ser Ala Pro Glu Ser
275 280 285
Thr Gln Leu Gly Leu Phe Ser Thr Leu Val Cys Ala His Arg Gln Ala
290 295 300
Gly Tyr Val Thr Leu Asn Ile His Leu Asp Gly Thr Asp Pro Asn Ala
305 310 315 320
Glu Arg Thr Leu Ala Glu His Leu Ser Ala Ile Asn Asp Gln Val Gly
325 330 335
Val Thr Pro Ala Glu Gly Leu Arg Glu Thr Leu Pro Trp Leu Arg Ser
340 345 350
Thr Gln Val Ser Gly Ser Leu Ala Glu Gly Gly Glu Pro Ser Gly Gln
355 360 365
Arg Thr Lys Val Lys Ala Ala Tyr Leu Arg Thr Gly Leu Ser Glu Ala
370 375 380
Gln Leu Ala Thr Val Tyr Arg Arg Leu Thr Asp Ser Gly Tyr Asp Asn
385 390 395 400
Pro Ala Ala Ala Leu Leu Leu Leu Gly Tyr Gly Gly Arg Ala Asn Ala
405 410 415
Val Ala Pro Ser Ala Thr Ala Leu Ala Gln Arg Asp Ser Val Leu Lys
420 425 430
Ala Leu Phe Val Thr Asn Trp Ser Glu Pro Ala Glu Asp Glu Arg His
435 440 445
Leu Thr Trp Ile Arg Gly Phe Tyr Arg Glu Met Tyr Ala Glu Thr Gly
450 455 460
Gly Val Pro Val Pro Gly Thr Arg Val Asp Gly Ser Tyr Ile Asr Tyr
465 470 475 480
Pro Asp Thr Asp Leu Ala Asp Pro Leu Trp Asn Thr Ser Gly Val Ala
485 490 495
Trp His Asp Leu Tyr Tyr Lys Asp Asn Tyr Pro Arg Leu Gln Arg Ala
500 505 510
Lys Ala Arg Trp Asp Pro Gln Asn Ile Phe Gln His Gly Leu Ser Ile
515 520 525
Lys Pro Pro Glu Arg Leu Ser Pro Gly Gln Pro
530 535
<210>13
<211>397
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>13
Met Ser Ala Thr His Glu Ile Glu Thr Val Glu Arg Ile Ile Leu Ala
1 5 10 15
Ala Gly Ser Ser Ala Ala Ser Leu Ala Glu Leu Thr Thr Glu Leu Gly
20 25 30
Leu Ala Arg Ile Ala Pro Val Leu Ile Glu Glu Ile Leu Phe Arg Ala
35 40 45
Glu Pro Ala Pro Asp Ile Glu Pro Thr Glu Val Ala Val Gln Ile Thr
50 55 60
His Gly Val Glu Thr Val Asp Phe Val Leu Lys Leu Gln Ser Gly Glu
65 70 75 80
Leu Ile Lys Ala Glu Gln Arg Pro Val Gly Asp Val Pro Leu Arg Ile
85 90 95
Gly Tyr Glu Leu Thr Asp Leu Ile Ala Glu Leu Phe Gly Pro Gly Ala
100 105 110
Pro Arg Ala Val Gly Ala Arg Ser Thr Asi Phe Leu Arg Thr Thr Thr
115 120 125
Ser Gly Ser Ile Pro Gly Pro Ser Glu Leu Ser Asp Gly Phe Gln Ala
130 135 140
Ile Ser Ala Val Val Ala Gly Cys Gly His Arg Arg Pro Asp Leu Asp
145 150 155 160
Gln Leu Ala Ser His Tyr Arg Thr Asp Lys Trp Gly Gly Leu His Trp
165 170 175
Phe Thr Pro Leu Tyr Glu Arg His Leu Gly Glu Phe Arg Asp Arg Pro
180 185 190
Val Arg Ile Leu Glu Ile Gly Val Gly Gly Tyr Asn Phe Asp Gly Gly
195 200 205
Gly Gly Glu Ser Leu Lys Met Trp Lys Arg Tyr Phe His Arg Gly Leu
210 215 220
Val Phe Gly Met Asp Val Phe Asp Lys Ser Phe Leu Asp Gln Gln Arg
225 230 235 240
Leu Tyr Thr Val Arg Ala Asp Gln Ser Lys Pro Glu Glu Leu Ala Ala
245 250 255
Val Asp Asp Glu Tyr Gly Pro Phe Asp Ile Ile Ile Asp Asp Gly Ser
260 265 270
His Ile Asn Gly His Val Arg Thr Ser Leu Glu Thr Leu Phe Pro Arg
275 280 285
Leu Arg Ser Gly Gly Val Tyr Val Ile Glu Asp Leu Trp Thr Thr Tyr
290 295 300
Ala Pro Gly Phe Gly Gly Gln Ala Gln Ser Pro Ala Ala Pro Gly Thr
305 310 315 320
Thr Val Ser Leu Leu Lys Asn Leu Leu Glu Gly Val Gln His Glu Glu
325 330 335
Gln Pro His Ala Gly Ser Tyr Glu Pro Ser Tyr Leu Glu Arg Asn Val
340 345 350
Val Gly Leu His Val Tyr His Asn Ile Ala Phe Leu Glu Lys Gly Val
355 360 365
Asn Ala Glu Gly Ala Val Pro Ala Trp Val Pro Arg Ser Leu Asp Asp
370 375 380
Ile Leu His Leu Ala Asp Val Asn Ser Ala Glu Asp Lys
385 390 395
<210>14
<211>283
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>14
Val Glu Ser Ile Phe Asp Ala Leu Ala Gln Gly Arg Ala Leu His His
1 5 10 15
Gly Tyr Trp Ala Gly Gly Tyr Arg Glu Asp Ala Gly Ala Thr Pro Trp
20 25 30
Ser Asp Ala Ala Asp His Leu Thr Asp Leu Phe Ile Asp Lys Ala Ala
35 40 45
Leu Arg Pro Gly Ala His Leu Phe Asp Leu Gly Cys Gly Asn Gly Gln
50 55 60
Pro Val Val Arg Ala Ala Arg Thr Lys Gly Val Arg Val Thr Gly Ile
65 70 75 80
Thr Val Asn Ala Glu His Leu Ala Ala Ala Thr Arg Leu Ala Asn Glu
85 90 95
Thr Gly Leu Ala Asp Ser Leu Arg Phe ASp Leu Val Asp Gly Ala Arg
100 105 110
Leu Pro Tyr Pro Glu Gly Ser Phe His Ala Ala Trp Ala Met Gln Ser
115 120 125
Val Val Gln Ile Val Asp Gln Ala Ala Ala Ile Arg Glu Val His Arg
130 135 140
Ile Leu Glu Pro Gly Gly Gln Phe Val Leu Gly Asp Ile Ile Thr Arg
145 150 155 160
Ala Arg Leu Pro Glu Glu Tyr Ala Ala Val Trp Thr Gly Thr Thr Ala
165 170 175
His Thr Leu Asn Ser Leu Thr Ala Leu Val Ser Glu Ala Gly Phe Glu
180 185 190
Ile Leu Glu Val Thr Asp Leu Thr Ala Gln Thr Arg Cys Met Val Ser
195 200 205
Trp Tyr Val Asp Glu Leu Leu Arg Glu Leu Asp Glu Leu Ala Gly Val
210 215 220
Glu Pro Ala Ala Val Gly Thr Tyr Gln Gln Arg Tyr Leu Gly Asp Ile
225 230 235 240
Ala Ala Lys His Gly Pro Gly Pro Ala Gln Leu Ile Ala Ala Val Ala
245 250 255
Glu Tyr Arg Lys His Pro Asp Tyr Ala Arg Asn Glu Glu Ser Met Gly
260 265 270
Phe Met Leu Leu Gln Ala Arg Lys Lys Gln Ser
275 280
<210>15
<211>310
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>15
Val Pro Asn Ile Pro Trp Pro Gly Glu Asp Arg Pro Ile Ile Thr Phe
1 5 10 15
Ala Val Gly Thr His Gly Leu Gly Ser Gln Val Ala Pro Ser Tyr Leu
20 25 30
Leu Arg Thr Gly Thr Glu Pro Glu Thr Glu Leu Ile Ala Val Ala Leu
35 40 45
Asp Arg Gly Trp Ala Val Val Ile Thr Asp Tyr Glu Gly Leu Gly Thr
50 55 60
Pro Gly Thr His Thr Tyr Thr Val Gly Arg Pro Gln Gly His Ala Met
65 70 75 80
Leu Asp Ala Ala Arg Ala Ala Gln Arg Leu Pro Gly Ser Gly Leu Gly
85 90 95
Thr Asp Cys Pro Val Gly Ile Trp Gly Tyr Ala Gln Gly Gly Gln Ala
100 105 110
Ser Ala Phe Ala Gly Glu Leu His Pro Thr Tyr Ala Pro Glu Leu Pro
115 120 125
Ile Arg Ala Ala Ala Ala Gly Ala Val Pro I1e Asp Leu Leu Asp Ile
130 135 140
Leu His Arg Asn Asp Gly Val Phe Thr G1y Pro Val Leu A1a Gly Leu
145 150 155 160
Val Gly His Ala Ala Ala Tyr Pro Asp Leu Pro Phe Asp Glu Leu Leu
165 170 175
Thr Asp Ala Gly Arg I1e Ala Val Asp Gln Val Arg Glu Leu G1y Ala
180 185 190
Pro Glu Leu Val Thr Arg Phe Leu Gly Arg Glu Leu Ser Asp Phe Leu
195 200 205
Asp Thr Ser Gly Leu Phe Glu His Pro Arg Trp Arg Ala Arg Leu Val
210 215 220
Glu Ser Val Ala Gly Arg Asn Gly Gly Pro Val Val Pro Thr Leu Val
225 230 235 240
Tyr His Ser Thr Asp Asp Glu Ile Val Pro Phe Ala Phe Gly Glu Arg
245 250 255
Leu Arg Asp Ser Tyr Arg Ala Ala Gly Thr Pro Val Arg Trp His Pro
260 265 270
Leu Ser Gly Leu Ala His Phe Pro Ala Ala Leu Ala Ser Ser Arg Val
275 280 285
Val Val Ser Trp Phe Asp Glu His Phe Ser Gly Pro Ser Ala Ile Ser
290 295 300
Gly Pro Arg Asp Asp Gly
305 310
<210>16
<211>332
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>16
Met Arg Lys Pro Val Arg 1le Gly Val Leu Gly Cys Ala Ser Phe Ala
1 5 10 15
Trp Arg Arg Met Leu Pro Ala Met Cys Asp Val Ala Glu Thr Glu Val
20 25 30
Val Ala Val Ala Ser Arg His Pro Ala Lys Ala Glu Arg Phe Ala Ala
35 40 45
Arg Phe Glu Cys Glu Ala Val Leu Gly Tyr Gln Arg Leu Leu Glu Arg
50 55 60
Pro Asp Ile Asp Ala Val Tyr Val Pro Leu Pro Pro Gly Met His Ala
65 70 75 80
Glu Trp Ile Gly Lys Ala Leu Glu Ala Gly Lys His Val Leu Ala Glu
85 90 95
Lys Pro Leu Thr Thr Thr Ala Ser Glu Thr Ala Arg Leu Val Gly Leu
100 105 1l0
Ala Arg Arg Lys His Leu Leu Leu Arg Glu Asn Tyr Leu Phe Leu His
115 120 125
His Gly Arg His Asp Val Val Arg Asp Leu Leu Gln Ser Glu Glu Ile
130 135 140
Gly Glu Leu Arg Glu Phe Thr Ala Val Phe Gly Ile Pro Pro Leu Ser
145 150 155 160
Asp Thr Asp Ile Arg Tyr Arg Thr Glu Leu Gly Gly Gly Ala Leu Leu
165 170 175
Asp Ile Gly Val Tyr Pro Ala Arg Ala Ala Arg Leu Phe Leu Leu Gly
180 185 190
Pro Leu Thr Val Ala Gly Ala Ser Ser His Glu Ala His Glu Ser Gly
195 200 205
Val Asp Leu Ser Gly Ser Val Leu Leu Gln Ser Glu Gly Gly Ala Val
210 215 220
Ala His Leu Gly Tyr Gly Phe Val His His Tyr Arg Ser Ala Tyr Glu
225 230 235 240
Leu Trp Gly Ser Arg Gly Arg Ile Val Ile Asp Arg Ala Phe Thr Pro
245 250 255
Pro Ala Glu Trp Gln Ala Val Ile Arg Ile Glu Arg Lys Gly Val Val
260 265 270
Asp Glu Leu Ser Leu Pro Ala Glu Asp Gln Val Arg Lys Ala Val Thr
275 280 285
Ala Phe Ala Arg Asp Ile Arg Ala Glu Ala Gly Val Asp Glu Pro Ala
290 295 300
Val Ala Gly Asp Ser Gly Glu Ser Met Ile Gln Gln Ala Ala Leu Val
305 310 315 320
Glu Ala Ile Gly Gln Ala Cys Arg Cys Gly Ser Thr
325 330
<210>17
<21l>486
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>17
Met Ser Ser Phe Ala Glu Ala Glu Ala Ser Ala Ala Ala Pro Leu Ser
1 5 10 15
Ser Asn Asn Thr Arg Arg Phe Val Asp Ser Ala Leu Ser Ala Cys Asn
20 25 30
Gly Arg Phe Pro Thr Thr Arg Phe His Cys Trp Leu Ala Asp Arg Leu
35 40 45
Gly Glu Asn Ser Phe Glu Thr Thr Arg Ile Pro Phe Asp Arg Leu Ser
50 55 60
Lys Trp Lys Phe Asp Ala Ser Thr Glu Asn Leu Val His Ala Asp Gly
65 70 75 80
Arg Phe Phe Thr Val Glu Gly Leu Gln Val Glu Thr Asn Tyr Gly Ala
85 90 95
Ala Thr Cys Trp His Gln Pro Ile Ile Asn Gln Ala Glu Val Gly Ile
100 105 110
Leu Gly Ile Leu Val Lys Glu Ile Asp Gly Val Leu His Cys Leu Met
115 120 125
Ser Ala Lys Met Glu Pro Gly Asn Val Asn Val Leu Gln Leu Ser Pro
130 135 140
Thr Val Gln Ala Thr Arg Ser Asn Tyr Thr Gln Ala His Arg Gly Ser
145 150 155 160
Val Pro Pro Tyr Val Asp Tyr Phe Leu Gly Arg Gly Arg Ser Arg Val
165 170 175
Leu Val Asp Val Leu Gln Ser Glu Gln Gly Ala Trp Phe Tyr Arg Lys
180 185 190
Arg Asn Arg Asn Met Val Val Glu Val Asp Glu Glu Val Pro Val Leu
195 200 205
Pro Asp Phe Cys Trp Leu Thr Leu Gly Gln Val Leu Asp Leu Leu Arg
210 215 220
Gln Asp Asn Ile Val Asn Met Asp Thr Arg Thr Val Leu Ser Cys Ile
225 230 235 240
Pro Phe His Asp Ser Ala Thr Gly Pro Gly Leu Ala Ala Ser Ala Glu
245 250 255
Pro Phe Arg Gln Ala Val Ala Arg Ser Leu Ser His GLy Ile Asp Ser
260 265 270
Ala Ser Ile Thr Glu Ala Val Gly Trp Phe Glu Glu Ala Lys Ala Arg
275 280 285
Tyr Ser Leu Arg Ala Thr Arg Val Pro Leu Ser Arg Val Asp Lys Trp
290 295 300
Tyr Arg Thr Asp Thr Glu Ile Ala His Gln Asp Gly Lys Tyr Phe Ser
305 310 315 320
Val Ile Ala Val Ser Val Ser Ala Thr Asn Arg Glu Val Ser Ser Trp
325 330 335
Thr Gln Pro Met Ile Glu Pro Arg Glu Pro Gly Glu Ile Ala Leu Leu
340 345 350
Val Lys Arg Ile Gly Gly Val Leu His Gly Leu Val Arg Ala Arg Val
355 360 365
Glu Ala Gly Tyr Lys Ser Thr Ala Glu Ile Ala Pro Thr Val Gln Cys
370 375 380
Ser Val Ala Asu Tyr Gln Ser Thr Pro Arg Asn Asp Trp Pro Pro Phe
385 390 395 400
Val Asp Asp Val Leu Thr Ala Asp Pro Glu Thr Val Arg Tyr Glu Ser
405 410 415
Ile Leu Ser Glu Glu Gly Gly Arg Phe Tyr Gln Ala Gln Asn Arg Tyr
420 425 430
Arg Ile Ile Glu Val His Glu Asp Phe Ala Ala Arg Pro Pro Ser Asp
435 440 445
Phe Arg Trp Met Thr Leu Gly Gln Leu Gly Glu Leu Leu Arg Ser Thr
450 455 460
His Ser Leu Asn Ile Gln Ala Arg Ser Leu Val Ala Ser Leu His Ser
465 470 475 480
Leu Trp Ala Leu Gly Arg
485
<210>18
<211>437
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>18
Met Arg Val Leu Phe Thr Pro Leu Pro Ala Ser Ser His Phe Phe Asn
1 5 10 15
Leu Val Pro Leu Ala Trp Ala Leu Arg Ala Ala Gly His Glu Val Arg
20 25 30
Val Ala Ile Cys Pro Asn Met Val Ser Met Val Thr Gly Ala Gly Leu
35 40 45
Thr Ala Val Pro Val Gly Asp Glu Leu Asp Leu Ile Ser Leu Ala Ala
50 55 60
Arg Asn Lys Leu Val Leu Gly Asn Gly Val Ala Phe Asp Glu Gly Arg
65 70 75 80
Arg Pro Glu Leu Phe Asp Glu Leu Leu Ser Ile Asn Ser Gly Arg Asp
85 90 95
Met Asp Ala Val Glu Gln Leu His Leu Val Asp Asp Arg Ser Leu Asp
100 105 110
Asp Leu Met Gly Phe Ala Glu Lys Trp Gln Pro Asp Leu Val Val Trp
115 120 125
Asp Ala Met Val Cys Ser Gly Pro Val Val Ala Gln Ala Leu Gly Val
130 135 140
Arg His Val Arg Met Leu Val Ala Leu Asp Val Ser Gly Trp Leu Arg
145 150 155 160
Ser Gly Phe Leu Glu Tyr Leu Glu Ser Lys Pro Pro Glu Gln Arg Val
165 170 175
Asp Pro Leu Gly Ala Trp Leu Gly Ala Lys Leu Ser Lys Phe Gly Ala
180 185 190
Thr Phe Asp Glu Glu Ile Val Thr Gly Gln Ala Thr Ile Asp Pro Val
195 200 205
Ser Ser Trp Met Arg Leu Pro Val Asp Leu Asp Tyr Ile Ser Met Arg
210 215 220
Phe Val Pro Tyr Asn Gly Pro Ala Val Val Pro Glu Trp Leu Arg Glu
225 230 235 240
Pro Pro Thr Lys Pro Arg Val Cys Val Thr Arg Gly Leu Thr Lys Arg
245 250 255
Gln Gln Ser Arg Val Ala Glu Gln Trp Glu Gly Glu Ala Gln Glu Gln
260 265 270
Ala Met Val Glu Thr Leu Leu Arg Gly Ala Ala Gly Leu Asp Val Glu
275 280 285
Val Ile Ala Thr Leu Ser Gly Gly Glu Val Arg Glu Met Gly Glu Leu
290 295 300
Pro Pro Asn Val Arg Val His Glu Tyr Val Pro Leu Asn Glu Leu Leu
305 3l0 315 320
Glu Ser Cys Ser Ala Ile Ile His His Gly Ser Thr Thr Thr Gln Glu
325 330 335
Thr Ala Thr Val Asn Gly Val Pro Gln Leu Ile Leu Pro Gly Thr Phe
340 345 350
Trp Asp Glu Ser Arg Arg Ala Glu Leu Leu Ala Asp Arg Gly Ala Gly
355 360 365
Leu Val Leu Asp Arg Ala Thr Phe Thr Glu Asp Asp Val Arg Arg Gln
370 375 380
Leu Ala Arg Leu Leu Asp Glu Pro Ser Phe Ala Ala Asn Ala Ala Leu
385 390 395 400
Ile Arg Gly Glu Ile Glu Glu Asn Pro Ser Pro His Asp Ile Val Ala
405 410 415
Arg Leu Glu Lys Leu Val Ala Glu Gly Lys Asn Arg Arg Ala Gly Lys
420 425 430
Ser Asp Gly His Leu
435
<210>19
<211>447
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>19
Val Thr Ser Cys Asp Asp Thr Cys Ala Thr Ala Thr Glu Met Thr Pro
1 5 10 15
Asp Ala Lys Asp Arg Ile Leu Ala Ser Val Arg Asp Tyr His Arg Glu
20 25 30
Gln Lys Ser Ser Ile Phe Val Ala Gly Ser Thr Pro Ile Arg Pro Ser
35 40 45
Gly Ala Val Leu Asp Glu Asp Asp Arg Val Ala Leu Val Glu Ala Ala
50 55 60
Leu Glu Leu Arg Ile Ala Ala Gly Gly Asn Ala Arg Arg Phe Glu Ser
65 70 75 80
Glu Phe Ala Arg Phe Phe Gly Leu Arg Lyg Ala His Leu Thr Asn Ser
85 90 95
Gly Ser Ser Ala Asn Leu Leu Ala Leu Ser Ser Leu Thr Ser Pro Asn
100 105 110
Leu Gly Glu Ala Arg Leu Arg Pro Gly Asp Glu Val Ile Thr Ala Ala
115 120 125
Val Gly Phe Pro Thr Thr Ile Asn Pro Ala Val Gln Asn Gly Leu Val
130 135 140
Pro Val Phe Val Asp Val Glu Leu Gly Thr Tyr Asn Ala Thr Pro Asp
145 150 155 160
Arg Ile Lys Ala Ala Val Ser Glu Arg Thr Arg Ala Ile Met Leu Ala
165 170 175
His Thr Leu Gly Asn Pro Phe Ala Ala Asp Glu Ile Ala Glu Ile Ala
180 185 190
Arg Glu His Glu Leu Phe Leu Ile Glu Asp Asn Cys Asp Ala Val Gly
195 200 205
Ser Thr Tyr Arg Gly Arg Leu Thr Gly Thr Phe Gly Asp Leu Thr Thr
210 215 220
Val Ser Phe Tyr Pro Ala His His Ile Thr Ser Gly Glu Gly Gly Cys
225 230 235 240
Val Leu Thr Gly Ser Leu Glu Leu Ala Arg Ile Ile Glu Ser Leu Arg
245 250 255
Asp Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Val Asp Asn Thr Cys
260 265 270
Arg Lys Arg Phe Asp Tyr Gln Leu Gly Thr Leu Pro Ala Gly Tyr Asp
275 280 285
His Lys Tyr Thr Phe Ser His Val Gly Tyr Asn Leu Lys Thr Thr Asp
290 295 300
Leu Gln Ala Ala Leu Ala Leu Ser Gln Leu Ser Lys Ile Ser Glu Phe
305 310 315 320
Gly Ser Ala Arg Arg Arg Asn Trp Arg Arg Leu Arg Glu Gly Leu Ser
325 330 335
Gly Val Pro Gly Leu Leu Leu Pro Val Pro Thr Pro His Ser Asp Pro
340 345 350
Ser Trp Phe Gly Phe Ala Ile Thr Val Ser Ala Asp Ala Gly Phe Thr
355 360 365
Arg Ala Ala Leu Val Asn Phe Leu Glu Ser Arg Asn Ile Gly Thr Arg
370 375 380
Leu Leu Phe Gly Gly Asn Ile Thr Arg His Pro Ala Phe Gln His Val
385 390 395 400
Arg Tyr Arg Ile Ala Asp Ala Leu Thr Asn Ser Asp Ile Val Thr Asp
405 410 415
Arg Thr Phe Trp Val Gly Val Tyr Pro Gly Ile Thr Asp Gln Met Ile
420 425 430
Asp Tyr Val Ala Glu Ser Ile Ala Glu Phe Val Ala Lys Asn Ser
435 440 445
<210>20
<211>378
<212>PRT
<313>刺糖多胞菌 NRRL30141
<400>20
Val Ile Asn Leu His Gln Pro Thr Leu Gly Ala Glu Glu Leu Asp Ala
1 5 10 15
Ile Ala Glu Val Phe Ala Ser Asn Trp Ile Gly Leu Gly Pro Arg Thr
20 25 30
Arg Thr Phe Glu Ala Asp Phe Ala His His Leu Gly Val Asp Pro Asp
35 40 45
Gln Ile Val Phe Val Agn Ser Gly Thr Ala Ala Leu Phe Leu Thr Val
50 55 60
Gln Val Leu Asp Leu Gly Pro Gly Asp Asp Val Val Leu Pro Ser Ile
65 70 75 80
Ser Phe Val Ala Ala Ala Asn Ala Ile Ala Ser Ser Gly Ala Arg Pro
85 90 95
Val Phe Cys Asp Val Asp Pro Arg Thr Leu Asn Pro Thr Leu Asp Asp
100 105 110
Val Ala Lys Ala Ile Thr Pro Thr Thr Lys Ala Val Leu Leu Leu His
115 120 125
Tyr Gly Gly Ser Pro Gly Glu Val Thr Glu Ile Ala Gly Phe Cys Arg
130 135 140
Glu Lys Gly Leu Val Leu Ile Glu Asp Thr Ala Cys Ala Val Ala Ser
145 150 155 160
Ser Val His Gly Thr Ala Cys Gly Thr Phe Gly Asp Leu Ala Thr Trp
165 170 175
Ser Phe Asp Ala Met Lys Ile Leu Val Thr Gly Asp Gly Gly Met Phe
180 185 190
Tyr Ala Ala Asp Arg Glu Leu Ala His Arg Ala Arg Arg Leu Ala Tyr
195 200 205
His Gly Leu Glu Gln Met Ser Gly Phe Asp Ser Ala Lys Ser Ser Asn
210 215 220
Arg Trp Trp Asp Ile Cys Val Glu Asp Ile Gly His Arg Leu Ile Gly
225 230 235 240
Asn Asp Met Thr Ala Ala Leu Gly Ser Val Gln Leu Arg Lys Leu Pro
245 250 255
Asp Phe Val Ser Arg Arg Arg Glu Ile Ala Thr Gln Tyr Asp Arg Leu
260 265 270
Leu Ser Asp Val Pro Gly Val His Leu Pro Pro Thr Leu Pro Asp Gly
275 280 285
His Val Ser Ser His Tyr Phe Tyr Trp Val Gln Leu Ala Pro Glu Ile
290 295 300
Arg Asp Arg Val Ala Gln Gln Met Leu Glu Arg Gly Ile Tyr Thr Ser
305 310 315 320
Phe Arg Tyr Pro Pro Leu His Lys Val Pro Ile Tyr Arg Ala Asp Cys
325 330 335
Lys Leu Pro Ser Ala Glu His Ala Cys Arg Arg Thr Leu Leu Leu Pro
340 345 350
Leu His Pro Ser Leu Asp Asp Ala Glu Val Arg Thr Val Ala Asp Glu
355 360 365
Phe Arg Lys Ala Val Glu Gln His Ile Ser
370 375
<210>21
<211>249
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>21
Met Ser Arg Val Ser Gly Thr Phe Glu Glu Leu Ser Ser Val Tyr Ser
1 5 10 15
Pro Asp His Ala Asp Ile Tyr Asp Ala Ile His Ser Ala Arg Gly Arg
20 25 30
Asp Trp Ala Thr Glu Ala Glu Glu Ile Ile Gln Leu Ile Arg Thr Arg
35 40 45
Leu Pro Glu Ala Gln Ser Leu Leu Asp Ile Ala Cys Gly Thr Gly Ala
50 55 60
His Leu Glu Arg Phe Arg Thr Glu Tyr Ala Lys Val Ala Gly Leu Glu
65 70 75 80
Leu Ser Asp Ala Met Arg Glu Ile Ala Ile Arg Arg Val Pro Glu Val
85 90 95
Pro Ile His Thr Gly Asp Ile Arg Asp Phe Asp Leu Gly Glu Pro Phe
100 105 110
Asp Val Val Thr Cys Leu Cys Phe Thr Ala Ala Tyr Met Arg Thr Val
115 120 125
Asp Glu Leu Arg Arg Val Thr Arg Asn Met Ala Arg His Leu Ala Pro
130 135 140
Gly Gly Val Ala Val Ile Glu Pro Trp Trp Phe Pro Asp Lys Phe Ile
145 150 155 160
Asp Gly Phe Val Thr Gly Ala Val Ala His His Gly Glu Arg Val Ile
165 170 175
Ser Arg Leu Ser His Ser Val Leu Glu Gly Arg Thr Ser Arg Met Thr
180 185 190
Val Arg Tyr Thr Val Ala Glu Pro Ala Gly Ile Arg Asp Phe Thr Glu
195 200 205
Phe Glu Ile Leu Ser Leu Phe Thr Glu Asp Glu Tyr Thr Ala Ala Leu
210 215 220
Glu Asp Ala Gly Ile Arg Ala Glu Tyr Leu Pro Gly Gly Pro Asn Gly
225 230 235 240
Arg Gly Leu Phe Val Gly Thr Arg Asn
245
<210>22
<211>470
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>22
Met Pro Ser Arg Arg Pro Leu Thr Ala Ile Gln Leu Asn Leu Tyr Pro
1 5 10 15
Arg Val Ala Arg His Pro Ala Val Val Gln Phe Cys Tyr Gly Gly Val
20 25 30
Tyr Cys Gly Pro Arg Trp Leu Gly Ser Trp Asp Trp Ile Met Ala His
35 40 45
Phe Val Phe Ala Thr Tyr Ala Asp His Ala His Ile Gly Pro Leu Val
50 55 60
Pro Val Ser Arg Ala Leu Val Glu Arg Asp His Gln Val Thr Trp Tyr
65 70 75 80
Thr Gly Glu Asn Tyr Arg Ala Ala Val Glu Arg Ser Gly Ala Asp Phe
85 90 95
Ala Ala Pro Val Glu Gly Arg Phe Ile Asp Gly Arg Glu Leu Glu Gln
100 105 110
Lys Phe Pro Glu Ser Ile Gln Met Ser Ala Arg Arg Arg Ala Arg Trp
115 120 125
Leu Met Asp Asn His Trp Val Pro Ala Tyr Glu Gly Gln Tyr Arg Asp
130 135 140
Leu Val Ala Val Val Asp Arg Thr Arg Ala Asp Val Leu Leu Ala Asp
145 150 155 160
Ala Ser Trp Gly Pro Ala Lys Leu Val His Ala Val Thr Gly Val Leu
165 170 175
Trp Ala Thr Ile Ser Gln Met Pro Ile Leu Leu Pro Asp Pro Ala Val
180 185 190
Pro Pro Ile Gly Thr Gly Trp Lys Phe Gly Thr Ser Pro Phe His Arg
195 200 205
Leu Arg Asn Arg Ile Gly Asn Arg Leu Ile Asn Ala Leu Val His Asp
210 215 220
Pro Gly Met Lys Lys Ile Asn Ala Phe Trp Asn Ser Ile Gly Val Pro
225 230 235 240
Val Ser Arg Glu Val Ser Glu Ser Pro Tyr Leu Phe Met Gln Ala Gly
245 250 255
Thr Arg Ser Leu Glu Iyr Pro Arg Ala Leu Pro Gln Gln Met His Phe
260 265 270
Ile Gly Arg Leu Glu Pro Asp Ser Pro Met Gly Val Gly Leu Pro Ser
275 280 285
Trp Trp Gly Glu Leu Asp Gly Asp Arg Pro Val Val Leu Val Thr Gln
290 295 300
Gly Thr Met Ala Val Asp Ala Asp Asp Leu Ile Arg Pro Ala Leu Arg
305 310 315 320
Gly Leu Ala Gly Asp Gln Val Leu Val Val Ala Thr Thr Gly Arg Glu
325 330 335
Gly Val Asp Leu Gly Tyr Val Pro Asp Asn Ala Arg Val Ala Ser Phe
340 345 350
Leu Pro Tyr Arg Glu Leu Met Pro Lys Leu Ala Ala Val Val Thr Asn
355 360 365
Gly Gly Phe Gly Thr Val Gln Gln Ala Leu Ser His Gly Leu Pro Leu
370 375 380
Val Val Ala Gly Arg Ser Glu Asp Lys Thr Asp Val Cys Ala Arg Val
385 390 395 400
Ala Trp Ser Gly Ala Gly Val Asp Leu Arg Thr Arg Arg Pro Ser Pro
405 410 415
Gln Gln Val Ala Gly Ala Val Lys Val Met Ser Thr Asp Pro Arg Tyr
420 425 430
Arg Gln Ala Ala Gln Arg Leu Ala Val Glu Tyr Ala Glu Tyr Asp Ala
435 440 445
Cys Gly Thr Ala Val Lys Leu Leu Glu Arg Leu Ala Thr Thr Arg Arg
450 455 460
Pro Val Ile Ala Ser Arg
465 470
<210>23
<211>169
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>23
Val Arg Cys Gly Cys Gly Arg Val His Thr Ala Ala Arg Pro Glu Gly
1 5 10 15
Ala Arg Pro Gly Ala Val Gly Tyr Gly Pro Asn Leu Gln Ala Phe Ala
20 25 30
Val Tyr Leu Met Val Val His Phe Ile Pro Val His Arg Cys Val Glu
35 40 45
Leu Leu Ala Ser Leu Thr Gly Ala Val Pro Ser Val Gly Phe Val His
50 55 60
Gly Val Leu Thr Arg Ala Ala Gly Val Leu Thr Glu Val Asp Lys Arg
65 70 75 80
Ile His Thr Leu Ala Tyr Ala Val Cys Cys Asp Glu Thr Pro Leu Arg
85 90 95
Val Gly Pro Arg Thr Pro Asn Gln Ala Glu Arg Asp Leu Arg Pro Ala
100 105 110
Lys Val Gln Gln Asn Ile Ser Gly Arg Leu Thr Ile Glu Lys Arg Thr
115 120 125
Lys Asp Arg Tyr Arg Ile Arg Gly Ser Leu Ser Thr Ala Gly Lys His
130 135 140
Gly Arg Asn Met Ile Glu Ala Leu Arg Glu Ala Ile Arg Gly His Pro
145 150 155 160
Trp Met Pro Pro Asp Pro Thr Pro Ala
165
<210>24
<211>165
<212>PRT
<213>Saccharopolyspora sp.NRRL30141
<400>24
Val Cys Ser Asp Arg Gly Ala Gly Val Ala Leu Cyg Val Cys Trp Ser
1 5 10 15
Trp Cys Gly Phe Cys Val Gly Val Ala Glu Leu Ile Glu Leu Val Gly
20 25 30
Glu Gln Gly Ala Arg Ile Ala Val Leu Gly Glu Gln Ile Ala Val Arg
35 40 45
Asp Arg Gln Ile Thr Ala Met Ala Ala Gln Met Ala Glu Leu Ala Glu
50 55 60
Val Asn Glu Ala Leu Gly Glu Arg Leu Ala Lys Leu Glu His Ala Leu
65 70 75 80
Ser Arg Asn Ser Lys Asn Ser Ser Ser Ala Pro Ser Lys Asp Asp Gly
85 90 95
Pro Gly Arg Thr Pro Pro Pro Ala Lys Ala Lys Arg Gly Gly Ala Val
100 105 110
Lys Arg Lys Gly Lys Gln Pro Gly Ala Pro Gly Ala Asn Leu Ala Trp
115 120 125
Thr Asp Leu Pro Gly Asp His Lys Asp Arg Phe Pro Gly Gly Val Cys
130 135 140
Glu Cys Gly Ser Asp Leu Ala Arg Gly Thr Gly Ser Gly Gly Gly Gly
145 150 155 160
Ser Leu Pro Ala Ala
165
<210>25
<211>248
<212>PRT
<213>刺糖多胞菌 NRRL3014l
<400>25
Met Glu Ile Ile Gly Arg Gly Phe Ile Ala Arg Asn Leu Leu Arg Ile
1 5 10 15
Ser Gly Arg His Ala Asp Ala Val Ala Leu Ala Ala Gly Val Ser Asn
20 25 30
Thr Ser Cys Arg Ser Glu Asp Glu Tyr Gln Arg Glu Ala Ala Leu Val
35 40 45
Tyr Arg Thr Ile Glu Arg Cys His Ala Ile Gly Arg Lys Leu Leu Phe
50 55 60
Phe Ser Thr Ala Ser Ala Ser Met Tyr Gly Ala Leu Thr Ser Pro Gly
65 70 75 80
Phe Glu Asp Gly Pro Val Tyr Pro Pro Thr Thr Tyr Gly Arg His Lys
85 90 95
Leu Ala Met Glu Ala Val Ile Lys Ala Ser Gly Val Asp Phe Leu Ile
100 105 110
Leu Arg Leu Ala Tyr Val Ile Gly Ala His Gln Arg Gly His Gln Leu
115 120 125
Leu Pro Ser Leu Val Thr Gln Leu Arg Ser Gly Ser Val Thr Val His
130 135 140
Arg Gly Ala His Arg Asp Val Ile Ala Ala Asp Asp Val Val Thr Ile
145 150 155 160
Val Asp Asp Leu Leu Thr Lys Ala Val Ala Gly Thr Val Val Asn Ile
165 170 175
Gly Ser Gly Phe Pro Val Pro Ala Glu Lys Ile Val Ala His Leu Glu
180 185 190
Tyr Arg Leu Gly Thr Ala Ala Ala Arg Gln Trp Ile Asp His Pro Thr
195 200 205
Glu Tyr Gln Ile Ser Leu Thr Arg Leu Asn Thr Leu Val Pro Arg Ile
210 215 220
Ala Glu Leu Gly Phe Gly Pro Asp Tyr Tyr Arg Gln Val Leu Asp His
225 230 235 240
Tyr Leu Asp Leu Tyr Pro Gln Ala
245
<210>26
<211>260
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>26
Met Phe Asp Thr Val Asp Asp Arg Ala Thr Gln Ala Leu Pro Asp Gly
1 5 10 15
Arg Leu Val Ala Cys Ala Asn Thr Leu Glu Val Leu Ala Ile Trp Gln
20 25 30
Asp Ile Ala Asn Asp Ser Ala Tyr Ala Arg Gly Leu Arg Gly Leu Gly
35 40 45
Ala Asp Ser Val Ile Val Asp Val Gly Ala His Val Gly Leu Ala Ser
50 55 60
Met Tyr Phe Ala Asp Arg Ile Pro Ala Ala Arg Ile Leu Ala Tyr Glu
65 70 75 80
Pro Ala Pro Thr Thr Phe Ala Cys Leu Arg Glu Asn Phe Ala Arg His
85 90 95
Val Pro Arg Gly Val Thr Phe AsP Leu Ala Val Gly Ala Glu Pro Gly
100 105 110
Thr Ser Arg Phe Val Tyr Tyr Pro Ala Gly Pro Ser Leu Ser Thr Leu
115 120 125
His Leu Asp Ala Ala Asp Glu Arg Arg Asn Ile Asp Thr Val Met Ser
130 135 140
Asn Val Gly Ser Pro Glu Leu Ala Gly Glu Ssr Met Gln Gly Leu Val
145 150 155 160
Arg Thr Lys Glu Glu Leu Asp Val Arg Val Thr Thr Leu Thr Glu Ile
165 170 175
Ala Arg Gln His Arg Leu Asp Val Leu Asp Leu Leu Lys Ile Asp Val
180 185 190
Glu Arg Gly Glu Leu Asp Val Leu Asn Gly Ile Asp Asp Glu Met Trp
195 200 205
Pro Arg Ile Arg Arg Ile Val Val Glu Val His Asp Ile Cys Gly Arg
2l0 215 220
Leu Arg Gln Val Leu Asp Arg Leu Arg Lys Leu Asp Tyr Gln Val Glu
225 230 235 240
Val Ser Gln Ser Pro Ile Phe Leu Gly Ala Ser Val His Ile Val Val
245 250 255
Ala Val Arg Asp
260
<2l0>27
<211>399
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>27
Met Thr Asn Gly Asp Glu Pro Met Ala Tyr Pro Phe Gly Glu Ile Asp
1 5 10 15
Arg Leu Leu Leu Asp Asp Arg Tyr Ala Val Leu Arg Glu Gly Glu Pro
20 25 30
Val Ser Lys Ile Arg Leu Pro Tyr Gly Gly Asp Gly Trp Leu Val Thr
35 40 45
Arg Tyr Ala Asp Ile Lys Thr Val Leu Gly Asp Pro Arg Phe Ser Ala
50 55 60
Ala Ala Ile Leu Asn Arg Asp Val Pro Arg Gly Phe Pro Leu Ile Leu
65 70 75 80
Arg Glu His Ser Leu Gly Thr Met Asp Pro Pro Glu His Thr Arg Leu
85 90 95
Arg Lys Leu Val Gly Lys Ala Phe Thr Ala Arg Arg Val Glu Gln Leu
100 105 110
Arg Pro Arg Thr Gln Gln Leu Val Asp His Leu Leu Asp Arg Met Ala
115 120 125
Ala Asp Gly Pro Pro Gly Asp Leu Val ser Ala Leu Ala Leu Pro Leu
130 135 140
Pro Ile Lys Val Ile Cys Asp Leu Leu Gly Ile Pro Val Ala Asp Arg
145 150 155 160
Glu Arg Phe Arg Val Trp ser Asp Ile Ala Leu Ala Ile Thr Ser Asn
165 170 175
Ser Pro Glu Glu Ile Arg Glu Ser Arg Asp Gln Ile Arg Ala Tyr Ile
180 185 190
Gly Glu Leu Val Gln Gln Arg Lys Lys Met Pro Thr Glu Asp Leu Leu
195 200 205
Ser Val Leu Val Gln Ala Arg Ala Glu Gly Ala Gln Leu Ser Glu Glu
210 215 220
Glu Ile Val Val Thr Gly Ala Gly Leu Leu Ile Ala Gly Phe Glu Thr
225 230 235 240
Thr Ala Asn His Ile Ala Asn Phe Thr Phe Asn Leu Leu Thr His Pro
245 250 255
Asp Gln Leu Asp Lys Leu Ile Ala Asp Pro Glu Leu Val Pro Arg Ala
260 265 270
Val Glu Glu Leu Leu Arg Tyr Thr Pro Leu Gly Ala Thr Pro Gly Phe
275 280 285
Pro Arg Ile Ala Thr Glu Asp Leu Glu Leu Gly Gly Val Ser Ile Arg
290 295 300
Arg Gly Asp Ala Val Phe Phe Glu Ile Ala Ser Ala Asn Arg Asp Ser
305 310 315 320
Ala Val Phe Asp Gly Pro Asp Glu Leu Asp Leu Ala Arg Glu His Asn
325 330 335
Ser His Met Ala Leu Gly His Gly Pro His Tyr Cys Ile Gly Ala Gln
340 345 350
Leu Ala Arg Met Glu Leu Gln Val Ala Ile Gly Thr Leu Ile Lys Arg
355 360 365
Phe Pro Gln Leu Ser Phe Ala Val Pro Val Asp Glu Val Val Trp Lys
370 375 380
Arg Gly Arg Met Thr Arg Gly Pro Glu Ala Leu Pro Ile Thr Trp
385 390 395
<210>28
<211>248
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>28
Val Val Arg Asn Gly His Asp Gln Pro Arg Glu Val Leu Thr Ser Ala
1 5 10 15
Gly Ala Val Glu Val Thr Ala Pro Arg Val Asn Asp Lys Arg Thr Asp
20 25 30
Pro Asp Thr Gly Ala Arg Arg Arg Phe Ser Ser Ala Ile Leu Pro Pro
35 40 45
Trp Ala Arg Lys Thr Pro Lys Ile Thr Glu Met Leu Pro Leu Leu Tyr
50 55 60
Leu His Gly Leu Ser Ser Gly Asp Phe Val Pro Ala Leu Gly Gln Phe
65 70 75 80
Leu Gly Ser Ser Lys Gly Leu Ser Ala Thr LeuIle Thr Lys Leu Thr
85 90 95
Glu Gln Trp Arg Thr Glu His Arg Ala Phe Asn Glu Arg Gly Leu Ser
100 105 110
Glu Val Asp Phe Val Tyr Leu Arg Ala Asp Gly Ile His Val Asn Ile
115 120 125
Arg Leu Glu Glu His Lys Leu Ser Leu Leu Val Val Ile Gly Val Arg
130 135 140
Ala Asp Gly Arg Lys Glu Leu Val Ala Leu Ala Asp Gly Tyr Arg Glu
145 150 155 160
Ser Thr Glu Ser Trp Ala Gly Leu Thr Tyr Cys Val Thr Ala Ser Ala
165 170 175
Ala Val Cys Val Pro Arg Tyr Trp Pro Ser Ala Thr Val His Trp Gly
180 185 190
Ser Gly Ala Arg Ser Ala Arg Leu Ser Leu Ile Arg Ala Ser Ser Ala
195 200 205
Thr Gly Ser Thr Arg Ser Ala Met Cys Ser Pro Arg Cys Arg Asn Arg
210 215 220
Arg Ile Pro Ala Arg Arg Arg Pro Trp Pro Arg Ser Gly Met Pro Arg
225 230 235 240
Thr Ala Gly Thr Cys Trp Thr Arg
245
<210>29
<211>276
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>29
Val Ala Glu Thr Ile Gly Leu Val Arg Arg Thr Ser Ser Gly Gln Leu
l 5 10 15
Ala Glu Thr Glu Leu Leu Ala Leu Leu Arg Arg Asp Gly Gly Arg Tyr
20 25 30
Arg Ser Thr Val Leu Ala Leu Thr Ala Pro Gly Phe Asn Arg Pro Ser
35 40 45
Glu Met Met His Arg Ala Val Leu Ser Gly Arg Ala His Thr Ala Gln
50 55 60
Val Leu Gly Thr Asp Leu Trp Gly Tyr Tyr Gly Thr Asn Pro Glu Glu
65 70 75 80
Ala Lys Trp Phe Gly Gly Ala Met Thr Asp Leu Thr Asn Leu Val Ala
85 90 95
Asp Leu Val Leu Ala Arg Tyr Glu Phe Ser Gly Arg Gly Thr Ile Met
100 105 110
Asp Val Gly Gly Ser His Gly Ile Phe Leu Ser Arg Ile Leu His Ala
115 120 125
Gln Pro Asp Ala Lys Gly Val Leu Phe Asp Arg Met Glu Val Val Glu
130 135 140
Glu Ala Arg Asn His Leu Asp Gln Asp Ile Arg Thr Arg Ile Gln Ile
145 150 155 160
Val Gly Gly Asr Phe Phe Glu Gly Val Pro Glu Gly Gly Asp Leu Tyr
165 170 175
Ile Leu Lys Ser Val Leu Cys Asp Trp Asp Asp Gln Ser Cys Leu Gln
180 185 190
Ile Leu Ser Arg Ile Arg Asn Ala Ala Met Pro Gly Ala Ser Lau Leu
195 200 205
Ile Val Asp Trp Leu Tyr Pro Asp Glu Ser Asp Pro Gly Leu Asp Ala
210 215 220
Ile Tyr Leu Gln Gln Ala Ile Ser Val Asn Gly Arg Val Arg Asn Gln
225 230 235 240
Glu Gln Phe Glu Ser Leu Leu Lys Ala Thr Gly Phe Ala Val Thr Arg
245 250 255
Val Glu Arg Thr Thr Pro Glu Asn Trp Ile Pro Ala Thr Ile Ile Glu
260 265 270
Ala Ile Arg Arg
275
<210>30
<211>616
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>30
Val Gly Cys Leu Arg Ser His Ala Ala Tyr Pro Ala Ser Ala Asp Gln
1 5 10 15
Gly Ala Leu Leu Arg Asp Pro Val Arg Arg Gly Val Gln Pro Gly Arg
20 25 30
Ala His Asp Arg His Arg Arg Arg Thr Arg Arg His Arg Ala Gly Arg
35 40 45
His Tyr Arg Pro Gly Gln Ser Gln Gly Ala Asn Arg Pro Leu Gly Thr
50 55 60
Leu Arg Gln Val Asp Asp Leu Gly Gly Val Gln Pro Gly Arg Gly Lys
65 70 75 80
Val Gly His Gln Arg Arg Arg Arg His Arg Cys Pro Val Val Pro Arg
85 90 95
Arg Pro Ala Ala Pro Thr Ala Ala Gly Asp Ala Gly Arg Pro Arg Arg
100 105 110
Gln GLy Val Arg Ser Gly Val Gln Leu Glu Arg Gly Gly Gly Leu His
115 120 125
Arg Gly Phe Leu Arg Asn Arg Arg Ser Arg Gly Ala Val Arg Gln Trp
l30 135 140
Arg Ala Ala Asp Arg Ala Arg Pro Val Ala Thr Gly Leu Leu Asn Gly
145 150 155 160
His Glu Leu Arg Leu Thr Ala Val Ser Thr Val Asp Asp Ser Ala Leu
165 170 175
Ala Ile Ala Ser Lys Pro Arg Ser Pro Ile Pro Asp Pro Arg Cys Lys
180 185 190
Val Ala Thr Thr Ser Ser Ala Ser Ser Pro Leu Pro Gly Leu Gly Pro
195 200 205
Val Val Arg Ser Asn Phe Gly Pro Thr Arg Leu Gly Phe Val Leu Met
210 215 220
Leu Lys Phe Phe Glu Leu Glu Gly Arg Phe Pro Gln Phe Val Glu Glu
225 230 235 240
Phe Pro Gln Ala Ala Val Asp Tyr Val Ala Gly Val Val Lys Val Pro
245 250 255
Ala Glu Asp Leu Ala Lys Tyr Xaa Leu Ser Ser Arg Ser Ala Lys Gly
260 265 270
His Arg Thr Gln Ile Arg Glu Thr Leu Gly Tyr Xaa Pro Ala Thr Arg
275 280 285
Ala Asp Glu Glu Arg Leu Thr Ala Trp Leu Ala Asp Glu Val Cys Pro
290 295 300
Val Glu Met Val Glu Asp Arg Leu Arg Glu Ala Leu Leu Val Gln Cys
305 310 315 320
Arg Ser Asp His Val Glu Pro Pro Gly Arg Val Glu Arg Ile Val Ala
325 330 335
Ala Ala Arg Ala Arg Ala Asp Arg Val Phe Cys Ala Gln Thr Val Ala
340 345 350
Arg Leu Gly Glu Ala Cys Ala Gly Arg Leu Leu Thr Leu Val Ala Glu
355 360 365
Gly Asn Glu Glu Gly Thr Ala Leu Leu Ala Ser Leu Lys Arg Asp Pro
370 375 380
Gly Ala Val Gly Leu Asp Ser Leu Leu Ala Glu Ile Thr Lys Leu Thr
385 390 395 400
Ala Val Arg Arg Leu Gly Leu Pro Glu Gly Leu Phe Ala Asp Cys Ser
405 410 415
Glu Lys Leu Val Ala Ala Trp Ala Gly Ala Gly Asp Gln Asp Val Ser
420 425 430
Leu Gly Leu Pro Gly Arg Trp Gln Gly Cys Ala Asp His Ala Ala Gly
435 440 445
Gly Ala Val Arg Val Pro Ala Gly Gly Asp His Arg Cys Pro Gly Gly
450 455 460
Ala Ala Gly Arg Ser Gly Ser His Lys Ile Asn Ala Arg Ala Glu Arg
465 470 475 480
Arg Val Glu Arg Gln Leu Thr Ala Glu Leu Lys Lys Val Arg Gly Lys
485 490 495
Glu Gly Ile Leu Phe Gln Leu Ala Asp Ala Ser Val Gly Gln Pro Glu
500 505 510
Gly Thr Val Arg Arg Val Leu Phe Pro Val Val Gly Glu Lys Thr Leu
515 520 525
Arg Asp Leu Val Ala Glu Ala Asn Glu Lys Ala Phe Lys Ala Arg Val
530 535 540
Arg Thr Thr Leu Arg Ser Ser Tyr Ser Ser Tyr Tyr Pro Ala Asp Ala
545 550 555 560
Ala Val Thr Ala Ala Asp Ala Arg Leu Gln Val Gln Gln His Arg Leu
565 570 575
Pro Ala Gly Asp Gly Arg Ala Arg Ala Ala Gly Glu Val Arg Arg Arg
580 585 590
Arg Arg Gln Asp Pro Leu Leu Arg Arg Arg Arg Arg Gly Ala Asp Gly
595 600 605
Arg Pro Ser Pro Gln Gly Leu Ala
610 615
<210>31
<211>458
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>31
Val Leu Ile Phe Asp Arg Gly His Ala Glu Lys Ile Arg Gln Glu Tyr
1 5 10 15
Ala Cys His Phe Asn Thr His Arg Pro His Gln Ala His Asp Gln Gln
20 25 30
Ala Pro Tyr Val Ala Arg Ala Ser Tyr Arg Cys Arg Gln Leu Gly Ser
35 40 45
Asn Ala Asp Lys Pro Trp Gln Asp Ser Ser Thr Ser Thr Ala Lys Gln
50 55 60
Pro Asp Gly Pro Thr Lys Pro Gln Leu Thr Ala Ser Glu Pro Phe Leu
65 70 75 80
Lys His Tyr Gly Ser Cys Cys Ser Pro Arg Ser Ser Arg Cys Arg Ser
85 90 95
Ala Gly Arg Ser Arg Phe Gln Glu Asp Gln Pro Val Trp Ser Pro Cys
100 105 110
Arg Pro Ala Glu Ala Ala Pro Thr Thr Gly Trp Thr Ser Gly Val Leu
115 120 125
Ala Arg Arg Thr Ala Gly Val Arg Asn Arg Pro Tyr Arg Arg Gly Ser
130 135 140
Gln His Thr Cys Gly Thr Trp Ser Arg Gln Gly Pro Gly Arg Gly Thr
145 150 155 160
Gly Val His His Phe Arg Arg Ser Gly Leu Asp His Asp Pro Ala Leu
165 170 175
Pro Arg Tyr Cys Phe His Arg Glu Arg Ala Ala Val Gln Val Thr Phe
180 185 190
Glu Arg Phe Asp Ala Val Val Ala Glu His Ser Leu Gly His Ala Lys
195 200 205
Tyr Gly Gly Ser Val Tyr Glu Lys Arg Asp Leu G1y Gln Gln Val Pro
210 215 220
Cys Arg His Val Gln Leu Phe Leu Arg Glu Thr Ala Val Gly Ser Pro
225 230 235 240
Pro Ala Pro Arg Asp Arg Phe Arg Leu Gly Gln Thr Gly Leu Pro Glu
245 250 255
Val Phe Ile Arg Pro Glu Asp Leu Arg Asn Leu Val Pro Arg Ala Gln
260 265 270
Val Leu Leu Val Leu Val Glu Pro Gly Glu Val Asp Asp His Leu Leu
275 280 285
Gly Gly Arg Asp Ala Glu His Gly Lys Arg Pro Glu Tyr Leu Pro Ala
290 295 300
Gln Pro Ser Cys Leu Ala Ser Arg Leu Pro Leu Cys Phe His Arg Arg
305 310 315 320
Ile Ala Ala Gly Leu His Arg Glu Ser Gly Gly Arg Val Arg Glu Glu
325 330 335
His Met Gln Arg Leu Glu Leu Gly His Arg Pro Pro Gln Glu Leu Val
340 345 350
Gln Ser Cys Leu Phe Glu Val Ala Val Glu Lys Ser Cys Ala Asn Lys
355 360 365
Gly Val Ser Thr Pro Leu Glu Asp Pro Gly Ala Leu Leu Gly Arg Arg
370 375 380
Phe Met Pro His His Lys Leu Gly Ile His Val His Ala Ser Ser Arg
385 390 395 400
His Gly Cys Ile His Val Val Cys Thr Phe Cys Leu Glu Thr Val Ala
405 410 415
Ser Ser Thr Leu Arg Gln Leu Arg Arg Ile Leu Tyr Ala Asn Leu Asp
420 425 430
Ala Leu Asp Ser Pro Leu Gln Arg Phe Arg Ile Cys Ala Arg Glu Pro
435 440 445
Arg Ala Glu Arg Gly Ile Pro Phe Pro Gln
450 455
<210>32
<211>305
<212>PRT
<213>刺糖多胞菌 NRRL30141
<400>32
Val Asp Val Ala Phe Cys Ala Ile Cys Gly Ser Asp Leu His Leu Arg
1 5 10 15
Ala Met Pro His Leu Val Pro Ala Asp Ala Val Leu Gly His Glu Ile
20 25 30
Ser Gly His Val Ala Ala Pro Gly Gly Glu Arg Leu Thr Ala Gly Gln
35 40 45
Ala Val Val Val Trp Pro Lys Ala Gly Cys Gly Asp Cys Asp Asp Cys
50 55 60
Arg Val Gly Asp Asn His Leu Cys Ala Val Gln Pro Trp Arg Leu Ser
65 70 75 80
Ser Leu Gly Leu Gly Thr Arg Pro Gly Gly Tyr Ala Glu Ala Val Val
85 90 95
Val Pro Glu His Thr Val Tyr Ala Val Pro Asp Gly Val Ser Leu Glu
100 105 110
His Ala Ala Leu Thr Glu Pro Leu Ser Cys Ala Val His Ala Val Asp
115 120 125
Arg Ser Gly Ile Ser Ala Ala Asp Thr Val Thr Val Leu Gly Gly Gly
130 135 140
Thr Val Gly Phe Leu Leu Ala His Val Leu Arg Leu Arg Gly Val Glu
145 150 155 160
Asp Val Arg Val Val Glu Pro His Pro Val Arg Arg Ala Arg Leu Thr
165 170 175
Ala Thr Gly Ile Thr Thr Val Asp Val Asp Glu Arg Gly Pro Asp Ala
180 185 190
Asp Val Val Phe Glu Cys Val Gly Ser Val Thr Ala Leu Thr Asp Ala
195 200 205
Ala Arg Arg Val Arg Thr Arg Gly Thr Ile Val Ala Leu Gly Val Asn
210 215 220
Glu Arg Pro Ser Glu Leu Asp Ser Val Ala Leu Ile Thr Lys Glu Ile
225 230 235 240
Arg Ile Val Gly Ser Phe Ala Gln Asn Arg Gly Ala Phe Glu Ala Ala
245 250 255
Leu Glu Leu Leu Gly Gly Gly Arg Ile Pro Val Glu Arg Ile Ile Thr
260 265 270
Asp Val Val Pro Leu Asp Ala Gly Pro Val Ser Ala Met Met Asp Ala
275 280 285
Leu Thr Gly Arg Pro Gly Asp His Gln Val Val Met Ile Ala Pro Gly
290 295 300
Gly
305
<210>33
<211>20
<212>DNA
<213>刺糖多胞菌
<400>33
gtgccgaata cgcgaaggtc 20
<210>34
<211>20
<212>DNA
<213>刺糖多胞菌
<400>34
tccaggaagg tattccgcgc 20
<210>35
<211>20
<212>DNA
<213>刺糖多胞菌
<400>35
gcgacaacgc gatccagatc 20
<210>36
<211>22
<212>DNA
<213>刺糖多胞菌
<400>36
ccatgtcgtg ggcatatttc tc 22
<210>37
<211>21
<212>DNA
<213>刺糖多胞菌
<400>37
tcccgatgcc tggattcatt g 21
<210>38
<211>22
<212>DNA
<213>刺糖多胞菌
<400>38
cgtccatcat cgagaagtgg tc 22
<210>39
<211>16
<212>DNA
<213>刺糖多胞菌 NRRL 30141
<400>39
cgtacgtggc gatcag 16
<210>40
<211>21
<212>DNA
<213>刺糖多胞菌 NRRL 30141
<400>40
gtccaagttt cggttgcgtt c 21