对相关申请的交叉引用
本申请要求2014年3月21日提交的新加坡申请号10201400876T的优先权权益,该新加坡申请的内容在此以引用的方式整体并入本文用于所有目的。
技术领域
本发明属于癌症生物标志物,特别是作为癌症的预后生物标志物的融合基因的领域。
背景技术
癌症是特征在于一群细胞已经丧失它的正常控制机制,从而导致生长失去调控的一类疾病。癌细胞也被称作恶性细胞并且可以由任何器官内的任何组织产生。随着癌细胞生长和增殖,它们形成肿瘤,所述肿瘤侵袭并且破坏正常的邻近组织。来自原发部位的癌细胞还可以扩散到全身。
癌症的一个实例是胃癌(GC)。大部分的GC在晚期被诊断出,这限制了当前的治疗策略,远处疾病或转移性疾病的总体5年存活率是约3%。
在分子水平上,GC是异质的并且目前,唯一的治疗靶标是扩增的受体酪氨酸蛋白激酶ERBB2。
虽然最近的全基因组和外显子组测序研究已经鉴定出反复突变的基因,但是GC中的基因组重排仍尚未被非常详细地研究。基因组重排可能通过扩增、缺失以及基因破坏对基因功能造成显著的影响,并且可能产生具有新功能的融合基因。
因此,需要鉴定出可以用于可靠地确定患有诸如胃癌的癌症的患者的预后,以允许鉴定出高风险的癌症患者和低风险的癌症患者而允许不同的治疗方法的预后因子和标志物。
发明内容
在一个方面,提供了一种对患者是否患有癌症或是否处于增加的患癌症的风险进行确定或作出预后的方法,所述方法包括测试从患者获得的样品中一种或多种癌症相关融合基因或其衍生的蛋白质的存在,其中在所述样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的癌症风险,其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133),或其中所述癌症相关融合基因选自由以下各项组成的组与CLDN18-ARHGAP26(SEQ ID NO:107)的组合:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133)。
在一个方面,提供了一种确定患者是否患有癌症或是否处于增加的患癌症的风险的方法,所述方法包括测试从患者获得的样品中一种或多种癌症相关融合基因或其衍生的蛋白质的存在,其中在所述样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的癌症风险,其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)、DUS2L-PSKH1(SEQ ID NO.:131或133)以及CLDN18-ARHGAP26(SEQ ID NO:107)。
在一个方面,提供了一种确定患者是否患有癌症或是否处于增加的患癌症的风险的方法,其中所述方法包括在从患者获得的样品中检测选自由以下各项组成的组的一种或多种癌症相关融合基因:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133);或检测选自由以下各项组成的组的一种或多种癌症相关融合基因与CLDN18-ARHGAP26(SEQ ID NO:107)的组合:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH(SEQ ID NO.:131或133),其中在所述样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的患癌症的风险。
在一个方面,提供了一种确定患者是否患有癌症或是否处于增加的患癌症的风险的方法,其中所述方法包括在从患者获得的样品中检测选自由以下各项组成的组的一种或多种癌症相关融合基因:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)、DUS2L-PSKH1(SEQ ID NO.:131或133)以及CLDN18-ARHGAP26(SEQ ID NO:107),其中在所述样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的患癌症的风险。
在一个方面,提供了一种表达载体,所述表达载体包含编码以下各项中的任一种的核酸序列:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)、DUS2L-PSKH1(SEQ ID NO.:131或133)或CLDN18-ARHGAP26(SEQ ID NO:107)。
在一个方面,提供了一种用如本文所公开的表达载体转化的细胞。
在一个方面,提供了一种用于产生多肽的方法,所述方法包括在适用于多肽表达的条件下培养如本文所公开的转化的细胞以及从所述细胞收集一定量的所述多肽。
在一个方面,提供了癌症相关融合基因用于在患者中对癌症进行确定或作出预后的用途,其中在从所述患者获得的样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的患癌症的风险,其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133),或其中所述癌症相关融合基因选自由以下各项组成的组与CLDN18-ARHGAP26(SEQ ID NO:107)的组合:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133)。
在一个方面,提供了癌症相关融合基因用于确定患者是否患有癌症或是否处于增加的癌症风险的用途,其中在从所述患者获得的样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的患癌症的风险,其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133),或其中所述癌症相关融合基因选自由以下各项组成的组与CLDN18-ARHGAP26(SEQ ID NO:107)的组合:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.:113或115)、MLL3-PRKAG2(SEQ ID NO.:121、123或125)以及DUS2L-PSKH1(SEQ ID NO.:131或133)。
在一个方面,提供了一种用于如本文所公开的方法中的试剂盒,所述试剂盒包括:
a)选自由以下各项组成的组的第一引物:SEQ ID NO.1、SEQ ID NO.3、SEQ ID NO.5、SEQ ID NO.7以及SEQ ID NO.9;
b)选自由以下各项组成的组的第二引物:SEQ ID NO.2、SEQ ID NO.4、SEQ ID NO.6、SEQ ID NO.8以及SEQ ID NO.10;
任选地连同使用说明书。
附图说明
在结合非限制性实施例和附图考虑时,参考详细说明将更好地理解本发明,在附图中:
图1:在GC中通过DNA-PET鉴定的体细胞SV的特征。(A)示出了用于GC患者125的SV过滤程序。通过Circos在被布置成圆圈的整个人类基因组上对SV进行作图,所述圆圈在外环中具有拷贝数改变,之后是缺失、串联重复、倒位/非配对倒位、以及在内环中具有染色体间分离易位。将在患者125的血液中所鉴定出的SV(右上)从在患者125的胃肿瘤中所鉴定出的SV(左上)中扣除,从而产生对所述肿瘤具有特异性的体细胞获得性SV(底部)。(B)15例GC的体细胞SV和生殖系SV的分布。(C)在15例GC中体细胞SV和生殖系SV的比例。SV计数示于顶部。(D)与生殖系SV相比较,GC中体细胞SV的组成。SV计数示于顶部。(E)GC的体细胞SV组成与对于胰腺癌、乳腺癌以及前列腺癌所报道的体细胞SV的比较。SV被减到四个类别以允许进行比较。
图2:体细胞SV的断裂点特征提供了对机制的见解。(A-C)GC中体细胞SV的断裂点位置的表征。重复区和基因的坐标是从UCSC基因组浏览器下载的并且开放染色质区域是从DNA元件百科全书(Encyclopedia of DNA Elements,ENCODE)编译的。(D)涉及重排的基因可能具有源自于SV断裂点中的一个的小DNA片段的插入。箭头表示基因组片段。指示了断裂点坐标并且微同源性示于断裂点对的上方。(E)体细胞串联重复和染色质相互作用的重叠的实例。染色体4和放大的基因座的坐标示于顶部。示出了GC肿瘤100的体细胞59kb串联重复的PET标测坐标,上游标测区位于左侧并且下游标测区位于右侧。括号中的数字指示连接这两个区域的非冗余PET读段的数目(簇大小)。底部:在细胞系MCF-7中通过ChIA-PET所鉴定的染色质相互作用显示出两个断裂点区域之间由拱形指示的相互作用。
图3:在15例GC中所鉴定出的SV与通过ChIA-PET测序所鉴定出的染色质相互作用之间的相关性。(A)在乳腺癌(BC,n=1,935)和GC(n=1,945)中通过DNA-PET所鉴定出的体细胞SV以及GC患者(n=1,667)中的生殖系SV与乳腺癌细胞系MCF-7(n=87,253)中与RNA聚合酶II结合的远距离的染色质相互作用的重叠。绝对数示于条柱上方。与ChIA-PET相互作用重叠的SV的分数是相对于每一个数据集的SV总数(例如GC SV)计算的。所有的SV/染色质相互作用重叠均显著高于随机期望(P<0.001,基于置换)。(B)在慢性骨髓性白血病(CML,n=189)和GC(n=1,945)中通过DNA-PET所鉴定出的体细胞SV以及GC患者(n=1,667)中的生殖系SV与CML细胞系K562(n=154,130)中与RNA聚合酶II结合的远距离的染色质相互作用的重叠。所有的SV/染色质相互作用重叠均显著高于随机期望(P<0.001,基于置换)。(C、E以及G)示出了在GC患者的配对正常组织中所鉴定出的1,667个非冗余生殖系SV与MCF-7的通过ChIA-PET所鉴定出的87,253个RNA聚合酶II染色质相互作用之间的重叠特征。(D、F以及H)示出了在15例GC中所鉴定出的1,945个体细胞SV与和C、E以及G中相同的MCF-7染色质相互作用之间的重叠特征。(C)和(D)图示了SV与染色质相互作用之间重叠的比例的维恩图(Venn diagram),所述维恩图显示出小的重叠,然而,所述重叠显著大于随机期望(P<0.001,基于置换)。(E)和(F)分别与染色质相互作用位点重叠(共同)或不重叠(独特)的SV的簇大小分布的比较。(G)和(H)示出了SV与染色质相互作用位点之间距离的分布。
图4:GC中复现的CLDN18-ARHGAP26框内融合在HGC27中具有促增殖作用。(A)RefSeq基因跟踪(顶部)、通过DNA-PET测序所测定的肿瘤136的拷贝数(中间)、以及在肿瘤136中在CLDN18和ARHGAP26中具有断裂点的体细胞平衡易位的PET标测(底部)。融合的外显子的数目以红色示出。DNA-PET簇的标测区由红色的和灰色的箭头尖示出,簇大小处于括号中,虚线处于方括号中的Sanger测序验证的断裂点坐标处。肿瘤07K611T的基因组断裂点的位置(chr3:139,237,526和chr5:142,309,897)由垂直箭头指示。(B)通过对肿瘤136进行FISH来验证基因组重排。(C)带有CLDN18-ARHGAP26融合的两例胃癌的肿瘤/正常对的RT-PCR。β-肌动蛋白的RT-PCR用作阳性对照。N:正常胃组织;T:胃肿瘤;M:标志物。(D)CLDN18的外显子5的编码区中的隐蔽剪接位点使得开放阅读框延伸到ARHGAP26中。融合转录物的序列以粗体突出显示并且由垂线连接。(E)CLDN18-ARHGAP26的蛋白质结构域表意图。(F)肿瘤136的CLDN18-ARHGAP26的RT-PCR的Sanger测序色谱图。CLDN18与ARHGAP26之间的融合点由垂直虚线指示。(G)HGC27亲代细胞和具有空载体和CLDN18-ARHGAP26表达载体的稳定细胞系中的CLDN18-ARHGAP26融合转录物的qRT-PCR。(H)稳定表达CLDN18-ARHGAP26的HGC27细胞的增殖测定。测定是按一式四份进行的。误差棒表示标准差。OD450:在450nm的光密度。关于对MLL3-PRKAG2、DUS2L-PSKH1、CLEC16A-EMP2以及SNX2-PRDM6的表征,参见图5至8和实施例12。
图5:GC中复现的MLL3-PRKAG2框内融合在TMK1中具有促增殖作用。(A)从UCSC下载的RefSeq基因跟踪(顶部)、对TMK1进行的DNA-PET测序的物理覆盖度(中间)、以及在MLL3和PRKAG2中具有断裂点的体细胞缺失的PET标测(底部)。(B)如从Ensembl(www.ensembl.org)所下载的MLL3和PRKAG2的基因结构。在转录物水平上外显子-外显子的融合由对角线指示,外显子编号分别示于基因的上方和下方。沿着对角线的数字指示了对每一种融合观测的次数。(C)带有MLL3-PRKAG2融合的三例胃癌的肿瘤/正常对的RT-PCR。β-肌动蛋白的RT-PCR用作阳性对照。M:标志物;N:正常胃组织;T:胃肿瘤。(D)TMK1的MLL3-PRKAG2融合的RT-PCR的Sanger测序色谱图。MLL3与PRKAG2之间的融合点由垂直虚线指示。(E)在具有对融合点具有特异性的siRNA A和siRNA B的TMK1细胞中在基因敲低后内源性MLL3和PRKAG2以及融合转录物的定量RT-PCR(qRT-PCR)。实验是按一式三份进行的。误差棒表示三次重复测定的标准差。(F)具有靶向MLL3-PRKAG2融合的siRNA-A的TMK1细胞的增殖测定。FGFR4是在基因敲低之后阴性增殖作用的阳性对照。测定是按一式四份进行的。误差棒表示标准差。OD450:在450nm的光密度,即WST-1测定的比色读数。
图6:对复现的框内融合基因DUS2L-PSKH1的鉴定以及在融合基因敲低之后对TMK1进行的增殖分析。(A)染色体表意图(顶部),放大区域(底部)由垂直方框突出显示。放大的基因组视图在顶部示出了基因组坐标,在下方示出了UCSC基因跟踪。基于癌症体细胞突变目录(Catalogue Of Somatic Mutations In Cancer,COSMIC)中的多个条目,基因GFOD2、RANBP10、NUTF2、NRN1L、DPEP2/3、DDX28、DUS2L、以及NFATC3牵涉到癌症。TMK1的拷贝数和SV跟踪示于基因跟踪下方,物理覆盖度被示为平滑线或不平滑线并且PET标测对于5′标测区被示为左箭头并且对于3′标测区被示为右箭头。TMK1的基于串联重复的重建基因组结构示于底部。(B)带有DUS2L-PSKH1基因融合的两例胃癌的肿瘤/正常对的RT-PCR。β-肌动蛋白的RT-PCR用作阳性对照。M:标志物;N:正常胃组织;T:胃肿瘤。(C)TMK1的DUS2L-PSKH1融合的RT-PCR的Sanger测序色谱图。DUS2L与PSKH1之间的融合点由垂直虚线指示。(D)使用靶向DUS2L-PSKH1转录物的融合点的四种siRNA敲低TMK1中融合基因的表达。实验是按一式三份进行的。两次实验中的一个代表。误差棒表示三次重复测定的标准差。(E)使用针对DUS2L-PSKH1的siRNAA和siRNA C来比较融合基因的敲低对增殖特性的影响。将TMK1细胞用siRNA瞬时转染并且通过使用WST-1试剂进行比色测定来估计增殖。FGFR4用作阳性对照。实验是按一式三份进行的。误差棒表示三次重复测定的标准差。注意siRNAA和siRNA C的不一致的结果。两次实验中的一个代表。
图7:对复现的框内融合基因CLEC16A-EMP2的鉴定以及对稳定表达CLEC16A-EMP2的HGC27进行的增殖分析。(A)在肿瘤133中通过DNA-PET所鉴定的引起CLEC16A和EMP2融合的非配对倒位。牵涉到癌症的EMP2、TEKT5、NUBP1、FAM18A、CIITA以及CLEC16A的染色体表意图、基因跟踪、拷贝数以及SV表示如对于图6所述。(B)肿瘤06/0159的融合CLEC16A-EMP2的Sanger测序色谱图。CLEC16A与EMP2之间的融合点由垂直虚线指示。(C)带有CLEC16A-EMP2基因融合的两例胃癌的肿瘤/正常对的RT-PCR。β-肌动蛋白的RT-PCR用作阳性对照。M:标志物;N:正常胃组织;T:胃肿瘤。(D)对稳定表达CLEC16A-EMP2融合基因的HGC27细胞进行的qPCR分析。倍数变化是相对于亲代细胞系和用空载体稳定转染的细胞计算的。误差棒表示三次重复测定的标准差。(E)对稳定表达CLEC16A-EMP2的HGC27细胞进行的增殖测定。测定是按一式四份进行的。误差棒表示标准差。OD450:在450nm的光密度,即WST-1测定的比色读数。
图8:对复现的框内融合基因SNX2-PRDM6的鉴定以及对稳定表达SNX2-PRDM6的HGC27进行的增殖分析。(A)在肿瘤125中通过DNA-PET所鉴定的引起SNX2和PRDM6融合的缺失。染色体表意图、基因跟踪、拷贝数以及SV表示如对于图6所述。(B)肿瘤160和配对的正常组织针对SNX2-PRDM6基因融合的RT-PCR。β-肌动蛋白的RT-PCR用作阳性对照。M:标志物;N:正常胃组织;T:胃肿瘤。(C)肿瘤125的融合SNX2-PRDM6的Sanger测序色谱图。SNX2与PRDM6之间的融合点由垂直虚线指示。(D)对稳定表达SNX2-PRDM6融合基因的HGC27细胞进行的qPCR分析。倍数变化是相对于亲代细胞系和用空载体稳定转染的细胞计算的。误差棒表示三次重复测定的标准偏差。(E)对稳定表达SNX2-PRDM6的HGC27细胞进行的增殖测定。测定是按一式四份进行的。误差棒表示标准差。OD450:在450nm的光密度,即WST-1测定的比色读数。
图9:对过表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的细胞系的表征。(A)针对CLDN18和ARHGAP26的抗体检测到CLDN18-ARHGAP26融合蛋白。用针对CLDN18和ARHGAP26的抗体对表达CLDN18-ARHGAP26的MDCK细胞进行免疫染色。(B和C)CLDN18在HeLa细胞中的强制表达恢复到上皮形态,如通过使用DAPI和针对N-钙粘蛋白(B)、β-连环蛋白(C)以及HA的抗体对稳定表达CLDN18和CLDN18-ARHGAP26融合基因的HeLa细胞进行免疫荧光分析所观测到。(D)针对N-钙粘蛋白、β-连环蛋白以及PAK1水平对未转染的HeLa以及表达CLDN18和CLDN18ΔP的稳定细胞进行的q-PCR分析。(E)经由对稳定表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK中的紧密连接蛋白进行q-PCR分析所观测到的在表达CLDN18-ARHGAP26的MDCK细胞中紧密连接蛋白的补偿效应。倍数变化是相对于未转染的MDCK细胞计算的。(F)将稳定表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合的MDCK细胞固定并且用针对ZO-1、HA或GFP的抗体进行免疫染色。
图10:表达CLDN18-ARHGAP26融合的患者样品和MDCK细胞表现出上皮表型的丧失和癌症进展的增进。在正常样品和胃肿瘤患者样品中(A)CLDN18和(B)ARHGAP26的表达。对分别用针对E-钙粘蛋白的抗体和DAPI以及针对CLDN18和ARHGAP26的抗体染色的人类正常胃切片(顶部)和肿瘤胃切片(底部)的免疫荧光分析。(C)表达CLDN18-ARHGAP26融合的MDCK细胞表现出梭形和突起的形态。在亚汇合水平获得的MDCK细胞中表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的稳定细胞系的相衬图像。(D)细胞聚集测定。将未转染的MDCK和表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的稳定细胞系以悬滴形式接种并且在第二天获得相衬图像。(E)分别稳定表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK细胞中的EMT标志物的qPCR。(F)和(G)通过用针对N-钙粘蛋白、β-连环蛋白(F)、Akt、pAkt、以及PAK1(G)的抗体进行免疫印迹法对未转染的HeLa和表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的稳定细胞进行的蛋白质印迹分析。肌动蛋白用作上样对照。
图11:CLDN18-ARHGAP26表达使得细胞-ECM粘附减少。(A)顶部:细胞-ECM粘附测定。将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的MDCK稳定细胞系接种到未处理的板上并且在接种后两小时之时获得相衬图像。未转染的MDCK细胞用作对照。底部:对粘附到未处理的表面、I型胶原处理的表面以及纤维连接蛋白处理的表面的细胞的定量。将2×104个细胞接种到这些表面上,用PBS洗涤三次并且在PFA中固定10分钟。对每个视野的细胞数计数3次-4次。粘附的细胞的比例是相对于未转染的MDCK细胞(100%)定量的。(B)将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的MDCK稳定细胞系固定并且用针对激活的FAK和HA或GFP的抗体进行免疫染色。(C)在表达CLDN18-ARHGAP26的MDCK细胞中的游离边缘中不存在桩蛋白。将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的MDCK稳定细胞系固定并且用针对桩蛋白和HA或GFP的抗体进行免疫染色。(D)对未转染的MDCK和表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的稳定细胞系中的粘着斑分子水平的蛋白质印迹分析。GAPDH用作上样对照。(E)表达CLDN18-ARHGAP26的MDCK中粘着斑分子的水平降低。针对粘着斑分子对表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK稳定细胞系进行的qPCR分析。倍数变化是相对于未转染的MDCK细胞计算的。(F)对未转染的MDCK以及表达CLDN18、ARHGAP26和CLDN18-ARHGAP26的稳定细胞进行的蛋白质印迹分析。针对整合素β1和β5对印迹进行探测,并且微管蛋白用作上样对照。(G)表达CLDN18-ARHGAP26融合的MDCK中的整合素亚基水平降低。对MDCK-CLDN18、MDCK-ARHGAP26以及MDCK-CLDN18-ARHGAP26稳定细胞进行的整合素亚基qPCR分析。倍数变化是相对于未转染的MDCK细胞计算的。(H)将表达CLDN18、具有失活的C末端PDZ结合基序的CLDN18(CLDN18ΔP)、ARHGAP26、CLDN18-ARHGAP26的MDCK稳定细胞系和未转染的MDCK细胞接种到Transwell插入物中并且经过48小时的时间测量TER值。空的Transwell插入物用作阴性对照。(I)未转染的MDCK以及表达CLDN18、ARHGAP26和CLDN18-ARHGAP26的稳定细胞在汇合水平下的相衬图像。
图12:CLDN18-ARHGAP26对增殖、侵袭以及伤口闭合有细胞背景特异性作用。(A)表达CLDN18-ARHGAP26融合的MDCK细胞中的细胞增殖速率延缓。将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK稳定细胞系以800个细胞按一式四份接种到24孔板中。未转染的MDCK细胞用作对照。(B)伤口愈合测定。将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK稳定细胞系接种到μ-Dish(细胞定位格子培养皿)中的Ibidi培养插入物上,并且在第二天,将插入物剥离以产生伤口并且监测闭合。在接种前,将M-Dish板用1型胶原处理。在实验开始时以及每隔一段时间获得相衬图像。(C)将稳定表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26融合基因的HeLa细胞接种到基质胶侵袭室上。未转染的HeLa细胞用作对照。在基础培养基处添加5%FBS作为化学吸引剂并且孵育24小时。将细胞固定,洗涤并且用结晶紫染色以获得相衬图像(左侧)以及定量(右侧)侵袭基质胶的细胞数。(D)将稳定表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的HeLa细胞和HGC27细胞接种到软琼脂上,孵育一个月并且成像(左侧)和计数(右侧)。用载体稳定转染的亲代细胞系用作对照。
图13:CLDN18和ARHGAP26调节上皮表型。(A)表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK细胞的肌动蛋白细胞骨架染色。针对表达CLDN18和CLDN18-ARHGAP26的细胞将细胞使用HA以及与Alexa 594荧光染料缀合的鬼笔环肽进行免疫染色。箭头指示表达ARHGAP26和CLDN18-ARHGAP26的MDCK细胞中应力纤维的清除。(B)对未转染的MDCK以及表达CLDN18、ARHGAP26和CLDN18-ARHGAP26的细胞中的总RhoA进行的蛋白质印迹分析。将细胞用RhoA抗体和GAPDH进行免疫染色。(C)在表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的MDCK细胞中进行的活性RhoA免疫荧光分析。将MDCK稳定细胞用针对活性RhoA的抗体和DAPI染色。(D)表达ARHGAP26和CLDN18-ARHGAP26的MDCK稳定细胞中的GAP活性降低。在沉降测定(G-LISA,细胞骨架)中分析GAP活性。在用Rho家族效应蛋白的RDB结构域包被的96孔板中测定内源性活性GTP结合的RhoA的量。用RhoA一抗和与HRP缀合的二抗来测定结合到板上的来自不同的稳定细胞系的细胞裂解物的Rho的GTP形式。发光值是相对于未转染的MDCK细胞计算的。(E)在37℃将表达CLDN18、ARHGAP26以及CLDN18-ARHGAP26的活HeLa细胞与和Alexa 594缀合的CTxB一起孵育15分钟,之后洗涤和固定。将细胞用HA抗体或GFP抗体和DAPI进行免疫染色。
定义
本文所用的以下词语和术语应当具有所示的含义:
如本文所用的术语“预后”或其语法上的变体指的是对临床病况或疾病的可能病程和结果的预测。患者的预后通常是通过评价疾病的因素或症状而作出的,所述因素或症状指示了所述疾病的有利的或不利的病程或结果。术语“预后”并非指的是以100%准确度预测病况的病程或结果的能力。相反,术语“预后”指的是增加的将发生某一病程或结果的概率;也就是说,在表现出给定的病况的患者中,当与没有表现出所述病况的那些个体相比时,该种病程或结果更有可能发生。举例来说,可以99%、98%、97%、96%、95%、94%、93%、92%、91%、90%、89%、88%、87%、86%、85%、84%、83%、82%、81%、80%、75%、70%、65%、60%、55%以及50%准确度对病况的病程或结果进行预测。
预后的一个实例是测试样品中标志物的存在,其中所述标志物的存在指示了有利的或不利的疾病结果。预后的另一个实例是测试样品中标志物的存在,其中所述标志物的存在指示了患者是一种类型的治疗的候选者。
如本文所用的术语“辨证治疗计划”指的是对患者或疾病亚型具有特异性的定制治疗计划。举例来说,患者样品中癌症标志物的存在指示了患者是辨证治疗计划的候选者,其中所述辨证治疗计划是靶向癌症治疗。
如本文所用的术语“样品”或“生物样品”指的是已经从受试者获得、取出或分离的细胞、组织或流体。样品的实例是肿瘤组织活检。样品可以是冷冻的新鲜组织、石蜡包埋的组织或福尔马林固定的石蜡包埋(FFPE)组织。样品的另一个实例是细胞系。流体样品的实例包括但不限于血液、血清、唾液、尿液、脑脊髓液以及骨髓液。
与基因、融合基因或其衍生的蛋白质产物有关的术语“测试……的存在”指的是针对样品中基因、融合基因或其衍生的蛋白质的存在或不存在进行筛选。与基因、融合基因或其衍生的蛋白质产物有关的术语“测试……的存在”还指的是对样品中基因、融合基因或其衍生的蛋白质产物的表达进行定量。应当了解的是,对表达进行定量包括对样品中基因、融合基因或蛋白质产物的绝对表达进行定量。
如本文所用的术语“融合基因”指的是由两个或更多个分开的基因形成的杂合基因。编码序列、非编码序列或这两者的全长或片段可以融合。融合可以通过染色体重排过程中的一个或多个来进行,所述过程包括但不限于染色体易位、倒位、重复或缺失。所述两个或更多个基因可以处于同一染色体、不同染色体或这两者的组合上。两个或更多个融合基因可以是框内融合或框外融合的。
应当了解的是,融合基因可以获得原始的未融合的基因中的一个的功能,或丧失原始的未融合的基因中的一个的功能或这两方面兼有。还将了解的是,融合基因可以获得未融合的基因中的任一个中不存在的功能。为了说明,由基因A和基因B融合的融合基因可以仅获得基因A的一种或多种功能并且丧失基因B的一种或多种功能。可选地,由基因A和基因B融合的融合基因可以获得在基因A或基因B中没有发现的功能。
因此将了解的是,带有融合基因的细胞可以具有在没有所述融合基因的细胞中没有发现的特性。
如本文所用的术语“癌症相关融合基因”指的是与癌症有关的融合基因。应当了解的是,一种或多种融合基因可能与癌症相关。举例来说,患者样品中一种或多种癌症相关融合基因的存在可以指示所述受试者患有癌症或所述受试者有增加的癌症风险。在患者样品中检测到一种或多种癌症相关融合基因还可以指示所述受试者适合靶向癌症治疗计划。癌症相关融合基因的实例包括但不限于CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2、DUS2L-PSKH1以及CLDN18-ARHGAP26。应当了解的是,融合基因可以被单独或组合检测。不受理论所束缚,应当了解的是,相对于单一癌症相关融合基因的存在,多于一种癌症相关融合基因的组合的存在与更差的预后或疾病结果有关。因而,应当了解的是,多于一种癌症相关融合基因的组合的存在是疾病结果或预后的预测指标。举例来说,融合基因可以选自由以下各项组成的组:CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2以及DUS2L-PSKH1与CLDN18-ARHGAP26的组合。应当了解的是,可以在样品中检测到0种、1种、2种、3种、4种、5种或更多种融合基因。举例来说,可以在样品中检测到CLEC16A-EMP2,或可以在样品中检测到CLEC16A-EMP2与CLDN18-ARHGAP26的组合。在一个实例中,CLDN18-ARHGAP26显示出CLDN18功能的丧失以及ARHGAP26功能的获得。
应当了解的是,在不同受试者的融合基因的核苷酸序列和氨基酸序列之间可能存在变异。这些遗传变异可能是由于突变、多态性或剪接变体导致的。还将了解的是,遗传变异可能导致受试者或样品中的表型变化或可能在表型上没有变化。
衍生自融合基因的蛋白质可以是功能性的或非功能性的。衍生自融合基因的蛋白质可以是延长的或截短的。如本文所用的“功能蛋白”指的是具有生物活性的多肽。应当了解的是,衍生自融合基因的功能蛋白的生物活性或生物特性可以与衍生自原始的未融合的基因中的一个的功能蛋白相同。还应当了解的是,衍生自融合基因的功能蛋白的生物活性或生物特性可以与未融合的基因的生物活性或特性不同。
如本文所用的“截短蛋白”指的是与全长的未截短的蛋白质相比具有减少数目的氨基酸的蛋白质或多肽。
如本文所用的“延长的蛋白质”指的是与全长的未截短的蛋白质相比具有增加数目的氨基酸的蛋白质。
还将了解的是,融合基因可以赋予细胞以不同的生物特性。举例来说,融合基因可以产生具有提高的迁移率、促转移特征或细胞形状变化的细胞。融合基因还可以产生丧失上皮表型、具有受损的上皮屏障特性和受损的伤口愈合特性的细胞。
本领域技术人员将了解的是,可以通过多种方法来检测融合基因的存在。实例包括但不限于聚合酶链反应(PCR)、定量PCR、微阵列、RT-PCR、DNA印迹、RNA印迹、荧光原位杂交(FISH)以及DNA测序。DNA测序包括但不限于DNA配对末端标签(DNA-PET)测序和下一代测序SOLiDTM测序。
本领域技术人员还将了解的是,多种检测剂可以用于检测融合基因。检测剂的实例包括但不限于与融合基因杂交的引物、探针以及互补核酸序列。
术语“引物”在本文用于意指能够在例如PCR技术中用作引物的任何单链寡核苷酸序列。因此,根据本公开的“引物”指的是能够用作起始点来合成与所要拷贝的核酸链基本上相同(对于正向引物)或基本上是所要拷贝的核酸链的反向互补序列(对于反向引物)的引物延伸产物的单链寡核苷酸序列。引物可以适用于例如PCR技术中。
如本文所用的术语“探针”指的是与靶序列杂交的任何核酸片段。可以将探针用放射性同位素、荧光标签、抗体或化学标记来标记以有助于检测探针。
如本文所用的“杂交”意指引物、探针或寡核苷酸与靶核酸分子在标准严格条件下形成非共价相互作用。杂交引物或寡核苷酸可以含有对形成非共价相互作用没有干扰的非杂交核苷酸,例如5′尾区或限制性内切酶识别位点以有助于克隆。
此外,如本文所用,任何“杂交”均是在严格条件下进行的。术语“严格条件”意指允许引物与等位基因扩增内的核苷酸序列特异性结合,但不与任何其它核苷酸序列特异性结合的任何杂交条件。举例来说,在“严格”杂交条件下探针与核酸靶区域的特异性杂交包括诸如3×SSC、0.1%SDS、在50℃的条件。改变温度、探针长度以及盐浓度的参数以使得可以实现特异性杂交在本领域技术人员的能力范围之内。杂交条件和洗涤条件是本领域公知的。
本领域技术人员将了解的是,可以通过多种方法来检测融合蛋白。检测融合蛋白的方法的实例包括但不限于免疫组织化学(IHC)、免疫荧光标记、蛋白质印迹、ELISA以及SDS-PAGE。
本领域技术人员还将了解的是,存在多种检测剂来定量融合蛋白的表达。检测剂的实例包括但不限于与融合蛋白特异性结合的抗体和配体。
如上所述,在从患者获得的样品中检测到一种或多种融合基因指示了癌症或增加的癌症风险。
如本文所用的“增加的癌症风险”意指受试者尚未被诊断为患有癌症,但是相对于没有所述一种或多种融合基因的对照或参照,具有增加的患有癌症的概率。
如本文所用的术语“参照”、“对照”或“标准”指的是用于进行比较以确定预后的样品或受试者。“参照”、“对照”或“标准”的实例包括从同一受试者获得的非癌样品、从非转移性肿瘤获得的样品、从未患癌症的受试者获得的样品或从患有不同的癌症亚型的受试者获得的样品。如本文所用的术语“参照”、“对照”或“标准”还可以指的是患者群组中基因或蛋白质的平均表达水平。如本文所用的术语“参照”、“对照”或“标准”还可以指的是一个细胞系或多个细胞系中融合基因或融合蛋白的存在或不存在。如本文所用的术语“参照”、“对照”或“标准”还可以指的是未患癌症或患有不同类型的癌症的受试者。参照或对照的实例是没有癌症相关融合基因中的任一种或多种的患者。
如本文所用的“癌症”指的是上皮癌。上皮癌的实例包括但不限于胃癌、肺癌、乳腺癌、泌尿生殖系统癌症、结肠癌、前列腺癌以及宫颈癌。
融合多肽可以通过将融合基因插入到表达载体中来获得。如本文所用的“表达载体”指的是用于将特定基因引入到靶细胞中的质粒。表达载体可以是瞬时表达载体或稳定表达载体。
应当了解的是,可以将细胞用表达载体转化。用于转化细胞的方法将是本领域技术人员所了解的。举例来说,可以通过电穿孔、热休克、化学转染或病毒转染来将细胞转化。
本文说明性描述的发明可以在不存在本文没有具体公开的任何一个或多个要素、一个或多个限制条件的情况下被适当地实施。因此,举例来说,术语“包含”、“包括”、“含有”等应当被宽泛地并且不加限制地解读。此外,本文所用的术语和措辞已经被用作描述性术语而非限制性术语,并且并不意图在使用这些术语和措辞时排除所示的以及所述的特征或其部分的任何等效形式,但应当认识到的是,各种修改在要求保护的本发明的范围内是可能的。因此,应当了解的是,尽管已经通过优选的实施方案和任选的特征具体地公开了本发明,但是本文所公开的其中所体现的发明的改动方案和变化方案可以依靠本领域技术人员获得,并且这些改动方案和变化方案被认为在本发明的范围内。
本发明已经在本文中被广泛地并且一般地描述。落入一般性公开内容内的较缩小种类和亚类分组中的每一个也形成本发明的一部分。这包括以从所述类中去除任何主题的附带条件或负面限制条件来一般性地说明本发明,不论所排除的内容是否在本文被具体地叙述。
其它实施方案在以下权利要求书和非限制性实施例的范围内。此外,在本发明的特征或方面以马库什组(Markush group)来描述的情况下,本领域技术人员将认识到的是,本发明因此同样以马库什组的任何单个成员或成员的亚组来描述。
具体实施方式
现在将公开对患者是否患有癌症或是否处于增加的患有癌症的风险进行确定或作出预后的方法的示例性的非限制性实施方案。
所述方法包括测试从患者获得的样品中一种或多种癌症相关融合基因或其衍生的蛋白质的存在,其中所述样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的癌症风险,其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2以及DUS2L-PSKH1,或其中所述癌症相关融合基因选自由以下各项组成的组:CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2以及DUS2L-PSKH1与CLDN18-ARHGAP26的组合。
在一个实施方案中,所述癌症相关融合基因是CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2、DUS2L-PSKH1或CLDN18-ARHGAP26。在一个优选的实施方案中,所述癌症相关融合基因是CLEC16A-EMP2。在一个实施方案中,所述融合基因中的2种、3种或4种选自由以下各项组成的组:CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2以及DUS2L-PSKH1与CLDN18-ARHGAP26的组合。
在一个实施方案中,CLEC16A-EMP2与CLDN18-ARHGAP26组合。在一个实施方案中,SNX2-PRDM6与CLDN18-ARHGAP26组合。在一个实施方案中,MLL3-PRKAG2与CLDN18-ARHGAP26组合。在一个实施方案中,DUS2L-PSKH1与CLDN18-ARHGAP26组合。在一个优选的实施方案中,CLEC16A-EMP2与CLDN18-ARHGAP26组合。在一个优选的实施方案中,MLL3-PRKAG2与CLDN18-ARHGAP26组合。
本文所公开的方法适用于对癌症进行确定或作出预后。所述癌症可以是癌瘤、肉瘤、白血病、淋巴瘤、骨髓瘤或中枢神经系统癌症。
在一个实施方案中,所述癌症是上皮癌或癌瘤。上皮癌优选地选自由以下各项组成的组:皮肤癌、肺癌、胃癌、乳腺癌、泌尿生殖系统癌症、结肠癌、前列腺癌、宫颈癌、皮肤癌、卵巢癌、肝癌以及肾癌。在一个优选的实施方案中,所述癌症是胃癌。
如本文所述的方法适用于新鲜组织、冷冻组织、石蜡保存的组织和/或乙醇保存的组织的样品。所述样品可以是生物样品。生物样品的非限制性实例包括全血或其组分(例如血浆、血清)、尿液、唾液、淋巴、胆汁液、痰液、泪液、脑脊髓液、支气管肺泡灌洗液、滑液、精液、腹水瘤液、乳汁以及脓液。在一个实施方案中,所述样品是从血液、羊水或颊部涂片获得的。在一个优选的实施方案中,所述样品是组织活检。
如本文所考虑的生物样品包括组织样品、培养的生物材料,包括衍生自培养细胞的样品,如从培养细胞收集的培养基或细胞沉淀物。因此,生物样品可以指的是由整个生物体或它的组织、细胞或组成部分的子集制备的裂解物、匀浆或提取物,或其级分或部分。生物样品还可以在使用之前,例如通过将一种或多种组分纯化、稀释和/或离心来改良。
公知的提取程序和纯化程序可用于从样品中分离核酸。核酸可以在从样品中提取后被直接使用或更优选地,在多核苷酸扩增步骤(例如PCR)之后被使用。扩增的多核苷酸是‘衍生’自样品的。
优选地,在扩增之前使核酸序列变性。在一个实施方案中,变性包括热处理。优选地,在选自由以下各项组成的组的范围的温度进行热处理:约70℃-110℃;约75℃-105℃;约80℃-100℃;以及约85℃-95℃。优选地,在94℃进行变性步骤。
在另一个实施方案中,将变性步骤进行选自由以下各项组成的组的一段时间:约1分钟-30分钟;约2分钟-25分钟;以及约3分钟-10分钟。优选地,将变性步骤进行3分钟。
在一个优选的实施方案中,扩增步骤包括聚合酶链反应(PCR)。优选地,PCR包括在94℃持续20秒、在58℃持续30秒以及在68℃持续10分钟的15个循环;以及在94℃持续20秒、在55℃持续30秒以及在68℃持续10分钟的20个循环;以及在68℃持续15分钟的最终延伸步骤。
可以通过毛细管电泳、解链曲线分析、在DNA芯片上或下一代测序来分析一个或多个另外的扩增子。
根据本公开的引物可以另外包含可检测的标记,从而使得探针能够被检测。可以使用的标记的实例包括:荧光标志物或报告基因染料,例如6-羧基荧光素(6FAMTM)、NEDTM(Applera公司)、HEXTM或VICTM(应用生物系统公司(Applied Biosystems));TAMRATM标记(美国加利福尼亚州的应用生物系统公司(Applied Biosystems,CA,USA));化学发光标志物,例如钌探针。
可选地,标记可以选自由以下各项组成的组:电致发光标签、磁性标签、亲和标签或结合标签、核苷酸序列标签、位置特异性标签、和/或具有特定的物理特性,如不同的大小、质量、回转、离子强度、介电特性、极化或阻抗的标签。
可获得公知的提取程序和纯化程序用于从样品中分离蛋白质。蛋白质可以在从样品中提取之后被直接使用。蛋白质提取可以通过物理细胞破坏或基于洗涤剂的细胞裂解来实现。可以通过蛋白质印迹、考马斯染色、Bradford测定以及BCA测定来分析提取的蛋白质。
本文所公开的方法适用于确定患者是否是辨证治疗计划的候选者。辨证治疗计划可以包括选自由以下各项组成的组的一种或多种类型的治疗:化学治疗、免疫治疗、放射治疗、靶向治疗以及移植。辨证治疗计划还可以包括一种或多种治疗的组合。辨证治疗计划可以包括同时或相继施用的一种或多种治疗。在一个优选的实施方案中,辨证治疗是靶向治疗。在另一个优选的实施方案中,辨证治疗是靶向治疗与化学治疗的组合。在一个实施方案中,辨证治疗计划是曲妥珠单抗(transtuzumab)或雷莫芦单抗(ramucirumab)。在另一个实施方案中,辨证治疗计划是曲妥珠单抗或雷莫芦单抗与化学治疗的组合。
本文所公开的方法适用于对个人是否处于癌症风险进行确定或作出预后。如先前所述,相对于没有一种或多种融合基因的对照或参照,处于癌症风险的个人有增加的患癌症的概率。在一个实施方案中,个人或患者有增加了10%、15%、20%、25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%或99%的癌症风险。
一种或多种融合基因的核苷酸序列可以与选自由以下各项组成的组的序列至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%相同:CLEC16A-EMP2(SEQ ID NO.:97、99或101)、SNX2-PRDM6(SEQ ID NO.115)、MLL3PRKAG2(SEQ ID NO.:121、123或125)、DUS2L-PSKH1(SEQ ID NO.:131或133)以及CLDN18-ARHGAP26(SEQ ID NO:107)。在一个实施例中,CLEC16A-EMP2的核苷酸序列与SEQ ID NO.:9770%相同。在另一个实施例中,CLDN18-ARHGAP26的核苷酸序列与SEQ ID NO:107 95%相同。在又另一个实施例中,在癌症相关融合基因是CLEC16A-EMP2与CLDN18-ARHGAP26的组合的情况下,CLEC16A-EMP2与SEQ ID NO.97 80%相同,并且CLDN18-ARHGAP26与SEQ ID NO.10785%相同。
还提供了一种表达载体,所述表达载体包含本文所公开的融合基因中的任一种的编码序列。在一个实施方案中,所述表达载体是哺乳动物表达载体。合适的表达载体包括但不限于pMXs-Puro、pVSVG、pEGFP以及pCMVmyc。
还提供了一种用如本文所公开的表达载体转化的细胞。转化可以通过电穿孔、热休克、化学转染或病毒转染来实现。在一个实施方案中,将细胞通过化学转染来转化。在另一个实施方案中,化学转染是通过脂质体2000(Lipofectamine 2000)来实现的。在另一个实施方案中,转化是通过病毒转染来实现的。在又另一个实施方案中,病毒转染是慢病毒转染或逆转录病毒转染。
还提供了一种用于产生多肽的方法,所述方法包括在加湿室中在5%CO2和37℃将转化的细胞在含有10%牛血清、2mM谷氨酰胺、1%非必需氨基酸以及1%青霉素/链霉素的伊格氏最低必需培养基(Eagle′s Minimum Essential Medium)或杜氏改良伊格氏培养基(Dulbecco′s Modified Eagle′s Medium)或RPMI中培养以使多肽表达以及从所述细胞收集一定量的所述多肽。改变培养条件的参数以优化多肽的产生和提取在本领域技术人员的能力范围之内。
还公开了癌症相关融合基因用于在患者中对癌症进行确定或作出预后的用途,其中在从所述患者获得的样品中所述一种或多种癌症相关融合基因的存在指示所述患者患有癌症或处于增加的患癌症的风险。
实验部分
将通过参考具体实施例更详细地进一步描述本发明的非限制性实施例以及比较实施例,所述实施例不应当被视为以任何方式限制本发明的范围。
材料和方法
临床肿瘤样品
患者样品和临床信息是从已经在新加坡的国立大学医院(National University Hospital,Singapore)和新加坡的陈笃生医院(Tan Tock Seng Hospital,Singapore)接受胃癌手术的患者获得的。从所有受试者获得知情同意书并且所述研究是由新加坡国立大学的机构审查委员会(Institutional Review Board)(参考代码05-145)以及国立健保集团特定领域审查委员会(National Healthcare Group Domain Specific Review Board)(参考代码2005/00440)批准的。
从样品中提取DNA/RNA
使用Allprep DNA/RNA小型试剂盒(快而精公司(Qiagen))从组织样品中提取基因组DNA和总RNA。使用血液和细胞培养物DNA试剂盒(Blood&Cell Culture DNA kit)(快而精公司)从血液样品中提取基因组DNA。
引物和寡核苷酸
用于这一研究中的引物和寡核苷酸描述于表1中。
表1:用于这一研究中的引物。
抗体和试剂
一级和二级市售抗体和试剂描述于表2中。
表2:一级和二级市售抗体和试剂。
针对融合基因的存在进行的RT-PCR筛选
使用SuperScript III试剂盒(英杰公司(Invitrogen)),根据制造商的建议将1μg的总RNA逆转录成cDNA。以如下方案使用JumpStart RED AccuTaq LA DNA聚合酶试剂盒(西格玛公司):
循环条件如下:94℃持续3分钟;(94℃持续20秒、58℃持续30秒、68℃持续10分钟)×15个循环;(94℃持续20秒、55℃持续30秒、68℃持续10分钟)×20个循环;68℃持续15分钟。
细胞培养条件和转染
根据标准条件培养MDCK II细胞系、HeLa细胞系、HGC27细胞系以及TMK1细胞系。使用JetPrimePolyPlus转染试剂盒,根据制造商的说明书进行瞬时转染和稳定转染实验。通过G418选择来产生稳定的转染子。
DNA-PET文库构建、测序、标测以及数据分析
通过改进的生物信息过滤来进行基因组DNA的10kb片段的DNA-PET文库的构建、测序、标测以及数据分析。使用Bioscope(生命科技公司)将短读段与NCBI人类参考基因组版本36.3(hg18)比对。先前已经描述了TMK1以及肿瘤17、26、28和38的DNA-PET数据(NCBI基因表达综合数据库(Gene Expression Omnibus,GEO)登录号GSE26954)以及肿瘤82和92的DNA-PET数据(NCBI GEO登录号GSE30833)。八个另外的肿瘤/正常对的SOLiD测序数据可以在NCBI的序列读段归档(Sequence Read Archive,SRA)处在BioProject ID PRJNA234469处被访问。用于鉴定CLDN18-ARHGAP26的复现的基因组断裂点、过滤癌症基因组中的生殖系结构变异(SV)以及断裂点分布分析的程序如下所述。
对于15个GC样品中的10个,配对的正常样品是可获得的并且使用对应的DNA-PET数据从在肿瘤中被鉴定出的SV中过滤生殖系SV。为此,在界定了SV的不一致配对末端标签(dPET)序列簇的延伸标测坐标中搜索与配对的正常样品的dPET簇的重叠。此外并且特别是对于没有配对正常样品的肿瘤(肿瘤17、26、28和38)以及TMK1,使用配对正常样品和16名无关的非癌症个体的所有SV进行过滤。此外,进行模拟,其中随机地从参考序列中选择呈代表性文库的距离分布的配对序列标签并且将所述标签标测并且通过管道处理。所得的dPET簇代表标测伪影并且用于SV过滤。此外,当较大的SV与在癌症基因组中所鉴定出的SV重叠≥80%时,将dPET簇与基因组变体的数据库(http://dgv.tcag.ca/dgv/app/home)(对非癌症个体的配对末端测序研究)中的SV相比较。通过标准管道进行的数据处理由于异常插入序列大小分布而对于患者82的血液样品产生大量小的缺失,并且去除小于12kb的所有缺失。
MCF-7 RNA聚合酶II ChIA-PET和GC DNA-PET比较
为了研究所述研究的生殖系SV和体细胞SV的两个伴侣位点是否对于核中彼此接近的基因座是富集的,测试SV与通过对乳腺癌细胞系MCF-7进行ChIA-PET测序所获得的全基因组染色质相互作用数据集的重叠,基本原理是一些染色质相互作用在不同的细胞类型间可能是保守的。
驱动融合基因预测
通过如先前所述的计算机模拟分析来预测潜在的驱动融合基因。计算机模拟分析是网络融合中心(network fusion centrality)法,其中使用转录物网络内基因产物的位置来预测它对于网络起作用的重要性。设定阈值0.37来鉴定潜在的融合驱动基因。
通过RT-PCR进行的框内融合基因确认和筛选
使用用于RT-PCR的SuperScript III第一链合成系统(英杰公司),根据制造商的说明书将一微克的总RNA逆转录成cDNA。使用JumpStartTM REDAccuTaq LA DNA聚合酶(西格玛-奥德里奇公司)进行PCR。
GC融合基因构建体和逆转录病毒转染
通过PCR,使用含有HF缓冲液的2×Phusion主混合物(赛默科技公司)和以下引物从肿瘤样品中扩增GC融合基因CLEC16A-EMP2、CLDN18-ARHGAP26、SNX2-PRDM6以及DUS2L-PSKH1。
使用以下正向引物和反向引物,在框内用pMXs-Puro的FLAG肽来构建CLEC16A-EMP2融合体的开放阅读框:正向引物5′GGCGCGGATCCGCCGCCACCATGTTTGGCCGCTCGCGGAG-3′(SEQ ID NO.11)(BamHI、kozak序列以及起始密码子,之后是CLEC16A的第一编码核苷酸)和反向引物5′-TGATAGCGGCCGCTCATCAAGCGTAATCTGGAACATCGTATGGGTACTCGAGTTTGCGCTTCCTCAGTATCAG-3′(SEQ ID NO.:12)(NotI、终止密码子、HA标签以及XhoI,之后是EMP2的编码序列的3′末端)。
类似地,用以下正向引物和反向引物构建CLDN18-ARHGAP26融合体的开放阅读框:正向引物5′GGCGCGGATCCGCCGCCACCATGGCCGTGACTGCCTGTCA-3′(SEQ ID NO.:13)(BamHI、kozak、起始密码子、CLDN18)和反向引物5′-GATAGCGGCCGCTCATCAAGCGTAATCTGGAACATCGTATGGGTACTCGAGGAGGAACTCCACGTAATTCTCA-3′(SEQ ID NO.:14)(NotI、终止密码子、HA标签、XhoI、ARHGAP26)。
使用以下正向引物和反向引物构建SNX2-PRDM6融合体的开放阅读框:正向引物5′-GGCGCTTAATTAAGCCGCCACCATGGCGGCCGAGAGGGAACC-3′(SEQ ID NO.:15)(PacI、kozak、起始密码子、SNX2)和反向引物5′-TGATAGCGGCCGCTCATCAAGCGTAATCTGGAACATCGTATGGGTACTCGAGATCCACTTCGATTGATTCTGG-3′(SEQ ID NO.:16)(NotI、终止密码子、HA标签、XhoI、PRDM6)。
使用以下正向引物和反向引物构建DUS2L-PSKH1融合体的开放阅读框:正向引物5′-GGCGCGGATCCGCCGCCACCATGATTTTGAATAGCCTCTC-3′(SEQ ID NO.:17)(BamHI、kozak、起始密码子、DUS2L)和反向引物5′-TGATAGCGGCCGCTCATCAAGCGTAATCTGGAACATCGTATGGGTACTCGAGGCCATTGTATTGCTGCTGGTAG-3′(SEQ ID NO.:18)(NotI、终止密码子、HA标签、XhoI、PSKH1)。
通过gBlock法(集成DNA技术公司(Integrated DNA Technologies,Inc))用pMXs-Puro的FLAG肽来合成MLL3-PRKAG2。将PCR产物或MLL3-PRKAG2克隆到pMXs-Puro逆转录病毒载体(细胞生物实验室公司(Cell biolabs),RTV-012)中。使用脂质体2000将含有融合基因的pMXs-Puro逆转录病毒载体与pVSVG(假型化构建体)一起共转染到GP2-293细胞中以产生病毒。然后将HGC27细胞和HeLa细胞这两者用含有空载体或融合基因的病毒上清液感染。在嘌呤霉素二盐酸盐(西格玛公司,P9620)的选择压力下获得和维持稳定的转染子。
CLDN18质粒和ARHGAP26质粒的构建
从IMAGE联盟(IMAGE consortium)(http://www.imageconsortium.org/)获得人类CLDN18 cDNA,并且将它与N末端HA标签一起克隆到pcDNA3载体中。CLDN18的编码PDZ结合基序的最后三个氨基酸(DYV)突变成丙氨酸并且被称为CLDN18ΔP。pEGFP载体中的人类ARHGAP26(GRAF1亚型2)cDNA和pCMVmyc是由Richard Lundmark博士(瑞典默奥的默奥大学(邮编:90187)的医药生物化学和生物物理学系(Medical Biochemistry and Biophysics,UniVersity,901 87Sweden))友情提供的。
ARHGAP26同种型的细节如下:
转录物:ARHGAP26-008ENST00000378004(http://www.ensembl.org)(SEQ ID NO.:135)
ATGGGGCTCCCAGCGCTCGAGTTCAGCGACTGCTGCCTCGATAGTCCGCACTTCCGAGAGACGCTCAAGTCGCACGAAGCAGAGCTGGACAAGACCAACAAATTCATCAAGGAGCTCATCAAGGACGGGAAGTCACTCATAAGCGCGCTCAAGAATTTGTCTTCAGCGAAGCGGAAGTTTGCAGATTCCTTAAATGAATTTAAATTTCAGTGCATAGGAGATGCAGAAACAGATGATGAGATGTGTATAGCAAGATCTTTGCAGGAGTTTGCCACTGTCCTCAGGAATCTTGAAGATGAACGGATACGGATGATTGAGAATGCCAGCGAGGTGCTCATCACTCCCTTGGAGAAGTTTCGAAAGGAACAGATCGGGGCTGCCAAGGAAGCCAAAAAGAAGTATGACAAAGAGACAGAAAAGTATTGTGGCATCTTAGAAAAACACTTGAATTTGTCTTCCAAAAAGAAAGAATCTCAGCTTCAGGAGGCAGACAGCCAAGTGGACCTGGTCCGGCAGCATTTCTATGAAGTATCCCTGGAATATGTCTTCAAGGTGCAGGAAGTCCAAGAGAGAAAGATGTTTGAGTTTGTGGAGCCTCTGCTGGCCTTCCTGCAAGGACTCTTCACTTTCTATCACCATGGTTACGAACTGGCCAAGGATTTCGGGGACTTCAAGACACAGTTAACCATTAGCATACAGAACACAAGAAATCGCTTTGAAGGCACTAGATCAGAAGTGGAATCACTGATGAAAAAGATGAAGGAGAATCCCCTTGAGCACAAGACCATCAGTCCCTACACCATGGAGGGATACCTCTACGTGCAGGAGAAACGTCACTTTGGAACTTCTTGGGTGAAGCACTACTGTACATATCAACGGGATTCCAAACAAATCACCATGGTACCATTTGACCAAAAGTCAGGAGGAAAAGGGGGAGAAGATGAATCAGTTATCCTCAAATCCTGCACACGGCGGAAAACAGACTCCATTGAGAAGAGGTTTTGCTTTGATGTGGAAGCAGTAGACAGGCCAGGGGTTATCACCATGCAAGCTTTGTCGGAAGAGGACCGGAGGCTCTGGATGGAAGCCATGGATGGCCGGGAACCTGTCTACAACTCGAACAAAGACAGCCAGAGTGAAGGGACTGCGCAGTTGGACAGCATTGGCTTCAGCATAATCAGGAAATGCATCCATGCTGTGGAAACCAGAGGGATCAACGAGCAAGGGCTGTATCGAATTGTGGGTGTCAACTCCAGAGTGCAGAAGTTGCTGAGTGTCCTGATGGACCCCAAGACTGCTTCTGAGACAGAAACAGATATCTGTGCTGAATGGGAGATAAAGACCATCACTAGTGCTCTGAAGACCTACCTAAGAATGCTTCCAGGACCACTCATGATGTACCAGTTTCAAAGAAGTTTCATCAAAGCAGCAAAACTGGAGAACCAGGAGTCTCGGGTCTCTGAAATCCACAGCCTTGTTCATCGGCTCCCAGAGAAAAATCGGCAGATGTTACAGCTGCTCATGAACCACTTGGCAAATGTTGCTAACAACCACAAGCAGAATTTGATGACGGTGGCAAACCTTGGTGTGGTGTTTGGACCCACTCTGCTGAGGCCTCAGGAAGAAACAGTAGCAGCCATCATGGACATCAAATTTCAGAACATTGTCATTGAGATCCTAATAGAAAACCACGAAAAGATATTTAACACCGTGCCCGATATGCCTCTCACCAATGCCCAGCTGCACCTGTCTCGGAAGAAGAGCAGTGACTCCAAGCCCCCGTCCTGCAGCGAGAGGCCCCTGACGCTCTTCCACACCGTTCAGTCAACAGAGAAACAGGAACAAAGGAACAGCATCATCAACTCCAGTTTGGAATCTGTCTCATCAAATCCAAACAGCATCCTTAATTCCAGCAGCAGCTTACAGCCCAACATGAACTCCAGTGACCCAGACCTGGCTGTGGTCAAACCCACCCGGCCCAACTCACTCCCCCCGAATCCAAGCCCAACTTCACCCCTCTCGCCATCTTGGCCCATGTTCTCGGCGCCATCCAGCCCTATGCCCACCTCATCCACGTCCAGCGACTCATCCCCCGTCAGCACACCGTTCCGGAAGGCAAAAGCCTTGTATGCCTGCAAAGCTGAACATGACTCAGAACTTTCGTTCACAGCAGGCACGGTCTTCGATAACGTTCACCCATCTCAGGAGCCTGGCTGGTTGGAGGGGACTCTGAACGGAAAGACTGGCCTCATCCCTGAGAATTACGTGGAGTTCCTC
在框内随后是HA标签,之后是终止密码子。人类流感血凝素(HA)标签具有以下核苷酸序列之一:5′TAC CCA TAC GAT GTT CCA GAT TAC GCT 3′或5′TAT CCA TAT GAT GTT CCA GAT TAT GCT 3′。还将了解的是,终止密码子可以选自以下中的任一个:TAG、TAA或TGA。
融合基因复现率显著性检验
使用随机化框架来评估融合基因的观测频率的统计显著性。限定了SV谱,所述SV谱模拟通过DNA-PET测序的样品中所鉴定出的SV的类型、数目以及大小分布。使用SV谱模拟15例GC测试数据集的SV并且评估85个GC样品的模拟验证集上的复现SV的频率。假设N=10,000是随机模拟的次数并且es是测试数据集中存在的SVs在验证数据集中的频率,P值(es)被定义为p/N,其中p是模拟次数,其中在频率ek≥es的情况下,SVk存在。
细胞聚集、细胞粘附以及伤口愈合测定
对于细胞聚集测定,将20μl的1.2×106个/毫升的细胞以悬滴形式接种到组织培养皿上,并且在第二天使用Nikon Eclipse TE2000-S获得相衬图像。
对于细胞粘附测定,24孔板未经处理或将24孔板用1mg/ml的纤维连接蛋白和10μg/ml的大鼠1型胶原处理2小时并且用0.1%BSA封闭。将2.5×104个/毫升的细胞接种并且在37℃孵育2小时。
详细地说,将24孔板用1mg/ml的纤维连接蛋白和10μg/ml的大鼠1型胶原处理2小时。随后对板进行洗涤并且通过将表面用0.1%牛血清白蛋白(BSA)处理20分钟来阻止非特异性结合。将表面再次用PBS洗涤并且将2.5×104个/毫升的细胞接种并且在37℃孵育2小时。还将细胞接种到未处理的24孔板中作为对照。使用相差显微术使细胞成像。对于粘附到表面的细胞的定量,将细胞轻轻地用PBS洗涤三次并且在PFA中固定并且计数。
对于伤口愈合测定,将70μL的7×105个细胞/毫升接种到μ-Dish 35mm(Ibidi公司)中的培养插入物上。在第二天,将插入物剥离以产生伤口并且用Nikon Eclispe TE2000使迁移成像直到伤口闭合为止。
细胞增殖测定
将800个细胞对于每一种条件按一式四份接种到24孔板中,并且根据制造商的说明书(细胞增殖试剂WST-1;罗氏公司(Roche))获取读数,持续7天。使用Infinite M200 Quad4单色器(Tecan公司),在450nm,使用650nm的参考波长测量吸光度。
细胞侵袭迁移测定
根据制造商的说明书(康宁公司(Corning))将0.5ml的于RPMI无血清培养基中的1×105个稳定转染的HeLa细胞和MDCK细胞接种到Biocoat基质胶侵袭室中,将于培养基中5%的FBS作为化学吸引剂添加到基质胶侵袭室的孔中,持续24小时。确切地说,根据制造商的说明书(康宁公司)将0.5ml的于RPMI无血清培养基中的1×105个用CLDN18、ARHGAP26以及CLDN18-ARHGAP26稳定转染的HeLa细胞和MDCK细胞接种到Biocoat基质胶侵袭室中。将于培养基中5%的FBS作为化学吸引剂添加到基质胶侵袭室的孔中,持续24小时。在第二天,将细胞在3.7%PFA中固定10分钟并且将插入物用PBS洗涤。将0.1%的结晶紫添加到插入物中,持续10分钟并且用水洗涤两次。使用棉签去除任何非侵袭细胞并且再次洗涤。使用Nikon Eclipse TE2000-S使侵袭细胞的数目成像并且计数。
跨上皮电阻(TER)分析
将2×105个稳定转染的MDCK细胞接种到12mm Transwell插入物(康宁公司)上以获得极化单层。在第二天,将插入物放置在CellZcope(nanoAnalytics公司)中以测量TER。
软琼脂集落形成测定
将HeLa和HGC27稳定细胞系的5000个细胞添加到2ml软琼脂(0.35%诺布尔琼脂(Noble agar)和2×FBS培养基)中并且接种到凝固的基层(含2×FBS培养基的0.7%诺布尔琼脂)上,对于每一次实验设置三次重复测定。在2周-4周后,对集落进行计数。
融合基因
在这一研究中使用5种融合基因,如下表3中所详述。
表3:融合基因
有关这五种复现的融合基因的细节如下所述。
所有的基因组坐标是基于2009年2月人类参考序列(GRCh37或hg19;http://genome.ucsc.edu/)。转录物ID是基于Ensembl基因组数据库(http://www.ensembl.org/)。加黄色阴影的是如在最初的筛选中所发现的5′融合伴侣基因的编码部分,并且加绿色阴影的是3′融合伴侣基因。
融合基因#1:CLEC16A-EMP2
CLEC16A
基因组PCR确认的断裂点-chr16:11073471
外显子9中由RT-PCR确认的RNA融合点-chr16:11073239
EMP2
基因组PCR确认的断裂点-chr16:10666428
外显子2(5′UTR)中由RT-PCR确认的RNA融合点-chr16:10641534
转录物:CLEC16A-001 ENST00000409790
cDNA序列(SEQ ID NO.93),融合基因的编码部分加阴影。
AACTGCATTTCCCAGCGCCCCACGCGGCGGCGGCCGTAAAGCGCGGCGGTCGAACGGCCGGTTCCGGCTGAATGTCAGTGCTGGGCTGTGGGCCGGGGAGGAAGGCGGCTCGCGGTTCCTCCACCGCCTCCGCCGCCGCATCCTCCGCTTGTGCTACCGCCGCGGGCGCTGGGCCGCTCTGCTGGTCCGGCATGAGACCGTGAGACGAGAGACGGGTCGGGGCCGCCGACATGTTTGGCCGCTCGCGGAGCTGGGTGGGCGGGGGCCATGGCAAGACTTCCCGCAACATCCACTCCTTGGACCACCTCAAGTATCTGTACCACGTTTTGACCAAAAACACCACAGTCACAGAACAGAACCGGAACCTGCTAGTGGAGACCATCCGTTCCATCACTGAGATCCTGATCTGGGGAGATCAAAATGACAGCTCTGTATTTGACTTCTTCCTGGAGAAGAATATGTTTGTTTTCTTCTTGAACATCTTGCGGCAAAAGTCGGGCCGTTACGTGTGCGTTCAGCTGCTGCAGACCTTGAACATCCTCTTTGAGAACATCAGTCACGAGACCTCACTTTATTATTTGCTCTCAAATAACTACGTAAATTCTATCATCGTTCATAAATTTGACTTTTCTGATGAGGAGATTATGGCCTATTATATATCGTTCCTGAAAACACTTTCGTTAAAACTCAACAACCACACTGTCCATTTCTTTTATAATGAGCACACCAATGACTTTGCCCTGTACACAGAAGCCATCAAGTTTTTCAACCACCCTGAAAGCATGGTTAGAATTGCTGTAAGAACCATAACTTTGAATGTCTATAAAGTGTCATTGGATAACCAGGCCATGCTGCACTACATCCGAGATAAAACTGCTGTTCCTTACTTCTCCAATTTGGTCTGGTTCATTGGGAGCCATGTGATCGAACTCGATGACTGCGTGCAGACTGATGAGGAGCATCGGAATCGGGGTAAACTGAGTGATCTGGTGGCAGAGCACCTAGACCACCTGCACTATCTCAATGACATCCTGATCATCAACTGTGAGTTCCTCAACGATGTGCTCACTGACCACCTGCTCAACAGGCTCTTCCTGCCCCTCTACGTGTACTCACTGGAGAACCAGGACAAGGGAGGAGAACGGCCGAAAATTAGCCTGCCGGTGTCTCTTTATCTTCTGTCACAGGTCTTCTTAATTATACATCATGCACCGCTGGTGAACTCGTTAGCTGAAGTCATTCTGAATGGTGATCTGTCTGAGATGTACGCTAAGACTGAACAGGATATTCAGAGAAGTTCTGCCAAGCCCAGCATTCGGTGCTTCATTAAACCCACCGAGACACTCGAGCGGTCCCTTGAGATGAACAAGCACAAGGGCAAGAGGCGGGTGCAAAAGAGACCCAACTACAAAAACGTTGGGGAAGAAGAAGATGAGGAGAAAGGGCCCACCGAGGATGCCCAAGAAGACGCCGAGAAGGCTAAAGGTACAGAGGGTGGTTCAAAAGGCATCAAGACGAGTGGGGAGAGTGAAGAGATCGAGATGGTGATCATGGAGCGTAGCAAGCTCTCAGAGCTGGCCGCCAGCACCTCCGTGCAGGAGCAGAACACCACGGACGAGGAGAAAAGCGCCGCCGCCACCTGCTCTGAGAGCACGCAATGGAGCAGACCCTTCCTGGATATGGTGTACCACGCGCTGGACAGCCCGGATGATGATTACCATGCCCTGTTCGTGCTCTGCCTCCTCTATGCCATGTCTCATAATAAAGGCATGGATCCTGAAAAATTAGAGCGAATCCAGCTCCCCGTGCCAAATGCGGCCGAGAAGACCACCTACAACCACCCGCTAGCTGAAAGACTCATCAGGATCATGAACAACGCTGCCCAGCCAGATGGGAAGATCCGGCTGGCGACGCTGGAGCTGAGCTGCCTGCTTCTGAAGCAGCAAGTCCTGATGAGTGCTGGCTGCATCATGAAGGACGTGCACCTGGCCTGCCTGGAGGGTGCGAGAGAAGAAAGTGTTCACCTTGTACGACATTTTTATAAGGGAGAAGACATTTTTTTGGACATGTTTGAAGATGAGTATAGGAGCATGACAATGAAGCCCATGAACGTGGAATATCTCATGATGGACGCCTCCATCCTGCTGCCCCCAACAGGCACGCCACTGACGGGCATTGACTTCGTGAAGCGGCTGCCGTGTGGCGATGTGGAGAAGACCCGGCGGGCCATCCGGGTGTTCTTCATGCTGCGTTCCCTGTCACTGCAATTGCGAGGGGAGCCTGAGACACAGTTGCCGCTGACTCGGGAGGAGGACCTGATCAAGACTGATGATGTCCTGGATCTGAATAACAGCGACTTGATTGCATGTACAGTGATCACCAAGGATGGCGGCATGGTCCAGCGATTCCTGGCTGTGGATATTTACCAGATGAGTTTGGTGGAGCCTGATGTGTCCAGGCTTGGCTGGGGAGTGGTCAAGTTTGCAGGCCTATTGCAGGACATGCAGGTGACTGGCGTGGAGGACGACAGCCGTGCCCTGAACATCACCATCCACAAGCCTGCGTCCAGCCCCCATTCCAAGCCCTTCCCCATCCTCCAGGCCACCTTCATCTTCTCAGACCACATCCGCTGCATCATCGCCAAGCAGCGCCTGGCCAAAGGCCGCATCCAGGCAAGGCGCATGAAGATGCAGAGAATAGCTGCCCTCCTGGACCTCCCAATCCAGCCCACCACTGAAGTCCTGGGGTTTGGACTCGGCTCCTCCACCTCCACTCAGCACCTGCCTTTCCGCTTCTACGACCAGGGGCGCCGGGGCAGCAGCGACCCCACAGTGCAGCGCTCCGTGTTTGCATCGGTGGACAAGGTGCCAGGCTTCGCCGTGGCCCAGTGCATAAACCAGCACAGCTCCCCGTCCCTGTCCTCACAGTCGCCACCCTCCGCCAGCGGGAGCCCCAGCGGCAGCGGGAGCACCAGCCACTGCGACTCTGGAGGCACCAGCTCGTCCTCCACCCCCTCCACAGCCCAGAGTCCAGCAGATGCCCCCATGAGTCCAGAACTGCCTAAGCCTCACCTTCCTGACCAGTTGGTAATCGTCAACGAAACGGAAGCAGACTCTAAGCCCAGCAAGAACGTGGCCAGGAGCGCAGCCGTGGAGACAGCCAGCCTGTCCCCCAGCCTCGTCCCTGCCCGGCAGCCCACCATTTCCCTGCTCTGCGAGGACACGGCTGACACGCTGAGCGTCGAATCGCTGACCCTTGTCCCCCCAGTTGACCCCCACAGCCTCCGCAGCCTCACCGGCATGCCCCCGCTGTCCACGCCGGCTGCCGCCTGCACAGAGCCCGTGGGCGAAGAGGCTGCATGTGCTGAGCCTGTGGGCACCGCTGAGGACTGAGTCAGTGCCGGGGCCTCCCTTTGTGTGTGTGGCCCCGCTGGTAGGGACCCCAGTGCCGCTGACTGGCAAGACACACTGGGAGCACCCACCATTCTGTGCGGCCCCCAGCAGCCATCTCAACCACCTATCCCTGCGCTCCCTTGAATGGGAAGAAGCCCCACGTTGTCCTTGAATTCCTTTTTCACTTTGCATCTCTTCACGTGCAGGCTGGGACCAGCGGAGACACCGCGGCGAATGCAGATGACTGCACCGGCCACTCAGGGAGCTGCCTGGGCTCCGTGTCTCTGAGCCCCGGGTGGCAGGACCCACCGGCACCTCTTTCTTCCTCTGTCATATGGCTCCTCTGTCACCAGCCCCAGTGTGCACAGAAGAATTGGACCAGGTCACTGTACGTAGAAATTTGTAGAAAAGCAGACTTAGATAAACATCTCCTTTGGATATTTATTTCCGCTTTTGGCAGCAGGTGAACATTTATTTTTAAAACTTCTATTTAAAAGAAGTCCAAAAACATCAACACTAAGGTTTGATGTCATGTGAAAAGTGTAATAATAACAGTTAAGATTTCATGATCATTTTCACTGGACCTTTCCTGATATTTTGTTTCAGAGTTCTTAGTGTGGCTTTTTCCATTTATTTAAGTGATTCTTTGTTACTCACTAACTCTGCAAGCCTGTGGAATAATGAAGTACCTTCCTGGAAAGTTTGGATTATTTTTTAAACAAAAACAAGGGAGATACATGTATTCTCAGGTACACACAGAGCTGAGAGGGCTGAATGGTTTTCTGCTATAGCAGCCGAGAGGCCTCCCATCATGGAAAGATTTCTCCAGGAAAAGGAGGAATGTAGCCAGCTCCCCACTCAGGACGCTTCCTCATTTCTCTTCACCAAAACCAAACAGAGACAGCTTCCAGCACCTTCTTCAGTGTTACCATCTCTAAGAAGGAACCAGTTGGGACCGTGAAGACTCCCGACCCTGTGGCCATGATGGAAATCAAAGGAAGACACCCTCTACGTCACCTGCCCTCGACTGTGTGTGCCCACATGTGCCGAGAGATGGCCCAGAGCCAGTTCCCCTCCAGCTGCAAGGGCATGGTGTCCCCAGAGCTCTGAGTCTGTCACTCTCCCTCTGCTACTGCTGCTGATCTGAATATGGAAACCCCATGGTTCCCTTCCCCATTCGGACTGGGTGTGTACAAGCAAGGACCCAGATGCATCAGACACAGCCCCCAAGATGTTCCTTTCTACTCGGCCAGCTCGGGAGCCAGACACAGCACTCACAGCCCAGGCCGTGATCCACCCTCCCCAAGTCCACCAGGGCCAGCGGCCCCTCACCTCTCTGGTCACTGGTGAGACCTTCCACAACTTTCCTCCAGACCTGCCAGCAGATGTGCCCACCAGGGGCATTAGGTATCCGCCGGAGCCTGGCCATAGGGTAGTCTCGGGAGCCGCGCTGAGATCTTTTGCCACCTGCATTTTAGAAGAACATGGTCTCTGTCTCCTCGGCCCAGCCAGCTGTCCCGGCAAGGCCTGCCGAGGGCAGTTTTCAACCTCATGAAGGAAACACAGTCCTGCCAAGGAGGGGGAGTGGCGCCCATGGGGACAGGCCTCAGTCCTTAGAAGCCCTCTGGGTAGCTGTGCCCACCCAGCCTTCATGGCTGCAGGTACAAGGACCTTTGCTTCCATAGAGAAAACGCACAGCTCAGAAAGGGGGCCACATGGGCAGAAACCCAAAGGAAGGACAAACCACGACCACCGTGGCCATCTGCAGAATCCCTGGAAGAGAAGGAAGGCAGGGTGGAGCGGGGGGAAGACCATCATGGAGAGAAGGACCACAGCATCAGGAGACGGGACACGCCACACCCAGCAGGCAGCCTGTGTGTTGCTTAATTTTTTAAGAGCAAGAGGGGTAGAGAGGATCAAGCTGGCCCTGGCTGGAGATGGCTAGCCCCTGAGACATGCACTTCTGGTTTTGAAATGACTCTGTCTGTGGGGCAGCAGAAACTAGAGAAGGCAAGTGGCTGCCCCACCCCAAGGCGTGACCAGGAGGAACAGCCTGCAGCTCACTCCATGCCACACGGGTGGGCCACCAGCCTGCTGTCAGAAGTCTCTGGGCTCCAACTGGTCTTGTAACCACTGAGCACTGAAGGAGAGAGGTCTTGGTCAGGGCTGGACAGCATGCCCGGGAGGACCAGCAGAGGATTAAAGGTGACTGGGAGGACCAGCGGAGGATAAAAGACACTGCTCAGGGCAGGGCTTCTACCCTGCATCCCTGGCCAAGAAAAGGGCAGTCCCCATGTGGGCTTGCAGGGTCACTCTCAGGGGCCTCTTTCAGCTGGGGCTGGCAACTTGCGTCTGGGGGACACCTCCAGGTGTGTGGGGTGAGGATTTCCTATAACCAGGGCTCCCAGAAGCTTTGCTTATGTAAGGAGGTCTGGGAGCCAGCCCATTGGAGGCCACCAGCCATTTTGGCTTCAAAGGACCCCACCTCACCCAGGTCTCAGCGGCAGTGGGCACAGCTATGTCTTCAGGAGCTCCCGTCAAACCTCATAGCTGGGGCGCTCCCAGACAGGCCAGTCCAGACAGGACACGCTGGGCCCCTGGCATCCAGAGGAAGAGCCAGGAGTGTGGGAAGGCCCACAGTGGGGGCTGTGGCTTCTGACACTCAGGTCATAGCCTCAGAGGTCTGAGGTCAGCCCCCACAGACCCATCCGGCCCGCCCCCCAAGTCCCTGCAGAGAGCACTTAGAGTTATGGCCCAGGCCCTGGTCCACCCTTCCCCTGTGCACCTCCGGCTGGGTTTGCCAAGTCAGGGAGCAGGGCTGGCCGCAGGAACTCCCAAACCTTGGCTTTGAATATTGTTGTGGAGGTGTGCTCGTCCCTTTCTGGACGTGCAAGGTACCTGTCCCAGCAGGTCAGATGGGGCCAGCTGAGGCGCTCCCCCAGGCAGGAAGGGCCAGCCTTCACCATCGCGTGGGATTGGGAGGAGGGGCCTCCGTGAGCAGCCCCTCCTCTGCCGCTGTCCCAGCCCAGTCCCTCTCCCGGAGCCTTGGCAGCCTCCCACAACCCAGACACTTGCGTTCACAAGCAACCTAAGGGGCAGGTGAAGAAGCGCAGCCCTGCCAGACGCGCTAGATTCCTCTAAGGTCTCTGAGATGCACCGTTTTTTAAAAAGGCGTGGGGTGAACTGATTTTGATCTTCTTGTCTAGATGCAATAAATAAATCTGAAGCATTTAATGTAGTCATCTTGACATTGGGCCTACACTGTACGAGTTCCTTATGTTTCCTTGAGCTAAAAATATGTAAATAATTTTTGTCCCAGTGAGAACCGAGGGTTAGAAAACCTCGATGCCTCTGAGCCTCGGGACCGCTCTAGGGAAGTACCTGCTTTCGCCAGCATGACTCATGCTTCGTGGGTACTGAACACGAGGGTGGAAATGAAAACTGGAACTTCCTTGTAAATTTAAACTTGGCAATAAAAGAGAAAAAAAGTTACCAAGAA
转录物:CLEC16A-001 ENST00000409790
蛋白质序列(SEQ ID NO.:94),融合基因的编码部分加阴影。
MFGRSRSWVGGGHGKTSRNIHSLDHLKYLYHVLTKNTTVTEQNRNLLVETIRSITEILIWGDQNDSSVFDFFLEKNMFVFFLNILRQKSGRYVCVQLLQTLNILFENISHETSLYYLLSNNYVNSIIVHKFDFSDEEIMAYYISFLKTLSLKLNNHTVHFFYNEHTNDFALYTEAIKFFNHPESMVRIAVRTITLNVYKVSLDNQAMLHYIRDKTAVPYFSNLVWFIGSHVIELDDCVQTDEEHRNRGKLSDLVAEHLDHLHYLNDILIINCEFLNDVLTDHLLNRLFLPLYVYSLENQDKGGERPKISLPVSLYLLSQVFLIIHHAPLVNSLAEVILNGDLSEMYAKTEQDIQRSSAKPSIRCFIKPTETLERSLEMNKHKGKRRVQKRPNYKNVGEEEDEEKGPTEDAQEDAEKAKGTEGGSKGIKTSGESEEIEMVIMERSKLSELAASTSVQEQNTTDEEKSAAATCSESTQWSRPFLDMVYHALDSPDDDYHALFVLCLLYAMSHNKGMDPEKLERIQLPVPNAAEKTTYNHPLAERLIRIMNNAAQPDGKIRLATLELSCLLLKQQVLMSAGCIMKDVHLACLEGAREESVHLVRHFYKGEDIFLDMFEDEYRSMTMKPMNVEYLMMDASILLPPTGTPLTGIDFVKRLPCGDVEKTRRAIRVFFMLRSLSLQLRGEPETQLPLTREEDLIKTDDVLDLNNSDLIACTVITKDGGMVQRFLAVDIYQMSLVEPDVSRLGWGVVKFAGLLQDMQVTGVEDDSRALNITIHKPASSPHSKPFPILQATFIFSDHIRCIIAKQRLAKGRIQARRMKMQRIAALLDLPIQPTTEVLGFGLGSSTSTQHLPFRFYDQGRRGSSDPTVQRSVFASVDKVPGFAVAQCINQHSSPSLSSQSPPSASGSPSGSGSTSHCDSGGTSSSSTPSTAQSPADAPMSPELPKPHLPDQLVIVNETEADSKPSKNVARSAAVETASLSPSLVPARQPTISLLCEDTADTLSVESLTLVPPVDPHSLRSLTGMPPLSTPAAACTEPVGEEAACAEPVGTAED
转录物:EMP2-001 ENST00000359543
cDNA序列(SEQ ID NO.:95),融合基因的编码部分加阴影。
GGCGGGATCGGGGAAGGAGGGGCCCCGCCGCCTAGAGGGTGGAGGGAGGGCGCGCAGTCCCAGCCCAGAGCTTCAAAACAGCCCGGCGGCCTCGCCTCGCACCCCCAGCCAGTCCGTCGATCCAGCTGCCAGCGCAGCCGCCAGCGCCGGCACATCCCGCTCTGGGCTTTAAACGTGACCCCTCGCCTCGACTCGCCCTGCCCTGTGAAAATGTTGGTGCTTCTTGCTTTCATCATCGCCTTCCACATCACCTCTGCAGCCTTGCTGTTCATTGCCACCGTCGACAATGCCTGGTGGGTAGGAGATGAGTTTTTTGCAGATGTCTGGAGAATATGTACCAACAACACGAATTGCACAGTCATCAATGACAGCTTTCAAGAGTACTCCACGCTGCAGGCGGTCCAGGCCACCATGATCCTCTCCACCATTCTCTGCTGCATCGCCTTCTTCATCTTCGTGCTCCAGCTCTTCCGCCTGAAGCAGGGAGAGAGGTTTGTCCTAACCTCCATCATCCAGCTAATGTCATGTCTGTGTGTCATGATTGCGGCCTCCATTTATACAGACAGGCGTGAAGACATTCACGACAAAAACGCGAAATTCTATCCCGTGACCAGAGAAGGCAGCTACGGCTACTCCTACATCCTGGCGTGGGTGGCCTTCGCCTGCACCTTCATCAGCGGCATGATGTACCTGATACTGAGGAAGCGCAAATAGAGTTCCGGAGCTGGGTTGCTTCTGCTGCAGTACAGAATCCACATTCAGATAACCATTTTGTATATAATCATTATTTTTTGAGGTTTTTCTAGCAAACGTATTGTTTCCTTTAAAAGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGAAAAAAAAAATCCAAAAGAGAGAAGAGTTTTTGCATTCTTGAGATCAGAGAATAGACTATGAAGGCTGGTATTCAGAACTGCTGCCCACTCAAAAGTCTCAACAAGACACAAGCAAAAATCCAGCAATGCTCAAATCCAAAAGCACTCGGCAGGACATTTCTTAACCATGGGGCTGTGATGGGAGGAGAGGAGAGGCTGGGAAAGCCGGGTCTCTGGGGACGTGCTTCCTATGGGTTTCAGCTGGCCCAAGCCCCTCCCGAATCTCTCTGCTAGTGGTGGGTGGAAGAGGGTGAGGTGGGGTATAGGAGAAGAATGACAGCTTCCTGAGAGGTTTCACCCAAGTTCCAAGTGAGAAGCAGGTGTAGTCCCTGGCATTCTGTCTGTATCCAAACCAGAGCCCAGCCATCCCTCCGGTATCGGGGTGGGTCAGAAAAAGTCTCACCTCAATTTGCCGACAGTGTCACCTGCTTGCCTTAGGAATGGTCATCCTTAACCTGCGTGCCAGATTTAGACTCGTCTTTAGGCAAAACCTACAGCGCCCCCCCCCTCACCCCAGACCTACAGAATCAGAGTCTTCAAGGGATGGGGCCAGGGAATCTGCATTTCTAACGCGCTCCCTGGGCAACGCTTCAGATGCGTTGAAGTTGGGGACCACGGTGCCTGGGCCAGGTCAGCAGAGCTGCCTCGTAAATGCTGGGGTATCGTCATGTGGAGATGGGGAGGTGAATGCAACCCCCACAGCAGGCCAAAACCTTGGCCTCCATCGCCACAGCTGTCTACATCTAGGGCCCCAAAACTCCATTCCTGAGCCATGTGAACTCATAGACACCTTCAGGGTGTGGGGTACAGCCTCCTTCCCATCTTATCCCAGAAGGCCTCTCCCTTCTTGTCCAGCCCTTCATGCTACACCTGGCTGGCCTCTCACCCCTATTTCTAGAGCCTCAGAGGACCCATCCACCATTCATTCATTCATTCATTCATTCATTCATTCATTCATTCATCAACATAAATCATAACTTGCATGCATGTGCCAGGCACAGGGGATACCCTCTAGAGACAATCTCCTCCTAGGGCTCATGGCCTAGTGGAGGAGACAGATTAAAACTTAATTAGAAAAACTGGCTGGGTACAGTGGCTCATGCTTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAAAATGGTAAAACCTGTCTCTACTAAAAATACAAAAATGAGCTGGGCGTGGTGGTGCATGCCTGTAATCCCAGCTATCAGGTGGCTGAGGCAGGAGAATCACTTGAAATGGGAGGTGGAGGTTGCAGTGAGCCGAGACCGTGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAAAAGAAAAAAAAGAAAAGAAACTAATTACACACTGTGATGGAGGCTGCAAAGAACACCACTAAGAATTCAAAATCAGCTGGGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGTGGATCACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACCGAAAATACAACAAAATTAGCCCGGTGTGGTGGCAGGTGCCTGTAATCCCAGCTACTTAGGAGGCTGAGGCAGGAGAATCGCTTGAAACTGGGAGGCGGAGGTCGCAGTGAGCCGAGATTCACCACTGCACTCCAGCCCAGGCGACAGTCTGAGACTCCGTCTCAAAAATAAAACGATTCAAAATCGAGGCCTGTGGCATGGTAGGGAGGCTGCTTTACGCGTGCCTATTATTAAATGCTCCTGGAGGCATTTAGGTATTTAGATCAGTCTAAATATAGCTCCATTCAGTTCGTGCAGATGACAGTTATTGGGCAGTACCTGTCTGTGTAACACCCAGAAAACATGTCTGTGGAGGGGCCCATGGTCCCGACAGTAAATGCGGTGAGAGGGTCCCATAGAGCTGGAGTTTTCAAGCTTTAGGGGTTCCCGTGCTGCTTGGGACAGGCTGATTCAGAGGGTCTGGGTGAATGATTTCCAGGTGATTTTAAGACTGTGCTGAGAAATAGGGCTTTTGGGGCCTTGTCCTTCAGGATCAAAGCATGATGCTGTGTGGCAATGCAGACCACCCAGGAACCATCCCAGGAGATAAGCTCTTTGCACCTCATTGTCTTTTTCTGCTTATGTTGGAGCAGGATGCTGGGGGCTGTCCTGGGATGGGGTGTGGGACCTCGTGCTATTTAAATACTTTTGCACTTGACCTTCTGCTGAGTGGAGTGGTGGTTTGCCATCAGCTCAGTTCCAGTGGAGCTGAAGAGACATCTGGTTTGAGTAGTTTTAGGGCCACCATGGATATCTCTTCAATGCAGGATTGGCTCTTTCCATCTGCTCTTTCATTCATTTGTTTTTGACAGATAGTATTAAATGTTTACCATGTTCCAGGCACTGTGTGAGGCTCTGAAAATACAGGGGTGAGCAAATCCAGATATCCTCCCTGCCATCATGAAGTTTGGAGTCTATGAGATAGGACCCCCTCCCTATGGAGAAGCCACCAATGCAGTACAGGGTGACCTGGGGCCAGAGACAGGACAAATGTCACCTCCTGCCTCCATGAGATACTCTCACTAGTCATATTGTGGGCAAGAATGTGGCTTACACCCCTAGGGTTAACAGGATGCTACCCAAGCTCATGGAGGAAGTTGAATCTTAAGTTCCCTTGAAACTTTCTACCTTGGTGGCTTTTCTATAATTTTCTTTTTTCTTTTTCTTTTTTTTTTTTTTTTTTGAGACTGAGTTTTGCTCTTGTTGCCCAGGCTGGAGTGCAGTGGCACCATCTTGGCTCACCGCAACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGTCCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCAGGCTGGTTTCGAACTCCCAACCTCAGGTGATCCGCCCACCTCAGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCGTCTGGCCTTCTATAATTTTCTGGTAGTCACGATGGAAACAAACAAAACACCTTAGAACCAGAGATCGACCCCCTCAAGCAATACATCAATTCCCTTCACAAGAAACGTCGGGGCTACATGAGTATCTGTGTTGAATGCGGTCTGAAATGATCCTATGGATTTTCCCGGCTGGTTGCCACTGCTGTACAACATTCAGTGCCCACATCCACCTGTGCCATTAAGCTTTTTTGAGACATGAGAGATGCCTCTTCCCTGCTGTATGACATGCATTTGGGAAGTTGGAAAGAAATGACAAAATCAGGGAGAAAACATCCAAGCTTCTTACCTGTAGATAGAATCAGCCCTCACTTGGTGCTTATTACCAGTTATTCAAGAACAATAACAACAACAAAATTAGTAGACATCCAAGAAGCACATATTAGGACCAAAGATAGCATCAACTGTATTTGAAGGAACTGTAGTTTGCGCATTTTATGACATTTTTATAAAGTACTGTAATTCTTTCATTGAGGGGCTATGTGATGGAGACAGACTAACTCATTTTGTTATTTGCATTAAAATTATTTTGGGTCTCTGTTCAAATGAGTTTGGAGAATGCTTGACTTGTTGGTCTGTGTGAATGTGTATATATATATACCTGAATACAGGAACATCGGAGACCTATTCACTCCCACACACTCTGCTATAGTTTGCGTGCTTTTGTGGACACCCCTCATGAACAGGCTGGCGCTCTAGGACGCTCTGTGTTCACTGATGATGAAGAAACCTAGAACTCCAAGCCTGTTTGTAAACACACTAAACACAGTGGCCTAGATAGAAACTGTATCGTAGTTTAAAATCTGCCTCGCGGGATGTTACTAAACTCGCTAATAGTTTAAAGGTTACTTACAATAGAGCAAGTTGGACAATTTTGTGGTGTTGGGGAAATGTTAGGGCAAGGCCTAGAGGTTCATTTTGAATCTTGGTTTGTGACTTTAGGGTAGTTAGAAACTTTCTACTTAATGTACCTTTAAAATAGTCCATTTTCTATGTTTTGTATAATCTGAAACTGTACATGGAAAATAAAGTTTAAAACCAGATTGCCCAGAGCAAGACTCTAATGTTCCCAACGGTGATGACATCTAGGGCAGAATGCTGCCATTTTGAGGGGCAGGGGGTCAGCTGATTTCTCATCAAGATAATAATGTATGGTTTTTACACTAAGCAACTGATAAATGGACAATTTATCACTGGA
转录物:EMP2-001 ENST00000359543
cDNA序列
GGCGGGATCGGGGAAGGAGGGGCCCCGCCGCCTAGAGGGTGGAGGGAGGGCGCGCAGTCC............................................................CAGCCCAGAGCTTCAAAACAGCCCGGCGGCCTCGCCTCGCACCCCCAGCCAGTCCGTCGA............................................................EMP2断裂点TCCAGCTGCCAGCGCAGCCGCCAGCGCCGGCACATCCCGCTCTGGGCTTTAAACGTGACC............................................................CCTCGCCTCGACTCGCCCTGCCCTGTGAAAATGTTGGTGCTTCTTGCTTTCATCATCGCC..............................-M--L--V--L--L--A--F--I--I--A-TTCCACATCACCTCTGCAGCCTTGCTGTTCATTGCCACCGTCGACAATGCCTGGTGGGTA-F--H--I--T--S--A--A--L--L--F--I--A--T--V--D--N--A--W--W--V-GGAGATGAGTTTTTTGCAGATGTCTGGAGAATATGTACCAACAACACGAATTGCACAGTC-G--D--E--F--F--A--D--V--W--R--I--C--T--N--N--T--N--C--T--V-ATCAATGACAGCTTTCAAGAGTACTCCACGCTGCAGGCGGTCCAGGCCACCATGATCCTC-I--N--D--S--F--Q--E--Y--S--T--L--Q--A--V--Q--A--T--M--I--L-TCCACCATTCTCTGCTGCATCGCCTTCTTCATCTTCGTGCTCCAGCTCTTCCGCCTGAAG-S--T--I--L--C--C--I--A--F--F--I--F--V--L--Q--L--F--R--L--K-CAGGGAGAGAGGTTTGTCCTAACCTCCATCATCCAGCTAATGTCATGTCTGTGTGTCATG-Q--G--E--R--F--V--L--T--S--I--I--Q--L--M--S--C--L--C--V--M-ATTGCGGCCTCCATTTATACAGACAGGCGTGAAGACATTCACGACAAAAACGCGAAATTC-I--A--A--S--I--Y--T--D--R--R--E--D--I--H--D--K--N--A--K--F-TATCCCGTGACCAGAGAAGGCAGCTACGGCTACTCCTACATCCTGGCGTGGGTGGCCTTC-Y--P--V--T--R--E--G--S--Y--G--Y--S--Y--I--L--A--W--V--A--F-GCCTGCACCTTCATCAGCGGCATGATGTACCTGATACTGAGGAAGCGCAAATAGAGTTCC-A--C--T--F--I--S--G--M--M--Y--L--I--L--R--K--R--K--*-......GGAGCTGGGTTGCTTCTGCTGCAGTACAGAATCCACATTCAGATAACCATTTTGTATATA............................................................ATCATTATTTTTTGAGGTTTTTCTAGCAAACGTATTGTTTCCTTTAAAAGCCAAAAAAAA............................................................AAAAAAAAAAAAAAAAAAAAGAAAAAAGAAAAAAAAAATCCAAAAGAGAGAAGAGTTTTT............................................................GCATTCTTGAGATCAGAGAATAGACTATGAAGGCTGGTATTCAGAACTGCTGCCCACTCA............................................................AAAGTCTCAACAAGACACAAGCAAAAATCCAGCAATGCTCAAATCCAAAAGCACTCGGCA............................................................GGACATTTCTTAACCATGGGGCTGTGATGGGAGGAGAGGAGAGGCTGGGAAAGCCGGGTC............................................................TCTGGGGACGTGCTTCCTATGGGTTTCAGCTGGCCCAAGCCCCTCCCGAATCTCTCTGCT............................................................AGTGGTGGGTGGAAGAGGGTGAGGTGGGGTATAGGAGAAGAATGACAGCTTCCTGAGAGG............................................................TTTCACCCAAGTTCCAAGTGAGAAGCAGGTGTAGTCCCTGGCATTCTGTCTGTATCCAAA............................................................CCAGAGCCCAGCCATCCCTCCGGTATCGGGGTGGGTCAGAAAAAGTCTCACCTCAATTTG............................................................CCGACAGTGTCACCTGCTTGCCTTAGGAATGGTCATCCTTAACCTGCGTGCCAGATTTAG............................................................ACTCGTCTTTAGGCAAAACCTACAGCGCCCCCCCCCTCACCCCAGACCTACAGAATCAGA............................................................GTCTTCAAGGGATGGGGCCAGGGAATCTGCATTTCTAACGCGCTCCCTGGGCAACGCTTC............................................................AGATGCGTTGAAGTTGGGGACCACGGTGCCTGGGCCAGGTCAGCAGAGCTGCCTCGTAAA............................................................TGCTGGGGTATCGTCATGTGGAGATGGGGAGGTGAATGCAACCCCCACAGCAGGCCAAAA............................................................CCTTGGCCTCCATCGCCACAGCTGTCTACATCTAGGGCCCCAAAACTCCATTCCTGAGCC............................................................ATGTGAACTCATAGACACCTTCAGGGTGTGGGGTACAGCCTCCTTCCCATCTTATCCCAG............................................................AAGGCCTCTCCCTTCTTGTCCAGCCCTTCATGCTACACCTGGCTGGCCTCTCACCCCTAT............................................................TTCTAGAGCCTCAGAGGACCCATCCACCATTCATTCATTCATTCATTCATTCATTCATTC............................................................ATTCATTCATCAACATAAATCATAACTTGCATGCATGTGCCAGGCACAGGGGATACCCTC............................................................TAGAGACAATCTCCTCCTAGGGCTCATGGCCTAGTGGAGGAGACAGATTAAAACTTAATT............................................................AGAAAAACTGGCTGGGTACAGTGGCTCATGCTTGTAATCCCAGCACTTTGGGAGGCTGAG............................................................GCGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAAAATGGTAAAACCTG............................................................TCTCTACTAAAAATACAAAAATGAGCTGGGCGTGGTGGTGCATGCCTGTAATCCCAGCTA............................................................TCAGGTGGCTGAGGCAGGAGAATCACTTGAAATGGGAGGTGGAGGTTGCAGTGAGCCGAG............................................................ACCGTGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAAAAGAAAA............................................................AAAAGAAAAGAAACTAATTACACACTGTGATGGAGGCTGCAAAGAACACCACTAAGAATT............................................................CAAAATCAGCTGGGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGC............................................................AGGTGGATCACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCT............................................................CTACCGAAAATACAACAAAATTAGCCCGGTGTGGTGGCAGGTGCCTGTAATCCCAGCTAC............................................................TTAGGAGGCTGAGGCAGGAGAATCGCTTGAAACTGGGAGGCGGAGGTCGCAGTGAGCCGA............................................................GATTCACCACTGCACTCCAGCCCAGGCGACAGTCTGAGACTCCGTCTCAAAAATAAAACG............................................................ATTCAAAATCGAGGCCTGTGGCATGGTAGGGAGGCTGCTTTACGCGTGCCTATTATTAAA............................................................TGCTCCTGGAGGCATTTAGGTATTTAGATCAGTCTAAATATAGCTCCATTCAGTTCGTGC............................................................AGATGACAGTTATTGGGCAGTACCTGTCTGTGTAACACCCAGAAAACATGTCTGTGGAGG............................................................GGCCCATGGTCCCGACAGTAAATGCGGTGAGAGGGTCCCATAGAGCTGGAGTTTTCAAGC............................................................TTTAGGGGTTCCCGTGCTGCTTGGGACAGGCTGATTCAGAGGGTCTGGGTGAATGATTTC............................................................CAGGTGATTTTAAGACTGTGCTGAGAAATAGGGCTTTTGGGGCCTTGTCCTTCAGGATCA............................................................AAGCATGATGCTGTGTGGCAATGCAGACCACCCAGGAACCATCCCAGGAGATAAGCTCTT............................................................TGCACCTCATTGTCTTTTTCTGCTTATGTTGGAGCAGGATGCTGGGGGCTGTCCTGGGAT............................................................GGGGTGTGGGACCTCGTGCTATTTAAATACTTTTGCACTTGACCTTCTGCTGAGTGGAGT...........................................................GGTGGTTTGCCATCAGCTCAGTTCCAGTGGAGCTGAAGAGACATCTGGTTTGAGTAGTTT............................................................TAGGGCCACCATGGATATCTCTTCAATGCAGGATTGGCTCTTTCCATCTGCTCTTTCATT............................................................CATTTGTTTTTGACAGATAGTATTAAATGTTTACCATGTTCCAGGCACTGTGTGAGGCTC............................................................TGAAAATACAGGGGTGAGCAAATCCAGATATCCTCCCTGCCATCATGAAGTTTGGAGTCT............................................................ATGAGATAGGACCCCCTCCCTATGGAGAAGCCACCAATGCAGTACAGGGTGACCTGGGGC............................................................CAGAGACAGGACAAATGTCACCTCCTGCCTCCATGAGATACTCTCACTAGTCATATTGTG............................................................GGCAAGAATGTGGCTTACACCCCTAGGGTTAACAGGATGCTACCCAAGCTCATGGAGGAA............................................................GTTGAATCTTAAGTTCCCTTGAAACTTTCTACCTTGGTGGCTTTTCTATAATTTTCTTTT............................................................TTCTTTTTCTTTTTTTTTTTTTTTTTTGAGACTGAGTTTTGCTCTTGTTGCCCAGGCTGG............................................................AGTGCAGTGGCACCATCTTGGCTCACCGCAACCTCTGCCTCCTGGGTTCAAGTGATTCTC............................................................CTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGTCCCACCATGCCCAGCTAATTT............................................................TTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCAGGCTGGTTTCGAACTCCCAA............................................................CCTCAGGTGATCCGCCCACCTCAGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCCACT............................................................GCGTCTGGCCTTCTATAATTTTCTGGTAGTCACGATGGAAACAAACAAAACACCTTAGAA............................................................CCAGAGATCGACCCCCTCAAGCAATACATCAATTCCCTTCACAAGAAACGTCGGGGCTAC............................................................ATGAGTATCTGTGTTGAATGCGGTCTGAAATGATCCTATGGATTTTCCCGGCTGGTTGCC............................................................ACTGCTGTACAACATTCAGTGCCCACATCCACCTGTGCCATTAAGCTTTTTTGAGACATG............................................................AGAGATGCCTCTTCCCTGCTGTATGACATGCATTTGGGAAGTTGGAAAGAAATGACAAAA............................................................TCAGGGAGAAAACATCCAAGCTTCTTACCTGTAGATAGAATCAGCCCTCACTTGGTGCTT............................................................ATTACCAGTTATTCAAGAACAATAACAACAACAAAATTAGTAGACATCCAAGAAGCACAT............................................................ATTAGGACCAAAGATAGCATCAACTGTATTTGAAGGAACTGTAGTTTGCGCATTTTATGA............................................................CATTTTTATAAAGTACTGTAATTCTTTCATTGAGGGGCTATGTGATGGAGACAGACTAAC............................................................TCATTTTGTTATTTGCATTAAAATTATTTTGGGTCTCTGTTCAAATGAGTTTGGAGAATG............................................................CTTGACTTGTTGGTCTGTGTGAATGTGTATATATATATACCTGAATACAGGAACATCGGA............................................................GACCTATTCACTCCCACACACTCTGCTATAGTTTGCGTGCTTTTGTGGACACCCCTCATG............................................................AACAGGCTGGCGCTCTAGGACGCTCTGTGTTCACTGATGATGAAGAAACCTAGAACTCCA............................................................AGCCTGTTTGTAAACACACTAAACACAGTGGCCTAGATAGAAACTGTATCGTAGTTTAAA............................................................ATCTGCCTCGCGGGATGTTACTAAACTCGCTAATAGTTTAAAGGTTACTTACAATAGAGC............................................................AAGTTGGACAATTTTGTGGTGTTGGGGAAATGTTAGGGCAAGGCCTAGAGGTTCATTTTG............................................................AATCTTGGTTTGTGACTTTAGGGTAGTTAGAAACTTTCTACTTAATGTACCTTTAAAATA............................................................GTCCATTTTCTATGTTTTGTATAATCTGAAACTGTACATGGAAAATAAAGTTTAAAACCA............................................................GATTGCCCAGAGCAAGACTCTAATGTTCCCAACGGTGATGACATCTAGGGCAGAATGCTG.............................................................CCATTTTGAGGGGCAGGGGGTCAGCTGATTTCTCATCAAGATAATAATGTATGGTTTTTA............................................................CACTAAGCAACTGATAAATGGACAATTTATCACTGGA.....................................
转录物:EMP2-001 ENST00000359543
蛋白质序列(SEQ I D N0.:96)
MLVLLAFIIAFHITSAALLFIATVDNAWWVGDEFFADVWRICTNNTNCTVINDSFQEYSTLQAVQATMILSTILCCIAFFIFVLQLFRLKQGERFVLTSIIQLMSCLCVMIAASIYTDRREDIHDKNAKFYPVTREGSYGYSYILAWVAFACTFISGMMYLILRKRK
CLEC16A-EMP2融合序列外显子9到外显子2UTR
cDNA序列(SEQ ID NO.:97),EMP2加下划线。
ATGTTTGGCCGCTCGCGGAGCTGGGTGGGCGGGGGCCATGGCAAGACTTCCCGCAACATCCACTCCTTGGACCACCTCAAGTATCTGTACCACGTTTTGACCAAAAACACCACAGTCACAGAACAGAACCGGAACCTGCTAGTGGAGACCATCCGTTCCATCACTGAGATCCTGATCTGGGGAGATCAAAATGACAGCTCTGTATTTGACTTCTTCCTGGAGAAGAATATGTTTGTTTTCTTCTTGAACATCTTGCGGCAAAAGTCGGGCCGTTACGTGTGCGTTCAGCTGCTGCAGACCTTGAACATCCTCTTTGAGAACATCAGTCACGAGACCTCACTTTATTATTTGCTCTCAAATAACTACGTAAATTCTATCATCGTTCATAAATTTGACTTTTCTGATGAGGAGATTATGGCCTATTATATATCGTTCCTGAAAACACTTTCGTTAAAACTCAACAACCACACTGTCCATTTCTTTTATAATGAGCACACCAATGACTTTGCCCTGTACACAGAAGCCATCAAGTTTTTCAACCACCCTGAAAGCATGGTTAGAATTGCTGTAAGAACCATAACTTTGAATGTCTATAAAGTGTCATTGGATAACCAGGCCATGCTGCACTACATCCGAGATAAAACTGCTGTTCCTTACTTCTCCAATTTGGTCTGGTTCATTGGGAGCCATGTGATCGAACTCGATGACTGCGTGCAGACTGATGAGGAGCATCGGAATCGGGGTAAACTGAGTGATCTGGTGGCAGAGCACCTAGACCACCTGCACTATCTCAATGACATCCTGATCATCAACTGTGAGTTCCTCAACGATGTGCTCACTGACCACCTGCTCAACAGGCTCTTCCTGCCCCTCTACGTGTACTCACTGGAGAACCAGGACAAGGGAGGAGAACGGCCGAAAATTAGCCTGCCGGTGTCTCTTTATCTTCTGTCACAGCACATCCCGCTCTGGGCTTTAAACGTGACCCCTCGCCTCGACTCGCCCTGCCCTGTGAAAATGTTGGTGCTTCTTGCTTTCATCATCGCCTTCCACATCACCTCTGCAGCCTTGCTGTTCATTGCCACCGTCGACAATGCCTGGTGGGTAGGAGATGAGTTTTTTGCAGATGTCTGGAGAATATGTACCAACAACACGAATTGCACAGTCATCAATGACAGCTTTCAAGAGTACTCCACGCTGCAGGCGGTCCAGGCCACCATGATCCTCTCCACCATTCTCTGCTGCATCGCCTTCTTCATCTTCGTGCTCCAGCTCTTCCGCCTGAAGCAGGGAGAGAGGTTTGTCCTAACCTCCATCATCCAGCTAATGTCATGTCTGTGTGTCATGATTGCGGCCTCCATTTATACAGACAGGCGTGAAGACATTCACGACAAAAACGCGAAATTCTATCCCGTGACCAGAGAAGGCAGCTACGGCTACTCCTACATCCTGGCGTGGGTGGCCTTCGCCTGCACCTTCATCAGCGGCATGATGTACCTGATACTGAGGAAGCGCAAATAG
蛋白质序列(SEQ ID NO.:98),EMP2加下划线。
MFGRSRSWVGGGHGKTSRNIHSLDHLKYLYHVLTKNTTVTEQNRNLLVETIRSITEILIWGDQNDSSVFDFFLEKNMFVFFLNILRQKSGRYVCVQLLQTLNILFENISHETSLYYLLSNNYVNSIIVHKFDFSDEEIMAYYISFLKTLSLKLNNHTVHFFYNEHTNDFALYTEAIKFFNHPESMVRIAVRTITLNVYKVSLDNQAMLHYIRDKTAVPYFSNLVWFIGSHVIELDDCVQTDEEHRNRGKLSDLVAEHLDHLHYLNDILIINCEFLNDVLTDHLLNRLFLPLYVYSLENQDKGGERPKISLPVSLYLLSQHIPLWALNVTPRLDSPCPVKMLVLLAFIIAFHITSAALLFIATVDNAWWVGDEFFADVWRICTNNTNCTVINDSFQEYSTLQAVQATMILSTILCCIAFFIFVLQLFRLKQGERFVLTSIIQLMSCLCVMIAASIYTDRREDIHDKNAKFYPVTREGSYGYSYILAWVAFACTFISGMMYLILRKRK
蛋白质结构域
具有506个残基的查询序列内的结构域
名称 开端 末端 跨膜区 341 363 跨膜区 400 422 跨膜区 434 456 跨膜区 480 502
CLEC16A-EMP2融合序列外显子4到外显子2UTR
cDNA序列(SEQ ID NO.:99),EMP2加下划线。
ATGTTTGGCCGCTCGCGGAGCTGGGTGGGCGGGGGCCATGGCAAGACTTCCCGCAACATCCACTCCTTGGACCACCTCAAGTATCTGTACCACGTTTTGACCAAAAACACCACAGTCACAGAACAGAACCGGAACCTGCTAGTGGAGACCATCCGTTCCATCACTGAGATCCTGATCTGGGGAGATCAAAATGACAGCTCTGTATTTGACTTCTTCCTGGAGAAGAATATGTTTGTTTTCTTCTTGAACATCTTGCGGCAAAAGTCGGGCCGTTACGTGTGCGTTCAGCTGCTGCAGACCTTGAACATCCTCTTTGAGAACATCAGTCACGAGACCTCACTTTATTATTTGCTCTCAAATAACTACGTAAATTCTATCATCGTTCATAAATTTGACTTTTCTGATGAGGAGATTATGGCCTATTATATATCGTTCCTGAAAACACTTTCGTTAAAACTCAACAACCACACTGTCCATTTCTTTTATAATGAGCACATCCCGCTCTGGGCTTTAAACGTGACCCCTCGCCTCGACTCGCCCTGCCCTGTGAAAATGTTGGTGCTTCTTGCTTTCATCATCGCCTTCCACATCACCTCTGCAGCCTTGCTGTTCATTGCCACCGTCGACAATGCCTGGTGGGTAGGAGATGAGTTTTTTGCAGATGTCTGGAGAATATGTACCAACAACACGAATTGCACAGTCATCAATGACAGCTTTCAAGAGTACTCCACGCTGCAGGCGGTCCAGGCCACCATGATCCTCTCCACCATTCTCTGCTGCATCGCCTTCTTCATCTTCGTGCTCCAGCTCTTCCGCCTGAAGCAGGGAGAGAGGTTTGTCCTAACCTCCATCATCCAGCTAATGTCATGTCTGTGTGTCATGATTGCGGCCTCCATTTATACAGACAGGCGTGAAGACATTCACGACAAAAACGCGAAATTCTATCCCGTGACCAGAGAAGGCAGCTACGGCTACTCCTACATCCTGGCGTGGGTGGCCTTCGCCTGCACCTTCATCAGCGGCATGATGTACCTGATACTGAGGAAGCGCAAATAG
蛋白质序列(SEQ ID NO.:100)
M F G R S R S W V G G G H G K T S R N I H S L D H L K Y L Y H V L T K N T T V T E Q N R N L L V E T I R S I T E I L I W G D Q N D S S V F D F F L E K N M F V F F L N I L R Q K S G R Y V C V Q L L Q T L N I L F E N I S H E T S L Y Y L L S N N Y V N S I I V H K F D F S D E E I M A Y Y I S F L K T L S L K L N N H T V H F F Y N E H I P L W A L N V T P R L D S P C P V K M L V L L A F I I A F H I T S A A L L F I A T V D N A W W V G D E F F A D V W R I C T N N T N C T V I N D S F Q E Y S T L Q A V Q A T M I L S T I L C C I A F F I F V L Q L F R L K Q G E R F V L T S I I Q L M S C L C V M I A A S I Y T D R R E D I H D K N A K F Y P V T R E G S Y G Y S Y I L A W V A F A C T F I S G M M Y L I L R K R K终止密码子
蛋白质结构域
具有351个残基的查询序列内的结构域
CLEC16A-EMP2融合序列外显子10到外显子2UTR
cDNA序列(SEQ ID NO.:101),EMP2加下划线。
ATGTTTGGCCGCTCGCGGAGCTGGGTGGGCGGGGGCCATGGCAAGACTTCCCGCAACATCCACTCCTTGGACCACCTCAAGTATCTGTACCACGTTTTGACCAAAAACACCACAGTCACAGAACAGAACCGGAACCTGCTAGTGGAGACCATCCGTTCCATCACTGAGATCCTGATCTGGGGAGATCAAAATGACAGCTCTGTATTTGACTTCTTCCTGGAGAAGAATATGTTTGTTTTCTTCTTGAACATCTTGCGGCAAAAGTCGGGCCGTTACGTGTGCGTTCAGCTGCTGCAGACCTTGAACATCCTCTTTGAGAACATCAGTCACGAGACCTCACTTTATTATTTGCTCTCAAATAACTACGTAAATTCTATCATCGTTCATAAATTTGACTTTTCTGATGAGGAGATTATGGCCTATTATATATCGTTCCTGAAAACACTTTCGTTAAAACTCAACAACCACACTGTCCATTTCTTTTATAATGAGCACACCAATGACTTTGCCCTGTACACAGAAGCCATCAAGTTTTTCAACCACCCTGAAAGCATGGTTAGAATTGCTGTAAGAACCATAACTTTGAATGTCTATAAAGTGTCATTGGATAACCAGGCCATGCTGCACTACATCCGAGATAAAACTGCTGTTCCTTACTTCTCCAATTTGGTCTGGTTCATTGGGAGCCATGTGATCGAACTCGATGACTGCGTGCAGACTGATGAGGAGCATCGGAATCGGGGTAAACTGAGTGATCTGGTGGCAGAGCACCTAGACCACCTGCACTATCTCAATGACATCCTGATCATCAACTGTGAGTTCCTCAACGATGTGCTCACTGACCACCTGCTCAACAGGCTCTTCCTGCCCCTCTACGTGTACTCACTGGAGAACCAGGACAAGGGAGGAGAACGGCCGAAAATTAGCCTGCCGGTGTCTCTTTATCTTCTGTCACAGGTCTTCTTAATTATACATCATGCACCGCTGGTGAACTCGTTAGCTGAAGTCATTCTGAATGGTGATCTGTCTGAGATGTACGCTAAGACTGAACAGGATATTCAGAGAAGTTCTCACATCCCGCTCTGGGCTTTAAACGTGACCCCTCGCCTCGACTCGCCCTGCCCTGTGAAAATGTTGGTGCTTCTTGCTTTCATCATCGCCTTCCACATCACCTCTGCAGCCTTGCTGTTCATTGCCACCGTCGACAATGCCTGGTGGGTAGGAGATGAGTTTTTTGCAGATGTCTGGAGAATATGTACCAACAACACGAATTGCACAGTCATCAATGACAGCTTTCAAGAGTACTCCACGCTGCAGGCGGTCCAGGCCACCATGATCCTCTCCACCATTCTCTGCTGCATCGCCTTCTTCATCTTCGTGCTCCAGCTCTTCCGCCTGAAGCAGGGAGAGAGGTTTGTCCTAACCTCCATCATCCAGCTAATGTCATGTCTGTGTGTCATGATTGCGGCCTCCATTTATACAGACAGGCGTGAAGACATTCACGACAAAAACGCGAAATTCTATCCCGTGACCAGAGAAGGCAGCTACGGCTACTCCTACATCCTGGCGTGGGTGGCCTTCGCCTGCACCTTCATCAGCGGCATGATGTACCTGATACTGAGGAAGCGCAAATAG
蛋白质序列(SEQ ID NO.:102)
M F G R S R S W V G G G H G K T S R N I H S L D H L K Y L Y H V L T K N T T V T E Q N R N L L V E T I R S I T E I L I W G D Q N D S S V F D F F L E K N M F V F F L N I L R Q K S G R Y V C V Q L L Q T L N I L F E N I S H E T S L Y Y L L S N N Y V N S I I V H K F D F S D E E I M A Y Y I S F L K T L S L K L N N H T V H F F Y N E H T N D F A L Y T E A I K F F N H P E S M V R I A V R T I T L N V Y K V S L D N Q A M L H Y I R D K T A V P Y F S N L V W F I G S H V I E L D D C V Q T D E E H R N R G K L S D L V A E H L D H L H Y L N D I L I I N C E F L N D V L T D H L L N R L F L P L Y V Y S L E N Q D K G G E R P K I S L P V S L Y L L S Q V F L I I H H A P L V N S L A E V I L N G D L S E M Y A K T E Q D I Q R S S H I P L W A L N V T P R L D S P C P V K M L V L L A F I I A F H I T S A A L L F I A T V D N A W W V G D E F F A D V W R I C T N N T N C T V I N D S F Q E Y S T L Q A V Q A T M I L S T I L C C I A F F I F V L Q L F R L K Q G E R F V L T S I I Q L M S C L C V M I A A S I Y T D R R E D I H D K N A K F Y P V T R E G S Y G Y S Y I L A W V A F A C T F I S G M M Y L I L R K R K终止密码子
蛋白质结构域
具有544个残基的查询序列内的结构域
融合基因#2:CLDN18-ARHGAP26
CLDN18
发现样品中的由基因组PCR确认的断裂点-chr3:137,752,065
外显子5中由RT-PCR确认的RNA融合点-chr3:137,749,947
ARHGAP26
发现样品中的由基因组PCR确认的断裂点-chr5:142318274
外显子12中由RT-PCR确认的RNA融合点-chr5:142393645
转录物:CLDN18-001 ENST00000343735
cDNA序列(SEQ ID NO.:103),融合基因的编码部分加阴影。
AACCGCCTCCATTACATGGTCCGTTCCTGACGTGTACACCAGCCTCTCAGAGAAAACTCCATCCCTACACTCGGTAGTCTCAGAATTGCGCTGTCCACTTGTCGTGTGGCTCTGTGTCGACACTGTGCGCCACCATGGCCGTGACTGCCTGTCAGGGCTTGGGGTTCGTGGTTTCACTGATTGGGATTGCGGGCATCATTGCTGCCACCTGCATGGACCAGTGGAGCACCCAAGACTTGTACAACAACCCCGTAACAGCTGTTTTCAACTACCAGGGGCTGTGGCGCTCCTGTGTCCGAGAGAGCTCTGGCTTCACCGAGTGCCGGGGCTACTTCACCCTGCTGGGGCTGCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCTGGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTGCCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTGTCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGGGATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCCTCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAAGCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTTTGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATCCTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCACCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCTCGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACAGCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACTTTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTCTTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAAGAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAAACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTTACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTGTTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGTCGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAAAATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAGGAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCCATGAAAAGCTCACTGATCCTAATTCTTTCCCTTTGAGGTCTCTATGGCTCTGATTGTACATGATAGTAAGTGTAAGCCATGTAAAAAGTAAATAATGTCTGGGCACAGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGTTCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCATGGTGGCCTACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGTTGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAAAATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAAAAAAAAAAAAAAGTTGAGCCTGAATTAAATGTAATGTTTCCAAGTGACAGGTATCCACATTTGCATGGTTACAAGCCACTGCCAGTTAGCAGTAGCACTTTCCTGGCACTGTGGTCGGTTTTGTTTTGTTTTGCTTTGTTTAGAGACGGGGTCTCACTTTCCAGGCTGGCCTCAAACTCCTGCACTCAAGCAATTCTTCTACCCTGGCCTCCCAAGTAGCTGGAATTACAGGTGTGCGCCATCACAACTAGCTGGTGGTCAGTTTTGTTACTCTGAGAGCTGTTCACTTCTCTGAATTCACCTAGAGTGGTTGGACCATCAGATGTTTGGGCAAAACTGAAAGCTCTTTGCAACCACACACCTTCCCTGAGCTTACATCACTGCCCTTTTGAGCAGAAAGTCTAAATTCCTTCCAAGACAGTAGAATTCCATCCCAGTACCAAAGCCAGATAGGCCCCCTAGGAAACTGAGGTAAGAGCAGTCTCTAAAAACTACCCACAGCAGCATTGGTGCAGGGGAACTTGGCCATTAGGTTATTATTTGAGAGGAAAGTCCTCACATCAATAGTACATATGAAAGTGACCTCCAAGGGGATTGGTGAATACTCATAAGGATCTTCAGGCTGAACAGACTATGTCTGGGGAAAGAACGGATTATGCCCCATTAAATAACAAGTTGTGTTCAAGAGTCAGAGCAGTGAGCTCAGAGGCCCTTCTCACTGAGACAGCAACATTTAAACCAAACCAGAGGAAGTATTTGTGGAACTCACTGCCTCAGTTTGGGTAAAGGATGAGCAGACAAGTCAACTAAAGAAAAAAGAAAAGCAAGGAGGAGGGTTGAGCAATCTAGAGCATGGAGTTTGTTAAGTGCTCTCTGGATTTGAGTTGAAGAGCATCCATTTGAGTTGAAGGCCACAGGGCACAATGAGCTCTCCCTTCTACCACCAGAAAGTCCCTGGTCAGGTCTCAGGTAGTGCGGTGTGGCTCAGCTGGGTTTTTAATTAGCGCATTCTCTATCCAACATTTAATTGTTTGAAAGCCTCCATATAGTTAGATTGTGCTTTGTAATTTTGTTGTTGTTGCTCTATCTTATTGTATATGCATTGAGTATTAACCTGAATGTTTTGTTACTTAAATATTAAAAACACTGTTATCCTACAGTT
转录物:CLDN18-001 ENST00000343735
蛋白质序列(SEQ ID NO.:104),融合基因的编码部分加阴影。
MAVTACQGLGFVVSLIGIAGIIAATCMDQWSTQDLYNNPVTAVFNYQGLWRSCVRESSGFTECRGYFTLLGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGLCAIAGVSVFANMLVTNFWMSTANMYTGMGGMVQTVQTRYTFGAALFVGWVAGGLTLIGGVMMCIACRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDYV
转录物:ARHGAP26-001 ENST00000274498
cDNA序列(SEQ ID NO.:105),融合基因的编码部分加阴影。
GGCGGGGCGGCCGAGGCTGCTGTGAGAGGGCGCTCGAGGCTGCCGAGAGCTAGCTAGCGAAGGAGGCGGGGAGGCGGCGTCTGCACTCGCTCGCCCGCTCGCTCGCTTCCCGGCGCCGCTGCGGGTCCGCGCTGCGTTTCCTGCTCGCGATCCGCTCCGTTGCCCGCGCCCGGAACAGCAGCACCTCGGCCGGGTCCGAGCTCGGTTCGGGAGTCTTGCGCGCCGGCGGACACCGCGCGCGGAGTGAGCCAGCGCCACACCTGTGGAGCCGGCGGCCGTCGGGGGAGCCGGCCGGGGTCCCGCCGCGTGAGTGCTCTGGGCGGCGGGCGGCCCGGGCCCCGGCGGAGGCGCGCCCCCCGGCTGGGCGCCGCGCGCACCATGGGGCTCCCAGCGCTCGAGTTCAGCGACTGCTGCCTCGATAGTCCGCACTTCCGAGAGACGCTCAAGTCGCACGAAGCAGAGCTGGACAAGACCAACAAATTCATCAAGGAGCTCATCAAGGACGGGAAGTCACTCATAAGCGCGCTCAAGAATTTGTCTTCAGCGAAGCGGAAGTTTGCAGATTCCTTAAATGAATTTAAATTTCAGTGCATAGGAGATGCAGAAACAGATGATGAGATGTGTATAGCAAGATCTTTGCAGGAGTTTGCCACTGTCCTCAGGAATCTTGAAGATGAACGGATACGGATGATTGAGAATGCCAGCGAGGTGCTCATCACTCCCTTGGAGAAGTTTCGAAAGGAACAGATCGGGGCTGCCAAGGAAGCCAAAAAGAAGTATGACAAAGAGACAGAAAAGTATTGTGGCATCTTAGAAAAACACTTGAATTTGTCTTCCAAAAAGAAAGAATCTCAGCTTCAGGAGGCAGACAGCCAAGTGGACCTGGTCCGGCAGCATTTCTATGAAGTATCCCTGGAATATGTCTTCAAGGTGCAGGAAGTCCAAGAGAGAAAGATGTTTGAGTTTGTGGAGCCTCTGCTGGCCTTCCTGCAAGGACTCTTCACTTTCTATCACCATGGTTACGAACTGGCCAAGGATTTCGGGGACTTCAAGACACAGTTAACCATTAGCATACAGAACACAAGAAATCGCTTTGAAGGCACTAGATCAGAAGTGGAATCACTGATGAAAAAGATGAAGGAGAATCCCCTTGAGCACAAGACCATCAGTCCCTACACCATGGAGGGATACCTCTACGTGCAGGAGAAACGTCACTTTGGAACTTCTTGGGTGAAGCACTACTGTACATATCAACGGGATTCCAAACAAATCACCATGGTACCATTTGACCAAAAGTCAGGAGGAAAAGGGGGAGAAGATGAATCAGTTATCCTCAAATCCTGCACACGGCGGAAAACAGACTCCATTGAGAAGAGGTTTTGCTTTGATGTGGAAGCAGTAGACAGGCCAGGGGTTATCACCATGCAAGCTTTGTCGGAAGAGGACCGGAGGCTCTGGATGGAAGCCATGGATGGCCGGGAACCTGTCTACAACTCGAACAAAGACAGCCAGAGTGAAGGGACTGCGCAGTTGGACAGCATTGGCTTCAGCATAATCAGGAAATGCATCCATGCTGTGGAAACCAGAGGGATCAACGAGCAAGGGCTGTATCGAATTGTGGGTGTCAACTCCAGAGTGCAGAAGTTGCTGAGTGTCCTGATGGACCCCAAGACTGCTTCTGAGACAGAAACAGATATCTGTGCTGAATGGGAGATAAAGACCATCACTAGTGCTCTGAAGACCTACCTAAGAATGCTTCCAGGACCACTCATGATGTACCAGTTTCAAAGAAGTTTCATCAAAGCAGCAAAACTGGAGAACCAGGAGTCTCGGGTCTCTGAAATCCACAGCCTTGTTCATCGGCTCCCAGAGAAAAATCGGCAGATGTTACAGCTGCTCATGAACCACTTGGCAAATGTTGCTAACAACCACAAGCAGAATTTGATGACGGTGGCAAACCTTGGTGTGGTGTTTGGACCCACTCTGCTGAGGCCTCAGGAAGAAACAGTAGCAGCCATCATGGACATCAAATTTCAGAACATTGTCATTGAGATCCTAATAGAAAACCACGAAAAGATATTTAACACCGTGCCCGATATGCCTCTCACCAATGCCCAGCTGCACCTGTCTCGGAAGAAGAGCAGTGACTCCAAGCCCCCGTCCTGCAGCGAGAGGCCCCTGACGCTCTTCCACACCGTTCAGTCAACAGAGAAACAGGAACAAAGGAACAGCATCATCAACTCCAGTTTGGAATCTGTCTCATCAAATCCAAACAGCATCCTTAATTCCAGCAGCAGCTTACAGCCCAACATGAACTCCAGTGACCCAGACCTGGCTGTGGTCAAACCCACCCGGCCCAACTCACTCCCCCCGAATCCAAGCCCAACTTCACCCCTCTCGCCATCTTGGCCCATGTTCTCGGCGCCATCCAGCCCTATGCCCACCTCATCCACGTCCAGCGACTCATCCCCCGTCAGGTCTGTTGCAGGGTTTGTTTGGTTTTCTGTTGCTGCCGTTGTTCTCTCATTGGCTCGGTCCTCTCTTCATGCAGTGTTCAGCCTCCTCGTCAACTTTGTTCCCTGCCATCCAAACCTGCACTTGCTTTTTGACAGGCCAGAAGAAGCGGTACATGAAGACTCCAGCACACCGTTCCGGAAGGCAAAAGCCTTGTATGCCTGCAAAGCTGAACATGACTCAGAACTTTCGTTCACAGCAGGCACGGTCTTCGATAACGTTCACCCATCTCAGGAGCCTGGCTGGTTGGAGGGGACTCTGAACGGAAAGACTGGCCTCATCCCTGAGAATTACGTGGAGTTCCTCTAACCGTGGGCCCCAGCAGAACTGCTGAGCTTTACATGGTATCCATGACAACTGCTGATTCCAGTGTCGAGGCCATTTCTCTTTGCCACTGAGAAATGCAGCGTGACTGACTCTGTTGCTACCTGTCAACATGAATGTTTCTGTGAGCTCTGGTGTCACTCATCTCCATGATCATCTCAGCCAACATGCATCAGTACTGCAAGAAAAGAAGTCAATCAGCAGAGGAGAGCATTTGATAACTAAGAGGAAGACTTGCAAAGCCGTTTTCTCATGAGTACCCTGAATAGGGGGCACTCATTTTGTTTCAACGGTCCAAACGCCCAACCTTCAGAAAGAGGAAGTCAGATAGAAATAGTCCCTGAGAGCACACTGTGTAGCTAAGCCTGCTGGGGCTGGGTGAAGAAATTGGCGCTGAGATCCAGGCTGGATCCATTGCTTTTGTTTACAATAGGCACTCTCTCTACCCCACCTCTCAGTACTTGAGACTTAAAGTGCTACAGGCAGCTGGATCTGTTTGCATGCAGGATGAAGAGGGTTAAAACACTGTTTATATAAGATCCAATCTCTCACCATCTCTAAAGCAGCCGTTGGCCTGTCATCAGTGAGATACAATCCAGTCTTCTCATGCACGGGAACACACACACCCTGCGTTTCTCCCTCCCAGGCTAGGAACCTCTCTGCCACCAAGGGCTGCCATCCATCGCCTAGTAACCACGGCAACCCAACCTACTCTAAAACCAAACCAAAAAAATAAAATAACACATCCTCTTTGCATGACACATTTTTTTTCTCCCCTTTTTGGTACACTTTTTTTGAATGGTTTTCTAACAACTTGAAGCACAGGATCAAGGAATTAGGGTGGTCTACTTGAGGCAGATGGGATAGTAGCTGGGAACTGTTCCCTTTCTGATTAATTTCAGCAGCATCGGAATATATTTGGAGCACACCCTAGTAACCTCTTGAGATTAAATTACATAGTCTTAATATTTCTGTTCCTCCATGCAACTGATGTTTGTTTTTTAAAGGGTAAGATGCTGCCTCCCAATGGGTGATGCCATCTGACTGGTTTCCCCATGTCCTCCCATTCACCCATCTCTGCTCCCACCCTTGCCTGCCTCTAACCCACCACTGGCCAGCCCCCTTGCCCTACTCTGGGCTGCTGAACACTGGTGCTGTGGTGGTTTTCAAGGTTAATTCCTAGGCTAACCGTATGGCCTATAGTTTAAAAGCACATCTATGTTCACTGCCACTCTGAAAAAGGGAATTATTTCTCAGTCTTTCAAGGCTTGAGACTAATATAGGCCATTGTGATTCAGGAAGAAACCCAAGGTTGGAGGGTGGGATGAGTACCCTCTGAAAAAGGGAATTTGCTGGTGAAAAGAGGCTGGATCTTGTGGAAGACTGTCTTGGATGGGGAAGTACTACCTGGAGATTTCAAATTCACTTGGCCTGCAAACAACAGAGTTATCCGTATCTTCCACATGTGAATGTCATTGCAAGGGTGACTCTAGACAAACTACAAACCGATGGACCGTCAAGCTCCCCAGGAGCCCCTTGGATGGCAGCGTTGCTTCAGAGTGTTTCCTGTTTCTGGAATTCCTTGTTAGGGAACTTTAAAGAAGAAAAGAAAAACTTGAATTGTGTTGAATTACTGTATCTTTTACTTTTTTTTTTTTGAAAAGATAAACTTGTAAATAGAGTGATTTGAAATACTATATGGCAAAGTTTTATATTTGATATTCTTTAAGTTAGTTGCTCACACACTTAGGCTTTGATTGCTGAAGAAGTATGTTTAAGAGGGAGAGAGGGGAGGCAAAGCTGAAGAGAGTCAAGGTCACTGTCCCCGCTTCGGCCTGAAGGAAAGAGAAGACATTTCTATGGCCTTGCTCTCTGCTGTCCTGTTGGTGGGCACGACACATCAGTGGTGTTCAGTCTTTATGTGTTTTTAAGCATCCCTTGGGCTTTGGATTTGGAGATGGGAAGAGCATCTCCAGGCAATGAGTTTTTCAAAGAATGCCTACTTAGTAGTAAGATGAAGCTCAGGATTTAAATAAGTGGGGTCAGGCATTCGAGTTTTTGTCTTTCTTCTCAGGTGTATTTCTTGGTACCCCCAAGATATCAGGCCAGAAAGAGATGAGTCAGTTGCTGTGCTCTTTACTTCTTTTTCTCCACATCTTCTGAGGCTTTAGAAATGTGGACAAGCTAGTTTTCAAATTTTGTGTGCGTCTGTAAGTTCTTAAAGAACCAGCTTCTTAGAATGTTCAGTTCTCAATGTGCTGCTGCTTTCCCTTCTCCTAAACATTTTAAAACTCTTCCCTTTCACCTCCAATTCCCGTGATCCCAAAAGAAGAGGAAGACTCCAGGAGGGGTATAGATTGTGCCGTCATAGCTTTACAGGTGGTTTTAAAGTTAACAGGGGTTTGTCATGGTGATTCACTACTCAGTTTATCAGCTCAAGGATTATACAGCTCTTTTCCGGGAACTCACCCAGGAGCAAGCGAGACACTACCATTGAATCAGGGAATGAGAATTAAGAATGGACAGGACCAAGACAGAACTCAAGAAAGCCACTGGGGAAAACTCGAGAAGAAAGGGAGTATACTAGTAGGTTAGATCTGTGAACCTGAGGACAAGAAGACCTTGGGAAATGGAGGCCTCAGGGGATGTGCATTCACATACTATTACGCTTCTCAAAGAGAGACCAACATCATGCTTTTAACACATTTGATGAGGTTTTTTATTTGTGTTTTTGTTTGTTTTTTGAGATGGAGTCTCACTCTGTGGCCCAGGCTGGAGTGCAGTGGCGCAATCTTGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGTGATTCTCCTGTCTCAGCCTCCCAAGTAGCTGGGACTACAGGCATGAGCCATCACACCCAGCTAGTTTTTTGTATTTTTAGTAAAGATGGGGTTTTGCCATGTTTGCCAGGCTGATCTCGAACTCCTGACCTCAAGTGATCTGCCCACTTCAGACCCCCAAAGTGCTGGGATTCCAGGTGTGAGCCGCTGCGGCCGACCACATTTGATGTTTGAAGTTGTAATCTGTCCCATCATAAACTTACCTGGAGCTCATGTGGAGGAACAGAAGGCCAAGATCCTTGCTTTGGGGGTGCCTCACGAAGCATCCCTGTAGACATTTGGCCCCAGCTTCACTGCTTGGAAGCATGTCCCTCCCTCTTGAGTTGGCTCTGATTTGAAATCGGGAGAAACAGAGCTGCTGCCAATGGGATCTTTTAGGTAACTCCCTCCCTAGCTTCCGTGTGTCTGTGCAGTGCCCATGAGCTGCTGCCAATGGGATCTTTCAGGTACCCCCTCCCCAGCTTCCCTGTGGCTGTGCGGTGCCCTTGACAGATGGCTTCTCTGTTTCCCTTTGCCCAGCCAGGCTCCCCTCCTTCCTATTAGCTACAAAACTGGATAAACTTCAGAATATGAGCCAATGAGTAGGAAGGAACTTGAAGACTAAAGATTTTACTCTCTCCCCTATCCATGCCCCCTACCTCTGACTCTCTCTGTGTGAACAGGAAACTTTAGGGCAGATGAGGAGAATGAATTGGTTATCAGAGTGGAAGACCATGGCCCAGGATCCCTGAGCTTTCCCAGTAGCCTCCAGTTTCCTTTGTAAGACCCAGGGATCACTTAGCCATAGCCTGAATCTTTTAGGGGTATTAAGGTCAGCCTCTCACTCTTCCTTCAGGTTACTAACAAAATTTCGTAGCTAAAGAATGCCATGGCCGGGTGCAGTGGCTCACGCCTATAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGAGATTGAGACCATCCTGGCTACGACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGTGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCTGGAGGCTGAGGCAGGAGAATGGCATGAACCCAGGAGGCAGAGATTGCAGTGAGCCAAGATCACGCCCCTGCACTCCAGCCTGGGTGACAGAGCCAGACTCCGTCTCAAAGG
转录物:ARHGAP26-001 ENST00000274498
蛋白质序列(SEQ ID NO.:106),融合基因的编码部分加阴影。
MGLPALEFSDCCLDSPHFRETLKSHEAELDKTNKFIKELIKDGKSLISALKNLSSAKRKFADSLNEFKFQCIGDAETDDEMCIARSLQEFATVLRNLEDERIRMIENASEVLITPLEKFRKEQIGAAKEAKKKYDKETEKYCGILEKHLNLSSKKKESQLQEADSQVDLVRQHFYEVSLEYVFKVQEVQERKMFEFVEPLLAFLQGLFTFYHHGYELAKDFGDFKTQLTISIQNTRNRFEGTRSEVESLMKKMKENPLEHKTISPYTMEGYLYVQEKRHFGTSWVKHYCTYQRDSKQITMVPFDQKSGGKGGEDESVILKSCTRRKTDSIEKRFCFDVEAVDRPGVITMQALSEEDRRLWMEAMDGREPVYNSNKDSQSEGTAQLDSIGFSIIRKCIHAVETRGINEQGLYRIVGVNSRVQKLLSVLMDPKTASETETDICAEWEIKTITSALKTYLRMLPGPLMMYQFQRSFIKAAKLENQESRVSEIHSLVHRLPEKNRQMLQLLMNHLANVANNHKQNLMTVANLGVVFGPTLLRPQEETVAAIMDIKFQNIVIEILIENHEKIFNTVPDMPLTNAQLHLSRKKSSDSKPPSCSERPLTLFHTVQSTEKQEQRNSIINSSLESVSSNPNSILNSSSSLQPNMNSSDPDLAVVKPTRPNSLPPNPSPTSPLSPSWPMFSAPSSPMPTSSTSSDSSPVRSVAGFVWFSVAAVVLSLARSSLHAVFSLLVNFVPCHPNLHLLFDRPEEAVHEDSSTPFRKAKALYACKAEHDSELSFTAGTVFDNVHPSQEPGWLEGTLNGKTGLIPENYVEFL
CLDN18-ARHGAP26融合序列
cDNA序列(SEQ ID NO.:107),ARHGAP26加下划线。
ATGGCCGTGACTGCCTGTCAGGGCTTGGGGTTCGTGGTTTCACTGATTGGGATTGCGGGCATCATTGCTGCCACCTGCATGGACCAGTGGAGCACCCAAGACTTGTACAACAACCCCGTAACAGCTGTTTTCAACTACCAGGGGCTGTGGCGCTCCTGTGTCCGAGAGAGCTCTGGCTTCACCGAGTGCCGGGGCTACTTCACCCTGCTGGGGCTGCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCTGGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTGCCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTGTCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGGGATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCCTCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAAGCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTTTGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTCTACAACTCGAACAAAGACAGCCAGAGTGAAGGGACTGCGCAGTTGGACAGCATTGGCTTCAGCATAATCAGGAAATGCATCCATGCTGTGGAAACCAGAGGGATCAACGAGCAAGGGCTGTATCGAATTGTGGGTGTCAACTCCAGAGTGCAGAAGTTGCTGAGTGTCCTGATGGACCCCAAGACTGCTTCTGAGACAGAAACAGATATCTGTGCTGAATGGGAGATAAAGACCATCACTAGTGCTCTGAAGACCTACCTAAGAATGCTTCCAGGACCACTCATGATGTACCAGTTTCAAAGAAGTTTCATCAAAGCAGCAAAACTGGAGAACCAGGAGTCTCGGGTCTCTGAAATCCACAGCCTTGTTCATCGGCTCCCAGAGAAAAATCGGCAGATGTTACAGCTGCTCATGAACCACTTGGCAAATGTTGCTAACAACCACAAGCAGAATTTGATGACGGTGGCAAACCTTGGTGTGGTGTTTGGACCCACTCTGCTGAGGCCTCAGGAAGAAACAGTAGCAGCCATCATGGACATCAAATTTCAGAACATTGTCATTGAGATCCTAATAGAAAACCACGAAAAGATATTTAACACCGTGCCCGATATGCCTCTCACCAATGCCCAGCTGCACCTGTCTCGGAAGAAGAGCAGTGACTCCAAGCCCCCGTCCTGCAGCGAGAGGCCCCTGACGCTCTTCCACACCGTTCAGTCAACAGAGAAACAGGAACAAAGGAACAGCATCATCAACTCCAGTTTGGAATCTGTCTCATCAAATCCAAACAGCATCCTTAATTCCAGCAGCAGCTTACAGCCCAACATGAACTCCAGTGACCCAGACCTGGCTGTGGTCAAACCCACCCGGCCCAACTCACTCCCCCCGAATCCAAGCCCAACTTCACCCCTCTCGCCATCTTGGCCCATGTTCTCGGCGCCATCCAGCCCTATGCCCACCTCATCCACGTCCAGCGACTCATCCCCCGTCAGGTCTGTTGCAGGGTTTGTTTGGTTTTCTGTTGCTGCCGTTGTTCTCTCATTGGCTCGGTCCTCTCTTCATGCAGTGTTCAGCCTCCTCGTCAACTTTGTTCCCTGCCATCCAAACCTGCACTTGCTTTTTGACAGGCCAGAAGAAGCGGTACATGAAGACTCCAGCACACCGTTCCGGAAGGCAAAAGCCTTGTATGCCTGCAAAGCTGAACATGACTCAGAACTTTCGTTCACAGCAGGCACGGTCTTCGATAACGTTCACCCATCTCAGGAGCCTGGCTGGTTGGAGGGGACTCTGAACGGAAAGACTGGCCTCATCCCTGAGAATTACGTGGAGTTCCTCTAA
蛋白质序列(SEQ ID NO.:108),ARHGAP26加下划线。
MAVTACQGLGFVVSLIGIAGIIAATCMDQWSTQDLYNNPVTAVFNYQGLWRSCVRESSGFTECRGYFTLLGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGLCAIAGVSVFANMLVTNFWMSTANMYTGMGGMVQTVQTRYTFGAALFVGWVAGGLTLIGGVMMCIACRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVYNSNKDSQSEGTAQLDSIGFSIIRKCIHAVETRGINEQGLYRIVGVNSRVQKLLSVLMDPKTASETETDICAEWEIKTITSALKTYLRMLPGPLMMYQFQRSFIKAAKLENQESRVSEIHSLVHRLPEKNRQMLQLLMNHLANVANNHKQNLMTVANLGVVFGPTLLRPQEETVAAIMDIKFQNIVIEILIENHEKIFNTVPDMPLTNAQLHLSRKKSSDSKPPSCSERPLTLFHTVQSTEKQEQRNSIINSSLESVSSNPNSILNSSSSLQPNMNSSDPDLAVVKPTRPNSLPPNPSPTSPLSPSWPMFSAPSSPMPTSSTSSDSSPVRSVAGFVWFSVAAVVLSLARSSLHAVFSLLVNFVPCHPNLHLLFDRPEEAVHEDSSTPFRKAKALYACKAEHDSELSFTAGTVFDNVHPSQEPGWLEGTLNGKTGLIPENYVEFL
蛋白质结构域
具有695个残基的查询序列内的结构域
融合基因#3:SNX2-PRDM6
位于以下转录物的内含子12-13中的chr5:122162808上的SNX2的确认的基因组断裂点:SNX2-001(ENST00000379516)
位于以下转录物的内含子3-4处的chr5∶122437347上的PRDM6的确认的基因组断裂点:PRDM6-001(ENST00000407847)
转录物:SNX2-001 ENST00000379516
cDNA序列(SEQ ID NO.:109),融合基因的编码部分加阴影。
AGGCCGGCCGGGGGCGGGGAGGCTGGCGGGTCGGCGCGGGCCCAGCCGTGCGTGCTCACGTGACGGGTCCGCGAGGCCCAGCTCGCGCAGTCGTTCGGGTGAGCGAAGATGGCGGCCGAGAGGGAACCTCCTCCGCTGGGGGACGGGAAGCCCACCGACTTTGAGGATCTGGAGGACGGAGAGGACCTGTTCACCAGCACTGTCTCCACCCTAGAGTCAAGTCCATCATCTCCAGAACCAGCTAGTCTTCCTGCAGAAGATATTAGTGCAAACTCCAATGGCCCAAAACCCACAGAAGTTGTATTAGATGATGACAGAGAAGATCTTTTTGCAGAAGCCACAGAAGAAGTTTCTTTGGACAGCCCTGAAAGGGAACCTATCCTATCCTCGGAACCTTCTCCTGCAGTCACACCTGTCACTCCTACTACACTCATTGCTCCTAGAATTGAATCAAAGAGTATGTCTGCTCCCGTGATCTTTGATAGATCCAGGGAAGAGATTGAAGAAGAAGCAAATGGAGACATTTTTGACATAGAAATTGGTGTATCAGATCCAGAAAAAGTTGGTGATGGCATGAATGCCTATATGGCATATAGAGTAACAACAAAGACATCTCTTTCCATGTTCAGTAAGAGTGAATTTTCAGTGAAAAGAAGATTCAGCGACTTTCTTGGTTTGCACAGCAAATTAGCAAGCAAATATTTACATGTTGGTTATATTGTGCCACCAGCTCCAGAAAAGAGTATAGTAGGGATGACCAAGGTCAAAGTGGGTAAAGAAGACTCATCATCCACTGAGTTTGTAGAAAAACGGAGAGCAGCTCTTGAAAGGTATCTTCAAAGAACAGTAAAACATCCAACTTTACTACAGGATCCTGATTTAAGGCAGTTCTTGGAAAGTTCAGAGCTGCCTAGAGCAGTTAATACACAGGCTCTGAGTGGAGCAGGAATATTGAGGATGGTGAACAAGGCTGCCGACGCTGTCAACAAAATGACAATCAAGATGAATGAATCGGATGCATGGTTTGAAGAAAAGCAGCAGCAATTTGAGAATCTGGATCAGCAACTTAGGAAACTTCATGTCAGTGTTGAAGCCTTGGTCTGTCATAGAAAAGAACTTTCAGCCAACACAGCTGCCTTTGCTAAAAGTGCTGCCATGTTAGGTAATTCTGAGGATCATACTGCTTTATCTAGAGCTTTGTCTCAGCTTGCAGAGGTTGAGGAGAAGATAGACCAGTTACATCAAGAACAAGCTTTTGCTGACTTTTATATGTTTTCAGAACTACTTAGTGACTACATTCGTCTTATTGCTGCAGTGAAAGGTGTGTTTGACCATCGAATGAAGTGCTGGCAGAAATGGGAAGATGCTCAAATTACTTTGCTCAAAAAACGTGAAGCTGAAGCAAAAATGATGGTTGCTAACAAACCAGATAAAATACAGCAAGCTAAAAATGAAATAAGAGAGTGGGAGGCGAAAGTGCAACAAGGGGAAAGAGATTTTGAACAGATATCTAAAACGATTCGAAAAGAAGTGGGAAGATTTGAGAAAGAACGAGTGAAGGATTTTAAAACCGTTATCATCAAGTACTTAGAATCACTAGTTCAAACACAACAACAGCTGATAAAATACTGGGAAGCATTCCTACCTGAAGCCAAAGCCATTGCCTAGCAATAAGATTGTTGCCGTTAAGAAGACCTTGGATGTTGTTCCAGTTATGCTGGATTCCACAGTGAAATCATTTAAAACCATCTAAATAAACCACTATATATTTTATGAATTACATGTGGTTTTATATACACACACACACACACACACACACACACACACACACTCTGACATTTTATTACAAGCTGCATGTCCTGACCCTCTTTGAATTAAGTGGACTGTGGCATGACATTCTGCAATACTTTGCTGAATTGAACACTATTGTGTCTTAAATACTTGCACTAAATAGTGCACTGCAAGACCAGAAAATTTTACAATATTTTTTCTTTACAATATGTTCTGTAGTATGTTTACCCTCTTTATGAAGTGAATTACCAATGCTTTGAATAATGTTCACTTATACATTCCTGTACAGAAATTACGATTTTGTGATTACAGTAATAAAATGATATTCCTTGTGAAA
转录物:SNX2-001 ENST00000379516
蛋白质序列(SEQ ID NO.:110),融合基因的编码部分加阴影。
MAAEREPPPLGDGKPTDFEDLEDGEDLFTSTVSTLESSPSSPEPASLPAEDISANSNGPKPTEVVLDDDREDLFAEATEEVSLDSPEREPILSSEPSPAVTPVTPTTLIAPRIESKSMSAPVIFDRSREEIEEEANGDIFDIEIGVSDPEKVGDGMNAYMAYRVTTKTSLSMFSKSEFSVKRRFSDFLGLHSKLASKYLHVGYIVPPAPEKSIVGMTKVKVGKEDSSSTEFVEKRRAALERYLQRTVKHPTLLQDPDLRQFLESSELPRAVNTQALSGAGILRMVNKAADAVNKMTIKMNESDAWFEEKQQQFENLDQQLRKLHVSVEALVCHRKELSANTAAFAKSAAMLGNSEDHTALSRALSQLAEVEEKIDQLHQEQAFADFYMFSELLSDYIRLIAAVKGVFDHRMKCWQKWEDAQITLLKKREAEAKMMVANKPDKIQQAKNEIREWEAKVQQGERDFEQISKTIRKEVGRFEKERVKDFKTVIIKYLESLVQTQQQLIKYWEAFLPEAKAIA
转录物:PRDM6-001 ENST00000407847
cDNA序列(SEQ ID NO.:111),融合基因的编码部分加阴影。
CTCTCTCACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACTCACTCTATTTTGTGCTGTCGTAAAACCCACGTGTCCAGCCGGGAAGCTGCCAGAGCGTGGAACCAAGGAGCCAGGACGCGGCAGCGGCCAAGCGCAGCAGCCCACGGCGGTTGAGTCGGGCGCCCAGGTCCGTCCGCACTCTCGCGCCCTCCGCGGGCCTCCCAATTTTCTCGCTTGCAGGTCGGGAGGTTTCCGGGCGGCACAATCTCTAGGACTCTCCTCCCGCGCTGCTCAGGGGCATGTAGCGCACGCAGGGCGCACACTCTCGCGCACCCGCACGCTCACCGAGACACCCGCACGCACCCACCGGCAGCACCGAGTTTTCAGTTCGAGGCGCCGGACATGCTGAAGCCCGGAGACCCCGGCGGTTCGGCCTTCCTCAAAGTGGACCCAGCCTACCTGCAGCACTGGCAGCAACTCTTCCCTCACGGAGGCGCAGGCCCGCTCAAGGGCAGCGGCGCCGCGGGTCTCCTGAGCGCGCCGCAGCCTCTTCAGCCGCCGCCGCCGCCCCCGCCCCCGGAGCGCGCTGAGCCTCCGCCGGACAGCCTGCGCCCGCGGCCCGCCTCTCTCTCCTCCGCCTCGTCCACGCCGGCTTCCTCTTCCACCTCCGCCTCCTCCGCCTCCTCCTGCGCTGCTGCGGCCGCTGCCGCCGCGCTGGCTGGTCTCTCGGCCCTGCCGGTGTCGCAGCTGCCGGTGTTCGCGCCTCTAGCCGCCGCTGCCGTCGCCGCCGAGCCGCTGCCCCCCAAGGAACTGTGCCTCGGCGCCACCTCCGGCCCCGGGCCCGTCAAGTGCGGTGGTGGTGGCGGCGGCGGCGGGGAGGGTCGCGGCGCCCCGCGCTTCCGCTGCAGCGCAGAGGAGCTGGACTATTACCTGTATGGCCAGCAGCGCATGGAGATCATCCCGCTCAACCAGCACACCAGCGACCCCAACAACCGTTGCGACATGTGCGCGGACAACCGCAACGGCGAGTGCCCTATGCATGGGCCACTGCACTCGCTGCGCCGGCTTGTGGGCACCAGCAGCGCTGCGGCCGCCGCGCCCCCGCCGGAGCTGCCGGAGTGGCTGCGGGACCTGCCTCGCGAGGTGTGCCTCTGCACCAGTACTGTGCCCGGCCTGGCCTACGGCATCTGCGCGGCGCAGAGGATCCAGCAAGGCACCTGGATTGGACCTTTCCAAGGCGTGCTTCTGCCCCCAGAGAAGGTGCAGGCAGGCGCCGTGAGGAACACGCAGCATCTCTGGGAGATATATGACCAGGATGGGACACTACAGCACTTTATTGATGGTGGGGAACCTAGTAAGTCGAGCTGGATGAGGTATATCCGATGTGCAAGGCACTGCGGAGAACAGAATCTAACAGTAGTTCAGTACAGGTCGAATATATTCTACCGAGCCTGTATAGATATCCCTAGGGGCACCGAGCTTCTGGTGTGGTACAATGACAGCTATACGTCTTTCTTTGGGATCCCCTTACAATGCATTGCCCAGGATGAAAACTTAAATGTCCCTTCAACGGTAATGGAAGCCATGTGCAGACAAGACGCCCTGCAGCCCTTCAACAAAAGCAGCAAACTCGCCCCTACCACCCAGCAGCGCTCCGTTGTTTTCCCCCAGACTCCGTGCAGCAGGAACTTCTCTCTTCTGGATAAGTCTGGGCCCATTGAATCAGGATTTAATCAAATCAACGTGAAAAACCAGCGAGTCCTGGCAAGCCCAACTTCCACAAGCCAGCTCCACTCGGAGTTCAGTGACTGGCATCTTTGGAAATGTGGGCAGTGCTTTAAGACTTTCACCCAGCGGATCCTCTTACAGATGCACGTGTGCACGCAGAACCCCGACAGACCCTACCAATGCGGCCACTGCTCCCAGTCCTTTTCCCAGCCTTCAGAACTGAGGAACCACGTGGTCACTCACTCTAGTGACCGGCCTTTCAAGTGCGGCTACTGTGGTCGTGCCTTTGCCGGGGCCACCACCCTCAACAACCACATCCGAACCCACACTGGAGAAAAGCCCTTCAAGTGCGAGAGGTGTGAGAGGAGCTTCACGCAGGCCACCCAGCTGAGCCGACACCAGCGGATGCCCAATGAGTGCAAGCCAATAACTGAGAGCCCAGAATCAATCGAAGTGGATTAACGGATTGACTGGTTGGAATTAAACTGCAAGGAAAGTCATGATTAAATGTCACGGACACTTAAGCAAAACCAAAGATTTCCTCTGAGCAACTTTCAATCAGTCCCAGAAAACCAAAAGCAGTAATAAAATAAGTAAGATGTTAAGAGATATTGATCCTGGCATGGAAGTCAGACCAGGAAAGAGATTATTTATTTATGACTTAGGGATGAGACTTATTTCAGTGGACAACTAACCTGGGATGGTTAACATTTCCAGTCCCACCATGTATTTTGCTTTGTTTCTAAAAAGCTTTTTAAAAACTGTTATTTAATACCAAAGGGAGGAATCGTATGGGTTCTTCTGCCCACCGTTGTGACTAAGAATGCACAGGGACTTGGTTCTCGTTGCACCTTTTTTTAGTAACATGTTTCATGGGGACCCACTGTACAGCCCTTCATTCTGCTGTGTCAGTTTGGCCTGGCCTGACACTGGCTGCCCCAGCGGGGACCACGGAAGCAGAGTGAGAGCCTTCGCTGAGTCAATGCTACCTTCAGCCCCAGACGCATCCCATTTCCATGTCTTCCATGCTCACTGCTCATGCACTTTTTACACGGTTTCTTCCAAACAGCCCGGTCTTGATGCAGGAGAGTCTGGAAAAGGAAGAAAATGGTTTCAGTTTCAAAATTCAAAGGAAAAAGTTGAGGACTTATTTTGTCCTGTCAAGATTGCAAGAACATGTAAAATGTACGGAGCTTCATAATACGTTATATTGTTCCGAAGCAGCTCGTTGAGAAACATTTGTTTTCAATAACATTTTAGCTTAAAAAAAAAAAAAGAAAATGAAAATAAAGTTCTTTGGTTTAAGGCTGGA
转录物:PRDM6-001 ENST00000407847
蛋白质序列(SEQ ID NO.:112),融合基因的编码部分加阴影。
MLKPGDPGGSAFLKVDPAYLQHWQQLFPHGGAGPLKGSGAAGLLSAPQPLQPPPPPPPPERAEPPPDSLRPRPASLSSASSTPASSSTSASSASSCAAAAAAAALAGLSALPVSQLPVFAPLAAAAVAAEPLPPKELCLGATSGPGPVKCGGGGGGGGEGRGAPRFRCSAEELDYYLYGQQRMEIIPLNQHTSDPNNRCDMCADNRNGECPMHGPLHSLRRLVGTSSAAAAAPPPELPEWLRDLPREVCLCTSTVPGLAYGICAAQRIQQGTWIGPFQGVLLPPEKVQAGAVRNTQHLWEIYDQDGTLQHFIDGGEPSKSSWMRYIRCARHCGEQNLTVVQYRSNIFYRACIDIPRGTELLVWYNDSYTSFFGIPLQCIAQDENLNVPSTVMEAMCRQDALQPFNKSSKLAPTTQQRSVVFPQTPCSRNFSLLDKSGPIESGFNQINVKNQRVLASPTSTSQLHSEFSDWHLWKCGQCFKTFTQRILLQMHVCTQNPDRPYQCGHCSQSFSQPSELRNHVVTHSSDRPFKCGYCGRAFAGATTLNNHIRTHTGEKPFKCERCERSFTQATQLSRHQRMPNECKPITESPESIEVD
SNX2-PRDM6融合序列外显子12到外显子4
cDNA序列(SEQ ID NO.:113)
ATGGCGGCCGAGAGGGAACCTCCTCCGCTGGGGGACGGGAAGCCCACCGACTTTGAGGATCTGGAGGACGGAGAGGACCTGTTCACCAGCACTGTCTCCACCCTAGAGTCAAGTCCATCATCTCCAGAACCAGCTAGTCTTCCTGCAGAAGATATTAGTGCAAACTCCAATGGCCCAAAACCCACAGAAGTTGTATTAGATGATGACAGAGAAGATCTTTTTGCAGAAGCCACAGAAGAAGTTTCTTTGGACAGCCCTGAAAGGGAACCTATCCTATCCTCGGAACCTTCTCCTGCAGTCACACCTGTCACTCCTACTACACTCATTGCTCCTAGAATTGAATCAAAGAGTATGTCTGCTCCCGTGATCTTTGATAGATCCAGGGAAGAGATTGAAGAAGAAGCAAATGGAGACATTTTTGACATAGAAATTGGTGTATCAGATCCAGAAAAAGTTGGTGATGGCATGAATGCCTATATGGCATATAGAGTAACAACAAAGACATCTCTTTCCATGTTCAGTAAGAGTGAATTTTCAGTGAAAAGAAGATTCAGCGACTTTCTTGGTTTGCACAGCAAATTAGCAAGCAAATATTTACATGTTGGTTATATTGTGCCACCAGCTCCAGAAAAGAGTATAGTAGGGATGACCAAGGTCAAAGTGGGTAAAGAAGACTCATCATCCACTGAGTTTGTAGAAAAACGGAGAGCAGCTCTTGAAAGGTATCTTCAAAGAACAGTAAAACATCCAACTTTACTACAGGATCCTGATTTAAGGCAGTTCTTGGAAAGTTCAGAGCTGCCTAGAGCAGTTAATACACAGGCTCTGAGTGGAGCAGGAATATTGAGGATGGTGAACAAGGCTGCCGACGCTGTCAACAAAATGACAATCAAGATGAATGAATCGGATGCATGGTTTGAAGAAAAGCAGCAGCAATTTGAGAATCTGGATCAGCAACTTAGGAAACTTCATGTCAGTGTTGAAGCCTTGGTCTGTCATAGAAAAGAACTTTCAGCCAACACAGCTGCCTTTGCTAAAAGTGCTGCCATGTTAGGTAATTCTGAGGATCATACTGCTTTATCTAGAGCTTTGTCTCAGCTTGCAGAGGTTGAGGAGAAGATAGACCAGTTACATCAAGAACAAGCTTTTGCTGACTTTTATATGTTTTCAGAACTACTTAGTGACTACATTCGTCTTATTGCTGCAGTGAAAGGTGTGTTTGACCATCGAATGAAGTGCTGGCAGAAATGGGAAGATGCTCAAATTACTTTGCTCAAAAAACGTGAAGCTGAAGCAAAAATGATGGTTGCTAACAAACCAGATAAAATACAGCAAGCTAAAAATGAAATAAGAGAGATATATGACCAGGATGGGACACTACAGCACTTTATTGATGGTGGGGAACCTAGTAAGTCGAGCTGGATGAGGTATATCCGATGTGCAAGGCACTGCGGAGAACAGAATCTAACAGTAGTTCAGTACAGGTCGAATATATTCTACCGAGCCTGTATAGATATCCCTAGGGGCACCGAGCTTCTGGTGTGGTACAATGACAGCTATACGTCTTTCTTTGGGATCCCCTTACAATGCATTGCCCAGGATGAAAACTTAAATGTCCCTTCAACGGTAATGGAAGCCATGTGCAGACAAGACGCCCTGCAGCCCTTCAACAAAAGCAGCAAACTCGCCCCTACCACCCAGCAGCGCTCCGTTGTTTTCCCCCAGACTCCGTGCAGCAGGAACTTCTCTCTTCTGGATAAGTCTGGGCCCATTGAATCAGGATTTAATCAAATCAACGTGAAAAACCAGCGAGTCCTGGCAAGCCCAACTTCCACAAGCCAGCTCCACTCGGAGTTCAGTGACTGGCATCTTTGGAAATGTGGGCAGTGCTTTAAGACTTTCACCCAGCGGATCCTCTTACAGATGCACGTGTGCACGCAGAACCCCGACAGACCCTACCAATGCGGCCACTGCTCCCAGTCCTTTTCCCAGCCTTCAGAACTGAGGAACCACGTGGTCACTCACTCTAGTGACCGGCCTTTCAAGTGCGGCTACTGTGGTCGTGCCTTTGCCGGGGCCACCACCCTCAACAACCACATCCGAACCCACACTGGAGAAAAGCCCTTCAAGTGCGAGAGGTGTGAGAGGAGCTTCACGCAGGCCACCCAGCTGAGCCGACACCAGCGGATGCCCAATGAGTGCAAGCCAATAACTGAGAGCCCAGAATCAATCGAAGTGGATTAA
蛋白质序列(SEQ ID NO.:114)
MAAEREPPPLGDGKPTDFEDLEDGEDLFTSTVSTLESSPSSPEPASLPAEDISANSNGPKPTEVVLDDDREDLFAEATEEVSLDSPEREPILSSEPSPAVTPVTPTTLIAPRIESKSMSAPVIFDRSREEIEEEANGDIFDIEIGVSDPEKVGDGMNAYMAYRVTTKTSLSMFSKSEFSVKRRFSDFLGLHSKLASKYLHVGYIVPPAPEKSIVGMTKVKVGKEDSSSTEFVEKRRAALERYLQRTVKHPTLLQDPDLRQFLESSELPRAVNTQALSGAGILRMVNKAADAVNKMTIKMNESDAWFEEKQQQFENLDQQLRKLHVSVEALVCHRKELSANTAAFAKSAAMLGNSEDHTALSRALSQLAEVEEKIDQLHQEQAFADFYMFSELLSDYIRLIAAVKGVFDHRMKCWQKWEDAQITLLKKREAEAKMMVANKPDKIQQAKNEIREIYDQDGTLQHFIDGGEPSKSSWMRYIRCARHCGEQNLTVVQYRSNIFYRACIDIPRGTELLVWYNDSYTSFFGIPLQCIAQDENLNVPSTVMEAMCRQDALQPFNKSSKLAPTTQQRSVVFPQTPCSRNFSLLDKSGPIESGFNQINVKNQRVLASPTSTSQLHSEFSDWHLWKCGQCFKTFTQRILLQMHVCTQNPDRPYQCGHCSQSFSQPSELRNHVVTHSSDRPFKCGYCGRAFAGATTLNNHIRTHTGEKPFKCERCERSFTQATQLSRHQRMPNECKPITESPESIEVD
蛋白质结构域
无跨膜结构域。
SNX2-PRDM6融合序列外显子2到外显子7
cDNA序列(SEQ ID NO.:115)
ATGGCGGCCGAGAGGGAACCTCCTCCGCTGGGGGACGGGAAGCCCACCGACTTTGAGGATCTGGAGGACGGAGAGGACCTGTTCACCAGCACTGTCTCCACCCTAGAGTCAAGTCCATCATCTCCAGAACCAGCTAGTCTTCCTGCAGAAGATATTAGTGCAAACTCCAATGGCCCAAAACCCACAGAAGTTGTATTAGATGATGACAGAGAAGATCTTTTTGCAGACCCTACCAATGCGGCCACTGCTCCCAGTCCTTTTCCCAGCCTTCAGAACTGAGGAACCACGTGGTCACTCACTCTAGTGACCGGCCTTTCAAGTGCGGCTACTGTGGTCGTGCCTTTGCCGGGGCCACCACCCTCAACAACCACATCCGAACCCACACTGGAGAAAAGCCCTTCAAGTGCGAGAGGTGTGAGAGGAGCTTCACGCAGGCCACCCAGCTGAGCCGACACCAGCGGATGCCCAATGAGTGCAAGCCAATAACTGAGAGCCCAGAATCAATCGAAGTGGATTAA
蛋白质序列(SEQ ID NO.:116)
MAAEREPPPLGDGKPTDFEDLEDGEDLFTSTVSTLESSPSSPEPASLPAEDISANSNGPKPTEVVLDDDREDLFAEPYQCGHCSQSFSQPSELRNHVVTHSSDRPFKCGYCGRAFAGATTLNNHIRTHTGEKPFKCERCERSFTQATQLSRHQRMPNECKPITESPESIEVD
蛋白质结构域
无跨膜结构域。
融合基因#4:MLL3-PRKAG2
chr7:151365906(参考转录物:MLL3-001(ENST00000262189))上的MLL3的确认的基因组断裂点
chr7∶151951997(参考转录物:PRKAG2-001(ENST00000287878))上的PRKAG2的确认的基因组断裂点
转录物:MLL3-001 ENST00000262189
cDNA序列(SEQ ID NO.:117),融合基因的一部分加阴影。
GAGGTGCGCGCGCCCGCGCCGATGTGTGTGAGTGCGTGTCCTGCTCGCTCCATGTTGCCGCCTCTCCCGGTACCTGCTGCTGCTCCCGGGGCTGCGGGAAATGCGAGAGGCTGAGCCGGGGAGGAGGAACCCGAGCAGCAGCGGCGGCGGCGGCGGCCGCGGCGGCGGGAGCCCCCCAGGAGGAGGACCGGGATCCATGTGTCTTTCCTGGTGACTAGGATGTCGTCGGAGGAGGACAAGAGCGTGGAGCAGCCGCAGCCGCCGCCACCACCCCCCGAGGAGCCTGGAGCCCCGGCCCCGAGCCCCGCAGCCGCAGACAAAAGACCTCGGGGCCGGCCTCGCAAAGATGGCGCTTCCCCTTTCCAGAGAGCCAGAAAGAAACCTCGAAGTAGGGGGAAAACTGCAGTGGAAGATGAGGACAGCATGGATGGGCTGGAGACAACAGAAACAGAAACGATTGTGGAAACAGAAATCAAAGAACAATCTGCAGAAGAGGATGCTGAAGCAGAAGTGGATAACAGCAAACAGCTAATTCCAACTCTTCAGCGATCTGTGTCTGAGGAATCGGCAAACTCCCTGGTCTCTGTTGGTGTAGAAGCCAAAATCAGTGAACAGCTCTGCGCTTTTTGTTACTGTGGGGAAAAAAGTTCCTTAGGACAAGGAGACTTAAAACAATTCAGAATAACGCCTGGATTTATCTTGCCATGGAGAAACCAACCTTCTAACAAGAAGGACATTGATGACAACAGCAATGGAACCTATGAGAAAATGCAAAACTCAGCACCACGAAAACAAAGAGGACAGAGAAAAGAACGATCTCCTCAGCAGAATATAGTATCTTGTGTAAGTGTAAGCACCCAGACAGCTTCAGATGATCAAGCTGGTAAACTGTGGGATGAACTCAGTCTGGTTGGGCTTCCAGATGCCATTGATATCCAAGCCTTATTTGATTCTACAGGCACTTGTTGGGCTCATCACCGTTGTGTGGAGTGGTCACTAGGAGTATGCCAGATGGAAGAACCATTGTTAGTGAACGTGGACAAAGCTGTTGTCTCAGGGAGCACAGAACGATGTGCATTTTGTAAGCACCTTGGAGCCACTATCAAATGCTGTGAAGAGAAATGTACCCAGATGTATCATTATCCTTGTGCTGCAGGAGCCGGCACCTTTCAGGATTTCAGTCACATCTTCCTGCTTTGTCCAGAACACATTGACCAAGCTCCTGAAAGATCGAAGGAAGATGCAAACTGTGCAGTGTGCGACAGCCCGGGAGACCTCTTAGATCAGTTCTTTTGTACTACTTGTGGTCAGCACTATCATGGAATGTGCCTGGATATAGCGGTTACTCCATTAAAACGTGCAGGTTGGCAATGTCCTGAGTGCAAAGTGTGCCAGAACTGCAAACAATCGGGAGAAGATAGCAAGATGCTAGTGTGTGATACGTGTGACAAAGGGTATCATACTTTTTGTCTTCAACCAGTTATGAAATCAGTACCAACCAATGGCTGGAAATGCAAAAATTGCAGAATATGTATAGAGTGTGGCACACGGTCTAGTTCTCAGTGGCACCACAATTGCCTGATATGTGACAATTGTTACCAACAGCAGGATAACTTATGTCCCTTCTGTGGGAAGTGTTATCATCCAGAATTGCAGAAAGACATGCTTCATTGTAATATGTGCAAAAGGTGGGTTCACCTAGAGTGTGACAAACCAACAGATCATGAACTGGATACTCAGCTCAAAGAAGAGTATATCTGCATGTATTGTAAACACCTGGGAGCTGAGATGGATCGTTTACAGCCAGGTGAGGAAGTGGAGATAGCTGAGCTCACTACAGATTATAACAATGAAATGGAAGTTGAAGGCCCTGAAGATCAAATGGTATTCTCAGAGCAGGCAGCTAATAAAGATGTCAACGGTCAGGAGTCCACTCCTGGAATTGTTCCAGATGCGGTTCAAGTCCACACTGAAGAGCAACAGAAGAGTCATCCCTCAGAAAGTCTTGACACAGATAGTCTTCTTATTGCTGTATCATCCCAACATACAGTGAATACTGAATTGGAAAAACAGATTTCTAATGAAGTTGATAGTGAAGACCTGAAAATGTCTTCTGAAGTGAAGCATATTTGTGGCGAAGATCAAATTGAAGATAAAATGGAAGTGACAGAAAACATTGAAGTCGTTACACACCAGATCACTGTGCAGCAAGAACAACTGCAGTTGTTAGAGGAACCTGAAACAGTGGTATCCAGAGAAGAATCAAGGCCTCCAAAATTAGTCATGGAATCTGTCACTCTTCCACTAGAAACCTTAGTGTCCCCACATGAGGAAAGTATTTCATTATGTCCTGAGGAACAGTTGGTTATAGAAAGGCTACAAGGAGAAAAGGAACAGAAAGAAAATTCTGAACTTTCTACTGGATTGATGGACTCTGAAATGACTCCTACAATTGAGGGTTGTGTGAAAGATGTTTCATACCAAGGAGGCAAATCTATAAAGTTATCATCTGAGACAGAGTCATCATTTTCATCATCAGCAGACATAAGCAAGGCAGATGTGTCTTCCTCCCCAACACCTTCTTCAGACTTGCCTTCGCATGACATGCTGCATAATTACCCTTCAGCTCTTAGTTCCTCTGCTGGAAACATCATGCCAACAACTTACATCTCAGTCACTCCAAAAATTGGCATGGGTAAACCAGCTATTACTAAGAGAAAATTTTCTCCTGGTAGACCTCGGTCCAAACAGGGGGCTTGGAGTACCCATAATACAGTGAGCCCACCTTCCTGGTCCCCAGACATTTCAGAAGGTCGGGAAATTTTTAAACCCAGGCAGCTTCCTGGCAGTGCCATTTGGAGCATCAAAGTGGGCCGTGGGTCTGGATTTCCAGGAAAGCGGAGACCTCGAGGTGCAGGACTGTCGGGGCGAGGTGGCCGAGGCAGGTCAAAGCTGAAAAGTGGAATCGGAGCTGTTGTATTACCTGGGGTGTCTACTGCAGATATTTCATCAAATAAGGATGATGAAGAAAACTCTATGCACAATACAGTTGTGTTGTTTTCTAGCAGTGACAAGTTCACTTTGAATCAGGATATGTGTGTAGTTTGTGGCAGTTTTGGCCAAGGAGCAGAAGGAAGATTACTTGCCTGTTCTCAGTGTGGTCAGTGTTACCATCCATACTGTGTCAGTATTAAGATCACTAAAGTGGTTCTTAGCAAAGGTTGGAGGTGTCTTGAGTGCACTGTGTGTGAGGCCTGTGGGAAGGCAACTGACCCAGGAAGACTCCTGCTGTGTGATGACTGTGACATAAGTTATCACACCTACTGCCTAGACCCTCCATTGCAGACAGTTCCCAAAGGAGGCTGGAAGTGCAAATGGTGTGTTTGGTGCAGACACTGTGGAGCAACATCTGCAGGTCTAAGATGTGAATGGCAGAACAATTACACACAGTGCGCTCCTTGTGCAAGCTTATCTTCCTGTCCAGTCTGCTATCGAAACTATAGAGAAGAAGATCTTATTCTGCAATGTAGACAATGTGATAGATGGATGCATGCAGTTTGTCAGAACTTAAATACTGAGGAAGAAGTGGAAAATGTAGCAGACATTGGTTTTGATTGTAGCATGTGCAGACCCTATATGCCTGCGTCTAATGTGCCTTCCTCAGACTGCTGTGAATCTTCACTTGTAGCACAAATTGTCACAAAAGTAAAAGAGCTAGACCCACCCAAGACTTATACCCAGGATGGTGTGTGTTTGACTGAATCAGGGATGACTCAGTTACAGAGCCTCACAGTTACAGTTCCAAGAAGAAAACGGTCAAAACCAAAATTGAAATTGAAGATTATAAATCAGAATAGCGTGGCCGTCCTTCAGACCCCTCCAGACATCCAATCAGAGCATTCAAGGGATGGTGAAATGGATGATAGTCGAGAAGGAGAACTTATGGATTGTGATGGAAAATCAGAATCTAGTCCTGAGCGGGAAGCTGTGGATGATGAAACTAAGGGAGTGGAAGGAACAGATGGTGTCAAAAAGAGAAAAAGGAAACCATACAGACCAGGTATTGGTGGATTTATGGTGCGGCAAAGAAGTCGAACTGGGCAAGGGAAAACCAAAAGATCTGTGATCAGAAAAGATTCCTCAGGCTCTATTTCCGAGCAGTTACCTTGCAGAGATGATGGCTGGAGTGAGCAGTTACCAGATACTTTAGTTGATGAATCTGTTTCTGTTACTGAAAGCACTGAAAAAATAAAGAAGAGATACCGAAAAAGGAAAAATAAGCTTGAAGAAACTTTCCCTGCCTATTTACAAGAAGCTTTCTTTGGAAAAGATCTTCTAGATACAAGTAGACAAAGCAAGATAAGTTTAGATAATCTGTCAGAAGATGGAGCTCAGCTTTTATATAAAACAAACATGAACACAGGTTTCTTGGATCCTTCCTTAGATCCACTACTTAGTTCATCCTCGGCTCCAACAAAATCTGGAACTCACGGTCCTGCTGATGACCCATTAGCTGATATTTCTGAAGTTTTAAACACAGATGATGACATTCTTGGAATAATTTCAGATGATCTAGCAAAATCAGTTGATCATTCAGATATTGGTCCTGTCACTGATGATCCTTCCTCTTTGCCTCAGCCAAATGTCAATCAGAGTTCACGACCATTAAGTGAAGAACAGCTAGATGGGATCCTCAGTCCTGAACTAGACAAAATGGTCACAGATGGAGCAATTCTTGGAAAATTATATAAAATTCCAGAGCTTGGCGGAAAAGATGTTGAAGACTTATTTACAGCTGTACTTAGTCCTGCGAACACTCAGCCAACTCCATTGCCACAGCCTCCCCCACCAACACAGCTGTTGCCAATACACAATCAGGATGCTTTTTCACGGATGCCTCTCATGAATGGCCTTATTGGATCCAGTCCTCATCTCCCACATAATTCTTTGCCACCTGGAAGCGGACTGGGAACTTTCTCTGCAATTGCACAATCCTCTTATCCTGATGCCAGGGATAAAAATTCAGCCTTTAATCCAATGGCAAGTGATCCTAACAACTCTTGGACATCATCAGCTCCCACTGTGGAAGGAGAAAATGACACAATGTCGAATGCCCAGAGAAGCACGCTTAAGTGGGAGAAAGAGGAGGCTCTGGGTGAAATGGCAACTGTTGCCCCAGTTCTCTACACCAATATTAATTTCCCCAACTTAAAGGAAGAATTCCCTGATTGGACTACTAGAGTGAAGCAAATTGCCAAATTGTGGAGAAAAGCAAGCTCACAAGAAAGAGCACCATATGTGCAAAAAGCCAGAGATAACAGAGCTGCTTTACGCATTAATAAAGTACAGATGTCAAATGATTCCATGAAAAGGCAGCAACAGCAAGATAGCATTGATCCCAGCTCTCGTATTGATTCGGAGCTTTTTAAAGATCCTTTAAAGCAAAGAGAATCAGAACATGAACAGGAATGGAAATTTAGACAGCAAATGCGTCAGAAAAGTAAGCAGCAAGCTAAAATTGAAGCCACACAGAAACTTGAACAGGTGAAAAATGAGCAGCAGCAGCAGCAACAACAGCAATTTGGTTCTCAGCATCTTCTGGTGCAGTCTGGTTCAGATACACCAAGTAGTGGGATACAGAGTCCCTTGACACCTCAGCCTGGCAATGGAAATATGTCTCCTGCACAGTCATTCCATAAAGAACTGTTTACAAAACAGCCACCCAGTACCCCTACGTCTACATCTTCAGATGATGTGTTTGTAAAGCCACAAGCTCCACCTCCTCCTCCAGCCCCATCCCGGATTCCCATCCAGGATAGTCTTTCTCAGGCTCAGACTTCTCAGCCACCCTCACCGCAAGTGTTTTCACCTGGGTCCTCTAACTCACGACCACCATCTCCAATGGATCCATATGCAAAAATGGTTGGTACCCCTCGACCACCTCCTGTGGGCCATAGTTTTTCCAGAAGAAATTCTGCTGCACCAGTGGAAAACTGTACACCTTTATCATCGGTATCTAGGCCCCTTCAAATGAATGAGACAACAGCAAATAGGCCATCCCCTGTCAGAGATTTATGTTCTTCTTCCACGACAAATAATGACCCCTATGCAAAACCTCCAGACACACCTAGGCCTGTGATGACAGATCAATTTCCCAAATCCTTGGGCCTATCCCGGTCTCCTGTAGTTTCAGAACAAACTGCAAAAGGCCCTATAGCAGCTGGAACCAGTGATCACTTTACTAAACCATCTCCTAGGGCAGATGTGTTTCAAAGACAAAGGATACCTGACTCATATGCACGACCCTTGTTGACACCTGCACCTCTTGATAGTGGTCCTGGACCTTTTAAGACTCCAATGCAACCTCCTCCATCCTCTCAGGATCCTTATGGATCAGTGTCACAGGCATCAAGGCGATTGTCTGTTGACCCTTATGAAAGGCCTGCTTTGACACCAAGACCTATAGATAATTTTTCTCATAATCAGTCAAATGATCCATATAGTCAGCCTCCCCTTACCCCACATCCAGCAGTGAATGAATCTTTTGCCCATCCTTCAAGGGCTTTTTCCCAGCCTGGAACCATATCAAGGCCAACATCTCAGGACCCATACTCCCAACCCCCAGGAACTCCACGACCTGTTGTAGATTCTTATTCCCAATCTTCAGGAACAGCTAGGTCCAATACAGACCCTTACTCTCAACCTCCTGGAACTCCCCGGCCTACTACTGTTGACCCATATAGTCAGCAGCCCCAAACCCCAAGACCATCTACACAAACTGACTTGTTTGTTACACCTGTAACAAATCAGAGGCATTCTGATCCATATGCTCATCCTCCTGGAACACCAAGACCTGGAATTTCTGTCCCTTACTCTCAGCCACCAGCAACACCAAGGCCAAGGATTTCAGAGGGTTTTACTAGGTCCTCAATGACAAGACCAGTCCTCATGCCAAATCAGGATCCTTTCCTGCAAGCAGCACAAAACCGAGGACCAGCTTTACCTGGCCCGTTGGTAAGGCCACCTGATACATGTTCCCAGACACCTAGGCCCCCTGGACCTGGTCTTTCAGACACATTTAGCCGTGTTTCCCCATCTGCTGCCCGTGATCCCTATGATCAGTCTCCAATGACTCCAAGATCTCAGTCTGACTCTTTTGGAACAAGTCAAACTGCCCATGATGTTGCTGATCAGCCAAGGCCTGGATCAGAGGGGAGCTTCTGTGCATCTTCAAACTCTCCAATGCACTCCCAAGGCCAGCAGTTCTCTGGTGTCTCCCAACTTCCTGGACCTGTGCCAACTTCAGGAGTAACTGATACACAGAATACTGTAAATATGGCCCAAGCAGATACAGAGAAATTGAGACAGCGGCAGAAGTTACGTGAAATCATTCTCCAGCAGCAACAGCAGAAGAAGATTGCAGGTCGACAGGAGAAGGGGTCACAGGACTCACCCGCAGTGCCTCATCCAGGGCCTCTTCAACACTGGCAACCAGAGAATGTTAACCAGGCTTTCACCAGACCCCCACCTCCCTATCCTGGGAACATTAGGTCTCCTGTTGCCCCTCCTTTAGGACCTAGATATGCTGTTTTCCCAAAAGATCAGCGTGGACCCTATCCTCCTGATGTTGCTAGTATGGGGATGAGACCTCATGGATTTAGATTTGGATTTCCAGGAGGTAGTCATGGTACCATGCCGAGTCAAGAGCGCTTCCTTGTGCCTCCTCAGCAAATACAGGGATCTGGAGTTTCTCCACAGCTAAGAAGATCAGTATCTGTAGATATGCCTAGGCCTTTAAATAACTCACAAATGAATAATCCAGTTGGACTTCCTCAGCATTTTTCACCACAGAGCTTGCCAGTTCAGCAGCACAACATACTGGGCCAAGCATATATTGAACTGAGACATAGGGCTCCTGACGGAAGGCAACGGCTGCCTTTCAGTGCTCCACCTGGCAGCGTTGTAGAGGCATCTTCTAATCTGAGACATGGAAACTTCATTCCCCGGCCAGACTTTCCGGGCCCTAGACACACAGACCCCATGCGACGACCTCCCCAGGGTCTACCTAATCAGCTACCTGTGCACCCAGATTTGGAACAAGTGCCACCATCTCAACAAGAGCAAGGTCATTCTGTCCATTCATCTTCTATGGTCATGAGGACTCTGAACCATCCACTAGGTGGTGAATTTTCAGAAGCTCCTTTGTCAACATCTGTACCGTCTGAAACAACGTCTGATAATTTACAGATAACCACCCAGCCTTCTGATGGTCTAGAGGAAAAACTTGATTCTGATGACCCTTCTGTGAAGGAACTGGATGTTAAAGACCTTGAGGGGGTTGAAGTCAAAGACTTAGATGATGAAGATCTTGAAAACTTAAATTTAGATACAGAGGATGGCAAGGTAGTTGAATTGGATACTTTAGATAATTTGGAAACTAATGATCCCAACCTGGATGACCTCTTAAGGTCAGGAGAGTTTGATATCATTGCATATACAGATCCAGAACTTGACATGGGAGATAAGAAAAGCATGTTTAATGAGGAACTAGACCTTCCAATTGATGATAAGTTAGATAATCAGTGTGTATCTGTTGAACCAAAAAAAAAGGAACAAGAAAACAAAACTCTGGTTCTCTCTGATAAACATTCACCACAGAAAAAATCCACTGTTACCAATGAGGTAAAAACGGAAGTACTGTCTCCAAATTCTAAGGTGGAATCCAAATGTGAAACTGAAAAAAATGATGAGAATAAAGATAATGTTGACACTCCTTGCTCACAGGCTTCTGCTCACTCAGACCTAAATGATGGAGAAAAGACTTCTTTGCATCCTTGTGATCCAGATCTATTTGAGAAAAGAACCAATCGAGAAACTGCTGGCCCCAGTGCAAATGTCATTCAGGCATCCACTCAACTACCTGCTCAAGATGTAATAAACTCTTGTGGCATAACTGGATCAACTCCAGTTCTCTCAAGTTTACTTGCTAATGAGAAATCTGATAATTCAGACATTAGGCCATCGGGGTCTCCACCACCACCAACTCTGCCGGCCTCCCCATCCAATCATGTGTCAAGTTTGCCTCCTTTCATAGCACCGCCTGGCCGTGTTTTGGATAATGCCATGAATTCTAATGTGACAGTAGTCTCTAGGGTAAACCATGTTTTTTCTCAGGGTGTGCAGGTAAACCCAGGGCTCATTCCAGGTCAATCAACAGTTAACCACAGTCTGGGGACAGGAAAACCTGCAACTCAAACTGGGCCTCAAACAAGTCAGTCTGGTACCAGTAGCATGTCTGGACCCCAACAGCTAATGATTCCTCAAACATTAGCACAGCAGAATAGAGAGAGGCCCCTTCTTCTAGAAGAACAGCCTCTACTTCTACAGGATCTTTTGGATCAAGAAAGGCAAGAACAGCAGCAGCAAAGACAGATGCAAGCCATGATTCGTCAGCGATCAGAACCGTTCTTCCCTAATATTGATTTTGATGCAATTACAGATCCTATAATGAAAGCCAAAATGGTGGCCCTTAAAGGTATAAATAAAGTGATGGCACAAAACAATCTGGGCATGCCACCAATGGTGATGAGCAGGTTCCCTTTTATGGGCCAGGTGGTAACTGGAACACAGAACAGTGAAGGACAGAACCTTGGACCACAGGCCATTCCTCAGGATGGCAGTATAACACATCAGATTTCTAGGCCTAATCCTCCAAATTTTGGTCCAGGCTTTGTCAATGATTCACAGCGTAAGCAGTATGAAGAGTGGCTCCAGGAGACCCAACAGCTGCTTCAAATGCAGCAGAAGTATCTTGAAGAACAAATTGGTGCTCACAGAAAATCTAAGAAGGCCCTTTCAGCTAAACAACGTACTGCCAAGAAAGCTGGGCGTGAATTTCCAGAGGAAGATGCAGAACAACTCAAGCATGTTACTGAACAGCAAAGCATGGTTCAGAAACAGCTAGAACAGATTCGTAAACAACAGAAAGAACATGCTGAATTGATTGAAGATTATCGGATCAAACAGCAGCAGCAATGTGCAATGGCCCCACCTACCATGATGCCCAGTGTCCAGCCCCAGCCACCCCTAATTCCAGGTGCCACTCCACCCACCATGAGCCAACCCACCTTTCCCATGGTGCCACAGCAGCTTCAGCACCAGCAGCACACAACAGTTATTTCTGGCCATACTAGCCCTGTTAGAATGCCCAGTTTACCTGGATGGCAACCCAACAGTGCTCCTGCCCACCTGCCCCTCAATCCTCCTAGAATTCAGCCCCCAATTGCCCAGTTACCAATAAAAACTTGTACACCAGCCCCAGGGACAGTCTCAAATGCAAATCCACAGAGTGGACCACCACCTCGGGTAGAATTTGATGACAACAATCCCTTTAGTGAAAGTTTTCAAGAACGGGAACGTAAGGAACGTTTACGAGAACAGCAAGAGAGACAACGGATCCAACTCATGCAGGAGGTAGATAGACAAAGAGCTTTGCAGCAGAGGATGGAAATGGAGCAGCATGGTATGGTGGGCTCTGAGATAAGTAGTAGTAGGACATCTGTGTCCCAGATTCCCTTCTACAGTTCCGACTTACCTTGTGATTTTATGCAACCTCTAGGACCCCTTCAGCAGTCTCCACAACACCAACAGCAAATGGGGCAGGTTTTACAGCAGCAGAATATACAACAAGGATCAATTAATTCACCCTCCACCCAAACTTTCATGCAGACTAATGAGCGAAGGCAGGTAGGCCCTCCTTCATTTGTTCCTGATTCACCATCAATCCCTGTTGGAAGCCCAAATTTTTCTTCTGTGAAGCAGGGACATGGAAATCTTTCTGGGACCAGCTTCCAGCAGTCCCCAGTGAGGCCTTCTTTTACACCTGCTTTACCAGCAGCACCTCCAGTAGCTAATAGCAGTCTCCCATGTGGCCAAGATTCTACTATAACCCATGGACACAGTTATCCGGGATCAACCCAATCGCTCATTCAGTTGTATTCTGATATAATCCCAGAGGAAAAAGGGAAAAAGAAAAGAACAAGAAAGAAGAAAAGAGATGATGATGCAGAATCCACCAAGGCTCCATCAACTCCCCATTCAGATATAACTGCCCCACCGACTCCAGGCATCTCAGAAACTACCTCTACTCCTGCAGTGAGCACACCCAGTGAGCTTCCTCAACAAGCCGACCAAGAGTCGGTGGAACCAGTCGGCCCATCCACTCCCAATATGGCAGCAGGCCAGCTATGTACAGAATTAGAGAACAAACTGCCCAATAGTGATTTCTCACAAGCAACTCCAAATCAACAGACGTATGCAAATTCAGAAGTAGACAAGCTCTCCATGGAAACCCCTGCCAAAACAGAAGAGATAAAACTGGAAAAGGCTGAGACAGAGTCCTGCCCAGGCCAAGAGGAGCCTAAATTGGAGGAACAGAATGGTAGTAAGGTAGAAGGAAACGCTGTAGCCTGTCCTGTCTCCTCAGCACAGAGTCCTCCCCATTCTGCTGGGGCCCCTGCTGCCAAAGGAGACTCAGGGAATGAACTTCTGAAACACTTGTTGAAAAATAAAAAGTCATCTTCTCTTTTGAATCAAAAACCTGAGGGCAGTATTTGTTCAGAAGATGACTGTACAAAGGATAATAAACTAGTTGAGAAGCAGAACCCAGCTGAAGGACTGCAAACTTTGGGGGCTCAAATGCAAGGTGGTTTTGGATGTGGCAACCAGTTGCCAAAAACAGATGGAGGAAGTGAAACCAAGAAACAGCGAAGCAAACGGACTCAGAGGACGGGTGAGAAAGCAGCACCTCGCTCAAAGAAAAGGAAAAAGGACGAAGAGGAGAAACAAGCTATGTACTCTAGCACTGACACGTTTACCCACTTGAAACAGCAGAATAATTTAAGTAATCCTCCAACACCCCCTGCCTCTCTTCCTCCTACACCACCTCCTATGGCTTGTCAGAAGATGGCCAATGGTTTTGCAACAACTGAAGAACTTGCTGGAAAAGCCGGAGTGTTAGTGAGCCATGAAGTTACCAAAACTCTAGGACCTAAACCATTTCAGCTGCCCTTCAGACCCCAGGACGACTTGTTGGCCCGAGCTCTTGCTCAGGGCCCCAAGACAGTTGATGTGCCAGCCTCCCTCCCAACACCACCTCATAACAATCAGGAAGAATTAAGGATACAGGATCACTGTGGTGATCGAGATACTCCTGACAGTTTTGTTCCCTCATCCTCTCCTGAGAGTGTGGTTGGGGTAGAAGTGAGCAGGTATCCAGATCTGTCATTGGTCAAGGAGGAGCCTCCAGAACCGGTGCCGTCCCCCATCATTCCAATTCTTCCTAGCACTGCTGGGAAAAGTTCAGAATCAAGAAGGAATGACATCAAAACTGAGCCAGGCACTTTATATTTTGCGTCACCTTTTGGTCCTTCCCCAAATGGTCCCAGATCAGGTCTTATATCTGTAGCAATTACTCTGCATCCTACAGCTGCTGAGAACATTAGCAGTGTTGTGGCTGCATTTTCCGACCTTCTTCACGTCCGAATCCCTAACAGCTATGAGGTTAGCAGTGCTCCAGATGTCCCATCCATGGGTTTGGTCAGTAGCCACAGAATCAACCCGGGTTTGGAGTATCGACAGCATTTACTTCTCCGTGGGCCTCCGCCAGGATCTGCAAACCCTCCCAGATTAGTGAGCTCTTACCGGCTGAAGCAGCCTAATGTACCATTTCCTCCAACAAGCAATGGTCTTTCTGGATATAAGGATTCTAGTCATGGTATTGCAGAAAGCGCAGCACTCAGACCACAGTGGTGTTGTCATTGTAAAGTGGTTATTCTTGGAAGTGGTGTGCGGAAATCTTTCAAAGATCTGACCCTTTTGAACAAGGATTCCCGAGAAAGCACCAAGAGGGTAGAGAAGGACATTGTCTTCTGTAGTAATAACTGCTTTATTCTTTATTCATCAACTGCACAAGCGAAAAACTCAGAAAACAAGGAATCCATTCCTTCATTGCCACAATCACCTATGAGAGAAACGCCTTCCAAAGCATTTCATCAGTACAGCAACAACATCTCCACTTTGGATGTGCACTGTCTCCCCCAGCTCCCAGAGAAAGCTTCTCCCCCTGCCTCACCACCCATCGCCTTCCCTCCTGCTTTTGAAGCAGCCCAAGTCGAGGCCAAGCCAGATGAGCTGAAGGTGACAGTCAAGCTGAAGCCTCGGCTAAGAGCTGTCCATGGTGGGTTTGAAGATTGCAGGCCGCTCAATAAAAAATGGAGAGGAATGAAATGGAAGAAGTGGAGCATTCATATTGTAATCCCTAAGGGGACATTTAAACCACCTTGTGAGGATGAAATAGATGAATTTCTAAAGAAATTGGGCACTTCCCTTAAACCTGATCCTGTGCCCAAAGACTATCGGAAATGTTGCTTTTGTCATGAAGAAGGTGATGGATTGACAGATGGACCAGCAAGGCTACTCAACCTTGACTTGGATCTGTGGGTCCACTTGAACTGCGCTCTGTGGTCCACGGAGGTCTATGAGACTCAGGCTGGTGCCTTAATAAATGTGGAGCTAGCTCTGAGGAGAGGCCTACAAATGAAATGTGTCTTCTGTCACAAGACGGGTGCCACTAGTGGATGCCACAGATTTCGATGCACCAACATTTATCACTTCACTTGCGCCATTAAAGCACAATGCATGTTTTTTAAGGACAAAACTATGCTTTGCCCCATGCACAAACCAAAGGGAATTCATGAGCAAGAATTAAGTTACTTTGCAGTCTTCAGGAGGGTCTATGTTCAGCGTGATGAGGTGCGACAGATTGCTAGCATCGTGCAACGAGGAGAACGGGACCATACCTTTCGCGTGGGTAGCCTCATCTTCCACACAATTGGTCAGCTGCTTCCACAGCAGATGCAAGCATTCCATTCTCCTAAAGCACTCTTCCCTGTGGGCTATGAAGCCAGCCGGCTGTACTGGAGCACTCGCTATGCCAATAGGCGCTGCCGCTACCTGTGCTCCATTGAGGAGAAGGATGGGCGCCCAGTGTTTGTCATCAGGATTGTGGAACAAGGCCATGAAGACCTGGTTCTAAGTGACATCTCACCTAAAGGTGTCTGGGATAAGATTTTGGAGCCTGTGGCATGTGTGAGAAAAAAGTCTGAAATGCTCCAGCTTTTCCCAGCGTATTTAAAAGGAGAGGATCTGTTTGGCCTGACCGTCTCTGCAGTGGCACGCATAGCGGAATCACTTCCTGGGGTTGAGGCATGTGAAAATTATACCTTCCGATACGGCCGAAATCCTCTCATGGAACTTCCTCTTGCCGTTAACCCCACAGGTTGTGCCCGTTCTGAACCTAAAATGAGTGCCCATGTCAAGAGGTTTGTGTTAAGGCCTCACACCTTAAACAGCACCAGCACCTCAAAGTCATTTCAGAGCACAGTCACTGGAGAACTGAACGCACCTTATAGTAAACAGTTTGTTCACTCCAAGTCATCGCAGTACCGGAAGATGAAAACTGAATGGAAATCCAATGTGTATCTGGCACGGTCTCGGATTCAGGGGCTGGGCCTGTATGCTGCTCGAGACATTGAGAAACACACCATGGTCATTGAGTACATCGGGACTATCATTCGAAACGAAGTAGCCAACAGGAAAGAGAAGCTTTATGAGTCTCAGAACCGTGGTGTGTACATGTTCCGCATGGATAACGACCATGTGATTGACGCGACGCTCACAGGAGGGCCCGCAAGGTATATCAACCATTCGTGTGCACCTAATTGTGTGGCTGAAGTGGTGACTTTTGAGAGAGGACACAAAATTATCATCAGCTCCAGTCGGAGAATCCAGAAAGGAGAAGAGCTCTGCTATGACTATAAGTTTGACTTTGAAGATGACCAGCACAAGATTCCGTGTCACTGTGGAGCTGTGAACTGCCGGAAGTGGATGAACTGAAATGCATTCCTTGCTAGCTCAGCGGGCGGCTTGTCCCTAGGAAGAGGCGATTCAACACACCATTGGAATTTTGCAGACAGAAAGAGATTTTTGTTTTCTGTTTTATGACTTTTTGAAAAAGCTTCTGGGAGTTCTGATTTCCTCAGTCCTTTAGGTTAAAGCAGCGCCAGGAGGAAGCTGACAGAAGCAGCGTTCCTGAAGTGGCCGAGGTTAAACGGAATCACAGAATGGTCCAGCACTTTTGCTTTTTTTTCTTTTCCTTTTCTTTTTTTTTTGTTTGTTTTTTGTTTTGTTTTTCCCTTGTGGGTGGGTTTCATTGTTTTGGTTTTCTAGTCTCACTAAGGAGAAACTTTTACTGGGGCAAAGAGCCGATGGCTGCCCTGCCCCGGGCAGGGGCCTTCCTATGAATGTAAGACTGAAATCACCAGCGAGGGGGACAGAGAGTGCTGGCCACGGCCTTATTAAAAAGGGGCAGGCCCTCTAACTTCAAAATGTTTTTAAATAAAGTAGACACCACTGAACAAGGAATGTACTGAAATGACTTCCTTAGGGATAGAGCTAAGGGATAATAACTTGCACTAAATACATTTAAATACTTGATTCCATGAGTCAGTTTATTGTAGTTTTTGATTTCTGTAAAATAAGAGAAACTTTTGTATTTATTATTGAATAAGTGAATGAAGCTATTTTTAAATAAAGTTAGAAGAAAGCCAAGCTGCTGCTGTTACCTGCAGAACTAACAAACCCTGTTACTTTGTACAGATATGTAAATATTTTGAGAAAAAATACAGTATAAAAATAGTTATTGACCAAATGCTACCAGGCTCTGCAGCAGCTCGGGGGCTTATAAAATGTTCATAGGGATGTTACAATATAATTTTGTGTTATAAAATATGCCATTATAATTATGTAATAACCAAAATTTCAACCTAGAGTGTTGGGGGTTTTTTGGAAACCGCAGTCTATTAGTACTCAATGGTTTTATACACCTTACTTCTGACAGAGCGGGGCGTATGCTACGACTACAACTTTTATAGCTGTTTTGGTAATTTAAACTAATTTTTTCATATTATATTGTTGCATCCCTACTTCTTCAGTCAGGTTTTTTTGTGCTTACAATTTGTGATAACTGTGAATAACTGCTTAAAAATACACCCAAATGGAGGCTGAATTTTTTCTTCAGCAAAAGTAGTTTTGATTAGAACTTTGTTTCAGCCACAGAGAATCATGTAAACGTAATAGGATCATGTAGCAGAAACTTAAATCTAACCCTTTAGCCTTCTATTTAACACAAAAATTTGAAAAAGTTAAAAAAAAAAAGGAGATGTGATTATGCTTACAGCTGCAGGACTCTGGCAATAGGGTTTTTGGAAGATGTAATTTTAAAATGTGTTTGTATGAACTGTTTGTTTACATTTCTTTAATAAAAAAAACACTGTTTTGTGTTTGCTTGTAGAAACTTAATCAGCATTTTGAACCAGGTTAGCTTTTTATTTTGTACTTAAAATTCTGGTACTGACACTTCACAGGCTAAGTATAAAATGAAGTTTTGTGTGCACAATTCAAGTGGACTGTAAACTGTTGGTATATTCAGTGATGCAGTTCTGAACTTGTATATGGCATGATGTATTTTTATCTTACAGAATAAATCAATTGTATATATTTTTCTCTTGATAAATAGCTGTATGAAATTTGTTTCCTGAATATTTTTCTTCTCTTGTACAATATCCTGACATCCTACCAGTATTTGTCCTACCGGGTTTTTGTTGTTTTCTGTTCTGTATAATAGTATCTAATGTTGGCAAAAATTGAATTTTTTGAAGTATACAGAGTGTTATGGGTTTTGGAATTTGTGGACACAGATTTAGAAGATCACCATTTACAAATAAAATATTTTACATCTATAA
转录物:MLL3-001 ENST00000262189
蛋白质序列(SEQ ID NO.:118),融合基因的一部分加阴影。
MSSEEDKSVEQPQPPPPPPEEPGAPAPSPAAADKRPRGRPRKDGASPFQRARKKPRSRGKTAVEDEDSMDGLETTETETIVETEIKEQSAEEDAEAEVDNSKQLIPTLQRSVSEESANSLVSVGVEAKISEQLCAFCYCGEKSSLGQGDLKQFRITPGFILPWRNQPSNKKDIDDNSNGTYEKMQNSAPRKQRGQRKERSPQQNIVSCVSVSTQTASDDQAGKLWDELSLVGLPDAIDIQALFDSTGTCWAHHRCVEWSLGVCQMEEPLLVNVDKAVVSGSTERCAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIFLLCPEHIDQAPERSKEDANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIECGTRSSSQWHHNCLICDNCYQQQDNLCPFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELDTQLKEEYICMYCKHLGAEMDRLQPGEEVEIAELTTDYNNEMEVEGPEDQMVFSEQAANKDVNGQESTPGIVPDAVQVHTEEQQKSHPSESLDTDSLLIAVSSQHTVNTELEKQISNEVDSEDLKMSSEVKHICGEDQIEDKMEVTENIEVVTHQITVQQEQLQLLEEPETVVSREESRPPKLVMESVTLPLETLVSPHEESISLCPEEQLVIERLQGEKEQKENSELSTGLMDSEMTPTIEGCVKDVSYQGGKSIKLSSETESSFSSSADISKADVSSSPTPSSDLPSHDMLHNYPSALSSSAGNIMPTTYISVTPKIGMGKPAITKRKFSPGRPRSKQGAWSTHNTVSPPSWSPDISEGREIFKPRQLPGSAIWSIKVGRGSGFPGKRRPRGAGLSGRGGRGRSKLKSGIGAVVLPGVSTADISSNKDDEENSMHNTVVLFSSSDKFTLNQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSSCPVCYRNYREEDLILQCRQCDRWMHAVCQNLNTEEEVENVADIGFDCSMCRPYMPASNVPSSDCCESSLVAQIVTKVKELDPPKTYTQDGVCLTESGMTQLQSLTVTVPRRKRSKPKLKLKIINQNSVAVLQTPPDIQSEHSRDGEMDDSREGELMDCDGKSESSPEREAVDDETKGVEGTDGVKKRKRKPYRPGIGGFMVRQRSRTGQGKTKRSVIRKDSSGSISEQLPCRDDGWSEQLPDTLVDESVSVTESTEKIKKRYRKRKNKLEETFPAYLQEAFFGKDLLDTSRQSKISLDNLSEDGAQLLYKTNMNTGFLDPSLDPLLSSSSAPTKSGTHGPADDPLADISEVLNTDDDILGIISDDLAKSVDHSDIGPVTDDPSSLPQPNVNQSSRPLSEEQLDGILSPELDKMVTDGAILGKLYKIPELGGKDVEDLFTAVLSPANTQPTPLPQPPPPTQLLPIHNQDAFSRMPLMNGLIGSSPHLPHNSLPPGSGLGTFSAIAQSSYPDARDKNSAFNPMASDPNNSWTSSAPTVEGENDTMSNAQRSTLKWEKEEALGEMATVAPVLYTNINFPNLKEEFPDWTTRVKQIAKLWRKASSQERAPYVQKARDNRAALRINKVQMSNDSMKRQQQQDSIDPSSRIDSELFKDPLKQRESEHEQEWKFRQQMRQKSKQQAKIEATQKLEQVKNEQQQQQQQQFGSQHLLVQSGSDTPSSGIQSPLTPQPGNGNMSPAQSFHKELFTKQPPSTPTSTSSDDVFVKPQAPPPPPAPSRIPIQDSLSQAQTSQPPSPQVFSPGSSNSRPPSPMDPYAKMVGTPRPPPVGHSFSRRNSAAPVENCTPLSSVSRPLQMNETTANRPSPVRDLCSSSTTNNDPYAKPPDTPRPVMTDQFPKSLGLSRSPVVSEQTAKGPIAAGTSDHFTKPSPRADVFQRQRIPDSYARPLLTPAPLDSGPGPFKTPMQPPPSSQDPYGSVSQASRRLSVDPYERPALTPRPIDNFSHNQSNDPYSQPPLTPHPAVNESFAHPSRAFSQPGTISRPTSQDPYSQPPGTPRPVVDSYSQSSGTARSNTDPYSQPPGTPRPTTVDPYSQQPQTPRPSTQTDLFVTPVTNQRHSDPYAHPPGTPRPGISVPYSQPPATPRPRISEGFTRSSMTRPVLMPNQDPFLQAAQNRGPALPGPLVRPPDTCSQTPRPPGPGLSDTFSRVSPSAARDPYDQSPMTPRSQSDSFGTSQTAHDVADQPRPGSEGSFCASSNSPMHSQGQQFSGVSQLPGPVPTSGVTDTQNTVNMAQADTEKLRQRQKLREIILQQQQQKKIAGRQEKGSQDSPAVPHPGPLQHWQPENVNQAFTRPPPPYPGNIRSPVAPPLGPRYAVFPKDQRGPYPPDVASMGMRPHGFRFGFPGGSHGTMPSQERFLVPPQQIQGSGVSPQLRRSVSVDMPRPLNNSQMNNPVGLPQHFSPQSLPVQQHNILGQAYIELRHRAPDGRQRLPFSAPPGSVVEASSNLRHGNFIPRPDFPGPRHTDPMRRPPQGLPNQLPVHPDLEQVPPSQQEQGHSVHSSSMVMRTLNHPLGGEFSEAPLSTSVPSETTSDNLQITTQPSDGLEEKLDSDDPSVKELDVKDLEGVEVKDLDDEDLENLNLDTEDGKVVELDTLDNLETNDPNLDDLLRSGEFDIIAYTDPELDMGDKKSMFNEELDLPIDDKLDNQCVSVEPKKKEQENKTLVLSDKHSPQKKSTVTNEVKTEVLSPNSKVESKCETEKNDENKDNVDTPCSQASAHSDLNDGEKTSLHPCDPDLFEKRTNRETAGPSANVIQASTQLPAQDVINSCGITGSTPVLSSLLANEKSDNSDIRPSGSPPPPTLPASPSNHVSSLPPFIAPPGRVLDNAMNSNVTVVSRVNHVFSQGVQVNPGLIPGQSTVNHSLGTGKPATQTGPQTSQSGTSSMSGPQQLMIPQTLAQQNRERPLLLEEQPLLLQDLLDQERQEQQQQRQMQAMIRQRSEPFFPNIDFDAITDPIMKAKMVALKGINKVMAQNNLGMPPMVMSRFPFMGQVVTGTQNSEGQNLGPQAIPQDGSITHQISRPNPPNFGPGFVNDSQRKQYEEWLQETQQLLQMQQKYLEEQIGAHRKSKKALSAKQRTAKKAGREFPEEDAEQLKHVTEQQSMVQKQLEQIRKQQKEHAELIEDYRIKQQQQCAMAPPTMMPSVQPQPPLIPGATPPTMSQPTFPMVPQQLQHQQHTTVISGHTSPVRMPSLPGWQPNSAPAHLPLNPPRIQPPIAQLPIKTCTPAPGTVSNANPQSGPPPRVEFDDNNPFSESFQERERKERLREQQERQRIQLMQEVDRQRALQQRMEMEQHGMVGSEISSSRTSVSQIPFYSSDLPCDFMQPLGPLQQSPQHQQQMGQVLQQQNIQQGSINSPSTQTFMQTNERRQVGPPSFVPDSPSIPVGSPNFSSVKQGHGNLSGTSFQQSPVRPSFTPALPAAPPVANSSLPCGQDSTITHGHSYPGSTQSLIQLYSDIIPEEKGKKKRTRKKKRDDDAESTKAPSTPHSDITAPPTPGISETTSTPAVSTPSELPQQADQESVEPVGPSTPNMAAGQLCTELENKLPNSDFSQATPNQQTYANSEVDKLSMETPAKTEEIKLEKAETESCPGQEEPKLEEQNGSKVEGNAVACPVSSAQSPPHSAGAPAAKGDSGNELLKHLLKNKKSSSLLNQKPEGSICSEDDCTKDNKLVEKQNPAEGLQTLGAQMQGGFGCGNQLPKTDGGSETKKQRSKRTQRTGEKAAPRSKKRKKDEEEKQAMYSSTDTFTHLKQQNNLSNPPTPPASLPPTPPPMACQKMANGFATTEELAGKAGVLVSHEVTKTLGPKPFQLPFRPQDDLLARALAQGPKTVDVPASLPTPPHNNQEELRIQDHCGDRDTPDSFVPSSSPESVVGVEVSRYPDLSLVKEEPPEPVPSPIIPILPSTAGKSSESRRNDIKTEPGTLYFASPFGPSPNGPRSGLISVAITLHPTAAENISSVVAAFSDLLHVRIPNSYEVSSAPDVPSMGLVSSHRINPGLEYRQHLLLRGPPPGSANPPRLVSSYRLKQPNVPFPPTSNGLSGYKDSSHGIAESAALRPQWCCHCKVVILGSGVRKSFKDLTLLNKDSRESTKRVEKDIVFCSNNCFILYSSTAQAKNSENKESIPSLPQSPMRETPSKAFHQYSNNISTLDVHCLPQLPEKASPPASPPIAFPPAFEAAQVEAKPDELKVTVKLKPRLRAVHGGFEDCRPLNKKWRGMKWKKWSIHIVIPKGTFKPPCEDEIDEFLKKLGTSLKPDPVPKDYRKCCFCHEEGDGLTDGPARLLNLDLDLWVHLNCALWSTEVYETQAGALINVELALRRGLQMKCVFCHKTGATSGCHRFRCTNIYHFTCAIKAQCMFFKDKTMLCPMHKPKGIHEQELSYFAVFRRVYVQRDEVRQIASIVQRGERDHTFRVGSLIFHTIGQLLPQQMQAFHSPKALFPVGYEASRLYWSTRYANRRCRYLCSIEEKDGRPVFVIRIVEQGHEDLVLSDISPKGVWDKILEPVACVRKKSEMLQLFPAYLKGEDLFGLTVSAVARIAESLPGVEACENYTFRYGRNPLMELPLAVNPTGCARSEPKMSAHVKRFVLKPHTLNSTSTSKSFQSTVTGELNAPYSKQFVHSKSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN
转录物:PRKAG2-001 ENST00000287878
cDNA序列(SEQ ID NO.:119),融合基因的一部分加阴影。
GAGCTGGTTTATTCTGCGGCCGAGGATTACATTTATGCACGAACGGGCTTACTGGTTCCAGATTCCCCACTTGGGCACAGGCATAGGAGGCTTGTTTTCCAAATTGCTGGTTTTAATTGCACCTGCCTTTCAGATTACCTCTGGGAATCTGTGGGAGGAGCCGAGAGGGTGGAAAATGTTTCTTAGCTTTGCAAAAGGAAGAAAACTTTGTCACCCAGCGGGAGACCTCAGCCACGAGTAACCCGGGGAGACACCAGAACCGGGACGGGCTTTGACTGATTTGCCTACGAGGGTTCCGTAGGAAAGGACGCTTGAATTCGGCGCTTCGGCGGCGGCGGCGGCCGCGCGAGTTCCCTGCTCACCCTCCCTCTCCGCGGAAGTCCCCACGAGGTGGCTTCAGGGTGTAACAGAGCGCGCGGCTCCAGTCCGAAGGCAGCGGCCGGGGGAGGGAAGGAGGGGACCGAACCCCCGAGGAGTTTCGCAGAATCAACTTCTGGTTAGAGTTATGGGAAGCGCGGTTATGGACACCAAGAAGAAAAAAGATGTTTCCAGCCCCGGCGGGAGCGGCGGCAAGAAAAATGCCAGCCAGAAGAGGCGTTCGCTGCGCGTGCACATTCCGGACCTGAGCTCCTTCGCCATGCCGCTCCTGGACGGAGACCTGGAGGGTTCCGGAAAGCATTCCTCTCGAAAGGTGGACAGCCCCTTCGGCCCGGGCAGCCCCTCCAAAGGGTTCTTCTCCAGAGGCCCCCAGCCCCGGCCCTCCAGCCCCATGTCTGCACCTGTGAGGCCCAAGACCAGCCCCGGCTCTCCCAAAACCGTGTTCCCGTTCTCCTACCAGGAGTCCCCGCCACGCTCCCCTCGACGCATGAGCTTCAGTGGGATCTTCCGCTCCTCCTCCAAAGAGTCTTCCCCCAACTCCAACCCTGCTACCTCGCCCGGGGGCATCAGGTTTTTCTCCCGCTCCAGAAAAACCTCCGGCCTCTCCTCCTCTCCGTCAACACCCACCCAAGTGACCAAGCAGCACACGTTTCCCCTGGAATCCTATAAGCACGAGCCTGAACGGTTAGAGAATCGCATCTATGCCTCGTCTTCCCCCCCGGACACAGGGCAGAGGTTCTGCCCGTCTTCCTTCCAGAGCCCGACCAGGCCTCCACTGGCATCACCGACACACTATGCTCCCTCCAAAGCCGCGGCGCTGGCGGCGGCCCTGGGACCCGCGGAAGCCGGCATGCTGGAGAAGCTGGAGTTCGAGGACGAAGCAGTAGAAGACTCAGAAAGTGGTGTTTACATGCGATTCATGAGGTCACACAAGTGTTATGACATCGTTCCAACCAGTTCAAAGCTTGTTGTCTTTGATACTACATTACAAGTTAAAAAGGCCTTCTTTGCTTTGGTAGCCAACGGTGTCCGAGCAGCGCCACTGTGGGAGAGTAAAAAACAAAGTTTTGTAGGAATGCTAACAATTACAGATTTCATAAATATACTACATAGATACTATAAATCACCTATGGTACAGATTTATGAATTAGAGGAACATAAAATTGAAACATGGAGGGAGCTTTATTTACAAGAAACATTTAAGCCTTTAGTGAATATATCTCCAGATGCAAGCCTCTTCGATGCTGTATACTCCTTGATCAAAAATAAAATCCACAGATTGCCCGTTATTGACCCTATCAGTGGGAATGCACTTTATATACTTACCCACAAAAGAATCCTCAAGTTCCTCCAGCTTTTTATGTCTGATATGCCAAAGCCTGCCTTCATGAAGCAGAACCTGGATGAGCTTGGAATAGGAACGTACCACAACATTGCCTTCATACATCCAGACACTCCCATCATCAAAGCCTTGAACATATTTGTGGAAAGACGAATATCAGCTCTGCCTGTTGTGGATGAGTCAGGAAAAGTTGTAGATATTTATTCCAAATTTGATGTAATTAATCTTGCTGCTGAGAAAACATACAATAACCTAGATATCACGGTGACCCAGGCCCTTCAGCACCGTTCACAGTATTTTGAAGGTGTTGTGAAGTGCAATAAGCTGGAAATACTGGAGACCATCGTGGACAGAATAGTAAGAGCTGAGGTCCATCGGCTGGTGGTGGTAAATGAAGCAGATAGTATTGTGGGTATTATTTCCCTGTCGGACATTCTGCAAGCCCTGATCCTCACACCAGCAGGTGCCAAACAAAAGGAGACAGAAACGGAGTGACCGCCGTGAATGTAGACGCCCTAGGAGGAGAACTTGAACAAAGTCTCTGGGTCACGTTTTGCCTCATGAACACTGGCTGCAAGTGGTTAAGAATGTATATCAGGGTTTAACAATAGGTATTTCTTCCAGTGATGTTGAAATTAAGCTTAAAAAAGAAAGATTTTATGTGCTTGAAGATTCAGGCTTGCATTAAAAGACTGTTTTCAGACCTTTGTCTGAAGGATTTTAAATGCTGTATGTCATTAAAGTGCACTGTGTCCTGAAGTTTTCATTATTTTTCATTTCAAAGAATTCACTGGTATGGAACAGGTGATGTGGCATAAGGTGAGTGCACGGTATGTTCAGATCACAGTGCCTTATGTCCGAATACAGCAATATGTCACCGCCGCAGCCGGGGCGCACGCGTGTGAAACAACACCGAGCTTGAATGTGGAAGTCTTTGAACCTTTTACCAAATCAGTTTGTTTTCTTTAGATTTGTCAAAAAGTTGTAATTTGAATATAAATAATTACTTTAAAATTGTAATGACACTTTTACACGTAAGTGTTTTGTTCTGGGCTACCGTGTCAACGAGGCTGCTTTACAACAGCTTTATTTATTTTTACTTTCATGCAATTTTTTTACACATCTTTTGGTGGAGTAAACTTCACCACATCCATGAATAAACTCTCAGTTATTTTGAAATGGCAAATTTCTCATTATTTAAGTTTGGATCTGGAAAGGACATGACTTCTGAAATAGCCGCTGCTGGGTTTTAAAAGCTGAGGTCTCTCAAAGTGTGGAGGAGACGTTGCCGTCAGGCGGGAGCCAAGTGCCGGGAAGATGTCTATTTTTTTTCTTGTGTATTGAAATGTAAAATCATGATGTTTGTTATGACTGCTGATGCGATTGTTTTTGTAAATTTTATTGTGGCATATACAGTATTGTCATACAGTTGAAGAGAAACAATGTTTCCTAATGTAAGTGCTCTGAAAATGTTGACACTGTATATATATATATGAGGATAGTTTGTTTTTTTTTTGTTTTGGGTTTTTTTTTTTCAGATTGAAAAATTAAAATAGATCCTA
转录物:PRKAG2-001 ENST00000287878
蛋白质序列(SEQ ID NO.:120),融合基因的一部分加阴影。
MGSAVMDTKKKKDVSSPGGSGGKKNASQKRRSLRVHIPDLSSFAMPLLDGDLEGSGKHSSRKVDSPFGPGSPSKGFFSRGPQPRPSSPMSAPVRPKTSPGSPKTVFPFSYQESPPRSPRRMSFSGIFRSSSKESSPNSNPATSPGGIRFFSRSRKTSGLSSSPSTPTQVTKQHTFPLESYKHEPERLENRIYASSSPPDTGQRFCPSSFQSPTRPPLASPTHYAPSKAAALAAALGPAEAGMLEKLEFEDEAVEDSESGVYMRFMRSHKCYDIVPTSSKLVVFDTTLQVKKAFFALVANGVRAAPLWESKKQSFVGMLTITDFINILHRYYKSPMVQIYELEEHKIETWRELYLQETFKPLVNISPDASLFDAVYSLIKNKIHRLPVIDPISGNALYILTHKRILKFLQLFMSDMPKPAFMKQNLDELGIGTYHNIAFIHPDTPIIKALNIFVERRISALPVVDESGKVVDIYSKFDVINLAAEKTYNNLDITVTQALQHRSQYFEGVVKCNKLEILETIVDRIVRAEVHRLVVVNEADSIVGIISLSDILQALILTPAGAKQKETETE
MLL3-PRKAG2融合序列外显子9到外显子5
cDNA序列(SEQ ID NO.:121),PRKAG2加下划线。
ATGTCGTCGGAGGAGGACAAGAGCGTGGAGCAGCCGCAGCCGCCGCCACCACCCCCCGAGGAGCCTGGAGCCCCGGCCCCGAGCCCCGCAGCCGCAGACAAAAGACCTCGGGGCCGGCCTCGCAAAGATGGCGCTTCCCCTTTCCAGAGAGCCAGAAAGAAACCTCGAAGTAGGGGGAAAACTGCAGTGGAAGATGAGGACAGCATGGATGGGCTGGAGACAACAGAAACAGAAACGATTGTGGAAACAGAAATCAAAGAACAATCTGCAGAAGAGGATGCTGAAGCAGAAGTGGATAACAGCAAACAGCTAATTCCAACTCTTCAGCGATCTGTGTCTGAGGAATCGGCAAACTCCCTGGTCTCTGTTGGTGTAGAAGCCAAAATCAGTGAACAGCTCTGCGCTTTTTGTTACTGTGGGGAAAAAAGTTCCTTAGGACAAGGAGACTTAAAACAATTCAGAATAACGCCTGGATTTATCTTGCCATGGAGAAACCAACCTTCTAACAAGAAGGACATTGATGACAACAGCAATGGAACCTATGAGAAAATGCAAAACTCAGCACCACGAAAACAAAGAGGACAGAGAAAAGAACGATCTCCTCAGCAGAATATAGTATCTTGTGTAAGTGTAAGCACCCAGACAGCTTCAGATGATCAAGCTGGTAAACTGTGGGATGAACTCAGTCTGGTTGGGCTTCCAGATGCCATTGATATCCAAGCCTTATTTGATTCTACAGGCACTTGTTGGGCTCATCACCGTTGTGTGGAGTGGTCACTAGGAGTATGCCAGATGGAAGAACCATTGTTAGTGAACGTGGACAAAGCTGTTGTCTCAGGGAGCACAGAACGATGTGCATTTTGTAAGCACCTTGGAGCCACTATCAAATGCTGTGAAGAGAAATGTACCCAGATGTATCATTATCCTTGTGCTGCAGGAGCCGGCACCTTTCAGGATTTCAGTCACATCTTCCTGCTTTGTCCAGAACACATTGACCAAGCTCCTGAAAGATCGAAGGAAGATGCAAACTGTGCAGTGTGCGACAGCCCGGGAGACCTCTTAGATCAGTTCTTTTGTACTACTTGTGGTCAGCACTATCATGGAATGTGCCTGGATATAGCGGTTACTCCATTAAAACGTGCAGGTTGGCAATGTCCTGAGTGCAAAGTGTGCCAGAACTGCAAACAATCGGGAGAAGATAGCAAGATGCTAGTGTGTGATACGTGTGACAAAGGGTATCATACTTTTTGTCTTCAACCAGTTATGAAATCAGTACCAACCAATGGCTGGAAATGCAAAGCGGCGCTGGCGGCGGCCCTGGGACCCGCGGAAGCCGGCATGCTGGAGAAGCTGGAGTTCGAGGACGAAGCAGTAGAAGACTCAGAAAGTGGTGTTTACATGCGATTCATGAGGTCACACAAGTGTTATGACATCGTTCCAACCAGTTCAAAGCTTGTTGTCTTTGATACTACATTACAAGTTAAAAAGGCCTTCTTTGCTTTGGTAGCCAACGGTGTCCGAGCAGCGCCACTGTGGGAGAGTAAAAAACAAAGTTTTGTAGGAATGCTAACAATTACAGATTTCATAAATATACTACATAGATACTATAAATCACCTATGGTACAGATTTATGAATTAGAGGAACATAAAATTGAAACATGGAGGGAGCTTTATTTACAAGAAACATTTAAGCCTTTAGTGAATATATCTCCAGATGCAAGCCTCTTCGATGCTGTATACTCCTTGATCAAAAATAAAATCCACAGATTGCCCGTTATTGACCCTATCAGTGGGAATGCACTTTATATACTTACCCACAAAAGAATCCTCAAGTTCCTCCAGCTTTTTATGTCTGATATGCCAAAGCCTGCCTTCATGAAGCAGAACCTGGATGAGCTTGGAATAGGAACGTACCACAACATTGCCTTCATACATCCAGACACTCCCATCATCAAAGCCTTGAACATATTTGTGGAAAGACGAATATCAGCTCTGCCTGTTGTGGATGAGTCAGGAAAAGTTGTAGATATTTATTCCAAATTTGATGTAATTAATCTTGCTGCTGAGAAAACATACAATAACCTAGATATCACGGTGACCCAGGCCCTTCAGCACCGTTCACAGTATTTTGAAGGTGTTGTGAAGTGCAATAAGCTGGAAATACTGGAGACCATCGTGGACAGAATAGTAAGAGCTGAGGTCCATCGGCTGGTGGTGGTAAATGAAGCAGATAGTATTGTGGGTATTATTTCCCTGTCGGACATTCTGCAAGCCCTGATCCTCACACCAGCAGGTGCCAAACAAAAGGAGACAGAAACGGAGTGA
蛋白质序列外显子9到外显子5(SEQ ID NO.:122),PRKAG2加下划线。
MSSEEDKSVEQPQPPPPPPEEPGAPAPSPAAADKRPRGRPRKDGASPFQRARKKPRSRGKTAVEDEDSMDGLETTETETIVETEIKEQSAEEDAEAEVDNSKQLIPTLQRSVSEESANSLVSVGVEAKISEQLCAFCYCGEKSSLGQGDLKQFRITPGFILPWRNQPSNKKDIDDNSNGTYEKMQNSAPRKQRGQRKERSPQQNIVSCVSVSTQTASDDQAGKLWDELSLVGLPDAIDIQALFDSTGTCWAHHRCVEWSLGVCQMEEPLLVNVDKAVVSGSTERCAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIFLLCPEHIDQAPERSKEDANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKAALAAALGPAEAGMLEKLEFEDEAVEDSESGVYMRFMRSHKCYDIVPTSSKLVVFDTTLQVKKAFFALVANGVRAAPLWESKKQSFVGMLTITDFINILHRYYKSPMVQIYELEEHKIETWRELYLQETFKPLVNISPDASLFDAVYSLIKNKIHRLPVIDPISGNALYILTHKRILKFLQLFMSDMPKPAFMKQNLDELGIGTYHNIAFIHPDTPIIKALNIFVERRISALPVVDESGKVVDIYSKFDVINLAAEKTYNNLDITVTQALQHRSQYFEGVVKCNKLEILETIVDRIVRAEVHRLVVVNEADSIVGIISLSDILQALILTPAGAKQKETETE
蛋白质结构域外显子9到外显子5
由于重叠的结构域,因此存在蛋白质的4种表示。无跨膜结构域。
MLL3-PRKAG2融合序列外显子6到外显子7
cDNA序列(SEQ ID NO.:123),PRKAG2加下划线。
ATGTCGTCGGAGGAGGACAAGAGCGTGGAGCAGCCGCAGCCGCCGCCACCACCCCCCGAGGAGCCTGGAGCCCCGGCCCCGAGCCCCGCAGCCGCAGACAAAAGACCTCGGGGCCGGCCTCGCAAAGATGGCGCTTCCCCTTTCCAGAGAGCCAGAAAGAAACCTCGAAGTAGGGGGAAAACTGCAGTGGAAGATGAGGACAGCATGGATGGGCTGGAGACAACAGAAACAGAAACGATTGTGGAAACAGAAATCAAAGAACAATCTGCAGAAGAGGATGCTGAAGCAGAAGTGGATAACAGCAAACAGCTAATTCCAACTCTTCAGCGATCTGTGTCTGAGGAATCGGCAAACTCCCTGGTCTCTGTTGGTGTAGAAGCCAAAATCAGTGAACAGCTCTGCGCTTTTTGTTACTGTGGGGAAAAAAGTTCCTTAGGACAAGGAGACTTAAAACAATTCAGAATAACGCCTGGATTTATCTTGCCATGGAGAAACCAACCTTCTAACAAGAAGGACATTGATGACAACAGCAATGGAACCTATGAGAAAATGCAAAACTCAGCACCACGAAAACAAAGAGGACAGAGAAAAGAACGATCTCCTCAGCAGAATATAGTATCTTGTGTAAGTGTAAGCACCCAGACAGCTTCAGATGATCAAGCTGGTAAACTGTGGGATGAACTCAGTCTGGTTGGGCTTCCAGATGCCATTGATATCCAAGCCTTATTTGATTCTACAGGCACTTGTTGGGCTCATCACCGTTGTGTGGAGTGGTCACTAGGAGTATGCCAGATGGAAGAACCATTGTTAGTGAACGTGGACAAAGCTGTTGTCTCAGGGAGCACAGAAGTTAAAAAGGCCTTCTTTGCTTTGGTAGCCAACGGTGTCCGAGCAGCGCCACTGTGGGAGAGTAAAAAACAAAGTTTTGTAGGAATGCTAACAATTACAGATTTCATAAATATACTACATAGATACTATAAATCACCTATGGTACAGATTTATGAATTAGAGGAACATAAAATTGAAACATGGAGGGAGCTTTATTTACAAGAAACATTTAAGCCTTTAGTGAATATATCTCCAGATGCAAGCCTCTTCGATGCTGTATACTCCTTGATCAAAAATAAAATCCACAGATTGCCCGTTATTGACCCTATCAGTGGGAATGCACTTTATATACTTACCCACAAAAGAATCCTCAAGTTCCTCCAGCTTTTTATGTCTGATATGCCAAAGCCTGCCTTCATGAAGCAGAACCTGGATGAGCTTGGAATAGGAACGTACCACAACATTGCCTTCATACATCCAGACACTCCCATCATCAAAGCCTTGAACATATTTGTGGAAAGACGAATATCAGCTCTGCCTGTTGTGGATGAGTCAGGAAAAGTTGTAGATATTTATTCCAAATTTGATGTAATTAATCTTGCTGCTGAGAAAACATACAATAACCTAGATATCACGGTGACCCAGGCCCTTCAGCACCGTTCACAGTATTTTGAAGGTGTTGTGAAGTGCAATAAGCTGGAAATACTGGAGACCATCGTGGACAGAATAGTAAGAGCTGAGGTCCATCGGCTGGTGGTGGTAAATGAAGCAGATAGTATTGTGGGTATTATTTCCCTGTCGGACATTCTGCAAGCCCTGATCCTCACACCAGCAGGTGCCAAACAAAAGGAGACAGAAACGGAGTGA
蛋白质序列外显子6到外显子7(SEQ ID NO.:124)
M S S E E D K S V E Q P Q P P P P P P E E P G A P A P S P A A A D K R P R G R P R K D G A S P F Q R A R K K P R S R G K T A V E D E D S Met D G L E T T E T E T I V E T E I K E Q S A E E D A E A E V D N S K Q L I P T L Q R S V S E E S A N S L V S V G V E A K I S E Q L C A F C Y C G E K S S L G Q G D L K Q F R I T P G F I L P W R N Q P S N K K D I D D N S N G T Y E K M Q N S A P R K Q R G Q R K E R S P Q Q N I V S C V S V S T Q T A S D D Q A G K L W D E L S L V G L P D A I D I Q A L F D S T G T C W A H H R C V E W S L G V C Q M E E P L L V N V D K A V V S G S T E V K K A F F A L V A N G V R A A P L W E S K K Q S F V G M L T I T D F I N I L H R Y Y K S P M V Q I Y E L E E H K I E T W R E L Y L Q E T F K P L V N I S P D A S L F D A V Y S L I K N K I H R L P V I D P I S G N A L Y I L T H K R I L K F L Q L F M S D M P K P A F M K Q N L D EL G I G T Y H N I A F I H P D T P I I K A L N I F V E R R I S A L P V V D E S G K V V D I Y S K F D V I N L A A E K T Y N N L D I T V T Q A L Q H R S Q Y F E G V V K C N K L E I L E T I V D R I V R A E V H R L V V V N E A D S I V G I I S L S D I L Q A L I L T PA G A K Q K E T E T E终止密码子
蛋白质结构域外显子6到外显子7
具有566个残基的查询序列内无跨膜结构域。
MLL3-PRKAG2融合序列外显子23到外显子6
cDNA序列(SEQ ID NO.:125),PRKAG2加下划线。
ATGTCGTCGGAGGAGGACAAGAGCGTGGAGCAGCCGCAGCCGCCGCCACCACCCCCCGAGGAGCCTGGAGCCCCGGCCCCGAGCCCCGCAGCCGCAGACAAAAGACCTCGGGGCCGGCCTCGCAAAGATGGCGCTTCCCCTTTCCAGAGAGCCAGAAAGAAACCTCGAAGTAGGGGGAAAACTGCAGTGGAAGATGAGGACAGCATGGATGGGCTGGAGACAACAGAAACAGAAACGATTGTGGAAACAGAAATCAAAGAACAATCTGCAGAAGAGGATGCTGAAGCAGAAGTGGATAACAGCAAACAGCTAATTCCAACTCTTCAGCGATCTGTGTCTGAGGAATCGGCAAACTCCCTGGTCTCTGTTGGTGTAGAAGCCAAAATCAGTGAACAGCTCTGCGCTTTTTGTTACTGTGGGGAAAAAAGTTCCTTAGGACAAGGAGACTTAAAACAATTCAGAATAACGCCTGGATTTATCTTGCCATGGAGAAACCAACCTTCTAACAAGAAGGACATTGATGACAACAGCAATGGAACCTATGAGAAAATGCAAAACTCAGCACCACGAAAACAAAGAGGACAGAGAAAAGAACGATCTCCTCAGCAGAATATAGTATCTTGTGTAAGTGTAAGCACCCAGACAGCTTCAGATGATCAAGCTGGTAAACTGTGGGATGAACTCAGTCTGGTTGGGCTTCCAGATGCCATTGATATCCAAGCCTTATTTGATTCTACAGGCACTTGTTGGGCTCATCACCGTTGTGTGGAGTGGTCACTAGGAGTATGCCAGATGGAAGAACCATTGTTAGTGAACGTGGACAAAGCTGTTGTCTCAGGGAGCACAGAACGATGTGCATTTTGTAAGCACCTTGGAGCCACTATCAAATGCTGTGAAGAGAAATGTACCCAGATGTATCATTATCCTTGTGCTGCAGGAGCCGGCACCTTTCAGGATTTCAGTCACATCTTCCTGCTTTGTCCAGAACACATTGACCAAGCTCCTGAAAGATCGAAGGAAGATGCAAACTGTGCAGTGTGCGACAGCCCGGGAGACCTCTTAGATCAGTTCTTTTGTACTACTTGTGGTCAGCACTATCATGGAATGTGCCTGGATATAGCGGTTACTCCATTAAAACGTGCAGGTTGGCAATGTCCTGAGTGCAAAGTGTGCCAGAACTGCAAACAATCGGGAGAAGATAGCAAGATGCTAGTGTGTGATACGTGTGACAAAGGGTATCATACTTTTTGTCTTCAACCAGTTATGAAATCAGTACCAACCAATGGCTGGAAATGCAAAAATTGCAGAATATGTATAGAGTGTGGCACACGGTCTAGTTCTCAGTGGCACCACAATTGCCTGATATGTGACAATTGTTACCAACAGCAGGATAACTTATGTCCCTTCTGTGGGAAGTGTTATCATCCAGAATTGCAGAAAGACATGCTTCATTGTAATATGTGCAAAAGGTGGGTTCACCTAGAGTGTGACAAACCAACAGATCATGAACTGGATACTCAGCTCAAAGAAGAGTATATCTGCATGTATTGTAAACACCTGGGAGCTGAGATGGATCGTTTACAGCCAGGTGAGGAAGTGGAGATAGCTGAGCTCACTACAGATTATAACAATGAAATGGAAGTTGAAGGCCCTGAAGATCAAATGGTATTCTCAGAGCAGGCAGCTAATAAAGATGTCAACGGTCAGGAGTCCACTCCTGGAATTGTTCCAGATGCGGTTCAAGTCCACACTGAAGAGCAACAGAAGAGTCATCCCTCAGAAAGTCTTGACACAGATAGTCTTCTTATTGCTGTATCATCCCAACATACAGTGAATACTGAATTGGAAAAACAGATTTCTAATGAAGTTGATAGTGAAGACCTGAAAATGTCTTCTGAAGTGAAGCATATTTGTGGCGAAGATCAAATTGAAGATAAAATGGAAGTGACAGAAAACATTGAAGTCGTTACACACCAGATCACTGTGCAGCAAGAACAACTGCAGTTGTTAGAGGAACCTGAAACAGTGGTATCCAGAGAAGAATCAAGGCCTCCAAAATTAGTCATGGAATCTGTCACTCTTCCACTAGAAACCTTAGTGTCCCCACATGAGGAAAGTATTTCATTATGTCCTGAGGAACAGTTGGTTATAGAAAGGCTACAAGGAGAAAAGGAACAGAAAGAAAATTCTGAACTTTCTACTGGATTGATGGACTCTGAAATGACTCCTACAATTGAGGGTTGTGTGAAAGATGTTTCATACCAAGGAGGCAAATCTATAAAGTTATCATCTGAGACAGAGTCATCATTTTCATCATCAGCAGACATAAGCAAGGCAGATGTGTCTTCCTCCCCAACACCTTCTTCAGACTTGCCTTCGCATGACATGCTGCATAATTACCCTTCAGCTCTTAGTTCCTCTGCTGGAAACATCATGCCAACAACTTACATCTCAGTCACTCCAAAAATTGGCATGGGTAAACCAGCTATTACTAAGAGAAAATTTTCTCCTGGTAGACCTCGGTCCAAACAGGGGGCTTGGAGTACCCATAATACAGTGAGCCCACCTTCCTGGTCCCCAGACATTTCAGAAGGTCGGGAAATTTTTAAACCCAGGCAGCTTCCTGGCAGTGCCATTTGGAGCATCAAAGTGGGCCGTGGGTCTGGATTTCCAGGAAAGCGGAGACCTCGAGGTGCAGGACTGTCGGGGCGAGGTGGCCGAGGCAGGTCAAAGCTGAAAAGTGGAATCGGAGCTGTTGTATTACCTGGGGTGTCTACTGCAGATATTTCATCAAATAAGGATGATGAAGAAAACTCTATGCACAATACAGTTGTGTTGTTTTCTAGCAGTGACAAGTTCACTTTGAATCAGGATATGTGTGTAGTTTGTGGCAGTTTTGGCCAAGGAGCAGAAGGAAGATTACTTGCCTGTTCTCAGTGTGGTCAGTGTTACCATCCATACTGTGTCAGTATTAAGATCACTAAAGTGGTTCTTAGCAAAGGTTGGAGGTGTCTTGAGTGCACTGTGTGTGAGGCCTGTGGGAAGGCAACTGACCCAGGAAGACTCCTGCTGTGTGATGACTGTGACATAAGTTATCACACCTACTGCCTAGACCCTCCATTGCAGACAGTTCCCAAAGGAGGCTGGAAGTGCAAATGGTGTGTTTGGTGCAGACACTGTGGAGCAACATCTGCAGGTCTAAGATGTGAATGGCAGAACAATTACACACAGTGCGCTCCTTGTGCAAGCTTATCTTCCTGTCCAGTCTGCTATCGAAACTATAGAGAAGAAGATCTTATTCTGCAATGTAGACAATGTGATAGATGGATGCATGCAGTTTGTCAGAACTTAAATACTGAGGAAGAAGTGGAAAATGTAGCAGACATTGGTTTTGATTGTAGCATGTGCAGACCCTATATGCCTGCGTCTAATGTGCCTTCCTCAGACTGCTGTGAATCTTCACTTGTAGCACAAATTGTCACAAAAGTAAAAGAGCTAGACCCACCCAAGACTTATACCCAGGATGGTGTGTGTTTGACTGAATCAGGGATGACTCAGTTACAGAGCCTCACAGTTACAGTTCCAAGAAGAAAACGGTCAAAACCAAAATTGAAATTGAAGATTATAAATCAGAATAGCGTGGCCGTCCTTCAGACCCCTCCAGACATCCAATCAGAGCATTCAAGGGATGGTGAAATGGATGATAGTCGAGCAGTAGAAGACTCAGAAAGTGGTGTTTACATGCGATTCATGAGGTCACACAAGTGTTATGACATCGTTCCAACCAGTTCAAAGCTTGTTGTCTTTGATACTACATTACAAGTTAAAAAGGCCTTCTTTGCTTTGGTAGCCAACGGTGTCCGAGCAGCGCCACTGTGGGAGAGTAAAAAACAAAGTTTTGTAGGAATGCTAACAATTACAGATTTCATAAATATACTACATAGATACTATAAATCACCTATGGTACAGATTTATGAATTAGAGGAACATAAAATTGAAACATGGAGGGAGCTTTATTTACAAGAAACATTTAAGCCTTTAGTGAATATATCTCCAGATGCAAGCCTCTTCGATGCTGTATACTCCTTGATCAAAAATAAAATCCACAGATTGCCCGTTATTGACCCTATCAGTGGGAATGCACTTTATATACTTACCCACAAAAGAATCCTCAAGTTCCTCCAGCTTTTTATGTCTGATATGCCAAAGCCTGCCTTCATGAAGCAGAACCTGGATGAGCTTGGAATAGGAACGTACCACAACATTGCCTTCATACATCCAGACACTCCCATCATCAAAGCCTTGAACATATTTGTGGAAAGACGAATATCAGCTCTGCCTGTTGTGGATGAGTCAGGAAAAGTTGTAGATATTTATTCCAAATTTGATGTAATTAATCTTGCTGCTGAGAAAACATACAATAACCTAGATATCACGGTGACCCAGGCCCTTCAGCACCGTTCACAGTATTTTGAAGGTGTTGTGAAGTGCAATAAGCTGGAAATACTGGAGACCATCGTGGACAGAATAGTAAGAGCTGAGGTCCATCGGCTGGTGGTGGTAAATGAAGCAGATAGTATTGTGGGTATTATTTCCCTGTCGGACATTCTGCAAGCCCTGATCCTCACACCAGCAGGTGCCAAACAAAAGGAGACAGAAACGGAGTGA
蛋白质序列外显子23到外显子6(SEQ ID NO.:126)
M S S E E D K S V E Q P Q P P P P P P E E P G A P A P S P A A A D K R P R G R P R K D G A S P F Q R A R K K P R S R G K T A V E D E D S M D G L E T T E T E T I V E T E I K E Q S A E E D A E A E V D N S K Q L I P T L Q R S V S E E S A N S L V S V G V E A K I S E Q L C A F C Y C G E K S S L G Q G D L K Q F R I T P G F I L P W R N Q P S N K K D I D D N S N G T Y E K M Q N S A P R K Q R G Q R K E R S P Q Q N I V S C V S V S T Q T A S D D Q A G K L W D E L S L V G L P D A I D I Q A L F D S T G T C W A H H R C V E W S L G V C Q M E E P L L V N V D K A V V S G S T E R C A F C K H L G A T I K C C E E K C T Q M Y H Y P C A A G A G T F Q D F S H I F L L C P E H I D Q A P E R S K E D A N C A V C D S P G D L L D Q F F C T T C G Q H Y H G M C L D I A V T P L K R A G W Q C P E C K V C Q N C K Q S G E D S K M L V C D T C D K G Y H T F C L Q P V M K S V P T N G W K C K N C R I C I E C G T R S S S Q W H H N C L I C D N C Y Q Q Q D N L C P F C G K C Y H P E L Q K D M L H C N M C K R W V H L E C D K P T D H E L D T Q L K E E Y I C M Y C K H L G A E M D R L Q P G E E V E I A E L T T D Y N N E M E V E G P E D Q Met V F S E Q A A N K D V N G Q E S T P G I V P D A V Q V H T E E Q Q K S H P S E S L D T D S L L I A V S S Q H T V N T E L E K Q I S N E V D S E D L K M S S E V K H I C G E D Q I E D K M E V T E N I E V V T H Q I T V Q Q E Q L Q L L E E P E T V V S R E E S R P P K L V M E S V T L P L E T L V S P H E E S I S L C P E E Q L V I E R L Q G E K E Q K E NS E L S T G L M D S E M T P T I E G C V K D V S Y Q G G K S I K L S S E T E S S F S S S A D I S K A D V S S S P T P S S D L P S H D M L H N Y P S A L S S S A G N I M P T T Y I S V T P K I G M G K P A I T K R K F S P G R P R S K Q G A W S T H N T V S P P S W S P D I S E G R E I F K P R Q L P G S A I W S I K V G R G S G F P G K R R P R G A G L S G R G G R G R S K L K S G I G A V V L P G V S T A D I S S N K D D E E N S M H N T V V L F S S S D K F T L N Q D M C V V C G S F G Q G A E G R L L A C S Q C G Q C Y H P Y C V S I K I T K V V L S K G W R C L E C T V C E A C G K A T D P G R L L L C D D C D I S Y H T Y C L D P P L Q T V P K G G W K C K W C V W C R H C G A T S A G L R C E W Q N N Y T Q C A P C A S L S S C P V C Y R N Y R E E D L I L Q C R Q C D R W M H A V C Q N L N T E E E V E N V A D I G F D C S M C R P Y M P A S N V P S S D C C E S S L V A Q I V T K V K E L D P P K T Y T Q D G V C L T E S G M T Q L Q S L T V T V P R R K R S K P K L K L K I I N Q N S V A V L Q T P P D I Q S E H S R D G E M D D S R A V E D S E S G V Y M R F M R S H K C Y D I V P T S S K L V V F D T T L Q V K K A F F A L V A N G V R A A P L W E S K K Q S F V G M L T I T D F I N I L H R Y Y K S P M V Q I Y E L E E H K I E T W R E L Y L Q E T F K P L V N I S P D A S L F D A V Y S L I K N K I H R L P V I D P I S G N A L Y I L T H K R I L K F L Q L F M S D M P K P A F M K Q N L D E L G I G T Y H N I A F I H P D T P I I K A L N I F V E R R I S A L P V V D E S G K V V D I Y S K F D V I N L A A E K T Y N N L D I T V T Q A L Q H R S Q Y F E G V V K C N K L E I L E T I V D R I V R A E V H R L V V V N E A D S I V G I I S L S D I L Q A L I L T P A G A K Q K E T E T E终止密码子
蛋白质结构域外显子23到外显子6
由于重叠的结构域,因此存在蛋白质的40种表示。无跨膜结构域。
融合基因#5:DUS2L-PSKH1
确认的基因组断裂点:DUS2L-chr16:67930935、PSKH1-chr16:68103638
转录物:DUS2L-001 ENST00000565263
cDNA序列(SEQ ID NO.:127),融合基因的一部分加阴影。
TGAGGCGCGCCGGCTGGTTCAACTCCGGCCGCCGCGCCGAAACCAGCAGCGGTCCGGGTCGAACCAGCACCGGCCTCGGGAGGTTCCGCCGCCTGCTCTGCCGCTGTTCCAACTGCCGCTGTAGAGCCACTGGGATGCGCACCACCGGCAGGGGTTCGTCGGGACTGCGGACCGTGAGGCCCCGTCGCGGCGCCAGGAGCAACCGAGTCACGAGGGAAAAGAGCCGCACCGGCCGCGTTAGAGCCATGTTTCCCTTAGTGCGGGAGAAGCGCACATCAGTGACGTCACGGACGCGCCGCGACCTCGCGTACGGTGGCTGGCGAGGCTCAGTACGGTGTGTGGAGCTGGAGCACCGTGAGGAAGAAGCGAGGTTCTTTTTAAGAGTTCAGCTGCGAGATATCAAACAAAGAATTACTCTGTACAAAGCCAGAACACATATATCAAAGTAATCCTGAAGTATCAGAACAAAATAATAGGCTGTAACAGAGGAGGAAATGATTTTGAATAGCCTCTCTCTGTGTTACCATAATAAGCTAATCCTGGCCCCAATGGTTCGGGTAGGGACTCTTCCAATGAGGCTGCTGGCCCTGGATTATGGAGCGGACATTGTTTACTGTGAGGAGCTGATCGACCTCAAGATGATTCAGTGCAAGAGAGTTGTTAATGAGGTGCTCAGCACAGTGGACTTTGTCGCCCCTGATGATCGAGTTGTCTTCCGCACCTGTGAAAGAGAGCAGAACAGGGTGGTCTTCCAGATGGGGACTTCAGACGCAGAGCGAGCCCTTGCTGTGGCCAGGCTTGTAGAAAATGATGTGGCTGGTATTGATGTCAACATGGGCTGTCCAAAACAATATTCCACCAAGGGAGGAATGGGAGCTGCCCTGCTGTCAGACCCTGACAAGATTGAGAAGATCCTCAGCACTCTTGTTAAAGGGACACGCAGACCTGTGACCTGCAAGATTCGCATCCTGCCATCGCTAGAAGATACCCTGAGCCTTGTGAAGCGGATAGAGAGGACTGGCATTGCTGCCATCGCAGTTCATGGGAGGAAGCGGGAGGAGCGACCTCAGCATCCTGTCAGCTGTGAAGTCATCAAAGCCATTGCTGATACCCTCTCCATTCCTGTCATAGCCAACGGAGGATCTCATGACCACATCCAACAGTATTCGGACATAGAGGACTTTCGACAAGCCACGGCAGCCTCTTCCGTGATGGTGGCCCGAGCAGCCATGTGGAACCCATCTATCTTCCTCAAGGAGGGTCTGCGGCCCCTGGAGGAGGTCATGCAGAAATACATCAGATACGCGGTGCAGTATGACAACCACTACACCAACACCAAGTACTGCTTGTGCCAGATGCTACGAGAACAGCTGGAGTCGCCCCAGGGAAGGTTGCTCCATGCTGCCCAGTCTTCCCGGGAAATTTGTGAGGCCTTTGGCCTTGGTGCCTTCTATGAGGAGACCACACAGGAGCTGGATGCCCAGCAGGCCAGGCTCTCAGCCAAGACTTCAGAGCAGACAGGGGAGCCAGCTGAAGATACCTCTGGTGTCATTAAGATGGCTGTCAAGTTTGACCGGAGAGCATACCCAGCCCAGATCACCCCTAAGATGTGCCTACTAGAGTGGTGCCGGAGGGAGAAGTTGGCACAGCCTGTGTATGAAACGGTTCAACGCCCTCTAGATCGCCTGTTCTCCTCTATTGTCACCGTTGCTGAACAAAAGTATCAGTCTACCTTGTGGGACAAGTCCAAGAAACTGGCGGAGCAGGCTGCAGCCATCGTCTGTCTGCGGAGCCAGGGCCTCCCTGAGGGTCGGCTGGGTGAGGAGAGCCCTTCCTTGCACAAGCGAAAGAGGGAGGCTCCTGACCAAGACCCTGGGGGCCCCAGAGCTCAGGAGCTAGCACAACCTGGGGATCTGTGCAAGAAGCCCTTTGTGGCCTTGGGAAGTGGTGAAGAAAGCCCCCTGGAAGGCTGGTGACTACTCTTCCTGCCTTAGTCACCCCTCCATGGGCCTGGTGCTAAGGTGGCTGTGGATGCCACAGCATGAACCAGATGCCGTTGAACAGTTTGCTGGTCTTGCCTGGCAGAAGTTAGATGTCCTGGCAGGGGCCATCAGCCTAGAGCATGGACCAGGGGCCGCCCAGGGGTGGATCCTGGCCCCTTTGGTGGATCTGAGTGACAGGGTCAAGTTCTCTTTGAAAACAGGAGCTTTTCAGGTGGTAACTCCCCAACCTGACATTGGTACTGTGCAATAAAGACACCCCCTACCCTCACCCACGGCTGGCTGCTTCAGCCTTGGGCATCTTCATAAA
转录物:DUS2L-001 ENST00000565263
cDNA序列
TGAGGCGCGCCGGCTGGTTCAACTCCGGCCGCCGCGCCGAAACCAGCAGCGGTCCGGGTC............................................................GAACCAGCACCGGCCTCGGGAGGTTCCGCCGCCTGCTCTGCCGCTGTTCCAACTGCCGCT............................................................GTAGAGCCACTGGGATGCGCACCACCGGCAGGGGTTCGTCGGGACTGCGGACCGTGAGGC............................................................CCCGTCGCGGCGCCAGGAGCAACCGAGTCACGAGGGAAAAGAGCCGCACCGGCCGCGTTA............................................................GAGCCATGTTTCCCTTAGTGCGGGAGAAGCGCACATCAGTGACGTCACGGACGCGCCGCG............................................................ACCTCGCGTACGGTGGCTGGCGAGGCTCAGTACGGTGTGTGGAGCTGGAGCACCGTGAGG............................................................AAGAAGCGAGGTTCTTTTTAAGAGTTCAGCTGCGAGATATCAAACAAAGAATTACTCTGT............................................................ACAAAGCCAGAACACATATATCAAAGTAATCCTGAAGTATCAGAACAAAATAATAGGCTG............................................................TAACAGAGGAGGAAATGATTTTGAATAGCCTCTCTCTGTGTTACCATAATAAGCTAATCC..............-M--I--L--N--S--L--S--L--C--Y--H--N--K--L--I--TGGCCCCAATGGTTCGGGTAGGGACTCTTCCAATGAGGCTGCTGGCCCTGGATTATGGAGL--A--P--M--V--R--V--G--T--L--P--M--R--L--L--A--L--D--Y--G--CGGACATTGTTTACTGTGAGGAGCTGATCGACCTCAAGATGATTCAGTGCAAGAGAGTTGA--D--I--V--Y--C--E--E--L--I--D--L--K--M--I--Q--C--K--R--V--TTAATGAGGTGCTCAGCACAGTGGACTTTGTCGCCCCTGATGATCGAGTTGTCTTCCGCAV--N--E--V--L--S--T--V--D--F--V--A--P--D--D--R--V--V--F--R--CCTGTGAAAGAGAGCAGAACAGGGTGGTCTTCCAGATGGGGACTTCAGACGCAGAGCGAGT--C--E--R--E--Q--N--R--V--V--F--Q--M--G--T--S--D--A--E--R--CCCTTGCTGTGGCCAGGCTTGTAGAAAATGATGTGGCTGGTATTGATGTCAACATGGGCTA--L--A--V--A--R--L--V--E--N--D--V--A--G--I--D--V--N--M--G--GTCCAAAACAATATTCCACCAAGGGAGGAATGGGAGCTGCCCTGCTGTCAGACCCTGACAC--P--K--Q--Y--S--T--K--G--G--M--G--A--A--L--L--S--D--P--D--AGATTGAGAAGATCCTCAGCACTCTTGTTAAAGGGACACGCAGACCTGTGACCTGCAAGAK--I--E--K--I--L--S--T--L--V--K--G--T--R--R--P--V--T--C--K--TTCGCATCCTGCCATCGCTAGAAGATACCCTGAGCCTTGTGAAGCGGATAGAGAGGACTGI--R--I--L--P--S--L--E--D--T--L--S--L--V--K--R--I--E--R--T--DUS2L断裂点GCATTGCTGCCATCGCAGTTCATGGGAGGAAGCGGGAGGAGCGACCTCAGCATCCTGTCAG--I--A--A--I--A--V--H--G--R--K--R--E--E--R--P--Q--H--P--V--GCTGTGAAGTCATCAAAGCCATTGCTGATACCCTCTCCATTCCTGTCATAGCCAACGGAGS--C--E--V--I--K--A--I--A--D--T--L--S--I--P--V--I--A--N--G--GATCTCATGACCACATCCAACAGTATTCGGACATAGAGGACTTTCGACAAGCCACGGCAGG--S--H--D--H--I--Q--Q--Y--S--D--I--E--D--F--R--Q--A--T--A--CCTCTTCCGTGATGGTGGCCCGAGCAGCCATGTGGAACCCATCTATCTTCCTCAAGGAGGA--S--S--V--M--V--A--R--A--A--M--W--N--P--S--I--F--L--K--E--GTCTGCGGCCCCTGGAGGAGGTCATGCAGAAATACATCAGATACGCGGTGCAGTATGACAG--L--R--P--L--E--E--V--M--Q--K--Y--I--R--Y--A--V--Q--Y--D--ACCACTACACCAACACCAAGTACTGCTTGTGCCAGATGCTACGAGAACAGCTGGAGTCGCN--H--Y--T--N--T--K--Y--C--L--C--Q--M--L--R--E--Q--L--E--S--CCCAGGGAAGGTTGCTCCATGCTGCCCAGTCTTCCCGGGAAATTTGTGAGGCCTTTGGCCP--Q--G--R--L--L--H--A--A--Q--S--S--R--E--I--C--E--A--F--G--TTGGTGCCTTCTATGAGGAGACCACACAGGAGCTGGATGCCCAGCAGGCCAGGCTCTCAGL--G--A--F--Y--E--E--T--T--Q--E--L--D--A--Q--Q--A--R--L--S--CCAAGACTTCAGAGCAGACAGGGGAGCCAGCTGAAGATACCTCTGGTGTCATTAAGATGGA--K--T--S--E--Q--T--G--E--P--A--E--D--T--S--G--V--I--K--M--CTGTCAAGTTTGACCGGAGAGCATACCCAGCCCAGATCACCCCTAAGATGTGCCTACTAGA--V--K--F--D--R--R--A--Y--P--A--Q--I--T--P--K--M--C--L--L--AGTGGTGCCGGAGGGAGAAGTTGGCACAGCCTGTGTATGAAACGGTTCAACGCCCTCTAGE--W--C--R--R--E--K--L--A--Q--P--V--Y--E--T--V--Q--R--P--L--ATCGCCTGTTCTCCTCTATTGTCACCGTTGCTGAACAAAAGTATCAGTCTACCTTGTGGGD--R--L--F--S--S--I--V--T--V--A--E--Q--K--Y--Q--S--T--L--W--ACAAGTCCAAGAAACTGGCGGAGCAGGCTGCAGCCATCGTCTGTCTGCGGAGCCAGGGCCD--K--S--K--K--L--A--E--Q--A--A--A--I--V--C--L--R--S--Q--G--TCCCTGAGGGTCGGCTGGGTGAGGAGAGCCCTTCCTTGCACAAGCGAAAGAGGGAGGCTCL--P--E--G--R--L--G--E--E--S--P--S--L--H--K--R--K--R--E--A--CTGACCAAGACCCTGGGGGCCCCAGAGCTCAGGAGCTAGCACAACCTGGGGATCTGTGCAP--D--Q--D--P--G--G--P--R--A--Q--E--L--A--Q--P--G--D--L--C--AGAAGCCCTTTGTGGCCTTGGGAAGTGGTGAAGAAAGCCCCCTGGAAGGCTGGTGACTACK--K--P--F--V--A--L--G--S--G--E--E--S--P--L--E--G--W--*-....TCTTCCTGCCTTAGTCACCCCTCCATGGGCCTGGTGCTAAGGTGGCTGTGGATGCCACAG............................................................CATGAACCAGATGCCGTTGAACAGTTTGCTGGTCTTGCCTGGCAGAAGTTAGATGTCCTG............................................................GCAGGGGCCATCAGCCTAGAGCATGGACCAGGGGCCGCCCAGGGGTGGATCCTGGCCCCT............................................................TTGGTGGATCTGAGTGACAGGGTCAAGTTCTCTTTGAAAACAGGAGCTTTTCAGGTGGTA............................................................ACTCCCCAACCTGACATTGGTACTGTGCAATAAAGACACCCCCTACCCTCACCCACGGCT............................................................GGCTGCTTCAGCCTTGGGCATCTTCATAAA..............................
转录物:DUS2L-001 ENST00000565263
蛋白质序列(SEQ ID NO.:128),融合基因的一部分加阴影。
MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVLSTVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMGCPKQYSTKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAIAVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQATAASSVMVARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQLESPQGRLLHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTGEPAEDTSGVIKMAVKFDRRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQSTLWDKSKKLAEQAAAIVCLRSQGLPEGRLGEESPSLHKRKREAPDQDPGGPRAQELAQPGDLCKKPFVALGSGEESPLEGW
转录物:PSKH1-001 ENST00000291041
cDNA序列(SEQ ID NO.:129),融合基因的一部分加阴影。
GAGAATGGCGGCGGCGGCGGCGGCGGCGGCGGCCGCTGCCATTGCCCGGAGATGGCCGGCPSKH1断裂点AGAGCCGCCGAGACGCCGAAGAGCCCGCCGCCCGCGCGAGGTGTAGACGGGGCACTGCCTTCAGAGCAGGTCCTGCCAGCCTCGCTGGAGAGGATGCCCTCGTGTCCGTGATGGGCTGTGGGACAAGCAAGGTCCTTCCCGAGCCACCCAAGGATGTCCAGCTGGATCTGGTCAAGAAGGTGGAGCCCTTCAGTGGCACTAAGAGTGACGTGTACAAGCACTTCATCACAGAGGTGGACAGTGTTGGCCCTGTCAAAGCCGGGTTCCCAGCAGCAAGTCAGTATGCACACCCCTGCCCCGGTCCCCCGACTGCTGGCCACACGGAGCCTCCCTCAGAACCACCACGCAGGGCCAGGGTAGCTAAGTACAGGGCCAAGTTTGACCCACGTGTTACAGCTAAGTATGACATCAAGGCCCTAATTGGCCGAGGCAGCTTCAGCCGAGTGGTACGTGTAGAGCACCGGGCAACCCGGCAGCCGTATGCCATCAAGATGATTGAGACCAAGTACCGGGAGGGGCGGGAGGTGTGTGAGTCGGAGCTGCGTGTGCTGCGTCGGGTGCGTCATGCCAACATCATCCAGCTGGTGGAGGTGTTCGAGACACAGGAGCGGGTGTACATGGTGATGGAGCTGGCCACTGGTGGAGAGCTCTTTGACCGCATCATTGCCAAGGGCTCCTTCACCGAGCGTGACGCCACGCGGGTGCTGCAGATGGTGCTGGATGGCGTCCGGTATCTGCATGCACTGGGCATCACACACCGAGACCTCAAACCTGAGAATCTGCTCTACTACCATCCGGGCACTGACTCCAAGATCATCATCACCGACTTCGGCCTGGCCAGTGCTCGCAAGAAGGGTGATGACTGCTTGATGAAGACCACCTGTGGCACGCCTGAGTACATTGCCCCAGAAGTCCTGGTCCGCAAGCCATACACCAACTCAGTGGACATGTGGGCGCTGGGCGTCATTGCCTACATCCTACTCAGTGGCACCATGCCGTTTGAGGATGACAACCGTACCCGGCTGTACCGGCAGATCCTCAGGGGCAAGTACAGTTACTCTGGGGAGCCCTGGCCTAGTGTGTCCAACCTGGCCAAGGACTTCATTGACCGCCTGCTGACAGTGGACCCTGGAGCCCGTATGACTGCACTGCAGGCCCTGAGGCACCCGTGGGTGGTGAGCATGGCTGCCTCTTCATCCATGAAGAACCTGCACCGCTCCATATCCCAGAACCTCCTTAAACGTGCCTCCTCGCGCTGCCAGAGCACCAAATCTGCCCAGTCCACGCGTTCCAGCCGCTCCACACGCTCCAATAAGTCACGCCGTGTGCGGGAACGGGAGCTGCGGGAGCTCAACCTGCGCTACCAGCAGCAATACAATGGCTGAGCCGCCTGGCTGTGCACACATGCAGCACGACCCAGCCTGGCCACACACTGTGGTGCCATCTGGGTCCGATGCCCTCTCTGGAGATAGGCCTATGTGGCCCACAGTAGGTGAAGAATGTCTGGCTCCAGCCCTTTCTCTGTGCCTTCAGCAGCCCCTGTCCTCACCATGGGCCTGGGCCAGGTGTGACAGAGTAGAGGTAGCACAGGGGGCTGTGACTCCCCCTGAACTGGGAGCCTGGCCTGGCACTGATACCCCTCTTGGTGGGCAGCTGCTCTGGTGGAGTTGGGAAGGGATAGGACCTGGCCTTCACTGTCTCCCTTGCCCTTTGACTTTTCCCCAATCAAAGGGAACTGCAGTGCTGGGTGGAGTGTCCTGTGGCCTCAGGACCCTTTGGGACAGTTACTTCTGGGACCCCCTTTCCTCCACAGAGCCCTTCTCCCTGGTTTCACACATTCCCATGCATCCTGATCCTTAAGATTATGCTCCAGTGGGAGACCCTGGTAGGCACAAAGCTTGTGCCTTGACTGGACCCGTAGCCCCTGGCTAGGTCGAAACAGCCCTCCACCTCCCAGCCAAGATCTGTCTTCCTTCATGGTGCCTCCAGGGAGCCTTCCTGGTCCCAGGACCTCTGGTGGAGGGCCATGGCGTGGACCTTCACCCTTCTGGACTGTGTGGCCATGCTGGTCATCGGCTTGCCCAGGCTCCAGCCTCTCCAGATTCTGAGGGGTCTCAGCCCACCGCCCTTGGTGCCTTCTTTGTAGAGCCCACCGCTACCTCCCTCTCCCCGTTGGATGTCCATTCCATTCCCCAGGTGCCTCCTTCCCAACTGGGGGTGGTTAAAGGGAGCCCCACTGCTGCTACCTGGGGAATGGGGCACCTGGGGGCCAAGGCAGAGGGAAGGGGGTCCTCCCGATTAGGGTCGAGTGTCAGCCTGGGTTCTATCCTTTGGTGCAGCCCCATTGCCTTTTCCCTTCAGGCTCTGTTGCTCCCTCCTCTGCAGCTGCACGAAGGCGCCATCTGGTGTCTGCATGGGTGTTGGCAGCCTGGGAGTGATCACTGCACGCCCATCGTGCACACCTGCCCATCGTGCACACCCACCCATGGTGCACACCTGTAGTCCTCCATGAGGACATGGGAAGGTAGGAGTTGCCGCCCTGGGGGAGGGTCCCGGGCTGCTCACCTCTCCCCTTCTGCTGAGCTTCTGCGCACCCCTCCCTGGAACTTAGCCATACTGTGTGACCTGCCTCTGAAACCAGGGTGCCAGGGGCACTGCCTTCTCACAGCTGGCCTTGCCCCGTCCACCCTGTGCTGCTTCCCTTCACAGCATTAACCTTCCAGTCTGGGTCCCACTGAGCCTCAAGCTGGAAGGAGCCCCTGCGGGAGGTGGGTGGGGTTGGGTGGCTGCTTTCCCAGAGGCCTGAGCCAGAACCATCCCCATTTCTTTTGTGGTATCTCCCCCTACCACAAACCAGGCTGGAACCCAAGCCCCTTCCTCCACAGCTGCCTTCAGTGGGTAGAATGGGGCCAGGGCCCAGCTTTGGCCTTAGCTTGACGGCAGGGCCCCTGCCATTGCAGGAGGGTTTGGTTCCCACTCAGCTTCTGCCGGTCGGCAGCCTGGGCCAGGCCCTTTTCCTGCATGTGCCACCTCCAGTGGGAAACAAAACTAAAGAGACCACTCTGTGCCAAGTCGACTATGCCTTAGACACATCCTCCTACCGTCCCCAATGCCCCCTGGGCAGGAGGCAGTGGAGAACCAAGCCCCATGGCCTCAGAATTTCCCCCCAGTTCCCCAAGTGTCTCTGGGGACCTGAAGCCCTGGGGCTTACGTTCTCTCTTGCCCAGGGTGGGCCTGGTCCTGAGGGCAGGACAGGGGGTTTGGAGATGTGGGCCTTTGATAGACCCACTTGGGCCTTCATGCCATGGCCTGTGGATGGAGAATGTGCAGTTATTTATTATGCGTATTCAGTTTGTAAACGTATCCTCTGTATTCAGTAAACAGGCTGCCTCTCCAGGGAGGGCTGCCATTCATTCCAACAGTTCTGGCTTCTTGCTGTAGGACCAAGGGGTTGCCCTGGAGGAGGGGTGGGGGCCCCGGCCTCGGCATGGCTACTCTAGGAAGAGCCACTGCTACTCAAGGAGTCACTCAGCCCCTTCTGTGCCAGAAGTCCAAGTAGGGAGTCGGACCCTCAACAGCCTCTTCTTTCTCCTGAGCCAGGAAGACAGACATGAATGCATGATGGGACAGGGCCTGGGTCTTTAATGGGTTGAGCTGGGGAGGGCCTGTGGTGAGCTCAGTTGTAGGCTATGACCTGGTT
转录物:PSKH1-001 ENST00000291041
cDNA序列
GAGAATGGCGGCGGCGGCGGCGGCGGCGGCGGCCGCTGCCATTGCCCGGAGATGGCCGGC............................................................
PSK1断裂点AGAGCCGCCGAGACGCCGAAGAGCCCGCCGCCCGCGCGAGGTGTAGACGGGGCACTGCCT............................................................TCAGAGCAGGTCCTGCCAGCCTCGCTGGAGAGGATGCCCTCGTGTCCGTGATGGGCTGTG..................................................-M--G--C--GGACAAGCAAGGTCCTTCCCGAGCCACCCAAGGATGTCCAGCTGGATCTGGTCAAGAAGGG--T--S--K--V--L--P--E--P--P--K--D--V--Q--L--D--L--V--K--K--TGGAGCCCTTCAGTGGCACTAAGAGTGACGTGTACAAGCACTTCATCACAGAGGTGGACAV--E--P--F--S--G--T--K--S--D--V--Y--K--H--F--I--T--E--V--D--GTGTTGGCCCTGTCAAAGCCGGGTTCCCAGCAGCAAGTCAGTATGCACACCCCTGCCCCGS--V--G--P--V--K--A--G--F--P--A--A--S--Q--Y--A--H--P--C--P--GTCCCCCGACTGCTGGCCACACGGAGCCTCCCTCAGAACCACCACGCAGGGCCAGGGTAGG--P--P--T--A--G--H--T--E--P--P--S--E--P--P--R--R--A--R--V--CTAAGTACAGGGCCAAGTTTGACCCACGTGTTACAGCTAAGTATGACATCAAGGCCCTAAA--K--Y--R--A--K--F--D--P--R--V--T--A--K--Y--D--I--K--A--L--TTGGCCGAGGCAGCTTCAGCCGAGTGGTACGTGTAGAGCACCGGGCAACCCGGCAGCCGTI--G--R--G--S--F--S--R--V--V--R--V--E--H--R--A--T--R--Q--P--ATGCCATCAAGATGATTGAGACCAAGTACCGGGAGGGGCGGGAGGTGTGTGAGTCGGAGCY--A--I--K--M--I--E--T--K--Y--R--E--G--R--E--V--C--E--S--E--TGCGTGTGCTGCGTCGGGTGCGTCATGCCAACATCATCCAGCTGGTGGAGGTGTTCGAGAL--R--V--L--R--R--V--R--H--A--N--I--I--Q--L--V--E--V--F--E--CACAGGAGCGGGTGTACATGGTGATGGAGCTGGCCACTGGTGGAGAGCTCTTTGACCGCAT--Q--E--R--V--Y--M--V--M--E--L--A--T--G--G--E--L--F--D--R--TCATTGCCAAGGGCTCCTTCACCGAGCGTGACGCCACGCGGGTGCTGCAGATGGTGCTGGI--I--A--K--G--S--F--T--E--R--D--A--T--R--V--L--Q--M--V--L--ATGGCGTCCGGTATCTGCATGCACTGGGCATCACACACCGAGACCTCAAACCTGAGAATCD--G--V--R--Y--L--H--A--L--G--I--T--H--R--D--L--K--P--E--N--TGCTCTACTACCATCCGGGCACTGACTCCAAGATCATCATCACCGACTTCGGCCTGGCCAL--L--Y--Y--H--P--G--T--D--S--K--I--I--I--T--D--F--G--L--A--GTGCTCGCAAGAAGGGTGATGACTGCTTGATGAAGACCACCTGTGGCACGCCTGAGTACAS--A--R--K--K--G--D--D--C--L--M--K--T--T--C--G--T--P--E--Y--TTGCCCCAGAAGTCCTGGTCCGCAAGCCATACACCAACTCAGTGGACATGTGGGCGCTGGI--A--P--E--V--L--V--R--K--P--Y--T--N--S--V--D--M--W--A--L--GCGTCATTGCCTACATCCTACTCAGTGGCACCATGCCGTTTGAGGATGACAACCGTACCCG--V--I--A--Y--I--L--L--S--G--T--M--P--F--E--D--D--N--R--T--GGCTGTACCGGCAGATCCTCAGGGGCAAGTACAGTTACTCTGGGGAGCCCTGGCCTAGTGR--L--Y--R--Q--I--L--R--G--K--Y--S--Y--S--G--E--P--W--P--S--TGTCCAACCTGGCCAAGGACTTCATTGACCGCCTGCTGACAGTGGACCCTGGAGCCCGTAV--S--N--L--A--K--D--F--I--D--R--L--L--T--V--D--P--G--A--R--TGACTGCACTGCAGGCCCTGAGGCACCCGTGGGTGGTGAGCATGGCTGCCTCTTCATCCAM--T--A--L--Q--A--L--R--H--P--W--V--V--S--M--A--A--S--S--S--TGAAGAACCTGCACCGCTCCATATCCCAGAACCTCCTTAAACGTGCCTCCTCGCGCTGCCM--K--N--L--H--R--S--I--S--Q--N--L--L--K--R--A--S--S--R--C--AGAGCACCAAATCTGCCCAGTCCACGCGTTCCAGCCGCTCCACACGCTCCAATAAGTCACQ--S--T--K--S--A--Q--S--T--R--S--S--R--S--T--R--S--N--K--S--GCCGTGTGCGGGAACGGGAGCTGCGGGAGCTCAACCTGCGCTACCAGCAGCAATACAATGR--R--V--R--E--R--E--L--R--E--L--N--L--R--Y--Q--Q--Q--Y--N--GCTGAGCCGCCTGGCTGTGCACACATGCAGCACGACCCAGCCTGGCCACACACTGTGGTGG--*-.......................................................CCATCTGGGTCCGATGCCCTCTCTGGAGATAGGCCTATGTGGCCCACAGTAGGTGAAGAA............................................................TGTCTGGCTCCAGCCCTTTCTCTGTGCCTTCAGCAGCCCCTGTCCTCACCATGGGCCTGG............................................................GCCAGGTGTGACAGAGTAGAGGTAGCACAGGGGGCTGTGACTCCCCCTGAACTGGGAGCC............................................................TGGCCTGGCACTGATACCCCTCTTGGTGGGCAGCTGCTCTGGTGGAGTTGGGAAGGGATA............................................................GGACCTGGCCTTCACTGTCTCCCTTGCCCTTTGACTTTTCCCCAATCAAAGGGAACTGCA............................................................GTGCTGGGTGGAGTGTCCTGTGGCCTCAGGACCCTTTGGGACAGTTACTTCTGGGACCCC............................................................CTTTCCTCCACAGAGCCCTTCTCCCTGGTTTCACACATTCCCATGCATCCTGATCCTTAA............................................................GATTATGCTCCAGTGGGAGACCCTGGTAGGCACAAAGCTTGTGCCTTGACTGGACCCGTA............................................................GCCCCTGGCTAGGTCGAAACAGCCCTCCACCTCCCAGCCAAGATCTGTCTTCCTTCATGG............................................................TGCCTCCAGGGAGCCTTCCTGGTCCCAGGACCTCTGGTGGAGGGCCATGGCGTGGACCTT............................................................CACCCTTCTGGACTGTGTGGCCATGCTGGTCATCGGCTTGCCCAGGCTCCAGCCTCTCCA............................................................GATTCTGAGGGGTCTCAGCCCACCGCCCTTGGTGCCTTCTTTGTAGAGCCCACCGCTACC............................................................TCCCTCTCCCCGTTGGATGTCCATTCCATTCCCCAGGTGCCTCCTTCCCAACTGGGGGTG............................................................GTTAAAGGGAGCCCCACTGCTGCTACCTGGGGAATGGGGCACCTGGGGGCCAAGGCAGAG............................................................GGAAGGGGGTCCTCCCGATTAGGGTCGAGTGTCAGCCTGGGTTCTATCCTTTGGTGCAGC............................................................CCCATTGCCTTTTCCCTTCAGGCTCTGTTGCTCCCTCCTCTGCAGCTGCACGAAGGCGCC............................................................ATCTGGTGTCTGCATGGGTGTTGGCAGCCTGGGAGTGATCACTGCACGCCCATCGTGCAC............................................................ACCTGCCCATCGTGCACACCCACCCATGGTGCACACCTGTAGTCCTCCATGAGGACATGG............................................................GAAGGTAGGAGTTGCCGCCCTGGGGGAGGGTCCCGGGCTGCTCACCTCTCCCCTTCTGCT............................................................GAGCTTCTGCGCACCCCTCCCTGGAACTTAGCCATACTGTGTGACCTGCCTCTGAAACCA............................................................GGGTGCCAGGGGCACTGCCTTCTCACAGCTGGCCTTGCCCCGTCCACCCTGTGCTGCTTC............................................................CCTTCACAGCATTAACCTTCCAGTCTGGGTCCCACTGAGCCTCAAGCTGGAAGGAGCCCC............................................................TGCGGGAGGTGGGTGGGGTTGGGTGGCTGCTTTCCCAGAGGCCTGAGCCAGAACCATCCC............................................................CATTTCTTTTGTGGTATCTCCCCCTACCACAAACCAGGCTGGAACCCAAGCCCCTTCCTC............................................................CACAGCTGCCTTCAGTGGGTAGAATGGGGCCAGGGCCCAGCTTTGGCCTTAGCTTGACGG............................................................CAGGGCCCCTGCCATTGCAGGAGGGTTTGGTTCCCACTCAGCTTCTGCCGGTCGGCAGCC............................................................TGGGCCAGGCCCTTTTCCTGCATGTGCCACCTCCAGTGGGAAACAAAACTAAAGAGACCA............................................................CTCTGTGCCAAGTCGACTATGCCTTAGACACATCCTCCTACCGTCCCCAATGCCCCCTGG............................................................GCAGGAGGCAGTGGAGAACCAAGCCCCATGGCCTCAGAATTTCCCCCCAGTTCCCCAAGT............................................................GTCTCTGGGGACCTGAAGCCCTGGGGCTTACGTTCTCTCTTGCCCAGGGTGGGCCTGGTC............................................................CTGAGGGCAGGACAGGGGGTTTGGAGATGTGGGCCTTTGATAGACCCACTTGGGCCTTCA.............................................................TGCCATGGCCTGTGGATGGAGAATGTGCAGTTATTTATTATGCGTATTCAGTTTGTAAAC............................................................GTATCCTCTGTATTCAGTAAACAGGCTGCCTCTCCAGGGAGGGCTGCCATTCATTCCAAC............................................................AGTTCTGGCTTCTTGCTGTAGGACCAAGGGGTTGCCCTGGAGGAGGGGTGGGGGCCCCGG............................................................CCTCGGCATGGCTACTCTAGGAAGAGCCACTGCTACTCAAGGAGTCACTCAGCCCCTTCT............................................................GTGCCAGAAGTCCAAGTAGGGAGTCGGACCCTCAACAGCCTCTTCTTTCTCCTGAGCCAG............................................................GAAGACAGACATGAATGCATGATGGGACAGGGCCTGGGTCTTTAATGGGTTGAGCTGGGG............................................................
AGGGCCTGTGGTGAGCTCAGTTGTAGGCTATGACCTGGTT........................................
转录物:PSKH1-001 ENST00000291041
蛋白质序列(SEQ ID NO.:130)
MGCGTSKVLPEPPKDVQLDLVKKVEPFSGTKSDVYKHFITEVDSVGPVKAGFPAASQYAHPCPGPPTAGHTEPPSEPPRRARVAKYRAKFDPRVTAKYDIKALIGRGSFSRVVRVEHRATRQPYAIKMIETKYREGREVCESELRVLRRVRHANIIQLVEVFETQERVYMVMELATGGELFDRIIAKGSFTERDATRVLQMVLDGVRYLHALGITHRDLKPENLLYYHPGTDSKIIITDFGLASARKKGDDCLMKTTCGTPEYIAPEVLVRKPYTNSVDMWALGVIAYILLSGTMPFEDDNRTRLYRQILRGKYSYSGEPWPSVSNLAKDFIDRLLTVDPGARMTALQALRHPWVVSMAASSSMKNLHRSISQNLLKRASSRCQSTKSAQSTRSSRSTRSNKSRRVRERELRELNLRYQQQYNG
DUS2L-PSKH1融合序列外显子10到外显子2UTR
cDNA序列(SEQ ID NO.:131)。PSKH1加下划线。
ATGATTTTGAATAGCCTCTCTCTGTGTTACCATAATAAGCTAATCCTGGCCCCAATGGTTCGGGTAGGGACTCTTCCAATGAGGCTGCTGGCCCTGGATTATGGAGCGGACATTGTTTACTGTGAGGAGCTGATCGACCTCAAGATGATTCAGTGCAAGAGAGTTGTTAATGAGGTGCTCAGCACAGTGGACTTTGTCGCCCCTGATGATCGAGTTGTCTTCCGCACCTGTGAAAGAGAGCAGAACAGGGTGGTCTTCCAGATGGGGACTTCAGACGCAGAGCGAGCCCTTGCTGTGGCCAGGCTTGTAGAAAATGATGTGGCTGGTATTGATGTCAACATGGGCTGTCCAAAACAATATTCCACCAAGGGAGGAATGGGAGCTGCCCTGCTGTCAGACCCTGACAAGATTGAGAAGATCCTCAGCACTCTTGTTAAAGGGACACGCAGACCTGTGACCTGCAAGATTCGCATCCTGCCATCGCTAGAAGATACCCTGAGCCTTGTGAAGCGGATAGAGAGGACTGGCATTGCTGCCATCGCAGTTCATGGGAGGTGTAGACGGGGCACTGCCTTCAGAGCAGGTCCTGCCAGCCTCGCTGGAGAGGATGCCCTCGTGTCCGTGATGGGCTGTGGGACAAGCAAGGTCCTTCCCGAGCCACCCAAGGATGTCCAGCTGGATCTGGTCAAGAAGGTGGAGCCCTTCAGTGGCACTAAGAGTGACGTGTACAAGCACTTCATCACAGAGGTGGACAGTGTTGGCCCTGTCAAAGCCGGGTTCCCAGCAGCAAGTCAGTATGCACACCCCTGCCCCGGTCCCCCGACTGCTGGCCACACGGAGCCTCCCTCAGAACCACCACGCAGGGCCAGGGTAGCTAAGTACAGGGCCAAGTTTGACCCACGTGTTACAGCTAAGTATGACATCAAGGCCCTAATTGGCCGAGGCAGCTTCAGCCGAGTGGTACGTGTAGAGCACCGGGCAACCCGGCAGCCGTATGCCATCAAGATGATTGAGACCAAGTACCGGGAGGGGCGGGAGGTGTGTGAGTCGGAGCTGCGTGTGCTGCGTCGGGTGCGTCATGCCAACATCATCCAGCTGGTGGAGGTGTTCGAGACACAGGAGCGGGTGTACATGGTGATGGAGCTGGCCACTGGTGGAGAGCTCTTTGACCGCATCATTGCCAAGGGCTCCTTCACCGAGCGTGACGCCACGCGGGTGCTGCAGATGGTGCTGGATGGCGTCCGGTATCTGCATGCACTGGGCATCACACACCGAGACCTCAAACCTGAGAATCTGCTCTACTACCATCCGGGCACTGACTCCAAGATCATCATCACCGACTTCGGCCTGGCCAGTGCTCGCAAGAAGGGTGATGACTGCTTGATGAAGACCACCTGTGGCACGCCTGAGTACATTGCCCCAGAAGTCCTGGTCCGCAAGCCATACACCAACTCAGTGGACATGTGGGCGCTGGGCGTCATTGCCTACATCCTACTCAGTGGCACCATGCCGTTTGAGGATGACAACCGTACCCGGCTGTACCGGCAGATCCTCAGGGGCAAGTACAGTTACTCTGGGGAGCCCTGGCCTAGTGTGTCCAACCTGGCCAAGGACTTCATTGACCGCCTGCTGACAGTGGACCCTGGAGCCCGTATGACTGCACTGCAGGCCCTGAGGCACCCGTGGGTGGTGAGCATGGCTGCCTCTTCATCCATGAAGAACCTGCACCGCTCCATATCCCAGAACCTCCTTAAACGTGCCTCCTCGCGCTGCCAGAGCACCAAATCTGCCCAGTCCACGCGTTCCAGCCGCTCCACACGCTCCAATAAGTCACGCCGTGTGCGGGAACGGGAGCTGCGGGAGCTCAACCTGCGCTACCAGCAGCAATACAATGGCTGA
DUS2L-PSKH1融合序列外显子10到外显子2UTR
蛋白质序列(SEQ ID NO.:132),PSKH1加下划线。
MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVLSTVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMGCPKQYSTKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAIAVHGRCRRGTAFRAGPASLAGEDALVSVMGCGTSKVLPEPPKDVQLDLVKKVEPFSGTKSDVYKHFITEVDSVGPVKAGFPAASQYAHPCPGPPTAGHTEPPSEPPRRARVAKYRAKFDPRVTAKYDIKALIGRGSFSRVVRVEHRATRQPYAIKMIETKYREGREVCESELRVLRRVRHANIIQLVEVFETQERVYMVMELATGGELFDRIIAKGSFTERDATRVLQMVLDGVRYLHALGITHRDLKPENLLYYHPGTDSKIIITDFGLASARKKGDDCLMKTTCGTPEYIAPEVLVRKPYTNSVDMWALGVIAYILLSGTMPFEDDNRTRLYRQILRGKYSYSGEPWPSVSNLAKDFIDRLLTVDPGARMTALQALRHPWVVSMAASSSMKNLHRSISQNLLKRASSRCQSTKSAQSTRSSRSTRSNKSRRVRERELRELNLRYQQQYNG
蛋白质结构域
无跨膜结构域。
DUS2L-PSKH1融合序列外显子3到外显子2UTR
cDNA序列(SEQ ID NO.:133),PSKH1加下划线。
ATGATTTTGAATAGCCTCTCTCTGTGTTACCATAATAAGCTAATCCTGGCCCCAATGGTTCGGGTAGGGACTCTTCCAATGAGGCTGCTGGCCCTGGATTATGGAGCGGACATTGTTTACTGTGAGGAGCTGATCGACCTCAAGATGATTCAGTGCAAGAGAGTTGTTAATGAGGTGCTCAGCACAGTGGACTTTGTCGCCCCTGATGATCGAGTTGTCTTCCGCACCTGTGAAAGAGAGCAGAACAGGGTGGTCTTCCAGATGGTGTAGACGGGGCACTGCCTTCAGAGCAGGTCCTGCCAGCCTCGCTGGAGAGGATGCCCTCGTGTCCGTGATGGGCTGTGGGACAAGCAAGGTCCTTCCCGAGCCACCCAAGGATGTCCAGCTGGATCTGGTCAAGAAGGTGGAGCCCTTCAGTGGCACTAAGAGTGACGTGTACAAGCACTTCATCACAGAGGTGGACAGTGTTGGCCCTGTCAAAGCCGGGTTCCCAGCAGCAAGTCAGTATGCACACCCCTGCCCCGGTCCCCCGACTGCTGGCCACACGGAGCCTCCCTCAGAACCACCACGCAGGGCCAGGGTAGCTAAGTACAGGGCCAAGTTTGACCCACGTGTTACAGCTAAGTATGACATCAAGGCCCTAATTGGCCGAGGCAGCTTCAGCCGAGTGGTACGTGTAGAGCACCGGGCAACCCGGCAGCCGTATGCCATCAAGATGATTGAGACCAAGTACCGGGAGGGGCGGGAGGTGTGTGAGTCGGAGCTGCGTGTGCTGCGTCGGGTGCGTCATGCCAACATCATCCAGCTGGTGGAGGTGTTCGAGACACAGGAGCGGGTGTACATGGTGATGGAGCTGGCCACTGGTGGAGAGCTCTTTGACCGCATCATTGCCAAGGGCTCCTTCACCGAGCGTGACGCCACGCGGGTGCTGCAGATGGTGCTGGATGGCGTCCGGTATCTGCATGCACTGGGCATCACACACCGAGACCTCAAACCTGAGAATCTGCTCTACTACCATCCGGGCACTGACTCCAAGATCATCATCACCGACTTCGGCCTGGCCAGTGCTCGCAAGAAGGGTGATGACTGCTTGATGAAGACCACCTGTGGCACGCCTGAGTACATTGCCCCAGAAGTCCTGGTCCGCAAGCCATACACCAACTCAGTGGACATGTGGGCGCTGGGCGTCATTGCCTACATCCTACTCAGTGGCACCATGCCGTTTGAGGATGACAACCGTACCCGGCTGTACCGGCAGATCCTCAGGGGCAAGTACAGTTACTCTGGGGAGCCCTGGCCTAGTGTGTCCAACCTGGCCAAGGACTTCATTGACCGCCTGCTGACAGTGGACCCTGGAGCCCGTATGACTGCACTGCAGGCCCTGAGGCACCCGTGGGTGGTGAGCATGGCTGCCTCTTCATCCATGAAGAACCTGCACCGCTCCATATCCCAGAACCTCCTTAAACGTGCCTCCTCGCGCTGCCAGAGCACCAAATCTGCCCAGTCCACGCGTTCCAGCCGCTCCACACGCTCCAATAAGTCACGCCGTGTGCGGGAACGGGAGCTGCGGGAGCTCAACCTGCGCTACCAGCAGCAATACAATGGCTGA
蛋白质序列(SEQ ID NO.:134)
M I L N S L S L C Y H N K L I L A P M V R V G T L P M R L L A L D Y G A D I V Y C E E L I D L K M I Q C K R V V N E V L S T V D F V A P D D R V V F R T C E R E Q N R V V F Q M V终止密码子
蛋白质结构域
无结构域。
这一研究中的融合基因中的每一种的mRNA融合点的基因组位置呈现于表4中。
表4:对应于这一研究中的五种复现的融合基因的mRNA融合点的基因组位置。
实验程序
实施例1
在胃癌(GC)中通过全基因组DNA-PET测序所鉴定的结构变异(SV)
通过DNA-PET由包括十个配对正常样品在内的14个原发性胃肿瘤和胃癌细胞系TMK1对基因组DNA进行测序。在基因组的约2倍碱基对覆盖度和200倍物理覆盖度的情况下,1,945个体细胞SV被鉴定(图1A-C),其中在生殖系SV与体细胞SV之间的SV分布方面有显著性差异(P=2.2×10-16,χ2检验,图1D),这表明了不同的突变机制或选择机制。与已经针对SV经过详细分析的其它癌症类型相比,GC显示出比前列腺癌高的比例的串联重复以及比胰腺癌多的倒位(图1E),这表明每一种癌症类型带有它自身的重排模式。
实施例2
GC中体细胞SV的特征提供了对重排机制的深入了解
生殖系断裂点和体细胞断裂点这两者在重复区(P<10-5,图2A)和开放染色质结构域(P<10-21,χ2检验;图2B)中均是富集的,而只有体细胞断裂点在基因中是富集的(P<10-15,χ2检验)而生殖系断裂点在基因中是贫化的(P<10-15,χ2检验,图2C)。这可以反映出对生殖系中基因破坏性重排的负选择,以及与此相反,改变基因结构的体细胞重排的促癌潜能。这些观测结果表明基因组的转录活性部分更容易发生GC中的体细胞重排。
观测到验证的融合点中有2%具有特征模式,其中插入序列源自于接近融合点的基因座(图2D)。这些情况中的三种情况产生融合基因(ARHGAP26-CLDN18、LIFR-GATA4以及MLL3-PRKAG2)。在相同的基因座处观测到这些重排特征可以表明可能是转录偶联的特定机制。
通过搜索SV与通过配对末端标签(ChIA-PET)测序数据进行的染色质相互作用分析之间的重叠来测试体细胞SV的重排伴侣位点倾向于在核内在空间上接近的可能性。作为概念验证,分别对乳腺癌和慢性骨髓性白血病(CML)的细胞系衍生的(MCF-7和K562)染色质相互作用和肿瘤衍生的体细胞SV进行比较并且观测到显著的重叠。
为了研究所述研究的生殖系SV和体细胞SV的两个伴侣位点是否对于核中彼此接近的基因座是富集的,测试SV与通过对乳腺癌细胞系MCF-7进行ChIA-PET测序所获得的全基因组染色质相互作用数据集的重叠,基本原理是一些染色质相互作用在不同的细胞类型间可能是保守的(图3)。
由于胃细胞系的ChIA-PET数据是不可获得的,因此使用来自乳腺癌细胞系MCF-7的数据,假设一些染色质相互作用在不同的组织间是稳定的。15例GC的1,667个生殖系SV和1,945个体细胞SV与MCF-7的87,253个染色质相互作用重叠,并且61个(3.7%)生殖系SV和19个(1%)体细胞SV重叠被发现,多于随机期望(P<0.001,基于置换,图2E),这表明染色质相互作用促成生殖系GC SV和体细胞GC SV的形状。
实施例3
GC中的重排热点
用严格的搜索标准鉴定出14个复现的体细胞SV并且用宽松的搜索标准鉴定出另外的173个SV。复现的重排簇集在七个热点中,FHIT、WWOX、MACROD2、PARK2以及PDE4D处于已知的脆性位点处并且NAALADL2和CCSER1(FAM190A)处于新的热点处。所有反复重排的基因对于癌症具有相关性。有趣的是,具有七个重排热点中最高数目的体细胞SV(分别是12个和11个)的肿瘤17和TMK1也在具有最大数目的体细胞SV的GC的范围内(图1B),这表明这些重排热点使重排在具有基因组不稳定性的肿瘤中快速积聚或热点基因的破坏在机制上导致基因组不稳定性。我们还发现MYC基因座处复现的串联重复和ATM基因座处复现的缺失,它们是癌症生物学中的两个关键的基因,这进一步证实复现的体细胞SV可能与癌症生物学具有相关性。
实施例4
GC中复现的融合基因
使用15例GC的体细胞SV,预测了136个融合基因,通过基因组PCR和Sanger测序验证了它们中的97个,并且在对应的肿瘤中通过逆转录聚合酶链反应(RT-PCR)确认了44个的表达。十五个表达的融合基因在框内。由于组成型活性致癌融合基因通常是框内融合,因此将重点放在这一类别上以通过RT-PCR对一组另外的85个GC肿瘤/正常对进行筛选并且发现一例另外的肿瘤中的SNX2-PRDM6、两例另外的肿瘤中的CLDN18-ARHGAP26和DUS2L-PSKH1、三例另外的肿瘤中的MLL3-PRKAG2、以及四例另外的肿瘤中的CLEC16A-EMP2,这给出了2%-5%的总频率(图4A-C和图5至8)。进行统计模拟以评估这样的复现率的显著性。使用随机化框架来评估融合基因的观测频率的统计显著性。限定了15个SV谱,所述SV谱模拟通过DNA-PET测序的样品中所鉴定出的SV的类型、数目以及大小分布。使用SV谱模拟15例GC测试数据集的SV并且对85个GC样品的模拟验证集评估复现的SV的频率。假设N=10,000是随机模拟的次数并且es是测试数据集中存在的SV s在验证数据集中的频率,我们将P值(es)定义为p/N,其中p是模拟次数,其中在频率ek≥es的情况下,SV k存在。
已发现它们不是随机期望的(P=0.00472),两个重新发现率(P=9.98×10-5)和三个重新发现率(P=1.11×10-5)具有更高的显著性水平。这表明这些融合基因不是随机产生的,而最有可能是通过靶向重排机制产生的和/或所得的融合基因提供了选择优势。
实施例5
融合基因对细胞增殖的作用
为了探究融合基因是否提供选择优势,使用生物信息学和细胞生物学方法。经由计算机模拟,使用网络融合中心分析来预测驱动融合基因。在这一研究的136个融合基因中,38个被分类为潜在的驱动融合基因,包括CLDN18-ARHGAP26、SNX2-PRDM6以及MLL3-PRKAG2(表5)。由于鉴定出TMK1中的MLL3-PRKAG2和DUS2L-PSKH1,因此进行对MLL3-PRKAG2和DUS2L-PSKH1转录物的融合点具有特异性的短干扰RNA(siRNA)实验。在使MLL3-PRKAG2沉默时,观测到细胞增殖减少了63%(图5),但是对于DUS2L-PSKH1基因敲低的细胞观测到非决定性的变化(图6)。因此,基于GC中4%的频率、预测的驱动特性、以及促增殖作用的实验证据,表明了MLL3-PRKAG2对于GC具有促致癌性。
表5:驱动融合基因预测。
为了研究CLDN18-ARHGAP26、CLEC16A-EMP2以及SNX2-PRDM6在GC中的功能,在GC细胞系HGC27中产生稳定的过表达,并且对于CLDN18-ARHGAP26(增加85%,P=4.2×10-6,T检验;图4G、H)和CLEC16A-EMP2(增加50%,P=7.9×10-5,T检验;图7)显示细胞增殖率增加,但是对于SNXX2-PRDM6(降低46%,P=9×10-6,T检验;图8)显示增殖率降低。
由CLDN18-ARHGAP26的过表达所引起的高增殖率表明这种融合基因的致癌作用,并且对它的功能进行进一步研究。CLDN18-ARHGAP26编码75.6kDa的融合蛋白,所述融合蛋白含有CLDN18的所有四个跨膜结构域以及ARHGAP26的RhoGAP结构域,但是缺少CLDN18的C末端PDZ结合基序(图4E),所述基序介导与闭锁小带支架蛋白(ZO-1、ZO-2、ZO-3)的相互作用。CLDN18属于密封蛋白(claudin)蛋白质家族,所述密封蛋白是紧密连接(TJ)的组分。ARHGAP26(GRAF1)与粘着斑激酶(FAK)结合,所述粘着斑激酶调节细胞生长、增殖、存活、粘附以及迁移。ARHGAP26还可以负调节小GTP结合蛋白RhoA,所述RhoA在RAS介导的恶性转化中的生长促进作用是公知的。
在带有CLDN18-ARHGAP26融合体的所有三例肿瘤中,转录物是通过CLDN18的外显子5的编码区内的隐蔽剪接位点和ARHGAP26的外显子12的常规剪接位点接合的(图4D)。在基因组水平上,我们通过荧光原位杂交(FISH,图4B)和PCR/Sanger测序(图4C)来验证肿瘤136中的CLDN18-ARHGAP26重排。使用定制的捕获测序,肿瘤07K611T中的基因组融合点被鉴定为CLDN18下游2,342bp(图4A),这表明隐蔽剪接位点甚至在断裂点处于CLDN18基因下游时仍介导了框内融合。
实施例6
表达CLDN18-ARHGAP26的患者样品和MDCK细胞中的上皮表型丧失
对于肿瘤样品中的免疫荧光,使用CLDN18抗体和ARHGAP26抗体,所述抗体均能够检测CLDN18-ARHGAP26融合蛋白(图9A)。在正常胃样品和表达融合体的肿瘤胃样品中,在内衬胃小凹区域的上皮细胞的质膜中以及在胃腺的基部处观测到CLDN18蛋白(图10A)。先前在HeLa细胞中的多形管状和点状膜结构上检测到ARHGAP26。在这一研究中,在正常胃中在整个胃粘膜上的囊泡结构上观测到ARHGAP26(图10B)。与分化良好的正常胃上皮相反,表达CLDN18-ARHGAP26的胃肿瘤样品显示出无组织的结构。虽然上皮标志物CDH1(E-钙粘蛋白)在对照组织中的上皮细胞的膜处表达,但是它在肿瘤样品中显示出细胞内点状分布或不存在于细胞中(图10A、B)。CLDN18-ARHGAP26存在于肿瘤样品中的E-钙粘蛋白阳性细胞和E-钙粘蛋白阴性细胞这两者中,E-钙粘蛋白阴性细胞显示出间充质特征(图10A、B),这与改变细胞-细胞粘附从而使得上皮表型丧失的融合蛋白相一致。总体而言,融合基因与对胃上皮完整性的毁灭性损伤相关。
为了了解融合蛋白对在肿瘤样品中所观测到的上皮完整性变化所起的作用,使CLDN18、ARHGAP26或CLDN18-ARHGAP26在未转化的上皮MDCK细胞中稳定表达。通过相衬观测到,对照细胞培养物和MDCK-CLDN18细胞培养物显示出特征性上皮形态(图10C)。虽然MDCK-ARHGAP26细胞略微更加呈梭形并且具有短的突起,但是MDCK-CLDN18-ARHGAP26细胞表现出上皮表型的显著丧失和长的突起,这指示了上皮-间充质转化(EMT)(图10C)。细胞聚集测定表明MDCK-CLDN18-ARHGAP26细胞的聚集不佳(图10D),这表明了的确,融合基因引起了所观测到的上皮变化。还用HGC27细胞获得了相似的结果。
为了评价由CLDN18-ARHGAP26所诱导的表型变化是否反映了EMT,使用定量PCR(qPCR)来研究各种EMT标志物的表达。虽然在表达ARHGAP26和CLDN18-ARHGAP26的细胞中E-钙粘蛋白mRNA水平保持不变,但是主EMT调节因子SNAI1(Snail)和SNAI2(Slug)的mRNA减少(图10E)。相对于对照MDCK细胞,MDCK-CLDN18-ARHGAP26显示出MMP2(基质金属蛋白酶2)mRNA水平提高5.2倍(图10E),这表明融合基因诱导细胞外基质(ECM)粘附发生变化。
有趣的是,在CLDN18,而不是融合蛋白表达的情况下,在转化的HeLa细胞中观测到N-钙粘蛋白和β-连环蛋白表达下调(图10F和9B-D),这分别表明CLDN18可以逆转在EMT期间所观测到的从上皮钙粘蛋白向间充质钙粘蛋白的转换和抑制Wnt信号转导。Wnt信号转导在许多癌症中是过度激活的,并且N-钙粘蛋白表达激活AKT信号转导,所述AKT信号转导在许多肿瘤中是超激活的。的确,与对照相比,在过表达CLDN18的HeLa细胞中,pAKT蛋白质水平以及下游效应子p21激活激酶(PAK)的蛋白质水平降低(图10G)。这表明了CLDN18作为肿瘤抑制因子的作用,所述作用是通过阻抑AKT信号转导和Wnt信号转导而实现的。
实施例7
CLDN18-ARHGAP26减少细胞-细胞外基质粘附
ARHGAP26可能经由它与FAK的相互作用以及它对RhoA的调节来影响细胞与ECM的粘附,这进而调节粘着斑。粘附测定显示对照细胞和MDCK-CLDN18细胞在未处理的表面或ECM包被的表面上附着和扩散。不仅表达ARHGAP26的细胞,而且甚至更是这样,表达CLDN18-ARHGAP26的细胞不太高效地附着到表面(图11A),但是确实附着的细胞在接种之后两小时之时仍变圆(图11A),这证实融合基因增强了ARHGAP26的作用并且强烈地影响了细胞-ECM粘附特性。存在于融合蛋白中的ARHGAP26的SH3结构域与粘着斑分子FAK和PXN(桩蛋白)结合。因此研究CLDN18-ARHGAP26的表达对粘着斑蛋白的影响。pFAK和桩蛋白在MDCK-CLDN18和MDCK-ARHGAP26的游离边缘处被检测到,但是不存在于MDCK-CLDN18-ARHGAP26细胞中的这一位置处(图11B、C)。对与ARHGAP26或粘着斑复合物蛋白质相关的粘附分子进行的蛋白质印迹分析显示MDCK-ARHGAP26中的β-Pix、LIMS1(PINCH1)以及桩蛋白的水平降低,并且在MDCK-CLDN18-ARHGAP26细胞中这样的情况更加显著(图11D)。
在反映出蛋白质水平的变化的情况下,通过qPCR在MDCK-ARHGAP26细胞和MDCK-CLDN18-ARHGAP26细胞中观测到PINCH1和桩蛋白转录物的水平显著降低(图11E)。还观测到在表达融合蛋白的细胞中踝蛋白-1、踝蛋白-2以及SDC1(多配体蛋白聚糖1)mRNA水平的大幅降低,进一步指示了CLDN18-ARHGAP26细胞的不佳ECM粘附(图11E)。
除了粘着斑的细胞质组分之外,还分析了直接与ECM组分相互作用的整合素家族成员的蛋白质水平。与MDCK-CLDN18-ARHGAP26细胞在胶原包被的表面上不佳的附着(图11A)相一致,这些细胞表达降低水平的ITGB1(整合素β1)和ITGB5(整合素β5)(图11F)。的确,在MDCK-CLDN18-ARHGAP26细胞中观测到许多整合素亚基,特别是整合素α5的转录物水平降低(图11G)。综上所述,ARHGAP26的过表达以及甚至更是这样的融合基因的过表达破坏ECM粘附。
实施例8
由CLDN18促进的上皮屏障被CLDN18-ARHGAP26损害
密封蛋白是细胞旁上皮屏障的关键组分,所述屏障包括保护胃组织不受胃腔中的酸性环境损伤。这种屏障功能的改变可能导致慢性炎症,即产生GC的一种风险因素。因此,研究了CLDN18和融合蛋白在屏障形成中的作用。并非内源性表达在MDCK细胞中的CLDN18的过表达使得MDCK-CLDN18单层的跨上皮电阻(TER)显著提高。虽然ARHGAP26对TER没有显著的影响,但是CLDN18-ARHGAP26完全消除了TER(图11H)。这一效应并非单纯反映了C末端PDZ结合基序的缺乏,这是因为其中这一C末端PDZ结合基序失活的CLDN18构建体(CLDN18ΔP)仍提高了MDCK细胞的基线TER。汇合的表达CLDN18-ARHGAP26融合体的MDCK细胞的相衬图像显示这些细胞不能形成紧密的单层,从而解释了TER的丧失(图11I)。虽然TJP1(ZO-1)(使密封蛋白与肌动蛋白细胞骨架直接连接的一种支架蛋白)的表达水平和亚细胞定位在表达融合蛋白的MDCK细胞中没有发生改变(图9E、F),但是若干种其它TJ组分的表达在MDCK-CLDN18-ARHGAP26中上调,这可能是作为一种代偿机制(图9E)。
实施例9
CLDN18-ARHGAP26对细胞增殖、侵袭以及迁移发挥细胞背景特异性作用
在GC细胞系HGC27中,CLDN18-ARHGAP26诱导增殖的增加(图4H)。然而,有趣的是,在未转化的MDCK细胞中,与对照相比,MDCK-CLDN18-AHGAP26细胞的增殖率更低(图12A)。虽然伤口闭合实验显示与对照相比MDCK-CLDN18-ARHGAP26细胞的细胞迁移减少(图12B),但是MDCK细胞中CLDN18-ARHGAP26的表达对作为癌症进展和转移的特征的侵袭和非锚定依赖性生长没有影响。因此测试这些过程以确定它们在癌细胞系HGC27和HeLa中是否发生改变。稳定表达CLDN18-ARHGAP26的两种独立的HeLa细胞系显示出细胞侵袭增加3倍至4倍(图12C)并且稳定表达融合蛋白的HeLa细胞和HGC27细胞在软琼脂生长测定中多形成30%的集落(图12D)。这些发现突出了融合蛋白对未转化的细胞和转化的细胞的增殖、侵袭以及非锚定依赖性生长的不同作用,并且表明了融合蛋白驱动晚期癌症事件,如侵袭和转移的作用。
实施例10
ARHGAP26和CLDN18-ARHGAP26这两者抑制RhoA和应力纤维形成
RhoA在生长因子受体或整合素与它们对应的配体结合时调节许多肌动蛋白事件,如肌动蛋白聚合、收缩以及应力纤维形成。ARHGAP26经由它的GAP结构域刺激CDC42和RhoA的GTP酶活性,从而使它们失活。由于CLDN18-ARHGAP26融合蛋白保留ARHGAP26的GAP结构域,因此它可以仍能够使RhoA失活。为了对此进行测试,分析CLDN18-ARHGAP26的表达对应力纤维形成以及活性RhoA(例如GTP结合的RhoA)的存在和亚细胞定位的影响。在HeLa细胞中,ARHGAP26或CLDN18-ARHGAP26的稳定过表达诱导细胞骨架变化,特别是应力纤维的减少,这指示了RhoA失活(图13A)。将稳定细胞系用特异性识别激活的RhoA的抗体标记显示出表达ARHGAP26和CLDN18-ARHGAP26融合蛋白的细胞中标记减少,而总RhoA水平保持不变(图13B、C)。测量活性RhoA水平的GLISA测定进一步确认了这些结果(图13D)。这些发现表明了CLDN18-ARHGAP26融合蛋白中的GAP结构域保留了它对RhoA的抑制活性。
实施例11
CLDN18-ARHGAP26融合蛋白抑制非网格蛋白依赖性内吞
内吞的变化可以影响细胞表面停留时间和/或细胞-ECM粘附蛋白和细胞-细胞粘附蛋白以及受体酪氨酸激酶(RTK)的降解,从而改变细胞粘附、迁移以及RTK信号转导,这些可能驱动致癌作用。与其它细胞系相反,表达CLDN18-ARHGAP26融合蛋白的HeLa细胞显示出内吞的显著减少(图13E和实施例13),这与融合蛋白中BAR结构域和PH结构域的不存在相一致,这些结构域对于内吞来说是必要的。
实施例12
复现的融合基因CLEC16A-EMP2、SNX2-PRDM6、MLL3-PRKAG2以及DUS2L-PSKH1的生物学背景
在癌细胞系TMK1中并且随后在两例原发性胃肿瘤中鉴定出DUS2L与PSKH1之间的融合转录物。然而,在一例肿瘤中,DUS2L的外显子3与PSKH1的外显子2(UTR区)融合,从而产生框外融合转录物(图6)。在TMK1和第二例肿瘤中,DUS2L的外显子10与PSKH1的外显子2在框内融合。非小细胞肺癌细胞中DUS2L的siRNA基因敲低抑制了生长并且已经报道了肿瘤中高水平的DUS2L与肺癌患者的更差预后之间的关联。PSKH1被鉴定为前列腺癌细胞生长的一种调节因子。对于DUS2L-PSKH1没有发现一致的增殖作用(图6)。然而,增殖只是(融合)基因可能促进肿瘤发生或进展的一个可能的机制,并且仍有可能的是,DUS2L-PSKH1在GC中起作用。
非配对倒位产生融合基因CLEC16A-EMP2,所述融合基因在100例GC中的五例中被鉴定出。在CLEC16A中,外显子4(一例肿瘤)、外显子9(两例肿瘤)或外显子10(两例肿瘤)与EMP2的外显子2融合(图7)。EMP2外显子2的前60bp是5′UTR并且融合使得在EMP2的标准起始甲硫氨酸前面包括了20个氨基酸。预测的开放阅读框编码328个、486个以及524个氨基酸,从而保留了具有其功能结构域的整个EMP2蛋白。在B细胞淋巴瘤细胞系中进行的实验表明EMP2充当肿瘤抑制因子。相反,EMP2被发现在>70%的卵巢肿瘤中高表达,针对EMP2的抗体在具有卵巢癌细胞系的小鼠异种移植物中显著抑制肿瘤生长并且诱导细胞死亡。EMP2因此可能是药物靶标。这两个研究表明了EMP2在癌症中的作用,但是作用可能具有组织特异性。通过表达微阵列来分析15例经过测序的GC中的14例,并且发现EMP2在所有的GC中的高表达水平以及在带有CLEC16A-EMP2融合体的肿瘤113中最高的表达(数据未示)。这与作为融合体的一部分的EMP2的致癌作用是一致的。用稳定表达融合基因的HGC27进行的增殖测定(图7)进一步支持了CLEC16A-EMP2可能具有致癌特性。
SNX2-PRDM6被发现在一例胃肿瘤中框内融合(SNX2的外显子12与PRDM6的外显子4融合)以及在第二例肿瘤中框外融合(SNX2的外显子2与PRDM6的外显子7融合,图8)。SNX2编码分选连接蛋白家族的成员并且这个家族的成员参与细胞内运输。PRDM6有可能具有组蛋白甲基转移酶功能并且可能用作转录阻遏因子。小鼠胚胎内皮细胞中PRDM6的过表达诱导细胞凋亡以及减少的管形成,这表明PRDM6可能通过染色质重塑在脉管系统中起作用。观测到稳定表达SNX2-PRDM6的HGC27的增殖率降低,但是潜在致癌作用可能与增强的脉管系统相关,而不是与增殖相关。
实施例13
CLDN18-ARHGAP26融合蛋白抑制非网格蛋白依赖性内吞
ARHGAP26被报道为对非网格蛋白依赖性内吞来说是不可缺少的,并且许多受体酪氨酸激酶(RTK)可以通过网格蛋白依赖性途径和非网格蛋白依赖性途径这两者被内化。为了评价CLDN18-ARHGAP26融合蛋白对非网格蛋白依赖性内吞的作用,将与异硫氰酸荧光素(FITC)缀合的CTxB(非网格蛋白依赖性内吞的一种标志物)与活的对照HeLa细胞或稳定表达CLDN18、ARHGAP26或CLDN18-ARHAGP26的细胞一起孵育15分钟。然后将细胞固定并且通过荧光显微术使内化的FITC-CTxB可视化。与其它细胞系相反,表达CLDN18-ARHGAP26融合蛋白的HeLa细胞显示出内吞的CTxB的量显著减少(图13),这与融合蛋白中BAR结构域和PH结构域的不存在相一致,这些结构域对于内吞来说是必要的。
在这一研究中观测到复现的体细胞SV和复现的融合基因。模拟证实复现的融合基因的比率不能用随机来解释,这表明与其它重排相比,特定的重排更有可能发生和/或选择过程富集了这些重排。通过将体细胞SV与染色质相互作用的全基因组视图相比较,相较于随机期望,观测到重排位点与染色质相互作用的显著更多的重叠,这表明染色质结构促成了GC中复现的远隔基因座的融合。
这是首次对在癌症中的体细胞SV与染色质相互作用之间进行的系统的相关性分析。由于染色质结构是在不同于GC的细胞类型中被表征的,因此染色质相互作用与重排之间实际的重叠率可能已经被低估。
评价了136种融合基因的有效性、表达以及阅读框特征,并且通过延伸的筛选鉴定出五种复现的融合基因。详细地分析了CLDN18-ARHGAP26并且发现促进早期癌症产生和晚期疾病进展这两者的功能特性。CLDN18和ARHGAP26在胃粘膜上皮中表达,其中CLDN18定位于紧密连接(TJ)并且ARHGAP26定位于上皮细胞的点状管状囊泡结构。CLDN18-ARHGAP26融合基因因此使RhoA的调节因子的功能蛋白结构域与TJ蛋白连接,从而产生改变的特性。这些以及GAP活性的异常定位使得向与GC有关的细胞功能发生变化。
虽然CLDN18-ARHGAP26与致瘤性HeLa细胞和HGC27细胞的增殖增加、锚定依赖性生长和侵袭有关,但是这些细胞过程在未转化的MDCK细胞中减少(增殖、伤口闭合),这表明转化的程度影响融合蛋白的效应中的一些,这与致癌作用的多步模型相一致。在相关GC中原位过表达以及当在MDCK细胞中过表达时,CLDN18-ARHGAP26与上皮表型的丧失有关。
164844PCT-CN-ELLA序列表
<110> 新加坡科技研究局
<120> 癌症中的融合基因
<130> 9869SG3063
<160> 135
<170> PatentIn 3.5版
<210> 1
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 1
tttcaactac caggggctgt 20
<210> 2
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 2
gccagtcttt ccgttcagag 20
<210> 3
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 3
tagtggagac catccgttcc 20
<210> 4
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 4
ccttctctgg tcacgggata 20
<210> 5
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 5
cagtacggtg tgtggagctg 20
<210> 6
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 6
ggtgcaggtt cttcatggat 20
<210> 7
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 7
cctttccaga gagccagaaa 20
<210> 8
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 8
gcaaaacgtg acccagagac 20
<210> 9
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 9
ttcaccagca ctgtctccac 20
<210> 10
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 10
ttcgattgat tctgggctct 20
<210> 11
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 11
ggcgcggatc cgccgccacc atgtttggcc gctcgcggag 40
<210> 12
<211> 73
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 12
tgatagcggc cgctcatcaa gcgtaatctg gaacatcgta tgggtactcg agtttgcgct 60
tcctcagtat cag 73
<210> 13
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 13
ggcgcggatc cgccgccacc atggccgtga ctgcctgtca 40
<210> 14
<211> 73
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 14
gatagcggcc gctcatcaag cgtaatctgg aacatcgtat gggtactcga ggaggaactc 60
cacgtaattc tca 73
<210> 15
<211> 42
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 15
ggcgcttaat taagccgcca ccatggcggc cgagagggaa cc 42
<210> 16
<211> 73
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 16
tgatagcggc cgctcatcaa gcgtaatctg gaacatcgta tgggtactcg agatccactt 60
cgattgattc tgg 73
<210> 17
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 17
ggcgcggatc cgccgccacc atgattttga atagcctctc 40
<210> 18
<211> 74
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 18
tgatagcggc cgctcatcaa gcgtaatctg gaacatcgta tgggtactcg aggccattgt 60
attgctgctg gtag 74
<210> 19
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 19
aaaacccaca gcctcatgtc 20
<210> 20
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 20
cacctggtcc ttgttctggt 20
<210> 21
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 21
ggtttcccat tatgccattg 20
<210> 22
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 22
ttccaagaca tgtgcagctc 20
<210> 23
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 23
ccgacaggat gttgacaatg 20
<210> 24
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 24
tcagagaggt cggcaaactt 20
<210> 25
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 25
ggatgctgcc tttaattgga 20
<210> 26
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 26
cgcacccttg aagaagtagc 20
<210> 27
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 27
caaactctac ggcttctgcc 20
<210> 28
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 28
tggcaccgat gaatgatcta 20
<210> 29
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 29
aagcagttgc actgtgatgc 20
<210> 30
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 30
gcagtgaggg caagaaaaag 20
<210> 31
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 31
caaggccttc aactgcaaat 20
<210> 32
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 32
aaggttcggg aacaggtctt 20
<210> 33
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 33
ctgaagtagc ttccccagg 19
<210> 34
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 34
tgttgatgag tgagtccact g 21
<210> 35
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 35
acacggatcc cagagcagc 19
<210> 36
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 36
tgcagcgata aaacaaaagg c 21
<210> 37
<211> 15
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 37
gcccctgcac cgtgg 15
<210> 38
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 38
tctctgaccc tccagccaat 20
<210> 39
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 39
gcgacggttc tttctaggga 20
<210> 40
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 40
tccccttgag gaaatgggag 20
<210> 41
<211> 17
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 41
ccagggacag tcccccc 17
<210> 42
<211> 17
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 42
gcgtcgggtt ccgagat 17
<210> 43
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 43
ggtgggcatg agatgcact 19
<210> 44
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 44
caccaccgcc agtctgtctt 20
<210> 45
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 45
gagggcctgt ggatgaactg 20
<210> 46
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 46
agtcgtacac cttgcactgc a 21
<210> 47
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 47
tccaccacct cgcatatctc t 21
<210> 48
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 48
gccatttagg gcctcactgg a 21
<210> 49
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 49
ccagaaggtt cctttgtgga 20
<210> 50
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 50
ggctggtgtt tgacttggtt 20
<210> 51
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 51
ggtggccctg tccttaaag 19
<210> 52
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 52
cgtacccgtc ccttcctcc 19
<210> 53
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 53
aagtgtgctc tggggtcaag 20
<210> 54
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 54
agcctttgtc cgtgaggtaa 20
<210> 55
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 55
agctcaactt tctggcgaag 20
<210> 56
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 56
cttcacgacg atgtcattgc 20
<210> 57
<211> 17
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 57
ccatttaaag atctccg 17
<210> 58
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 58
catttggaag tcatgttcg 19
<210> 59
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 59
aggacgaggg gagctatgac c 21
<210> 60
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 60
gtgggggcct tctgataag 19
<210> 61
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 61
atcccagagg ctccaaagat 20
<210> 62
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 62
gctggagctt ctctgctgtt 20
<210> 63
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 63
gacctttgag tgtggggtgt 20
<210> 64
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 64
tcttccgagc attcacactg 20
<210> 65
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 65
acagtcccaa gaaacggatg 20
<210> 66
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 66
ccttcaccgt gtagcggtat 20
<210> 67
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 67
aagcccatct ccacacactc 20
<210> 68
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 68
aggagaaggg gctctcagtc 20
<210> 69
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 69
tgagaccagg cagtgaacag 20
<210> 70
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 70
ccgagaggtc catgaggtaa 20
<210> 71
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 71
cgtgacttcc gtcttggatt 20
<210> 72
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 72
cctttctggg tggatgctaa 20
<210> 73
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 73
atttggaaac tgccacaagc 20
<210> 74
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 74
atttggaaac tgccacaagc 20
<210> 75
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 75
catctaccac agcagctcca 20
<210> 76
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 76
ctcctcccca tggattacct 20
<210> 77
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 77
gacgacacgg aggactttgt 20
<210> 78
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 78
tgtctgagcc attgaggatg 20
<210> 79
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 79
agtggagctg tggttttgct 20
<210> 80
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 80
agaccttccc cgtcaaaaat 20
<210> 81
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 81
tccaggtgga gcttcttttg 20
<210> 82
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 82
ttcttagagt gacctggaga cc 22
<210> 83
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 83
aacatcatcc ctgcttccac 20
<210> 84
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 84
gaccacctgg tcctcagtgt 20
<210> 85
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 85
acagtggcca cctacaaagg 20
<210> 86
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 86
ccgagatggg gttgataatg 20
<210> 87
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 87
aaaatggcag tgcgtttag 19
<210> 88
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 88
tttgaaggca gtctgtcgta 20
<210> 89
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 89
cgtggctaca tctcccattt 20
<210> 90
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 90
tccctcatga ccaggatctc 20
<210> 91
<211> 14
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 91
gaccccttca ttga 14
<210> 92
<211> 14
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 92
cttctccatg gtgg 14
<210> 93
<211> 6891
<212> DNA
<213> 智人
<400> 93
aactgcattt cccagcgccc cacgcggcgg cggccgtaaa gcgcggcggt cgaacggccg 60
gttccggctg aatgtcagtg ctgggctgtg ggccggggag gaaggcggct cgcggttcct 120
ccaccgcctc cgccgccgca tcctccgctt gtgctaccgc cgcgggcgct gggccgctct 180
gctggtccgg catgagaccg tgagacgaga gacgggtcgg ggccgccgac atgtttggcc 240
gctcgcggag ctgggtgggc gggggccatg gcaagacttc ccgcaacatc cactccttgg 300
accacctcaa gtatctgtac cacgttttga ccaaaaacac cacagtcaca gaacagaacc 360
ggaacctgct agtggagacc atccgttcca tcactgagat cctgatctgg ggagatcaaa 420
atgacagctc tgtatttgac ttcttcctgg agaagaatat gtttgttttc ttcttgaaca 480
tcttgcggca aaagtcgggc cgttacgtgt gcgttcagct gctgcagacc ttgaacatcc 540
tctttgagaa catcagtcac gagacctcac tttattattt gctctcaaat aactacgtaa 600
attctatcat cgttcataaa tttgactttt ctgatgagga gattatggcc tattatatat 660
cgttcctgaa aacactttcg ttaaaactca acaaccacac tgtccatttc ttttataatg 720
agcacaccaa tgactttgcc ctgtacacag aagccatcaa gtttttcaac caccctgaaa 780
gcatggttag aattgctgta agaaccataa ctttgaatgt ctataaagtg tcattggata 840
accaggccat gctgcactac atccgagata aaactgctgt tccttacttc tccaatttgg 900
tctggttcat tgggagccat gtgatcgaac tcgatgactg cgtgcagact gatgaggagc 960
atcggaatcg gggtaaactg agtgatctgg tggcagagca cctagaccac ctgcactatc 1020
tcaatgacat cctgatcatc aactgtgagt tcctcaacga tgtgctcact gaccacctgc 1080
tcaacaggct cttcctgccc ctctacgtgt actcactgga gaaccaggac aagggaggag 1140
aacggccgaa aattagcctg ccggtgtctc tttatcttct gtcacaggtc ttcttaatta 1200
tacatcatgc accgctggtg aactcgttag ctgaagtcat tctgaatggt gatctgtctg 1260
agatgtacgc taagactgaa caggatattc agagaagttc tgccaagccc agcattcggt 1320
gcttcattaa acccaccgag acactcgagc ggtcccttga gatgaacaag cacaagggca 1380
agaggcgggt gcaaaagaga cccaactaca aaaacgttgg ggaagaagaa gatgaggaga 1440
aagggcccac cgaggatgcc caagaagacg ccgagaaggc taaaggtaca gagggtggtt 1500
caaaaggcat caagacgagt ggggagagtg aagagatcga gatggtgatc atggagcgta 1560
gcaagctctc agagctggcc gccagcacct ccgtgcagga gcagaacacc acggacgagg 1620
agaaaagcgc cgccgccacc tgctctgaga gcacgcaatg gagcagaccc ttcctggata 1680
tggtgtacca cgcgctggac agcccggatg atgattacca tgccctgttc gtgctctgcc 1740
tcctctatgc catgtctcat aataaaggca tggatcctga aaaattagag cgaatccagc 1800
tccccgtgcc aaatgcggcc gagaagacca cctacaacca cccgctagct gaaagactca 1860
tcaggatcat gaacaacgct gcccagccag atgggaagat ccggctggcg acgctggagc 1920
tgagctgcct gcttctgaag cagcaagtcc tgatgagtgc tggctgcatc atgaaggacg 1980
tgcacctggc ctgcctggag ggtgcgagag aagaaagtgt tcaccttgta cgacattttt 2040
ataagggaga agacattttt ttggacatgt ttgaagatga gtataggagc atgacaatga 2100
agcccatgaa cgtggaatat ctcatgatgg acgcctccat cctgctgccc ccaacaggca 2160
cgccactgac gggcattgac ttcgtgaagc ggctgccgtg tggcgatgtg gagaagaccc 2220
ggcgggccat ccgggtgttc ttcatgctgc gttccctgtc actgcaattg cgaggggagc 2280
ctgagacaca gttgccgctg actcgggagg aggacctgat caagactgat gatgtcctgg 2340
atctgaataa cagcgacttg attgcatgta cagtgatcac caaggatggc ggcatggtcc 2400
agcgattcct ggctgtggat atttaccaga tgagtttggt ggagcctgat gtgtccaggc 2460
ttggctgggg agtggtcaag tttgcaggcc tattgcagga catgcaggtg actggcgtgg 2520
aggacgacag ccgtgccctg aacatcacca tccacaagcc tgcgtccagc ccccattcca 2580
agcccttccc catcctccag gccaccttca tcttctcaga ccacatccgc tgcatcatcg 2640
ccaagcagcg cctggccaaa ggccgcatcc aggcaaggcg catgaagatg cagagaatag 2700
ctgccctcct ggacctccca atccagccca ccactgaagt cctggggttt ggactcggct 2760
cctccacctc cactcagcac ctgcctttcc gcttctacga ccaggggcgc cggggcagca 2820
gcgaccccac agtgcagcgc tccgtgtttg catcggtgga caaggtgcca ggcttcgccg 2880
tggcccagtg cataaaccag cacagctccc cgtccctgtc ctcacagtcg ccaccctccg 2940
ccagcgggag ccccagcggc agcgggagca ccagccactg cgactctgga ggcaccagct 3000
cgtcctccac cccctccaca gcccagagtc cagcagatgc ccccatgagt ccagaactgc 3060
ctaagcctca ccttcctgac cagttggtaa tcgtcaacga aacggaagca gactctaagc 3120
ccagcaagaa cgtggccagg agcgcagccg tggagacagc cagcctgtcc cccagcctcg 3180
tccctgcccg gcagcccacc atttccctgc tctgcgagga cacggctgac acgctgagcg 3240
tcgaatcgct gacccttgtc cccccagttg acccccacag cctccgcagc ctcaccggca 3300
tgcccccgct gtccacgccg gctgccgcct gcacagagcc cgtgggcgaa gaggctgcat 3360
gtgctgagcc tgtgggcacc gctgaggact gagtcagtgc cggggcctcc ctttgtgtgt 3420
gtggccccgc tggtagggac cccagtgccg ctgactggca agacacactg ggagcaccca 3480
ccattctgtg cggcccccag cagccatctc aaccacctat ccctgcgctc ccttgaatgg 3540
gaagaagccc cacgttgtcc ttgaattcct ttttcacttt gcatctcttc acgtgcaggc 3600
tgggaccagc ggagacaccg cggcgaatgc agatgactgc accggccact cagggagctg 3660
cctgggctcc gtgtctctga gccccgggtg gcaggaccca ccggcacctc tttcttcctc 3720
tgtcatatgg ctcctctgtc accagcccca gtgtgcacag aagaattgga ccaggtcact 3780
gtacgtagaa atttgtagaa aagcagactt agataaacat ctcctttgga tatttatttc 3840
cgcttttggc agcaggtgaa catttatttt taaaacttct atttaaaaga agtccaaaaa 3900
catcaacact aaggtttgat gtcatgtgaa aagtgtaata ataacagtta agatttcatg 3960
atcattttca ctggaccttt cctgatattt tgtttcagag ttcttagtgt ggctttttcc 4020
atttatttaa gtgattcttt gttactcact aactctgcaa gcctgtggaa taatgaagta 4080
ccttcctgga aagtttggat tattttttaa acaaaaacaa gggagataca tgtattctca 4140
ggtacacaca gagctgagag ggctgaatgg ttttctgcta tagcagccga gaggcctccc 4200
atcatggaaa gatttctcca ggaaaaggag gaatgtagcc agctccccac tcaggacgct 4260
tcctcatttc tcttcaccaa aaccaaacag agacagcttc cagcaccttc ttcagtgtta 4320
ccatctctaa gaaggaacca gttgggaccg tgaagactcc cgaccctgtg gccatgatgg 4380
aaatcaaagg aagacaccct ctacgtcacc tgccctcgac tgtgtgtgcc cacatgtgcc 4440
gagagatggc ccagagccag ttcccctcca gctgcaaggg catggtgtcc ccagagctct 4500
gagtctgtca ctctccctct gctactgctg ctgatctgaa tatggaaacc ccatggttcc 4560
cttccccatt cggactgggt gtgtacaagc aaggacccag atgcatcaga cacagccccc 4620
aagatgttcc tttctactcg gccagctcgg gagccagaca cagcactcac agcccaggcc 4680
gtgatccacc ctccccaagt ccaccagggc cagcggcccc tcacctctct ggtcactggt 4740
gagaccttcc acaactttcc tccagacctg ccagcagatg tgcccaccag gggcattagg 4800
tatccgccgg agcctggcca tagggtagtc tcgggagccg cgctgagatc ttttgccacc 4860
tgcattttag aagaacatgg tctctgtctc ctcggcccag ccagctgtcc cggcaaggcc 4920
tgccgagggc agttttcaac ctcatgaagg aaacacagtc ctgccaagga gggggagtgg 4980
cgcccatggg gacaggcctc agtccttaga agccctctgg gtagctgtgc ccacccagcc 5040
ttcatggctg caggtacaag gacctttgct tccatagaga aaacgcacag ctcagaaagg 5100
gggccacatg ggcagaaacc caaaggaagg acaaaccacg accaccgtgg ccatctgcag 5160
aatccctgga agagaaggaa ggcagggtgg agcgggggga agaccatcat ggagagaagg 5220
accacagcat caggagacgg gacacgccac acccagcagg cagcctgtgt gttgcttaat 5280
tttttaagag caagaggggt agagaggatc aagctggccc tggctggaga tggctagccc 5340
ctgagacatg cacttctggt tttgaaatga ctctgtctgt ggggcagcag aaactagaga 5400
aggcaagtgg ctgccccacc ccaaggcgtg accaggagga acagcctgca gctcactcca 5460
tgccacacgg gtgggccacc agcctgctgt cagaagtctc tgggctccaa ctggtcttgt 5520
aaccactgag cactgaagga gagaggtctt ggtcagggct ggacagcatg cccgggagga 5580
ccagcagagg attaaaggtg actgggagga ccagcggagg ataaaagaca ctgctcaggg 5640
cagggcttct accctgcatc cctggccaag aaaagggcag tccccatgtg ggcttgcagg 5700
gtcactctca ggggcctctt tcagctgggg ctggcaactt gcgtctgggg gacacctcca 5760
ggtgtgtggg gtgaggattt cctataacca gggctcccag aagctttgct tatgtaagga 5820
ggtctgggag ccagcccatt ggaggccacc agccattttg gcttcaaagg accccacctc 5880
acccaggtct cagcggcagt gggcacagct atgtcttcag gagctcccgt caaacctcat 5940
agctggggcg ctcccagaca ggccagtcca gacaggacac gctgggcccc tggcatccag 6000
aggaagagcc aggagtgtgg gaaggcccac agtgggggct gtggcttctg acactcaggt 6060
catagcctca gaggtctgag gtcagccccc acagacccat ccggcccgcc ccccaagtcc 6120
ctgcagagag cacttagagt tatggcccag gccctggtcc acccttcccc tgtgcacctc 6180
cggctgggtt tgccaagtca gggagcaggg ctggccgcag gaactcccaa accttggctt 6240
tgaatattgt tgtggaggtg tgctcgtccc tttctggacg tgcaaggtac ctgtcccagc 6300
aggtcagatg gggccagctg aggcgctccc ccaggcagga agggccagcc ttcaccatcg 6360
cgtgggattg ggaggagggg cctccgtgag cagcccctcc tctgccgctg tcccagccca 6420
gtccctctcc cggagccttg gcagcctccc acaacccaga cacttgcgtt cacaagcaac 6480
ctaaggggca ggtgaagaag cgcagccctg ccagacgcgc tagattcctc taaggtctct 6540
gagatgcacc gttttttaaa aaggcgtggg gtgaactgat tttgatcttc ttgtctagat 6600
gcaataaata aatctgaagc atttaatgta gtcatcttga cattgggcct acactgtacg 6660
agttccttat gtttccttga gctaaaaata tgtaaataat ttttgtccca gtgagaaccg 6720
agggttagaa aacctcgatg cctctgagcc tcgggaccgc tctagggaag tacctgcttt 6780
cgccagcatg actcatgctt cgtgggtact gaacacgagg gtggaaatga aaactggaac 6840
ttccttgtaa atttaaactt ggcaataaaa gagaaaaaaa gttaccaaga a 6891
<210> 94
<211> 1053
<212> PRT
<213> 智人
<400> 94
Met Phe Gly Arg Ser Arg Ser Trp Val Gly Gly Gly His Gly Lys Thr
1 5 10 15
Ser Arg Asn Ile His Ser Leu Asp His Leu Lys Tyr Leu Tyr His Val
20 25 30
Leu Thr Lys Asn Thr Thr Val Thr Glu Gln Asn Arg Asn Leu Leu Val
35 40 45
Glu Thr Ile Arg Ser Ile Thr Glu Ile Leu Ile Trp Gly Asp Gln Asn
50 55 60
Asp Ser Ser Val Phe Asp Phe Phe Leu Glu Lys Asn Met Phe Val Phe
65 70 75 80
Phe Leu Asn Ile Leu Arg Gln Lys Ser Gly Arg Tyr Val Cys Val Gln
85 90 95
Leu Leu Gln Thr Leu Asn Ile Leu Phe Glu Asn Ile Ser His Glu Thr
100 105 110
Ser Leu Tyr Tyr Leu Leu Ser Asn Asn Tyr Val Asn Ser Ile Ile Val
115 120 125
His Lys Phe Asp Phe Ser Asp Glu Glu Ile Met Ala Tyr Tyr Ile Ser
130 135 140
Phe Leu Lys Thr Leu Ser Leu Lys Leu Asn Asn His Thr Val His Phe
145 150 155 160
Phe Tyr Asn Glu His Thr Asn Asp Phe Ala Leu Tyr Thr Glu Ala Ile
165 170 175
Lys Phe Phe Asn His Pro Glu Ser Met Val Arg Ile Ala Val Arg Thr
180 185 190
Ile Thr Leu Asn Val Tyr Lys Val Ser Leu Asp Asn Gln Ala Met Leu
195 200 205
His Tyr Ile Arg Asp Lys Thr Ala Val Pro Tyr Phe Ser Asn Leu Val
210 215 220
Trp Phe Ile Gly Ser His Val Ile Glu Leu Asp Asp Cys Val Gln Thr
225 230 235 240
Asp Glu Glu His Arg Asn Arg Gly Lys Leu Ser Asp Leu Val Ala Glu
245 250 255
His Leu Asp His Leu His Tyr Leu Asn Asp Ile Leu Ile Ile Asn Cys
260 265 270
Glu Phe Leu Asn Asp Val Leu Thr Asp His Leu Leu Asn Arg Leu Phe
275 280 285
Leu Pro Leu Tyr Val Tyr Ser Leu Glu Asn Gln Asp Lys Gly Gly Glu
290 295 300
Arg Pro Lys Ile Ser Leu Pro Val Ser Leu Tyr Leu Leu Ser Gln Val
305 310 315 320
Phe Leu Ile Ile His His Ala Pro Leu Val Asn Ser Leu Ala Glu Val
325 330 335
Ile Leu Asn Gly Asp Leu Ser Glu Met Tyr Ala Lys Thr Glu Gln Asp
340 345 350
Ile Gln Arg Ser Ser Ala Lys Pro Ser Ile Arg Cys Phe Ile Lys Pro
355 360 365
Thr Glu Thr Leu Glu Arg Ser Leu Glu Met Asn Lys His Lys Gly Lys
370 375 380
Arg Arg Val Gln Lys Arg Pro Asn Tyr Lys Asn Val Gly Glu Glu Glu
385 390 395 400
Asp Glu Glu Lys Gly Pro Thr Glu Asp Ala Gln Glu Asp Ala Glu Lys
405 410 415
Ala Lys Gly Thr Glu Gly Gly Ser Lys Gly Ile Lys Thr Ser Gly Glu
420 425 430
Ser Glu Glu Ile Glu Met Val Ile Met Glu Arg Ser Lys Leu Ser Glu
435 440 445
Leu Ala Ala Ser Thr Ser Val Gln Glu Gln Asn Thr Thr Asp Glu Glu
450 455 460
Lys Ser Ala Ala Ala Thr Cys Ser Glu Ser Thr Gln Trp Ser Arg Pro
465 470 475 480
Phe Leu Asp Met Val Tyr His Ala Leu Asp Ser Pro Asp Asp Asp Tyr
485 490 495
His Ala Leu Phe Val Leu Cys Leu Leu Tyr Ala Met Ser His Asn Lys
500 505 510
Gly Met Asp Pro Glu Lys Leu Glu Arg Ile Gln Leu Pro Val Pro Asn
515 520 525
Ala Ala Glu Lys Thr Thr Tyr Asn His Pro Leu Ala Glu Arg Leu Ile
530 535 540
Arg Ile Met Asn Asn Ala Ala Gln Pro Asp Gly Lys Ile Arg Leu Ala
545 550 555 560
Thr Leu Glu Leu Ser Cys Leu Leu Leu Lys Gln Gln Val Leu Met Ser
565 570 575
Ala Gly Cys Ile Met Lys Asp Val His Leu Ala Cys Leu Glu Gly Ala
580 585 590
Arg Glu Glu Ser Val His Leu Val Arg His Phe Tyr Lys Gly Glu Asp
595 600 605
Ile Phe Leu Asp Met Phe Glu Asp Glu Tyr Arg Ser Met Thr Met Lys
610 615 620
Pro Met Asn Val Glu Tyr Leu Met Met Asp Ala Ser Ile Leu Leu Pro
625 630 635 640
Pro Thr Gly Thr Pro Leu Thr Gly Ile Asp Phe Val Lys Arg Leu Pro
645 650 655
Cys Gly Asp Val Glu Lys Thr Arg Arg Ala Ile Arg Val Phe Phe Met
660 665 670
Leu Arg Ser Leu Ser Leu Gln Leu Arg Gly Glu Pro Glu Thr Gln Leu
675 680 685
Pro Leu Thr Arg Glu Glu Asp Leu Ile Lys Thr Asp Asp Val Leu Asp
690 695 700
Leu Asn Asn Ser Asp Leu Ile Ala Cys Thr Val Ile Thr Lys Asp Gly
705 710 715 720
Gly Met Val Gln Arg Phe Leu Ala Val Asp Ile Tyr Gln Met Ser Leu
725 730 735
Val Glu Pro Asp Val Ser Arg Leu Gly Trp Gly Val Val Lys Phe Ala
740 745 750
Gly Leu Leu Gln Asp Met Gln Val Thr Gly Val Glu Asp Asp Ser Arg
755 760 765
Ala Leu Asn Ile Thr Ile His Lys Pro Ala Ser Ser Pro His Ser Lys
770 775 780
Pro Phe Pro Ile Leu Gln Ala Thr Phe Ile Phe Ser Asp His Ile Arg
785 790 795 800
Cys Ile Ile Ala Lys Gln Arg Leu Ala Lys Gly Arg Ile Gln Ala Arg
805 810 815
Arg Met Lys Met Gln Arg Ile Ala Ala Leu Leu Asp Leu Pro Ile Gln
820 825 830
Pro Thr Thr Glu Val Leu Gly Phe Gly Leu Gly Ser Ser Thr Ser Thr
835 840 845
Gln His Leu Pro Phe Arg Phe Tyr Asp Gln Gly Arg Arg Gly Ser Ser
850 855 860
Asp Pro Thr Val Gln Arg Ser Val Phe Ala Ser Val Asp Lys Val Pro
865 870 875 880
Gly Phe Ala Val Ala Gln Cys Ile Asn Gln His Ser Ser Pro Ser Leu
885 890 895
Ser Ser Gln Ser Pro Pro Ser Ala Ser Gly Ser Pro Ser Gly Ser Gly
900 905 910
Ser Thr Ser His Cys Asp Ser Gly Gly Thr Ser Ser Ser Ser Thr Pro
915 920 925
Ser Thr Ala Gln Ser Pro Ala Asp Ala Pro Met Ser Pro Glu Leu Pro
930 935 940
Lys Pro His Leu Pro Asp Gln Leu Val Ile Val Asn Glu Thr Glu Ala
945 950 955 960
Asp Ser Lys Pro Ser Lys Asn Val Ala Arg Ser Ala Ala Val Glu Thr
965 970 975
Ala Ser Leu Ser Pro Ser Leu Val Pro Ala Arg Gln Pro Thr Ile Ser
980 985 990
Leu Leu Cys Glu Asp Thr Ala Asp Thr Leu Ser Val Glu Ser Leu Thr
995 1000 1005
Leu Val Pro Pro Val Asp Pro His Ser Leu Arg Ser Leu Thr Gly
1010 1015 1020
Met Pro Pro Leu Ser Thr Pro Ala Ala Ala Cys Thr Glu Pro Val
1025 1030 1035
Gly Glu Glu Ala Ala Cys Ala Glu Pro Val Gly Thr Ala Glu Asp
1040 1045 1050
<210> 95
<211> 5197
<212> DNA
<213> 智人
<400> 95
ggcgggatcg gggaaggagg ggccccgccg cctagagggt ggagggaggg cgcgcagtcc 60
cagcccagag cttcaaaaca gcccggcggc ctcgcctcgc acccccagcc agtccgtcga 120
tccagctgcc agcgcagccg ccagcgccgg cacatcccgc tctgggcttt aaacgtgacc 180
cctcgcctcg actcgccctg ccctgtgaaa atgttggtgc ttcttgcttt catcatcgcc 240
ttccacatca cctctgcagc cttgctgttc attgccaccg tcgacaatgc ctggtgggta 300
ggagatgagt tttttgcaga tgtctggaga atatgtacca acaacacgaa ttgcacagtc 360
atcaatgaca gctttcaaga gtactccacg ctgcaggcgg tccaggccac catgatcctc 420
tccaccattc tctgctgcat cgccttcttc atcttcgtgc tccagctctt ccgcctgaag 480
cagggagaga ggtttgtcct aacctccatc atccagctaa tgtcatgtct gtgtgtcatg 540
attgcggcct ccatttatac agacaggcgt gaagacattc acgacaaaaa cgcgaaattc 600
tatcccgtga ccagagaagg cagctacggc tactcctaca tcctggcgtg ggtggccttc 660
gcctgcacct tcatcagcgg catgatgtac ctgatactga ggaagcgcaa atagagttcc 720
ggagctgggt tgcttctgct gcagtacaga atccacattc agataaccat tttgtatata 780
atcattattt tttgaggttt ttctagcaaa cgtattgttt cctttaaaag ccaaaaaaaa 840
aaaaaaaaaa aaaaaaaaaa gaaaaaagaa aaaaaaaatc caaaagagag aagagttttt 900
gcattcttga gatcagagaa tagactatga aggctggtat tcagaactgc tgcccactca 960
aaagtctcaa caagacacaa gcaaaaatcc agcaatgctc aaatccaaaa gcactcggca 1020
ggacatttct taaccatggg gctgtgatgg gaggagagga gaggctggga aagccgggtc 1080
tctggggacg tgcttcctat gggtttcagc tggcccaagc ccctcccgaa tctctctgct 1140
agtggtgggt ggaagagggt gaggtggggt ataggagaag aatgacagct tcctgagagg 1200
tttcacccaa gttccaagtg agaagcaggt gtagtccctg gcattctgtc tgtatccaaa 1260
ccagagccca gccatccctc cggtatcggg gtgggtcaga aaaagtctca cctcaatttg 1320
ccgacagtgt cacctgcttg ccttaggaat ggtcatcctt aacctgcgtg ccagatttag 1380
actcgtcttt aggcaaaacc tacagcgccc cccccctcac cccagaccta cagaatcaga 1440
gtcttcaagg gatggggcca gggaatctgc atttctaacg cgctccctgg gcaacgcttc 1500
agatgcgttg aagttgggga ccacggtgcc tgggccaggt cagcagagct gcctcgtaaa 1560
tgctggggta tcgtcatgtg gagatgggga ggtgaatgca acccccacag caggccaaaa 1620
ccttggcctc catcgccaca gctgtctaca tctagggccc caaaactcca ttcctgagcc 1680
atgtgaactc atagacacct tcagggtgtg gggtacagcc tccttcccat cttatcccag 1740
aaggcctctc ccttcttgtc cagcccttca tgctacacct ggctggcctc tcacccctat 1800
ttctagagcc tcagaggacc catccaccat tcattcattc attcattcat tcattcattc 1860
attcattcat caacataaat cataacttgc atgcatgtgc caggcacagg ggataccctc 1920
tagagacaat ctcctcctag ggctcatggc ctagtggagg agacagatta aaacttaatt 1980
agaaaaactg gctgggtaca gtggctcatg cttgtaatcc cagcactttg ggaggctgag 2040
gcgggtggat cacctgaggt caggagttca agaccagcct ggccaaaatg gtaaaacctg 2100
tctctactaa aaatacaaaa atgagctggg cgtggtggtg catgcctgta atcccagcta 2160
tcaggtggct gaggcaggag aatcacttga aatgggaggt ggaggttgca gtgagccgag 2220
accgtgccac tgcactccag cctgggtgac agagtgagac tccatctcaa aaaaagaaaa 2280
aaaagaaaag aaactaatta cacactgtga tggaggctgc aaagaacacc actaagaatt 2340
caaaatcagc tgggtgcggt ggctcacacc tgtaatccca gcactttggg aggctgaggc 2400
aggtggatca caaggtcagg agttcaagac cagcctggcc aacatggtga aaccccgtct 2460
ctaccgaaaa tacaacaaaa ttagcccggt gtggtggcag gtgcctgtaa tcccagctac 2520
ttaggaggct gaggcaggag aatcgcttga aactgggagg cggaggtcgc agtgagccga 2580
gattcaccac tgcactccag cccaggcgac agtctgagac tccgtctcaa aaataaaacg 2640
attcaaaatc gaggcctgtg gcatggtagg gaggctgctt tacgcgtgcc tattattaaa 2700
tgctcctgga ggcatttagg tatttagatc agtctaaata tagctccatt cagttcgtgc 2760
agatgacagt tattgggcag tacctgtctg tgtaacaccc agaaaacatg tctgtggagg 2820
ggcccatggt cccgacagta aatgcggtga gagggtccca tagagctgga gttttcaagc 2880
tttaggggtt cccgtgctgc ttgggacagg ctgattcaga gggtctgggt gaatgatttc 2940
caggtgattt taagactgtg ctgagaaata gggcttttgg ggccttgtcc ttcaggatca 3000
aagcatgatg ctgtgtggca atgcagacca cccaggaacc atcccaggag ataagctctt 3060
tgcacctcat tgtctttttc tgcttatgtt ggagcaggat gctgggggct gtcctgggat 3120
ggggtgtggg acctcgtgct atttaaatac ttttgcactt gaccttctgc tgagtggagt 3180
ggtggtttgc catcagctca gttccagtgg agctgaagag acatctggtt tgagtagttt 3240
tagggccacc atggatatct cttcaatgca ggattggctc tttccatctg ctctttcatt 3300
catttgtttt tgacagatag tattaaatgt ttaccatgtt ccaggcactg tgtgaggctc 3360
tgaaaataca ggggtgagca aatccagata tcctccctgc catcatgaag tttggagtct 3420
atgagatagg accccctccc tatggagaag ccaccaatgc agtacagggt gacctggggc 3480
cagagacagg acaaatgtca cctcctgcct ccatgagata ctctcactag tcatattgtg 3540
ggcaagaatg tggcttacac ccctagggtt aacaggatgc tacccaagct catggaggaa 3600
gttgaatctt aagttccctt gaaactttct accttggtgg cttttctata attttctttt 3660
ttctttttct tttttttttt tttttttgag actgagtttt gctcttgttg cccaggctgg 3720
agtgcagtgg caccatcttg gctcaccgca acctctgcct cctgggttca agtgattctc 3780
ctgcctcagc ctcccgagta gctgggatta caggcatgtc ccaccatgcc cagctaattt 3840
ttgtattttt agtagagatg gggtttctcc atgttggtca ggctggtttc gaactcccaa 3900
cctcaggtga tccgcccacc tcagccttcc aaagtgctgg gattacaggc atgagccact 3960
gcgtctggcc ttctataatt ttctggtagt cacgatggaa acaaacaaaa caccttagaa 4020
ccagagatcg accccctcaa gcaatacatc aattcccttc acaagaaacg tcggggctac 4080
atgagtatct gtgttgaatg cggtctgaaa tgatcctatg gattttcccg gctggttgcc 4140
actgctgtac aacattcagt gcccacatcc acctgtgcca ttaagctttt ttgagacatg 4200
agagatgcct cttccctgct gtatgacatg catttgggaa gttggaaaga aatgacaaaa 4260
tcagggagaa aacatccaag cttcttacct gtagatagaa tcagccctca cttggtgctt 4320
attaccagtt attcaagaac aataacaaca acaaaattag tagacatcca agaagcacat 4380
attaggacca aagatagcat caactgtatt tgaaggaact gtagtttgcg cattttatga 4440
catttttata aagtactgta attctttcat tgaggggcta tgtgatggag acagactaac 4500
tcattttgtt atttgcatta aaattatttt gggtctctgt tcaaatgagt ttggagaatg 4560
cttgacttgt tggtctgtgt gaatgtgtat atatatatac ctgaatacag gaacatcgga 4620
gacctattca ctcccacaca ctctgctata gtttgcgtgc ttttgtggac acccctcatg 4680
aacaggctgg cgctctagga cgctctgtgt tcactgatga tgaagaaacc tagaactcca 4740
agcctgtttg taaacacact aaacacagtg gcctagatag aaactgtatc gtagtttaaa 4800
atctgcctcg cgggatgtta ctaaactcgc taatagttta aaggttactt acaatagagc 4860
aagttggaca attttgtggt gttggggaaa tgttagggca aggcctagag gttcattttg 4920
aatcttggtt tgtgacttta gggtagttag aaactttcta cttaatgtac ctttaaaata 4980
gtccattttc tatgttttgt ataatctgaa actgtacatg gaaaataaag tttaaaacca 5040
gattgcccag agcaagactc taatgttccc aacggtgatg acatctaggg cagaatgctg 5100
ccattttgag gggcaggggg tcagctgatt tctcatcaag ataataatgt atggttttta 5160
cactaagcaa ctgataaatg gacaatttat cactgga 5197
<210> 96
<211> 167
<212> PRT
<213> 智人
<400> 96
Met Leu Val Leu Leu Ala Phe Ile Ile Ala Phe His Ile Thr Ser Ala
1 5 10 15
Ala Leu Leu Phe Ile Ala Thr Val Asp Asn Ala Trp Trp Val Gly Asp
20 25 30
Glu Phe Phe Ala Asp Val Trp Arg Ile Cys Thr Asn Asn Thr Asn Cys
35 40 45
Thr Val Ile Asn Asp Ser Phe Gln Glu Tyr Ser Thr Leu Gln Ala Val
50 55 60
Gln Ala Thr Met Ile Leu Ser Thr Ile Leu Cys Cys Ile Ala Phe Phe
65 70 75 80
Ile Phe Val Leu Gln Leu Phe Arg Leu Lys Gln Gly Glu Arg Phe Val
85 90 95
Leu Thr Ser Ile Ile Gln Leu Met Ser Cys Leu Cys Val Met Ile Ala
100 105 110
Ala Ser Ile Tyr Thr Asp Arg Arg Glu Asp Ile His Asp Lys Asn Ala
115 120 125
Lys Phe Tyr Pro Val Thr Arg Glu Gly Ser Tyr Gly Tyr Ser Tyr Ile
130 135 140
Leu Ala Trp Val Ala Phe Ala Cys Thr Phe Ile Ser Gly Met Met Tyr
145 150 155 160
Leu Ile Leu Arg Lys Arg Lys
165
<210> 97
<211> 1521
<212> DNA
<213> 智人
<400> 97
atgtttggcc gctcgcggag ctgggtgggc gggggccatg gcaagacttc ccgcaacatc 60
cactccttgg accacctcaa gtatctgtac cacgttttga ccaaaaacac cacagtcaca 120
gaacagaacc ggaacctgct agtggagacc atccgttcca tcactgagat cctgatctgg 180
ggagatcaaa atgacagctc tgtatttgac ttcttcctgg agaagaatat gtttgttttc 240
ttcttgaaca tcttgcggca aaagtcgggc cgttacgtgt gcgttcagct gctgcagacc 300
ttgaacatcc tctttgagaa catcagtcac gagacctcac tttattattt gctctcaaat 360
aactacgtaa attctatcat cgttcataaa tttgactttt ctgatgagga gattatggcc 420
tattatatat cgttcctgaa aacactttcg ttaaaactca acaaccacac tgtccatttc 480
ttttataatg agcacaccaa tgactttgcc ctgtacacag aagccatcaa gtttttcaac 540
caccctgaaa gcatggttag aattgctgta agaaccataa ctttgaatgt ctataaagtg 600
tcattggata accaggccat gctgcactac atccgagata aaactgctgt tccttacttc 660
tccaatttgg tctggttcat tgggagccat gtgatcgaac tcgatgactg cgtgcagact 720
gatgaggagc atcggaatcg gggtaaactg agtgatctgg tggcagagca cctagaccac 780
ctgcactatc tcaatgacat cctgatcatc aactgtgagt tcctcaacga tgtgctcact 840
gaccacctgc tcaacaggct cttcctgccc ctctacgtgt actcactgga gaaccaggac 900
aagggaggag aacggccgaa aattagcctg ccggtgtctc tttatcttct gtcacagcac 960
atcccgctct gggctttaaa cgtgacccct cgcctcgact cgccctgccc tgtgaaaatg 1020
ttggtgcttc ttgctttcat catcgccttc cacatcacct ctgcagcctt gctgttcatt 1080
gccaccgtcg acaatgcctg gtgggtagga gatgagtttt ttgcagatgt ctggagaata 1140
tgtaccaaca acacgaattg cacagtcatc aatgacagct ttcaagagta ctccacgctg 1200
caggcggtcc aggccaccat gatcctctcc accattctct gctgcatcgc cttcttcatc 1260
ttcgtgctcc agctcttccg cctgaagcag ggagagaggt ttgtcctaac ctccatcatc 1320
cagctaatgt catgtctgtg tgtcatgatt gcggcctcca tttatacaga caggcgtgaa 1380
gacattcacg acaaaaacgc gaaattctat cccgtgacca gagaaggcag ctacggctac 1440
tcctacatcc tggcgtgggt ggccttcgcc tgcaccttca tcagcggcat gatgtacctg 1500
atactgagga agcgcaaata g 1521
<210> 98
<211> 506
<212> PRT
<213> 智人
<400> 98
Met Phe Gly Arg Ser Arg Ser Trp Val Gly Gly Gly His Gly Lys Thr
1 5 10 15
Ser Arg Asn Ile His Ser Leu Asp His Leu Lys Tyr Leu Tyr His Val
20 25 30
Leu Thr Lys Asn Thr Thr Val Thr Glu Gln Asn Arg Asn Leu Leu Val
35 40 45
Glu Thr Ile Arg Ser Ile Thr Glu Ile Leu Ile Trp Gly Asp Gln Asn
50 55 60
Asp Ser Ser Val Phe Asp Phe Phe Leu Glu Lys Asn Met Phe Val Phe
65 70 75 80
Phe Leu Asn Ile Leu Arg Gln Lys Ser Gly Arg Tyr Val Cys Val Gln
85 90 95
Leu Leu Gln Thr Leu Asn Ile Leu Phe Glu Asn Ile Ser His Glu Thr
100 105 110
Ser Leu Tyr Tyr Leu Leu Ser Asn Asn Tyr Val Asn Ser Ile Ile Val
115 120 125
His Lys Phe Asp Phe Ser Asp Glu Glu Ile Met Ala Tyr Tyr Ile Ser
130 135 140
Phe Leu Lys Thr Leu Ser Leu Lys Leu Asn Asn His Thr Val His Phe
145 150 155 160
Phe Tyr Asn Glu His Thr Asn Asp Phe Ala Leu Tyr Thr Glu Ala Ile
165 170 175
Lys Phe Phe Asn His Pro Glu Ser Met Val Arg Ile Ala Val Arg Thr
180 185 190
Ile Thr Leu Asn Val Tyr Lys Val Ser Leu Asp Asn Gln Ala Met Leu
195 200 205
His Tyr Ile Arg Asp Lys Thr Ala Val Pro Tyr Phe Ser Asn Leu Val
210 215 220
Trp Phe Ile Gly Ser His Val Ile Glu Leu Asp Asp Cys Val Gln Thr
225 230 235 240
Asp Glu Glu His Arg Asn Arg Gly Lys Leu Ser Asp Leu Val Ala Glu
245 250 255
His Leu Asp His Leu His Tyr Leu Asn Asp Ile Leu Ile Ile Asn Cys
260 265 270
Glu Phe Leu Asn Asp Val Leu Thr Asp His Leu Leu Asn Arg Leu Phe
275 280 285
Leu Pro Leu Tyr Val Tyr Ser Leu Glu Asn Gln Asp Lys Gly Gly Glu
290 295 300
Arg Pro Lys Ile Ser Leu Pro Val Ser Leu Tyr Leu Leu Ser Gln His
305 310 315 320
Ile Pro Leu Trp Ala Leu Asn Val Thr Pro Arg Leu Asp Ser Pro Cys
325 330 335
Pro Val Lys Met Leu Val Leu Leu Ala Phe Ile Ile Ala Phe His Ile
340 345 350
Thr Ser Ala Ala Leu Leu Phe Ile Ala Thr Val Asp Asn Ala Trp Trp
355 360 365
Val Gly Asp Glu Phe Phe Ala Asp Val Trp Arg Ile Cys Thr Asn Asn
370 375 380
Thr Asn Cys Thr Val Ile Asn Asp Ser Phe Gln Glu Tyr Ser Thr Leu
385 390 395 400
Gln Ala Val Gln Ala Thr Met Ile Leu Ser Thr Ile Leu Cys Cys Ile
405 410 415
Ala Phe Phe Ile Phe Val Leu Gln Leu Phe Arg Leu Lys Gln Gly Glu
420 425 430
Arg Phe Val Leu Thr Ser Ile Ile Gln Leu Met Ser Cys Leu Cys Val
435 440 445
Met Ile Ala Ala Ser Ile Tyr Thr Asp Arg Arg Glu Asp Ile His Asp
450 455 460
Lys Asn Ala Lys Phe Tyr Pro Val Thr Arg Glu Gly Ser Tyr Gly Tyr
465 470 475 480
Ser Tyr Ile Leu Ala Trp Val Ala Phe Ala Cys Thr Phe Ile Ser Gly
485 490 495
Met Met Tyr Leu Ile Leu Arg Lys Arg Lys
500 505
<210> 99
<211> 1056
<212> DNA
<213> 智人
<400> 99
atgtttggcc gctcgcggag ctgggtgggc gggggccatg gcaagacttc ccgcaacatc 60
cactccttgg accacctcaa gtatctgtac cacgttttga ccaaaaacac cacagtcaca 120
gaacagaacc ggaacctgct agtggagacc atccgttcca tcactgagat cctgatctgg 180
ggagatcaaa atgacagctc tgtatttgac ttcttcctgg agaagaatat gtttgttttc 240
ttcttgaaca tcttgcggca aaagtcgggc cgttacgtgt gcgttcagct gctgcagacc 300
ttgaacatcc tctttgagaa catcagtcac gagacctcac tttattattt gctctcaaat 360
aactacgtaa attctatcat cgttcataaa tttgactttt ctgatgagga gattatggcc 420
tattatatat cgttcctgaa aacactttcg ttaaaactca acaaccacac tgtccatttc 480
ttttataatg agcacatccc gctctgggct ttaaacgtga cccctcgcct cgactcgccc 540
tgccctgtga aaatgttggt gcttcttgct ttcatcatcg ccttccacat cacctctgca 600
gccttgctgt tcattgccac cgtcgacaat gcctggtggg taggagatga gttttttgca 660
gatgtctgga gaatatgtac caacaacacg aattgcacag tcatcaatga cagctttcaa 720
gagtactcca cgctgcaggc ggtccaggcc accatgatcc tctccaccat tctctgctgc 780
atcgccttct tcatcttcgt gctccagctc ttccgcctga agcagggaga gaggtttgtc 840
ctaacctcca tcatccagct aatgtcatgt ctgtgtgtca tgattgcggc ctccatttat 900
acagacaggc gtgaagacat tcacgacaaa aacgcgaaat tctatcccgt gaccagagaa 960
ggcagctacg gctactccta catcctggcg tgggtggcct tcgcctgcac cttcatcagc 1020
ggcatgatgt acctgatact gaggaagcgc aaatag 1056
<210> 100
<211> 351
<212> PRT
<213> 智人
<400> 100
Met Phe Gly Arg Ser Arg Ser Trp Val Gly Gly Gly His Gly Lys Thr
1 5 10 15
Ser Arg Asn Ile His Ser Leu Asp His Leu Lys Tyr Leu Tyr His Val
20 25 30
Leu Thr Lys Asn Thr Thr Val Thr Glu Gln Asn Arg Asn Leu Leu Val
35 40 45
Glu Thr Ile Arg Ser Ile Thr Glu Ile Leu Ile Trp Gly Asp Gln Asn
50 55 60
Asp Ser Ser Val Phe Asp Phe Phe Leu Glu Lys Asn Met Phe Val Phe
65 70 75 80
Phe Leu Asn Ile Leu Arg Gln Lys Ser Gly Arg Tyr Val Cys Val Gln
85 90 95
Leu Leu Gln Thr Leu Asn Ile Leu Phe Glu Asn Ile Ser His Glu Thr
100 105 110
Ser Leu Tyr Tyr Leu Leu Ser Asn Asn Tyr Val Asn Ser Ile Ile Val
115 120 125
His Lys Phe Asp Phe Ser Asp Glu Glu Ile Met Ala Tyr Tyr Ile Ser
130 135 140
Phe Leu Lys Thr Leu Ser Leu Lys Leu Asn Asn His Thr Val His Phe
145 150 155 160
Phe Tyr Asn Glu His Ile Pro Leu Trp Ala Leu Asn Val Thr Pro Arg
165 170 175
Leu Asp Ser Pro Cys Pro Val Lys Met Leu Val Leu Leu Ala Phe Ile
180 185 190
Ile Ala Phe His Ile Thr Ser Ala Ala Leu Leu Phe Ile Ala Thr Val
195 200 205
Asp Asn Ala Trp Trp Val Gly Asp Glu Phe Phe Ala Asp Val Trp Arg
210 215 220
Ile Cys Thr Asn Asn Thr Asn Cys Thr Val Ile Asn Asp Ser Phe Gln
225 230 235 240
Glu Tyr Ser Thr Leu Gln Ala Val Gln Ala Thr Met Ile Leu Ser Thr
245 250 255
Ile Leu Cys Cys Ile Ala Phe Phe Ile Phe Val Leu Gln Leu Phe Arg
260 265 270
Leu Lys Gln Gly Glu Arg Phe Val Leu Thr Ser Ile Ile Gln Leu Met
275 280 285
Ser Cys Leu Cys Val Met Ile Ala Ala Ser Ile Tyr Thr Asp Arg Arg
290 295 300
Glu Asp Ile His Asp Lys Asn Ala Lys Phe Tyr Pro Val Thr Arg Glu
305 310 315 320
Gly Ser Tyr Gly Tyr Ser Tyr Ile Leu Ala Trp Val Ala Phe Ala Cys
325 330 335
Thr Phe Ile Ser Gly Met Met Tyr Leu Ile Leu Arg Lys Arg Lys
340 345 350
<210> 101
<211> 1635
<212> DNA
<213> 智人
<400> 101
atgtttggcc gctcgcggag ctgggtgggc gggggccatg gcaagacttc ccgcaacatc 60
cactccttgg accacctcaa gtatctgtac cacgttttga ccaaaaacac cacagtcaca 120
gaacagaacc ggaacctgct agtggagacc atccgttcca tcactgagat cctgatctgg 180
ggagatcaaa atgacagctc tgtatttgac ttcttcctgg agaagaatat gtttgttttc 240
ttcttgaaca tcttgcggca aaagtcgggc cgttacgtgt gcgttcagct gctgcagacc 300
ttgaacatcc tctttgagaa catcagtcac gagacctcac tttattattt gctctcaaat 360
aactacgtaa attctatcat cgttcataaa tttgactttt ctgatgagga gattatggcc 420
tattatatat cgttcctgaa aacactttcg ttaaaactca acaaccacac tgtccatttc 480
ttttataatg agcacaccaa tgactttgcc ctgtacacag aagccatcaa gtttttcaac 540
caccctgaaa gcatggttag aattgctgta agaaccataa ctttgaatgt ctataaagtg 600
tcattggata accaggccat gctgcactac atccgagata aaactgctgt tccttacttc 660
tccaatttgg tctggttcat tgggagccat gtgatcgaac tcgatgactg cgtgcagact 720
gatgaggagc atcggaatcg gggtaaactg agtgatctgg tggcagagca cctagaccac 780
ctgcactatc tcaatgacat cctgatcatc aactgtgagt tcctcaacga tgtgctcact 840
gaccacctgc tcaacaggct cttcctgccc ctctacgtgt actcactgga gaaccaggac 900
aagggaggag aacggccgaa aattagcctg ccggtgtctc tttatcttct gtcacaggtc 960
ttcttaatta tacatcatgc accgctggtg aactcgttag ctgaagtcat tctgaatggt 1020
gatctgtctg agatgtacgc taagactgaa caggatattc agagaagttc tcacatcccg 1080
ctctgggctt taaacgtgac ccctcgcctc gactcgccct gccctgtgaa aatgttggtg 1140
cttcttgctt tcatcatcgc cttccacatc acctctgcag ccttgctgtt cattgccacc 1200
gtcgacaatg cctggtgggt aggagatgag ttttttgcag atgtctggag aatatgtacc 1260
aacaacacga attgcacagt catcaatgac agctttcaag agtactccac gctgcaggcg 1320
gtccaggcca ccatgatcct ctccaccatt ctctgctgca tcgccttctt catcttcgtg 1380
ctccagctct tccgcctgaa gcagggagag aggtttgtcc taacctccat catccagcta 1440
atgtcatgtc tgtgtgtcat gattgcggcc tccatttata cagacaggcg tgaagacatt 1500
cacgacaaaa acgcgaaatt ctatcccgtg accagagaag gcagctacgg ctactcctac 1560
atcctggcgt gggtggcctt cgcctgcacc ttcatcagcg gcatgatgta cctgatactg 1620
aggaagcgca aatag 1635
<210> 102
<211> 544
<212> PRT
<213> 智人
<400> 102
Met Phe Gly Arg Ser Arg Ser Trp Val Gly Gly Gly His Gly Lys Thr
1 5 10 15
Ser Arg Asn Ile His Ser Leu Asp His Leu Lys Tyr Leu Tyr His Val
20 25 30
Leu Thr Lys Asn Thr Thr Val Thr Glu Gln Asn Arg Asn Leu Leu Val
35 40 45
Glu Thr Ile Arg Ser Ile Thr Glu Ile Leu Ile Trp Gly Asp Gln Asn
50 55 60
Asp Ser Ser Val Phe Asp Phe Phe Leu Glu Lys Asn Met Phe Val Phe
65 70 75 80
Phe Leu Asn Ile Leu Arg Gln Lys Ser Gly Arg Tyr Val Cys Val Gln
85 90 95
Leu Leu Gln Thr Leu Asn Ile Leu Phe Glu Asn Ile Ser His Glu Thr
100 105 110
Ser Leu Tyr Tyr Leu Leu Ser Asn Asn Tyr Val Asn Ser Ile Ile Val
115 120 125
His Lys Phe Asp Phe Ser Asp Glu Glu Ile Met Ala Tyr Tyr Ile Ser
130 135 140
Phe Leu Lys Thr Leu Ser Leu Lys Leu Asn Asn His Thr Val His Phe
145 150 155 160
Phe Tyr Asn Glu His Thr Asn Asp Phe Ala Leu Tyr Thr Glu Ala Ile
165 170 175
Lys Phe Phe Asn His Pro Glu Ser Met Val Arg Ile Ala Val Arg Thr
180 185 190
Ile Thr Leu Asn Val Tyr Lys Val Ser Leu Asp Asn Gln Ala Met Leu
195 200 205
His Tyr Ile Arg Asp Lys Thr Ala Val Pro Tyr Phe Ser Asn Leu Val
210 215 220
Trp Phe Ile Gly Ser His Val Ile Glu Leu Asp Asp Cys Val Gln Thr
225 230 235 240
Asp Glu Glu His Arg Asn Arg Gly Lys Leu Ser Asp Leu Val Ala Glu
245 250 255
His Leu Asp His Leu His Tyr Leu Asn Asp Ile Leu Ile Ile Asn Cys
260 265 270
Glu Phe Leu Asn Asp Val Leu Thr Asp His Leu Leu Asn Arg Leu Phe
275 280 285
Leu Pro Leu Tyr Val Tyr Ser Leu Glu Asn Gln Asp Lys Gly Gly Glu
290 295 300
Arg Pro Lys Ile Ser Leu Pro Val Ser Leu Tyr Leu Leu Ser Gln Val
305 310 315 320
Phe Leu Ile Ile His His Ala Pro Leu Val Asn Ser Leu Ala Glu Val
325 330 335
Ile Leu Asn Gly Asp Leu Ser Glu Met Tyr Ala Lys Thr Glu Gln Asp
340 345 350
Ile Gln Arg Ser Ser His Ile Pro Leu Trp Ala Leu Asn Val Thr Pro
355 360 365
Arg Leu Asp Ser Pro Cys Pro Val Lys Met Leu Val Leu Leu Ala Phe
370 375 380
Ile Ile Ala Phe His Ile Thr Ser Ala Ala Leu Leu Phe Ile Ala Thr
385 390 395 400
Val Asp Asn Ala Trp Trp Val Gly Asp Glu Phe Phe Ala Asp Val Trp
405 410 415
Arg Ile Cys Thr Asn Asn Thr Asn Cys Thr Val Ile Asn Asp Ser Phe
420 425 430
Gln Glu Tyr Ser Thr Leu Gln Ala Val Gln Ala Thr Met Ile Leu Ser
435 440 445
Thr Ile Leu Cys Cys Ile Ala Phe Phe Ile Phe Val Leu Gln Leu Phe
450 455 460
Arg Leu Lys Gln Gly Glu Arg Phe Val Leu Thr Ser Ile Ile Gln Leu
465 470 475 480
Met Ser Cys Leu Cys Val Met Ile Ala Ala Ser Ile Tyr Thr Asp Arg
485 490 495
Arg Glu Asp Ile His Asp Lys Asn Ala Lys Phe Tyr Pro Val Thr Arg
500 505 510
Glu Gly Ser Tyr Gly Tyr Ser Tyr Ile Leu Ala Trp Val Ala Phe Ala
515 520 525
Cys Thr Phe Ile Ser Gly Met Met Tyr Leu Ile Leu Arg Lys Arg Lys
530 535 540
<210> 103
<211> 3431
<212> DNA
<213> 智人
<400> 103
aaccgcctcc attacatggt ccgttcctga cgtgtacacc agcctctcag agaaaactcc 60
atccctacac tcggtagtct cagaattgcg ctgtccactt gtcgtgtggc tctgtgtcga 120
cactgtgcgc caccatggcc gtgactgcct gtcagggctt ggggttcgtg gtttcactga 180
ttgggattgc gggcatcatt gctgccacct gcatggacca gtggagcacc caagacttgt 240
acaacaaccc cgtaacagct gttttcaact accaggggct gtggcgctcc tgtgtccgag 300
agagctctgg cttcaccgag tgccggggct acttcaccct gctggggctg ccagccatgc 360
tgcaggcagt gcgagccctg atgatcgtag gcatcgtcct gggtgccatt ggcctcctgg 420
tatccatctt tgccctgaaa tgcatccgca ttggcagcat ggaggactct gccaaagcca 480
acatgacact gacctccggg atcatgttca ttgtctcagg tctttgtgca attgctggag 540
tgtctgtgtt tgccaacatg ctggtgacta acttctggat gtccacagct aacatgtaca 600
ccggcatggg tgggatggtg cagactgttc agaccaggta cacatttggt gcggctctgt 660
tcgtgggctg ggtcgctgga ggcctcacac taattggggg tgtgatgatg tgcatcgcct 720
gccggggcct ggcaccagaa gaaaccaact acaaagccgt ttcttatcat gcctcaggcc 780
acagtgttgc ctacaagcct ggaggcttca aggccagcac tggctttggg tccaacacca 840
aaaacaagaa gatatacgat ggaggtgccc gcacagagga cgaggtacaa tcttatcctt 900
ccaagcacga ctatgtgtaa tgctctaaga cctctcagca cgggcggaag aaactcccgg 960
agagctcacc caaaaaacaa ggagatccca tctagatttc ttcttgcttt tgactcacag 1020
ctggaagtta gaaaagcctc gatttcatct ttggagaggc caaatggtct tagcctcagt 1080
ctctgtctct aaatattcca ccataaaaca gctgagttat ttatgaatta gaggctatag 1140
ctcacatttt caatcctcta tttctttttt taaatataac tttctactct gatgagagaa 1200
tgtggtttta atctctctct cacattttga tgatttagac agactccccc tcttcctcct 1260
agtcaataaa cccattgatg atctatttcc cagcttatcc ccaagaaaac ttttgaaagg 1320
aaagagtaga cccaaagatg ttattttctg ctgtttgaat tttgtctccc cacccccaac 1380
ttggctagta ataaacactt actgaagaag aagcaataag agaaagatat ttgtaatctc 1440
tccagcccat gatctcggtt ttcttacact gtgatcttaa aagttaccaa accaaagtca 1500
ttttcagttt gaggcaacca aacctttcta ctgctgttga catcttctta ttacagcaac 1560
accattctag gagtttcctg agctctccac tggagtcctc tttctgtcgc gggtcagaaa 1620
ttgtccctag atgaatgaga aaattatttt ttttaattta agtcctaaat atagttaaaa 1680
taaataatgt tttagtaaaa tgatacacta tctctgtgaa atagcctcac ccctacatgt 1740
ggatagaagg aaatgaaaaa ataattgctt tgacattgtc tatatggtac tttgtaaagt 1800
catgcttaag tacaaattcc atgaaaagct cactgatcct aattctttcc ctttgaggtc 1860
tctatggctc tgattgtaca tgatagtaag tgtaagccat gtaaaaagta aataatgtct 1920
gggcacagtg gctcacgcct gtaatcctag cactttggga ggctgaggag gaaggatcac 1980
ttgagcccag aagttcgaga ctagcctggg caacatggag aagccctgtc tctacaaaat 2040
acagagagaa aaaatcagcc agtcatggtg gcctacacct gtagtcccag cattccggga 2100
ggctgaggtg ggaggatcac ttgagcccag ggaggttggg gctgcagtga gccatgatca 2160
caccactgca ctccagccag gtgacatagc gagatcctgt ctaaaaaaat aaaaaataaa 2220
taatggaaca cagcaagtcc taggaagtag gttaaaacta attctttaaa aaaaaaaaaa 2280
agttgagcct gaattaaatg taatgtttcc aagtgacagg tatccacatt tgcatggtta 2340
caagccactg ccagttagca gtagcacttt cctggcactg tggtcggttt tgttttgttt 2400
tgctttgttt agagacgggg tctcactttc caggctggcc tcaaactcct gcactcaagc 2460
aattcttcta ccctggcctc ccaagtagct ggaattacag gtgtgcgcca tcacaactag 2520
ctggtggtca gttttgttac tctgagagct gttcacttct ctgaattcac ctagagtggt 2580
tggaccatca gatgtttggg caaaactgaa agctctttgc aaccacacac cttccctgag 2640
cttacatcac tgcccttttg agcagaaagt ctaaattcct tccaagacag tagaattcca 2700
tcccagtacc aaagccagat aggcccccta ggaaactgag gtaagagcag tctctaaaaa 2760
ctacccacag cagcattggt gcaggggaac ttggccatta ggttattatt tgagaggaaa 2820
gtcctcacat caatagtaca tatgaaagtg acctccaagg ggattggtga atactcataa 2880
ggatcttcag gctgaacaga ctatgtctgg ggaaagaacg gattatgccc cattaaataa 2940
caagttgtgt tcaagagtca gagcagtgag ctcagaggcc cttctcactg agacagcaac 3000
atttaaacca aaccagagga agtatttgtg gaactcactg cctcagtttg ggtaaaggat 3060
gagcagacaa gtcaactaaa gaaaaaagaa aagcaaggag gagggttgag caatctagag 3120
catggagttt gttaagtgct ctctggattt gagttgaaga gcatccattt gagttgaagg 3180
ccacagggca caatgagctc tcccttctac caccagaaag tccctggtca ggtctcaggt 3240
agtgcggtgt ggctcagctg ggtttttaat tagcgcattc tctatccaac atttaattgt 3300
ttgaaagcct ccatatagtt agattgtgct ttgtaatttt gttgttgttg ctctatctta 3360
ttgtatatgc attgagtatt aacctgaatg ttttgttact taaatattaa aaacactgtt 3420
atcctacagt t 3431
<210> 104
<211> 261
<212> PRT
<213> 智人
<400> 104
Met Ala Val Thr Ala Cys Gln Gly Leu Gly Phe Val Val Ser Leu Ile
1 5 10 15
Gly Ile Ala Gly Ile Ile Ala Ala Thr Cys Met Asp Gln Trp Ser Thr
20 25 30
Gln Asp Leu Tyr Asn Asn Pro Val Thr Ala Val Phe Asn Tyr Gln Gly
35 40 45
Leu Trp Arg Ser Cys Val Arg Glu Ser Ser Gly Phe Thr Glu Cys Arg
50 55 60
Gly Tyr Phe Thr Leu Leu Gly Leu Pro Ala Met Leu Gln Ala Val Arg
65 70 75 80
Ala Leu Met Ile Val Gly Ile Val Leu Gly Ala Ile Gly Leu Leu Val
85 90 95
Ser Ile Phe Ala Leu Lys Cys Ile Arg Ile Gly Ser Met Glu Asp Ser
100 105 110
Ala Lys Ala Asn Met Thr Leu Thr Ser Gly Ile Met Phe Ile Val Ser
115 120 125
Gly Leu Cys Ala Ile Ala Gly Val Ser Val Phe Ala Asn Met Leu Val
130 135 140
Thr Asn Phe Trp Met Ser Thr Ala Asn Met Tyr Thr Gly Met Gly Gly
145 150 155 160
Met Val Gln Thr Val Gln Thr Arg Tyr Thr Phe Gly Ala Ala Leu Phe
165 170 175
Val Gly Trp Val Ala Gly Gly Leu Thr Leu Ile Gly Gly Val Met Met
180 185 190
Cys Ile Ala Cys Arg Gly Leu Ala Pro Glu Glu Thr Asn Tyr Lys Ala
195 200 205
Val Ser Tyr His Ala Ser Gly His Ser Val Ala Tyr Lys Pro Gly Gly
210 215 220
Phe Lys Ala Ser Thr Gly Phe Gly Ser Asn Thr Lys Asn Lys Lys Ile
225 230 235 240
Tyr Asp Gly Gly Ala Arg Thr Glu Asp Glu Val Gln Ser Tyr Pro Ser
245 250 255
Lys His Asp Tyr Val
260
<210> 105
<211> 6862
<212> DNA
<213> 智人
<400> 105
ggcggggcgg ccgaggctgc tgtgagaggg cgctcgaggc tgccgagagc tagctagcga 60
aggaggcggg gaggcggcgt ctgcactcgc tcgcccgctc gctcgcttcc cggcgccgct 120
gcgggtccgc gctgcgtttc ctgctcgcga tccgctccgt tgcccgcgcc cggaacagca 180
gcacctcggc cgggtccgag ctcggttcgg gagtcttgcg cgccggcgga caccgcgcgc 240
ggagtgagcc agcgccacac ctgtggagcc ggcggccgtc gggggagccg gccggggtcc 300
cgccgcgtga gtgctctggg cggcgggcgg cccgggcccc ggcggaggcg cgccccccgg 360
ctgggcgccg cgcgcaccat ggggctccca gcgctcgagt tcagcgactg ctgcctcgat 420
agtccgcact tccgagagac gctcaagtcg cacgaagcag agctggacaa gaccaacaaa 480
ttcatcaagg agctcatcaa ggacgggaag tcactcataa gcgcgctcaa gaatttgtct 540
tcagcgaagc ggaagtttgc agattcctta aatgaattta aatttcagtg cataggagat 600
gcagaaacag atgatgagat gtgtatagca agatctttgc aggagtttgc cactgtcctc 660
aggaatcttg aagatgaacg gatacggatg attgagaatg ccagcgaggt gctcatcact 720
cccttggaga agtttcgaaa ggaacagatc ggggctgcca aggaagccaa aaagaagtat 780
gacaaagaga cagaaaagta ttgtggcatc ttagaaaaac acttgaattt gtcttccaaa 840
aagaaagaat ctcagcttca ggaggcagac agccaagtgg acctggtccg gcagcatttc 900
tatgaagtat ccctggaata tgtcttcaag gtgcaggaag tccaagagag aaagatgttt 960
gagtttgtgg agcctctgct ggccttcctg caaggactct tcactttcta tcaccatggt 1020
tacgaactgg ccaaggattt cggggacttc aagacacagt taaccattag catacagaac 1080
acaagaaatc gctttgaagg cactagatca gaagtggaat cactgatgaa aaagatgaag 1140
gagaatcccc ttgagcacaa gaccatcagt ccctacacca tggagggata cctctacgtg 1200
caggagaaac gtcactttgg aacttcttgg gtgaagcact actgtacata tcaacgggat 1260
tccaaacaaa tcaccatggt accatttgac caaaagtcag gaggaaaagg gggagaagat 1320
gaatcagtta tcctcaaatc ctgcacacgg cggaaaacag actccattga gaagaggttt 1380
tgctttgatg tggaagcagt agacaggcca ggggttatca ccatgcaagc tttgtcggaa 1440
gaggaccgga ggctctggat ggaagccatg gatggccggg aacctgtcta caactcgaac 1500
aaagacagcc agagtgaagg gactgcgcag ttggacagca ttggcttcag cataatcagg 1560
aaatgcatcc atgctgtgga aaccagaggg atcaacgagc aagggctgta tcgaattgtg 1620
ggtgtcaact ccagagtgca gaagttgctg agtgtcctga tggaccccaa gactgcttct 1680
gagacagaaa cagatatctg tgctgaatgg gagataaaga ccatcactag tgctctgaag 1740
acctacctaa gaatgcttcc aggaccactc atgatgtacc agtttcaaag aagtttcatc 1800
aaagcagcaa aactggagaa ccaggagtct cgggtctctg aaatccacag ccttgttcat 1860
cggctcccag agaaaaatcg gcagatgtta cagctgctca tgaaccactt ggcaaatgtt 1920
gctaacaacc acaagcagaa tttgatgacg gtggcaaacc ttggtgtggt gtttggaccc 1980
actctgctga ggcctcagga agaaacagta gcagccatca tggacatcaa atttcagaac 2040
attgtcattg agatcctaat agaaaaccac gaaaagatat ttaacaccgt gcccgatatg 2100
cctctcacca atgcccagct gcacctgtct cggaagaaga gcagtgactc caagcccccg 2160
tcctgcagcg agaggcccct gacgctcttc cacaccgttc agtcaacaga gaaacaggaa 2220
caaaggaaca gcatcatcaa ctccagtttg gaatctgtct catcaaatcc aaacagcatc 2280
cttaattcca gcagcagctt acagcccaac atgaactcca gtgacccaga cctggctgtg 2340
gtcaaaccca cccggcccaa ctcactcccc ccgaatccaa gcccaacttc acccctctcg 2400
ccatcttggc ccatgttctc ggcgccatcc agccctatgc ccacctcatc cacgtccagc 2460
gactcatccc ccgtcaggtc tgttgcaggg tttgtttggt tttctgttgc tgccgttgtt 2520
ctctcattgg ctcggtcctc tcttcatgca gtgttcagcc tcctcgtcaa ctttgttccc 2580
tgccatccaa acctgcactt gctttttgac aggccagaag aagcggtaca tgaagactcc 2640
agcacaccgt tccggaaggc aaaagccttg tatgcctgca aagctgaaca tgactcagaa 2700
ctttcgttca cagcaggcac ggtcttcgat aacgttcacc catctcagga gcctggctgg 2760
ttggagggga ctctgaacgg aaagactggc ctcatccctg agaattacgt ggagttcctc 2820
taaccgtggg ccccagcaga actgctgagc tttacatggt atccatgaca actgctgatt 2880
ccagtgtcga ggccatttct ctttgccact gagaaatgca gcgtgactga ctctgttgct 2940
acctgtcaac atgaatgttt ctgtgagctc tggtgtcact catctccatg atcatctcag 3000
ccaacatgca tcagtactgc aagaaaagaa gtcaatcagc agaggagagc atttgataac 3060
taagaggaag acttgcaaag ccgttttctc atgagtaccc tgaatagggg gcactcattt 3120
tgtttcaacg gtccaaacgc ccaaccttca gaaagaggaa gtcagataga aatagtccct 3180
gagagcacac tgtgtagcta agcctgctgg ggctgggtga agaaattggc gctgagatcc 3240
aggctggatc cattgctttt gtttacaata ggcactctct ctaccccacc tctcagtact 3300
tgagacttaa agtgctacag gcagctggat ctgtttgcat gcaggatgaa gagggttaaa 3360
acactgttta tataagatcc aatctctcac catctctaaa gcagccgttg gcctgtcatc 3420
agtgagatac aatccagtct tctcatgcac gggaacacac acaccctgcg tttctccctc 3480
ccaggctagg aacctctctg ccaccaaggg ctgccatcca tcgcctagta accacggcaa 3540
cccaacctac tctaaaacca aaccaaaaaa ataaaataac acatcctctt tgcatgacac 3600
attttttttc tccccttttt ggtacacttt ttttgaatgg ttttctaaca acttgaagca 3660
caggatcaag gaattagggt ggtctacttg aggcagatgg gatagtagct gggaactgtt 3720
ccctttctga ttaatttcag cagcatcgga atatatttgg agcacaccct agtaacctct 3780
tgagattaaa ttacatagtc ttaatatttc tgttcctcca tgcaactgat gtttgttttt 3840
taaagggtaa gatgctgcct cccaatgggt gatgccatct gactggtttc cccatgtcct 3900
cccattcacc catctctgct cccacccttg cctgcctcta acccaccact ggccagcccc 3960
cttgccctac tctgggctgc tgaacactgg tgctgtggtg gttttcaagg ttaattccta 4020
ggctaaccgt atggcctata gtttaaaagc acatctatgt tcactgccac tctgaaaaag 4080
ggaattattt ctcagtcttt caaggcttga gactaatata ggccattgtg attcaggaag 4140
aaacccaagg ttggagggtg ggatgagtac cctctgaaaa agggaatttg ctggtgaaaa 4200
gaggctggat cttgtggaag actgtcttgg atggggaagt actacctgga gatttcaaat 4260
tcacttggcc tgcaaacaac agagttatcc gtatcttcca catgtgaatg tcattgcaag 4320
ggtgactcta gacaaactac aaaccgatgg accgtcaagc tccccaggag ccccttggat 4380
ggcagcgttg cttcagagtg tttcctgttt ctggaattcc ttgttaggga actttaaaga 4440
agaaaagaaa aacttgaatt gtgttgaatt actgtatctt ttactttttt ttttttgaaa 4500
agataaactt gtaaatagag tgatttgaaa tactatatgg caaagtttta tatttgatat 4560
tctttaagtt agttgctcac acacttaggc tttgattgct gaagaagtat gtttaagagg 4620
gagagagggg aggcaaagct gaagagagtc aaggtcactg tccccgcttc ggcctgaagg 4680
aaagagaaga catttctatg gccttgctct ctgctgtcct gttggtgggc acgacacatc 4740
agtggtgttc agtctttatg tgtttttaag catcccttgg gctttggatt tggagatggg 4800
aagagcatct ccaggcaatg agtttttcaa agaatgccta cttagtagta agatgaagct 4860
caggatttaa ataagtgggg tcaggcattc gagtttttgt ctttcttctc aggtgtattt 4920
cttggtaccc ccaagatatc aggccagaaa gagatgagtc agttgctgtg ctctttactt 4980
ctttttctcc acatcttctg aggctttaga aatgtggaca agctagtttt caaattttgt 5040
gtgcgtctgt aagttcttaa agaaccagct tcttagaatg ttcagttctc aatgtgctgc 5100
tgctttccct tctcctaaac attttaaaac tcttcccttt cacctccaat tcccgtgatc 5160
ccaaaagaag aggaagactc caggaggggt atagattgtg ccgtcatagc tttacaggtg 5220
gttttaaagt taacaggggt ttgtcatggt gattcactac tcagtttatc agctcaagga 5280
ttatacagct cttttccggg aactcaccca ggagcaagcg agacactacc attgaatcag 5340
ggaatgagaa ttaagaatgg acaggaccaa gacagaactc aagaaagcca ctggggaaaa 5400
ctcgagaaga aagggagtat actagtaggt tagatctgtg aacctgagga caagaagacc 5460
ttgggaaatg gaggcctcag gggatgtgca ttcacatact attacgcttc tcaaagagag 5520
accaacatca tgcttttaac acatttgatg aggtttttta tttgtgtttt tgtttgtttt 5580
ttgagatgga gtctcactct gtggcccagg ctggagtgca gtggcgcaat cttggctcac 5640
tgcaacctcc acctcccagg ttcaagtgat tctcctgtct cagcctccca agtagctggg 5700
actacaggca tgagccatca cacccagcta gttttttgta tttttagtaa agatggggtt 5760
ttgccatgtt tgccaggctg atctcgaact cctgacctca agtgatctgc ccacttcaga 5820
cccccaaagt gctgggattc caggtgtgag ccgctgcggc cgaccacatt tgatgtttga 5880
agttgtaatc tgtcccatca taaacttacc tggagctcat gtggaggaac agaaggccaa 5940
gatccttgct ttgggggtgc ctcacgaagc atccctgtag acatttggcc ccagcttcac 6000
tgcttggaag catgtccctc cctcttgagt tggctctgat ttgaaatcgg gagaaacaga 6060
gctgctgcca atgggatctt ttaggtaact ccctccctag cttccgtgtg tctgtgcagt 6120
gcccatgagc tgctgccaat gggatctttc aggtaccccc tccccagctt ccctgtggct 6180
gtgcggtgcc cttgacagat ggcttctctg tttccctttg cccagccagg ctcccctcct 6240
tcctattagc tacaaaactg gataaacttc agaatatgag ccaatgagta ggaaggaact 6300
tgaagactaa agattttact ctctccccta tccatgcccc ctacctctga ctctctctgt 6360
gtgaacagga aactttaggg cagatgagga gaatgaattg gttatcagag tggaagacca 6420
tggcccagga tccctgagct ttcccagtag cctccagttt cctttgtaag acccagggat 6480
cacttagcca tagcctgaat cttttagggg tattaaggtc agcctctcac tcttccttca 6540
ggttactaac aaaatttcgt agctaaagaa tgccatggcc gggtgcagtg gctcacgcct 6600
ataatcccag cactttggga ggccgaggcg ggcggatcac gaggtcagga gattgagacc 6660
atcctggcta cgacggtgaa accccgtctc tactaaaaat acaaaaaatt agccgggtgt 6720
ggtggcgggc gcctgtagtc ccagctactc tggaggctga ggcaggagaa tggcatgaac 6780
ccaggaggca gagattgcag tgagccaaga tcacgcccct gcactccagc ctgggtgaca 6840
gagccagact ccgtctcaaa gg 6862
<210> 106
<211> 814
<212> PRT
<213> 智人
<400> 106
Met Gly Leu Pro Ala Leu Glu Phe Ser Asp Cys Cys Leu Asp Ser Pro
1 5 10 15
His Phe Arg Glu Thr Leu Lys Ser His Glu Ala Glu Leu Asp Lys Thr
20 25 30
Asn Lys Phe Ile Lys Glu Leu Ile Lys Asp Gly Lys Ser Leu Ile Ser
35 40 45
Ala Leu Lys Asn Leu Ser Ser Ala Lys Arg Lys Phe Ala Asp Ser Leu
50 55 60
Asn Glu Phe Lys Phe Gln Cys Ile Gly Asp Ala Glu Thr Asp Asp Glu
65 70 75 80
Met Cys Ile Ala Arg Ser Leu Gln Glu Phe Ala Thr Val Leu Arg Asn
85 90 95
Leu Glu Asp Glu Arg Ile Arg Met Ile Glu Asn Ala Ser Glu Val Leu
100 105 110
Ile Thr Pro Leu Glu Lys Phe Arg Lys Glu Gln Ile Gly Ala Ala Lys
115 120 125
Glu Ala Lys Lys Lys Tyr Asp Lys Glu Thr Glu Lys Tyr Cys Gly Ile
130 135 140
Leu Glu Lys His Leu Asn Leu Ser Ser Lys Lys Lys Glu Ser Gln Leu
145 150 155 160
Gln Glu Ala Asp Ser Gln Val Asp Leu Val Arg Gln His Phe Tyr Glu
165 170 175
Val Ser Leu Glu Tyr Val Phe Lys Val Gln Glu Val Gln Glu Arg Lys
180 185 190
Met Phe Glu Phe Val Glu Pro Leu Leu Ala Phe Leu Gln Gly Leu Phe
195 200 205
Thr Phe Tyr His His Gly Tyr Glu Leu Ala Lys Asp Phe Gly Asp Phe
210 215 220
Lys Thr Gln Leu Thr Ile Ser Ile Gln Asn Thr Arg Asn Arg Phe Glu
225 230 235 240
Gly Thr Arg Ser Glu Val Glu Ser Leu Met Lys Lys Met Lys Glu Asn
245 250 255
Pro Leu Glu His Lys Thr Ile Ser Pro Tyr Thr Met Glu Gly Tyr Leu
260 265 270
Tyr Val Gln Glu Lys Arg His Phe Gly Thr Ser Trp Val Lys His Tyr
275 280 285
Cys Thr Tyr Gln Arg Asp Ser Lys Gln Ile Thr Met Val Pro Phe Asp
290 295 300
Gln Lys Ser Gly Gly Lys Gly Gly Glu Asp Glu Ser Val Ile Leu Lys
305 310 315 320
Ser Cys Thr Arg Arg Lys Thr Asp Ser Ile Glu Lys Arg Phe Cys Phe
325 330 335
Asp Val Glu Ala Val Asp Arg Pro Gly Val Ile Thr Met Gln Ala Leu
340 345 350
Ser Glu Glu Asp Arg Arg Leu Trp Met Glu Ala Met Asp Gly Arg Glu
355 360 365
Pro Val Tyr Asn Ser Asn Lys Asp Ser Gln Ser Glu Gly Thr Ala Gln
370 375 380
Leu Asp Ser Ile Gly Phe Ser Ile Ile Arg Lys Cys Ile His Ala Val
385 390 395 400
Glu Thr Arg Gly Ile Asn Glu Gln Gly Leu Tyr Arg Ile Val Gly Val
405 410 415
Asn Ser Arg Val Gln Lys Leu Leu Ser Val Leu Met Asp Pro Lys Thr
420 425 430
Ala Ser Glu Thr Glu Thr Asp Ile Cys Ala Glu Trp Glu Ile Lys Thr
435 440 445
Ile Thr Ser Ala Leu Lys Thr Tyr Leu Arg Met Leu Pro Gly Pro Leu
450 455 460
Met Met Tyr Gln Phe Gln Arg Ser Phe Ile Lys Ala Ala Lys Leu Glu
465 470 475 480
Asn Gln Glu Ser Arg Val Ser Glu Ile His Ser Leu Val His Arg Leu
485 490 495
Pro Glu Lys Asn Arg Gln Met Leu Gln Leu Leu Met Asn His Leu Ala
500 505 510
Asn Val Ala Asn Asn His Lys Gln Asn Leu Met Thr Val Ala Asn Leu
515 520 525
Gly Val Val Phe Gly Pro Thr Leu Leu Arg Pro Gln Glu Glu Thr Val
530 535 540
Ala Ala Ile Met Asp Ile Lys Phe Gln Asn Ile Val Ile Glu Ile Leu
545 550 555 560
Ile Glu Asn His Glu Lys Ile Phe Asn Thr Val Pro Asp Met Pro Leu
565 570 575
Thr Asn Ala Gln Leu His Leu Ser Arg Lys Lys Ser Ser Asp Ser Lys
580 585 590
Pro Pro Ser Cys Ser Glu Arg Pro Leu Thr Leu Phe His Thr Val Gln
595 600 605
Ser Thr Glu Lys Gln Glu Gln Arg Asn Ser Ile Ile Asn Ser Ser Leu
610 615 620
Glu Ser Val Ser Ser Asn Pro Asn Ser Ile Leu Asn Ser Ser Ser Ser
625 630 635 640
Leu Gln Pro Asn Met Asn Ser Ser Asp Pro Asp Leu Ala Val Val Lys
645 650 655
Pro Thr Arg Pro Asn Ser Leu Pro Pro Asn Pro Ser Pro Thr Ser Pro
660 665 670
Leu Ser Pro Ser Trp Pro Met Phe Ser Ala Pro Ser Ser Pro Met Pro
675 680 685
Thr Ser Ser Thr Ser Ser Asp Ser Ser Pro Val Arg Ser Val Ala Gly
690 695 700
Phe Val Trp Phe Ser Val Ala Ala Val Val Leu Ser Leu Ala Arg Ser
705 710 715 720
Ser Leu His Ala Val Phe Ser Leu Leu Val Asn Phe Val Pro Cys His
725 730 735
Pro Asn Leu His Leu Leu Phe Asp Arg Pro Glu Glu Ala Val His Glu
740 745 750
Asp Ser Ser Thr Pro Phe Arg Lys Ala Lys Ala Leu Tyr Ala Cys Lys
755 760 765
Ala Glu His Asp Ser Glu Leu Ser Phe Thr Ala Gly Thr Val Phe Asp
770 775 780
Asn Val His Pro Ser Gln Glu Pro Gly Trp Leu Glu Gly Thr Leu Asn
785 790 795 800
Gly Lys Thr Gly Leu Ile Pro Glu Asn Tyr Val Glu Phe Leu
805 810
<210> 107
<211> 2088
<212> DNA
<213> 智人
<400> 107
atggccgtga ctgcctgtca gggcttgggg ttcgtggttt cactgattgg gattgcgggc 60
atcattgctg ccacctgcat ggaccagtgg agcacccaag acttgtacaa caaccccgta 120
acagctgttt tcaactacca ggggctgtgg cgctcctgtg tccgagagag ctctggcttc 180
accgagtgcc ggggctactt caccctgctg gggctgccag ccatgctgca ggcagtgcga 240
gccctgatga tcgtaggcat cgtcctgggt gccattggcc tcctggtatc catctttgcc 300
ctgaaatgca tccgcattgg cagcatggag gactctgcca aagccaacat gacactgacc 360
tccgggatca tgttcattgt ctcaggtctt tgtgcaattg ctggagtgtc tgtgtttgcc 420
aacatgctgg tgactaactt ctggatgtcc acagctaaca tgtacaccgg catgggtggg 480
atggtgcaga ctgttcagac caggtacaca tttggtgcgg ctctgttcgt gggctgggtc 540
gctggaggcc tcacactaat tgggggtgtg atgatgtgca tcgcctgccg gggcctggca 600
ccagaagaaa ccaactacaa agccgtttct tatcatgcct caggccacag tgttgcctac 660
aagcctggag gcttcaaggc cagcactggc tttgggtcca acaccaaaaa caagaagata 720
tacgatggag gtgcccgcac agaggacgag gtctacaact cgaacaaaga cagccagagt 780
gaagggactg cgcagttgga cagcattggc ttcagcataa tcaggaaatg catccatgct 840
gtggaaacca gagggatcaa cgagcaaggg ctgtatcgaa ttgtgggtgt caactccaga 900
gtgcagaagt tgctgagtgt cctgatggac cccaagactg cttctgagac agaaacagat 960
atctgtgctg aatgggagat aaagaccatc actagtgctc tgaagaccta cctaagaatg 1020
cttccaggac cactcatgat gtaccagttt caaagaagtt tcatcaaagc agcaaaactg 1080
gagaaccagg agtctcgggt ctctgaaatc cacagccttg ttcatcggct cccagagaaa 1140
aatcggcaga tgttacagct gctcatgaac cacttggcaa atgttgctaa caaccacaag 1200
cagaatttga tgacggtggc aaaccttggt gtggtgtttg gacccactct gctgaggcct 1260
caggaagaaa cagtagcagc catcatggac atcaaatttc agaacattgt cattgagatc 1320
ctaatagaaa accacgaaaa gatatttaac accgtgcccg atatgcctct caccaatgcc 1380
cagctgcacc tgtctcggaa gaagagcagt gactccaagc ccccgtcctg cagcgagagg 1440
cccctgacgc tcttccacac cgttcagtca acagagaaac aggaacaaag gaacagcatc 1500
atcaactcca gtttggaatc tgtctcatca aatccaaaca gcatccttaa ttccagcagc 1560
agcttacagc ccaacatgaa ctccagtgac ccagacctgg ctgtggtcaa acccacccgg 1620
cccaactcac tccccccgaa tccaagccca acttcacccc tctcgccatc ttggcccatg 1680
ttctcggcgc catccagccc tatgcccacc tcatccacgt ccagcgactc atcccccgtc 1740
aggtctgttg cagggtttgt ttggttttct gttgctgccg ttgttctctc attggctcgg 1800
tcctctcttc atgcagtgtt cagcctcctc gtcaactttg ttccctgcca tccaaacctg 1860
cacttgcttt ttgacaggcc agaagaagcg gtacatgaag actccagcac accgttccgg 1920
aaggcaaaag ccttgtatgc ctgcaaagct gaacatgact cagaactttc gttcacagca 1980
ggcacggtct tcgataacgt tcacccatct caggagcctg gctggttgga ggggactctg 2040
aacggaaaga ctggcctcat ccctgagaat tacgtggagt tcctctaa 2088
<210> 108
<211> 695
<212> PRT
<213> 智人
<400> 108
Met Ala Val Thr Ala Cys Gln Gly Leu Gly Phe Val Val Ser Leu Ile
1 5 10 15
Gly Ile Ala Gly Ile Ile Ala Ala Thr Cys Met Asp Gln Trp Ser Thr
20 25 30
Gln Asp Leu Tyr Asn Asn Pro Val Thr Ala Val Phe Asn Tyr Gln Gly
35 40 45
Leu Trp Arg Ser Cys Val Arg Glu Ser Ser Gly Phe Thr Glu Cys Arg
50 55 60
Gly Tyr Phe Thr Leu Leu Gly Leu Pro Ala Met Leu Gln Ala Val Arg
65 70 75 80
Ala Leu Met Ile Val Gly Ile Val Leu Gly Ala Ile Gly Leu Leu Val
85 90 95
Ser Ile Phe Ala Leu Lys Cys Ile Arg Ile Gly Ser Met Glu Asp Ser
100 105 110
Ala Lys Ala Asn Met Thr Leu Thr Ser Gly Ile Met Phe Ile Val Ser
115 120 125
Gly Leu Cys Ala Ile Ala Gly Val Ser Val Phe Ala Asn Met Leu Val
130 135 140
Thr Asn Phe Trp Met Ser Thr Ala Asn Met Tyr Thr Gly Met Gly Gly
145 150 155 160
Met Val Gln Thr Val Gln Thr Arg Tyr Thr Phe Gly Ala Ala Leu Phe
165 170 175
Val Gly Trp Val Ala Gly Gly Leu Thr Leu Ile Gly Gly Val Met Met
180 185 190
Cys Ile Ala Cys Arg Gly Leu Ala Pro Glu Glu Thr Asn Tyr Lys Ala
195 200 205
Val Ser Tyr His Ala Ser Gly His Ser Val Ala Tyr Lys Pro Gly Gly
210 215 220
Phe Lys Ala Ser Thr Gly Phe Gly Ser Asn Thr Lys Asn Lys Lys Ile
225 230 235 240
Tyr Asp Gly Gly Ala Arg Thr Glu Asp Glu Val Tyr Asn Ser Asn Lys
245 250 255
Asp Ser Gln Ser Glu Gly Thr Ala Gln Leu Asp Ser Ile Gly Phe Ser
260 265 270
Ile Ile Arg Lys Cys Ile His Ala Val Glu Thr Arg Gly Ile Asn Glu
275 280 285
Gln Gly Leu Tyr Arg Ile Val Gly Val Asn Ser Arg Val Gln Lys Leu
290 295 300
Leu Ser Val Leu Met Asp Pro Lys Thr Ala Ser Glu Thr Glu Thr Asp
305 310 315 320
Ile Cys Ala Glu Trp Glu Ile Lys Thr Ile Thr Ser Ala Leu Lys Thr
325 330 335
Tyr Leu Arg Met Leu Pro Gly Pro Leu Met Met Tyr Gln Phe Gln Arg
340 345 350
Ser Phe Ile Lys Ala Ala Lys Leu Glu Asn Gln Glu Ser Arg Val Ser
355 360 365
Glu Ile His Ser Leu Val His Arg Leu Pro Glu Lys Asn Arg Gln Met
370 375 380
Leu Gln Leu Leu Met Asn His Leu Ala Asn Val Ala Asn Asn His Lys
385 390 395 400
Gln Asn Leu Met Thr Val Ala Asn Leu Gly Val Val Phe Gly Pro Thr
405 410 415
Leu Leu Arg Pro Gln Glu Glu Thr Val Ala Ala Ile Met Asp Ile Lys
420 425 430
Phe Gln Asn Ile Val Ile Glu Ile Leu Ile Glu Asn His Glu Lys Ile
435 440 445
Phe Asn Thr Val Pro Asp Met Pro Leu Thr Asn Ala Gln Leu His Leu
450 455 460
Ser Arg Lys Lys Ser Ser Asp Ser Lys Pro Pro Ser Cys Ser Glu Arg
465 470 475 480
Pro Leu Thr Leu Phe His Thr Val Gln Ser Thr Glu Lys Gln Glu Gln
485 490 495
Arg Asn Ser Ile Ile Asn Ser Ser Leu Glu Ser Val Ser Ser Asn Pro
500 505 510
Asn Ser Ile Leu Asn Ser Ser Ser Ser Leu Gln Pro Asn Met Asn Ser
515 520 525
Ser Asp Pro Asp Leu Ala Val Val Lys Pro Thr Arg Pro Asn Ser Leu
530 535 540
Pro Pro Asn Pro Ser Pro Thr Ser Pro Leu Ser Pro Ser Trp Pro Met
545 550 555 560
Phe Ser Ala Pro Ser Ser Pro Met Pro Thr Ser Ser Thr Ser Ser Asp
565 570 575
Ser Ser Pro Val Arg Ser Val Ala Gly Phe Val Trp Phe Ser Val Ala
580 585 590
Ala Val Val Leu Ser Leu Ala Arg Ser Ser Leu His Ala Val Phe Ser
595 600 605
Leu Leu Val Asn Phe Val Pro Cys His Pro Asn Leu His Leu Leu Phe
610 615 620
Asp Arg Pro Glu Glu Ala Val His Glu Asp Ser Ser Thr Pro Phe Arg
625 630 635 640
Lys Ala Lys Ala Leu Tyr Ala Cys Lys Ala Glu His Asp Ser Glu Leu
645 650 655
Ser Phe Thr Ala Gly Thr Val Phe Asp Asn Val His Pro Ser Gln Glu
660 665 670
Pro Gly Trp Leu Glu Gly Thr Leu Asn Gly Lys Thr Gly Leu Ile Pro
675 680 685
Glu Asn Tyr Val Glu Phe Leu
690 695
<210> 109
<211> 2128
<212> DNA
<213> 智人
<400> 109
aggccggccg ggggcgggga ggctggcggg tcggcgcggg cccagccgtg cgtgctcacg 60
tgacgggtcc gcgaggccca gctcgcgcag tcgttcgggt gagcgaagat ggcggccgag 120
agggaacctc ctccgctggg ggacgggaag cccaccgact ttgaggatct ggaggacgga 180
gaggacctgt tcaccagcac tgtctccacc ctagagtcaa gtccatcatc tccagaacca 240
gctagtcttc ctgcagaaga tattagtgca aactccaatg gcccaaaacc cacagaagtt 300
gtattagatg atgacagaga agatcttttt gcagaagcca cagaagaagt ttctttggac 360
agccctgaaa gggaacctat cctatcctcg gaaccttctc ctgcagtcac acctgtcact 420
cctactacac tcattgctcc tagaattgaa tcaaagagta tgtctgctcc cgtgatcttt 480
gatagatcca gggaagagat tgaagaagaa gcaaatggag acatttttga catagaaatt 540
ggtgtatcag atccagaaaa agttggtgat ggcatgaatg cctatatggc atatagagta 600
acaacaaaga catctctttc catgttcagt aagagtgaat tttcagtgaa aagaagattc 660
agcgactttc ttggtttgca cagcaaatta gcaagcaaat atttacatgt tggttatatt 720
gtgccaccag ctccagaaaa gagtatagta gggatgacca aggtcaaagt gggtaaagaa 780
gactcatcat ccactgagtt tgtagaaaaa cggagagcag ctcttgaaag gtatcttcaa 840
agaacagtaa aacatccaac tttactacag gatcctgatt taaggcagtt cttggaaagt 900
tcagagctgc ctagagcagt taatacacag gctctgagtg gagcaggaat attgaggatg 960
gtgaacaagg ctgccgacgc tgtcaacaaa atgacaatca agatgaatga atcggatgca 1020
tggtttgaag aaaagcagca gcaatttgag aatctggatc agcaacttag gaaacttcat 1080
gtcagtgttg aagccttggt ctgtcataga aaagaacttt cagccaacac agctgccttt 1140
gctaaaagtg ctgccatgtt aggtaattct gaggatcata ctgctttatc tagagctttg 1200
tctcagcttg cagaggttga ggagaagata gaccagttac atcaagaaca agcttttgct 1260
gacttttata tgttttcaga actacttagt gactacattc gtcttattgc tgcagtgaaa 1320
ggtgtgtttg accatcgaat gaagtgctgg cagaaatggg aagatgctca aattactttg 1380
ctcaaaaaac gtgaagctga agcaaaaatg atggttgcta acaaaccaga taaaatacag 1440
caagctaaaa atgaaataag agagtgggag gcgaaagtgc aacaagggga aagagatttt 1500
gaacagatat ctaaaacgat tcgaaaagaa gtgggaagat ttgagaaaga acgagtgaag 1560
gattttaaaa ccgttatcat caagtactta gaatcactag ttcaaacaca acaacagctg 1620
ataaaatact gggaagcatt cctacctgaa gccaaagcca ttgcctagca ataagattgt 1680
tgccgttaag aagaccttgg atgttgttcc agttatgctg gattccacag tgaaatcatt 1740
taaaaccatc taaataaacc actatatatt ttatgaatta catgtggttt tatatacaca 1800
cacacacaca cacacacaca cacacacaca ctctgacatt ttattacaag ctgcatgtcc 1860
tgaccctctt tgaattaagt ggactgtggc atgacattct gcaatacttt gctgaattga 1920
acactattgt gtcttaaata cttgcactaa atagtgcact gcaagaccag aaaattttac 1980
aatatttttt ctttacaata tgttctgtag tatgtttacc ctctttatga agtgaattac 2040
caatgctttg aataatgttc acttatacat tcctgtacag aaattacgat tttgtgatta 2100
cagtaataaa atgatattcc ttgtgaaa 2128
<210> 110
<211> 519
<212> PRT
<213> 智人
<400> 110
Met Ala Ala Glu Arg Glu Pro Pro Pro Leu Gly Asp Gly Lys Pro Thr
1 5 10 15
Asp Phe Glu Asp Leu Glu Asp Gly Glu Asp Leu Phe Thr Ser Thr Val
20 25 30
Ser Thr Leu Glu Ser Ser Pro Ser Ser Pro Glu Pro Ala Ser Leu Pro
35 40 45
Ala Glu Asp Ile Ser Ala Asn Ser Asn Gly Pro Lys Pro Thr Glu Val
50 55 60
Val Leu Asp Asp Asp Arg Glu Asp Leu Phe Ala Glu Ala Thr Glu Glu
65 70 75 80
Val Ser Leu Asp Ser Pro Glu Arg Glu Pro Ile Leu Ser Ser Glu Pro
85 90 95
Ser Pro Ala Val Thr Pro Val Thr Pro Thr Thr Leu Ile Ala Pro Arg
100 105 110
Ile Glu Ser Lys Ser Met Ser Ala Pro Val Ile Phe Asp Arg Ser Arg
115 120 125
Glu Glu Ile Glu Glu Glu Ala Asn Gly Asp Ile Phe Asp Ile Glu Ile
130 135 140
Gly Val Ser Asp Pro Glu Lys Val Gly Asp Gly Met Asn Ala Tyr Met
145 150 155 160
Ala Tyr Arg Val Thr Thr Lys Thr Ser Leu Ser Met Phe Ser Lys Ser
165 170 175
Glu Phe Ser Val Lys Arg Arg Phe Ser Asp Phe Leu Gly Leu His Ser
180 185 190
Lys Leu Ala Ser Lys Tyr Leu His Val Gly Tyr Ile Val Pro Pro Ala
195 200 205
Pro Glu Lys Ser Ile Val Gly Met Thr Lys Val Lys Val Gly Lys Glu
210 215 220
Asp Ser Ser Ser Thr Glu Phe Val Glu Lys Arg Arg Ala Ala Leu Glu
225 230 235 240
Arg Tyr Leu Gln Arg Thr Val Lys His Pro Thr Leu Leu Gln Asp Pro
245 250 255
Asp Leu Arg Gln Phe Leu Glu Ser Ser Glu Leu Pro Arg Ala Val Asn
260 265 270
Thr Gln Ala Leu Ser Gly Ala Gly Ile Leu Arg Met Val Asn Lys Ala
275 280 285
Ala Asp Ala Val Asn Lys Met Thr Ile Lys Met Asn Glu Ser Asp Ala
290 295 300
Trp Phe Glu Glu Lys Gln Gln Gln Phe Glu Asn Leu Asp Gln Gln Leu
305 310 315 320
Arg Lys Leu His Val Ser Val Glu Ala Leu Val Cys His Arg Lys Glu
325 330 335
Leu Ser Ala Asn Thr Ala Ala Phe Ala Lys Ser Ala Ala Met Leu Gly
340 345 350
Asn Ser Glu Asp His Thr Ala Leu Ser Arg Ala Leu Ser Gln Leu Ala
355 360 365
Glu Val Glu Glu Lys Ile Asp Gln Leu His Gln Glu Gln Ala Phe Ala
370 375 380
Asp Phe Tyr Met Phe Ser Glu Leu Leu Ser Asp Tyr Ile Arg Leu Ile
385 390 395 400
Ala Ala Val Lys Gly Val Phe Asp His Arg Met Lys Cys Trp Gln Lys
405 410 415
Trp Glu Asp Ala Gln Ile Thr Leu Leu Lys Lys Arg Glu Ala Glu Ala
420 425 430
Lys Met Met Val Ala Asn Lys Pro Asp Lys Ile Gln Gln Ala Lys Asn
435 440 445
Glu Ile Arg Glu Trp Glu Ala Lys Val Gln Gln Gly Glu Arg Asp Phe
450 455 460
Glu Gln Ile Ser Lys Thr Ile Arg Lys Glu Val Gly Arg Phe Glu Lys
465 470 475 480
Glu Arg Val Lys Asp Phe Lys Thr Val Ile Ile Lys Tyr Leu Glu Ser
485 490 495
Leu Val Gln Thr Gln Gln Gln Leu Ile Lys Tyr Trp Glu Ala Phe Leu
500 505 510
Pro Glu Ala Lys Ala Ile Ala
515
<210> 111
<211> 3052
<212> DNA
<213> 智人
<400> 111
ctctctcaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60
cacacacaca ctcactctat tttgtgctgt cgtaaaaccc acgtgtccag ccgggaagct 120
gccagagcgt ggaaccaagg agccaggacg cggcagcggc caagcgcagc agcccacggc 180
ggttgagtcg ggcgcccagg tccgtccgca ctctcgcgcc ctccgcgggc ctcccaattt 240
tctcgcttgc aggtcgggag gtttccgggc ggcacaatct ctaggactct cctcccgcgc 300
tgctcagggg catgtagcgc acgcagggcg cacactctcg cgcacccgca cgctcaccga 360
gacacccgca cgcacccacc ggcagcaccg agttttcagt tcgaggcgcc ggacatgctg 420
aagcccggag accccggcgg ttcggccttc ctcaaagtgg acccagccta cctgcagcac 480
tggcagcaac tcttccctca cggaggcgca ggcccgctca agggcagcgg cgccgcgggt 540
ctcctgagcg cgccgcagcc tcttcagccg ccgccgccgc ccccgccccc ggagcgcgct 600
gagcctccgc cggacagcct gcgcccgcgg cccgcctctc tctcctccgc ctcgtccacg 660
ccggcttcct cttccacctc cgcctcctcc gcctcctcct gcgctgctgc ggccgctgcc 720
gccgcgctgg ctggtctctc ggccctgccg gtgtcgcagc tgccggtgtt cgcgcctcta 780
gccgccgctg ccgtcgccgc cgagccgctg ccccccaagg aactgtgcct cggcgccacc 840
tccggccccg ggcccgtcaa gtgcggtggt ggtggcggcg gcggcgggga gggtcgcggc 900
gccccgcgct tccgctgcag cgcagaggag ctggactatt acctgtatgg ccagcagcgc 960
atggagatca tcccgctcaa ccagcacacc agcgacccca acaaccgttg cgacatgtgc 1020
gcggacaacc gcaacggcga gtgccctatg catgggccac tgcactcgct gcgccggctt 1080
gtgggcacca gcagcgctgc ggccgccgcg cccccgccgg agctgccgga gtggctgcgg 1140
gacctgcctc gcgaggtgtg cctctgcacc agtactgtgc ccggcctggc ctacggcatc 1200
tgcgcggcgc agaggatcca gcaaggcacc tggattggac ctttccaagg cgtgcttctg 1260
cccccagaga aggtgcaggc aggcgccgtg aggaacacgc agcatctctg ggagatatat 1320
gaccaggatg ggacactaca gcactttatt gatggtgggg aacctagtaa gtcgagctgg 1380
atgaggtata tccgatgtgc aaggcactgc ggagaacaga atctaacagt agttcagtac 1440
aggtcgaata tattctaccg agcctgtata gatatcccta ggggcaccga gcttctggtg 1500
tggtacaatg acagctatac gtctttcttt gggatcccct tacaatgcat tgcccaggat 1560
gaaaacttaa atgtcccttc aacggtaatg gaagccatgt gcagacaaga cgccctgcag 1620
cccttcaaca aaagcagcaa actcgcccct accacccagc agcgctccgt tgttttcccc 1680
cagactccgt gcagcaggaa cttctctctt ctggataagt ctgggcccat tgaatcagga 1740
tttaatcaaa tcaacgtgaa aaaccagcga gtcctggcaa gcccaacttc cacaagccag 1800
ctccactcgg agttcagtga ctggcatctt tggaaatgtg ggcagtgctt taagactttc 1860
acccagcgga tcctcttaca gatgcacgtg tgcacgcaga accccgacag accctaccaa 1920
tgcggccact gctcccagtc cttttcccag ccttcagaac tgaggaacca cgtggtcact 1980
cactctagtg accggccttt caagtgcggc tactgtggtc gtgcctttgc cggggccacc 2040
accctcaaca accacatccg aacccacact ggagaaaagc ccttcaagtg cgagaggtgt 2100
gagaggagct tcacgcaggc cacccagctg agccgacacc agcggatgcc caatgagtgc 2160
aagccaataa ctgagagccc agaatcaatc gaagtggatt aacggattga ctggttggaa 2220
ttaaactgca aggaaagtca tgattaaatg tcacggacac ttaagcaaaa ccaaagattt 2280
cctctgagca actttcaatc agtcccagaa aaccaaaagc agtaataaaa taagtaagat 2340
gttaagagat attgatcctg gcatggaagt cagaccagga aagagattat ttatttatga 2400
cttagggatg agacttattt cagtggacaa ctaacctggg atggttaaca tttccagtcc 2460
caccatgtat tttgctttgt ttctaaaaag ctttttaaaa actgttattt aataccaaag 2520
ggaggaatcg tatgggttct tctgcccacc gttgtgacta agaatgcaca gggacttggt 2580
tctcgttgca ccttttttta gtaacatgtt tcatggggac ccactgtaca gcccttcatt 2640
ctgctgtgtc agtttggcct ggcctgacac tggctgcccc agcggggacc acggaagcag 2700
agtgagagcc ttcgctgagt caatgctacc ttcagcccca gacgcatccc atttccatgt 2760
cttccatgct cactgctcat gcacttttta cacggtttct tccaaacagc ccggtcttga 2820
tgcaggagag tctggaaaag gaagaaaatg gtttcagttt caaaattcaa aggaaaaagt 2880
tgaggactta ttttgtcctg tcaagattgc aagaacatgt aaaatgtacg gagcttcata 2940
atacgttata ttgttccgaa gcagctcgtt gagaaacatt tgttttcaat aacattttag 3000
cttaaaaaaa aaaaaagaaa atgaaaataa agttctttgg tttaaggctg ga 3052
<210> 112
<211> 595
<212> PRT
<213> 智人
<400> 112
Met Leu Lys Pro Gly Asp Pro Gly Gly Ser Ala Phe Leu Lys Val Asp
1 5 10 15
Pro Ala Tyr Leu Gln His Trp Gln Gln Leu Phe Pro His Gly Gly Ala
20 25 30
Gly Pro Leu Lys Gly Ser Gly Ala Ala Gly Leu Leu Ser Ala Pro Gln
35 40 45
Pro Leu Gln Pro Pro Pro Pro Pro Pro Pro Pro Glu Arg Ala Glu Pro
50 55 60
Pro Pro Asp Ser Leu Arg Pro Arg Pro Ala Ser Leu Ser Ser Ala Ser
65 70 75 80
Ser Thr Pro Ala Ser Ser Ser Thr Ser Ala Ser Ser Ala Ser Ser Cys
85 90 95
Ala Ala Ala Ala Ala Ala Ala Ala Leu Ala Gly Leu Ser Ala Leu Pro
100 105 110
Val Ser Gln Leu Pro Val Phe Ala Pro Leu Ala Ala Ala Ala Val Ala
115 120 125
Ala Glu Pro Leu Pro Pro Lys Glu Leu Cys Leu Gly Ala Thr Ser Gly
130 135 140
Pro Gly Pro Val Lys Cys Gly Gly Gly Gly Gly Gly Gly Gly Glu Gly
145 150 155 160
Arg Gly Ala Pro Arg Phe Arg Cys Ser Ala Glu Glu Leu Asp Tyr Tyr
165 170 175
Leu Tyr Gly Gln Gln Arg Met Glu Ile Ile Pro Leu Asn Gln His Thr
180 185 190
Ser Asp Pro Asn Asn Arg Cys Asp Met Cys Ala Asp Asn Arg Asn Gly
195 200 205
Glu Cys Pro Met His Gly Pro Leu His Ser Leu Arg Arg Leu Val Gly
210 215 220
Thr Ser Ser Ala Ala Ala Ala Ala Pro Pro Pro Glu Leu Pro Glu Trp
225 230 235 240
Leu Arg Asp Leu Pro Arg Glu Val Cys Leu Cys Thr Ser Thr Val Pro
245 250 255
Gly Leu Ala Tyr Gly Ile Cys Ala Ala Gln Arg Ile Gln Gln Gly Thr
260 265 270
Trp Ile Gly Pro Phe Gln Gly Val Leu Leu Pro Pro Glu Lys Val Gln
275 280 285
Ala Gly Ala Val Arg Asn Thr Gln His Leu Trp Glu Ile Tyr Asp Gln
290 295 300
Asp Gly Thr Leu Gln His Phe Ile Asp Gly Gly Glu Pro Ser Lys Ser
305 310 315 320
Ser Trp Met Arg Tyr Ile Arg Cys Ala Arg His Cys Gly Glu Gln Asn
325 330 335
Leu Thr Val Val Gln Tyr Arg Ser Asn Ile Phe Tyr Arg Ala Cys Ile
340 345 350
Asp Ile Pro Arg Gly Thr Glu Leu Leu Val Trp Tyr Asn Asp Ser Tyr
355 360 365
Thr Ser Phe Phe Gly Ile Pro Leu Gln Cys Ile Ala Gln Asp Glu Asn
370 375 380
Leu Asn Val Pro Ser Thr Val Met Glu Ala Met Cys Arg Gln Asp Ala
385 390 395 400
Leu Gln Pro Phe Asn Lys Ser Ser Lys Leu Ala Pro Thr Thr Gln Gln
405 410 415
Arg Ser Val Val Phe Pro Gln Thr Pro Cys Ser Arg Asn Phe Ser Leu
420 425 430
Leu Asp Lys Ser Gly Pro Ile Glu Ser Gly Phe Asn Gln Ile Asn Val
435 440 445
Lys Asn Gln Arg Val Leu Ala Ser Pro Thr Ser Thr Ser Gln Leu His
450 455 460
Ser Glu Phe Ser Asp Trp His Leu Trp Lys Cys Gly Gln Cys Phe Lys
465 470 475 480
Thr Phe Thr Gln Arg Ile Leu Leu Gln Met His Val Cys Thr Gln Asn
485 490 495
Pro Asp Arg Pro Tyr Gln Cys Gly His Cys Ser Gln Ser Phe Ser Gln
500 505 510
Pro Ser Glu Leu Arg Asn His Val Val Thr His Ser Ser Asp Arg Pro
515 520 525
Phe Lys Cys Gly Tyr Cys Gly Arg Ala Phe Ala Gly Ala Thr Thr Leu
530 535 540
Asn Asn His Ile Arg Thr His Thr Gly Glu Lys Pro Phe Lys Cys Glu
545 550 555 560
Arg Cys Glu Arg Ser Phe Thr Gln Ala Thr Gln Leu Ser Arg His Gln
565 570 575
Arg Met Pro Asn Glu Cys Lys Pro Ile Thr Glu Ser Pro Glu Ser Ile
580 585 590
Glu Val Asp
595
<210> 113
<211> 2244
<212> DNA
<213> 智人
<400> 113
atggcggccg agagggaacc tcctccgctg ggggacggga agcccaccga ctttgaggat 60
ctggaggacg gagaggacct gttcaccagc actgtctcca ccctagagtc aagtccatca 120
tctccagaac cagctagtct tcctgcagaa gatattagtg caaactccaa tggcccaaaa 180
cccacagaag ttgtattaga tgatgacaga gaagatcttt ttgcagaagc cacagaagaa 240
gtttctttgg acagccctga aagggaacct atcctatcct cggaaccttc tcctgcagtc 300
acacctgtca ctcctactac actcattgct cctagaattg aatcaaagag tatgtctgct 360
cccgtgatct ttgatagatc cagggaagag attgaagaag aagcaaatgg agacattttt 420
gacatagaaa ttggtgtatc agatccagaa aaagttggtg atggcatgaa tgcctatatg 480
gcatatagag taacaacaaa gacatctctt tccatgttca gtaagagtga attttcagtg 540
aaaagaagat tcagcgactt tcttggtttg cacagcaaat tagcaagcaa atatttacat 600
gttggttata ttgtgccacc agctccagaa aagagtatag tagggatgac caaggtcaaa 660
gtgggtaaag aagactcatc atccactgag tttgtagaaa aacggagagc agctcttgaa 720
aggtatcttc aaagaacagt aaaacatcca actttactac aggatcctga tttaaggcag 780
ttcttggaaa gttcagagct gcctagagca gttaatacac aggctctgag tggagcagga 840
atattgagga tggtgaacaa ggctgccgac gctgtcaaca aaatgacaat caagatgaat 900
gaatcggatg catggtttga agaaaagcag cagcaatttg agaatctgga tcagcaactt 960
aggaaacttc atgtcagtgt tgaagccttg gtctgtcata gaaaagaact ttcagccaac 1020
acagctgcct ttgctaaaag tgctgccatg ttaggtaatt ctgaggatca tactgcttta 1080
tctagagctt tgtctcagct tgcagaggtt gaggagaaga tagaccagtt acatcaagaa 1140
caagcttttg ctgactttta tatgttttca gaactactta gtgactacat tcgtcttatt 1200
gctgcagtga aaggtgtgtt tgaccatcga atgaagtgct ggcagaaatg ggaagatgct 1260
caaattactt tgctcaaaaa acgtgaagct gaagcaaaaa tgatggttgc taacaaacca 1320
gataaaatac agcaagctaa aaatgaaata agagagatat atgaccagga tgggacacta 1380
cagcacttta ttgatggtgg ggaacctagt aagtcgagct ggatgaggta tatccgatgt 1440
gcaaggcact gcggagaaca gaatctaaca gtagttcagt acaggtcgaa tatattctac 1500
cgagcctgta tagatatccc taggggcacc gagcttctgg tgtggtacaa tgacagctat 1560
acgtctttct ttgggatccc cttacaatgc attgcccagg atgaaaactt aaatgtccct 1620
tcaacggtaa tggaagccat gtgcagacaa gacgccctgc agcccttcaa caaaagcagc 1680
aaactcgccc ctaccaccca gcagcgctcc gttgttttcc cccagactcc gtgcagcagg 1740
aacttctctc ttctggataa gtctgggccc attgaatcag gatttaatca aatcaacgtg 1800
aaaaaccagc gagtcctggc aagcccaact tccacaagcc agctccactc ggagttcagt 1860
gactggcatc tttggaaatg tgggcagtgc tttaagactt tcacccagcg gatcctctta 1920
cagatgcacg tgtgcacgca gaaccccgac agaccctacc aatgcggcca ctgctcccag 1980
tccttttccc agccttcaga actgaggaac cacgtggtca ctcactctag tgaccggcct 2040
ttcaagtgcg gctactgtgg tcgtgccttt gccggggcca ccaccctcaa caaccacatc 2100
cgaacccaca ctggagaaaa gcccttcaag tgcgagaggt gtgagaggag cttcacgcag 2160
gccacccagc tgagccgaca ccagcggatg cccaatgagt gcaagccaat aactgagagc 2220
ccagaatcaa tcgaagtgga ttaa 2244
<210> 114
<211> 747
<212> PRT
<213> 智人
<400> 114
Met Ala Ala Glu Arg Glu Pro Pro Pro Leu Gly Asp Gly Lys Pro Thr
1 5 10 15
Asp Phe Glu Asp Leu Glu Asp Gly Glu Asp Leu Phe Thr Ser Thr Val
20 25 30
Ser Thr Leu Glu Ser Ser Pro Ser Ser Pro Glu Pro Ala Ser Leu Pro
35 40 45
Ala Glu Asp Ile Ser Ala Asn Ser Asn Gly Pro Lys Pro Thr Glu Val
50 55 60
Val Leu Asp Asp Asp Arg Glu Asp Leu Phe Ala Glu Ala Thr Glu Glu
65 70 75 80
Val Ser Leu Asp Ser Pro Glu Arg Glu Pro Ile Leu Ser Ser Glu Pro
85 90 95
Ser Pro Ala Val Thr Pro Val Thr Pro Thr Thr Leu Ile Ala Pro Arg
100 105 110
Ile Glu Ser Lys Ser Met Ser Ala Pro Val Ile Phe Asp Arg Ser Arg
115 120 125
Glu Glu Ile Glu Glu Glu Ala Asn Gly Asp Ile Phe Asp Ile Glu Ile
130 135 140
Gly Val Ser Asp Pro Glu Lys Val Gly Asp Gly Met Asn Ala Tyr Met
145 150 155 160
Ala Tyr Arg Val Thr Thr Lys Thr Ser Leu Ser Met Phe Ser Lys Ser
165 170 175
Glu Phe Ser Val Lys Arg Arg Phe Ser Asp Phe Leu Gly Leu His Ser
180 185 190
Lys Leu Ala Ser Lys Tyr Leu His Val Gly Tyr Ile Val Pro Pro Ala
195 200 205
Pro Glu Lys Ser Ile Val Gly Met Thr Lys Val Lys Val Gly Lys Glu
210 215 220
Asp Ser Ser Ser Thr Glu Phe Val Glu Lys Arg Arg Ala Ala Leu Glu
225 230 235 240
Arg Tyr Leu Gln Arg Thr Val Lys His Pro Thr Leu Leu Gln Asp Pro
245 250 255
Asp Leu Arg Gln Phe Leu Glu Ser Ser Glu Leu Pro Arg Ala Val Asn
260 265 270
Thr Gln Ala Leu Ser Gly Ala Gly Ile Leu Arg Met Val Asn Lys Ala
275 280 285
Ala Asp Ala Val Asn Lys Met Thr Ile Lys Met Asn Glu Ser Asp Ala
290 295 300
Trp Phe Glu Glu Lys Gln Gln Gln Phe Glu Asn Leu Asp Gln Gln Leu
305 310 315 320
Arg Lys Leu His Val Ser Val Glu Ala Leu Val Cys His Arg Lys Glu
325 330 335
Leu Ser Ala Asn Thr Ala Ala Phe Ala Lys Ser Ala Ala Met Leu Gly
340 345 350
Asn Ser Glu Asp His Thr Ala Leu Ser Arg Ala Leu Ser Gln Leu Ala
355 360 365
Glu Val Glu Glu Lys Ile Asp Gln Leu His Gln Glu Gln Ala Phe Ala
370 375 380
Asp Phe Tyr Met Phe Ser Glu Leu Leu Ser Asp Tyr Ile Arg Leu Ile
385 390 395 400
Ala Ala Val Lys Gly Val Phe Asp His Arg Met Lys Cys Trp Gln Lys
405 410 415
Trp Glu Asp Ala Gln Ile Thr Leu Leu Lys Lys Arg Glu Ala Glu Ala
420 425 430
Lys Met Met Val Ala Asn Lys Pro Asp Lys Ile Gln Gln Ala Lys Asn
435 440 445
Glu Ile Arg Glu Ile Tyr Asp Gln Asp Gly Thr Leu Gln His Phe Ile
450 455 460
Asp Gly Gly Glu Pro Ser Lys Ser Ser Trp Met Arg Tyr Ile Arg Cys
465 470 475 480
Ala Arg His Cys Gly Glu Gln Asn Leu Thr Val Val Gln Tyr Arg Ser
485 490 495
Asn Ile Phe Tyr Arg Ala Cys Ile Asp Ile Pro Arg Gly Thr Glu Leu
500 505 510
Leu Val Trp Tyr Asn Asp Ser Tyr Thr Ser Phe Phe Gly Ile Pro Leu
515 520 525
Gln Cys Ile Ala Gln Asp Glu Asn Leu Asn Val Pro Ser Thr Val Met
530 535 540
Glu Ala Met Cys Arg Gln Asp Ala Leu Gln Pro Phe Asn Lys Ser Ser
545 550 555 560
Lys Leu Ala Pro Thr Thr Gln Gln Arg Ser Val Val Phe Pro Gln Thr
565 570 575
Pro Cys Ser Arg Asn Phe Ser Leu Leu Asp Lys Ser Gly Pro Ile Glu
580 585 590
Ser Gly Phe Asn Gln Ile Asn Val Lys Asn Gln Arg Val Leu Ala Ser
595 600 605
Pro Thr Ser Thr Ser Gln Leu His Ser Glu Phe Ser Asp Trp His Leu
610 615 620
Trp Lys Cys Gly Gln Cys Phe Lys Thr Phe Thr Gln Arg Ile Leu Leu
625 630 635 640
Gln Met His Val Cys Thr Gln Asn Pro Asp Arg Pro Tyr Gln Cys Gly
645 650 655
His Cys Ser Gln Ser Phe Ser Gln Pro Ser Glu Leu Arg Asn His Val
660 665 670
Val Thr His Ser Ser Asp Arg Pro Phe Lys Cys Gly Tyr Cys Gly Arg
675 680 685
Ala Phe Ala Gly Ala Thr Thr Leu Asn Asn His Ile Arg Thr His Thr
690 695 700
Gly Glu Lys Pro Phe Lys Cys Glu Arg Cys Glu Arg Ser Phe Thr Gln
705 710 715 720
Ala Thr Gln Leu Ser Arg His Gln Arg Met Pro Asn Glu Cys Lys Pro
725 730 735
Ile Thr Glu Ser Pro Glu Ser Ile Glu Val Asp
740 745
<210> 115
<211> 518
<212> DNA
<213> 智人
<400> 115
atggcggccg agagggaacc tcctccgctg ggggacggga agcccaccga ctttgaggat 60
ctggaggacg gagaggacct gttcaccagc actgtctcca ccctagagtc aagtccatca 120
tctccagaac cagctagtct tcctgcagaa gatattagtg caaactccaa tggcccaaaa 180
cccacagaag ttgtattaga tgatgacaga gaagatcttt ttgcagaccc taccaatgcg 240
gccactgctc ccagtccttt tcccagcctt cagaactgag gaaccacgtg gtcactcact 300
ctagtgaccg gcctttcaag tgcggctact gtggtcgtgc ctttgccggg gccaccaccc 360
tcaacaacca catccgaacc cacactggag aaaagccctt caagtgcgag aggtgtgaga 420
ggagcttcac gcaggccacc cagctgagcc gacaccagcg gatgcccaat gagtgcaagc 480
caataactga gagcccagaa tcaatcgaag tggattaa 518
<210> 116
<211> 172
<212> PRT
<213> 智人
<400> 116
Met Ala Ala Glu Arg Glu Pro Pro Pro Leu Gly Asp Gly Lys Pro Thr
1 5 10 15
Asp Phe Glu Asp Leu Glu Asp Gly Glu Asp Leu Phe Thr Ser Thr Val
20 25 30
Ser Thr Leu Glu Ser Ser Pro Ser Ser Pro Glu Pro Ala Ser Leu Pro
35 40 45
Ala Glu Asp Ile Ser Ala Asn Ser Asn Gly Pro Lys Pro Thr Glu Val
50 55 60
Val Leu Asp Asp Asp Arg Glu Asp Leu Phe Ala Glu Pro Tyr Gln Cys
65 70 75 80
Gly His Cys Ser Gln Ser Phe Ser Gln Pro Ser Glu Leu Arg Asn His
85 90 95
Val Val Thr His Ser Ser Asp Arg Pro Phe Lys Cys Gly Tyr Cys Gly
100 105 110
Arg Ala Phe Ala Gly Ala Thr Thr Leu Asn Asn His Ile Arg Thr His
115 120 125
Thr Gly Glu Lys Pro Phe Lys Cys Glu Arg Cys Glu Arg Ser Phe Thr
130 135 140
Gln Ala Thr Gln Leu Ser Arg His Gln Arg Met Pro Asn Glu Cys Lys
145 150 155 160
Pro Ile Thr Glu Ser Pro Glu Ser Ile Glu Val Asp
165 170
<210> 117
<211> 16862
<212> DNA
<213> 智人
<400> 117
gaggtgcgcg cgcccgcgcc gatgtgtgtg agtgcgtgtc ctgctcgctc catgttgccg 60
cctctcccgg tacctgctgc tgctcccggg gctgcgggaa atgcgagagg ctgagccggg 120
gaggaggaac ccgagcagca gcggcggcgg cggcggccgc ggcggcggga gccccccagg 180
aggaggaccg ggatccatgt gtctttcctg gtgactagga tgtcgtcgga ggaggacaag 240
agcgtggagc agccgcagcc gccgccacca ccccccgagg agcctggagc cccggccccg 300
agccccgcag ccgcagacaa aagacctcgg ggccggcctc gcaaagatgg cgcttcccct 360
ttccagagag ccagaaagaa acctcgaagt agggggaaaa ctgcagtgga agatgaggac 420
agcatggatg ggctggagac aacagaaaca gaaacgattg tggaaacaga aatcaaagaa 480
caatctgcag aagaggatgc tgaagcagaa gtggataaca gcaaacagct aattccaact 540
cttcagcgat ctgtgtctga ggaatcggca aactccctgg tctctgttgg tgtagaagcc 600
aaaatcagtg aacagctctg cgctttttgt tactgtgggg aaaaaagttc cttaggacaa 660
ggagacttaa aacaattcag aataacgcct ggatttatct tgccatggag aaaccaacct 720
tctaacaaga aggacattga tgacaacagc aatggaacct atgagaaaat gcaaaactca 780
gcaccacgaa aacaaagagg acagagaaaa gaacgatctc ctcagcagaa tatagtatct 840
tgtgtaagtg taagcaccca gacagcttca gatgatcaag ctggtaaact gtgggatgaa 900
ctcagtctgg ttgggcttcc agatgccatt gatatccaag ccttatttga ttctacaggc 960
acttgttggg ctcatcaccg ttgtgtggag tggtcactag gagtatgcca gatggaagaa 1020
ccattgttag tgaacgtgga caaagctgtt gtctcaggga gcacagaacg atgtgcattt 1080
tgtaagcacc ttggagccac tatcaaatgc tgtgaagaga aatgtaccca gatgtatcat 1140
tatccttgtg ctgcaggagc cggcaccttt caggatttca gtcacatctt cctgctttgt 1200
ccagaacaca ttgaccaagc tcctgaaaga tcgaaggaag atgcaaactg tgcagtgtgc 1260
gacagcccgg gagacctctt agatcagttc ttttgtacta cttgtggtca gcactatcat 1320
ggaatgtgcc tggatatagc ggttactcca ttaaaacgtg caggttggca atgtcctgag 1380
tgcaaagtgt gccagaactg caaacaatcg ggagaagata gcaagatgct agtgtgtgat 1440
acgtgtgaca aagggtatca tactttttgt cttcaaccag ttatgaaatc agtaccaacc 1500
aatggctgga aatgcaaaaa ttgcagaata tgtatagagt gtggcacacg gtctagttct 1560
cagtggcacc acaattgcct gatatgtgac aattgttacc aacagcagga taacttatgt 1620
cccttctgtg ggaagtgtta tcatccagaa ttgcagaaag acatgcttca ttgtaatatg 1680
tgcaaaaggt gggttcacct agagtgtgac aaaccaacag atcatgaact ggatactcag 1740
ctcaaagaag agtatatctg catgtattgt aaacacctgg gagctgagat ggatcgttta 1800
cagccaggtg aggaagtgga gatagctgag ctcactacag attataacaa tgaaatggaa 1860
gttgaaggcc ctgaagatca aatggtattc tcagagcagg cagctaataa agatgtcaac 1920
ggtcaggagt ccactcctgg aattgttcca gatgcggttc aagtccacac tgaagagcaa 1980
cagaagagtc atccctcaga aagtcttgac acagatagtc ttcttattgc tgtatcatcc 2040
caacatacag tgaatactga attggaaaaa cagatttcta atgaagttga tagtgaagac 2100
ctgaaaatgt cttctgaagt gaagcatatt tgtggcgaag atcaaattga agataaaatg 2160
gaagtgacag aaaacattga agtcgttaca caccagatca ctgtgcagca agaacaactg 2220
cagttgttag aggaacctga aacagtggta tccagagaag aatcaaggcc tccaaaatta 2280
gtcatggaat ctgtcactct tccactagaa accttagtgt ccccacatga ggaaagtatt 2340
tcattatgtc ctgaggaaca gttggttata gaaaggctac aaggagaaaa ggaacagaaa 2400
gaaaattctg aactttctac tggattgatg gactctgaaa tgactcctac aattgagggt 2460
tgtgtgaaag atgtttcata ccaaggaggc aaatctataa agttatcatc tgagacagag 2520
tcatcatttt catcatcagc agacataagc aaggcagatg tgtcttcctc cccaacacct 2580
tcttcagact tgccttcgca tgacatgctg cataattacc cttcagctct tagttcctct 2640
gctggaaaca tcatgccaac aacttacatc tcagtcactc caaaaattgg catgggtaaa 2700
ccagctatta ctaagagaaa attttctcct ggtagacctc ggtccaaaca gggggcttgg 2760
agtacccata atacagtgag cccaccttcc tggtccccag acatttcaga aggtcgggaa 2820
atttttaaac ccaggcagct tcctggcagt gccatttgga gcatcaaagt gggccgtggg 2880
tctggatttc caggaaagcg gagacctcga ggtgcaggac tgtcggggcg aggtggccga 2940
ggcaggtcaa agctgaaaag tggaatcgga gctgttgtat tacctggggt gtctactgca 3000
gatatttcat caaataagga tgatgaagaa aactctatgc acaatacagt tgtgttgttt 3060
tctagcagtg acaagttcac tttgaatcag gatatgtgtg tagtttgtgg cagttttggc 3120
caaggagcag aaggaagatt acttgcctgt tctcagtgtg gtcagtgtta ccatccatac 3180
tgtgtcagta ttaagatcac taaagtggtt cttagcaaag gttggaggtg tcttgagtgc 3240
actgtgtgtg aggcctgtgg gaaggcaact gacccaggaa gactcctgct gtgtgatgac 3300
tgtgacataa gttatcacac ctactgccta gaccctccat tgcagacagt tcccaaagga 3360
ggctggaagt gcaaatggtg tgtttggtgc agacactgtg gagcaacatc tgcaggtcta 3420
agatgtgaat ggcagaacaa ttacacacag tgcgctcctt gtgcaagctt atcttcctgt 3480
ccagtctgct atcgaaacta tagagaagaa gatcttattc tgcaatgtag acaatgtgat 3540
agatggatgc atgcagtttg tcagaactta aatactgagg aagaagtgga aaatgtagca 3600
gacattggtt ttgattgtag catgtgcaga ccctatatgc ctgcgtctaa tgtgccttcc 3660
tcagactgct gtgaatcttc acttgtagca caaattgtca caaaagtaaa agagctagac 3720
ccacccaaga cttataccca ggatggtgtg tgtttgactg aatcagggat gactcagtta 3780
cagagcctca cagttacagt tccaagaaga aaacggtcaa aaccaaaatt gaaattgaag 3840
attataaatc agaatagcgt ggccgtcctt cagacccctc cagacatcca atcagagcat 3900
tcaagggatg gtgaaatgga tgatagtcga gaaggagaac ttatggattg tgatggaaaa 3960
tcagaatcta gtcctgagcg ggaagctgtg gatgatgaaa ctaagggagt ggaaggaaca 4020
gatggtgtca aaaagagaaa aaggaaacca tacagaccag gtattggtgg atttatggtg 4080
cggcaaagaa gtcgaactgg gcaagggaaa accaaaagat ctgtgatcag aaaagattcc 4140
tcaggctcta tttccgagca gttaccttgc agagatgatg gctggagtga gcagttacca 4200
gatactttag ttgatgaatc tgtttctgtt actgaaagca ctgaaaaaat aaagaagaga 4260
taccgaaaaa ggaaaaataa gcttgaagaa actttccctg cctatttaca agaagctttc 4320
tttggaaaag atcttctaga tacaagtaga caaagcaaga taagtttaga taatctgtca 4380
gaagatggag ctcagctttt atataaaaca aacatgaaca caggtttctt ggatccttcc 4440
ttagatccac tacttagttc atcctcggct ccaacaaaat ctggaactca cggtcctgct 4500
gatgacccat tagctgatat ttctgaagtt ttaaacacag atgatgacat tcttggaata 4560
atttcagatg atctagcaaa atcagttgat cattcagata ttggtcctgt cactgatgat 4620
ccttcctctt tgcctcagcc aaatgtcaat cagagttcac gaccattaag tgaagaacag 4680
ctagatggga tcctcagtcc tgaactagac aaaatggtca cagatggagc aattcttgga 4740
aaattatata aaattccaga gcttggcgga aaagatgttg aagacttatt tacagctgta 4800
cttagtcctg cgaacactca gccaactcca ttgccacagc ctcccccacc aacacagctg 4860
ttgccaatac acaatcagga tgctttttca cggatgcctc tcatgaatgg ccttattgga 4920
tccagtcctc atctcccaca taattctttg ccacctggaa gcggactggg aactttctct 4980
gcaattgcac aatcctctta tcctgatgcc agggataaaa attcagcctt taatccaatg 5040
gcaagtgatc ctaacaactc ttggacatca tcagctccca ctgtggaagg agaaaatgac 5100
acaatgtcga atgcccagag aagcacgctt aagtgggaga aagaggaggc tctgggtgaa 5160
atggcaactg ttgccccagt tctctacacc aatattaatt tccccaactt aaaggaagaa 5220
ttccctgatt ggactactag agtgaagcaa attgccaaat tgtggagaaa agcaagctca 5280
caagaaagag caccatatgt gcaaaaagcc agagataaca gagctgcttt acgcattaat 5340
aaagtacaga tgtcaaatga ttccatgaaa aggcagcaac agcaagatag cattgatccc 5400
agctctcgta ttgattcgga gctttttaaa gatcctttaa agcaaagaga atcagaacat 5460
gaacaggaat ggaaatttag acagcaaatg cgtcagaaaa gtaagcagca agctaaaatt 5520
gaagccacac agaaacttga acaggtgaaa aatgagcagc agcagcagca acaacagcaa 5580
tttggttctc agcatcttct ggtgcagtct ggttcagata caccaagtag tgggatacag 5640
agtcccttga cacctcagcc tggcaatgga aatatgtctc ctgcacagtc attccataaa 5700
gaactgttta caaaacagcc acccagtacc cctacgtcta catcttcaga tgatgtgttt 5760
gtaaagccac aagctccacc tcctcctcca gccccatccc ggattcccat ccaggatagt 5820
ctttctcagg ctcagacttc tcagccaccc tcaccgcaag tgttttcacc tgggtcctct 5880
aactcacgac caccatctcc aatggatcca tatgcaaaaa tggttggtac ccctcgacca 5940
cctcctgtgg gccatagttt ttccagaaga aattctgctg caccagtgga aaactgtaca 6000
cctttatcat cggtatctag gccccttcaa atgaatgaga caacagcaaa taggccatcc 6060
cctgtcagag atttatgttc ttcttccacg acaaataatg acccctatgc aaaacctcca 6120
gacacaccta ggcctgtgat gacagatcaa tttcccaaat ccttgggcct atcccggtct 6180
cctgtagttt cagaacaaac tgcaaaaggc cctatagcag ctggaaccag tgatcacttt 6240
actaaaccat ctcctagggc agatgtgttt caaagacaaa ggatacctga ctcatatgca 6300
cgacccttgt tgacacctgc acctcttgat agtggtcctg gaccttttaa gactccaatg 6360
caacctcctc catcctctca ggatccttat ggatcagtgt cacaggcatc aaggcgattg 6420
tctgttgacc cttatgaaag gcctgctttg acaccaagac ctatagataa tttttctcat 6480
aatcagtcaa atgatccata tagtcagcct ccccttaccc cacatccagc agtgaatgaa 6540
tcttttgccc atccttcaag ggctttttcc cagcctggaa ccatatcaag gccaacatct 6600
caggacccat actcccaacc cccaggaact ccacgacctg ttgtagattc ttattcccaa 6660
tcttcaggaa cagctaggtc caatacagac ccttactctc aacctcctgg aactccccgg 6720
cctactactg ttgacccata tagtcagcag ccccaaaccc caagaccatc tacacaaact 6780
gacttgtttg ttacacctgt aacaaatcag aggcattctg atccatatgc tcatcctcct 6840
ggaacaccaa gacctggaat ttctgtccct tactctcagc caccagcaac accaaggcca 6900
aggatttcag agggttttac taggtcctca atgacaagac cagtcctcat gccaaatcag 6960
gatcctttcc tgcaagcagc acaaaaccga ggaccagctt tacctggccc gttggtaagg 7020
ccacctgata catgttccca gacacctagg ccccctggac ctggtctttc agacacattt 7080
agccgtgttt ccccatctgc tgcccgtgat ccctatgatc agtctccaat gactccaaga 7140
tctcagtctg actcttttgg aacaagtcaa actgcccatg atgttgctga tcagccaagg 7200
cctggatcag aggggagctt ctgtgcatct tcaaactctc caatgcactc ccaaggccag 7260
cagttctctg gtgtctccca acttcctgga cctgtgccaa cttcaggagt aactgataca 7320
cagaatactg taaatatggc ccaagcagat acagagaaat tgagacagcg gcagaagtta 7380
cgtgaaatca ttctccagca gcaacagcag aagaagattg caggtcgaca ggagaagggg 7440
tcacaggact cacccgcagt gcctcatcca gggcctcttc aacactggca accagagaat 7500
gttaaccagg ctttcaccag acccccacct ccctatcctg ggaacattag gtctcctgtt 7560
gcccctcctt taggacctag atatgctgtt ttcccaaaag atcagcgtgg accctatcct 7620
cctgatgttg ctagtatggg gatgagacct catggattta gatttggatt tccaggaggt 7680
agtcatggta ccatgccgag tcaagagcgc ttccttgtgc ctcctcagca aatacaggga 7740
tctggagttt ctccacagct aagaagatca gtatctgtag atatgcctag gcctttaaat 7800
aactcacaaa tgaataatcc agttggactt cctcagcatt tttcaccaca gagcttgcca 7860
gttcagcagc acaacatact gggccaagca tatattgaac tgagacatag ggctcctgac 7920
ggaaggcaac ggctgccttt cagtgctcca cctggcagcg ttgtagaggc atcttctaat 7980
ctgagacatg gaaacttcat tccccggcca gactttccgg gccctagaca cacagacccc 8040
atgcgacgac ctccccaggg tctacctaat cagctacctg tgcacccaga tttggaacaa 8100
gtgccaccat ctcaacaaga gcaaggtcat tctgtccatt catcttctat ggtcatgagg 8160
actctgaacc atccactagg tggtgaattt tcagaagctc ctttgtcaac atctgtaccg 8220
tctgaaacaa cgtctgataa tttacagata accacccagc cttctgatgg tctagaggaa 8280
aaacttgatt ctgatgaccc ttctgtgaag gaactggatg ttaaagacct tgagggggtt 8340
gaagtcaaag acttagatga tgaagatctt gaaaacttaa atttagatac agaggatggc 8400
aaggtagttg aattggatac tttagataat ttggaaacta atgatcccaa cctggatgac 8460
ctcttaaggt caggagagtt tgatatcatt gcatatacag atccagaact tgacatggga 8520
gataagaaaa gcatgtttaa tgaggaacta gaccttccaa ttgatgataa gttagataat 8580
cagtgtgtat ctgttgaacc aaaaaaaaag gaacaagaaa acaaaactct ggttctctct 8640
gataaacatt caccacagaa aaaatccact gttaccaatg aggtaaaaac ggaagtactg 8700
tctccaaatt ctaaggtgga atccaaatgt gaaactgaaa aaaatgatga gaataaagat 8760
aatgttgaca ctccttgctc acaggcttct gctcactcag acctaaatga tggagaaaag 8820
acttctttgc atccttgtga tccagatcta tttgagaaaa gaaccaatcg agaaactgct 8880
ggccccagtg caaatgtcat tcaggcatcc actcaactac ctgctcaaga tgtaataaac 8940
tcttgtggca taactggatc aactccagtt ctctcaagtt tacttgctaa tgagaaatct 9000
gataattcag acattaggcc atcggggtct ccaccaccac caactctgcc ggcctcccca 9060
tccaatcatg tgtcaagttt gcctcctttc atagcaccgc ctggccgtgt tttggataat 9120
gccatgaatt ctaatgtgac agtagtctct agggtaaacc atgttttttc tcagggtgtg 9180
caggtaaacc cagggctcat tccaggtcaa tcaacagtta accacagtct ggggacagga 9240
aaacctgcaa ctcaaactgg gcctcaaaca agtcagtctg gtaccagtag catgtctgga 9300
ccccaacagc taatgattcc tcaaacatta gcacagcaga atagagagag gccccttctt 9360
ctagaagaac agcctctact tctacaggat cttttggatc aagaaaggca agaacagcag 9420
cagcaaagac agatgcaagc catgattcgt cagcgatcag aaccgttctt ccctaatatt 9480
gattttgatg caattacaga tcctataatg aaagccaaaa tggtggccct taaaggtata 9540
aataaagtga tggcacaaaa caatctgggc atgccaccaa tggtgatgag caggttccct 9600
tttatgggcc aggtggtaac tggaacacag aacagtgaag gacagaacct tggaccacag 9660
gccattcctc aggatggcag tataacacat cagatttcta ggcctaatcc tccaaatttt 9720
ggtccaggct ttgtcaatga ttcacagcgt aagcagtatg aagagtggct ccaggagacc 9780
caacagctgc ttcaaatgca gcagaagtat cttgaagaac aaattggtgc tcacagaaaa 9840
tctaagaagg ccctttcagc taaacaacgt actgccaaga aagctgggcg tgaatttcca 9900
gaggaagatg cagaacaact caagcatgtt actgaacagc aaagcatggt tcagaaacag 9960
ctagaacaga ttcgtaaaca acagaaagaa catgctgaat tgattgaaga ttatcggatc 10020
aaacagcagc agcaatgtgc aatggcccca cctaccatga tgcccagtgt ccagccccag 10080
ccacccctaa ttccaggtgc cactccaccc accatgagcc aacccacctt tcccatggtg 10140
ccacagcagc ttcagcacca gcagcacaca acagttattt ctggccatac tagccctgtt 10200
agaatgccca gtttacctgg atggcaaccc aacagtgctc ctgcccacct gcccctcaat 10260
cctcctagaa ttcagccccc aattgcccag ttaccaataa aaacttgtac accagcccca 10320
gggacagtct caaatgcaaa tccacagagt ggaccaccac ctcgggtaga atttgatgac 10380
aacaatccct ttagtgaaag ttttcaagaa cgggaacgta aggaacgttt acgagaacag 10440
caagagagac aacggatcca actcatgcag gaggtagata gacaaagagc tttgcagcag 10500
aggatggaaa tggagcagca tggtatggtg ggctctgaga taagtagtag taggacatct 10560
gtgtcccaga ttcccttcta cagttccgac ttaccttgtg attttatgca acctctagga 10620
ccccttcagc agtctccaca acaccaacag caaatggggc aggttttaca gcagcagaat 10680
atacaacaag gatcaattaa ttcaccctcc acccaaactt tcatgcagac taatgagcga 10740
aggcaggtag gccctccttc atttgttcct gattcaccat caatccctgt tggaagccca 10800
aatttttctt ctgtgaagca gggacatgga aatctttctg ggaccagctt ccagcagtcc 10860
ccagtgaggc cttcttttac acctgcttta ccagcagcac ctccagtagc taatagcagt 10920
ctcccatgtg gccaagattc tactataacc catggacaca gttatccggg atcaacccaa 10980
tcgctcattc agttgtattc tgatataatc ccagaggaaa aagggaaaaa gaaaagaaca 11040
agaaagaaga aaagagatga tgatgcagaa tccaccaagg ctccatcaac tccccattca 11100
gatataactg ccccaccgac tccaggcatc tcagaaacta cctctactcc tgcagtgagc 11160
acacccagtg agcttcctca acaagccgac caagagtcgg tggaaccagt cggcccatcc 11220
actcccaata tggcagcagg ccagctatgt acagaattag agaacaaact gcccaatagt 11280
gatttctcac aagcaactcc aaatcaacag acgtatgcaa attcagaagt agacaagctc 11340
tccatggaaa cccctgccaa aacagaagag ataaaactgg aaaaggctga gacagagtcc 11400
tgcccaggcc aagaggagcc taaattggag gaacagaatg gtagtaaggt agaaggaaac 11460
gctgtagcct gtcctgtctc ctcagcacag agtcctcccc attctgctgg ggcccctgct 11520
gccaaaggag actcagggaa tgaacttctg aaacacttgt tgaaaaataa aaagtcatct 11580
tctcttttga atcaaaaacc tgagggcagt atttgttcag aagatgactg tacaaaggat 11640
aataaactag ttgagaagca gaacccagct gaaggactgc aaactttggg ggctcaaatg 11700
caaggtggtt ttggatgtgg caaccagttg ccaaaaacag atggaggaag tgaaaccaag 11760
aaacagcgaa gcaaacggac tcagaggacg ggtgagaaag cagcacctcg ctcaaagaaa 11820
aggaaaaagg acgaagagga gaaacaagct atgtactcta gcactgacac gtttacccac 11880
ttgaaacagc agaataattt aagtaatcct ccaacacccc ctgcctctct tcctcctaca 11940
ccacctccta tggcttgtca gaagatggcc aatggttttg caacaactga agaacttgct 12000
ggaaaagccg gagtgttagt gagccatgaa gttaccaaaa ctctaggacc taaaccattt 12060
cagctgccct tcagacccca ggacgacttg ttggcccgag ctcttgctca gggccccaag 12120
acagttgatg tgccagcctc cctcccaaca ccacctcata acaatcagga agaattaagg 12180
atacaggatc actgtggtga tcgagatact cctgacagtt ttgttccctc atcctctcct 12240
gagagtgtgg ttggggtaga agtgagcagg tatccagatc tgtcattggt caaggaggag 12300
cctccagaac cggtgccgtc ccccatcatt ccaattcttc ctagcactgc tgggaaaagt 12360
tcagaatcaa gaaggaatga catcaaaact gagccaggca ctttatattt tgcgtcacct 12420
tttggtcctt ccccaaatgg tcccagatca ggtcttatat ctgtagcaat tactctgcat 12480
cctacagctg ctgagaacat tagcagtgtt gtggctgcat tttccgacct tcttcacgtc 12540
cgaatcccta acagctatga ggttagcagt gctccagatg tcccatccat gggtttggtc 12600
agtagccaca gaatcaaccc gggtttggag tatcgacagc atttacttct ccgtgggcct 12660
ccgccaggat ctgcaaaccc tcccagatta gtgagctctt accggctgaa gcagcctaat 12720
gtaccatttc ctccaacaag caatggtctt tctggatata aggattctag tcatggtatt 12780
gcagaaagcg cagcactcag accacagtgg tgttgtcatt gtaaagtggt tattcttgga 12840
agtggtgtgc ggaaatcttt caaagatctg acccttttga acaaggattc ccgagaaagc 12900
accaagaggg tagagaagga cattgtcttc tgtagtaata actgctttat tctttattca 12960
tcaactgcac aagcgaaaaa ctcagaaaac aaggaatcca ttccttcatt gccacaatca 13020
cctatgagag aaacgccttc caaagcattt catcagtaca gcaacaacat ctccactttg 13080
gatgtgcact gtctccccca gctcccagag aaagcttctc cccctgcctc accacccatc 13140
gccttccctc ctgcttttga agcagcccaa gtcgaggcca agccagatga gctgaaggtg 13200
acagtcaagc tgaagcctcg gctaagagct gtccatggtg ggtttgaaga ttgcaggccg 13260
ctcaataaaa aatggagagg aatgaaatgg aagaagtgga gcattcatat tgtaatccct 13320
aaggggacat ttaaaccacc ttgtgaggat gaaatagatg aatttctaaa gaaattgggc 13380
acttccctta aacctgatcc tgtgcccaaa gactatcgga aatgttgctt ttgtcatgaa 13440
gaaggtgatg gattgacaga tggaccagca aggctactca accttgactt ggatctgtgg 13500
gtccacttga actgcgctct gtggtccacg gaggtctatg agactcaggc tggtgcctta 13560
ataaatgtgg agctagctct gaggagaggc ctacaaatga aatgtgtctt ctgtcacaag 13620
acgggtgcca ctagtggatg ccacagattt cgatgcacca acatttatca cttcacttgc 13680
gccattaaag cacaatgcat gttttttaag gacaaaacta tgctttgccc catgcacaaa 13740
ccaaagggaa ttcatgagca agaattaagt tactttgcag tcttcaggag ggtctatgtt 13800
cagcgtgatg aggtgcgaca gattgctagc atcgtgcaac gaggagaacg ggaccatacc 13860
tttcgcgtgg gtagcctcat cttccacaca attggtcagc tgcttccaca gcagatgcaa 13920
gcattccatt ctcctaaagc actcttccct gtgggctatg aagccagccg gctgtactgg 13980
agcactcgct atgccaatag gcgctgccgc tacctgtgct ccattgagga gaaggatggg 14040
cgcccagtgt ttgtcatcag gattgtggaa caaggccatg aagacctggt tctaagtgac 14100
atctcaccta aaggtgtctg ggataagatt ttggagcctg tggcatgtgt gagaaaaaag 14160
tctgaaatgc tccagctttt cccagcgtat ttaaaaggag aggatctgtt tggcctgacc 14220
gtctctgcag tggcacgcat agcggaatca cttcctgggg ttgaggcatg tgaaaattat 14280
accttccgat acggccgaaa tcctctcatg gaacttcctc ttgccgttaa ccccacaggt 14340
tgtgcccgtt ctgaacctaa aatgagtgcc catgtcaaga ggtttgtgtt aaggcctcac 14400
accttaaaca gcaccagcac ctcaaagtca tttcagagca cagtcactgg agaactgaac 14460
gcaccttata gtaaacagtt tgttcactcc aagtcatcgc agtaccggaa gatgaaaact 14520
gaatggaaat ccaatgtgta tctggcacgg tctcggattc aggggctggg cctgtatgct 14580
gctcgagaca ttgagaaaca caccatggtc attgagtaca tcgggactat cattcgaaac 14640
gaagtagcca acaggaaaga gaagctttat gagtctcaga accgtggtgt gtacatgttc 14700
cgcatggata acgaccatgt gattgacgcg acgctcacag gagggcccgc aaggtatatc 14760
aaccattcgt gtgcacctaa ttgtgtggct gaagtggtga cttttgagag aggacacaaa 14820
attatcatca gctccagtcg gagaatccag aaaggagaag agctctgcta tgactataag 14880
tttgactttg aagatgacca gcacaagatt ccgtgtcact gtggagctgt gaactgccgg 14940
aagtggatga actgaaatgc attccttgct agctcagcgg gcggcttgtc cctaggaaga 15000
ggcgattcaa cacaccattg gaattttgca gacagaaaga gatttttgtt ttctgtttta 15060
tgactttttg aaaaagcttc tgggagttct gatttcctca gtcctttagg ttaaagcagc 15120
gccaggagga agctgacaga agcagcgttc ctgaagtggc cgaggttaaa cggaatcaca 15180
gaatggtcca gcacttttgc ttttttttct tttccttttc tttttttttt gtttgttttt 15240
tgttttgttt ttcccttgtg ggtgggtttc attgttttgg ttttctagtc tcactaagga 15300
gaaactttta ctggggcaaa gagccgatgg ctgccctgcc ccgggcaggg gccttcctat 15360
gaatgtaaga ctgaaatcac cagcgagggg gacagagagt gctggccacg gccttattaa 15420
aaaggggcag gccctctaac ttcaaaatgt ttttaaataa agtagacacc actgaacaag 15480
gaatgtactg aaatgacttc cttagggata gagctaaggg ataataactt gcactaaata 15540
catttaaata cttgattcca tgagtcagtt tattgtagtt tttgatttct gtaaaataag 15600
agaaactttt gtatttatta ttgaataagt gaatgaagct atttttaaat aaagttagaa 15660
gaaagccaag ctgctgctgt tacctgcaga actaacaaac cctgttactt tgtacagata 15720
tgtaaatatt ttgagaaaaa atacagtata aaaatagtta ttgaccaaat gctaccaggc 15780
tctgcagcag ctcgggggct tataaaatgt tcatagggat gttacaatat aattttgtgt 15840
tataaaatat gccattataa ttatgtaata accaaaattt caacctagag tgttgggggt 15900
tttttggaaa ccgcagtcta ttagtactca atggttttat acaccttact tctgacagag 15960
cggggcgtat gctacgacta caacttttat agctgttttg gtaatttaaa ctaatttttt 16020
catattatat tgttgcatcc ctacttcttc agtcaggttt ttttgtgctt acaatttgtg 16080
ataactgtga ataactgctt aaaaatacac ccaaatggag gctgaatttt ttcttcagca 16140
aaagtagttt tgattagaac tttgtttcag ccacagagaa tcatgtaaac gtaataggat 16200
catgtagcag aaacttaaat ctaacccttt agccttctat ttaacacaaa aatttgaaaa 16260
agttaaaaaa aaaaaggaga tgtgattatg cttacagctg caggactctg gcaatagggt 16320
ttttggaaga tgtaatttta aaatgtgttt gtatgaactg tttgtttaca tttctttaat 16380
aaaaaaaaca ctgttttgtg tttgcttgta gaaacttaat cagcattttg aaccaggtta 16440
gctttttatt ttgtacttaa aattctggta ctgacacttc acaggctaag tataaaatga 16500
agttttgtgt gcacaattca agtggactgt aaactgttgg tatattcagt gatgcagttc 16560
tgaacttgta tatggcatga tgtattttta tcttacagaa taaatcaatt gtatatattt 16620
ttctcttgat aaatagctgt atgaaatttg tttcctgaat atttttcttc tcttgtacaa 16680
tatcctgaca tcctaccagt atttgtccta ccgggttttt gttgttttct gttctgtata 16740
atagtatcta atgttggcaa aaattgaatt ttttgaagta tacagagtgt tatgggtttt 16800
ggaatttgtg gacacagatt tagaagatca ccatttacaa ataaaatatt ttacatctat 16860
aa 16862
<210> 118
<211> 4911
<212> PRT
<213> 智人
<400> 118
Met Ser Ser Glu Glu Asp Lys Ser Val Glu Gln Pro Gln Pro Pro Pro
1 5 10 15
Pro Pro Pro Glu Glu Pro Gly Ala Pro Ala Pro Ser Pro Ala Ala Ala
20 25 30
Asp Lys Arg Pro Arg Gly Arg Pro Arg Lys Asp Gly Ala Ser Pro Phe
35 40 45
Gln Arg Ala Arg Lys Lys Pro Arg Ser Arg Gly Lys Thr Ala Val Glu
50 55 60
Asp Glu Asp Ser Met Asp Gly Leu Glu Thr Thr Glu Thr Glu Thr Ile
65 70 75 80
Val Glu Thr Glu Ile Lys Glu Gln Ser Ala Glu Glu Asp Ala Glu Ala
85 90 95
Glu Val Asp Asn Ser Lys Gln Leu Ile Pro Thr Leu Gln Arg Ser Val
100 105 110
Ser Glu Glu Ser Ala Asn Ser Leu Val Ser Val Gly Val Glu Ala Lys
115 120 125
Ile Ser Glu Gln Leu Cys Ala Phe Cys Tyr Cys Gly Glu Lys Ser Ser
130 135 140
Leu Gly Gln Gly Asp Leu Lys Gln Phe Arg Ile Thr Pro Gly Phe Ile
145 150 155 160
Leu Pro Trp Arg Asn Gln Pro Ser Asn Lys Lys Asp Ile Asp Asp Asn
165 170 175
Ser Asn Gly Thr Tyr Glu Lys Met Gln Asn Ser Ala Pro Arg Lys Gln
180 185 190
Arg Gly Gln Arg Lys Glu Arg Ser Pro Gln Gln Asn Ile Val Ser Cys
195 200 205
Val Ser Val Ser Thr Gln Thr Ala Ser Asp Asp Gln Ala Gly Lys Leu
210 215 220
Trp Asp Glu Leu Ser Leu Val Gly Leu Pro Asp Ala Ile Asp Ile Gln
225 230 235 240
Ala Leu Phe Asp Ser Thr Gly Thr Cys Trp Ala His His Arg Cys Val
245 250 255
Glu Trp Ser Leu Gly Val Cys Gln Met Glu Glu Pro Leu Leu Val Asn
260 265 270
Val Asp Lys Ala Val Val Ser Gly Ser Thr Glu Arg Cys Ala Phe Cys
275 280 285
Lys His Leu Gly Ala Thr Ile Lys Cys Cys Glu Glu Lys Cys Thr Gln
290 295 300
Met Tyr His Tyr Pro Cys Ala Ala Gly Ala Gly Thr Phe Gln Asp Phe
305 310 315 320
Ser His Ile Phe Leu Leu Cys Pro Glu His Ile Asp Gln Ala Pro Glu
325 330 335
Arg Ser Lys Glu Asp Ala Asn Cys Ala Val Cys Asp Ser Pro Gly Asp
340 345 350
Leu Leu Asp Gln Phe Phe Cys Thr Thr Cys Gly Gln His Tyr His Gly
355 360 365
Met Cys Leu Asp Ile Ala Val Thr Pro Leu Lys Arg Ala Gly Trp Gln
370 375 380
Cys Pro Glu Cys Lys Val Cys Gln Asn Cys Lys Gln Ser Gly Glu Asp
385 390 395 400
Ser Lys Met Leu Val Cys Asp Thr Cys Asp Lys Gly Tyr His Thr Phe
405 410 415
Cys Leu Gln Pro Val Met Lys Ser Val Pro Thr Asn Gly Trp Lys Cys
420 425 430
Lys Asn Cys Arg Ile Cys Ile Glu Cys Gly Thr Arg Ser Ser Ser Gln
435 440 445
Trp His His Asn Cys Leu Ile Cys Asp Asn Cys Tyr Gln Gln Gln Asp
450 455 460
Asn Leu Cys Pro Phe Cys Gly Lys Cys Tyr His Pro Glu Leu Gln Lys
465 470 475 480
Asp Met Leu His Cys Asn Met Cys Lys Arg Trp Val His Leu Glu Cys
485 490 495
Asp Lys Pro Thr Asp His Glu Leu Asp Thr Gln Leu Lys Glu Glu Tyr
500 505 510
Ile Cys Met Tyr Cys Lys His Leu Gly Ala Glu Met Asp Arg Leu Gln
515 520 525
Pro Gly Glu Glu Val Glu Ile Ala Glu Leu Thr Thr Asp Tyr Asn Asn
530 535 540
Glu Met Glu Val Glu Gly Pro Glu Asp Gln Met Val Phe Ser Glu Gln
545 550 555 560
Ala Ala Asn Lys Asp Val Asn Gly Gln Glu Ser Thr Pro Gly Ile Val
565 570 575
Pro Asp Ala Val Gln Val His Thr Glu Glu Gln Gln Lys Ser His Pro
580 585 590
Ser Glu Ser Leu Asp Thr Asp Ser Leu Leu Ile Ala Val Ser Ser Gln
595 600 605
His Thr Val Asn Thr Glu Leu Glu Lys Gln Ile Ser Asn Glu Val Asp
610 615 620
Ser Glu Asp Leu Lys Met Ser Ser Glu Val Lys His Ile Cys Gly Glu
625 630 635 640
Asp Gln Ile Glu Asp Lys Met Glu Val Thr Glu Asn Ile Glu Val Val
645 650 655
Thr His Gln Ile Thr Val Gln Gln Glu Gln Leu Gln Leu Leu Glu Glu
660 665 670
Pro Glu Thr Val Val Ser Arg Glu Glu Ser Arg Pro Pro Lys Leu Val
675 680 685
Met Glu Ser Val Thr Leu Pro Leu Glu Thr Leu Val Ser Pro His Glu
690 695 700
Glu Ser Ile Ser Leu Cys Pro Glu Glu Gln Leu Val Ile Glu Arg Leu
705 710 715 720
Gln Gly Glu Lys Glu Gln Lys Glu Asn Ser Glu Leu Ser Thr Gly Leu
725 730 735
Met Asp Ser Glu Met Thr Pro Thr Ile Glu Gly Cys Val Lys Asp Val
740 745 750
Ser Tyr Gln Gly Gly Lys Ser Ile Lys Leu Ser Ser Glu Thr Glu Ser
755 760 765
Ser Phe Ser Ser Ser Ala Asp Ile Ser Lys Ala Asp Val Ser Ser Ser
770 775 780
Pro Thr Pro Ser Ser Asp Leu Pro Ser His Asp Met Leu His Asn Tyr
785 790 795 800
Pro Ser Ala Leu Ser Ser Ser Ala Gly Asn Ile Met Pro Thr Thr Tyr
805 810 815
Ile Ser Val Thr Pro Lys Ile Gly Met Gly Lys Pro Ala Ile Thr Lys
820 825 830
Arg Lys Phe Ser Pro Gly Arg Pro Arg Ser Lys Gln Gly Ala Trp Ser
835 840 845
Thr His Asn Thr Val Ser Pro Pro Ser Trp Ser Pro Asp Ile Ser Glu
850 855 860
Gly Arg Glu Ile Phe Lys Pro Arg Gln Leu Pro Gly Ser Ala Ile Trp
865 870 875 880
Ser Ile Lys Val Gly Arg Gly Ser Gly Phe Pro Gly Lys Arg Arg Pro
885 890 895
Arg Gly Ala Gly Leu Ser Gly Arg Gly Gly Arg Gly Arg Ser Lys Leu
900 905 910
Lys Ser Gly Ile Gly Ala Val Val Leu Pro Gly Val Ser Thr Ala Asp
915 920 925
Ile Ser Ser Asn Lys Asp Asp Glu Glu Asn Ser Met His Asn Thr Val
930 935 940
Val Leu Phe Ser Ser Ser Asp Lys Phe Thr Leu Asn Gln Asp Met Cys
945 950 955 960
Val Val Cys Gly Ser Phe Gly Gln Gly Ala Glu Gly Arg Leu Leu Ala
965 970 975
Cys Ser Gln Cys Gly Gln Cys Tyr His Pro Tyr Cys Val Ser Ile Lys
980 985 990
Ile Thr Lys Val Val Leu Ser Lys Gly Trp Arg Cys Leu Glu Cys Thr
995 1000 1005
Val Cys Glu Ala Cys Gly Lys Ala Thr Asp Pro Gly Arg Leu Leu
1010 1015 1020
Leu Cys Asp Asp Cys Asp Ile Ser Tyr His Thr Tyr Cys Leu Asp
1025 1030 1035
Pro Pro Leu Gln Thr Val Pro Lys Gly Gly Trp Lys Cys Lys Trp
1040 1045 1050
Cys Val Trp Cys Arg His Cys Gly Ala Thr Ser Ala Gly Leu Arg
1055 1060 1065
Cys Glu Trp Gln Asn Asn Tyr Thr Gln Cys Ala Pro Cys Ala Ser
1070 1075 1080
Leu Ser Ser Cys Pro Val Cys Tyr Arg Asn Tyr Arg Glu Glu Asp
1085 1090 1095
Leu Ile Leu Gln Cys Arg Gln Cys Asp Arg Trp Met His Ala Val
1100 1105 1110
Cys Gln Asn Leu Asn Thr Glu Glu Glu Val Glu Asn Val Ala Asp
1115 1120 1125
Ile Gly Phe Asp Cys Ser Met Cys Arg Pro Tyr Met Pro Ala Ser
1130 1135 1140
Asn Val Pro Ser Ser Asp Cys Cys Glu Ser Ser Leu Val Ala Gln
1145 1150 1155
Ile Val Thr Lys Val Lys Glu Leu Asp Pro Pro Lys Thr Tyr Thr
1160 1165 1170
Gln Asp Gly Val Cys Leu Thr Glu Ser Gly Met Thr Gln Leu Gln
1175 1180 1185
Ser Leu Thr Val Thr Val Pro Arg Arg Lys Arg Ser Lys Pro Lys
1190 1195 1200
Leu Lys Leu Lys Ile Ile Asn Gln Asn Ser Val Ala Val Leu Gln
1205 1210 1215
Thr Pro Pro Asp Ile Gln Ser Glu His Ser Arg Asp Gly Glu Met
1220 1225 1230
Asp Asp Ser Arg Glu Gly Glu Leu Met Asp Cys Asp Gly Lys Ser
1235 1240 1245
Glu Ser Ser Pro Glu Arg Glu Ala Val Asp Asp Glu Thr Lys Gly
1250 1255 1260
Val Glu Gly Thr Asp Gly Val Lys Lys Arg Lys Arg Lys Pro Tyr
1265 1270 1275
Arg Pro Gly Ile Gly Gly Phe Met Val Arg Gln Arg Ser Arg Thr
1280 1285 1290
Gly Gln Gly Lys Thr Lys Arg Ser Val Ile Arg Lys Asp Ser Ser
1295 1300 1305
Gly Ser Ile Ser Glu Gln Leu Pro Cys Arg Asp Asp Gly Trp Ser
1310 1315 1320
Glu Gln Leu Pro Asp Thr Leu Val Asp Glu Ser Val Ser Val Thr
1325 1330 1335
Glu Ser Thr Glu Lys Ile Lys Lys Arg Tyr Arg Lys Arg Lys Asn
1340 1345 1350
Lys Leu Glu Glu Thr Phe Pro Ala Tyr Leu Gln Glu Ala Phe Phe
1355 1360 1365
Gly Lys Asp Leu Leu Asp Thr Ser Arg Gln Ser Lys Ile Ser Leu
1370 1375 1380
Asp Asn Leu Ser Glu Asp Gly Ala Gln Leu Leu Tyr Lys Thr Asn
1385 1390 1395
Met Asn Thr Gly Phe Leu Asp Pro Ser Leu Asp Pro Leu Leu Ser
1400 1405 1410
Ser Ser Ser Ala Pro Thr Lys Ser Gly Thr His Gly Pro Ala Asp
1415 1420 1425
Asp Pro Leu Ala Asp Ile Ser Glu Val Leu Asn Thr Asp Asp Asp
1430 1435 1440
Ile Leu Gly Ile Ile Ser Asp Asp Leu Ala Lys Ser Val Asp His
1445 1450 1455
Ser Asp Ile Gly Pro Val Thr Asp Asp Pro Ser Ser Leu Pro Gln
1460 1465 1470
Pro Asn Val Asn Gln Ser Ser Arg Pro Leu Ser Glu Glu Gln Leu
1475 1480 1485
Asp Gly Ile Leu Ser Pro Glu Leu Asp Lys Met Val Thr Asp Gly
1490 1495 1500
Ala Ile Leu Gly Lys Leu Tyr Lys Ile Pro Glu Leu Gly Gly Lys
1505 1510 1515
Asp Val Glu Asp Leu Phe Thr Ala Val Leu Ser Pro Ala Asn Thr
1520 1525 1530
Gln Pro Thr Pro Leu Pro Gln Pro Pro Pro Pro Thr Gln Leu Leu
1535 1540 1545
Pro Ile His Asn Gln Asp Ala Phe Ser Arg Met Pro Leu Met Asn
1550 1555 1560
Gly Leu Ile Gly Ser Ser Pro His Leu Pro His Asn Ser Leu Pro
1565 1570 1575
Pro Gly Ser Gly Leu Gly Thr Phe Ser Ala Ile Ala Gln Ser Ser
1580 1585 1590
Tyr Pro Asp Ala Arg Asp Lys Asn Ser Ala Phe Asn Pro Met Ala
1595 1600 1605
Ser Asp Pro Asn Asn Ser Trp Thr Ser Ser Ala Pro Thr Val Glu
1610 1615 1620
Gly Glu Asn Asp Thr Met Ser Asn Ala Gln Arg Ser Thr Leu Lys
1625 1630 1635
Trp Glu Lys Glu Glu Ala Leu Gly Glu Met Ala Thr Val Ala Pro
1640 1645 1650
Val Leu Tyr Thr Asn Ile Asn Phe Pro Asn Leu Lys Glu Glu Phe
1655 1660 1665
Pro Asp Trp Thr Thr Arg Val Lys Gln Ile Ala Lys Leu Trp Arg
1670 1675 1680
Lys Ala Ser Ser Gln Glu Arg Ala Pro Tyr Val Gln Lys Ala Arg
1685 1690 1695
Asp Asn Arg Ala Ala Leu Arg Ile Asn Lys Val Gln Met Ser Asn
1700 1705 1710
Asp Ser Met Lys Arg Gln Gln Gln Gln Asp Ser Ile Asp Pro Ser
1715 1720 1725
Ser Arg Ile Asp Ser Glu Leu Phe Lys Asp Pro Leu Lys Gln Arg
1730 1735 1740
Glu Ser Glu His Glu Gln Glu Trp Lys Phe Arg Gln Gln Met Arg
1745 1750 1755
Gln Lys Ser Lys Gln Gln Ala Lys Ile Glu Ala Thr Gln Lys Leu
1760 1765 1770
Glu Gln Val Lys Asn Glu Gln Gln Gln Gln Gln Gln Gln Gln Phe
1775 1780 1785
Gly Ser Gln His Leu Leu Val Gln Ser Gly Ser Asp Thr Pro Ser
1790 1795 1800
Ser Gly Ile Gln Ser Pro Leu Thr Pro Gln Pro Gly Asn Gly Asn
1805 1810 1815
Met Ser Pro Ala Gln Ser Phe His Lys Glu Leu Phe Thr Lys Gln
1820 1825 1830
Pro Pro Ser Thr Pro Thr Ser Thr Ser Ser Asp Asp Val Phe Val
1835 1840 1845
Lys Pro Gln Ala Pro Pro Pro Pro Pro Ala Pro Ser Arg Ile Pro
1850 1855 1860
Ile Gln Asp Ser Leu Ser Gln Ala Gln Thr Ser Gln Pro Pro Ser
1865 1870 1875
Pro Gln Val Phe Ser Pro Gly Ser Ser Asn Ser Arg Pro Pro Ser
1880 1885 1890
Pro Met Asp Pro Tyr Ala Lys Met Val Gly Thr Pro Arg Pro Pro
1895 1900 1905
Pro Val Gly His Ser Phe Ser Arg Arg Asn Ser Ala Ala Pro Val
1910 1915 1920
Glu Asn Cys Thr Pro Leu Ser Ser Val Ser Arg Pro Leu Gln Met
1925 1930 1935
Asn Glu Thr Thr Ala Asn Arg Pro Ser Pro Val Arg Asp Leu Cys
1940 1945 1950
Ser Ser Ser Thr Thr Asn Asn Asp Pro Tyr Ala Lys Pro Pro Asp
1955 1960 1965
Thr Pro Arg Pro Val Met Thr Asp Gln Phe Pro Lys Ser Leu Gly
1970 1975 1980
Leu Ser Arg Ser Pro Val Val Ser Glu Gln Thr Ala Lys Gly Pro
1985 1990 1995
Ile Ala Ala Gly Thr Ser Asp His Phe Thr Lys Pro Ser Pro Arg
2000 2005 2010
Ala Asp Val Phe Gln Arg Gln Arg Ile Pro Asp Ser Tyr Ala Arg
2015 2020 2025
Pro Leu Leu Thr Pro Ala Pro Leu Asp Ser Gly Pro Gly Pro Phe
2030 2035 2040
Lys Thr Pro Met Gln Pro Pro Pro Ser Ser Gln Asp Pro Tyr Gly
2045 2050 2055
Ser Val Ser Gln Ala Ser Arg Arg Leu Ser Val Asp Pro Tyr Glu
2060 2065 2070
Arg Pro Ala Leu Thr Pro Arg Pro Ile Asp Asn Phe Ser His Asn
2075 2080 2085
Gln Ser Asn Asp Pro Tyr Ser Gln Pro Pro Leu Thr Pro His Pro
2090 2095 2100
Ala Val Asn Glu Ser Phe Ala His Pro Ser Arg Ala Phe Ser Gln
2105 2110 2115
Pro Gly Thr Ile Ser Arg Pro Thr Ser Gln Asp Pro Tyr Ser Gln
2120 2125 2130
Pro Pro Gly Thr Pro Arg Pro Val Val Asp Ser Tyr Ser Gln Ser
2135 2140 2145
Ser Gly Thr Ala Arg Ser Asn Thr Asp Pro Tyr Ser Gln Pro Pro
2150 2155 2160
Gly Thr Pro Arg Pro Thr Thr Val Asp Pro Tyr Ser Gln Gln Pro
2165 2170 2175
Gln Thr Pro Arg Pro Ser Thr Gln Thr Asp Leu Phe Val Thr Pro
2180 2185 2190
Val Thr Asn Gln Arg His Ser Asp Pro Tyr Ala His Pro Pro Gly
2195 2200 2205
Thr Pro Arg Pro Gly Ile Ser Val Pro Tyr Ser Gln Pro Pro Ala
2210 2215 2220
Thr Pro Arg Pro Arg Ile Ser Glu Gly Phe Thr Arg Ser Ser Met
2225 2230 2235
Thr Arg Pro Val Leu Met Pro Asn Gln Asp Pro Phe Leu Gln Ala
2240 2245 2250
Ala Gln Asn Arg Gly Pro Ala Leu Pro Gly Pro Leu Val Arg Pro
2255 2260 2265
Pro Asp Thr Cys Ser Gln Thr Pro Arg Pro Pro Gly Pro Gly Leu
2270 2275 2280
Ser Asp Thr Phe Ser Arg Val Ser Pro Ser Ala Ala Arg Asp Pro
2285 2290 2295
Tyr Asp Gln Ser Pro Met Thr Pro Arg Ser Gln Ser Asp Ser Phe
2300 2305 2310
Gly Thr Ser Gln Thr Ala His Asp Val Ala Asp Gln Pro Arg Pro
2315 2320 2325
Gly Ser Glu Gly Ser Phe Cys Ala Ser Ser Asn Ser Pro Met His
2330 2335 2340
Ser Gln Gly Gln Gln Phe Ser Gly Val Ser Gln Leu Pro Gly Pro
2345 2350 2355
Val Pro Thr Ser Gly Val Thr Asp Thr Gln Asn Thr Val Asn Met
2360 2365 2370
Ala Gln Ala Asp Thr Glu Lys Leu Arg Gln Arg Gln Lys Leu Arg
2375 2380 2385
Glu Ile Ile Leu Gln Gln Gln Gln Gln Lys Lys Ile Ala Gly Arg
2390 2395 2400
Gln Glu Lys Gly Ser Gln Asp Ser Pro Ala Val Pro His Pro Gly
2405 2410 2415
Pro Leu Gln His Trp Gln Pro Glu Asn Val Asn Gln Ala Phe Thr
2420 2425 2430
Arg Pro Pro Pro Pro Tyr Pro Gly Asn Ile Arg Ser Pro Val Ala
2435 2440 2445
Pro Pro Leu Gly Pro Arg Tyr Ala Val Phe Pro Lys Asp Gln Arg
2450 2455 2460
Gly Pro Tyr Pro Pro Asp Val Ala Ser Met Gly Met Arg Pro His
2465 2470 2475
Gly Phe Arg Phe Gly Phe Pro Gly Gly Ser His Gly Thr Met Pro
2480 2485 2490
Ser Gln Glu Arg Phe Leu Val Pro Pro Gln Gln Ile Gln Gly Ser
2495 2500 2505
Gly Val Ser Pro Gln Leu Arg Arg Ser Val Ser Val Asp Met Pro
2510 2515 2520
Arg Pro Leu Asn Asn Ser Gln Met Asn Asn Pro Val Gly Leu Pro
2525 2530 2535
Gln His Phe Ser Pro Gln Ser Leu Pro Val Gln Gln His Asn Ile
2540 2545 2550
Leu Gly Gln Ala Tyr Ile Glu Leu Arg His Arg Ala Pro Asp Gly
2555 2560 2565
Arg Gln Arg Leu Pro Phe Ser Ala Pro Pro Gly Ser Val Val Glu
2570 2575 2580
Ala Ser Ser Asn Leu Arg His Gly Asn Phe Ile Pro Arg Pro Asp
2585 2590 2595
Phe Pro Gly Pro Arg His Thr Asp Pro Met Arg Arg Pro Pro Gln
2600 2605 2610
Gly Leu Pro Asn Gln Leu Pro Val His Pro Asp Leu Glu Gln Val
2615 2620 2625
Pro Pro Ser Gln Gln Glu Gln Gly His Ser Val His Ser Ser Ser
2630 2635 2640
Met Val Met Arg Thr Leu Asn His Pro Leu Gly Gly Glu Phe Ser
2645 2650 2655
Glu Ala Pro Leu Ser Thr Ser Val Pro Ser Glu Thr Thr Ser Asp
2660 2665 2670
Asn Leu Gln Ile Thr Thr Gln Pro Ser Asp Gly Leu Glu Glu Lys
2675 2680 2685
Leu Asp Ser Asp Asp Pro Ser Val Lys Glu Leu Asp Val Lys Asp
2690 2695 2700
Leu Glu Gly Val Glu Val Lys Asp Leu Asp Asp Glu Asp Leu Glu
2705 2710 2715
Asn Leu Asn Leu Asp Thr Glu Asp Gly Lys Val Val Glu Leu Asp
2720 2725 2730
Thr Leu Asp Asn Leu Glu Thr Asn Asp Pro Asn Leu Asp Asp Leu
2735 2740 2745
Leu Arg Ser Gly Glu Phe Asp Ile Ile Ala Tyr Thr Asp Pro Glu
2750 2755 2760
Leu Asp Met Gly Asp Lys Lys Ser Met Phe Asn Glu Glu Leu Asp
2765 2770 2775
Leu Pro Ile Asp Asp Lys Leu Asp Asn Gln Cys Val Ser Val Glu
2780 2785 2790
Pro Lys Lys Lys Glu Gln Glu Asn Lys Thr Leu Val Leu Ser Asp
2795 2800 2805
Lys His Ser Pro Gln Lys Lys Ser Thr Val Thr Asn Glu Val Lys
2810 2815 2820
Thr Glu Val Leu Ser Pro Asn Ser Lys Val Glu Ser Lys Cys Glu
2825 2830 2835
Thr Glu Lys Asn Asp Glu Asn Lys Asp Asn Val Asp Thr Pro Cys
2840 2845 2850
Ser Gln Ala Ser Ala His Ser Asp Leu Asn Asp Gly Glu Lys Thr
2855 2860 2865
Ser Leu His Pro Cys Asp Pro Asp Leu Phe Glu Lys Arg Thr Asn
2870 2875 2880
Arg Glu Thr Ala Gly Pro Ser Ala Asn Val Ile Gln Ala Ser Thr
2885 2890 2895
Gln Leu Pro Ala Gln Asp Val Ile Asn Ser Cys Gly Ile Thr Gly
2900 2905 2910
Ser Thr Pro Val Leu Ser Ser Leu Leu Ala Asn Glu Lys Ser Asp
2915 2920 2925
Asn Ser Asp Ile Arg Pro Ser Gly Ser Pro Pro Pro Pro Thr Leu
2930 2935 2940
Pro Ala Ser Pro Ser Asn His Val Ser Ser Leu Pro Pro Phe Ile
2945 2950 2955
Ala Pro Pro Gly Arg Val Leu Asp Asn Ala Met Asn Ser Asn Val
2960 2965 2970
Thr Val Val Ser Arg Val Asn His Val Phe Ser Gln Gly Val Gln
2975 2980 2985
Val Asn Pro Gly Leu Ile Pro Gly Gln Ser Thr Val Asn His Ser
2990 2995 3000
Leu Gly Thr Gly Lys Pro Ala Thr Gln Thr Gly Pro Gln Thr Ser
3005 3010 3015
Gln Ser Gly Thr Ser Ser Met Ser Gly Pro Gln Gln Leu Met Ile
3020 3025 3030
Pro Gln Thr Leu Ala Gln Gln Asn Arg Glu Arg Pro Leu Leu Leu
3035 3040 3045
Glu Glu Gln Pro Leu Leu Leu Gln Asp Leu Leu Asp Gln Glu Arg
3050 3055 3060
Gln Glu Gln Gln Gln Gln Arg Gln Met Gln Ala Met Ile Arg Gln
3065 3070 3075
Arg Ser Glu Pro Phe Phe Pro Asn Ile Asp Phe Asp Ala Ile Thr
3080 3085 3090
Asp Pro Ile Met Lys Ala Lys Met Val Ala Leu Lys Gly Ile Asn
3095 3100 3105
Lys Val Met Ala Gln Asn Asn Leu Gly Met Pro Pro Met Val Met
3110 3115 3120
Ser Arg Phe Pro Phe Met Gly Gln Val Val Thr Gly Thr Gln Asn
3125 3130 3135
Ser Glu Gly Gln Asn Leu Gly Pro Gln Ala Ile Pro Gln Asp Gly
3140 3145 3150
Ser Ile Thr His Gln Ile Ser Arg Pro Asn Pro Pro Asn Phe Gly
3155 3160 3165
Pro Gly Phe Val Asn Asp Ser Gln Arg Lys Gln Tyr Glu Glu Trp
3170 3175 3180
Leu Gln Glu Thr Gln Gln Leu Leu Gln Met Gln Gln Lys Tyr Leu
3185 3190 3195
Glu Glu Gln Ile Gly Ala His Arg Lys Ser Lys Lys Ala Leu Ser
3200 3205 3210
Ala Lys Gln Arg Thr Ala Lys Lys Ala Gly Arg Glu Phe Pro Glu
3215 3220 3225
Glu Asp Ala Glu Gln Leu Lys His Val Thr Glu Gln Gln Ser Met
3230 3235 3240
Val Gln Lys Gln Leu Glu Gln Ile Arg Lys Gln Gln Lys Glu His
3245 3250 3255
Ala Glu Leu Ile Glu Asp Tyr Arg Ile Lys Gln Gln Gln Gln Cys
3260 3265 3270
Ala Met Ala Pro Pro Thr Met Met Pro Ser Val Gln Pro Gln Pro
3275 3280 3285
Pro Leu Ile Pro Gly Ala Thr Pro Pro Thr Met Ser Gln Pro Thr
3290 3295 3300
Phe Pro Met Val Pro Gln Gln Leu Gln His Gln Gln His Thr Thr
3305 3310 3315
Val Ile Ser Gly His Thr Ser Pro Val Arg Met Pro Ser Leu Pro
3320 3325 3330
Gly Trp Gln Pro Asn Ser Ala Pro Ala His Leu Pro Leu Asn Pro
3335 3340 3345
Pro Arg Ile Gln Pro Pro Ile Ala Gln Leu Pro Ile Lys Thr Cys
3350 3355 3360
Thr Pro Ala Pro Gly Thr Val Ser Asn Ala Asn Pro Gln Ser Gly
3365 3370 3375
Pro Pro Pro Arg Val Glu Phe Asp Asp Asn Asn Pro Phe Ser Glu
3380 3385 3390
Ser Phe Gln Glu Arg Glu Arg Lys Glu Arg Leu Arg Glu Gln Gln
3395 3400 3405
Glu Arg Gln Arg Ile Gln Leu Met Gln Glu Val Asp Arg Gln Arg
3410 3415 3420
Ala Leu Gln Gln Arg Met Glu Met Glu Gln His Gly Met Val Gly
3425 3430 3435
Ser Glu Ile Ser Ser Ser Arg Thr Ser Val Ser Gln Ile Pro Phe
3440 3445 3450
Tyr Ser Ser Asp Leu Pro Cys Asp Phe Met Gln Pro Leu Gly Pro
3455 3460 3465
Leu Gln Gln Ser Pro Gln His Gln Gln Gln Met Gly Gln Val Leu
3470 3475 3480
Gln Gln Gln Asn Ile Gln Gln Gly Ser Ile Asn Ser Pro Ser Thr
3485 3490 3495
Gln Thr Phe Met Gln Thr Asn Glu Arg Arg Gln Val Gly Pro Pro
3500 3505 3510
Ser Phe Val Pro Asp Ser Pro Ser Ile Pro Val Gly Ser Pro Asn
3515 3520 3525
Phe Ser Ser Val Lys Gln Gly His Gly Asn Leu Ser Gly Thr Ser
3530 3535 3540
Phe Gln Gln Ser Pro Val Arg Pro Ser Phe Thr Pro Ala Leu Pro
3545 3550 3555
Ala Ala Pro Pro Val Ala Asn Ser Ser Leu Pro Cys Gly Gln Asp
3560 3565 3570
Ser Thr Ile Thr His Gly His Ser Tyr Pro Gly Ser Thr Gln Ser
3575 3580 3585
Leu Ile Gln Leu Tyr Ser Asp Ile Ile Pro Glu Glu Lys Gly Lys
3590 3595 3600
Lys Lys Arg Thr Arg Lys Lys Lys Arg Asp Asp Asp Ala Glu Ser
3605 3610 3615
Thr Lys Ala Pro Ser Thr Pro His Ser Asp Ile Thr Ala Pro Pro
3620 3625 3630
Thr Pro Gly Ile Ser Glu Thr Thr Ser Thr Pro Ala Val Ser Thr
3635 3640 3645
Pro Ser Glu Leu Pro Gln Gln Ala Asp Gln Glu Ser Val Glu Pro
3650 3655 3660
Val Gly Pro Ser Thr Pro Asn Met Ala Ala Gly Gln Leu Cys Thr
3665 3670 3675
Glu Leu Glu Asn Lys Leu Pro Asn Ser Asp Phe Ser Gln Ala Thr
3680 3685 3690
Pro Asn Gln Gln Thr Tyr Ala Asn Ser Glu Val Asp Lys Leu Ser
3695 3700 3705
Met Glu Thr Pro Ala Lys Thr Glu Glu Ile Lys Leu Glu Lys Ala
3710 3715 3720
Glu Thr Glu Ser Cys Pro Gly Gln Glu Glu Pro Lys Leu Glu Glu
3725 3730 3735
Gln Asn Gly Ser Lys Val Glu Gly Asn Ala Val Ala Cys Pro Val
3740 3745 3750
Ser Ser Ala Gln Ser Pro Pro His Ser Ala Gly Ala Pro Ala Ala
3755 3760 3765
Lys Gly Asp Ser Gly Asn Glu Leu Leu Lys His Leu Leu Lys Asn
3770 3775 3780
Lys Lys Ser Ser Ser Leu Leu Asn Gln Lys Pro Glu Gly Ser Ile
3785 3790 3795
Cys Ser Glu Asp Asp Cys Thr Lys Asp Asn Lys Leu Val Glu Lys
3800 3805 3810
Gln Asn Pro Ala Glu Gly Leu Gln Thr Leu Gly Ala Gln Met Gln
3815 3820 3825
Gly Gly Phe Gly Cys Gly Asn Gln Leu Pro Lys Thr Asp Gly Gly
3830 3835 3840
Ser Glu Thr Lys Lys Gln Arg Ser Lys Arg Thr Gln Arg Thr Gly
3845 3850 3855
Glu Lys Ala Ala Pro Arg Ser Lys Lys Arg Lys Lys Asp Glu Glu
3860 3865 3870
Glu Lys Gln Ala Met Tyr Ser Ser Thr Asp Thr Phe Thr His Leu
3875 3880 3885
Lys Gln Gln Asn Asn Leu Ser Asn Pro Pro Thr Pro Pro Ala Ser
3890 3895 3900
Leu Pro Pro Thr Pro Pro Pro Met Ala Cys Gln Lys Met Ala Asn
3905 3910 3915
Gly Phe Ala Thr Thr Glu Glu Leu Ala Gly Lys Ala Gly Val Leu
3920 3925 3930
Val Ser His Glu Val Thr Lys Thr Leu Gly Pro Lys Pro Phe Gln
3935 3940 3945
Leu Pro Phe Arg Pro Gln Asp Asp Leu Leu Ala Arg Ala Leu Ala
3950 3955 3960
Gln Gly Pro Lys Thr Val Asp Val Pro Ala Ser Leu Pro Thr Pro
3965 3970 3975
Pro His Asn Asn Gln Glu Glu Leu Arg Ile Gln Asp His Cys Gly
3980 3985 3990
Asp Arg Asp Thr Pro Asp Ser Phe Val Pro Ser Ser Ser Pro Glu
3995 4000 4005
Ser Val Val Gly Val Glu Val Ser Arg Tyr Pro Asp Leu Ser Leu
4010 4015 4020
Val Lys Glu Glu Pro Pro Glu Pro Val Pro Ser Pro Ile Ile Pro
4025 4030 4035
Ile Leu Pro Ser Thr Ala Gly Lys Ser Ser Glu Ser Arg Arg Asn
4040 4045 4050
Asp Ile Lys Thr Glu Pro Gly Thr Leu Tyr Phe Ala Ser Pro Phe
4055 4060 4065
Gly Pro Ser Pro Asn Gly Pro Arg Ser Gly Leu Ile Ser Val Ala
4070 4075 4080
Ile Thr Leu His Pro Thr Ala Ala Glu Asn Ile Ser Ser Val Val
4085 4090 4095
Ala Ala Phe Ser Asp Leu Leu His Val Arg Ile Pro Asn Ser Tyr
4100 4105 4110
Glu Val Ser Ser Ala Pro Asp Val Pro Ser Met Gly Leu Val Ser
4115 4120 4125
Ser His Arg Ile Asn Pro Gly Leu Glu Tyr Arg Gln His Leu Leu
4130 4135 4140
Leu Arg Gly Pro Pro Pro Gly Ser Ala Asn Pro Pro Arg Leu Val
4145 4150 4155
Ser Ser Tyr Arg Leu Lys Gln Pro Asn Val Pro Phe Pro Pro Thr
4160 4165 4170
Ser Asn Gly Leu Ser Gly Tyr Lys Asp Ser Ser His Gly Ile Ala
4175 4180 4185
Glu Ser Ala Ala Leu Arg Pro Gln Trp Cys Cys His Cys Lys Val
4190 4195 4200
Val Ile Leu Gly Ser Gly Val Arg Lys Ser Phe Lys Asp Leu Thr
4205 4210 4215
Leu Leu Asn Lys Asp Ser Arg Glu Ser Thr Lys Arg Val Glu Lys
4220 4225 4230
Asp Ile Val Phe Cys Ser Asn Asn Cys Phe Ile Leu Tyr Ser Ser
4235 4240 4245
Thr Ala Gln Ala Lys Asn Ser Glu Asn Lys Glu Ser Ile Pro Ser
4250 4255 4260
Leu Pro Gln Ser Pro Met Arg Glu Thr Pro Ser Lys Ala Phe His
4265 4270 4275
Gln Tyr Ser Asn Asn Ile Ser Thr Leu Asp Val His Cys Leu Pro
4280 4285 4290
Gln Leu Pro Glu Lys Ala Ser Pro Pro Ala Ser Pro Pro Ile Ala
4295 4300 4305
Phe Pro Pro Ala Phe Glu Ala Ala Gln Val Glu Ala Lys Pro Asp
4310 4315 4320
Glu Leu Lys Val Thr Val Lys Leu Lys Pro Arg Leu Arg Ala Val
4325 4330 4335
His Gly Gly Phe Glu Asp Cys Arg Pro Leu Asn Lys Lys Trp Arg
4340 4345 4350
Gly Met Lys Trp Lys Lys Trp Ser Ile His Ile Val Ile Pro Lys
4355 4360 4365
Gly Thr Phe Lys Pro Pro Cys Glu Asp Glu Ile Asp Glu Phe Leu
4370 4375 4380
Lys Lys Leu Gly Thr Ser Leu Lys Pro Asp Pro Val Pro Lys Asp
4385 4390 4395
Tyr Arg Lys Cys Cys Phe Cys His Glu Glu Gly Asp Gly Leu Thr
4400 4405 4410
Asp Gly Pro Ala Arg Leu Leu Asn Leu Asp Leu Asp Leu Trp Val
4415 4420 4425
His Leu Asn Cys Ala Leu Trp Ser Thr Glu Val Tyr Glu Thr Gln
4430 4435 4440
Ala Gly Ala Leu Ile Asn Val Glu Leu Ala Leu Arg Arg Gly Leu
4445 4450 4455
Gln Met Lys Cys Val Phe Cys His Lys Thr Gly Ala Thr Ser Gly
4460 4465 4470
Cys His Arg Phe Arg Cys Thr Asn Ile Tyr His Phe Thr Cys Ala
4475 4480 4485
Ile Lys Ala Gln Cys Met Phe Phe Lys Asp Lys Thr Met Leu Cys
4490 4495 4500
Pro Met His Lys Pro Lys Gly Ile His Glu Gln Glu Leu Ser Tyr
4505 4510 4515
Phe Ala Val Phe Arg Arg Val Tyr Val Gln Arg Asp Glu Val Arg
4520 4525 4530
Gln Ile Ala Ser Ile Val Gln Arg Gly Glu Arg Asp His Thr Phe
4535 4540 4545
Arg Val Gly Ser Leu Ile Phe His Thr Ile Gly Gln Leu Leu Pro
4550 4555 4560
Gln Gln Met Gln Ala Phe His Ser Pro Lys Ala Leu Phe Pro Val
4565 4570 4575
Gly Tyr Glu Ala Ser Arg Leu Tyr Trp Ser Thr Arg Tyr Ala Asn
4580 4585 4590
Arg Arg Cys Arg Tyr Leu Cys Ser Ile Glu Glu Lys Asp Gly Arg
4595 4600 4605
Pro Val Phe Val Ile Arg Ile Val Glu Gln Gly His Glu Asp Leu
4610 4615 4620
Val Leu Ser Asp Ile Ser Pro Lys Gly Val Trp Asp Lys Ile Leu
4625 4630 4635
Glu Pro Val Ala Cys Val Arg Lys Lys Ser Glu Met Leu Gln Leu
4640 4645 4650
Phe Pro Ala Tyr Leu Lys Gly Glu Asp Leu Phe Gly Leu Thr Val
4655 4660 4665
Ser Ala Val Ala Arg Ile Ala Glu Ser Leu Pro Gly Val Glu Ala
4670 4675 4680
Cys Glu Asn Tyr Thr Phe Arg Tyr Gly Arg Asn Pro Leu Met Glu
4685 4690 4695
Leu Pro Leu Ala Val Asn Pro Thr Gly Cys Ala Arg Ser Glu Pro
4700 4705 4710
Lys Met Ser Ala His Val Lys Arg Phe Val Leu Arg Pro His Thr
4715 4720 4725
Leu Asn Ser Thr Ser Thr Ser Lys Ser Phe Gln Ser Thr Val Thr
4730 4735 4740
Gly Glu Leu Asn Ala Pro Tyr Ser Lys Gln Phe Val His Ser Lys
4745 4750 4755
Ser Ser Gln Tyr Arg Lys Met Lys Thr Glu Trp Lys Ser Asn Val
4760 4765 4770
Tyr Leu Ala Arg Ser Arg Ile Gln Gly Leu Gly Leu Tyr Ala Ala
4775 4780 4785
Arg Asp Ile Glu Lys His Thr Met Val Ile Glu Tyr Ile Gly Thr
4790 4795 4800
Ile Ile Arg Asn Glu Val Ala Asn Arg Lys Glu Lys Leu Tyr Glu
4805 4810 4815
Ser Gln Asn Arg Gly Val Tyr Met Phe Arg Met Asp Asn Asp His
4820 4825 4830
Val Ile Asp Ala Thr Leu Thr Gly Gly Pro Ala Arg Tyr Ile Asn
4835 4840 4845
His Ser Cys Ala Pro Asn Cys Val Ala Glu Val Val Thr Phe Glu
4850 4855 4860
Arg Gly His Lys Ile Ile Ile Ser Ser Ser Arg Arg Ile Gln Lys
4865 4870 4875
Gly Glu Glu Leu Cys Tyr Asp Tyr Lys Phe Asp Phe Glu Asp Asp
4880 4885 4890
Gln His Lys Ile Pro Cys His Cys Gly Ala Val Asn Cys Arg Lys
4895 4900 4905
Trp Met Asn
4910
<210> 119
<211> 3282
<212> DNA
<213> 智人
<400> 119
gagctggttt attctgcggc cgaggattac atttatgcac gaacgggctt actggttcca 60
gattccccac ttgggcacag gcataggagg cttgttttcc aaattgctgg ttttaattgc 120
acctgccttt cagattacct ctgggaatct gtgggaggag ccgagagggt ggaaaatgtt 180
tcttagcttt gcaaaaggaa gaaaactttg tcacccagcg ggagacctca gccacgagta 240
acccggggag acaccagaac cgggacgggc tttgactgat ttgcctacga gggttccgta 300
ggaaaggacg cttgaattcg gcgcttcggc ggcggcggcg gccgcgcgag ttccctgctc 360
accctccctc tccgcggaag tccccacgag gtggcttcag ggtgtaacag agcgcgcggc 420
tccagtccga aggcagcggc cgggggaggg aaggagggga ccgaaccccc gaggagtttc 480
gcagaatcaa cttctggtta gagttatggg aagcgcggtt atggacacca agaagaaaaa 540
agatgtttcc agccccggcg ggagcggcgg caagaaaaat gccagccaga agaggcgttc 600
gctgcgcgtg cacattccgg acctgagctc cttcgccatg ccgctcctgg acggagacct 660
ggagggttcc ggaaagcatt cctctcgaaa ggtggacagc cccttcggcc cgggcagccc 720
ctccaaaggg ttcttctcca gaggccccca gccccggccc tccagcccca tgtctgcacc 780
tgtgaggccc aagaccagcc ccggctctcc caaaaccgtg ttcccgttct cctaccagga 840
gtccccgcca cgctcccctc gacgcatgag cttcagtggg atcttccgct cctcctccaa 900
agagtcttcc cccaactcca accctgctac ctcgcccggg ggcatcaggt ttttctcccg 960
ctccagaaaa acctccggcc tctcctcctc tccgtcaaca cccacccaag tgaccaagca 1020
gcacacgttt cccctggaat cctataagca cgagcctgaa cggttagaga atcgcatcta 1080
tgcctcgtct tcccccccgg acacagggca gaggttctgc ccgtcttcct tccagagccc 1140
gaccaggcct ccactggcat caccgacaca ctatgctccc tccaaagccg cggcgctggc 1200
ggcggccctg ggacccgcgg aagccggcat gctggagaag ctggagttcg aggacgaagc 1260
agtagaagac tcagaaagtg gtgtttacat gcgattcatg aggtcacaca agtgttatga 1320
catcgttcca accagttcaa agcttgttgt ctttgatact acattacaag ttaaaaaggc 1380
cttctttgct ttggtagcca acggtgtccg agcagcgcca ctgtgggaga gtaaaaaaca 1440
aagttttgta ggaatgctaa caattacaga tttcataaat atactacata gatactataa 1500
atcacctatg gtacagattt atgaattaga ggaacataaa attgaaacat ggagggagct 1560
ttatttacaa gaaacattta agcctttagt gaatatatct ccagatgcaa gcctcttcga 1620
tgctgtatac tccttgatca aaaataaaat ccacagattg cccgttattg accctatcag 1680
tgggaatgca ctttatatac ttacccacaa aagaatcctc aagttcctcc agctttttat 1740
gtctgatatg ccaaagcctg ccttcatgaa gcagaacctg gatgagcttg gaataggaac 1800
gtaccacaac attgccttca tacatccaga cactcccatc atcaaagcct tgaacatatt 1860
tgtggaaaga cgaatatcag ctctgcctgt tgtggatgag tcaggaaaag ttgtagatat 1920
ttattccaaa tttgatgtaa ttaatcttgc tgctgagaaa acatacaata acctagatat 1980
cacggtgacc caggcccttc agcaccgttc acagtatttt gaaggtgttg tgaagtgcaa 2040
taagctggaa atactggaga ccatcgtgga cagaatagta agagctgagg tccatcggct 2100
ggtggtggta aatgaagcag atagtattgt gggtattatt tccctgtcgg acattctgca 2160
agccctgatc ctcacaccag caggtgccaa acaaaaggag acagaaacgg agtgaccgcc 2220
gtgaatgtag acgccctagg aggagaactt gaacaaagtc tctgggtcac gttttgcctc 2280
atgaacactg gctgcaagtg gttaagaatg tatatcaggg tttaacaata ggtatttctt 2340
ccagtgatgt tgaaattaag cttaaaaaag aaagatttta tgtgcttgaa gattcaggct 2400
tgcattaaaa gactgttttc agacctttgt ctgaaggatt ttaaatgctg tatgtcatta 2460
aagtgcactg tgtcctgaag ttttcattat ttttcatttc aaagaattca ctggtatgga 2520
acaggtgatg tggcataagg tgagtgcacg gtatgttcag atcacagtgc cttatgtccg 2580
aatacagcaa tatgtcaccg ccgcagccgg ggcgcacgcg tgtgaaacaa caccgagctt 2640
gaatgtggaa gtctttgaac cttttaccaa atcagtttgt tttctttaga tttgtcaaaa 2700
agttgtaatt tgaatataaa taattacttt aaaattgtaa tgacactttt acacgtaagt 2760
gttttgttct gggctaccgt gtcaacgagg ctgctttaca acagctttat ttatttttac 2820
tttcatgcaa tttttttaca catcttttgg tggagtaaac ttcaccacat ccatgaataa 2880
actctcagtt attttgaaat ggcaaatttc tcattattta agtttggatc tggaaaggac 2940
atgacttctg aaatagccgc tgctgggttt taaaagctga ggtctctcaa agtgtggagg 3000
agacgttgcc gtcaggcggg agccaagtgc cgggaagatg tctatttttt ttcttgtgta 3060
ttgaaatgta aaatcatgat gtttgttatg actgctgatg cgattgtttt tgtaaatttt 3120
attgtggcat atacagtatt gtcatacagt tgaagagaaa caatgtttcc taatgtaagt 3180
gctctgaaaa tgttgacact gtatatatat atatgaggat agtttgtttt ttttttgttt 3240
tgggtttttt tttttcagat tgaaaaatta aaatagatcc ta 3282
<210> 120
<211> 569
<212> PRT
<213> 智人
<400> 120
Met Gly Ser Ala Val Met Asp Thr Lys Lys Lys Lys Asp Val Ser Ser
1 5 10 15
Pro Gly Gly Ser Gly Gly Lys Lys Asn Ala Ser Gln Lys Arg Arg Ser
20 25 30
Leu Arg Val His Ile Pro Asp Leu Ser Ser Phe Ala Met Pro Leu Leu
35 40 45
Asp Gly Asp Leu Glu Gly Ser Gly Lys His Ser Ser Arg Lys Val Asp
50 55 60
Ser Pro Phe Gly Pro Gly Ser Pro Ser Lys Gly Phe Phe Ser Arg Gly
65 70 75 80
Pro Gln Pro Arg Pro Ser Ser Pro Met Ser Ala Pro Val Arg Pro Lys
85 90 95
Thr Ser Pro Gly Ser Pro Lys Thr Val Phe Pro Phe Ser Tyr Gln Glu
100 105 110
Ser Pro Pro Arg Ser Pro Arg Arg Met Ser Phe Ser Gly Ile Phe Arg
115 120 125
Ser Ser Ser Lys Glu Ser Ser Pro Asn Ser Asn Pro Ala Thr Ser Pro
130 135 140
Gly Gly Ile Arg Phe Phe Ser Arg Ser Arg Lys Thr Ser Gly Leu Ser
145 150 155 160
Ser Ser Pro Ser Thr Pro Thr Gln Val Thr Lys Gln His Thr Phe Pro
165 170 175
Leu Glu Ser Tyr Lys His Glu Pro Glu Arg Leu Glu Asn Arg Ile Tyr
180 185 190
Ala Ser Ser Ser Pro Pro Asp Thr Gly Gln Arg Phe Cys Pro Ser Ser
195 200 205
Phe Gln Ser Pro Thr Arg Pro Pro Leu Ala Ser Pro Thr His Tyr Ala
210 215 220
Pro Ser Lys Ala Ala Ala Leu Ala Ala Ala Leu Gly Pro Ala Glu Ala
225 230 235 240
Gly Met Leu Glu Lys Leu Glu Phe Glu Asp Glu Ala Val Glu Asp Ser
245 250 255
Glu Ser Gly Val Tyr Met Arg Phe Met Arg Ser His Lys Cys Tyr Asp
260 265 270
Ile Val Pro Thr Ser Ser Lys Leu Val Val Phe Asp Thr Thr Leu Gln
275 280 285
Val Lys Lys Ala Phe Phe Ala Leu Val Ala Asn Gly Val Arg Ala Ala
290 295 300
Pro Leu Trp Glu Ser Lys Lys Gln Ser Phe Val Gly Met Leu Thr Ile
305 310 315 320
Thr Asp Phe Ile Asn Ile Leu His Arg Tyr Tyr Lys Ser Pro Met Val
325 330 335
Gln Ile Tyr Glu Leu Glu Glu His Lys Ile Glu Thr Trp Arg Glu Leu
340 345 350
Tyr Leu Gln Glu Thr Phe Lys Pro Leu Val Asn Ile Ser Pro Asp Ala
355 360 365
Ser Leu Phe Asp Ala Val Tyr Ser Leu Ile Lys Asn Lys Ile His Arg
370 375 380
Leu Pro Val Ile Asp Pro Ile Ser Gly Asn Ala Leu Tyr Ile Leu Thr
385 390 395 400
His Lys Arg Ile Leu Lys Phe Leu Gln Leu Phe Met Ser Asp Met Pro
405 410 415
Lys Pro Ala Phe Met Lys Gln Asn Leu Asp Glu Leu Gly Ile Gly Thr
420 425 430
Tyr His Asn Ile Ala Phe Ile His Pro Asp Thr Pro Ile Ile Lys Ala
435 440 445
Leu Asn Ile Phe Val Glu Arg Arg Ile Ser Ala Leu Pro Val Val Asp
450 455 460
Glu Ser Gly Lys Val Val Asp Ile Tyr Ser Lys Phe Asp Val Ile Asn
465 470 475 480
Leu Ala Ala Glu Lys Thr Tyr Asn Asn Leu Asp Ile Thr Val Thr Gln
485 490 495
Ala Leu Gln His Arg Ser Gln Tyr Phe Glu Gly Val Val Lys Cys Asn
500 505 510
Lys Leu Glu Ile Leu Glu Thr Ile Val Asp Arg Ile Val Arg Ala Glu
515 520 525
Val His Arg Leu Val Val Val Asn Glu Ala Asp Ser Ile Val Gly Ile
530 535 540
Ile Ser Leu Ser Asp Ile Leu Gln Ala Leu Ile Leu Thr Pro Ala Gly
545 550 555 560
Ala Lys Gln Lys Glu Thr Glu Thr Glu
565
<210> 121
<211> 2325
<212> DNA
<213> 智人
<400> 121
atgtcgtcgg aggaggacaa gagcgtggag cagccgcagc cgccgccacc accccccgag 60
gagcctggag ccccggcccc gagccccgca gccgcagaca aaagacctcg gggccggcct 120
cgcaaagatg gcgcttcccc tttccagaga gccagaaaga aacctcgaag tagggggaaa 180
actgcagtgg aagatgagga cagcatggat gggctggaga caacagaaac agaaacgatt 240
gtggaaacag aaatcaaaga acaatctgca gaagaggatg ctgaagcaga agtggataac 300
agcaaacagc taattccaac tcttcagcga tctgtgtctg aggaatcggc aaactccctg 360
gtctctgttg gtgtagaagc caaaatcagt gaacagctct gcgctttttg ttactgtggg 420
gaaaaaagtt ccttaggaca aggagactta aaacaattca gaataacgcc tggatttatc 480
ttgccatgga gaaaccaacc ttctaacaag aaggacattg atgacaacag caatggaacc 540
tatgagaaaa tgcaaaactc agcaccacga aaacaaagag gacagagaaa agaacgatct 600
cctcagcaga atatagtatc ttgtgtaagt gtaagcaccc agacagcttc agatgatcaa 660
gctggtaaac tgtgggatga actcagtctg gttgggcttc cagatgccat tgatatccaa 720
gccttatttg attctacagg cacttgttgg gctcatcacc gttgtgtgga gtggtcacta 780
ggagtatgcc agatggaaga accattgtta gtgaacgtgg acaaagctgt tgtctcaggg 840
agcacagaac gatgtgcatt ttgtaagcac cttggagcca ctatcaaatg ctgtgaagag 900
aaatgtaccc agatgtatca ttatccttgt gctgcaggag ccggcacctt tcaggatttc 960
agtcacatct tcctgctttg tccagaacac attgaccaag ctcctgaaag atcgaaggaa 1020
gatgcaaact gtgcagtgtg cgacagcccg ggagacctct tagatcagtt cttttgtact 1080
acttgtggtc agcactatca tggaatgtgc ctggatatag cggttactcc attaaaacgt 1140
gcaggttggc aatgtcctga gtgcaaagtg tgccagaact gcaaacaatc gggagaagat 1200
agcaagatgc tagtgtgtga tacgtgtgac aaagggtatc atactttttg tcttcaacca 1260
gttatgaaat cagtaccaac caatggctgg aaatgcaaag cggcgctggc ggcggccctg 1320
ggacccgcgg aagccggcat gctggagaag ctggagttcg aggacgaagc agtagaagac 1380
tcagaaagtg gtgtttacat gcgattcatg aggtcacaca agtgttatga catcgttcca 1440
accagttcaa agcttgttgt ctttgatact acattacaag ttaaaaaggc cttctttgct 1500
ttggtagcca acggtgtccg agcagcgcca ctgtgggaga gtaaaaaaca aagttttgta 1560
ggaatgctaa caattacaga tttcataaat atactacata gatactataa atcacctatg 1620
gtacagattt atgaattaga ggaacataaa attgaaacat ggagggagct ttatttacaa 1680
gaaacattta agcctttagt gaatatatct ccagatgcaa gcctcttcga tgctgtatac 1740
tccttgatca aaaataaaat ccacagattg cccgttattg accctatcag tgggaatgca 1800
ctttatatac ttacccacaa aagaatcctc aagttcctcc agctttttat gtctgatatg 1860
ccaaagcctg ccttcatgaa gcagaacctg gatgagcttg gaataggaac gtaccacaac 1920
attgccttca tacatccaga cactcccatc atcaaagcct tgaacatatt tgtggaaaga 1980
cgaatatcag ctctgcctgt tgtggatgag tcaggaaaag ttgtagatat ttattccaaa 2040
tttgatgtaa ttaatcttgc tgctgagaaa acatacaata acctagatat cacggtgacc 2100
caggcccttc agcaccgttc acagtatttt gaaggtgttg tgaagtgcaa taagctggaa 2160
atactggaga ccatcgtgga cagaatagta agagctgagg tccatcggct ggtggtggta 2220
aatgaagcag atagtattgt gggtattatt tccctgtcgg acattctgca agccctgatc 2280
ctcacaccag caggtgccaa acaaaaggag acagaaacgg agtga 2325
<210> 122
<211> 774
<212> PRT
<213> 智人
<400> 122
Met Ser Ser Glu Glu Asp Lys Ser Val Glu Gln Pro Gln Pro Pro Pro
1 5 10 15
Pro Pro Pro Glu Glu Pro Gly Ala Pro Ala Pro Ser Pro Ala Ala Ala
20 25 30
Asp Lys Arg Pro Arg Gly Arg Pro Arg Lys Asp Gly Ala Ser Pro Phe
35 40 45
Gln Arg Ala Arg Lys Lys Pro Arg Ser Arg Gly Lys Thr Ala Val Glu
50 55 60
Asp Glu Asp Ser Met Asp Gly Leu Glu Thr Thr Glu Thr Glu Thr Ile
65 70 75 80
Val Glu Thr Glu Ile Lys Glu Gln Ser Ala Glu Glu Asp Ala Glu Ala
85 90 95
Glu Val Asp Asn Ser Lys Gln Leu Ile Pro Thr Leu Gln Arg Ser Val
100 105 110
Ser Glu Glu Ser Ala Asn Ser Leu Val Ser Val Gly Val Glu Ala Lys
115 120 125
Ile Ser Glu Gln Leu Cys Ala Phe Cys Tyr Cys Gly Glu Lys Ser Ser
130 135 140
Leu Gly Gln Gly Asp Leu Lys Gln Phe Arg Ile Thr Pro Gly Phe Ile
145 150 155 160
Leu Pro Trp Arg Asn Gln Pro Ser Asn Lys Lys Asp Ile Asp Asp Asn
165 170 175
Ser Asn Gly Thr Tyr Glu Lys Met Gln Asn Ser Ala Pro Arg Lys Gln
180 185 190
Arg Gly Gln Arg Lys Glu Arg Ser Pro Gln Gln Asn Ile Val Ser Cys
195 200 205
Val Ser Val Ser Thr Gln Thr Ala Ser Asp Asp Gln Ala Gly Lys Leu
210 215 220
Trp Asp Glu Leu Ser Leu Val Gly Leu Pro Asp Ala Ile Asp Ile Gln
225 230 235 240
Ala Leu Phe Asp Ser Thr Gly Thr Cys Trp Ala His His Arg Cys Val
245 250 255
Glu Trp Ser Leu Gly Val Cys Gln Met Glu Glu Pro Leu Leu Val Asn
260 265 270
Val Asp Lys Ala Val Val Ser Gly Ser Thr Glu Arg Cys Ala Phe Cys
275 280 285
Lys His Leu Gly Ala Thr Ile Lys Cys Cys Glu Glu Lys Cys Thr Gln
290 295 300
Met Tyr His Tyr Pro Cys Ala Ala Gly Ala Gly Thr Phe Gln Asp Phe
305 310 315 320
Ser His Ile Phe Leu Leu Cys Pro Glu His Ile Asp Gln Ala Pro Glu
325 330 335
Arg Ser Lys Glu Asp Ala Asn Cys Ala Val Cys Asp Ser Pro Gly Asp
340 345 350
Leu Leu Asp Gln Phe Phe Cys Thr Thr Cys Gly Gln His Tyr His Gly
355 360 365
Met Cys Leu Asp Ile Ala Val Thr Pro Leu Lys Arg Ala Gly Trp Gln
370 375 380
Cys Pro Glu Cys Lys Val Cys Gln Asn Cys Lys Gln Ser Gly Glu Asp
385 390 395 400
Ser Lys Met Leu Val Cys Asp Thr Cys Asp Lys Gly Tyr His Thr Phe
405 410 415
Cys Leu Gln Pro Val Met Lys Ser Val Pro Thr Asn Gly Trp Lys Cys
420 425 430
Lys Ala Ala Leu Ala Ala Ala Leu Gly Pro Ala Glu Ala Gly Met Leu
435 440 445
Glu Lys Leu Glu Phe Glu Asp Glu Ala Val Glu Asp Ser Glu Ser Gly
450 455 460
Val Tyr Met Arg Phe Met Arg Ser His Lys Cys Tyr Asp Ile Val Pro
465 470 475 480
Thr Ser Ser Lys Leu Val Val Phe Asp Thr Thr Leu Gln Val Lys Lys
485 490 495
Ala Phe Phe Ala Leu Val Ala Asn Gly Val Arg Ala Ala Pro Leu Trp
500 505 510
Glu Ser Lys Lys Gln Ser Phe Val Gly Met Leu Thr Ile Thr Asp Phe
515 520 525
Ile Asn Ile Leu His Arg Tyr Tyr Lys Ser Pro Met Val Gln Ile Tyr
530 535 540
Glu Leu Glu Glu His Lys Ile Glu Thr Trp Arg Glu Leu Tyr Leu Gln
545 550 555 560
Glu Thr Phe Lys Pro Leu Val Asn Ile Ser Pro Asp Ala Ser Leu Phe
565 570 575
Asp Ala Val Tyr Ser Leu Ile Lys Asn Lys Ile His Arg Leu Pro Val
580 585 590
Ile Asp Pro Ile Ser Gly Asn Ala Leu Tyr Ile Leu Thr His Lys Arg
595 600 605
Ile Leu Lys Phe Leu Gln Leu Phe Met Ser Asp Met Pro Lys Pro Ala
610 615 620
Phe Met Lys Gln Asn Leu Asp Glu Leu Gly Ile Gly Thr Tyr His Asn
625 630 635 640
Ile Ala Phe Ile His Pro Asp Thr Pro Ile Ile Lys Ala Leu Asn Ile
645 650 655
Phe Val Glu Arg Arg Ile Ser Ala Leu Pro Val Val Asp Glu Ser Gly
660 665 670
Lys Val Val Asp Ile Tyr Ser Lys Phe Asp Val Ile Asn Leu Ala Ala
675 680 685
Glu Lys Thr Tyr Asn Asn Leu Asp Ile Thr Val Thr Gln Ala Leu Gln
690 695 700
His Arg Ser Gln Tyr Phe Glu Gly Val Val Lys Cys Asn Lys Leu Glu
705 710 715 720
Ile Leu Glu Thr Ile Val Asp Arg Ile Val Arg Ala Glu Val His Arg
725 730 735
Leu Val Val Val Asn Glu Ala Asp Ser Ile Val Gly Ile Ile Ser Leu
740 745 750
Ser Asp Ile Leu Gln Ala Leu Ile Leu Thr Pro Ala Gly Ala Lys Gln
755 760 765
Lys Glu Thr Glu Thr Glu
770
<210> 123
<211> 1695
<212> DNA
<213> 智人
<400> 123
atgtcgtcgg aggaggacaa gagcgtggag cagccgcagc cgccgccacc accccccgag 60
gagcctggag ccccggcccc gagccccgca gccgcagaca aaagacctcg gggccggcct 120
cgcaaagatg gcgcttcccc tttccagaga gccagaaaga aacctcgaag tagggggaaa 180
actgcagtgg aagatgagga cagcatggat gggctggaga caacagaaac agaaacgatt 240
gtggaaacag aaatcaaaga acaatctgca gaagaggatg ctgaagcaga agtggataac 300
agcaaacagc taattccaac tcttcagcga tctgtgtctg aggaatcggc aaactccctg 360
gtctctgttg gtgtagaagc caaaatcagt gaacagctct gcgctttttg ttactgtggg 420
gaaaaaagtt ccttaggaca aggagactta aaacaattca gaataacgcc tggatttatc 480
ttgccatgga gaaaccaacc ttctaacaag aaggacattg atgacaacag caatggaacc 540
tatgagaaaa tgcaaaactc agcaccacga aaacaaagag gacagagaaa agaacgatct 600
cctcagcaga atatagtatc ttgtgtaagt gtaagcaccc agacagcttc agatgatcaa 660
gctggtaaac tgtgggatga actcagtctg gttgggcttc cagatgccat tgatatccaa 720
gccttatttg attctacagg cacttgttgg gctcatcacc gttgtgtgga gtggtcacta 780
ggagtatgcc agatggaaga accattgtta gtgaacgtgg acaaagctgt tgtctcaggg 840
agcacagaag ttaaaaaggc cttctttgct ttggtagcca acggtgtccg agcagcgcca 900
ctgtgggaga gtaaaaaaca aagttttgta ggaatgctaa caattacaga tttcataaat 960
atactacata gatactataa atcacctatg gtacagattt atgaattaga ggaacataaa 1020
attgaaacat ggagggagct ttatttacaa gaaacattta agcctttagt gaatatatct 1080
ccagatgcaa gcctcttcga tgctgtatac tccttgatca aaaataaaat ccacagattg 1140
cccgttattg accctatcag tgggaatgca ctttatatac ttacccacaa aagaatcctc 1200
aagttcctcc agctttttat gtctgatatg ccaaagcctg ccttcatgaa gcagaacctg 1260
gatgagcttg gaataggaac gtaccacaac attgccttca tacatccaga cactcccatc 1320
atcaaagcct tgaacatatt tgtggaaaga cgaatatcag ctctgcctgt tgtggatgag 1380
tcaggaaaag ttgtagatat ttattccaaa tttgatgtaa ttaatcttgc tgctgagaaa 1440
acatacaata acctagatat cacggtgacc caggcccttc agcaccgttc acagtatttt 1500
gaaggtgttg tgaagtgcaa taagctggaa atactggaga ccatcgtgga cagaatagta 1560
agagctgagg tccatcggct ggtggtggta aatgaagcag atagtattgt gggtattatt 1620
tccctgtcgg acattctgca agccctgatc ctcacaccag caggtgccaa acaaaaggag 1680
acagaaacgg agtga 1695
<210> 124
<211> 566
<212> PRT
<213> 智人
<400> 124
Met Ser Ser Glu Glu Asp Lys Ser Val Glu Gln Pro Gln Pro Pro Pro
1 5 10 15
Pro Pro Pro Glu Glu Pro Gly Ala Pro Ala Pro Ser Pro Ala Ala Ala
20 25 30
Asp Lys Arg Pro Arg Gly Arg Pro Arg Lys Asp Gly Ala Ser Pro Phe
35 40 45
Gln Arg Ala Arg Lys Lys Pro Arg Ser Arg Gly Lys Thr Ala Val Glu
50 55 60
Asp Glu Asp Ser Met Glu Thr Asp Gly Leu Glu Thr Thr Glu Thr Glu
65 70 75 80
Thr Ile Val Glu Thr Glu Ile Lys Glu Gln Ser Ala Glu Glu Asp Ala
85 90 95
Glu Ala Glu Val Asp Asn Ser Lys Gln Leu Ile Pro Thr Leu Gln Arg
100 105 110
Ser Val Ser Glu Glu Ser Ala Asn Ser Leu Val Ser Val Gly Val Glu
115 120 125
Ala Lys Ile Ser Glu Gln Leu Cys Ala Phe Cys Tyr Cys Gly Glu Lys
130 135 140
Ser Ser Leu Gly Gln Gly Asp Leu Lys Gln Phe Arg Ile Thr Pro Gly
145 150 155 160
Phe Ile Leu Pro Trp Arg Asn Gln Pro Ser Asn Lys Lys Asp Ile Asp
165 170 175
Asp Asn Ser Asn Gly Thr Tyr Glu Lys Met Gln Asn Ser Ala Pro Arg
180 185 190
Lys Gln Arg Gly Gln Arg Lys Glu Arg Ser Pro Gln Gln Asn Ile Val
195 200 205
Ser Cys Val Ser Val Ser Thr Gln Thr Ala Ser Asp Asp Gln Ala Gly
210 215 220
Lys Leu Trp Asp Glu Leu Ser Leu Val Gly Leu Pro Asp Ala Ile Asp
225 230 235 240
Ile Gln Ala Leu Phe Asp Ser Thr Gly Thr Cys Trp Ala His His Arg
245 250 255
Cys Val Glu Trp Ser Leu Gly Val Cys Gln Met Glu Glu Pro Leu Leu
260 265 270
Val Asn Val Asp Lys Ala Val Val Ser Gly Ser Thr Glu Val Lys Lys
275 280 285
Ala Phe Phe Ala Leu Val Ala Asn Gly Val Arg Ala Ala Pro Leu Trp
290 295 300
Glu Ser Lys Lys Gln Ser Phe Val Gly Met Leu Thr Ile Thr Asp Phe
305 310 315 320
Ile Asn Ile Leu His Arg Tyr Tyr Lys Ser Pro Met Val Gln Ile Tyr
325 330 335
Glu Leu Glu Glu His Lys Ile Glu Thr Trp Arg Glu Leu Tyr Leu Gln
340 345 350
Glu Thr Phe Lys Pro Leu Val Asn Ile Ser Pro Asp Ala Ser Leu Phe
355 360 365
Asp Ala Val Tyr Ser Leu Ile Lys Asn Lys Ile His Arg Leu Pro Val
370 375 380
Ile Asp Pro Ile Ser Gly Asn Ala Leu Tyr Ile Leu Thr His Lys Arg
385 390 395 400
Ile Leu Lys Phe Leu Gln Leu Phe Met Ser Asp Met Pro Lys Pro Ala
405 410 415
Phe Met Lys Gln Asn Leu Asp Glu Leu Gly Ile Gly Thr Tyr His Asn
420 425 430
Ile Ala Phe Ile His Pro Asp Thr Pro Ile Ile Lys Ala Leu Asn Ile
435 440 445
Phe Val Glu Arg Arg Ile Ser Ala Leu Pro Val Val Asp Glu Ser Gly
450 455 460
Lys Val Val Asp Ile Tyr Ser Lys Phe Asp Val Ile Asn Leu Ala Ala
465 470 475 480
Glu Lys Thr Tyr Asn Asn Leu Asp Ile Thr Val Thr Gln Ala Leu Gln
485 490 495
His Arg Ser Gln Tyr Phe Glu Gly Val Val Lys Cys Asn Lys Leu Glu
500 505 510
Ile Leu Glu Thr Ile Val Asp Arg Ile Val Arg Ala Glu Val His Arg
515 520 525
Leu Val Val Val Asn Glu Ala Asp Ser Ile Val Gly Ile Ile Ser Leu
530 535 540
Ser Asp Ile Leu Gln Ala Leu Ile Leu Thr Pro Ala Gly Ala Lys Gln
545 550 555 560
Lys Glu Thr Glu Thr Glu
565
<210> 125
<211> 4668
<212> DNA
<213> 智人
<400> 125
atgtcgtcgg aggaggacaa gagcgtggag cagccgcagc cgccgccacc accccccgag 60
gagcctggag ccccggcccc gagccccgca gccgcagaca aaagacctcg gggccggcct 120
cgcaaagatg gcgcttcccc tttccagaga gccagaaaga aacctcgaag tagggggaaa 180
actgcagtgg aagatgagga cagcatggat gggctggaga caacagaaac agaaacgatt 240
gtggaaacag aaatcaaaga acaatctgca gaagaggatg ctgaagcaga agtggataac 300
agcaaacagc taattccaac tcttcagcga tctgtgtctg aggaatcggc aaactccctg 360
gtctctgttg gtgtagaagc caaaatcagt gaacagctct gcgctttttg ttactgtggg 420
gaaaaaagtt ccttaggaca aggagactta aaacaattca gaataacgcc tggatttatc 480
ttgccatgga gaaaccaacc ttctaacaag aaggacattg atgacaacag caatggaacc 540
tatgagaaaa tgcaaaactc agcaccacga aaacaaagag gacagagaaa agaacgatct 600
cctcagcaga atatagtatc ttgtgtaagt gtaagcaccc agacagcttc agatgatcaa 660
gctggtaaac tgtgggatga actcagtctg gttgggcttc cagatgccat tgatatccaa 720
gccttatttg attctacagg cacttgttgg gctcatcacc gttgtgtgga gtggtcacta 780
ggagtatgcc agatggaaga accattgtta gtgaacgtgg acaaagctgt tgtctcaggg 840
agcacagaac gatgtgcatt ttgtaagcac cttggagcca ctatcaaatg ctgtgaagag 900
aaatgtaccc agatgtatca ttatccttgt gctgcaggag ccggcacctt tcaggatttc 960
agtcacatct tcctgctttg tccagaacac attgaccaag ctcctgaaag atcgaaggaa 1020
gatgcaaact gtgcagtgtg cgacagcccg ggagacctct tagatcagtt cttttgtact 1080
acttgtggtc agcactatca tggaatgtgc ctggatatag cggttactcc attaaaacgt 1140
gcaggttggc aatgtcctga gtgcaaagtg tgccagaact gcaaacaatc gggagaagat 1200
agcaagatgc tagtgtgtga tacgtgtgac aaagggtatc atactttttg tcttcaacca 1260
gttatgaaat cagtaccaac caatggctgg aaatgcaaaa attgcagaat atgtatagag 1320
tgtggcacac ggtctagttc tcagtggcac cacaattgcc tgatatgtga caattgttac 1380
caacagcagg ataacttatg tcccttctgt gggaagtgtt atcatccaga attgcagaaa 1440
gacatgcttc attgtaatat gtgcaaaagg tgggttcacc tagagtgtga caaaccaaca 1500
gatcatgaac tggatactca gctcaaagaa gagtatatct gcatgtattg taaacacctg 1560
ggagctgaga tggatcgttt acagccaggt gaggaagtgg agatagctga gctcactaca 1620
gattataaca atgaaatgga agttgaaggc cctgaagatc aaatggtatt ctcagagcag 1680
gcagctaata aagatgtcaa cggtcaggag tccactcctg gaattgttcc agatgcggtt 1740
caagtccaca ctgaagagca acagaagagt catccctcag aaagtcttga cacagatagt 1800
cttcttattg ctgtatcatc ccaacataca gtgaatactg aattggaaaa acagatttct 1860
aatgaagttg atagtgaaga cctgaaaatg tcttctgaag tgaagcatat ttgtggcgaa 1920
gatcaaattg aagataaaat ggaagtgaca gaaaacattg aagtcgttac acaccagatc 1980
actgtgcagc aagaacaact gcagttgtta gaggaacctg aaacagtggt atccagagaa 2040
gaatcaaggc ctccaaaatt agtcatggaa tctgtcactc ttccactaga aaccttagtg 2100
tccccacatg aggaaagtat ttcattatgt cctgaggaac agttggttat agaaaggcta 2160
caaggagaaa aggaacagaa agaaaattct gaactttcta ctggattgat ggactctgaa 2220
atgactccta caattgaggg ttgtgtgaaa gatgtttcat accaaggagg caaatctata 2280
aagttatcat ctgagacaga gtcatcattt tcatcatcag cagacataag caaggcagat 2340
gtgtcttcct ccccaacacc ttcttcagac ttgccttcgc atgacatgct gcataattac 2400
ccttcagctc ttagttcctc tgctggaaac atcatgccaa caacttacat ctcagtcact 2460
ccaaaaattg gcatgggtaa accagctatt actaagagaa aattttctcc tggtagacct 2520
cggtccaaac agggggcttg gagtacccat aatacagtga gcccaccttc ctggtcccca 2580
gacatttcag aaggtcggga aatttttaaa cccaggcagc ttcctggcag tgccatttgg 2640
agcatcaaag tgggccgtgg gtctggattt ccaggaaagc ggagacctcg aggtgcagga 2700
ctgtcggggc gaggtggccg aggcaggtca aagctgaaaa gtggaatcgg agctgttgta 2760
ttacctgggg tgtctactgc agatatttca tcaaataagg atgatgaaga aaactctatg 2820
cacaatacag ttgtgttgtt ttctagcagt gacaagttca ctttgaatca ggatatgtgt 2880
gtagtttgtg gcagttttgg ccaaggagca gaaggaagat tacttgcctg ttctcagtgt 2940
ggtcagtgtt accatccata ctgtgtcagt attaagatca ctaaagtggt tcttagcaaa 3000
ggttggaggt gtcttgagtg cactgtgtgt gaggcctgtg ggaaggcaac tgacccagga 3060
agactcctgc tgtgtgatga ctgtgacata agttatcaca cctactgcct agaccctcca 3120
ttgcagacag ttcccaaagg aggctggaag tgcaaatggt gtgtttggtg cagacactgt 3180
ggagcaacat ctgcaggtct aagatgtgaa tggcagaaca attacacaca gtgcgctcct 3240
tgtgcaagct tatcttcctg tccagtctgc tatcgaaact atagagaaga agatcttatt 3300
ctgcaatgta gacaatgtga tagatggatg catgcagttt gtcagaactt aaatactgag 3360
gaagaagtgg aaaatgtagc agacattggt tttgattgta gcatgtgcag accctatatg 3420
cctgcgtcta atgtgccttc ctcagactgc tgtgaatctt cacttgtagc acaaattgtc 3480
acaaaagtaa aagagctaga cccacccaag acttataccc aggatggtgt gtgtttgact 3540
gaatcaggga tgactcagtt acagagcctc acagttacag ttccaagaag aaaacggtca 3600
aaaccaaaat tgaaattgaa gattataaat cagaatagcg tggccgtcct tcagacccct 3660
ccagacatcc aatcagagca ttcaagggat ggtgaaatgg atgatagtcg agcagtagaa 3720
gactcagaaa gtggtgttta catgcgattc atgaggtcac acaagtgtta tgacatcgtt 3780
ccaaccagtt caaagcttgt tgtctttgat actacattac aagttaaaaa ggccttcttt 3840
gctttggtag ccaacggtgt ccgagcagcg ccactgtggg agagtaaaaa acaaagtttt 3900
gtaggaatgc taacaattac agatttcata aatatactac atagatacta taaatcacct 3960
atggtacaga tttatgaatt agaggaacat aaaattgaaa catggaggga gctttattta 4020
caagaaacat ttaagccttt agtgaatata tctccagatg caagcctctt cgatgctgta 4080
tactccttga tcaaaaataa aatccacaga ttgcccgtta ttgaccctat cagtgggaat 4140
gcactttata tacttaccca caaaagaatc ctcaagttcc tccagctttt tatgtctgat 4200
atgccaaagc ctgccttcat gaagcagaac ctggatgagc ttggaatagg aacgtaccac 4260
aacattgcct tcatacatcc agacactccc atcatcaaag ccttgaacat atttgtggaa 4320
agacgaatat cagctctgcc tgttgtggat gagtcaggaa aagttgtaga tatttattcc 4380
aaatttgatg taattaatct tgctgctgag aaaacataca ataacctaga tatcacggtg 4440
acccaggccc ttcagcaccg ttcacagtat tttgaaggtg ttgtgaagtg caataagctg 4500
gaaatactgg agaccatcgt ggacagaata gtaagagctg aggtccatcg gctggtggtg 4560
gtaaatgaag cagatagtat tgtgggtatt atttccctgt cggacattct gcaagccctg 4620
atcctcacac cagcaggtgc caaacaaaag gagacagaaa cggagtga 4668
<210> 126
<211> 1557
<212> PRT
<213> 智人
<400> 126
Met Ser Ser Glu Glu Asp Lys Ser Val Glu Gln Pro Gln Pro Pro Pro
1 5 10 15
Pro Pro Pro Glu Glu Pro Gly Ala Pro Ala Pro Ser Pro Ala Ala Ala
20 25 30
Asp Lys Arg Pro Arg Gly Arg Pro Arg Lys Asp Gly Ala Ser Pro Phe
35 40 45
Gln Arg Ala Arg Lys Lys Pro Arg Ser Arg Gly Lys Thr Ala Val Glu
50 55 60
Asp Glu Asp Ser Met Asp Gly Leu Glu Thr Thr Glu Thr Glu Thr Ile
65 70 75 80
Val Glu Thr Glu Ile Lys Glu Gln Ser Ala Glu Glu Asp Ala Glu Ala
85 90 95
Glu Val Asp Asn Ser Lys Gln Leu Ile Pro Thr Leu Gln Arg Ser Val
100 105 110
Ser Glu Glu Ser Ala Asn Ser Leu Val Ser Val Gly Val Glu Ala Lys
115 120 125
Ile Ser Glu Gln Leu Cys Ala Phe Cys Tyr Cys Gly Glu Lys Ser Ser
130 135 140
Leu Gly Gln Gly Asp Leu Lys Gln Phe Arg Ile Thr Pro Gly Phe Ile
145 150 155 160
Leu Pro Trp Arg Asn Gln Pro Ser Asn Lys Lys Asp Ile Asp Asp Asn
165 170 175
Ser Asn Gly Thr Tyr Glu Lys Met Gln Asn Ser Ala Pro Arg Lys Gln
180 185 190
Arg Gly Gln Arg Lys Glu Arg Ser Pro Gln Gln Asn Ile Val Ser Cys
195 200 205
Val Ser Val Ser Thr Gln Thr Ala Ser Asp Asp Gln Ala Gly Lys Leu
210 215 220
Trp Asp Glu Leu Ser Leu Val Gly Leu Pro Asp Ala Ile Asp Ile Gln
225 230 235 240
Ala Leu Phe Asp Ser Thr Gly Thr Cys Trp Ala His His Arg Cys Val
245 250 255
Glu Trp Ser Leu Gly Val Cys Gln Met Glu Glu Pro Leu Leu Val Asn
260 265 270
Val Asp Lys Ala Val Val Ser Gly Ser Thr Glu Arg Cys Ala Phe Cys
275 280 285
Lys His Leu Gly Ala Thr Ile Lys Cys Cys Glu Glu Lys Cys Thr Gln
290 295 300
Met Tyr His Tyr Pro Cys Ala Ala Gly Ala Gly Thr Phe Gln Asp Phe
305 310 315 320
Ser His Ile Phe Leu Leu Cys Pro Glu His Ile Asp Gln Ala Pro Glu
325 330 335
Arg Ser Lys Glu Asp Ala Asn Cys Ala Val Cys Asp Ser Pro Gly Asp
340 345 350
Leu Leu Asp Gln Phe Phe Cys Thr Thr Cys Gly Gln His Tyr His Gly
355 360 365
Met Cys Leu Asp Ile Ala Val Thr Pro Leu Lys Arg Ala Gly Trp Gln
370 375 380
Cys Pro Glu Cys Lys Val Cys Gln Asn Cys Lys Gln Ser Gly Glu Asp
385 390 395 400
Ser Lys Met Leu Val Cys Asp Thr Cys Asp Lys Gly Tyr His Thr Phe
405 410 415
Cys Leu Gln Pro Val Met Lys Ser Val Pro Thr Asn Gly Trp Lys Cys
420 425 430
Lys Asn Cys Arg Ile Cys Ile Glu Cys Gly Thr Arg Ser Ser Ser Gln
435 440 445
Trp His His Asn Cys Leu Ile Cys Asp Asn Cys Tyr Gln Gln Gln Asp
450 455 460
Asn Leu Cys Pro Phe Cys Gly Lys Cys Tyr His Pro Glu Leu Gln Lys
465 470 475 480
Asp Met Leu His Cys Asn Met Cys Lys Arg Trp Val His Leu Glu Cys
485 490 495
Asp Lys Pro Thr Asp His Glu Leu Asp Thr Gln Leu Lys Glu Glu Tyr
500 505 510
Ile Cys Met Tyr Cys Lys His Leu Gly Ala Glu Met Asp Arg Leu Gln
515 520 525
Pro Gly Glu Glu Val Glu Ile Ala Glu Leu Thr Thr Asp Tyr Asn Asn
530 535 540
Glu Met Glu Val Glu Gly Pro Glu Asp Gln Met Glu Thr Val Phe Ser
545 550 555 560
Glu Gln Ala Ala Asn Lys Asp Val Asn Gly Gln Glu Ser Thr Pro Gly
565 570 575
Ile Val Pro Asp Ala Val Gln Val His Thr Glu Glu Gln Gln Lys Ser
580 585 590
His Pro Ser Glu Ser Leu Asp Thr Asp Ser Leu Leu Ile Ala Val Ser
595 600 605
Ser Gln His Thr Val Asn Thr Glu Leu Glu Lys Gln Ile Ser Asn Glu
610 615 620
Val Asp Ser Glu Asp Leu Lys Met Ser Ser Glu Val Lys His Ile Cys
625 630 635 640
Gly Glu Asp Gln Ile Glu Asp Lys Met Glu Val Thr Glu Asn Ile Glu
645 650 655
Val Val Thr His Gln Ile Thr Val Gln Gln Glu Gln Leu Gln Leu Leu
660 665 670
Glu Glu Pro Glu Thr Val Val Ser Arg Glu Glu Ser Arg Pro Pro Lys
675 680 685
Leu Val Met Glu Ser Val Thr Leu Pro Leu Glu Thr Leu Val Ser Pro
690 695 700
His Glu Glu Ser Ile Ser Leu Cys Pro Glu Glu Gln Leu Val Ile Glu
705 710 715 720
Arg Leu Gln Gly Glu Lys Glu Gln Lys Glu Asn Ser Glu Leu Ser Thr
725 730 735
Gly Leu Met Asp Ser Glu Met Thr Pro Thr Ile Glu Gly Cys Val Lys
740 745 750
Asp Val Ser Tyr Gln Gly Gly Lys Ser Ile Lys Leu Ser Ser Glu Thr
755 760 765
Glu Ser Ser Phe Ser Ser Ser Ala Asp Ile Ser Lys Ala Asp Val Ser
770 775 780
Ser Ser Pro Thr Pro Ser Ser Asp Leu Pro Ser His Asp Met Leu His
785 790 795 800
Asn Tyr Pro Ser Ala Leu Ser Ser Ser Ala Gly Asn Ile Met Pro Thr
805 810 815
Thr Tyr Ile Ser Val Thr Pro Lys Ile Gly Met Gly Lys Pro Ala Ile
820 825 830
Thr Lys Arg Lys Phe Ser Pro Gly Arg Pro Arg Ser Lys Gln Gly Ala
835 840 845
Trp Ser Thr His Asn Thr Val Ser Pro Pro Ser Trp Ser Pro Asp Ile
850 855 860
Ser Glu Gly Arg Glu Ile Phe Lys Pro Arg Gln Leu Pro Gly Ser Ala
865 870 875 880
Ile Trp Ser Ile Lys Val Gly Arg Gly Ser Gly Phe Pro Gly Lys Arg
885 890 895
Arg Pro Arg Gly Ala Gly Leu Ser Gly Arg Gly Gly Arg Gly Arg Ser
900 905 910
Lys Leu Lys Ser Gly Ile Gly Ala Val Val Leu Pro Gly Val Ser Thr
915 920 925
Ala Asp Ile Ser Ser Asn Lys Asp Asp Glu Glu Asn Ser Met His Asn
930 935 940
Thr Val Val Leu Phe Ser Ser Ser Asp Lys Phe Thr Leu Asn Gln Asp
945 950 955 960
Met Cys Val Val Cys Gly Ser Phe Gly Gln Gly Ala Glu Gly Arg Leu
965 970 975
Leu Ala Cys Ser Gln Cys Gly Gln Cys Tyr His Pro Tyr Cys Val Ser
980 985 990
Ile Lys Ile Thr Lys Val Val Leu Ser Lys Gly Trp Arg Cys Leu Glu
995 1000 1005
Cys Thr Val Cys Glu Ala Cys Gly Lys Ala Thr Asp Pro Gly Arg
1010 1015 1020
Leu Leu Leu Cys Asp Asp Cys Asp Ile Ser Tyr His Thr Tyr Cys
1025 1030 1035
Leu Asp Pro Pro Leu Gln Thr Val Pro Lys Gly Gly Trp Lys Cys
1040 1045 1050
Lys Trp Cys Val Trp Cys Arg His Cys Gly Ala Thr Ser Ala Gly
1055 1060 1065
Leu Arg Cys Glu Trp Gln Asn Asn Tyr Thr Gln Cys Ala Pro Cys
1070 1075 1080
Ala Ser Leu Ser Ser Cys Pro Val Cys Tyr Arg Asn Tyr Arg Glu
1085 1090 1095
Glu Asp Leu Ile Leu Gln Cys Arg Gln Cys Asp Arg Trp Met His
1100 1105 1110
Ala Val Cys Gln Asn Leu Asn Thr Glu Glu Glu Val Glu Asn Val
1115 1120 1125
Ala Asp Ile Gly Phe Asp Cys Ser Met Cys Arg Pro Tyr Met Pro
1130 1135 1140
Ala Ser Asn Val Pro Ser Ser Asp Cys Cys Glu Ser Ser Leu Val
1145 1150 1155
Ala Gln Ile Val Thr Lys Val Lys Glu Leu Asp Pro Pro Lys Thr
1160 1165 1170
Tyr Thr Gln Asp Gly Val Cys Leu Thr Glu Ser Gly Met Thr Gln
1175 1180 1185
Leu Gln Ser Leu Thr Val Thr Val Pro Arg Arg Lys Arg Ser Lys
1190 1195 1200
Pro Lys Leu Lys Leu Lys Ile Ile Asn Gln Asn Ser Val Ala Val
1205 1210 1215
Leu Gln Thr Pro Pro Asp Ile Gln Ser Glu His Ser Arg Asp Gly
1220 1225 1230
Glu Met Asp Asp Ser Arg Ala Val Glu Asp Ser Glu Ser Gly Val
1235 1240 1245
Tyr Met Arg Phe Met Arg Ser His Lys Cys Tyr Asp Ile Val Pro
1250 1255 1260
Thr Ser Ser Lys Leu Val Val Phe Asp Thr Thr Leu Gln Val Lys
1265 1270 1275
Lys Ala Phe Phe Ala Leu Val Ala Asn Gly Val Arg Ala Ala Pro
1280 1285 1290
Leu Trp Glu Ser Lys Lys Gln Ser Phe Val Gly Met Leu Thr Ile
1295 1300 1305
Thr Asp Phe Ile Asn Ile Leu His Arg Tyr Tyr Lys Ser Pro Met
1310 1315 1320
Val Gln Ile Tyr Glu Leu Glu Glu His Lys Ile Glu Thr Trp Arg
1325 1330 1335
Glu Leu Tyr Leu Gln Glu Thr Phe Lys Pro Leu Val Asn Ile Ser
1340 1345 1350
Pro Asp Ala Ser Leu Phe Asp Ala Val Tyr Ser Leu Ile Lys Asn
1355 1360 1365
Lys Ile His Arg Leu Pro Val Ile Asp Pro Ile Ser Gly Asn Ala
1370 1375 1380
Leu Tyr Ile Leu Thr His Lys Arg Ile Leu Lys Phe Leu Gln Leu
1385 1390 1395
Phe Met Ser Asp Met Pro Lys Pro Ala Phe Met Lys Gln Asn Leu
1400 1405 1410
Asp Glu Leu Gly Ile Gly Thr Tyr His Asn Ile Ala Phe Ile His
1415 1420 1425
Pro Asp Thr Pro Ile Ile Lys Ala Leu Asn Ile Phe Val Glu Arg
1430 1435 1440
Arg Ile Ser Ala Leu Pro Val Val Asp Glu Ser Gly Lys Val Val
1445 1450 1455
Asp Ile Tyr Ser Lys Phe Asp Val Ile Asn Leu Ala Ala Glu Lys
1460 1465 1470
Thr Tyr Asn Asn Leu Asp Ile Thr Val Thr Gln Ala Leu Gln His
1475 1480 1485
Arg Ser Gln Tyr Phe Glu Gly Val Val Lys Cys Asn Lys Leu Glu
1490 1495 1500
Ile Leu Glu Thr Ile Val Asp Arg Ile Val Arg Ala Glu Val His
1505 1510 1515
Arg Leu Val Val Val Asn Glu Ala Asp Ser Ile Val Gly Ile Ile
1520 1525 1530
Ser Leu Ser Asp Ile Leu Gln Ala Leu Ile Leu Thr Pro Ala Gly
1535 1540 1545
Ala Lys Gln Lys Glu Thr Glu Thr Glu
1550 1555
<210> 127
<211> 2310
<212> DNA
<213> 智人
<400> 127
tgaggcgcgc cggctggttc aactccggcc gccgcgccga aaccagcagc ggtccgggtc 60
gaaccagcac cggcctcggg aggttccgcc gcctgctctg ccgctgttcc aactgccgct 120
gtagagccac tgggatgcgc accaccggca ggggttcgtc gggactgcgg accgtgaggc 180
cccgtcgcgg cgccaggagc aaccgagtca cgagggaaaa gagccgcacc ggccgcgtta 240
gagccatgtt tcccttagtg cgggagaagc gcacatcagt gacgtcacgg acgcgccgcg 300
acctcgcgta cggtggctgg cgaggctcag tacggtgtgt ggagctggag caccgtgagg 360
aagaagcgag gttcttttta agagttcagc tgcgagatat caaacaaaga attactctgt 420
acaaagccag aacacatata tcaaagtaat cctgaagtat cagaacaaaa taataggctg 480
taacagagga ggaaatgatt ttgaatagcc tctctctgtg ttaccataat aagctaatcc 540
tggccccaat ggttcgggta gggactcttc caatgaggct gctggccctg gattatggag 600
cggacattgt ttactgtgag gagctgatcg acctcaagat gattcagtgc aagagagttg 660
ttaatgaggt gctcagcaca gtggactttg tcgcccctga tgatcgagtt gtcttccgca 720
cctgtgaaag agagcagaac agggtggtct tccagatggg gacttcagac gcagagcgag 780
cccttgctgt ggccaggctt gtagaaaatg atgtggctgg tattgatgtc aacatgggct 840
gtccaaaaca atattccacc aagggaggaa tgggagctgc cctgctgtca gaccctgaca 900
agattgagaa gatcctcagc actcttgtta aagggacacg cagacctgtg acctgcaaga 960
ttcgcatcct gccatcgcta gaagataccc tgagccttgt gaagcggata gagaggactg 1020
gcattgctgc catcgcagtt catgggagga agcgggagga gcgacctcag catcctgtca 1080
gctgtgaagt catcaaagcc attgctgata ccctctccat tcctgtcata gccaacggag 1140
gatctcatga ccacatccaa cagtattcgg acatagagga ctttcgacaa gccacggcag 1200
cctcttccgt gatggtggcc cgagcagcca tgtggaaccc atctatcttc ctcaaggagg 1260
gtctgcggcc cctggaggag gtcatgcaga aatacatcag atacgcggtg cagtatgaca 1320
accactacac caacaccaag tactgcttgt gccagatgct acgagaacag ctggagtcgc 1380
cccagggaag gttgctccat gctgcccagt cttcccggga aatttgtgag gcctttggcc 1440
ttggtgcctt ctatgaggag accacacagg agctggatgc ccagcaggcc aggctctcag 1500
ccaagacttc agagcagaca ggggagccag ctgaagatac ctctggtgtc attaagatgg 1560
ctgtcaagtt tgaccggaga gcatacccag cccagatcac ccctaagatg tgcctactag 1620
agtggtgccg gagggagaag ttggcacagc ctgtgtatga aacggttcaa cgccctctag 1680
atcgcctgtt ctcctctatt gtcaccgttg ctgaacaaaa gtatcagtct accttgtggg 1740
acaagtccaa gaaactggcg gagcaggctg cagccatcgt ctgtctgcgg agccagggcc 1800
tccctgaggg tcggctgggt gaggagagcc cttccttgca caagcgaaag agggaggctc 1860
ctgaccaaga ccctgggggc cccagagctc aggagctagc acaacctggg gatctgtgca 1920
agaagccctt tgtggccttg ggaagtggtg aagaaagccc cctggaaggc tggtgactac 1980
tcttcctgcc ttagtcaccc ctccatgggc ctggtgctaa ggtggctgtg gatgccacag 2040
catgaaccag atgccgttga acagtttgct ggtcttgcct ggcagaagtt agatgtcctg 2100
gcaggggcca tcagcctaga gcatggacca ggggccgccc aggggtggat cctggcccct 2160
ttggtggatc tgagtgacag ggtcaagttc tctttgaaaa caggagcttt tcaggtggta 2220
actccccaac ctgacattgg tactgtgcaa taaagacacc ccctaccctc acccacggct 2280
ggctgcttca gccttgggca tcttcataaa 2310
<210> 128
<211> 493
<212> PRT
<213> 智人
<400> 128
Met Ile Leu Asn Ser Leu Ser Leu Cys Tyr His Asn Lys Leu Ile Leu
1 5 10 15
Ala Pro Met Val Arg Val Gly Thr Leu Pro Met Arg Leu Leu Ala Leu
20 25 30
Asp Tyr Gly Ala Asp Ile Val Tyr Cys Glu Glu Leu Ile Asp Leu Lys
35 40 45
Met Ile Gln Cys Lys Arg Val Val Asn Glu Val Leu Ser Thr Val Asp
50 55 60
Phe Val Ala Pro Asp Asp Arg Val Val Phe Arg Thr Cys Glu Arg Glu
65 70 75 80
Gln Asn Arg Val Val Phe Gln Met Gly Thr Ser Asp Ala Glu Arg Ala
85 90 95
Leu Ala Val Ala Arg Leu Val Glu Asn Asp Val Ala Gly Ile Asp Val
100 105 110
Asn Met Gly Cys Pro Lys Gln Tyr Ser Thr Lys Gly Gly Met Gly Ala
115 120 125
Ala Leu Leu Ser Asp Pro Asp Lys Ile Glu Lys Ile Leu Ser Thr Leu
130 135 140
Val Lys Gly Thr Arg Arg Pro Val Thr Cys Lys Ile Arg Ile Leu Pro
145 150 155 160
Ser Leu Glu Asp Thr Leu Ser Leu Val Lys Arg Ile Glu Arg Thr Gly
165 170 175
Ile Ala Ala Ile Ala Val His Gly Arg Lys Arg Glu Glu Arg Pro Gln
180 185 190
His Pro Val Ser Cys Glu Val Ile Lys Ala Ile Ala Asp Thr Leu Ser
195 200 205
Ile Pro Val Ile Ala Asn Gly Gly Ser His Asp His Ile Gln Gln Tyr
210 215 220
Ser Asp Ile Glu Asp Phe Arg Gln Ala Thr Ala Ala Ser Ser Val Met
225 230 235 240
Val Ala Arg Ala Ala Met Trp Asn Pro Ser Ile Phe Leu Lys Glu Gly
245 250 255
Leu Arg Pro Leu Glu Glu Val Met Gln Lys Tyr Ile Arg Tyr Ala Val
260 265 270
Gln Tyr Asp Asn His Tyr Thr Asn Thr Lys Tyr Cys Leu Cys Gln Met
275 280 285
Leu Arg Glu Gln Leu Glu Ser Pro Gln Gly Arg Leu Leu His Ala Ala
290 295 300
Gln Ser Ser Arg Glu Ile Cys Glu Ala Phe Gly Leu Gly Ala Phe Tyr
305 310 315 320
Glu Glu Thr Thr Gln Glu Leu Asp Ala Gln Gln Ala Arg Leu Ser Ala
325 330 335
Lys Thr Ser Glu Gln Thr Gly Glu Pro Ala Glu Asp Thr Ser Gly Val
340 345 350
Ile Lys Met Ala Val Lys Phe Asp Arg Arg Ala Tyr Pro Ala Gln Ile
355 360 365
Thr Pro Lys Met Cys Leu Leu Glu Trp Cys Arg Arg Glu Lys Leu Ala
370 375 380
Gln Pro Val Tyr Glu Thr Val Gln Arg Pro Leu Asp Arg Leu Phe Ser
385 390 395 400
Ser Ile Val Thr Val Ala Glu Gln Lys Tyr Gln Ser Thr Leu Trp Asp
405 410 415
Lys Ser Lys Lys Leu Ala Glu Gln Ala Ala Ala Ile Val Cys Leu Arg
420 425 430
Ser Gln Gly Leu Pro Glu Gly Arg Leu Gly Glu Glu Ser Pro Ser Leu
435 440 445
His Lys Arg Lys Arg Glu Ala Pro Asp Gln Asp Pro Gly Gly Pro Arg
450 455 460
Ala Gln Glu Leu Ala Gln Pro Gly Asp Leu Cys Lys Lys Pro Phe Val
465 470 475 480
Ala Leu Gly Ser Gly Glu Glu Ser Pro Leu Glu Gly Trp
485 490
<210> 129
<211> 3760
<212> DNA
<213> 智人
<400> 129
gagaatggcg gcggcggcgg cggcggcggc ggccgctgcc attgcccgga gatggccggc 60
agagccgccg agacgccgaa gagcccgccg cccgcgcgag gtgtagacgg ggcactgcct 120
tcagagcagg tcctgccagc ctcgctggag aggatgccct cgtgtccgtg atgggctgtg 180
ggacaagcaa ggtccttccc gagccaccca aggatgtcca gctggatctg gtcaagaagg 240
tggagccctt cagtggcact aagagtgacg tgtacaagca cttcatcaca gaggtggaca 300
gtgttggccc tgtcaaagcc gggttcccag cagcaagtca gtatgcacac ccctgccccg 360
gtcccccgac tgctggccac acggagcctc cctcagaacc accacgcagg gccagggtag 420
ctaagtacag ggccaagttt gacccacgtg ttacagctaa gtatgacatc aaggccctaa 480
ttggccgagg cagcttcagc cgagtggtac gtgtagagca ccgggcaacc cggcagccgt 540
atgccatcaa gatgattgag accaagtacc gggaggggcg ggaggtgtgt gagtcggagc 600
tgcgtgtgct gcgtcgggtg cgtcatgcca acatcatcca gctggtggag gtgttcgaga 660
cacaggagcg ggtgtacatg gtgatggagc tggccactgg tggagagctc tttgaccgca 720
tcattgccaa gggctccttc accgagcgtg acgccacgcg ggtgctgcag atggtgctgg 780
atggcgtccg gtatctgcat gcactgggca tcacacaccg agacctcaaa cctgagaatc 840
tgctctacta ccatccgggc actgactcca agatcatcat caccgacttc ggcctggcca 900
gtgctcgcaa gaagggtgat gactgcttga tgaagaccac ctgtggcacg cctgagtaca 960
ttgccccaga agtcctggtc cgcaagccat acaccaactc agtggacatg tgggcgctgg 1020
gcgtcattgc ctacatccta ctcagtggca ccatgccgtt tgaggatgac aaccgtaccc 1080
ggctgtaccg gcagatcctc aggggcaagt acagttactc tggggagccc tggcctagtg 1140
tgtccaacct ggccaaggac ttcattgacc gcctgctgac agtggaccct ggagcccgta 1200
tgactgcact gcaggccctg aggcacccgt gggtggtgag catggctgcc tcttcatcca 1260
tgaagaacct gcaccgctcc atatcccaga acctccttaa acgtgcctcc tcgcgctgcc 1320
agagcaccaa atctgcccag tccacgcgtt ccagccgctc cacacgctcc aataagtcac 1380
gccgtgtgcg ggaacgggag ctgcgggagc tcaacctgcg ctaccagcag caatacaatg 1440
gctgagccgc ctggctgtgc acacatgcag cacgacccag cctggccaca cactgtggtg 1500
ccatctgggt ccgatgccct ctctggagat aggcctatgt ggcccacagt aggtgaagaa 1560
tgtctggctc cagccctttc tctgtgcctt cagcagcccc tgtcctcacc atgggcctgg 1620
gccaggtgtg acagagtaga ggtagcacag ggggctgtga ctccccctga actgggagcc 1680
tggcctggca ctgatacccc tcttggtggg cagctgctct ggtggagttg ggaagggata 1740
ggacctggcc ttcactgtct cccttgccct ttgacttttc cccaatcaaa gggaactgca 1800
gtgctgggtg gagtgtcctg tggcctcagg accctttggg acagttactt ctgggacccc 1860
ctttcctcca cagagccctt ctccctggtt tcacacattc ccatgcatcc tgatccttaa 1920
gattatgctc cagtgggaga ccctggtagg cacaaagctt gtgccttgac tggacccgta 1980
gcccctggct aggtcgaaac agccctccac ctcccagcca agatctgtct tccttcatgg 2040
tgcctccagg gagccttcct ggtcccagga cctctggtgg agggccatgg cgtggacctt 2100
cacccttctg gactgtgtgg ccatgctggt catcggcttg cccaggctcc agcctctcca 2160
gattctgagg ggtctcagcc caccgccctt ggtgccttct ttgtagagcc caccgctacc 2220
tccctctccc cgttggatgt ccattccatt ccccaggtgc ctccttccca actgggggtg 2280
gttaaaggga gccccactgc tgctacctgg ggaatggggc acctgggggc caaggcagag 2340
ggaagggggt cctcccgatt agggtcgagt gtcagcctgg gttctatcct ttggtgcagc 2400
cccattgcct tttcccttca ggctctgttg ctccctcctc tgcagctgca cgaaggcgcc 2460
atctggtgtc tgcatgggtg ttggcagcct gggagtgatc actgcacgcc catcgtgcac 2520
acctgcccat cgtgcacacc cacccatggt gcacacctgt agtcctccat gaggacatgg 2580
gaaggtagga gttgccgccc tgggggaggg tcccgggctg ctcacctctc cccttctgct 2640
gagcttctgc gcacccctcc ctggaactta gccatactgt gtgacctgcc tctgaaacca 2700
gggtgccagg ggcactgcct tctcacagct ggccttgccc cgtccaccct gtgctgcttc 2760
ccttcacagc attaaccttc cagtctgggt cccactgagc ctcaagctgg aaggagcccc 2820
tgcgggaggt gggtggggtt gggtggctgc tttcccagag gcctgagcca gaaccatccc 2880
catttctttt gtggtatctc cccctaccac aaaccaggct ggaacccaag ccccttcctc 2940
cacagctgcc ttcagtgggt agaatggggc cagggcccag ctttggcctt agcttgacgg 3000
cagggcccct gccattgcag gagggtttgg ttcccactca gcttctgccg gtcggcagcc 3060
tgggccaggc ccttttcctg catgtgccac ctccagtggg aaacaaaact aaagagacca 3120
ctctgtgcca agtcgactat gccttagaca catcctccta ccgtccccaa tgccccctgg 3180
gcaggaggca gtggagaacc aagccccatg gcctcagaat ttccccccag ttccccaagt 3240
gtctctgggg acctgaagcc ctggggctta cgttctctct tgcccagggt gggcctggtc 3300
ctgagggcag gacagggggt ttggagatgt gggcctttga tagacccact tgggccttca 3360
tgccatggcc tgtggatgga gaatgtgcag ttatttatta tgcgtattca gtttgtaaac 3420
gtatcctctg tattcagtaa acaggctgcc tctccaggga gggctgccat tcattccaac 3480
agttctggct tcttgctgta ggaccaaggg gttgccctgg aggaggggtg ggggccccgg 3540
cctcggcatg gctactctag gaagagccac tgctactcaa ggagtcactc agccccttct 3600
gtgccagaag tccaagtagg gagtcggacc ctcaacagcc tcttctttct cctgagccag 3660
gaagacagac atgaatgcat gatgggacag ggcctgggtc tttaatgggt tgagctgggg 3720
agggcctgtg gtgagctcag ttgtaggcta tgacctggtt 3760
<210> 130
<211> 424
<212> PRT
<213> 智人
<400> 130
Met Gly Cys Gly Thr Ser Lys Val Leu Pro Glu Pro Pro Lys Asp Val
1 5 10 15
Gln Leu Asp Leu Val Lys Lys Val Glu Pro Phe Ser Gly Thr Lys Ser
20 25 30
Asp Val Tyr Lys His Phe Ile Thr Glu Val Asp Ser Val Gly Pro Val
35 40 45
Lys Ala Gly Phe Pro Ala Ala Ser Gln Tyr Ala His Pro Cys Pro Gly
50 55 60
Pro Pro Thr Ala Gly His Thr Glu Pro Pro Ser Glu Pro Pro Arg Arg
65 70 75 80
Ala Arg Val Ala Lys Tyr Arg Ala Lys Phe Asp Pro Arg Val Thr Ala
85 90 95
Lys Tyr Asp Ile Lys Ala Leu Ile Gly Arg Gly Ser Phe Ser Arg Val
100 105 110
Val Arg Val Glu His Arg Ala Thr Arg Gln Pro Tyr Ala Ile Lys Met
115 120 125
Ile Glu Thr Lys Tyr Arg Glu Gly Arg Glu Val Cys Glu Ser Glu Leu
130 135 140
Arg Val Leu Arg Arg Val Arg His Ala Asn Ile Ile Gln Leu Val Glu
145 150 155 160
Val Phe Glu Thr Gln Glu Arg Val Tyr Met Val Met Glu Leu Ala Thr
165 170 175
Gly Gly Glu Leu Phe Asp Arg Ile Ile Ala Lys Gly Ser Phe Thr Glu
180 185 190
Arg Asp Ala Thr Arg Val Leu Gln Met Val Leu Asp Gly Val Arg Tyr
195 200 205
Leu His Ala Leu Gly Ile Thr His Arg Asp Leu Lys Pro Glu Asn Leu
210 215 220
Leu Tyr Tyr His Pro Gly Thr Asp Ser Lys Ile Ile Ile Thr Asp Phe
225 230 235 240
Gly Leu Ala Ser Ala Arg Lys Lys Gly Asp Asp Cys Leu Met Lys Thr
245 250 255
Thr Cys Gly Thr Pro Glu Tyr Ile Ala Pro Glu Val Leu Val Arg Lys
260 265 270
Pro Tyr Thr Asn Ser Val Asp Met Trp Ala Leu Gly Val Ile Ala Tyr
275 280 285
Ile Leu Leu Ser Gly Thr Met Pro Phe Glu Asp Asp Asn Arg Thr Arg
290 295 300
Leu Tyr Arg Gln Ile Leu Arg Gly Lys Tyr Ser Tyr Ser Gly Glu Pro
305 310 315 320
Trp Pro Ser Val Ser Asn Leu Ala Lys Asp Phe Ile Asp Arg Leu Leu
325 330 335
Thr Val Asp Pro Gly Ala Arg Met Thr Ala Leu Gln Ala Leu Arg His
340 345 350
Pro Trp Val Val Ser Met Ala Ala Ser Ser Ser Met Lys Asn Leu His
355 360 365
Arg Ser Ile Ser Gln Asn Leu Leu Lys Arg Ala Ser Ser Arg Cys Gln
370 375 380
Ser Thr Lys Ser Ala Gln Ser Thr Arg Ser Ser Arg Ser Thr Arg Ser
385 390 395 400
Asn Lys Ser Arg Arg Val Arg Glu Arg Glu Leu Arg Glu Leu Asn Leu
405 410 415
Arg Tyr Gln Gln Gln Tyr Asn Gly
420
<210> 131
<211> 1899
<212> DNA
<213> 智人
<400> 131
atgattttga atagcctctc tctgtgttac cataataagc taatcctggc cccaatggtt 60
cgggtaggga ctcttccaat gaggctgctg gccctggatt atggagcgga cattgtttac 120
tgtgaggagc tgatcgacct caagatgatt cagtgcaaga gagttgttaa tgaggtgctc 180
agcacagtgg actttgtcgc ccctgatgat cgagttgtct tccgcacctg tgaaagagag 240
cagaacaggg tggtcttcca gatggggact tcagacgcag agcgagccct tgctgtggcc 300
aggcttgtag aaaatgatgt ggctggtatt gatgtcaaca tgggctgtcc aaaacaatat 360
tccaccaagg gaggaatggg agctgccctg ctgtcagacc ctgacaagat tgagaagatc 420
ctcagcactc ttgttaaagg gacacgcaga cctgtgacct gcaagattcg catcctgcca 480
tcgctagaag ataccctgag ccttgtgaag cggatagaga ggactggcat tgctgccatc 540
gcagttcatg ggaggtgtag acggggcact gccttcagag caggtcctgc cagcctcgct 600
ggagaggatg ccctcgtgtc cgtgatgggc tgtgggacaa gcaaggtcct tcccgagcca 660
cccaaggatg tccagctgga tctggtcaag aaggtggagc ccttcagtgg cactaagagt 720
gacgtgtaca agcacttcat cacagaggtg gacagtgttg gccctgtcaa agccgggttc 780
ccagcagcaa gtcagtatgc acacccctgc cccggtcccc cgactgctgg ccacacggag 840
cctccctcag aaccaccacg cagggccagg gtagctaagt acagggccaa gtttgaccca 900
cgtgttacag ctaagtatga catcaaggcc ctaattggcc gaggcagctt cagccgagtg 960
gtacgtgtag agcaccgggc aacccggcag ccgtatgcca tcaagatgat tgagaccaag 1020
taccgggagg ggcgggaggt gtgtgagtcg gagctgcgtg tgctgcgtcg ggtgcgtcat 1080
gccaacatca tccagctggt ggaggtgttc gagacacagg agcgggtgta catggtgatg 1140
gagctggcca ctggtggaga gctctttgac cgcatcattg ccaagggctc cttcaccgag 1200
cgtgacgcca cgcgggtgct gcagatggtg ctggatggcg tccggtatct gcatgcactg 1260
ggcatcacac accgagacct caaacctgag aatctgctct actaccatcc gggcactgac 1320
tccaagatca tcatcaccga cttcggcctg gccagtgctc gcaagaaggg tgatgactgc 1380
ttgatgaaga ccacctgtgg cacgcctgag tacattgccc cagaagtcct ggtccgcaag 1440
ccatacacca actcagtgga catgtgggcg ctgggcgtca ttgcctacat cctactcagt 1500
ggcaccatgc cgtttgagga tgacaaccgt acccggctgt accggcagat cctcaggggc 1560
aagtacagtt actctgggga gccctggcct agtgtgtcca acctggccaa ggacttcatt 1620
gaccgcctgc tgacagtgga ccctggagcc cgtatgactg cactgcaggc cctgaggcac 1680
ccgtgggtgg tgagcatggc tgcctcttca tccatgaaga acctgcaccg ctccatatcc 1740
cagaacctcc ttaaacgtgc ctcctcgcgc tgccagagca ccaaatctgc ccagtccacg 1800
cgttccagcc gctccacacg ctccaataag tcacgccgtg tgcgggaacg ggagctgcgg 1860
gagctcaacc tgcgctacca gcagcaatac aatggctga 1899
<210> 132
<211> 632
<212> PRT
<213> 智人
<400> 132
Met Ile Leu Asn Ser Leu Ser Leu Cys Tyr His Asn Lys Leu Ile Leu
1 5 10 15
Ala Pro Met Val Arg Val Gly Thr Leu Pro Met Arg Leu Leu Ala Leu
20 25 30
Asp Tyr Gly Ala Asp Ile Val Tyr Cys Glu Glu Leu Ile Asp Leu Lys
35 40 45
Met Ile Gln Cys Lys Arg Val Val Asn Glu Val Leu Ser Thr Val Asp
50 55 60
Phe Val Ala Pro Asp Asp Arg Val Val Phe Arg Thr Cys Glu Arg Glu
65 70 75 80
Gln Asn Arg Val Val Phe Gln Met Gly Thr Ser Asp Ala Glu Arg Ala
85 90 95
Leu Ala Val Ala Arg Leu Val Glu Asn Asp Val Ala Gly Ile Asp Val
100 105 110
Asn Met Gly Cys Pro Lys Gln Tyr Ser Thr Lys Gly Gly Met Gly Ala
115 120 125
Ala Leu Leu Ser Asp Pro Asp Lys Ile Glu Lys Ile Leu Ser Thr Leu
130 135 140
Val Lys Gly Thr Arg Arg Pro Val Thr Cys Lys Ile Arg Ile Leu Pro
145 150 155 160
Ser Leu Glu Asp Thr Leu Ser Leu Val Lys Arg Ile Glu Arg Thr Gly
165 170 175
Ile Ala Ala Ile Ala Val His Gly Arg Cys Arg Arg Gly Thr Ala Phe
180 185 190
Arg Ala Gly Pro Ala Ser Leu Ala Gly Glu Asp Ala Leu Val Ser Val
195 200 205
Met Gly Cys Gly Thr Ser Lys Val Leu Pro Glu Pro Pro Lys Asp Val
210 215 220
Gln Leu Asp Leu Val Lys Lys Val Glu Pro Phe Ser Gly Thr Lys Ser
225 230 235 240
Asp Val Tyr Lys His Phe Ile Thr Glu Val Asp Ser Val Gly Pro Val
245 250 255
Lys Ala Gly Phe Pro Ala Ala Ser Gln Tyr Ala His Pro Cys Pro Gly
260 265 270
Pro Pro Thr Ala Gly His Thr Glu Pro Pro Ser Glu Pro Pro Arg Arg
275 280 285
Ala Arg Val Ala Lys Tyr Arg Ala Lys Phe Asp Pro Arg Val Thr Ala
290 295 300
Lys Tyr Asp Ile Lys Ala Leu Ile Gly Arg Gly Ser Phe Ser Arg Val
305 310 315 320
Val Arg Val Glu His Arg Ala Thr Arg Gln Pro Tyr Ala Ile Lys Met
325 330 335
Ile Glu Thr Lys Tyr Arg Glu Gly Arg Glu Val Cys Glu Ser Glu Leu
340 345 350
Arg Val Leu Arg Arg Val Arg His Ala Asn Ile Ile Gln Leu Val Glu
355 360 365
Val Phe Glu Thr Gln Glu Arg Val Tyr Met Val Met Glu Leu Ala Thr
370 375 380
Gly Gly Glu Leu Phe Asp Arg Ile Ile Ala Lys Gly Ser Phe Thr Glu
385 390 395 400
Arg Asp Ala Thr Arg Val Leu Gln Met Val Leu Asp Gly Val Arg Tyr
405 410 415
Leu His Ala Leu Gly Ile Thr His Arg Asp Leu Lys Pro Glu Asn Leu
420 425 430
Leu Tyr Tyr His Pro Gly Thr Asp Ser Lys Ile Ile Ile Thr Asp Phe
435 440 445
Gly Leu Ala Ser Ala Arg Lys Lys Gly Asp Asp Cys Leu Met Lys Thr
450 455 460
Thr Cys Gly Thr Pro Glu Tyr Ile Ala Pro Glu Val Leu Val Arg Lys
465 470 475 480
Pro Tyr Thr Asn Ser Val Asp Met Trp Ala Leu Gly Val Ile Ala Tyr
485 490 495
Ile Leu Leu Ser Gly Thr Met Pro Phe Glu Asp Asp Asn Arg Thr Arg
500 505 510
Leu Tyr Arg Gln Ile Leu Arg Gly Lys Tyr Ser Tyr Ser Gly Glu Pro
515 520 525
Trp Pro Ser Val Ser Asn Leu Ala Lys Asp Phe Ile Asp Arg Leu Leu
530 535 540
Thr Val Asp Pro Gly Ala Arg Met Thr Ala Leu Gln Ala Leu Arg His
545 550 555 560
Pro Trp Val Val Ser Met Ala Ala Ser Ser Ser Met Lys Asn Leu His
565 570 575
Arg Ser Ile Ser Gln Asn Leu Leu Lys Arg Ala Ser Ser Arg Cys Gln
580 585 590
Ser Thr Lys Ser Ala Gln Ser Thr Arg Ser Ser Arg Ser Thr Arg Ser
595 600 605
Asn Lys Ser Arg Arg Val Arg Glu Arg Glu Leu Arg Glu Leu Asn Leu
610 615 620
Arg Tyr Gln Gln Gln Tyr Asn Gly
625 630
<210> 133
<211> 1609
<212> DNA
<213> 智人
<400> 133
atgattttga atagcctctc tctgtgttac cataataagc taatcctggc cccaatggtt 60
cgggtaggga ctcttccaat gaggctgctg gccctggatt atggagcgga cattgtttac 120
tgtgaggagc tgatcgacct caagatgatt cagtgcaaga gagttgttaa tgaggtgctc 180
agcacagtgg actttgtcgc ccctgatgat cgagttgtct tccgcacctg tgaaagagag 240
cagaacaggg tggtcttcca gatggtgtag acggggcact gccttcagag caggtcctgc 300
cagcctcgct ggagaggatg ccctcgtgtc cgtgatgggc tgtgggacaa gcaaggtcct 360
tcccgagcca cccaaggatg tccagctgga tctggtcaag aaggtggagc ccttcagtgg 420
cactaagagt gacgtgtaca agcacttcat cacagaggtg gacagtgttg gccctgtcaa 480
agccgggttc ccagcagcaa gtcagtatgc acacccctgc cccggtcccc cgactgctgg 540
ccacacggag cctccctcag aaccaccacg cagggccagg gtagctaagt acagggccaa 600
gtttgaccca cgtgttacag ctaagtatga catcaaggcc ctaattggcc gaggcagctt 660
cagccgagtg gtacgtgtag agcaccgggc aacccggcag ccgtatgcca tcaagatgat 720
tgagaccaag taccgggagg ggcgggaggt gtgtgagtcg gagctgcgtg tgctgcgtcg 780
ggtgcgtcat gccaacatca tccagctggt ggaggtgttc gagacacagg agcgggtgta 840
catggtgatg gagctggcca ctggtggaga gctctttgac cgcatcattg ccaagggctc 900
cttcaccgag cgtgacgcca cgcgggtgct gcagatggtg ctggatggcg tccggtatct 960
gcatgcactg ggcatcacac accgagacct caaacctgag aatctgctct actaccatcc 1020
gggcactgac tccaagatca tcatcaccga cttcggcctg gccagtgctc gcaagaaggg 1080
tgatgactgc ttgatgaaga ccacctgtgg cacgcctgag tacattgccc cagaagtcct 1140
ggtccgcaag ccatacacca actcagtgga catgtgggcg ctgggcgtca ttgcctacat 1200
cctactcagt ggcaccatgc cgtttgagga tgacaaccgt acccggctgt accggcagat 1260
cctcaggggc aagtacagtt actctgggga gccctggcct agtgtgtcca acctggccaa 1320
ggacttcatt gaccgcctgc tgacagtgga ccctggagcc cgtatgactg cactgcaggc 1380
cctgaggcac ccgtgggtgg tgagcatggc tgcctcttca tccatgaaga acctgcaccg 1440
ctccatatcc cagaacctcc ttaaacgtgc ctcctcgcgc tgccagagca ccaaatctgc 1500
ccagtccacg cgttccagcc gctccacacg ctccaataag tcacgccgtg tgcgggaacg 1560
ggagctgcgg gagctcaacc tgcgctacca gcagcaatac aatggctga 1609
<210> 134
<211> 89
<212> PRT
<213> 智人
<400> 134
Met Ile Leu Asn Ser Leu Ser Leu Cys Tyr His Asn Lys Leu Ile Leu
1 5 10 15
Ala Pro Met Val Arg Val Gly Thr Leu Pro Met Arg Leu Leu Ala Leu
20 25 30
Asp Tyr Gly Ala Asp Ile Val Tyr Cys Glu Glu Leu Ile Asp Leu Lys
35 40 45
Met Ile Gln Cys Lys Arg Val Val Asn Glu Val Leu Ser Thr Val Asp
50 55 60
Phe Val Ala Pro Asp Asp Arg Val Val Phe Arg Thr Cys Glu Arg Glu
65 70 75 80
Gln Asn Arg Val Val Phe Gln Met Val
85
<210> 135
<211> 2277
<212> DNA
<213> 智人
<400> 135
atggggctcc cagcgctcga gttcagcgac tgctgcctcg atagtccgca cttccgagag 60
acgctcaagt cgcacgaagc agagctggac aagaccaaca aattcatcaa ggagctcatc 120
aaggacggga agtcactcat aagcgcgctc aagaatttgt cttcagcgaa gcggaagttt 180
gcagattcct taaatgaatt taaatttcag tgcataggag atgcagaaac agatgatgag 240
atgtgtatag caagatcttt gcaggagttt gccactgtcc tcaggaatct tgaagatgaa 300
cggatacgga tgattgagaa tgccagcgag gtgctcatca ctcccttgga gaagtttcga 360
aaggaacaga tcggggctgc caaggaagcc aaaaagaagt atgacaaaga gacagaaaag 420
tattgtggca tcttagaaaa acacttgaat ttgtcttcca aaaagaaaga atctcagctt 480
caggaggcag acagccaagt ggacctggtc cggcagcatt tctatgaagt atccctggaa 540
tatgtcttca aggtgcagga agtccaagag agaaagatgt ttgagtttgt ggagcctctg 600
ctggccttcc tgcaaggact cttcactttc tatcaccatg gttacgaact ggccaaggat 660
ttcggggact tcaagacaca gttaaccatt agcatacaga acacaagaaa tcgctttgaa 720
ggcactagat cagaagtgga atcactgatg aaaaagatga aggagaatcc ccttgagcac 780
aagaccatca gtccctacac catggaggga tacctctacg tgcaggagaa acgtcacttt 840
ggaacttctt gggtgaagca ctactgtaca tatcaacggg attccaaaca aatcaccatg 900
gtaccatttg accaaaagtc aggaggaaaa gggggagaag atgaatcagt tatcctcaaa 960
tcctgcacac ggcggaaaac agactccatt gagaagaggt tttgctttga tgtggaagca 1020
gtagacaggc caggggttat caccatgcaa gctttgtcgg aagaggaccg gaggctctgg 1080
atggaagcca tggatggccg ggaacctgtc tacaactcga acaaagacag ccagagtgaa 1140
gggactgcgc agttggacag cattggcttc agcataatca ggaaatgcat ccatgctgtg 1200
gaaaccagag ggatcaacga gcaagggctg tatcgaattg tgggtgtcaa ctccagagtg 1260
cagaagttgc tgagtgtcct gatggacccc aagactgctt ctgagacaga aacagatatc 1320
tgtgctgaat gggagataaa gaccatcact agtgctctga agacctacct aagaatgctt 1380
ccaggaccac tcatgatgta ccagtttcaa agaagtttca tcaaagcagc aaaactggag 1440
aaccaggagt ctcgggtctc tgaaatccac agccttgttc atcggctccc agagaaaaat 1500
cggcagatgt tacagctgct catgaaccac ttggcaaatg ttgctaacaa ccacaagcag 1560
aatttgatga cggtggcaaa ccttggtgtg gtgtttggac ccactctgct gaggcctcag 1620
gaagaaacag tagcagccat catggacatc aaatttcaga acattgtcat tgagatccta 1680
atagaaaacc acgaaaagat atttaacacc gtgcccgata tgcctctcac caatgcccag 1740
ctgcacctgt ctcggaagaa gagcagtgac tccaagcccc cgtcctgcag cgagaggccc 1800
ctgacgctct tccacaccgt tcagtcaaca gagaaacagg aacaaaggaa cagcatcatc 1860
aactccagtt tggaatctgt ctcatcaaat ccaaacagca tccttaattc cagcagcagc 1920
ttacagccca acatgaactc cagtgaccca gacctggctg tggtcaaacc cacccggccc 1980
aactcactcc ccccgaatcc aagcccaact tcacccctct cgccatcttg gcccatgttc 2040
tcggcgccat ccagccctat gcccacctca tccacgtcca gcgactcatc ccccgtcagc 2100
acaccgttcc ggaaggcaaa agccttgtat gcctgcaaag ctgaacatga ctcagaactt 2160
tcgttcacag caggcacggt cttcgataac gttcacccat ctcaggagcc tggctggttg 2220
gaggggactc tgaacggaaa gactggcctc atccctgaga attacgtgga gttcctc 2277