技术领域
本发明涉及的是一种化学合成的单纯疱疹病毒1型(Herpes simplex virus 1,HSV1)糖蛋白B(glycoproteinB,gB糖蛋白)胞外区的全新基因片段,利用基因工程技术,制备重组HSV1病毒gB糖蛋白。通过计算机分析,筛选出含强抗原表位的HSV1病毒的gB糖蛋白胞外区片段,选择真核和原核生物均偏爱的密码子,化学合成全新的基因序列,利用基因工程技术表达,表达的蛋白可用于疫苗及HSV1病毒抗体或抗原的检测等,本发明涉及基因工程技术、疫苗和诊断试剂领域。
背景技术
单纯疱疹病毒(Herpes simplex virus,HSV)是危害人类健康的常见病原体,容易导致复发性感染和潜伏感染,对人体危害严重,分为HSV-1及HSV-2两型。1型单纯疱疹病毒主要引起口面部感染、眼部感染和疱疹性脑炎等;而2型主要引起生殖器感染,并且和女性宫颈癌的发生密切相关。近年HSV1的感染显著增多,约占该病的10%~40%。目前药物治疗HSV感染时常出现相应耐药性,所以研制疫苗是切实可行的有效方法,它能使机体在抗HSV感染免疫中,发挥体液免疫和细胞免疫功能来消除HSV感染。至今已研制了多种HSV疫苗,已有两种HSV糖蛋白疫苗进入III期临床试验,即Chiron公司研制的重组gD2/gB2糖蛋白与MF59佐剂配伍而成的疫苗以及Glaxo SmithKline(GSK)公司开发的重组gD2糖蛋白与另一佐剂(3-de-O-酰化单磷酸类脂A和明矾)配伍而成的疫苗,这两种疫苗均有一定的保护作用,但临床效果有限。国内目前对HSV疫苗的研究,主要集中在DNA核酸疫苗的探索研究方面。
HSV包膜糖蛋白在病毒的吸附、入侵和刺激机体产生免疫应答及疫苗研制中具有重要地位。目前正式命名的HSV包膜糖蛋白有12种,其中gB糖蛋白在感染的细胞中含量最多,是疱疹病毒家族中高度保守的蛋白,各毒株间差异较小,且与所有的疱疹类病毒亚群有较大的同源性,同时gB蛋白具有较强的免疫原性,可诱导机体产生中和抗体和细胞免疫反应,因此HSV1gB糖蛋白是构建HSV疫苗理想的目的基因。
HSV1gB糖蛋白基因全长2712bp,编码29个氨基酸序列的信号肽、696氨基酸序列的胞外区、69氨基酸序列的跨膜区、109氨基酸序列的胞内区。去除部分信号肽序列不影响其在原核细胞中的分泌表达。研究表明,为增强编码抗原蛋白的免疫原性,可去除其部分功能性编码序列,特别在完整抗原蛋白对宿主具有毒性或免疫抑制作用的情况下,对抗原蛋白进行截短表达就尤其重要。而截短后的gB基因构建的疫苗,与用完整gB基因构建的疫苗对HSV-1病毒攻击具有相同的保护作用。目前应用的HSV1病毒抗体的酶联免疫检测试剂盒,其使用的抗原是全病毒抗原,具有生产较危险、成本高、与其它病毒有交叉反应等缺点。目前缺乏HSV1病毒疫苗。灭活疫苗的研究取得了较大进展,但是生产成本高、危险大。
发明内容
本发明目的是针对上述不足之处提供一种化学合成的HSV1病毒gB糖蛋白胞外区的全新基因片段,利用基因工程技术,制备重组HSV1病毒gB糖蛋白胞外区片段。通过计算机分析,筛选出含强抗原表位的HSV1病毒gB糖蛋白胞外区片段,第1个氨基酸-第696个氨基酸,共696个氨基酸,选择真核和原核生物均偏爱的密码子,化学合成全新的基因序列,利用基因工程技术表达该基因。表达的蛋白可用于疫苗、HSV1病毒抗体或抗原的检测及用于免疫制备抗HSV1病毒单抗和多抗等。
化学合成的HSV1病毒gB糖蛋白胞外区基因片段及其表达、应用是采取以下方案实现的:
1.HSV1病毒gB糖蛋白胞外区抗原表位的筛选及其基因片段的化学合成:
利用ANTHEWIN等软件,通过计算机分析HSV1病毒gB糖蛋白的氨基酸序列,筛选出gB糖蛋白的N端(第1氨基酸-第696氨基酸)含有较强的抗原决定簇。选择真核和原核生物均偏爱的密码子,化学合成全新的基因序列,并且在5’端增加了BamHI酶切位点(下画线部分)及两个保护碱基(GC),在3’端增加了终止密码子(TGA)和EcoRI酶切位点(下画线部分)及两个保护碱基(GC),使该基因片段易于克隆至质粒pGEX4T-2内的BamHI和EcoRI酶切位点内。
筛选的HSV1病毒gB糖蛋白内的抗原表位氨基酸序列(第1个aa-第696个aa):
Pro Ser Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gln Ala Ala Asn Gly
Gly Pro Ala Thr Pro Ala Pro Pro Ala Leu Gly Ala Ala Pro Thr Gly Asp Pro
Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Thr Pro Pro Arg Pro Ala Gly
Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr Leu Arg Glu His Leu Arg Asp
Ile Lys Ala Glu Asn Thr Asp Ala Asn Phe Tyr Val Cys Pro Pro Pro Thr Gly
Ala Thr Val Val Gln Phe Glu Gln Pro Arg Arg Cys Pro Thr Arg Pro Glu Gly
Gln Asn Tyr Thr Glu Gly Ile Ala Val Val Phe Lys Glu Asn Ile Ala Pro Tyr
Lys Phe Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gln Val Trp Phe
Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro Val Pro
Phe Glu Glu Val Ile Asp Lys Ile Asn Ala Lys Gly Val Cys Arg Ser Thr Ala
Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg Asp Asp His Glu
Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala Thr Arg Thr Ser Arg Gly Trp
His Thr Thr Asp Leu Lys Tyr Asn Pro Ser Arg Val Glu Ala Phe His Arg Tyr
Gly Thr Thr Val Asn Cys Ile Val Glu Glu Val Asp Ala Arg Ser Val Tyr Pro
Tyr Asp Glu Phe Val Leu Ala Thr Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr
Gly Tyr Arg Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp Arg Phe
Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg Ala Thr
Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Ala Trp Asp
Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp Gln Glu Val Asp
Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg Phe Ser Ser Asp Ala Ile
Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu Tyr Pro Leu Ser Arg Val Asp Leu
Gly Asp Cys Ile Gly Lys Asp Ala Arg Asp Ala Met Asp Arg Ile Phe Ala Arg
Arg Tyr Asn Ala Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu Ala Asn
Gly Gly Phe Leu Ile Ala Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala Glu Leu
Tyr Val Arg Glu His Leu Arg Glu Gln Ser Pro Lys Pro Pro Asn Pro Thr Pro
Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg Ile Lys Thr Thr Ser
Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn His Ile Gln Asn His Val
Asn Asp Met Leu Gly Arg Val Ala Ile Ala Trp Cys Glu Leu Gln Asn His Glu
Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu Asn Pro Asn Ala Ile Ala Ser Ala
Thr Val Gly Arg Arg Val Ser Ala Arg Met Leu Gly Asp Val Met Ala Val Ser
Thr Cys Val Pro Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met Arg Ile
Ser Ser Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu
Asp Gln Gly Pro Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu Arg Leu
Thr Arg Asp Ala Ile Glu Pro Cys Thr Val Gly His Arg Arg Tyr Phe Thr Phe
Gly Gly GlY Tyr Val Tyr Phe Glu Glu Ser Ala Tyr Ser His Gln Leu Ser Arg
Ala Asp Ile Thr Thr Val Ser Thr Phe Ile Asp Leu Asn Ile Thr Met Leu Glu
Asp His Glu Phe Val Pro Leu Glu Val Tyr Thr Arg His Glu Ile Lys Asp Ser
Gly Leu Leu Asp Tyr Thr Glu Val Gln Arg Arg Asn Gln Leu His Asp Leu Arg
Phe Ala Asp Ile Asp Thr Val Ile His Ala Asp Ala
化学合成的含HSV1病毒gB糖蛋白胞外区抗原表位基因的DNA序列(2107bp):
GCGGATCC CCT ACC TCT CCT GGT ACT CCT GGT GTG GCT GCC GCT ACC CAG GCC GCT AAC
GGT GGT CCT GCC ACT CCT GCT CCT CCT CCT CTG GGT GCC GCT CCT ACT GGT GAC CCT
AAG CCT AAG AAG AAC AAG AAG CCT AAG AAC CCT ACT CCT CCT AGG CCT GCT GGT GAC
AAC GCC ACC GTG GCC GCT GGC CAC GCC ACT CTG AGA GAG CAC CTG AGA GAC ATC AAG
GCT GAG AAC ACC GAC GCT AAC TTC TAC GTG TGT CCT CCT CCT ACT GGT GCC ACC GTG
GTG CAG TTC GAG CAG CCT CGC AGG TGC CCT ACC AGG CCT GAA GGT CAG AAC TAC ACC
GAA GGC ATC GCC GTG GTG TTC AAG GAG AAC ATC GCT CCT TAC AAG TTC AAG GCC ACC
ATG TAC TAC AAG GAC GTG ACC GTG AGC CAG GTG TGG TTC GGC CAC CGC TAC TCC CAG
TTC ATG GGC ATC TTC GAG GAC CGC GCT CCT GTG CCT TTC GAG GAG GTG ATC GAC AAG
ATC AAC GCC AAG GGT GTG TGT AGG TCC ACC GCT AAG TAC GTG CGC AAC AAC CTG GAG
ACC ACT GCT TTC CAC AGA GAC GAT CAC GAG ACC GAC ATG GAG CTG AAG CCT GCC AAC
GCC GCT ACT CGC ACC AGC AGA GGC TGG CAC ACC ACT GAC CTG AAG TAC AAC CCT AGC
AGA GTG GAG GCC TTC CAC AGG TAC GGC ACC ACC GTG AAC TGC ATC GTG GAG GAG GTG
GAC GCT CGC AGC GTG TAC CCT TAC GAC GAG TTC GTG CTG GCC ACT GGT GAC TTC GTG
TAC ATG TCT CCT TTC TAC GGC TAC AGA GAA GGT AGC CAC ACC GAG CAC ACT ACC TAC
GCT GCT GAC AGG TTC AAG CAG GTG GAC GGC TTC TAC GCT CGC GAC CTG ACC ACC AAG
GCT AGA GCC ACT GCT CCT ACC ACT AGG AAC CTG CTG ACC ACT CCT AAG TTC ACC GTG
GCT TGG GAC TGG GTG CCT AAG CGC CCT AGC GTG TGC ACC ATG ACC AAG TGG CAG GAG
GTG GAC GAG ATG CTG CGC TCC GAG TAC GGC GGT TCC TTC AGG TTC TCC TCT GAC GCT
ATC TCC ACT ACC TTC ACT ACC AAC CTG ACC GAG TAC CCT CTG TCC AGA GTG GAC CTG
GGT GAC TGC ATC GGT AAG GAC GCT CGC GAC GCC ATG GAC CGC ATC TTC GCT CGC AGG
TAC AAC GCT ACT CAC ATC AAG GTG GGC CAG CCT CAG TAC TAC CAG GCC AAC GGT GGT
TTC CTG ATC GCC TAC CAG CCT CTG CTG AGC AAC ACT CTG GCT GAG CTG TAC GTG AGA
GAG CAC CTG AGA GAG CAG AGC CGC AAG CCT CCT AAC CCT ACG CCT CCT CCT CCC GGT
GCT AGC GCC AAC GCT TCC GTG GAG CGC ATC AAG ACT ACC TCT AGC ATC GAG TTC GCC
AGG CTG CAG TTC ACC TAC AAC CAC ATC CAG CGC CAC GTG AAC GAC ATG CTG GGT CGC
GTG GCT ATC GCT TGG TGC GAG CTG CAG AAC CAC GAG CTG ACT CTG TGG AAC GAG GCT
CGC AAG CTG AAC CCT AAC GCT ATC GCC AGC GTG ACC GTG GGC AGG AGA GTG AGC GCT
AGA ATG CTG GGC GAC GTG ATG GCC GTG TCC ACC TGC GTG CCT GTG GCT GCT GAC AAC
GTG ATC GTG CAG AAC AGC ATG CGC ATC AGC TCC AGA CCT GGT GCC TGC TAC AGC AGA
CCT CTG GTG AGC TTC AGG TAC GAG GAC CAA GGT CCT CTG GTG GAA GGT CAG CTG GGT
GAG AAC AAC GAG CTG AGG CTG ACT CGC GAC GCT ATC GAG CCT TGC ACC GTC GGT CAC
AGA CGC TAC TTC ACC TTC GGT GGC GGT TAC GTG TAC TTC GAG GAG TAC GCT TAC TCT
CAC CAG CTG AGC CGC GCT GAC ATC ACT ACC GTG AGC ACC TTC ATC GAC CTG AAC ATC
ACC ATG CTG GAG GAC CAC GAG TTC GTG CCT CTG GAG GTG TAC ACC CGC CAC GAG ATC
AAG GAC AGC GGC CTG CTG GAC TAC ACC GAG GTG CAG CGC CGC AAC CAG CTG CAC GAC
CTG CGC TTC GCT GAC ATC GAC ACC GTG ATC CAC GCC GAC GCC TAA GAATTCGC
2.表达HSV1病毒gB糖蛋白胞外区片段重组质粒的构建:
提取质粒pGEX4T-2,用Bam H I和EcoRI双酶切,电泳后回收酶切的质粒大片段,溶于去离子水内。同样用BamHI和EcoR I双酶切化学合成的HSV1病毒gB糖蛋白基因片段,电泳回收后,溶于去离子水内。
取等摩尔浓度的上述两种酶切后DNA片段,在同一离心管内用T4 DNA连接酶连接,使HSV1病毒gB糖蛋白基因片段插入到载体pGEX4T-2内的BamH I和EcoR I位点之间,与载体上的起始密码子翻译框架一致,表达一个融合蛋白。
3.重组质粒的筛选与鉴定:
将重组质粒转化大肠杆菌TG1,涂布含氨苄青霉素(100μg/ml)LB平板,置37℃过夜。次日随机挑取转化菌落和1个对照菌(质粒pGEX4T-2转化菌),分别提取质粒,以提取的质粒为模板,PCR扩增HSV1病毒gB糖蛋白胞外区基因片段,含HSV1病毒gB糖蛋白胞外区基因片段的阳性重组质粒,应扩增出长约2107bp的基因片段。将含有外源基因的质粒进行DNA序列分析,序列分析证实重组质粒含有合成的HSV1病毒gB糖蛋白基因片段,序列完全正确:
CCT ACC TCT CCT GGT ACT CCT GGT GTG GCT GCC GCT ACC CAG GCC GCT AAC GGT GGT
CCT GCC ACT CCT GCT CCT CCT CCT CTG GGT GCC GCT CCT ACT GGT GAC CCT AAG CCT
AAG AAG AAC AAG AAG CCT AAG AAC CCT ACT CCT CCT AGG CCT GCT GGT GAC AAC GCC
ACC GTG GCC GCT GGC CAC GCC ACT CTG AGA GAG CAC CTG AGA GAC ATC AAG GCT GAG
AAC ACC GAC GCT AAC TTC TAC GTG TGT CCT CCT CCT ACT GGT GCC ACC GTG GTG CAG
TTC GAG CAG CCT CGC AGG TGC CCT ACC AGG CCT GAA GGT CAG AAC TAC ACC GAA GGC
ATC GCC GTG GTG TTC AAG GAG AAC ATC GCT CCT TAC AAG TTC AAG GCC ACC ATG TAC
TAC AAG GAC GTG ACC GTG AGC CAG GTG TGG TTC GGC CAC CGC TAC TCC CAG TTC ATG
GGC ATC TTC GAG GAC CGC GCT CCT GTG CCT TTC GAG GAG GTG ATC GAC AAG ATC AAC
GCC AAG GGT GTG TGT AGG TCC ACC GCT AAG TAC GTG CGC AAC AAC CTG GAG ACC ACT
GCT TTC CAC AGA GAC GAT CAC GAG ACC GAC ATG GAG CTG AAG CCT GCC AAC GCC GCT
ACT CGC ACC AGC AGA GGC TGG CAC ACC ACT GAC CTG AAG TAC AAC CCT AGC AGA GTG
GAG GCC TTC CAC AGG TAC GGC ACC ACC GTG AAC TGC ATC GTG GAG GAG GTG GAC GCT
CGC AGC GTG TAC CCT TAC GAC GAG TTC GTG CTG GCC ACT GGT GAC TTC GTG TAC ATG
TCT CCT TTC TAC GGC TAC AGA GAA GGT AGC CAC ACC GAG CAC ACT ACC TAC GCT GCT
GAC AGG TTC AAG CAG GTG GAC GGC TTC TAC GCT CGC GAC CTG ACC ACC AAG GCT AGA
GCC ACT GCT CCT ACC ACT AGG AAC CTG CTG ACC ACT CCT AAG TTC ACC GTG GCT TGG
GAC TGG GTG CCT AAG CGC CCT AGC GTG TGC ACC ATG ACC AAG TGG CAG GAG GTG GAC
GAG ATG CTG CGC TCC GAG TAC GGC GGT TCC TTC AGG TTC TCC TCT GAC GCT ATC TCC
ACT ACC TTC ACT ACC AAC CTG ACC GAG TAC CCT CTG TCC AGA GTG GAC CTG GGT GAC
TGC ATC GGT AAG GAC GCT CGC GAC GCC ATG GAC CGC ATC TTC GCT CGC AGG TAC AAC
GCT ACT CAC ATC AAG GTG GGC CAG CCT CAG TAC TAC CAG GCC AAC GGT GGT TTC CTG
ATC GCC TAC CAG CCT CTG CTG AGC AAC ACT CTG GCT GAG CTG TAC GTG AGA GAG CAC
CTG AGA GAG CAG AGC CGC AAG CCT CCT AAC CCT ACG CCT CCT CCT CCC GGT GCT AGC
GCC AAC GCT TCC GTG GAG CGC ATC AAG ACT ACC TCT AGC ATC GAG TTC GCC AGG CTG
CAG TTC ACC TAC AAC CAC ATC CAG CGC CAC GTG AAC GAC ATG CTG GGT CGC GTG GCT
ATC GCT TGG TGC GAG CTG CAG AAC CAC GAG CTG ACT CTG TGG AAC GAG GCT CGC AAG
CTG AAC CCT AAC GCT ATC GCC AGC GTG ACC GTG GGC AGG AGA GTG AGC GCT AGA ATG
CTG GGC GAC GTG ATG GCC GTG TCC ACC TGC GTG CCT GTG GCT GCT GAC AAC GTG ATC
GTG CAG AAC AGC ATG CGC ATC AGC TCC AGA CCT GGT GCC TGC TAC AGC AGA CCT CTG
GTG AGC TTC AGG TAC GAG GAC CAA GGT CCT CTG GTG GAA GGT CAG CTG GGT GAG AAC
AAC GAG CTG AGG CTG ACT CGC GAC GCT ATC GAG CCT TGC ACC GTC GGT CAC AGA CGC
TAC TTC ACC TTC GGT GGC GGT TAC GTG TAC TTC GAG GAG TAC GCT TAC TCT CAC CAG
CTG AGC CGC GCT GAC ATC ACT ACC GTG AGC ACC TTC ATC GAC CTG AAC ATC ACC ATG
CTG GAG GAC CAC GAG TTC GTG CCT CTG GAG GTG TAC ACC CGC CAC GAG ATC AAG GAC
AGC GGC CTG CTG GAC TAC ACC GAG GTG CAG CGC CGC AAC CAG CTG CAC GAC CTG CGC
TTC GCT GAC ATC GAC ACC GTG ATC CAC GCC GAC GCC TGA
构建的重组质粒表达HSV1的病毒gB糖蛋白胞外区片段(696个氨基酸),在其N端融合了载体上的238个氨基酸,全长934个氨基酸,其氨基酸序列如下:
Met Pro Met Ile Leu Gly Tyr Trp Asp Ile Arg Gly Leu Ala His Ala Ile Arg
Leu Leu Leu Glu Tyr Thr Asp Ser Ser Tyr Glu Glu Lys Lys Tyr Thr Met Gly
Gly Ala Pro Asp Tyr Asp Arg Ser Gln Trp Leu Asn Glu Lys Phe Lys Leu Gly
Leu Asp Phe Pro Asn Leu Pro Tyr Leu Ile Asp Gly Ala His Lys Ile Thr Gln
Ser Asn Ala Ile Leu Cys Tyr Ile Ala Arg Lys His Asn Leu Cys Gly Glu Thr
Glu Glu Glu Lys Ile Arg Val Asp Ile Leu Glu Asn Gln Ala Met Asp Val Ser
Asn Gln Leu Ala Arg Val Cys Tyr Ser Pro Asp Phe Glu Lys Leu Lys Pro Glu
Tyr Leu Glu Glu Leu Pro Thr Met Met Gln His Phe Ser Gln Phe Leu Gly Lys
Arg Pro Trp Phe Val Gly Asp Lys Ile Thr Phe Val Asp Phe Leu Ala Tyr Asp
Val Leu Asp Leu His Arg Ile Phe Glu Pro Asn Cys Leu Asp Ala Phe Pro Asn
Leu Lys Asp Phe Ile Ser Arg Phe Glu Gly Leu Glu Lys Ile Ser Ala Tyr Met
Lys Ser Ser Arg Phe Leu Pro Lys Pro Leu Tyr Thr Arg Met Ala Val Trp Gly
Asn Lys Pro Ser Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gln Ala Ala
Asn Gly Gly Pro Ala Thr Pro Ala Pro Pro Ala Leu Gly Ala Ala Pro Thr Gly
Asp Pro Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Thr Pro Pro Arg Pro
Ala Gly Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr Leu Arg Glu His Leu
Arg Asp Ile Lys Ala Glu Asn Thr Asp Ala Asn Phe Tyr Val Cys Pro Pro Pro
Thr Gly Ala Thr Val Val Gln Phe Glu Gln Pro Arg Arg Cys Pro Thr Arg Pro
Glu Gly Gln Asn Tyr Thr Glu Gly Ile Ala Val Val Phe Lys Glu Asn Ile Ala
Pro Tyr Lys Phe Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gln Val
Trp Phe Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro
Val Pro Phe Glu Glu Val Ile Asp Lys Ile Asn Ala Lys Gly Val Cys Arg Ser
Thr Ala Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg Asp Asp
His Glu Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala Thr Arg Thr Ser Arg
Gly Trp His Thr Thr Asp Leu Lys Tyr Asn Pro Ser Arg Val Glu Ala Phe His
Arg Tyr Gly Thr Thr Val Asn Cys Ile Val Glu Glu Val Asp Ala Arg Ser Val
Tyr Pro Tyr Asp Glu Phe Val Leu Ala Thr GlY Asp Phe Val Tyr Met Ser Pro
Phe Tyr Gly Tyr Arg Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp
Arg Phe Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg
Ala Thr Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Ala
Trp Asp Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp Gln Glu
Val Asp Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg Phe Ser Ser Asp
Ala Ile Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu Tyr Pro Leu Ser Arg Val
Asp Leu Gly Asp Cys Ile Gly Lys Asp Ala Arg Asp Ala Met Asp Arg Ile Phe
Ala Arg Arg Tyr Asn Ala Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu
Ala Asn Gly Gly Phe Leu Ile Ala Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala
Glu Leu Tyr Val Arg Glu His Leu Arg Glu Gln Ser Pro Lys Pro Pro Asn Pro
Thr Pro Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg Ile Lys Thr
Thr Ser Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn His Ile Gln Asn
His Val Asn Asp Met Leu Gly Arg Val Ala Ile Ala Trp Cys Glu Leu Gln Asn
His Glu Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu Asn Pro Asn Ala Ile Ala
Ser Ala Thr Val Gly Arg Arg Val Ser Ala Arg Met Leu Gly Asp Val Met Ala
Val Ser Thr Cys Val Pro Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met
Arg Ile Ser Ser Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg
Tyr Glu Asp Gln Gly Pro Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu
Arg Leu Thr Arg Asp Ala Ile Glu Pro Cys Thr Val GlY His Arg Arg Tyr Phe
Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu Glu Ser Ala Tyr Ser His Gln Leu
Ser Arg Ala Asp Ile Thr Thr Val Ser Thr Phe Ile Asp Leu Asn Ile Thr Met
Leu Glu Asp His Glu Phe Val Pro Leu Glu Val Tyr Thr Arg His Glu Ile Lys
Asp Ser Gly Leu Leu Asp Tyr Thr Glu Val Gln Arg Arg Asn Gln Leu His Asp
Leu Arg Phe Ala Asp Ile Asp Thr Val Ile His Ala Asp Ala
4.表达融合蛋白工程菌的筛选鉴定:
将重组质粒转化大肠杆菌TG1,涂布含100μg/ml氨苄青霉素的LB平板,置37℃过夜,次日随机挑取转化菌落和含质粒pGEX4T-2的对照菌,提取质粒,以提取的质粒为模板,PCR扩增HSV1病毒的gB糖蛋白胞外区基因片段,含HSV1病毒的gB糖蛋白胞外区基因片段的阳性重组质粒,应扩增出长约2107bp的基因片段,将含有重组质粒的阳性转化子,接种至含氨苄青霉素100μg/mL的LB培养基内,37℃振荡培养3h,加IPTG至终浓度0.5~1.0mmol/L,继续振荡培养诱导4~6h,离心收集菌体进行SDS-PAGE检测,重组子表达相对分子量约为96kD的HSV病毒gB糖蛋白,表达量约为30%,而对照菌TG1无此蛋白带;
5.表达HSV1病毒gB糖蛋白的纯化:
1)表达HSV1病毒GB糖蛋白工程菌的超声裂解
将诱导表达融合蛋白的工程菌离心(8000rpm、20min、4℃)收菌,菌体重悬于原培养液1/10体积的裂解液内,裂解液为50mmol/L Tris-HCl pH8.0、10mmol/LEDTA、10mmol/L DTT,冰浴超声破菌10min,离心(8000rpm、20min、4℃)收集上清,弃沉淀。收集的上清用于下一步的亲和层析纯化。
2)表达HSV1病毒gB糖蛋白的纯化
上清溶液加已经平衡的Glutathione Sepharose 4B凝胶3ml,室温结合60min,上样,收集穿透液。用十倍柱床体积的1×PBS洗涤柱子,接着用15ml高浓度的GSH洗脱液,洗脱液为50mmol/LTris-HCL pH8.0+10mmol/LGST,分三次洗脱目的蛋白,即为纯化的HSV1病毒gB糖蛋白胞外区片段。
6.纯化的HSV1病毒gB糖蛋白的ELISA检测:
将初步纯化获得的融合蛋白按1∶1000~1∶32000倍比稀释,并将阴性对照以同样浓度倍比稀释,用Human HSV-1 ELISA试剂盒检测表达蛋白的抗原性(具体操作方法见试剂盒说明书)。结果显示(表1)此融合表达蛋白具有较好的抗原性和特异性。
7.将表达的HSV1病毒gB糖蛋白片段,用于疫苗、HSV1病毒抗体或抗原的检测及用于免疫制备抗HSV1病毒单抗和多抗等。
8.将合成的HSV1病毒gB糖蛋白基因片段与其它基因片段连接,以融合蛋白的形式进行表达、制备。
上述方法制备的HSV1病毒gB糖蛋白胞外区基因片段在制备HSV1病毒亚单位疫苗中的应用。
化学合成的HSV1病毒gB糖蛋白胞外区基因片段,可利用细菌、酵母细胞、昆虫细胞、哺乳动物细胞及转基因动植物进行重组表达、制备。
本发明与现有技术相比具有的优点:
本发明表达的HSV1病毒的gB糖蛋白片段,有较多优点:
1.本发明表达的HSV1病毒gB糖蛋白胞外区片段用作抗原制备HSV1病毒抗体的酶联免疫检测试剂盒,具有生产较安全、成本低、与其它病毒交叉反应少等优点。
2.HSV1病毒的gB糖蛋白在感染的细胞中含量最多,是疱疹病毒家族中高度保守的蛋白,各毒株间差异较小,且与所有的疱疹类病毒亚群有较大的同源性,同时gB蛋白具有较强的免疫原性,可诱导机体产生中和抗体和细胞免疫反应,因此HSV1gB糖蛋白是构建HSV疫苗理想的目的基因。本发明选择了其强抗原表位,利用基因工程技术表达制备,为研制基因工程疫苗奠定基础。基因工程疫苗安全、成本低。
3.本发明根据筛选出的HSV1病毒的gB糖蛋白胞外区片段氨基酸序列,选择真核和原核生物均偏爱的密码子,化学合成全新的基因序列,该基因适宜在真核和原核细胞内高表达。
4.本发明构建的表达HSV1病毒的gB糖蛋白的工程菌,表达量可达菌体蛋白的30%,易于纯化,可大量纯化制备该蛋白。
附图说明
以下将结合附图对本发明作进一步说明。
图1是本发明表达HSV1病毒的gB糖蛋白的重组质粒构建流程图。
图2是本发明PCR扩增HSV1病毒gB糖蛋白胞外区基因片断的电泳图。
图3是本发明用1.0%的Agarose凝胶检测5个重组子的PCR扩增产物。附图3中M:低分子量DNA标准(TaKaRa,500bp);1~5∶5个转化子均扩增出2107bp的目的基因片段。
图4是本发明表达HSV1病毒的gB糖蛋白重组菌的SDS-PAGE分析结果。附图4中M:低分子量蛋白标准(Pharmacia);7:对照菌E.coli TG1;1~6表示1~6号重组菌,2号、5号2个重组子表达相对分子量约为96000的融合蛋白,1、3、4、6号4个转化子无此蛋白带。
图5是本发明表达HSV1病毒的gB糖蛋白纯化前后的SDS-PAGE分析结果。附图5中M:低分子量蛋白标准(Pharmacia);1:对照菌TG1;2:表达HSV1病毒的gB糖蛋白的工程菌;3:HSV1病毒的gB糖蛋白的包涵体;4:HSV1病毒的gB糖蛋白的纯化过程中的穿过峰;5:Glutathione Sepharose 4B亲和层析柱纯化后的的HSV1病毒的gB糖蛋白;6:HSV1病毒的gB糖蛋白的纯化过程中的上清。
具体实施方式
本发明实施方式的详细说明:
HSV1病毒的gB糖蛋白抗原表位的分析、基因合成及表达
通过计算机分析HSV1病毒的gB糖蛋白的全部氨基酸序列,筛选出HSV1病毒的gB糖蛋白内的强抗原表位,选用细菌偏爱的密码子,化学合成HSV1病毒的gB糖蛋白强抗原表位的全新基因片段。将基因片段克隆至质粒pGEX4T-2内的BamHI/EcoRI位点,与载体上的起始密码子的翻译框架一致,可表达一个融合蛋白。将重组质粒转化大肠杆菌TG1,筛选获得了高效表达HSV1病毒的gB糖蛋白的工程菌,表达的HSV1病毒的gB糖蛋白约占菌体蛋白总量的30%左右。
材料与方法
1.菌种与质粒:宿主菌TG1及表达载体pGEX4T-2为实验室保存。
2.分子生物学试剂:限制性内切酶BamHI、EcoRI、及T4 DNA连接酶为市售TaKaRa公司产品。质粒纯化试剂盒及从琼脂糖凝胶内回收DNA片段的试剂盒为市售TaKaRa公司产品。DTT及IPTG为市售Promega公司产品。其它试剂为进口或国产分析纯试剂。
3.基因片段的合成:由公司按设计合成。
4.基因克隆方法:DNA的酶切、连接、电泳;质粒的提取、转化;蛋白的SDS-PAGE分析等一般分子克隆方法按常规方法进行。其它试剂盒按说明书进行操作。
5.DNA序列分析:用QIAGEN公司质粒纯化试剂盒纯化质粒,用DNA全自动测序仪测序。
实施方式
1.HSV1病毒的gB糖蛋白抗原表位筛选及基因片段的合成:
利用ANTHEWIN等软件,通过计算机分析HSV1病毒的gB糖蛋白的全部氨基酸序列(GeneBank,接通号:AY278488),筛选出HSV1病毒的gB糖蛋白内的强抗原表位(图1),即从第1个氨基酸到第696个氨基酸,其氨基酸序列如下:
Pro Ser Ser Pro Gly Thr Pro GIY Val Ala Ala Ala Thr Gln Ala Ala Asn Gly
Gly Pro Ala Thr Pro Ala Pro Pro Ala Leu GlY Ala Ala Pro Thr GlY Asp Pro
Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Thr Pro Pro Arg Pro Ala Gly
Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr Leu Arg Glu His Leu Arg Asp
Ile Lys Ala Glu Asn Thr Asp Ala Asn Phe Tyr Val Cys Pro Pro Pro Thr Gly
Ala Thr Val Val Gln Phe Glu Gln Pro Arg Arg Cys Pro Thr Arg Pro Glu Gly
Gln Asn Tyr Thr Glu Gly Ile Ala Val Val Phe Lys Glu Asn Ile Ala Pro Tyr
Lys Phe Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gln Val Trp Phe
Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro Val Pro
Phe Glu Glu Val Ile Asp LYs Ile Asn Ala Lys Gly Val Cys Arg Ser Thr Ala
Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg Asp Asp His Glu
Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala Thr Arg Thr Ser Arg Gly Trp
His Thr Thr Asp Leu Lys Tyr Asn Pro Ser Arg Val Glu Ala Phe His Arg Tyr
Gly Thr Thr Val Asn Cys Ile Val Glu Glu Val Asp Ala Arg Ser Val Tyr Pro
Tyr Asp Glu Phe Val Leu Ala Thr Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr
Gly Tyr Arg Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp Arg Phe
Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg Ala Thr
Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Ala Trp Asp
Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp Gln Glu Val Asp
Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg Phe Ser Ser Asp Ala Ile
Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu Tyr Pro Leu Ser Arg Val Asp Leu
Gly Asp Cys Ile Gly Lys Asp Ala Arg Asp Ala Met Asp Arg Ile Phe Ala Arg
Arg Tyr Asn Ala Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu Ala Asn
Gly Gly Phe Leu Ile Ala Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala Glu Leu
Tyr Val Arg Glu His Leu Arg Glu Gln Ser Pro Lys Pro Pro Asn Pro Thr Pro
Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg Ile Lys Thr Thr Ser
Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn His Ile Gln Asn His Val
Asn Asp Met Leu Gly Arg Val Ala Ile Ala Trp Cys Glu Leu Gln Asn His Glu
Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu Asn Pro Asn Ala Ile Ala Ser Ala
Thr Val Gly Arg Arg Val Ser Ala Arg Met Leu Gly Asp Val Met Ala Val Ser
Thr Cys Val Pro Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met Arg Ile
Ser Ser Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu
Asp Gln Gly Pro Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu Arg Leu
Thr Arg Asp Ala Ile Glu Pro Cys Thr Val Gly His Arg Arg Tyr Phe Thr Phe
Gly Gly Gly Tyr Val Tyr Phe Glu Glu Ser Ala Tyr Ser His Gln Leu Ser Arg
Ala Asp Ile Thr Thr Val Ser Thr Phe Ile Asp Leu Asn Ile Thr Met Leu Glu
Asp His Glu Phe Val Pro Leu Glu Val Tyr Thr Arg His Glu Ile Lys Asp Ser
Gly Leu Leu Asp Tyr Thr Glu Val Gln Arg Arg Asn Gln Leu His Asp Leu Arg
Phe Ala Asp Ile Asp Thr Val Ile His Ala Asp Ala
根据筛选的HSV1病毒gB糖蛋白内的抗原表位氨基酸序列,选择真核和原核生物均偏爱的密码子,化学合成全新的基因序列,并且在5’端增加了BamHI酶切位点(下画线部分)及两个保护碱基(GC),在3’端增加了终止密码子(TGA)和EcoRI酶切位点(下画线部分)及两个保护碱基(GC),使该基因片段易于克隆至质粒pGEX4T-2内的BamHI和EcoRI酶切位点内。化学合成的含HSV1病毒gB糖蛋白抗原表位基因的DNA序列(2107bp)如下:
GCGGATCC CCT ACC TCT CCT GGT ACT CCT GGT GTG GCT GCC GCT ACC CAG GCC GCT AAC
GGT GGT CCT GCC ACT CCT GCT CCT CCT CCT CTG GGT GCC GCT CCT ACT GGT GAC CCT
AAG CCT AAG AAG AAC AAG AAG CCT AAG AAC CCT ACT CCT CCT AGG CCT GCT GGT GAC
AAC GCC ACC GTG GCC GCT GGC CAC GCC ACT CTG AGA GAG CAC CTG AGA GAC ATC AAG
GCT GAG AAC ACC GAC GCT AAC TTC TAC GTG TGT CCT CCT CCT ACT GGT GCC ACC GTG
GTG CAG TTC GAG CAG CCT CGC AGG TGC CCT ACC AGG CCT GAA GGT CAG AAC TAC ACC
GAA GGC ATC GCC GTG GTG TTC AAG GAG AAC ATC GCT CCT TAC AAG TTC AAG GCC ACC
ATG TAC TAC AAG GAC GTG ACC GTG AGC CAG GTG TGG TTC GGC CAC CGC TAC TCC CAG
TTC ATG GGC ATC TTC GAG GAC CGC GCT CCT GTG CCT TTC GAG GAG GTG ATC GAC AAG
ATC AAC GCC AAG GGT GTG TGT AGG TCC ACC GCT AAG TAC GTG CGC AAC AAC CTG GAG
ACC ACT GCT TTC CAC AGA GAC GAT CAC GAG ACC GAC ATG GAG CTG AAG CCT GCC AAC
GCC GCT ACT CGC ACC AGC AGA GGC TGG CAC ACC ACT GAC CTG AAG TAC AAC CCT AGC
AGA GTG GAG GCC TTC CAC AGG TAC GGC ACC ACC GTG AAC TGC ATC GTG GAG GAG GTG
GAC GCT CGC AGC GTG TAC CCT TAC GAC GAG TTC GTG CTG GCC ACT GGT GAC TTC GTG
TAC ATG TCT CCT TTC TAC GGC TAC AGA GAA GGT AGC CAC ACC GAG CAC ACT ACC TAC
GCT GCT GAC AGG TTC AAG CAG GTG GAC GGC TTC TAC GCT CGC GAC CTG ACC ACC AAG
GCT AGA GCC ACT GCT CCT ACC ACT AGG AAC CTG CTG ACC ACT CCT AAG TTC ACC GTG
GCT TGG GAC TGG GTG CCT AAG CGC CCT AGC GTG TGC ACC ATG ACC AAG TGG CAG GAG
GTG GAC GAG ATG CTG CGC TCC GAG TAC GGC GGT TCC TTC AGG TTC TCC TCT GAC GCT
ATC TCC ACT ACC TTC ACT ACC AAC CTG ACC GAG TAC CCT CTG TCC AGA GTG GAC CTG
GGT GAC TGC ATC GGT AAG GAC GCT CGC GAC GCC ATG GAC CGC ATC TTC GCT CGC AGG
TAC AAC GCT ACT CAC ATC AAG GTG GGC CAG CCT CAG TAC TAC CAG GCC AAC GGT GGT
TTC CTG ATC GCC TAC CAG CCT CTG CTG AGC AAC ACT CTG GCT GAG CTG TAC GTG AGA
GAG CAC CTG AGA GAG CAG AGC CGC AAG CCT CCT AAC CCT ACG CCT CCT CCT CCC GGT
GCT AGC GCC AAC GCT TCC GTG GAG CGC ATC AAG ACT ACC TCT AGC ATC GAG TTC GCC
AGG CTG CAG TTC ACC TAC AAC CAC ATC CAG CGC CAC GTG AAC GAC ATG CTG GGT CGC
GTG GCT ATC GCT TGG TGC GAG CTG CAG AAC CAC GAG CTG ACT CTG TGG AAC GAG GCT
CGC AAG CTG AAC CCT AAC GCT ATC GCC AGC GTG ACC GTG GGC AGG AGA GTG AGC GCT
AGA ATG CTG GGC GAC GTG ATG GCC GTG TCC ACC TGC GTG CCT GTG GCT GCT GAC AAC
GTG ATC GTG CAG AAC AGC ATG CGC ATC AGC TCC AGA CCT GGT GCC TGC TAC AGC AGA
CCT CTG GTG AGC TTC AGG TAC GAG GAC CAA GGT CCT CTG GTG GAA GGT CAG CTG GGT
GAG AAC AAC GAG CTG AGG CTG ACT CGC GAC GCT ATC GAG CCT TGC ACC GTC GGT CAC
AGA CGC TAC TTC ACC TTC GGT GGC GGT TAC GTG TAC TTC GAG GAG TAC GCT TAC TCT
CAC CAG CTG AGC CGC GCT GAC ATC ACT ACC GTG AGC ACC TTC ATC GAC CTG AAC ATC
ACC ATG CTG GAG GAC CAC GAG TTC GTG CCT CTG GAG GTG TAC ACC CGC CAC GAG ATC
AAG GAC AGC GGC CTG CTG GAC TAC ACC GAG GTG CAG CGC CGC AAC CAG CTG CAC GAC
CTG CGC TTC GCT GAC ATC GAC ACC GTG ATC CAC GCC GAC GCC TGA GAATTCGC
2.表达HSV1病毒gB糖蛋白胞外区片段重组质粒的构建:
提取质粒pGEX4T-2,用Bam H I和EcoRI双酶切,电泳后回收酶切的质粒大片段,溶于去离子水内。同样用BamH I和EcoR I双酶切化学合成的HSV1病毒gB糖蛋白胞外区基因片段,电泳回收后,溶于去离子水内。
取等摩尔浓度的上述两种酶切后DNA片段,在同一离心管内用T4 DNA连接酶连接,使HSV1病毒gB糖蛋白胞外区基因片段插入到载体pGEX4T-2内的BamH I和EcoRI位点之间,与载体上的起始密码子翻译框架一致,表达一个融合蛋白(构建流程见图2)。
3.重组质粒的筛选与鉴定:
将上步连接的重组质粒转化到大肠杆菌TGl,将转化产物涂布含氨苄青霉素(100μg/ml)的固体LB培养基上,置37℃培养过夜。次日随机挑选5个转化子菌落(分别标记为1-5号)和1个对照菌(质粒pGEX4T-2转化菌),分别接种到含3ml液体LB培养基(含氨苄青霉素100μg/ml)的试管内,置37℃振荡培养5h,取菌液1ml,离心收菌。分别用50μl去离子水悬浮菌体,沸水煮5min,离心(4℃,12000rpm)5min,取上清(内有质粒)2μl用作PCR模板,PCR扩增插入载体内的HSV1病毒gB糖蛋白胞外区基因片段,PCR反应浓度为:质粒模板2μl、HSV1病毒gB糖蛋白胞外区基因片段的正链P1(GCGGATCCCCTACCTCTCCTGGTACTC)和负链引物P2(GCGAATTCTTAGGCGTCGGC)各1μl、10×pyrobest buffer5.0μl、2.5mmol/L dNTP4.0μl、Pyrobest DNA Taq酶0.5μl(2.5U)、去离子水36.5μl,总体积50μl。扩增条件为:94℃30秒、55℃30秒、72℃4分钟,35个循环;最后72℃延伸7分钟。取PCR扩增产物5μl,用1.0%的Agarose凝胶检测,结果,5个转化子均扩增出2107bp的目的基因片段(见图3),而含质粒pGEX4T-2的对照菌没有扩增出该基因片段。初步证实,这5个转化子均含有HSV1病毒gB糖蛋白基因片段。
提取1号重组子的质粒,测定质粒内的HSV1病毒gB糖蛋白基因序列,DNA序列分析证实,重组质粒含有合成的HSV1病毒gB糖蛋白基因片段,序列完全正确:
CCT ACC TCT CCT GGT ACT CCT GGT GTG GCT GCC GCT ACC CAG GCC GCT AAC GGT GGT
CCT GCC ACT CCT GCT CCT CCT CCT CTG GGT GCC GCT CCT ACT GGT GAC CCT AAG CCT
AAG AAG AAC AAG AAG CCT AAG AAC CCT ACT CCT CCT AGG CCT GCT GGT GAC AAC GCC
ACC GTG GCC GCT GGC CAC GCC ACT CTG AGA GAG CAC CTG AGA GAC ATC AAG GCT GAG
AAC ACC GAC GCT AAC TTC TAC GTG TGT CCT CCT CCT ACT GGT GCC ACC GTG GTG CAG
TTC GAG CAG CCT CGC AGG TGC CCT ACC AGG CCT GAA GGT CAG AAC TAC ACC GAA GGC
ATC GCC GTG GTG TTC AAG GAG AAC ATC GCT CCT TAC AAG TTC AAG GCC ACC ATG TAC
TAC AAG GAC GTG ACC GTG AGC CAG GTG TGG TTC GGC CAC CGC TAC TCC CAG TTC ATG
GGC ATC TTC GAG GAC CGC GCT CCT GTG CCT TTC GAG GAG GTG ATC GAC AAG ATC AAC
GCC AAG GGT GTG TGT AGG TCC ACC GCT AAG TAC GTG CGC AAC AAC CTG GAG ACC ACT
GCT TTC CAC AGA GAC GAT CAC GAG ACC GAC ATG GAG CTG AAG CCT GCC AAC GCC GCT
ACT CGC ACC AGC AGA GGC TGG CAC ACC ACT GAC CTG AAG TAC AAC CCT AGC AGA GTG
GAG GCC TTC CAC AGG TAC GGC ACC ACC GTG AAC TGC ATC GTG GAG GAG GTG GAC GCT
CGC AGC GTG TAC CCT TAC GAC GAG TTC GTG CTG GCC ACT GGT GAC TTC GTG TAC ATG
TCT CCT TTC TAC GGC TAC AGA GAA GGT AGC CAC ACC GAG CAC ACT ACC TAC GCT GCT
GAC AGG TTC AAG CAG GTG GAC GGC TTC TAC GCT CGC GAC CTG ACC ACC AAG GCT AGA
GCC ACT GCT CCT ACC ACT AGG AAC CTG CTG ACC ACT CCT AAG TTC ACC GTG GCT TGG
GAC TGG GTG CCT AAG CGC CCT AGC GTG TGC ACC ATG ACC AAG TGG CAG GAG GTG GAC
GAG ATG CTG CGC TCC GAG TAC GGC GGT TCC TTC AGG TTC TCC TCT GAC GCT ATC TCC
ACT ACC TTC ACT ACC AAC CTG ACC GAG TAC CCT CTG TCC AGA GTG GAC CTG GGT GAC
TGC ATC GGT AAG GAC GCT CGC GAC GCC ATG GAC CGC ATC TTC GCT CGC AGG TAC AAC
GCT ACT CAC ATC AAG GTG GGC CAG CCT CAG TAC TAC CAG GCC AAC GGT GGT TTC CTG
ATC GCC TAC CAG CCT CTG CTG AGC AAC ACT CTG GCT GAG CTG TAC GTG AGA GAG CAC
CTG AGA GAG CAG AGC CGC AAG CCT CCT AAC CCT ACG CCT CCT CCT CCC GGT GCT AGC
GCC AAC GCT TCC GTG GAG CGC ATC AAG ACT ACC TCT AGC ATC GAG TTC GCC AGG CTG
CAG TTC ACC TAC AAC CAC ATC CAG CGC CAC GTG AAC GAC ATG CTG GGT CGC GTG GCT
ATC GCT TGG TGC GAG CTG CAG AAC CAC GAG CTG ACT CTG TGG AAC GAG GCT CGC AAG
CTG AAC CCT AAC GCT ATC GCC AGC GTG ACC GTG GGC AGG AGA GTG AGC GCT AGA ATG
CTG GGC GAC GTG ATG GCC GTG TCC ACC TGC GTG CCT GTG GCT GCT GAC AAC GTG ATC
GTG CAG AAC AGC ATG CGC ATC AGC TCC AGA CCT GGT GCC TGC TAC AGC AGA CCT CTG
GTG AGC TTC AGG TAC GAG GAC CAA GGT CCT CTG GTG GAA GGT CAG CTG GGT GAG AAC
AAC GAG CTG AGG CTG ACT CGC GAC GCT ATC GAG CCT TGC ACC GTC GGT CAC AGA CGC
TAC TTC ACC TTC GGT GGC GGT TAC GTG TAC TTC GAG GAG TAC GCT TAC TCT CAC CAG
CTG AGC CGC GCT GAC ATC ACT ACC GTG AGC ACC TTC ATC GAC CTG AAC ATC ACC ATG
CTG GAG GAC CAC GAG TTC GTG CCT CTG GAG GTG TAC ACC CGC CAC GAG ATC AAG GAC
AGC GGC CTG CTG GAC TAC ACC GAG GTG CAG CGC CGC AAC CAG CTG CAC GAC CTG CGC
TTC GCT GAC ATC GAC ACC GTG ATC CAC GCC GAC GCC TGA
构建的重组质粒表达HSV1的病毒gB糖蛋白片段(696个氨基酸),在其N端融合了载体上的238个氨基酸,全长934个氨基酸,其氨基酸序列如下:
Met Pro Met Ile Leu Gly Tyr Trp Asp Ile Arg Gly Leu Ala His Ala Ile Arg
Leu Leu Leu Glu Tyr Thr Asp Ser Ser Tyr Glu Glu Lys Lys Tyr Thr Met Gly
Gly Ala Pro Asp Tyr Asp Arg Ser Gln Trp Leu Asn Glu Lys Phe Lys Leu Gly
Leu Asp Phe Pro Asn Leu Pro Tyr Leu Ile Asp Gly Ala His Lys Ile Thr Gln
Ser Asn Ala Ile Leu Cys Tyr Ile Ala Arg Lys His Asn Leu Cys Gly Glu Thr
Glu Glu Glu Lys Ile Arg Val Asp Ile Leu Glu Asn Gln Ala Met Asp Val Ser
Asn Gln Leu Ala Arg Val Cys Tyr Ser Pro Asp Phe Glu Lys Leu Lys Pro Glu
Tyr Leu Glu Glu Leu Pro Thr Met Met Gln His Phe Ser Gln Phe Leu Gly Lys
Arg Pro Trp Phe Val Gly Asp Lys Ile Thr Phe Val Asp Phe Leu Ala Tyr Asp
Val Leu Asp Leu His Arg Ile Phe Glu Pro Asn Cys Leu Asp Ala Phe Pro Asn
Leu Lys Asp Phe Ile Ser Arg Phe Glu Gly Leu Glu Lys Ile Ser Ala Tyr Met
Lys Ser Ser Arg Phe Leu Pro Lys Pro Leu Tyr Thr Arg Met Ala Val Trp Gly
Asn Lys Pro Ser Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gln Ala Ala
Asn Gly Gly Pro Ala Thr Pro Ala Pro Pro Ala Leu Gly Ala Ala Pro Thr Gly
Asp Pro Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Thr Pro Pro Arg Pro
Ala Gly Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr Leu Arg Glu His Leu
Arg Asp Ile Lys Ala Glu Asn Thr Asp Ala Asn Phe Tyr Val Cys Pro Pro Pro
Thr Gly Ala Thr Val Val Gln Phe Glu Gln Pro Arg Arg Cys Pro Thr Arg Pro
Glu Gly Gln Asn Tyr Thr Glu Gly Ile Ala Val Val Phe Lys Glu Asn Ile Ala
Pro Tyr Lys Phe Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gln Val
Trp Phe Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro
Val Pro Phe Glu Glu Val Ile Asp Lys Ile Asn Ala Lys Gly Val Cys Arg Ser
Thr Ala Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg Asp Asp
His Glu Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala Thr Arg Thr Ser Arg
Gly Trp His Thr Thr Asp Leu Lys Tyr Asn Pro Ser Arg Val Glu Ala Phe His
Arg Tyr Gly Thr Thr Val Asn Cys Ile Val Glu Glu Val Asp Ala Arg Ser Val
Tyr Pro Tyr Asp Glu Phe Val Leu Ala Thr Gly Asp Phe Val Tyr Met Ser Pro
Phe Tyr Gly Tyr Arg Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp
Arg Phe Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg
Ala Thr Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Ala
Trp Asp Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp Gln Glu
Val Asp Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg Phe Ser Ser Asp
Ala Ile Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu Tyr Pro Leu Ser Arg Val
Asp Leu Gly Asp Cys Ile Gly Lys Asp Ala Arg Asp Ala Met Asp Arg Ile Phe
Ala Arg Arg Tyr Asn Ala Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu
Ala Asn Gly Gly Phe Leu Ile Ala Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala
Glu Leu Tyr Val Arg Glu His Leu Arg Glu Gln Ser Pro Lys Pro Pro Asn Pro
Thr Pro Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg Ile Lys Thr
Thr Ser Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn His Ile Gln Asn
His Val Asn Asp Met Leu Gly Arg Val Ala Ile Ala Trp Cys Glu Leu Gln Asn
His Glu Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu Asn Pro Asn Ala Ile Ala
Ser Ala Thr Val Gly Arg Arg Val Ser Ala Arg Met Leu Gly Asp Val Met Ala
Val Ser Thr Cys Val Pro Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met
Arg Ile Ser Ser Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg
Tyr Glu Asp Gln Gly Pro Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu
Arg Leu Thr Arg Asp Ala Ile Glu Pro Cys Thr Val Gly His Arg Arg Tyr Phe
Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu Glu Ser Ala Tyr Ser His Gln Leu
Ser Arg Ala Asp Ile Thr Thr Val Ser Thr Phe Ile Asp Leu Asn Ile Thr Met
Leu Glu Asp His Glu Phe Val Pro Leu Glu Val Tyr Thr Arg His Glu Ile Lys
Asp Ser Gly Leu Leu Asp Tyr Thr Glu Val Gln Arg Arg Asn Gln Leu His Asp
Leu Arg Phe Ala Asp Ile Asp Thr Val Ile His Ala Asp Ala
4.表达HSV1病毒gB糖蛋白工程菌的筛选鉴定:
将含有重组质粒的5个阳性转化子和1个对照菌(质粒pGEX4T-2转化菌),接种至含3ml LB培养基(含氨苄青霉素100μg/ml)的试管内,37℃振荡培养3h,加IPTG至终浓度1.0mmol/L,继续振荡培养诱导5h,离心收集菌体进行SDS-PAGE检测,重组子表达相对分子量约为96kD的HSV1病毒gB糖蛋白,表达量约为30%,而对照菌TG1无此蛋白带(图4)。
表达HSV1病毒gB糖蛋白的纯化
根据表达HSV1病毒gB糖蛋白的氨基酸序列,分析其理化特性,确定适当的纯化方法。我们所表达的HSV1gB糖蛋白融合有载体上的GST蛋白,GST融合蛋白在表达时同时表达了谷胱甘肽s转酶,可很方便用GST琼脂糖凝胶FF分离,因此我们决定采用亲和层析法,用Glutathione Sepharose 4B凝胶进行纯化。具体步骤如下:
材料和方法
1.主要试剂:
Glutathione Sepharose 4B凝胶为GE Heathcare公司产品,IPTG、DTT为Promega公司产品。其它试剂为国产或进口分析纯试剂。
2.表达HSV1病毒的gB糖蛋白的超声裂解:
将培养的表达HSV1病毒的gB糖蛋白的工程菌离心(8000rpm,20mins,4℃),弃上清,菌体重悬于原培养液1/10体积的裂解液(50mmol/L Tris-HCl pH8.0、10mmol/LEDTA、10mmol/L DTT)内,冰浴超声破菌10mins,离心(8000rpm、20mins,4℃)收集上清,弃沉淀。收集的上清用于下一步的亲和层析纯化。
3.表达HSV1病毒gB糖蛋白的纯化:
上清溶液加已经平衡的Glutathione Sepharose 4B凝胶3ml室温结合60min,上样,收集穿透液。用十倍柱床体积的1×PBS洗涤柱子,接着用15ml高浓度的GSH洗脱液(50mmol/LTris-HCL pH8.0+10mmol/LGST)分三次洗脱目的蛋白,即为纯化的HSV1病毒gB糖蛋白胞外区片段。
结果:
将从Glutathione Sepharose 4B凝胶柱上洗脱的蛋白进行SDS-PAGE分析,结果显示(见图5),经诱导明显表达出HSV1gB/GST融合蛋白,表达产物主要存在于上清液中。SDS-PAGE显示表达产物约96kDa。
纯化HSV1病毒的gB糖蛋白的鉴定及应用
将纯化的重组HSV1病毒gB糖蛋白抗原,通过双抗体夹心ELISA试验方法检测,以鉴定表达的HSV1病毒gB糖蛋白的抗原性和特异性。实验结果显示,该重组蛋白有很好的抗原性和特异性。
材料和方法
1.酶联反应试剂盒:Human HSV-1 ELISA试剂盒为美国Uscnlife公司产品。
2.ELISA试验:将初步纯化获得的融合蛋白按1∶1000~1∶32000倍比稀释,并将阴性对照以同样浓度倍比稀释,用Human HSV-1 ELISA试剂盒检测表达蛋白的抗原性(具体操作方法见试剂盒说明书)。
结果
HSV1gB蛋白的ELISA检测
将初步纯化获得的融合蛋白按1∶1000~1∶32000倍比稀释,并将阴性对照以同样浓度倍比稀释,用Human HSV-1 ELISA试剂盒检测表达蛋白的抗原性。结果显示(表1)此融合表达蛋白具有较好的抗原性和特异性。
表1表达蛋白的ELISA实验结果
Table1.ELISA results of the expressed protein
蛋白稀释浓度1∶10001∶2000 1∶40001∶8000 1∶16000 1∶32000HSVlgBControl0.8140.0210.5390.019 0.428 0.0250.4000.011 0.396 0.003 0.129 0.007
化学合成的HSV病毒gB蛋白胞外区基因片段序列表
<110>李越希
<120>化学合成的HSV病毒gB蛋白胞外区基因片段及其表达、应用
<160>2
<210>1
<211>696
<212>PRT
<213>HSV病毒gB蛋白胞外区片段
<220>
<223>含强抗原表位的HSV病毒gB蛋白胞外区片段,第1个氨基酸-第696个氨基酸,共696个氨基酸。
<400>1
Pro Ser Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gln Ala Ala
1 5 10 15
Asn Gly Gly Pro Ala Thr Pro Ala Pro Pro Ala Leu Gly Ala Ala Pro
20 25 30
Thr Gly Asp Pro Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Thr
35 40 45
Pro Pro Arg Pro Ala Gly Asp Asn Ala Thr Val Ala Ala Gly His Ala
50 55 60
Thr Leu Arg Glu His Leu Arg Asp Ile Lys Ala Glu Asn Thr Asp Ala
65 70 75 80
Asn Phe Tyr Val Cys Pro Pro Pro Thr Gly Ala Thr Val Val Gln Phe
85 90 95
Glu Gln Pro Arg Arg Cys Pro Thr Arg Pro Glu Gly Gln Asn Tyr Thr
100 105 110
Glu Gly Ile Ala Val Val Phe Lys Glu Asn Ile Ala Pro Tyr Lys Phe
115 120 125
Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gln Val Trp Phe
130 135 140
Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro
145 150 155 160
Val Pro Phe Glu Glu Val Ile Asp Lys Ile Asn Ala Lys Gly Val Cys
165 170 175
Arg Ser Thr Ala Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe
180 185 190
His Arg Asp Asp His Glu Thr Asp Met Glu Leu Lys Pro Ala Asn Ala
195 200 205
Ala Thr Arg Thr Ser Arg Gly Trp His Thr Thr Asp Leu Lys Tyr Asn
210 215 220
Pro Ser Arg Val Glu Ala Phe His Arg Tyr Gly Thr Thr Val Asn Cys
225 230 235 240
Ile Val Glu Glu Val Asp Ala Arg Ser Val Tyr Pro Tyr Asp Glu Phe
245 250 255
Val Leu Ala Thr Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr Gly Tyr
260 265 270
Arg Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp Arg Phe
275 280 285
Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg
290 295 300
Ala Thr Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr
305 310 315 320
Val Ala Trp Asp Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr
325 330 335
Lys Trp Gln Glu Val Asp Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser
340 345 350
Phe Arg Phe Ser Ser Asp Ala Ile Ser Thr Thr Phe Thr Thr Asn Leu
355 360 365
Thr Glu Tyr Pro Leu Ser Arg Val Asp Leu Gly Asp Cys Ile Gly Lys
370 375 380
Asp Ala Arg Asp Ala Met Asp Arg Ile Phe Ala Arg Arg Tyr Asn Ala
385 390 395 400
Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu Ala Asn Gly Gly
405 410 415
Phe Leu Ile Ala Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala Glu Leu
420 425 430
Tyr Val Arg Glu His Leu Arg Glu Gln Ser Pro Lys Pro Pro Asn Pro
435 440 445
Thr Pro Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg Ile
450 455 460
Lys Thr Thr Ser Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn
465 470 475 480
His Ile Gln Asn His Val Asn Asp Met Leu Gly Arg Val Ala Ile Ala
485 490 495
Trp Cys Glu Leu Gln Asn His Glu Leu Thr Leu Trp Asn Glu Ala Arg
500 505 510
Lys Leu Asn Pro Asn Ala Ile Ala Ser Ala Thr Val Gly Arg Arg Val
515 520 525
Ser Ala Arg Met Leu Gly Asp Val Met Ala Val Ser Thr Cys Val Pro
530 535 540
Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met Arg Ile Ser Ser
545 550 555 560
Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu
565 570 575
Asp Gln Gly Pro Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu
580 585 590
Arg Leu Thr Arg Asp Ala Ile Glu Pro Cys Thr Val Gly His Arg Arg
595 600 605
Tyr Phe Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu Glu Ser Ala Tyr
610 615 620
Ser His Gln Leu Ser Arg Ala Asp Ile Thr Thr Val Ser Thr Phe Ile
625 630 635 640
Asp Leu Asn Ile Thr Met Leu Glu Asp His Glu Phe Val Pro Leu Glu
645 650 655
Val Tyr Thr Arg His Glu Ile Lys Asp Ser Gly Leu Leu Asp Tyr Thr
660 665 670
Glu Val Gln Arg Arg Asn Gln Leu His Asp Leu Arg Phe Ala Asp Ile
675 680 685
Asp Thr Val Ile His Ala Asp Ala
690 695
<210>2
<211>2107
<212>DNA
<213>人工序列
<220>
<221>CDS
<222>(1)...(2104)
<223>人工合成的全新基因片段,编码HSV病毒gB蛋白胞外区片段。
<220>
<221>mis-feature
<222>(2105)...(2107)
<223>合成基因时增加的终止密码子。
<400>2
CCT ACC TCT CCT GGT ACT CCT GGT GTG GCT GCC GCT ACC CAG GCC GCT AAC GGT GGT CCT 60
Pro Ser Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gln Ala Ala Asn Gly Gly Pro
1 5 10 15 20
GCC ACT CCT GCT CCT CCT CCT CTG GGT GCC GCT CCT ACT GGT GAC CCT AAG CCT AAG AAG 120
Ala Thr Pro Ala Pro Pro Ala Leu Gly Ala Ala Pro Thr Gly Asp Pro Lys Pro Lys Lys
25 30 35 40
AAC AAG AAG CCT AAG AAC CCT ACT CCT CCT AGG CCT GCT GGT GAC AAC GCC ACC GTG GCC 180
Asn Lys Lys Pro Lys Asn Pro Thr Pro Pro Arg Pro Ala Gly Asp Asn Ala Thr Val Ala
45 50 55 60
GCT GGC CAC GCC ACT CTG AGA GAG CAC CTG AGA GAC ATC AAG GCT GAG AAC ACC GAC GCT 240
Ala Gly His Ala Thr Leu Arg Glu His Leu Arg Asp Ile Lys Ala Glu Asn Thr Asp Ala
65 70 75 80
AAC TTC TAC GTG TGT CCT CCT CCT ACT GGT GCC ACC GTG GTG CAG TTC GAG CAG CCT CGC 300
Asn Phe Tyr Val Cys Pro Pro Pro Thr Gly Ala Thr Val Val Gln Phe Glu Gln Pro Arg
85 90 95 100
AGG TGC CCT ACC AGG CCT GAA GGT CAG AAC TAC ACC GAA GGC ATC GCC GTG GTG TTC AAG 360
Arg Cys Pro Thr Arg Pro Glu Gly Gln Asn Tyr Thr Glu Gly Ile Ala Val Val Phe Lys
105 110 115 120
GAG AAC ATC GCT CCT TAC AAG TTC AAG GCC ACC ATG TAC TAC AAG GAC GTG ACC GTG AGC 420
Glu Asn Ile Ala Pro Tyr Lys Phe Lys Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser
125 130 135 140
CAG GTG TGG TTC GGC CAC CGC TAC TCC CAG TTC ATG GGC ATC TTC GAG GAC CGC GCT CCT 480
Gln Val Trp Phe Gly His Arg Tyr Ser Gln Phe Met Gly Ile Phe Glu Asp Arg Ala Pro
145 150 155 160
GTG CCT TTC GAG GAG GTG ATC GAC AAG ATC AAC GCC AAG GGT GTG TGT AGG TCC ACC GCT 540
Val Pro Phe Glu Glu Val Ile Asp Lys Ile Asn Ala Lys Gly Val Cys Arg Ser Thr Ala
165 170 175 180
AAG TAC GTG CGC AAC AAC CTG GAG ACC ACT GCT TTC CAC AGA GAC GAT CAC GAG ACC GAC 600
Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg Asp Asp His Glu Thr Asp
185 190 195 200
ATG GAG CTG AAG CCT GCC AAC GCC GCT ACT CGC ACC AGC AGA GGC TGG CAC ACC ACT GAC 660
Met Glu Leu Lys Pro Ala Asn Ala Ala Thr Arg Thr Ser Arg Gly Trp His Thr Thr Asp
205 210 215 220
CTG AAG TAC AAC CCT AGC AGA GTG GAG GCC TTC CAC AGG TAC GGC ACC ACC GTG AAC TGC 720
Leu Lys Tyr Asn Pro Ser Arg Val Glu Ala Phe His Arg Tyr Gly Thr Thr Val Asn Cys
225 230 235 240
ATC GTG GAG GAG GTG GAC GCT CGC AGC GTG TAC CCT TAC GAC GAG TTC GTG CTG GCC ACT 780
Ile Val Glu Glu Val Asp Ala Arg Ser Val Tyr Pro Tyr Asp Glu Phe Val Leu Ala Thr
245 250 255 260
GGT GAC TTC GTG TAC ATG TCT CCT TTC TAC GGC TAC AGA GAA GGT AGC CAC ACC GAG CAC 840
Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr GlY Tyr Arg Glu Gly Ser His Thr Glu His
265 270 275 280
ACT ACC TAC GCT GCT GAC AGG TTC AAG CAG GTG GAC GGC TTC TAC GCT CGC GAC CTG ACC 900
Thr Ser Tyr Ala Ala Asp Arg Phe Lys Gln Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr
285 290 295 300
ACC AAG GCT AGA GCC ACT GCT CCT ACC ACT AGG AAC CTG CTG ACC ACT CCT AAG TTC ACC 960
Thr Lys Ala Arg Ala Thr Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr
305 310 315 320
GTG GCT TGG GAC TGG GTG CCT AAG CGC CCT AGC GTG TGC ACC ATG ACC AAG TGG CAG GAG 1020
Val Ala Trp Asp Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp Gln Glu
325 330 335 340
GTG GAC GAG ATG CTG CGC TCC GAG TAC GGC GGT TCC TTC AGG TTC TCC TCT GAC GCT ATC 1080
Val Asp Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg Phe Ser Ser Asp Ala Ile
345 350 355 360
TCC ACT ACC TTC ACT ACC AAC CTG ACC GAG TAC CCT CTG TCC AGA GTG GAC CTG GGT GAC 1140
Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu Tyr Pro Leu Ser Arg Val Asp Leu Gly Asp
365 370 375 380
TGC ATC GGT AAG GAC GCT CGC GAC GCC ATG GAC CGC ATC TTC GCT CGC AGG TAC AAC GCT 1200
Cys Ile Gly Lys Asp Ala Arg Asp Ala Met Asp Arg Ile Phe Ala Arg Arg Tyr Ash Ala
385 390 395 400
ACT CAC ATC AAG GTG GGC CAG CCT CAG TAC TAC CAG GCC AAC GGT GGT TTC CTG ATC GCC 1260
Thr His Ile Lys Val Gly Gln Pro Gln Tyr Tyr Leu Ala Asn GlY Gly Phe Leu Ile Ala
405 410 415 420
TAC CAG CCT CTG CTG AGC AAC ACT CTG GCT GAG CTG TAC GTG AGA GAG CAC CTG AGA GAG 1320
Tyr Gln Pro Leu Leu Ser Asn Thr Leu Ala Glu Leu Tyr Val Arg Glu His Leu Arg Glu
425 430 435 440
CAG AGC CGC AAG CCT CCT AAC CCT ACG CCT CCT CCT CCC GGT GCT AGC GCC AAC GCT TCC 1380
Gln Ser Pro Lys Pro Pro Asn Pro Thr Pro Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser
445 450 455 460
GTG GAG CGC ATC AAG ACT ACC TCT AGC ATC GAG TTC GCC AGG CTG CAG TTC ACC TAC AAC 1440
Val Glu Arg Ile Lys Thr Thr Ser Ser Ile Glu Phe Ala Arg Leu Gln Phe Thr Tyr Asn
465 470 475 480
CAC ATC CAG CGC CAC GTG AAC GAC ATG CTG GGT CGC GTG GCT ATC GCT TGG TGC GAG CTG 1500
His Ile Gln Asn His Val Asn Asp Met Leu Gly Arg Val Ala Ile Ala Trp Cys Glu Leu
485 490 495 500
CAG AAC CAC GAG CTG ACT CTG TGG AAC GAG GCT CGC AAG CTG AAC CCT AAC GCT ATC GCC 1560
Gln Asn His Glu Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu Asn Pro Asn Ala Ile Ala
505 510 515 520
AGC GTG ACC GTG GGC AGG AGA GTG AGC GCT AGA ATG CTG GGC GAC GTG ATG GCC GTG TCC 1620
Ser Ala Thr Val Gly Arg Arg Val Ser Ala Arg Met Leu Gly Asp Val Met Ala Val Ser
525 530 535 540
ACC TGC GTG CCT GTG GCT GCT GAC AAC GTG ATC GTG CAG AAC AGC ATG CGC ATC AGC TCC 1680
Thr Cys Val Pro Val Ala Ala Asp Asn Val Ile Val Gln Asn Ser Met Arg Ile Ser Ser
545 550 555 560
AGA CCT GGT GCC TGC TAC AGC AGA CCT CTG GTG AGC TTC AGG TAC GAG GAC CAA GGT CCT 1740
Arg Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu Asp Gln Gly Pro
565 570 575 580
CTG GTG GAA GGT CAG CTG GGT GAG AAC AAC GAG CTG AGG CTG ACT CGC GAC GCT ATC GAG 1800
Leu Val Glu Gly Gln Leu Gly Glu Asn Asn Glu Leu Arg Leu Thr Arg Asp Ala Ile Glu
585 590 595 600
CCT TGC ACC GTC GGT CAC AGA CGC TAC TTC ACC TTC GGT GGC GGT TAC GTG TAC TTC GAG 1860
Pro Cys Thr Val Gly His Arg Arg Tyr Phe Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu
605 610 615 620
GAG TAC GCT TAC TCT CAC CAG CTG AGC CGC GCT GAC ATC ACT ACC GTG AGC ACC TTC ATC 1920
Glu Ser Ala Tyr Ser His Gln Leu Ser Arg Ala Asp Ile Thr Thr Val Ser Thr Phe Ile
625 630 635 640
GAC CTG AAC ATC ACC ATG CTG GAG GAC CAC GAG TTC GTG CCT CTG GAG GTG TAC ACC CGC 1980
Asp Leu Asn Ile Thr Met Leu Glu Asp His Glu Phe Val Pro Leu Glu Val Tyr Thr Arg
645 650 655 660
CAC GAG ATC AAG GAC AGC GGC CTG CTG GAC TAC ACC GAG GTG CAG CGC CGC AAC CAG CTG 2040
His Glu Ile Lys Asp Ser Gly Leu Leu Asp Tyr Thr Glu Val Gln Arg Arg Asn Gln Leu
665 670 675 680
CAC GAC CTG CGC TTC GCT GAC ATC GAC ACC GTG ATC CAC GCC GAC GCC TAA 2100
His Asp Leu Arg Phe Ala Asp Ile Asp Thr Val Ile His Ala Asp Ala
685 690 695