專利名稱::番紅霉素的生物合成基因簇的制作方法番紅霉素的生物合成基因簇
技術(shù)領(lǐng)域:
-本發(fā)明屬于微生物基因資源和基因工程領(lǐng)域,具體涉及抗腫瘤抗生素番紅霉素的生物合成基因簇的克隆、分析、功能研究及其應(yīng)用。技術(shù)背景四氫異喹啉生物堿是一類結(jié)構(gòu)復(fù)雜而獨特的天然產(chǎn)物家族,以番紅霉素(Saframycins,SFM)為代表的雙四氫異喹啉生物堿是其中的一個分支,它們具有良好的抗菌和抗腫瘤活性[C/ie附.iev.(2002)102,1669-1730]。SFM(如圖1所示)首次分離自鏈霉菌Sfre/^w^ce;y/ave"dw/ae314[/(1977)30,1015-1018],其中番紅霉素A(SFM-A)是活性較高的組份,其抗腫瘤活性的IDso約為5nM(1980)71,790-796]。SFM-A能夠以0.2ng/mL的濃度抑制RNA合成,以2.0pg/mL的濃度抑制DNA合成。它主要是通過DNA的烷基化發(fā)揮抗腫瘤生物活性,其C-21位的氰基有助于形成親電的亞胺離子從而在DNA的小溝對鳥嘌呤G的N-2位進行烷基化修飾,而且對烷基化修飾的位點具有中等程度的序列特異性,如5'-GGG,5'-GGC[Oew.iev.(2002)102,1669-1730]。2004年,Myers從牛腦組織裂解液中篩選到了與SFM-A-DNA復(fù)合物相結(jié)合的靶蛋白甘油醛-3-磷酸脫氫酶GADPHW加/.Jo^.,&/.2004(101),5862-5866]。該蛋白一般以四聚體的形式存在,為甘油醛磷酸脫氫酶,參與葡萄糖的代謝;以單聚體形式存在時,是細(xì)胞周期中S期轉(zhuǎn)錄復(fù)合物的重要組成部分,因此推測GAPDH應(yīng)是跟細(xì)胞周期有關(guān)的蛋白。究竟SFM-A抗腫瘤的作用機制主要是由烷基化修飾引起DNA的斷裂,還是通過與效應(yīng)蛋白相結(jié)合終止細(xì)胞周期,使DNA不能合成,目前還沒有定論,但SFM-A具有抗腫瘤活性,可以對DNA進行垸基化修飾卻是不爭的事實。SFM-A具有獨特的化學(xué)結(jié)構(gòu)(如圖1所示),雙四氫異喹啉骨架結(jié)構(gòu)是該家族化合物的典型特征。1979年,Arai等人首次通過X-my晶體衍射分析確定了SFM-C的結(jié)構(gòu)[re加/ieJraw丄饑(1979)20,2355-2358〗,隨后通過與SFM-C的13CNMR數(shù)據(jù)比較確定了SFM-B的結(jié)構(gòu)。SFM-A,C-21位存在一個氰基,其結(jié)構(gòu)通過與SFM-C的高場^NMR等波譜數(shù)據(jù)對比分析得以確定。其他各成員的結(jié)構(gòu)也陸續(xù)通過NMR,X-ray等方法被確定。之后,隨著該家族其他化合物的發(fā)現(xiàn),這一獨特的化學(xué)結(jié)構(gòu)吸引了許多有機化學(xué)家從事其化學(xué)合成研究[Og.丄e仏(1991)1,75-77;Org.(2000)2,3019-3022;/爿m.CAe附.(1999)121,10828-10829;乂爿附.C/zem.Soc.(2001)123,5114-5115;/爿肌Soc.(2002)124,12969-12971]。前體標(biāo)記實驗表明SFM骨架來源于2個酪氨酸,邊鏈來源于1個丙氨酸和1個甘氨酸,甲基來源于甲硫氨酸[乂所o/.C&m.(1985)260,344-348]。目前,一個來源于加勒比海的一種海鞘(Ecteinascidiaturbinate)中的四氣異喹啉天然產(chǎn)物Et743(見圖l)己進入PhannaMar公司的ni期臨床研究,非常有希望成為一種治療乳腺癌,惡性黑素瘤,肺癌及卵巢癌的新藥。該化合物體外活性LC50<lpM,ID500.2-0.6nM,活性比抗腫瘤藥物紫杉醇高約50倍[C/肚Ow.及"(2002)8,75-85]。作為一種具有良好臨床應(yīng)用前景的抗腫瘤藥物,Et743來源十分有限。據(jù)統(tǒng)計,從l噸海鞘中只能獲得lgEt743,不能滿足臨床科學(xué)研究的需要。有人用培養(yǎng)海鞘細(xì)胞的方法獲得Et743,但是其海洋來源使得研究非常困難。目前臨床研究所用Et743主要來源于熒光假單孢菌Aet^owo"as^ww^ceraA2-2發(fā)酵產(chǎn)生的番紅菌素B(safracin,SAC-B)的氰化衍生物經(jīng)21步化學(xué)半合成得至!j[Oie附.iev.(2002)102,1669-1730;Org.(2000)2,2545-2548]。SFM與Et743骨架結(jié)構(gòu)相似,而且其宿主菌是鏈霉菌,可以通過發(fā)酵的方法來獲得代謝產(chǎn)物。我們以微生物來源的番紅霉素為目標(biāo)分子,從克隆生物合成基因簇出發(fā),采用微生物學(xué)、分子生物學(xué)、生物化學(xué)及有機化學(xué)相結(jié)合的方法研究其生物合成,闡明生物合成機理,通過代謝工程改造生產(chǎn)Et743或產(chǎn)生SFM類似物進而結(jié)合化學(xué)半合成生成Et743或類似物,為新活性"非天然"天然產(chǎn)物的發(fā)現(xiàn)和藥物開發(fā)提供分子和活性多樣性。
發(fā)明內(nèi)容本發(fā)明涉及四氫異喹啉生物堿家族中一種鏈霉菌/avem/"/ae314產(chǎn)生的具有抗腫瘤活性的抗生素一番紅霉素(Saframycins,SFM)的生物合成基因簇的克隆、測序、分析、功能研究及其應(yīng)用。本發(fā)明中整個基因簇共包含30個基因的核苷酸序列或互補序列(序列1),其中3個(5>^,s加B,s/附C)用于編碼非核糖體聚肽合成酶(NRPS),共包含3個模塊,12個功能域,負(fù)責(zé)催化SFM聚肽鏈的生物合成;4個基因(,/附05,s/mM2,功wM3,s/mZ))編碼的蛋白負(fù)責(zé)催化前體分子3-羥基-5-甲基-0-甲基-酪氨酸的生物合成;2個基因G/mC>/,W"C>2)編碼的蛋白負(fù)責(zé)催化肽骨架的環(huán)合;5個基因(^nW,s力"S2,s/wW,s/w5V,s/wS5)編碼的蛋白負(fù)責(zé)S-腺苷化甲硫氨酸(SAM)的合成,為甲基化酶提供輔因子;7個基因0/wM7,s加OZ,s/w(92,s/wO丄s/附尺,s/附CM,s/w06)編碼的甲基化酶和氧化還原酶催化分子骨架的進一步修飾;還包括3個調(diào)節(jié)基因(^n及7,s/附i2,s/wi3)、2個抗性基因0/^G,s/An/Z)及4個功能不確定的基因(s/m五,s加F,s加/,s^nJ)。本發(fā)明還提供了一個編碼TetR家族轉(zhuǎn)錄調(diào)節(jié)因子的核苷酸序列,由序列2中的氨基酸序列組成,命名為sfinRl,其基因的核苷酸序列位于序列1中第786-1373堿基處。本發(fā)明還提供了一個編碼氧化還原酶的核苷酸序列,由序列3中的氨基酸序列組成,命名為sfin01,其基因的核苷酸序列位于序列1中第1370-2284堿基處。本發(fā)明還提供了一個編碼轉(zhuǎn)錄調(diào)節(jié)因子的核苷酸序列,由序列4中的氨基酸序列組成,命名為sfinR2,其基因的核苷酸序列位于序列1中第2487-2996堿基處。本發(fā)明還提供了一個編碼氧化還原環(huán)合酶的核苷酸序列,由序列5中的氨基酸序列組成,命名為sfinCyl,其基因的核苷酸序列位于序列1中第3166-4704堿基處。本發(fā)明還提供了一個編碼移位酶的核苷酸序列,由序列6中的氨基酸序列組成,命名為sfmG,其基因的核苷酸序列位于序列1中第4738-6177堿基處。本發(fā)明還提供了一個編碼紫外修復(fù)蛋白的核苷酸序列,由序列7中的氨基酸序列組成,命名為sfmH,其基因的核苷酸序列位于序列1中第8748-6289堿基處。本發(fā)明還提供了一個編碼單氧化酶的核苷酸序列,由序列8中的氨基酸序列組成,命名為sfin02,其基因的核苷酸序列位于序列l(wèi)中第10384-8819堿基處。本發(fā)明還提供了一個編碼甲基化酶的核苷酸序列,由序列9中的氨基酸序列組成,命名為sfinMl,其基因的核苷酸序列位于序列1中第11059-10460堿基處。本發(fā)明還提供了一個編碼未知功能蛋白的核苷酸序列,由序列IO中的氨基酸序列組成,命名為sfinl,其基因的核苷酸序列位于序列1中第11694-11212堿基處。本發(fā)明還提供了一個編碼吡哆胺-5'-磷酸氧化酶相關(guān)蛋白的核苷酸序列,由序列11中的氨基酸序列組成,命名為sfinJ,其基因的核苷酸序列位于序列1中第12188-11691堿基處。本發(fā)明還提供了一個編碼細(xì)胞色素P-450羥化酶的核苷酸序列,由序列12中的氨基酸序列組成,命名為sfm03,其基因的核苷酸序列位于序列1中第12372-13559堿基處。本發(fā)明還提供了一個編碼鐵氧化還原蛋白的核苷酸序列,由序列13中的氨基酸序列組成,命名為sfinK,其基因的核苷酸序列位于序列1中第13613-13798堿基處。本發(fā)明還提供了一個編碼氧化還原環(huán)合酶的核苷酸序列,由序列14中的氨基酸序列組成,命名為sfmCy2,其基因的核苷酸序列位于序列1中第15388-13874堿基處。本發(fā)明還提供了一個編碼細(xì)胞色素P-450羥化酶的核苷酸序列,由序列15中的氨基酸序列組成,命名為sfm04,其基因的核苷酸序列位于序列1中第16947-15520堿基處。本發(fā)明還提供了一個編碼包括AL,PCP,C,A,PCP非核糖體聚肽合成酶結(jié)構(gòu)域的核苷酸序列,由序列16中的氨基酸序列組成,命名為sfinA,其基因的核苷酸序列位于序列1中第16971-22481堿基處。本發(fā)明還提供了一個編碼包括C,A,PCP非核糖體聚肽合成酶結(jié)構(gòu)域的核苷酸序列,由序列17中的氨基酸序列組成,命名為sfinB,其基因的核苷酸序列位于序列1中第22618-25866堿基處。本發(fā)明還提供了一個編碼包括C,A,PCP,RE非核糖體聚肽合成酶結(jié)構(gòu)域的核苷酸序列,由序列18中的氨基酸序列組成,命名為sfmC,其基因的核苷酸序列位于序列1中第25863-30320堿基處。本發(fā)明還提供了一個編碼5-甲基-()-甲基-酪氨酸-3-羥化酶的核苷酸序列,由序列19中的氨基酸序列組成,命名為sfmD,其基因的核苷酸序列位于序列1中第30317-31414堿基處。本發(fā)明還提供了一個編碼肽酶的核苷酸序列,由序列20中的氨基酸序列組成,命名為sftnE,其基因的核苷酸序列位于序列1中第31411-33780堿基處。本發(fā)明還提供了一個編碼MbtH類似蛋白的核苷酸序列,由序列21中的氨基酸序列組成,命名為sfinF,其基因的核苷酸序列位于序列l(wèi)中第33777-33998堿基處。本發(fā)明還提供了一個編碼甲基化酶的核苷酸序列,由序列22中的氨基酸序列組成,命名為sfinM2,其基因的核苷酸序列位于序列1中第34031-35131堿基處。本發(fā)明還提供了一個編碼轉(zhuǎn)錄調(diào)節(jié)因子的核苷酸序列,由序列23中的氨基酸序列組成,命名為sfmR3,其基因的核苷酸序列位于序列1中第35223-35759堿基處。本發(fā)明還提供了一個編碼預(yù)苯酸脫氫酶的核苷酸序列,由序列24中的氨基酸序列組成,命名為sfin05,其基因的核苷酸序列位于序列1中第37069-35卯0堿基處。本發(fā)明還提供了一個編碼S-腺苷-L-高半胱氨酸水解酶的核苷酸序列,由序列25中的氨基酸序列組成,命名為sfmSl,其基因的核苷酸序列位于序列1中第38550-37096堿基處。本發(fā)明還提供了一個編碼亞甲基四氫葉酸還原酶的核苷酸序列,由序列26中的氨基酸序列組成,命名為sfmS2,其基因的核苷酸序列位于序列1中第39521-38580堿基處。本發(fā)明還提供了一個編碼5-甲基四氫葉酸-高半胱氨酸S-甲基轉(zhuǎn)移酶的核苷酸序列,由序列27中的氨基酸序列組成,命名為sftnS3,其基因的核苷酸序列位于序列1中第42940-39557堿基處。本發(fā)明還提供了一個編碼碳水化合物激酶的核苷酸序列,由序列28中的氨基酸序列組成,命名為sfinS4,其基因的核苷酸序列位于序列1中第44013-43036堿基處。本發(fā)明還提供了一個編碼S-腺苷甲硫氨酸合成酶的核苷酸序列,由序列29中的氨基酸序列組成,命名為sfmS5,其基因的核苷酸序列位于序列1中第45220-44015堿基處。本發(fā)明還提供了一個編碼甲基化酶的核苷酸序列,由序列30中的氨基酸序列組成,命名為sfmM3,其基因的核苷酸序列位于序列1中第46245-45241堿基處。本發(fā)明還提供了一個編碼FAD依賴的氧化酶的核苷酸序列,由序列31中的氨基酸序列組成,命名為sfm06,其基因的核苷酸序列位于序列1中第46553-47683堿基處。序列1的互補序列可根據(jù)DNA堿基互補原則隨時得到。序列1的核苷酸序列或部分核苷酸序列可以通過聚合酶鏈?zhǔn)椒磻?yīng)(PCR)或用合適的限制性內(nèi)切酶酶切相應(yīng)的DNA或使用其他合適的技術(shù)得到。本發(fā)明提供了得到至少包含部分序列1中DNA序列的重組DNA質(zhì)粒的途徑。本發(fā)明還提供了產(chǎn)生SFM生物合成基因被中斷或加倍的微生物體的途徑,至少其中之一的基因包含有序列1中的核苷酸序列。本發(fā)明所提供的核苷酸序列或部分核苷酸序列,可利用聚合酶鏈?zhǔn)椒磻?yīng)(PCR)的方法或包含本發(fā)明序列的DNA作為探針以Southern雜交等方法從其他生物體中得到與SFM生物合成基因相似的基因。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的克隆DNA可用于從鏈霉菌S^^o/^ces/avem/w/fle314(NRRL11002)基因組文庫中定位更多的文庫質(zhì)粒。這些文庫質(zhì)粒至少包含本發(fā)明中的部分序列,也包含有S/reptof^c^/avem/w/ae314基因組中以前鄰近區(qū)域未克隆的DNA。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列可以被修飾或突變。這些途徑包括插入、置換或缺失,聚合酶鏈?zhǔn)椒磻?yīng),錯誤介導(dǎo)聚合酶鏈?zhǔn)椒磻?yīng),位點特異性突變,不同序列的重新連接,序列的不同部分或與其他來源的同源序列進行定向進化(DNAshuffling),或通過紫外線或化學(xué)試劑誘變等。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的克隆基因可以通過合適的表達體系在外源宿主中表達以得到相應(yīng)的酶或其他更高的生物活性或產(chǎn)量。這些外源宿主包括鏈霉菌、假單孢菌、大腸桿菌、芽孢桿菌、酵母、植物和動物等。本發(fā)明所提供的氨基酸序列可以用來分離所需要的蛋白并可用于抗體的制備。包含本發(fā)明所提供的氨基酸序列或至少部分序列的多肽可能在去除或替代某些氨基酸之后仍有生物活性甚至有新的生物學(xué)活性,或者提高了產(chǎn)量或優(yōu)化了蛋白動力學(xué)特征或其他致力于得到的性質(zhì)。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的基因或基因簇可以在異源宿主中表達并通過DNA芯片技術(shù)了解它們在宿主代謝鏈中的功能。包含本發(fā)明所提供的核苷酸序列編碼的蛋白可以催化合成3-羥基-5-甲基-0-甲基酪氨酸和5-甲基-0-甲基酪氨酸及SFM聚肽骨架,進一步催化合成抗生素SFM。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的基因或基因簇可以通過遺傳重組來構(gòu)建重組質(zhì)粒以獲得新型生物合成途徑,也可以通過插入、置換、缺失或失活進而獲得新型生物合成途徑。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的克隆基因或DNA片段可以通過中斷SFM生物合成的一個或幾個步驟而得到新的SFM結(jié)構(gòu)類似物或前體。包含DNA片段或基因可以用來提高SFM或其衍生物的產(chǎn)量,本發(fā)明提供了在基因工程微生物中提高產(chǎn)量的途徑。包含本發(fā)明所提供的非核糖體聚肽合成酶可以通過缺失、插入或失活來自于相同或不同的非核糖體聚肽合成酶系統(tǒng)的一個或多個非核糖體聚肽合成酶結(jié)構(gòu)域、模塊或基因而產(chǎn)生新的聚肽化合物。包含本發(fā)明所提供的核苷酸序列或至少部分核苷酸序列的片段或基因可以用來構(gòu)建非核糖體聚肽合成酶庫或非核糖體聚肽合成酶衍生庫或組合庫。本發(fā)明所提供的催化合成腺苷化甲硫氨酸(SAM)的合成基因可用于合成腺苷化甲硫氨酸。本發(fā)明所提供的成3-羥基-5-甲基-0-甲基酪氨酸和5-甲基-0-甲基酪氨酸生物合成基因可用于合成SFM家族獨特的成3-羥基-5-甲基-0-甲基酪氨酸和5-甲基-O-甲基酪氨酸及其他酪氨酸類似物。本發(fā)明所提供的SFM骨架的后修飾基因提供了通過遺傳修飾得到類似物的途徑,所包含的氧化還原反應(yīng)也可有其他應(yīng)用??傊景l(fā)明所提供的包含SFM生物合成相關(guān)的所有基因和蛋白信息可以幫助人們理解SFM家族天然產(chǎn)物的生物合成機制,為進一步遺傳改造提供了材料和知識。本發(fā)明所提供的基因及其蛋白質(zhì)也可以用來尋找和發(fā)現(xiàn)可用于醫(yī)藥、工業(yè)或農(nóng)業(yè)的化合物或基因、蛋白。圖l:番紅霉素(Saframycin,SFM)、Et743及番紅菌素(Safracin,SAC)的化學(xué)結(jié)構(gòu)圖2:SFM生物合成基因NRPS片段的克隆(A)NRPS催化的肽鏈延伸;(B)推測的SFM肽鏈的生物合成;(C)PCR得到NRPS基因片段的電泳泳道1為分子量標(biāo)準(zhǔn),泳道2,3為PCR產(chǎn)物;(D)NRPS基因片段中斷的突變菌株發(fā)酵產(chǎn)物的HPLC分析I是標(biāo)準(zhǔn)品,II野生型發(fā)酵產(chǎn)物,III是與SFM生物合成無關(guān)的NRPS基因中斷的突變體發(fā)酵產(chǎn)物,IV是與SFM生物合成相關(guān)的NRPS基因中斷的突變體發(fā)酵產(chǎn)物,V上述IV的發(fā)酵產(chǎn)物與標(biāo)準(zhǔn)品的共進樣。圖3:SFM生物合成基因RE片斷的克隆(A)NRPS中肽鏈終止的方式;(B)推測的SFM肽鏈的終止;(C)PCR得到RE基因片斷的電泳泳道l為分子量標(biāo)準(zhǔn),泳道2,3為PCR產(chǎn)物。圖4:SFM生物合成基因簇的基因結(jié)構(gòu)和限制性內(nèi)切酶譜(A)3個交疊的粘粒代表了抗生鏈霉菌基因組70kb的DNA區(qū)域,B代表限制性內(nèi)切酶BamHI,實體表示已被DNA測序的部分,Probe-2和Probe-3代表標(biāo)記的探針部分;(B)SFM生物合成基因簇的基因組成。圖5:SFM生物合成基因簇中包含的S-腺苷化甲硫氨酸(SAM)的生物合成途徑圖6:提出的3-羥基-5-甲基-0-甲基酪氨酸(A)和SFM-A(B)的生物合成途徑圖7:基因置換與基因互補突變菌株發(fā)酵產(chǎn)物的高效液相色譜(HPLC)分析(A)野生型發(fā)酵產(chǎn)物;(B)NRPS基因置換的突變體(/1S/附S)發(fā)酵產(chǎn)物;(C)/1S^^基因置換的突變體經(jīng)SfmB基因互補的突變體發(fā)酵產(chǎn)物;(D)4S/wfi基因置換的突變體經(jīng)SfinA—F基因互補的突變體發(fā)酵產(chǎn)物;(E)SFM-A標(biāo)準(zhǔn)品。圖8:四氫異喹啉家族化合物生物合成中非核糖體聚肽合成酶NRPS催化的肽骨架的合成(A)SFM-A的生物合成途徑;(B)SFM-Mxl生物合成途徑;(C)SAC-B的生物合成途徑。圖9:SFM-A生物合成途徑中三個NRPS的底物識別功能分析(A)重組蛋白的分離純化泳道1,SfmA(C-A-PCP),泳道2,SfinB,泳道3,SfinC;(B)三個NRPS重組蛋白的底物測定。圖10:在SAC的產(chǎn)生菌熒光假單孢菌中異源表達SFM-A的生物合成基因(A)SAC和SFM-A生物合成基因簇的對比;(B)突變菌株發(fā)酵產(chǎn)物的HPLC分析I,野生型假單孢菌,II,野生型假單孢菌發(fā)酵后經(jīng)KCN處理,III,含有^M(94異源表達質(zhì)粒的重組假單孢菌,IV,含有^n04異源表達質(zhì)粒的重組假單孢菌發(fā)酵后經(jīng)KCN處理。圖11:SFM-A的生物合成基因在大腸桿菌中的表達及重組蛋白分離純化泳道l,蛋白分子量標(biāo)準(zhǔn),泳道2,純化后的蛋白。(A)Sfm01,(B)Sfm03,(C)Sfin04,(D)SfinM2,(E)SfmM3,(F)SftnD。符號說明圖1Saframycin:番紅霉素;aminated-Saframycin:氨基化番紅霉素;Ecteinascidin:海鞘素;Safracin:番紅菌素;cyano-Safracin:氰基化番紅菌素。圖2Module:模塊;C:縮合酶;A:腺苷化酶;PCP:肽酰載體蛋白;Loading:起始模塊;NRPS:非核糖體肽合成酶;Wavelength:波長。圖3Acyl-S-T:?;蝓ポd體蛋白;TE:硫酯酶;C:縮合酶;red或RE:還原酶。圖4Probe:探針;Skeleton:骨架;Tailoring:后修飾;Reguaition:調(diào)節(jié);Resistance:抗性;SAMRecycle:酰苷化甲硫氨酸再生循環(huán);Unassigned:功能不確定;Beyondboundary:邊界之外。圖5S-adenosylhomocysteine:腺苷化高半胱.氨酸;S-adenosylmethionine:腺苷化甲硫氨酸;L-homocysteine:高半胱氨酸;L-methionine:甲硫氨酸;5-methyl-THF:NS-甲基四氫葉酸;THF:四氫葉酸;5,10-methylene-THF:#,>41()-亞甲基四氫葉酸;Adenosine:腺嘌呤核苷;AMP:腺嘌呤核苷單磷酸;ATP:腺嘌呤核苷三磷酸;PPi:焦磷酸;Pi:磷酸根;MT:甲基化酶。圖6AL:?;o酶A連接酶;PCP:肽酰載體蛋白;C:縮合酶;A:腺苷化酶;SAC-B:番紅菌素B;SFM-A:番紅霉素A。圖7WT:野生型;mLLS524B:NRPS基因置換的突變體;mLLS647:基因置換的突變體經(jīng)SfinB基因互補的突變體;mLLS925:基因置換的突變體經(jīng)S/m^—F基因互補的突變體;SFM-A:番紅霉素A。圖8AL:?;o酶A連接酶;PCP:肽酰載體蛋白;C:縮合酶;A:腺苷化酶;RE:還原酶;SFM-A:番紅霉素A;SFM-Mxl:番紅霉素Mxl;SAC-B:番紅菌素B。圖9Relativeactivity:相對活性;Ala:丙氨酸;D-Ala:D型丙氨酸;Ala-Gly:丙—甘二肽;Pyruvate:丙酮酸;Cys:半胱氨酸;Tyr:酪氨酸;3h5mOmTyr:3-羥基-5-甲基-0-甲基酪氨酸;5mOmTyr:5-甲基-0-甲基酪氨酸;OmTyr:O-甲基酪氨酸;3hTyr:3-羥基-酪氨酸;Phe:苯丙氨酸。圖10Skeleton:骨架;Tailoring:后修飾;Reguaition:調(diào)節(jié);Resistance:抗性;SAMRecycle:酰苷化甲硫氨酸再生循環(huán);Unassigned:功能不確定;Beyondboundary:邊界之外;Wavelength:波長;SAC-B:番紅菌素B;cyanoSAC-B:氰基化番紅菌素B;aminatedSFM-S:氨基化番紅霉素S;SFM-Y3:番紅霉素Y3。具體實施例方式以下結(jié)合圖l-圖11對本發(fā)明進一步詳細(xì)說明。1.克隆SFM-A的生物合成基因片斷SFM-A的前體標(biāo)記實驗證明,二聚四氫異喹啉環(huán)骨架來自兩個酪氨酸,邊鏈來自丙氨酸,甘氨酸,o-,c-,N-甲基來自甲硫氨酸[J.編.CZj謡.(1985)260,344-348]。來源于粘細(xì)菌Myjcococc^xw^M的類似物SFM-Mxl與SFM-A結(jié)構(gòu)極其類似,Pospiech等通過轉(zhuǎn)座子插入失活的方法克隆到SFM-Mxl的18kb的部分生物合成基因,表明其中含有兩個非核糖體聚肽合成酶(NRPS)和一個O-甲基轉(zhuǎn)移酶,證明SFM-Mxl的生物合成是采用NRPS機理[M/cro6/0/.(1995)141,1793-1803;M'croWo/.(1996)142,741-746]。最近來源于熒光假單孢菌尸wwfitowomwy7MomyceraA2-2的化合物番紅菌素b(sac-b)生物合成基因簇的克隆和分析表明SAC-B的生物合成也是采用NRPS機理[Mo/.M'cra6Zo/.(2005)56,144-154]。這些研究為克隆SFM的生物合成基因提供了極為重要的信息。我們推測SFM的生物合成可能是采用NRPS機理,Ala-Gly-Tyr-Tyr依次縮合(圖2A,B),結(jié)合還原成環(huán),氧化還原,脫水,N-,O-,C-甲基化,加氰,脫氨等后修飾完成。因此克隆策略就是從克隆NRPS基因入手。典型NRPS是以模塊形式存在的多功能酶,每一模塊含有一套獨特的,非重復(fù)使用的催化功能域,即催化功能域與產(chǎn)物合成順序是一一對應(yīng)的(圖2A)。我們基于已有的一些化合物的生物合成序列A功能域設(shè)計了NRPS通用引物,希望得到相關(guān)的NRPS基因片斷。采用兼并性引物(5'-TACACGTCCGGCACSACSGGCAARCCNAARGG-3'和5,-AWCGAGKSGCCGCCSRRGSMGAAGAA-3,)PCR克隆編碼NRPS基因部分序列的方法得到約1.2kb的PCR產(chǎn)物(圖2C),克隆入pGEM-TEasy載體,經(jīng)限制性內(nèi)切酶分組,DNA順序分析表明與已知的NRPS基因有很高的同源性。進一步經(jīng)基因中斷發(fā)現(xiàn)有的基因中斷突變菌株仍然產(chǎn)生SFM-A,說明它們與SFM的生物合成無關(guān);而另一組基因失活的突變菌株完全喪失了產(chǎn)生SFM-A的能力(圖2,D),證明已經(jīng)克隆了與SFM-A生物合成相關(guān)的NRPS基因片段。絕大多數(shù)非核糖體聚肽類的合成完畢后都是由硫酯酶(TE)將聚肽鏈從PCP上解離并催化縮合,但有些天然產(chǎn)物的生物合成中沒有NRPS常用的TE而是存在一個末端的還原酶,稱之為RE結(jié)構(gòu)域[/VocAto/.JcadSc/.{/&4(2001)98,11136-11141;(2004)325,35-42]。這類化合物一般不是形成大環(huán)的酰胺或內(nèi)酯,而是解離成醛,進而形成末端的醇,胺,酰胺或亞胺。SFM-A與SFM-Mxl結(jié)構(gòu)非常類似,所以我們猜測SFM-A也采用還原酶RE來解離連接在PCP上的四肽,首先RE在NADPH作用下將四肽解離下來形成醛,再分子內(nèi)反應(yīng)成環(huán)(圖3A,B)。根據(jù)部分化合物的生物合成序列的PCP功能域和RE功能域的保守序列設(shè)計了RE引物,希望能克隆到相關(guān)基因。采用兼并性引物(5'-GACAACTTCTTCGAGCTGGGSGGSSAYTC-3,禾05,-GCGGACCAACTTCTCCGCSRCCCAYTTRCT-3')PCR克隆編碼RE基因部分序列的方法得到約0.8kb的PCR產(chǎn)物(圖3C),克隆入pGEM-TEasy載體,經(jīng)限制性內(nèi)切酶分組,DNA順序分析表明與已知的RE基因有很髙的同源性。2.SFM-A的生物合成基因簇的克隆,順序分析及功能分析將上述NRPS基因片段和RE基因片段用地高辛標(biāo)記為探針Probe-2和Probe-3從5^印to附少ces/ove"由/ae314的基因組文庫約6000個克隆中篩選,分離得到的粘粒涵蓋了染色體約70kb的區(qū)域(圖4A)。DNA順序分析了50kb的染色體區(qū)域,GC含量71.8。/。。生物信息學(xué)分析包含了32個開放讀碼框(圖4B)。其中含有三個NRPS模塊,說明SFM-A的生物合成采取NRPS機理。同時序列分析還顯示該生物合成基因簇包含大量新奇的NRPS后修飾基因及2個抗性基因和3個調(diào)控基因(圖4B),這為我們研究非核糖體聚肽類化合物的后修飾提供了很好的一個平臺。詳細(xì)的分析結(jié)果列于表l。表1SFM-A的生物合成基因簇中各基因及編碼蛋白的功能分析<table>tableseeoriginaldocumentpage17</column></row><table>195TetR(ZP_01431206)57〃1轉(zhuǎn)錄調(diào)節(jié)因子咖o/304SAMR0789(CAJ88498)62〃5氧化還原酶咖"169MitQ(AAD28455)63/78轉(zhuǎn)錄調(diào)節(jié)因子512M欣(AAD28454)67/78氧化還原環(huán)合酶479Met(AAD32747)38/56移位酶咖//819CmrX(CAE17542)57〃2紫外修復(fù)蛋白咖02521SfcJ(AAL33754)43/58單氧化酶199Sfcl(AAL33755)43/56甲基轉(zhuǎn)移酶咖/160MflvDRAFT—079835/46未知功能蛋白(ZP一01194885)咖《/165MflvDRAFT079941/58與吡哆胺5'-磷酸氧化酶相(ZP—01194886)關(guān),結(jié)合黃素單核苷酸咖O395Orf3(AAD28449)50/67細(xì)胞色素P450羥化酶咖《61Fas2(P46374)44/57鐵氧化還原蛋白504MitR(AAD28454)45/60氧化還原環(huán)合酶咖o皁475HctH(AAY42400)25/45細(xì)胞色素P450羥化酶咖」1836Sa氾(AAC44128)38/51AL-PCP-C-A-PCP咖51082SfcB(AAL33757)42/58C-A-PCP咖C1485SfcC(AAL33758)47/60C-A-PCP-RE咖£>365SfcD(AAL33759)38/51未知功能蛋白789Orf(-15)(AAN85499)32/43肽酶咖F73SfcE(AAL33760)60/68MbtH類似蛋白咖M2366SfcF(AAL33761)63/76甲基轉(zhuǎn)移酶178MitQ(AAD28455)51/69轉(zhuǎn)錄調(diào)節(jié)因子咖05389Dbv5(CAD91200)46/58預(yù)苯酸脫氫酶咖57484F1I(CAJ20010)76/84S-腺苷-L-高半胱氨酸水解酶s何2313MetF(AAK98793)78/87亞甲基四氫葉酸還原酶咖《1127Orf49(NP—851471)84/905-甲基四氫葉酸-高半胱氨酸S-甲基轉(zhuǎn)移酶咖W325med-Orf21(BAC79034)63〃2碳水化合物激酶401Samll(CAJ43279)89/94S-腺苷甲硫氨酸合成酶334SfcG(AAL33762)43/62甲基轉(zhuǎn)移酶咖06376M加OII(CAK50777)37/46氧化酶153Cofl(Q9HF97)31/48Cofilin(肌動蛋白解聚因子1)3.SFM-A的生物合成基因簇邊界的確定根據(jù)基因編碼蛋白的功能分析,SFM-A的生物合成基因簇被確定為從基因s/附W到x加0《圖4B),涵蓋染色體46.9kb的區(qū)域,包含30個開放讀碼框。整個SFM-A的生物合成基因簇共30個基因,其中3個(功wj,s/wB,^wC)用于編碼非核糖^t^聚肽合成酶(NRPS),共包含3個模塊,負(fù)責(zé)催化SFM-A聚肽鏈的生物合成;5個基因(s/m57,^iSi,功"W,s/肌W,編碼的蛋白負(fù)責(zé)S-腺苷化甲硫氨酸的生物合成,為甲基化提供輔因子;4個基因(s/附OJ,^MM2,s//wM3,s/wD)編碼的蛋白負(fù)責(zé)催化酪氨酸衍生物3-羥基-5-甲基-0-甲基酪氨酸的生物合成;1個(s/wM7)用于編碼甲基化酵,負(fù)責(zé)催化分子中的N甲基化;2個基因(s/mQ;7,#"C>2)編碼的蛋白催化氧化還原反應(yīng),負(fù)責(zé)肽鏈骨架的氧化成環(huán);6個基因(;y/;w6^,s/w02,s/mds/w《,s/附04,^n06)編碼的蛋白催化氧化還原反應(yīng),負(fù)責(zé)分子骨架合成后的修飾;還包括3個調(diào)節(jié)基因(s/附i/,#jW,s/mW)和2個抗性基因(^wG,s/wi/)及4個功能不十分確定的基因4.S-腺苷化甲硫氨酸(SAM)的生物合成甲基化修飾是天然產(chǎn)物生物合成中經(jīng)常發(fā)生的反應(yīng),絕大多數(shù)情況下是一種甲基化酶催化的,這類甲基化酶利用S-腺苷化甲硫氨酸(S-adenosylmethionine,SAM)作為甲基供體,而SAM的生物合成在多數(shù)情況下來源于宿主的基礎(chǔ)代謝。在番紅霉素A的生物合成基因簇中包含5個基因(s/wW,s/wS2,s/m^,s/附S5)編碼的蛋白形成完整的SAM合成循環(huán)(圖5):當(dāng)甲基化發(fā)生時,SAM作為甲基供體失去甲基被轉(zhuǎn)化為S-腺苷-L-高半胱氨酸(S-adenosylhomocysteine);這時^nW編碼的S-腺苷-L-高半胱氨酸水解酶將其水解產(chǎn)生L-高半胱氨酸(homocysteine)和腺嘌呤核苷A;然后^"W編碼的5-甲基四氫葉酸-高半胱氨酸S-甲基轉(zhuǎn)移酶利用5-甲基四氫葉酸(5-methyl-THF)作為甲基供體將高半胱氨酸轉(zhuǎn)化為L-甲硫氨酸(methionine),而功"5'2編碼的亞甲基四氫葉酸還原酶則催化5,10-亞甲基四氫葉酸(5,10-methylene-THF)轉(zhuǎn)化為5-甲基四氫葉酸;s/wS4編碼的碳水化合物激酶則催化腺嘌呤核苷A轉(zhuǎn)化為腺嘌呤核苷單磷酸AMP,進而完成A到ATP的再生循環(huán);最后s/m&5編碼的S-腺苷甲硫氨酸合成酶催化L-甲硫氨酸和ATP轉(zhuǎn)化為SAM,從而完成SAM的再生循環(huán)。5.3-羥基-5-甲基-0-甲基酪氨酸的生物合成3-羥基-5-甲基-0-甲基酪氨酸的生物合成如圖6A所示編碼的預(yù)苯酸脫氫酶催化基礎(chǔ)代謝產(chǎn)物預(yù)苯酸轉(zhuǎn)化為酪氨酸(該反應(yīng)在基礎(chǔ)代謝酪氨酸的合成中也有),然后在酪氨酸功nM2和s>M3編碼的甲基化酶催化下發(fā)生羥基甲基化和苯環(huán)5位C甲基化修飾,進一步在^A/Z)編碼的酶作用下苯環(huán)3位羥基化,從而完成番紅霉素中3-羥基-5-甲基-0-甲基酪氨酸單元的生物合成。6.SFM-A骨架合成中NRPS的功能SFM-A生物合成基因簇中共有三個典型的NRPS基因和s>C。為了進一步確證它們與SFM-A生物合成的相關(guān)性構(gòu)建了NRPS基因SfinB基因置換的突變體mLLS524B(4S/w5),經(jīng)生物發(fā)酵、HPLC檢測發(fā)現(xiàn)該NRPS基因失活的突變體不再產(chǎn)生SFM-A(圖7B),而將S/zn5基因置于外源啟動子ErmE下進行基因互補可以部分恢復(fù)SFM-A的產(chǎn)生(圖7C),將S^^—尸完整的操縱子置于外源啟動子ErmE下進行基因互補可以將SFM-A的產(chǎn)生基本完全恢復(fù)至野生型水平(圖7D),進一步證實克隆到的生物合成基因簇確與SFM-A的生物合成相關(guān)。進一步分析這三個NRPS基因編碼蛋白的氨基酸序列,可以推測出了這三個NRPS模塊的組織結(jié)構(gòu)(圖8A)。SfmA包括五個功能域(AL-PCPo-d-A,-PCPO,SfinB包含三個功能域(C2-ArPCP2),SfinC包含四個功能域(C3-A3-PCP3-RE)。序列分析表明,位于SfinAN-端的AL結(jié)構(gòu)域與許多細(xì)菌來源的酰基輔酶A連接酶(Acyl-CoAligase)具有高度的同源性,同時AL缺乏腺苷化結(jié)構(gòu)域(A)所必需的一些核心氨基酸基序,因此SfinA的AL結(jié)構(gòu)域可能不具備A結(jié)構(gòu)域應(yīng)有的識別和激活氨基酸底物的功能。推測SfmA的Al、SfmB的A2和SfmC的A3結(jié)構(gòu)域分別識別和激活丙氨酸Ala、甘氨酸Gly和Tyr衍生物(3-羥基-5-甲基-0-甲基酪氨酸)。其中SfmC是一種非線性型NRPS,它重復(fù)催化激活了兩次Tyr衍生物。位于SfinC的C-端RE結(jié)構(gòu)域同與myxochelin生物合成相關(guān)的MxaAC端的R結(jié)構(gòu)域[iVoc.iVa"JcadSd.(20(H)98,11136-11141]同源性很高,后者的催化機理是先將底物催化還原成中間體醛,再將底物還原成醇解離。推測SfmC的RE結(jié)構(gòu)域也釆用了類似的催化機制。將SFM-A生物合成基因簇中負(fù)責(zé)骨架合成的SfmA、SfinB和SfmC與將SFM-Mxl和SAC-B中負(fù)責(zé)骨架合成的NRPS的組織結(jié)構(gòu)進行了對比可見(圖8),催化這三種物質(zhì)骨架生物合成的NRPS在組織構(gòu)上具有高度相似性。進一步分別將SfinA-AL和SafB-AQ,SfinA-A卜Saffi-A,和SacA-A!,SfinB-A2、SafA國A2和SacB-A2,SfinC-A3、SafA-A3和SacC-A3進行序列分析對照,這些NRPS不僅組織構(gòu)架具有高度相似性,而且它們相對應(yīng)的組織結(jié)構(gòu)域也具有高度的序列同源性。聯(lián)系到SFM-A、SFM-Mxl和SAC-B在分子結(jié)構(gòu)上所具有的相似性,可以推斷這三種物質(zhì)的生物合成應(yīng)該是采用相同的機制合成了共同的聚肽骨架。根據(jù)A.Marahid[五A/BOJ(1997)]6,4174-4183;C/zem.(1999)6,493—505]和Townsend[C77e附.5fo/.(2000)7,211-224]的總結(jié),A結(jié)構(gòu)域的底物特異性可以由位于該蛋白和底物結(jié)合區(qū)的8個位置固定的關(guān)鍵氨基酸殘基來預(yù)測。我們將這些A結(jié)構(gòu)域決定底物選擇性的8個核心氨基酸進行對比,結(jié)果如表2所示。表2.NRPSA結(jié)構(gòu)域決定底物選擇性的8個核心氨基酸進行對比<table>tableseeoriginaldocumentpage21</column></row><table>Pospiech等人關(guān)于SafA和SafB各個模塊A結(jié)構(gòu)域激活底物的推測主要是根據(jù)組織結(jié)構(gòu)和催化功能一對應(yīng)的線性邏輯關(guān)系推測的,而不是根據(jù)底物識別的規(guī)則[CAe/n.歷o/.(2000)7,623-642]。然而不符合線性邏輯關(guān)系的生物合成的例子已經(jīng)發(fā)現(xiàn)了很多[C臉w5/oC;e肌(2002)3,490-504],說明僅根據(jù)這一點而不考慮保守氨基酸的底物選擇性這種做法是不可靠的。Saffi-Al與micmcystin的McyA及bleomycin的NRPS-7中推測激活A(yù)la的結(jié)構(gòu)域中決定底物特異性的8個核心氨基酸一致,因此沈奔等人[Ojem.歷o/.(2000)7,623-642]認(rèn)為這三種蛋白決定底物特異性的氨基酸基序可能是一種新型的用來識別Ala的氨基酸序列。另一方面,SafA-1與已經(jīng)發(fā)現(xiàn)的Tal[乂Mo/.歷o/.(1999)286,465-474]和CdaPSII[C/ie肌歷o/.(1999)6,385-400]中激活甘氨酸的結(jié)構(gòu)域同源性很高,說明它極有可能激活甘氨酸而非作者所推測的酪氨酸衍生物。A.Velasco等人推測[Mo/.M!croWo/.(2005)56,144-154]SacA-A直接激活一個Ala-Gly二肽,這個二肽由一個非?;顫姷暮颂请幕D(zhuǎn)移酶獲得或來自蛋白質(zhì)水解。這一假設(shè)缺乏足夠證據(jù),且SacB-A和SacC-A的顯著差異也不支持它們同樣激活L-Tyr衍生物的假設(shè)。綜上所述,我們認(rèn)為,A.Velasco和Pospiech等人的推測存在著不合理之處,而我們的推理則能比較圓滿的解釋這三種類似物共同的聚肽骨架的生物合成途徑。NRPS模塊中A結(jié)構(gòu)域?qū)τ诘孜锏奶禺愋允峭茰y其催化機制的關(guān)鍵。SFM-A的多肽骨架是由SfinA、SfinB和SfinC三個NRPS模塊共同催化合成的。起始合成時,SfmA的Al結(jié)構(gòu)域首先識別并激活一個丙氨酸,活化的丙氨酸被PCP!的巰基捕獲形成氨酰-S-PCPl。下游的SfinB-A2結(jié)構(gòu)域識別激活一個甘氨酸。丙氨酸與PCPl相耦聯(lián)的肽鏈上的a-羰基碳原子受到下游模塊上的Gly-S-PCP2通過氨基基團孤對電子的親核進攻形成一個二肽。SfmC-A3識別并激活3-羥基-5-甲基-0-甲基酪氨酸,整個SfmC重復(fù)使用了兩次,催化了兩次3-羥基-5-甲基-0-甲基酪氨酸的延伸。然后四肽-S-PCP3被RE結(jié)構(gòu)域還原成醛并解離下來,環(huán)合后形成了SFM-A的多肽骨架(圖6B)。為了進一步驗證上述模型,需要對SfmA,SfmB,SfmC三個NRPS蛋白底物的選擇性進行實驗驗證。編碼SfinA的C-A-PCP三功能域和SfrnB,SfmC兩個NRPS蛋白的基因被克隆入表達載體pET28a中,在大腸桿菌BL21(DE3)中經(jīng)IPTG誘導(dǎo)得到表達。重組蛋白SfinA(C-A-PCP)和SfinB、SfmC三個NRPS重組蛋白經(jīng)Ni-NTA親和層析和FPLC得到分離純化(圖9A),采用ATP-32PPi同位素交換的方法進行測活。結(jié)果顯示SfmA(C-A-PCP)特異地識別丙氨酸,SfinB識別甘氨酸,而SfmC特異性識別L-酪氨酸衍生物3-羥基-5-甲基-0-甲基酪氨酸(圖9B)。這與我們的推測完全吻合。7.SFM-A完整的生物合成途徑及后修飾得到系列番紅霉素分子SFM-A分子骨架聚肽鏈部分是由非核糖體聚肽合成酶NRPS催化合成的(圖6B)。在上述NRPS的催化下1分子丙氨酸、l分子甘氨酸與2分子3-輕基-5-甲基-O-甲基酪氨酸縮合成四肽,然后四肽-S-PCP3被RE結(jié)構(gòu)域還原成醛并解離下來,環(huán)合后形成了SFM-A的多肽骨架。然后SfmCyl和SfinCy2的作用下進一步環(huán)合形成核心骨架結(jié)構(gòu),進而在SfinMl催化下發(fā)生N—甲基化、Sfin02催化下發(fā)生酚氧化成醌生成SAC-B,然后在Sfin04催化下發(fā)生酚氧化成醌,進而發(fā)生轉(zhuǎn)氨、氰化等生成SFM-A。而Sfin03和SfinOl可能催化14位進一步發(fā)生氧化形成SFM-F、D等系列類似物。8.SFM-A生物合成基因簇的應(yīng)用一生物合成基因簇中的基因可以在異源宿主熒光假單孢菌/^ew必;no"ayy/wow^raA2-2中進行異源表達可以催化SAC-B的生物轉(zhuǎn)化在克隆、分析了完整SFM生物合成基因簇,研究了各基因編碼蛋白可能的功能的基礎(chǔ)上,本發(fā)明對SFM-A的生物合成機制進行了初步探討,這有助于理解SFM-A和其他四氫異喹啉家族化合物的生物合成機制,同時也為進一步通過對生物合成基因簇進行遺傳修飾獲得SFM-A結(jié)構(gòu)類似物提供了可能。熒光假單孢菌Aewcfo/Momw_/7worwcewA2-2發(fā)酵產(chǎn)生SAC-B類化合物(圖1),該化合物的結(jié)構(gòu)與SFM結(jié)構(gòu)類似,但活性卻低很多。同時,在我們推測的SFM-A生物合成途徑中,SAC-B是中間產(chǎn)物(圖6B),這提示我們有可能利用SFM-A的生物合成基因在簡單、低等的熒光假單孢菌中部分重組相對復(fù)雜的SFM-A的生物合成途徑,從而將SAC-B進一步轉(zhuǎn)化為其他SFM-A結(jié)構(gòu)類似物。當(dāng)s/附6^基因以pVLT33[Gene(1993)123,17-24]為載體構(gòu)建的表達質(zhì)粒采用接合轉(zhuǎn)移的方法導(dǎo)入SAC-B產(chǎn)生的宿主菌i^ewflowowoyywo/^ceraA2-2中得到相應(yīng)的突變菌株,經(jīng)發(fā)酵、HPLC和LC-MS分析(圖10BIH)顯示s/m04導(dǎo)入的突變株可以將SAC-B轉(zhuǎn)化為新化合物一氨基化的SFM-S(aminatedSFM-S)。若在發(fā)酵產(chǎn)物中加入氰化鉀相應(yīng)的化合物均轉(zhuǎn)化為相應(yīng)含氰基單元的化合物SFM-Y3(圖10BIV)。S加04編碼細(xì)胞色素P450氧化酶,在SFM-A生物合成基因簇中有兩個基因編碼細(xì)胞色素P450氧化酶(sfin03和sfin04),Sfm04在異源宿主熒光假單孢菌尸ww^wo"twy7柳mycmyA2-2中成功地將SAC-B轉(zhuǎn)化為aminatedSFM-S同時也進一步顯示該基因編碼的酶在SFM-A生物合成途徑中的功能即催化苯酚環(huán)氧化為醌的結(jié)構(gòu)。9.SFM-A生物合成基因簇的應(yīng)用一生物合成基因簇中的基因可以在異源宿主大腸桿菌中進行異源表達并得到重組蛋白氧化還原酶催化機制的研究對于闡明SFM-A的整條生物合成途徑具有重要意義。在SFM-A生物合成基因簇中有8個基因編碼的蛋白(s加0—s加06及s加Q;/、5>C_y2)與已知的氧化還原酶同源性很高,推測它們屬于氧化還原酶系。其中7個氧化還原酶(s加0/i>6>5及s/wC_W、^wCy2)在大腸桿菌BL21(DE3)中以pET28a為載體可以很好地進行蛋白表達。另外N-甲基化酶基因和負(fù)責(zé)前體3-羥基-5-甲基-0-甲基酪氨酸生物合成的基因^mil"、功"M3及s/M)均可在大腸桿菌中獲得很好的表達。其中的蛋白SftnOl、Sfm03、Sfm04、SfinM2、SftnM3及SfinD可以被分離純化獲得可溶性重組蛋白(圖11)以進行進一步功能研究。以下進一步提供實施實例,這些實施實例有助于理解本發(fā)明,僅用作說明而不限制本發(fā)明的應(yīng)用范圍。實施例lSFM-A產(chǎn)生菌鏈霉菌S./flvem/M/ae314總DNA的提取將100nLlxl0851./avem/wtoe314孢子懸液接種到3mLISP-2(Yeastextract0.4%,Maltextract1.0%,Glucose0.4%,pH7.2)液體培養(yǎng)基中,30°C,230rpm培養(yǎng)約24hr后達到對數(shù)生長期后期,取2mL接種到50mLISP-2中(含25mM氯化鎂),30'C,250rpm培養(yǎng)約23hr后達到穩(wěn)定生長期前期,呈乳黃色渾濁,將菌液4。C,3500rpm,離心15min收集菌絲,用裂解液洗漆,收集淡乳黃色菌絲0.5mL。向lmL菌絲中加入10mL裂解液(含溶菌酶5mg/mL)共四管,渦旋至均一,37。C水浴15mim。加入0.1mL蛋白酶K(10mg/mL,用裂解液新鮮配制),1mL10%SDS,混勻后迅速放入70"水浴15mim,呈澄清。置冰上冷卻,加入2.5mL5MKAc,冰上冷卻15min。加入10mL飽和盼,混勻,10mL氯仿,混勻,12000rpm,4t離心20min。用破口的槍頭將水相吸出置于新的離心管,加等量的CHCl3-異戊醇(24:1)抽提,12000rpm,4'C離心10min。用破口的槍頭將水相吸出置于新的離心管,加2倍的無水乙醇,混勻,有大團的DNA出現(xiàn)。將其鉤出置于新的離心管,加5mL70Q/。乙醇洗滌,將液體傾出,用槍吸凈,加5mLTE溶解,加RNaseA使終濃度為5()pg/mL,37'C溫育0.5小時。依次用等體積的飽和酚抽提兩次,CHCl3-異戊醇抽提兩次,向水相中加入0.1體積的3MNaAc,2體積的無水乙醇,輕輕的混合充分,有絮狀DNA出現(xiàn)。將四管DNA合并到兩管(每管中有l(wèi)mL70%乙醇用于洗滌),將液體吸出,再加lmL無水乙醇洗滌,吸出乙醇,超凈臺中吹干,溶于適當(dāng)體積的TE(pH8.0)中。實施例2SFM-A產(chǎn)生菌鏈霉菌S./flvem^te314遺傳轉(zhuǎn)移系統(tǒng)的建立培養(yǎng)含有適當(dāng)質(zhì)粒的S17-l至OD6oo0.3-0.4,20mLLB培養(yǎng)液中的細(xì)菌細(xì)胞離心收集,用等體積的LB洗兩次,重懸于2mLLB中,作為大腸桿菌供體細(xì)胞。取適量凍存于一80'C的S./ave"dw/ae314的20%甘油孢子懸液500nL,用等體積的TES緩沖液(50mMTESNa^pH8.0)洗兩次,重懸于等體積的TES緩沖液,50"C熱激10min使孢子萌發(fā)。再加等體積的TSB培養(yǎng)基,37。C溫育2一5hr。離心重懸于0.5—lmLLB中作為鏈霉菌受體細(xì)胞。將不同濃度的受體細(xì)胞100pL與等體積的供體細(xì)胞混合直接涂布在含有10mMMgCl2的平板上,30'C溫浴20hr后,采用無菌水輕輕洗滌平板表面以洗去大部分大腸桿菌,在每一平板的表面覆蓋3mL含萘啶酮酸(終濃度為50ng4iL)和相應(yīng)抗生素的LB軟瓊脂或lmL無菌水。3(TC培養(yǎng)5天以上挑取接合子。由于S./flvem/"/^314對硫鏈絲菌肽,阿泊拉霉素和紅霉素都敏感,最后確定遺傳轉(zhuǎn)移所用抗生素的濃度硫鏈絲菌肽25pg/mL,阿泊拉霉素50pg/mL,紅霉素50pg/mL。IWL-4(可溶性淀粉1.0%,K2HP040.1%,MgS040.1%,NaCl0.1%,(NH4)2S040.2%,CaC030.2%,F(xiàn)eS040.0001%,MnCl2.6H200.0001%,ZnS040.0001%,酵母提取物0.05%,胰化蛋白胨0.1%,瓊脂2.0%,pH7.2)培養(yǎng)基最好,接合轉(zhuǎn)移效率最高。在IWL-4培養(yǎng)基上,無論是在鏈霉菌中可以自主復(fù)制的質(zhì)粒pKCl139,pHZ1358,還是位點特異性整合的質(zhì)粒pSET152,接合轉(zhuǎn)移的效率都較高。轉(zhuǎn)移效率約為IO"4。實施例3SFM-A產(chǎn)生菌鏈霉菌S./ovew/"/e314基因組文庫的構(gòu)建首先通過一系列的稀釋實驗來確定3AI的用量,在此基礎(chǔ)上大量酶切得到的DNA片段略大于40kb,脫磷。SuperCosl先用AaI從兩個cos序列中間切開并脫磷,然后再從多克隆位點處用SawHI切開,獲得兩個臂,與制備的40kb的DNA片段連接過夜。從-8(TC冰箱中取出包裝混合物置于干冰桶中,將包裝混合物在指間迅速融化,在剛剛開始融化時加入4nL的連接產(chǎn)物,用槍混勻,22'C水浴2小時。加入500^LSM緩沖液[(L"):5.8gNaCl,2.0gMgSO4,50.0mL1MTris.HCl(pH=7.5),5.0mL2%()明膠]和50氯仿,輕輕混勻,離心機離心4秒,上下界面出現(xiàn)沉淀,轉(zhuǎn)移上清至新管中,于4。C保存。將凍存于-80。C的菌株£.C0//VCS257涂布在LB培養(yǎng)基上復(fù)蘇。挑取單菌落接種于50mLLB培養(yǎng)基中(蛋白胨1.0%,酵母提取物0.5%,NaCl1.0%,pH=7.0;添加0.2%麥芽糖和10mMMgS04),37'C,220rpm振蕩6.5小時,測得其OD6oo=0.83時,取用100nL的包裝上清和100OD600=0.83的菌液,輕彈管壁使其混勻,平行操作5份。22。C水浴30分鐘,加入800的LB培養(yǎng)基,37。C水浴1.5小時,每15分鐘倒轉(zhuǎn)一次。每一份涂布一塊大板,LB瓊脂(Amp:10(^g/mL)(15mmxi5mm),每一份用200pL的LB培養(yǎng)基涮洗,37。C培養(yǎng)18個小時。測定噬菌斑形成單位(pfU)以估算文庫的效價。然后用LB培養(yǎng)基刮洗,加入氨芐西林和甘油,使氨芐西林的終濃度為50嗎/mL,甘油的終濃度為18%。分裝成lmLx12,0.5mLx48,025mLx24。保存于-80。C。共約10000個克隆,效價為4000cfii/昭DNA。對于鏈霉菌而言,其染色體DNA的大小約為8Mb,如果插入片段為20kb的文庫效價是20005000cfti,就足以代表它的整個基因組。我們文庫的效價達到4000cfu/嗎DNA,具有良好的質(zhì)量,滿足文庫篩選的需要。實施例4SFM-A產(chǎn)生菌鏈霉菌S./flve"^/ae314的發(fā)酵、產(chǎn)物分離純化與鑒定取lx108S./ove"甴/ae314孢子100pL涂布在YSA(酵母提取物0.1%,可溶性淀粉0.5%,瓊脂1.5%,自來水,pH7.5)平板上,30。C培養(yǎng)七天后,用打孔器打一塊長滿孢子的瓊脂塊接種到50mL(500mL錐形瓶)發(fā)酵培養(yǎng)基(葡萄糖0.1%,可溶性淀粉1.0%,NaCl0.5%,K2HP040.1%,Caseinacidhydrolysate0.5%,Meatextract0.5%,自來水,pH7.0)中,27°C,250rpm培養(yǎng)。約20小時后,產(chǎn)生深藍色色素,培養(yǎng)基呈渾濁,有很多泡沫。6小時后停止培養(yǎng),離心收集上清,調(diào)pH-6.8,加KCN使終濃度為lmM,35"C震蕩30min。將發(fā)酵液用乙酸乙酯萃取,有機相用去離子水洗,有機相濃縮抽干溶于100pL乙酸乙酯。取20pL經(jīng)HPLC檢測。HPLC分析UV-270證;柱子EC150/4.6NUCLEOSIL100-5C18Ser.No.2085011Batch21302092;流動相條件V=lmL/min;A=H2O(0.05%TFA)B=CH3CN(0.05%TFA)時間/min0525272930B/%101085851010實施例5PCR克隆SFM-A生物合成基因PCR體系包含DMSO(8%,v/v),MgCl2(25mM),dNTP(2.5mM),兼并性引物(40mM),TaqDNA聚合酶(2.5u)及適量模板S./avem/M/ae314總DNA。首先95。C,3min,l輪;然后94。C,lmin,68。C,lmin,72°C,2min,5輪;94°C,lmin,65。C,lmin,72°C,2min,30輪;最后72°C,10min,1輪。PCR結(jié)束后,1%瓊脂糖電泳檢査結(jié)果。低熔點膠回收預(yù)期大小的DNA片段,與pGEMTEasyvector連接,轉(zhuǎn)化大腸桿菌DH5a感受態(tài)細(xì)胞,涂布在含有氨芐青霉素,IPTG(異丙基硫代—P—D-半乳糖苷)和X-gal(5—溴—4一氯一3—吲哚—卩—D-半乳糖苷)的LB平板上進行藍白斑篩選。挑取白色菌落過夜培養(yǎng),抽提質(zhì)粒,五coRl酶切鑒定是否含有預(yù)期大小的DNA插入片段。插入有預(yù)期大小DNA片段的質(zhì)粒測序。實施例6核酸分子雜交l)DIGDNA標(biāo)記將待標(biāo)記的DNA用無菌水稀釋至總體積15nL,沸水浴中加熱變性10分鐘,立即置于冰鹽浴中冷卻。接著加入2pL引物混合物,2nLdNTP混合物,lnL酶,混合均勻后,37'C水浴約16小時。加入0.8hL0.8MEDTA(pH8.0)以終止反應(yīng),加入2.5pL4MLiCl混合均勻,再加入75pL預(yù)冷的無水乙醇沉淀標(biāo)記后的DNA,置于—80'C沉降40分鐘。4'C,12000rpm離心20分鐘收集DNA,用預(yù)冷的70。/。乙醇洗滌DNA沉淀,真空干燥后重新溶于50pLTE((pH8.0)中。2)DIGDNA探針標(biāo)記后的質(zhì)量檢測稀釋標(biāo)記的DNA探針,至以下六個梯度,1、10"、10-2、l(T3、IO"4、10-5。稀釋標(biāo)記的對照DNA至濃度分別為以下濃度lpg/mL,100ng/mL,10ng/mL,lng/mL,0.1ng/mL,0.01ng/mL。分別取lpL上述梯度的DNA樣品點在雜交用的尼龍膜上,根據(jù)7)所述步驟進行顯色反應(yīng),對比標(biāo)記的DNA探針和DIG標(biāo)記的對照DNA的顯色強度以決定標(biāo)記的DNA探針濃度。3)菌落雜交(文庫篩選)的膜轉(zhuǎn)移將保存于—80'C的基因文庫稍融,取50pL,用450jiLLB稀釋得到10"的稀釋倍數(shù),倍比稀釋得到10—2,10—3,IO"4,10—5,10—6。300pL涂平板(15cmxl5cm,平板為LB/50pg/mL卡那霉素)。選取合適的比例,使每塊平板約1200—1500個克隆。照選定的比例均勻涂布四塊平板,37'C培養(yǎng)過夜。根據(jù)平板的大小剪取尼龍膜,小心地覆蓋于平板表面不要產(chǎn)生氣泡,做好位置標(biāo)記,1分鐘后取下尼龍膜置于干燥濾紙上,干燥10分鐘直至菌落結(jié)合在尼龍膜上。原始的平板置于培養(yǎng)箱中4—5hr,使克隆重新生長作為原平板。將尼龍膜置于變性液(0.25MNaOH,1.5MNaCl)飽和的濾紙上15分鐘(不要浸過膜),轉(zhuǎn)移至中和液(1.0MTris.HCl,1.5MNaCl,pH7.5)飽和的濾紙上5分鐘。轉(zhuǎn)移至2xSSC(20xSSC儲備液(L"):NaCl,175.3g,檸檬酸鈉,88.2g,pH=7.0)飽和的濾紙上自然風(fēng)干。取下尼龍膜置于烘箱中,12(TC固定45分鐘。常溫下于3xSSC/0.P/。SDS溶液中振蕩洗滌3小時,以除去細(xì)胞碎片。4)Southern雜交的膜轉(zhuǎn)移:DNA樣品在適當(dāng)濃度的瓊脂糖凝膠上電泳至適當(dāng)距離,做好標(biāo)記。浸泡于400mL0.25MHCl中脫嘌呤20分鐘,使溴酚藍變黃,用去離子水洗數(shù)次。室溫下浸入堿性緩沖液(0.5MNaOH,1MNaCl)15分鐘并輕輕振蕩。換液一次繼續(xù)浸泡凝膠20分鐘,并輕輕振蕩,去離子水洗三次。取一張每邊都比凝膠大lmm的尼龍膜,用去離子水完全浸濕,做好標(biāo)記。采用向上毛細(xì)管轉(zhuǎn)移方法,用10xSSC轉(zhuǎn)移緩沖液轉(zhuǎn)移8—24hr。用2xSSC略微洗膜,120'C烘烤30分鐘。5)預(yù)雜交和雜交預(yù)熱雜交液(20mL/100cm2)至雜交溫度68'C,放入雜交尼龍膜,輕輕振蕩并保溫30分鐘。將DIG標(biāo)記的DNA探針在沸水浴中變性5分鐘,立即置于冰鹽浴中冷卻。冷卻后,將DNA探針與合適體積的DIG雜交液(2.5mL/100cm"混合均勻。去除預(yù)雜交液并立即把DNA探針/DIG雜交液加入,輕輕振蕩保持雜交溫度64'C或68'C約16小時。6)雜交后嚴(yán)緊洗脫室溫下用2xSSC/0.1%SDS漂洗兩次,每次5分鐘。68。C,用0.1xSSC/0.1%SDS振蕩漂洗兩次,每次15分鐘。7)顯色反應(yīng)和檢測嚴(yán)緊洗脫后的尼龍膜在洗滌緩沖液(0.1M馬來酸,0.15MNaCl,pH=7.5,0.3%(v/v)Tween20)中平衡1-5分鐘,接著在封閉緩沖液(封閉試劑以10%的濃度溶于0.1M馬來酸,0.15MNaCl,pH=7.5)中封閉30分鐘,然后在抗體中浸泡30分鐘。用洗滌緩沖液漂洗尼龍膜兩次后,用檢測緩沖液(0.1MTris-HCl,0.1MNaCl,pH-9.5)中平衡2-5分鐘,最后將尼龍膜置于10mL新配制的顯色溶液[NBT(nitrobluetetrazoliumchloride)溶于70%DMF,濃度為70mg/mL,BCIP(5-bromo-4-chloro-3國indolyl-phosphate)溶于水,濃度為50mg/mL。用時10mL顯色溶液中加45nLNBT,35pLBCIP]中,置于黑暗中顯色。顯色合適后用去離子水漂洗以終止反應(yīng)。實施例7基因中斷突變菌株的獲得將獲得的轉(zhuǎn)化子接種到液體培養(yǎng)基ISP-2(Am25pg/ml)中,3(TC振蕩約28hr。取出200^1涂布在ISP-2(Am50ng/ml)平板,3(TC培養(yǎng)6-8天,收孢子,保存于-80°C;取出1(Hil在ISP-2(Am50^ig/ml)平板畫線,37。C培養(yǎng),放置2—3天。挑37。C整合生長的單菌落,接種至液體培養(yǎng)基ISP-2(Am25pg/ml),37'C,振蕩2—3天。取出涂布在ISP-2平板(Am50ng/ml),37'C整合2-3天,收孢子,保存于-80。C。取出200pL涂布在ISP-2平板(Am5(Hig/ml),30。C放置2—5天,用于鑒定其生物活性。基因中斷或基因置換所用載體為pOJ260或pKC1139,基因置換用紅霉素抗性基因替代目標(biāo)基因中間的DNA片段。構(gòu)建好的質(zhì)粒通過實施例2所述屬間接合轉(zhuǎn)移的方式導(dǎo)入S./flvewdw/fle314中得到雙交換突變體,所得突變體通過Southern雜交在基因型上加以證明。實施例8基因在假單孢菌中表達及發(fā)酵產(chǎn)物分析首先將pET28a中的目標(biāo)蛋白基因以^al/歷"din切出,克隆入載體質(zhì)粒pVLT33相應(yīng)的位點,轉(zhuǎn)化£.CO//CC118(Kana50pg/mL),酶切鑒定。正確的質(zhì)粒以結(jié)合轉(zhuǎn)移的方式導(dǎo)入假單孢菌/'set^o附o"wA2-2中。接合轉(zhuǎn)移方法如下將構(gòu)建的pVLT33質(zhì)粒的衍生質(zhì)粒轉(zhuǎn)化到S17-l中,于LB中(Kana50嗎/mL)37'C培養(yǎng)。取Py7wore"wwA2-2的菌株在LB的平板上劃線培養(yǎng),于3(TC培養(yǎng)。S17-l(含有pVLT33的衍生質(zhì)粒)的單菌落,于LB(Kana50ng/mL)中37。C培養(yǎng)過夜。挑取Py"om^似A2-2的單菌落于LB中30。C培養(yǎng)過夜。取上述細(xì)菌的過夜培養(yǎng)物,以1X的量轉(zhuǎn)接到相應(yīng)的LB的培養(yǎng)基中,繼續(xù)在相應(yīng)的溫度下培養(yǎng)至OD60()至0.8—1.0左右。分別取上述兩種菌液lmL和4mL按1:4的比例混合均勻,用0.22nm的膜過濾,取下濾膜,有菌的一面朝上,放置在LB的平板上(無抗生素),37"培養(yǎng)8hr。將濾膜取下,用3mL10mM的MgS04洗膜。分別取lul、O.lul稀釋涂在LB的平板上(Kana50pg/mL同時加入Ap100嗎/mL或Tc12.5pg/mL以抑制E.coli的生長),于3(TC培養(yǎng)過夜。挑3~5個單菌落到LB培養(yǎng)基(Kana50pg/mL同時加入Ap100pg/mL或Tc12.5ng/mL)于27—3(TC培養(yǎng)過夜,保存菌種。進一步培養(yǎng)和表達同大腸桿菌,采用LB培養(yǎng)基(Kana50j^g/mL),培養(yǎng)溫度是25°C—30°C;在OD600在0.6—1.0加入IPTG誘導(dǎo),誘導(dǎo)后可適當(dāng)延長培養(yǎng)的時間(24hr)。尸^w^wwmwy7wowwe似A2-2及突變體的發(fā)酵接種0.1%的菌種到50mLYMP3(葡萄糖1%,牛肉提取物0.25%,胰化蛋白胨0.5%,CaCO30.8%,pH6.5)培養(yǎng)基(250mL搖瓶)中,27。C,250rpm培養(yǎng)30hr,轉(zhuǎn)接2^到50mL(500mL搖瓶)M-16培養(yǎng)基(D-mannitol15.2%,driedyeast3.5%,(NH4)2S041.4%,F(xiàn)eC130.018%,CaC032.6%,pH6.5)中,27°C,220rpm培養(yǎng)四天。發(fā)酵液離心,上清調(diào)pH-9,用乙酸乙酯萃取濃縮?;蛳日{(diào)pH-3,用乙酸乙酯萃取除去雜質(zhì),再將水相調(diào)pH-9,用乙酸乙酯萃取濃縮,濃縮液中含有SAC-B。如果將發(fā)酵液調(diào)pH-6.8,加KCN至終濃度為lmM,35度,250rpm反應(yīng)半小時,再用乙酸乙酯萃取濃縮,濃縮液中含有cyano-SAC-B。突變體的發(fā)酵與此類似,在轉(zhuǎn)接40小時后,加IPTG至終濃度為0.2mM,24°C,220rpm培養(yǎng)至96小時(或72小時)。發(fā)酵液離心上清調(diào)pH=9,用乙酸乙酯萃取濃縮?;蛳葘l^3,用乙酸乙酯萃取除去雜質(zhì),再將水相調(diào)pH-9,用乙酸乙酯萃取濃縮,濃縮液中含有化合物aminatedSFM-S。如果將發(fā)酵液調(diào)pH=6.8,加KCN至終濃度為lmM,35°C,250rpm反應(yīng)半小時。用乙酸乙酯萃取濃縮,濃縮液中含有SFM-Y3。實施例9基因在大腸桿菌中表達及重組蛋白分離純化主要以PCR方法獲得待表達的目標(biāo)基因(表NRPS基因時PCR獲得編碼N端和C端的基因,中間部分通過粘粒亞克隆獲得),以£CORI///z'"dIII克隆入普通載體如pSP72中,經(jīng)DNA順序分析確定后,插入片段再經(jīng)A^el/Z^"din切出,連入表達載體pET28a的相應(yīng)酶切位點,獲得了表達質(zhì)粒。轉(zhuǎn)化大腸桿菌BL21(DE3),挑取單菌落接入3mLLB(Kana,50嗎/mL),37"C過夜培養(yǎng)。取30nL過夜培養(yǎng)物接入3mLLB(Kana50pg/mL)培養(yǎng)液,37'C培養(yǎng)兩小時,加入IPTG至終濃度1mM/L,37。C繼續(xù)培養(yǎng)四小時后離心收集1mL菌體,加入5(HiL蛋白電泳緩沖液,渦旋后在沸水浴中煮沸3min,取出樣品,12000rpm離心5min,取10nL上清上樣,用SDS-PAGE蛋白電泳檢查表達情況。大量表達將表達良好的2ml過夜培養(yǎng)物接種于500mlLB培養(yǎng)基(卡那霉素50pg/ml),37°C培養(yǎng)2.5小時(A6Q。nm約0.5),轉(zhuǎn)移至25°C繼續(xù)培養(yǎng)0.5小時,加入IPTG至終濃度0.1mM,25°C表達6小時或15°C表達24小時以上。冰中冷卻,4°C5000rpm離心5分鐘收集菌體(取1ml菌液收集菌體作為SDS-PAGE樣品分析全菌蛋白),STE緩沖液UOmMTris-HCl,lOmMNaCl,lmMEDTA,pH8.0)洗滌1次,破菌緩沖液(50mMNaH2PO4,300mMNaCl,10mM咪唑,以NaOH調(diào)pH為8.0)洗滌1次,加入10ml(2ml/g濕菌體)破菌緩沖液(加入溶菌酵至終濃度lmg/ml)重懸均勻,4°C(或冰中)放置30分鐘。-80°C凍存30分鐘后室溫融化,冰浴狀態(tài)下超聲破碎10分鐘(20()~300W,超聲10秒間歇50秒)。上清液轉(zhuǎn)移至50ml尖底離心管。加入l~2mlNi-NTA親和層析柱填料(視可溶性蛋白表達量及大小而定),冰浴狀態(tài)下?lián)u蕩1~2小時(若蛋白大于150KD時間可延長,200KD以上可過夜)。4°C2,000rpm離心5分鐘,傾去上清,用10~20ml淋洗緩沖液(50mMNaH2P04,300mMNaCl,25mM咪唑,以NaOH調(diào)pH為8.0)輕輕旋起柱填料,冰浴狀態(tài)下?lián)u蕩10分鐘,4。C2,000rpm離心5分鐘,傾去上清,加入10ml淋洗緩沖液旋起柱填料,裝柱,以1020ml淋洗緩沖液淋洗2~3次。以0.5mlx6次洗脫緩沖液(50mMNaH2P04,300mMNaCl,250mM咪哇,以NaOH調(diào)pH為8.0)對目的蛋白進行洗脫。分析SDS-PAGE結(jié)果,若目的蛋白純度達到卯%以上,合并目的蛋白,4。C對透析液(50mMTris-HCl,25mMNaCl,5mMp-巰基乙醇,10%甘油,0.02%NaN3)透析過夜后定量,分裝保存于-80°(:待測活;若純度較差則合并目的蛋白,根據(jù)目的蛋白的性質(zhì)結(jié)合其他方法(如凝膠過濾)進一步分離純化。以下根據(jù)本
發(fā)明內(nèi)容提供基因和蛋白序列氨基酸/核苷酸序列表SEQUENCELISTING<110>中國科學(xué)院上海有機化學(xué)研究所<120>番紅霉素(Saframycin)生物合成基因簇<130>說明書、權(quán)利要求書<160>1<170>Patentlnversion3.3<210>1<211>49362<212>DNA<213>鏈霉菌S/re/^owyces/ave"dw/ae314(NRRL11002)<400>1tcatggcggaggcggcgaaggccggcgccasggtcctcaagcccgccgscgccctcccct60ggggcggttscggcggtaccttcgccgaccccgagggctacatctggagcctcggctaca120gcgcccsgggccctscgcggaatagccgaacgccsgsgcag3ccgg3g3g180ttccgcscccacgtcgagccccccgagcccatgcggggcctggaagtccc240gcccgaggtggtggccgcgctcggcgggggcgcgcggccsccggtgscgstcactctggg300cggccsctcctggsagsgccggatcgccctcctccgcggccgccacctgctcggcctcag360C33CgCC33Ccgccaggccgccggcgtcgccgtcggcgacgaggtcgaggtcgagctgga4203ctcgeiC3ccgasccccgcgtcgtcgtcgsgcctgcggacttcgcccsggccctggacgc480cgacccggcagcccgcgccgcctggcgtacagccgcasgcgcgagcacgt540acgcgccatcgagagcgcgsagsagcccga3scccgccg33gccgcstcgas33ggccct600caccsccctgcggggctgscaccgcaccgatgasggccag3acgtgcccc660cgggtggccgcagccgcgscattttgcstggtggacgcccgccctgcttt720cagtcatttattaactcgccssgctgg3gggcactgtggtgggacgtccccgaggggtcg780scgacgtggscatcctgcgtgcggcggcagacgtg3tcggccgggtggggcccgcgggac840tcacgctcgccgccgtggccggcgsggtgggcctcgtaiccgggcsccctcctgcsgcggt900tcggctccs3gcgcgggctc3tgctggccatcgcccgggagtcggccsaggaggccgaat960ccgtgtacgagagcatgcgtgscgsgc3cgcca.ctgccctcatggccgtgacggacctgg1020ccgtgacctcsatgggcgtccggaggtctacgcgaaccatctggccttcc1080tgtgcatcgacctcacggacccggagttccacgcgcacgcCCtC3CC3tCcaccsggccc1140agcggcgggtatscgsggccctgctcaccgaggcggtggac33cgggg3gctgttggccg1200gcscc33tgtcgcggcgctggccsccsccctccsggctgtcgtcgcgggcgcgggcctca1260cctgggccctggaccgcgagggcaccctgccccsccggctcgaacgcgaggtC3tc3cgg1320ccctagcccccc3catcacgccgcagggtg3ctccgc3ggggagggatcgtgaccgscgg1380cgtccgcscactggccggcaaggtcgccctcgtcgccggaggcacgcgcggsggcgggcg1440gggcstcgccgtcgaactcggtgccgccggcgcgacggtgtacgtcagcggccgcagc^g1500ctcggccascggssgctccgacstgggccgcccggagacccggcgcgggc1560ggtcaccgccgccggaggcgtgggcatcccggtgcgcaccgccccg33g31620ggtccgcgcgctggtcgsccggatcgacgccgsgcsgcaggggcgcctggacgtcctggt1680C3actgcgtct柳gcggggaccggctgaccgactgg33ccgtccgctgtggcagcagga1740cctcgacaaggggttgsggctgctgcgccsggccgtcgagsctcacgtgstcaccagccg1800gteicgccgtgcggctgatggccgcccgccgcagcggcctcgtcgtggsggtcaccgacgg1860C33C3CCgCCcgctaccgcggstccttcttctscgscgtggcgaagtccacggtgatccg1920cctggccttcgcacsggccgccgacctgaaggaa(:acggcgtcgcggccgtcgccatcac1980gcccggcttcctgcgctcggaggccatgctggagcacttcggcgtcsccgaggacaactg2040gcgcgacggcgtggccagggacccggacttcgcg(:3ttccgagaccccggcat3cctggg2100csgggcggtggccgccctggcggccgstcccgscdtcstggcgaagaccgggcgggcgct2160ggccacctggggcctgtacaaggaatacgggttcdccgsc3tcgscggcagccagccgga2220cttcgccgcccsctgggccaggsccctgg3gccgcggctcggsccgctgggtsccccctt2280gtsgggcgcttcccgg33sgagccacgctsatacctgcgcacagccgttgtccgcggcgc2340cccaggggttCtCC3CC3Ctgcgcggs3sgacgcttgagacceiactggaggtggctgcca2,ggacgggstctcccttggctactgtgcacacaggtacgtgccaatccggaatccaccgtg2楊cccatggcac3gggaggtctgcttcggtgcttaacgcaat2520tcsggtggccggtgga_cgcagcccgtcgggcggcatgcagggaccagggggtcatgcggc2580tcctggtgctgg3agcaggggtcgssgccccggtgtgtscggacgtccgggsggsctggg2640tccgtsccccggtcagtaggagcgaccttcgcgcgcggatcgacgcgctgcgcgtgsagg2700gcagcgcggacttcctgccgacggtggatccgaacggcgtcctgcgctatcgcsggatgt2760ccgtggtgctttccccgacctggtgaatcgcctcgcggaatcgttttccc2820atgtcgtcccgcgggaagtcctgctggaagcgctgtcccggscttcgcag2880atgccctgg3cttgcsc3tcatgcggatccgccgccgggtgaaacccctcgscctcgccc2940tgcgtacggtgtgggggcgcggctacgtcatggagtccgcgg3cgccgcctcgtsgcggg3000ccgcaccagggcctccccggcgggscscccgcggattcgtctttcagggactcgaaggst3060tcgscccggcsttcgaccgctgcgcgtcgac33tcgctggcgscgcgcatgtggatccgsc3120cgcgcgcgtccccgt己cccgcgcagagtccaccdgggggsg3scc3tgcttgagcgccgt3180gccgtgttgcggatgagtgcaggsgcggcggccgtcgccgtcggsggctccatggtcgcg3240ggcgtggcgtccgccgccgccgacaccggcgtccgccgctccgcgggcgtggsctggsgc3300aggctgsgcsgccgcctctccggcgatctggtg(:ttcccgccgacgcccgttaicgsgcsg3360gccagccgtctggcgstcggscsgttcgacgcgatccgcccgcaggccgtcgcctactgc34203gg3cgtgcgcacggccctcgccttcgcscaggacaacagcctgcgcctg3480gtgccgcgc3gcggtggccacsgcttcggcggctactccacgscgactggtctggtsctc3540gacgtctcgcgcctcaactcggtccgcctg3cggccgacacggtggtggtgggtcccggc3600ctccsgcaggtcgscgcgctgacccsgctggcg(:cgcgcggcgtccaggtcgtgagcgga3660ctgtgcccgggcgtctgcccgggcgggttcatccagggcggcggcct卿ctggcagacc3720cgcaagttcggcstggcgctcgsccgtctcgtg二cggccagggtcgtgcttgccg3cggc3780cgtgccgtcaccgcctccgctgcgacctgttctgggcgctgcgcggcggt3840ggcggcggcsacttcggcgtcgtc3cc3gtttcgsggtcgsgcccacgc3cgtcccgtcc3900atcgtgascttcaacctgscctggccctgggaagccgcccagaaggtcatcgcggcgtgg3960csgcgctgggtC3tcggcggctctcgcgacctgggcgccggcctcggcgtgcagtggctg4020gacgccggt3ccggccggcccgaactgctcgtctacggcggctggctcggccgccccgac楊Ogccttcaccgccgtgctcgscgccttcgtccgcgatgccggcagcgcccccctcscccgc4140tcggtcgaggagcagccgtaccsgssggcgatgatgtcctactacggctgcgccgacctg4200acgccggaccagtgccscacggtcgggtactcgcccgaggccgcggtaccgcgc3ccssc4260ttcgcgatcg3ccgcagcaggctcttttcgaaggccatcccggactccgg4320gccctggsggccttcgtcgccascccgcgcgccggcc3gttccgcttcctcsgcttcttc4380gcgctgggcggcgcggccssggagcccggccgcsccg3C3ccgccttcgtgc3ccgcg3c4440accgagttctacgccggctactcgctgggcctcaacgaagcgaagtscacggccgsggsc4500gsggcggcgggscgggcctgggccgsggccggcttccgcgcc3tcg3cccgttctccsac4560cgggagagctaccagaacttcstcgscccctccctgacggactggaagacggcctactac4620ggcgagaactacgcgcggcttctcacggtcaagcggaagtacgaccggcaccaggtcttc4680tccttcgcgcagggaatcgcctssttcccg3a3gatccc3accgagsggtgsascacatg4740gtcgccgagcagggcgcgccgagcgccasggcgggcaagaCCCtCCtC3Ccctgttgtgc4800ctggcccagttcatgctgstcctggacgtcgccgtggtcgcggtcgccgtcccgtcgatg4860cagtcgcsgctgaacatctccgcggccaacatccaatgggtcsgtsccgcgtacggcctg4920gcgttcggtggcttcctggtcgcttccggccgsgcggccgacctgttcggccccaagcgg4980atcctgctcatcgggctctcgctgttcacactcgcctcggggctgtgcggcatcgcgccc5040cgctgttcgtggcccgcgcggtgca.gggcttcggcgccgctctggtgtcg5100ccggcggccctgtcgctggtcacgsccagcttcggggagggcgaggagcg5160ctcgggctgtggggsgcggtctcgtccggcggtgccatcgccggccagctgctcggcggc5220gtgctcgcggscctcgcgggctggcggtcc3tCttCCtC3tcaacgtcccgctcggcctg5280atcgtgatcgtggccgtcgcccgcatggtccgcas.ggaccggtcgaacgacaccggccgc5340atcgacatggcgggcgccggcctgctcaccgccggagtggtgctcctcgtgacggccgtg5400tcgcaggccgccgsgcgcggtgtgggcctgcccgtcctg3tcagcggtgtgctcggsgcg5460gtcctgctggtcttcttcgtggtgc3cgaaggccgcgtcc3g33ccccgtcctgcgcctg5520tcgstgttccgC33CCCC33tgtccgttacggcas.catgatctgcatgctgscggccggs5580ggcgcgscggtctcggtgttcctcggc3ctctctacatgcagcaggtcatcgggatgagc5640ccgatgatggcsggcctgggcttcgctccggtcaccctgctgatcctgggcgtgtcgatg5700cgcatgggcccgctggtgcag3agttcggcgtga&gascctgctgctggcctcggccgcg5760ctgttcgcggtgggttcgctgctgctgagcttcgtcscggtcgscggctcgtsctgggtg5820caggtgttccccggcctcgccstcggaggcgtcggcgcggccctgsgcttcgccccctcg58803tg3tcctctgcsccsccggggtccccgacgsggdgcsgggcctggcttcggggctgctc5940ggcacctcgcagcagatcggtgcggcgctcttcct:ggcc3tcctgagcggggtcaccgcc6000gcggtggccgcctcgtccggcggcgacacccccgaggcgctgaccgsgggcttccaggcg6060gggttcctgtggtcgctggcgatcccggcactgatcgtcctgtgcctcctggccctgccc6120agg3ag33ggacgaaccgcaggtggsgccggcggcggagcggtctgaccc6180ctcstacggtccgscscgtcg33cggcccg3cccgtc3tacggcgcctccgggccscsag6240cgaaagcgctcggccggtacctcaccggccgagcgcttccgttcggcgtcsgccgcgtgg6300gacgsgccggcgcaggtggccggcggtggcgctgtcggccgcccggatctgctcgggggt6360gccctcggcgacgacgcgcccgccgtgctcgccggctccgggccccatgtcgacc3gcca6420gtcggcgctgtccgccsggtgcaggtcgtgctcggcgsccagcagggtgtggcccgcggc6480csgcagggcgtcgsaggcgctcsccatccgctgcacatcggccgggtgcsgcccggtcsg6540cggctcgtcccgccgccgcggccacccscggcgccccggcgtatcgcctc6600cgccsgcttgsggcgctgcgcctcgcccccggacaggtcggtggcgctctggccgagctg6660gaggtagccgaggccgacctgctggsgggcgtgcaggstggccgtc3gggtgcgcggttc6720gcggsagaactccgccgcctcgtcgacggtcsgcccgagcscctggtcgstggccaggcc6780ctggtsggtgacgg3C3ggscctcgggcgtgaacctgcgtccggcgc3C3cgtcacsggt6840cgcccacscgtcgggcsggsagtgcatgtccaccagcttctggccgtagcccgtgcacgc6900ctcgcagcgtcccgcggccgcgttgaaggsgaaggcggsggcgcccaggcccsgcgcgcg6960ggcgcggtccgtggccgcgtscagcttgcggsccgcgtcgaacgccttggtgtacgtggc7020cggattggaccgcggggtgcggccg3tcggactctggtcgacgcagtccaCCC3CgCC3C7080ggactcgstgccggtgsccgcccccsccgcaggatggcccgaaccggcgsgggcggcggt7140gagcgcgggggcg3cggc3tggcccaggsggctgctcttgccgcttccgctgscgccggt7200agggtgttcagcgggacgcggagctcgtccacccgtacggtgtgcgcccg7260gacgtccttg3acgtcagcc3cctgccgctgccgccgccggsccgggcgcggttgatgcg7320cggccggctgcggtccaggtagcggccggtcaccgagtcgggatgggccgcgagctcctg7380cgggggaccgcccggccgccsccgctgccggcgccggggccg3ggtcg3t7440gacccagtccgcctgggccscgacgtccgggtcgtgctcgcggtgttgcc7500cgagtcccgcsggtcttccaggatgtcgcgcsgcttctgcttgtccgcgggatgcsgtcc7560ggaccccagctcgtcgagcagcccgtcsgcgaggtgctgagctgsgcggc7620cgtacgggcgcgctggagctccccgccggsgagggtcggggccgtgcggccgagttcgag7680atgggccaggcccaaccggtcgsgcsgccgcagcc:ggttcgtcscctcgggcagcagcgc7740ctcgccgstctcgcgctggacgccgctgaggscgccgccgagctcggcggcccacttctg7800cacctcgtgcacccgc3gggcggscagctgctggtacgtgscgcccgccagggccaccgt7860scgggcggcctcgccgtsgccgctgccsccgcsctgcgcgcsggggatctggcgc3tgta7920cggcaggtagcgctgcttggcgttctcggtaccggccgcggcgsacagccgctcgacctc7980ggccsgcgcsccgcgcsgcggctggttgsgctggtsgaiggacctcggcgtcggsgttctt,0gttggggacctgcscgctgacgctcacggtttccgtgcccgttccgtacagcaaggcgtc8100gcggasggsctggggcsgctcccgccacggcsgggagaggtccgcgccgtggcgctccgc8160csggaeigggcaccgccacctgctccggcgagcgcagtttgtcg33ccacggcgstccgcc8220ctcctggsgggggsggccggggtgggtgatg3tcaggtcggcgtccgcscggggagtgcc8280gccggcgccgtggcsgacggggcagccgccctccgggctgtagcggctgasgtgcccgct8340ggacagctccggtccctccccggccaccgcgggcagccgggagtsgagcagcccg3ggta8400ggcgtcgacgccggtcatcgtggcgaccgtggsc(:gcgsatggcggttgaggcggcgctg8460gtcaacggcgsgggtggcgccgaggccgtcg3tgcggtccacctgcgggcggtccttggg8520ggtgatgtactggcggacga3gggcg333ggccctccaggtagcgcagttgggcttccgc8580gtggaccgtgtcgatcgcc3gtgsggtcttgccgctcccgctgacccccgtgasggcgsc8640gatcccgccccgtggsatgtccacggtgacgtccttcaggttgtgcgtccgggcgcgtac8700caccctgstccattccgtgctgctgtggggg33ggggctttccggcatgctgtttccgct8760cgttcgggggcgt3cgggsscaggcccccgggggaccgggggcctgctgaasatcagatc88203caccgccgcgggggccgcggccggtgcggccggccggcgccgcgccgcgccaggggcct8880ccggctcgccgcsccsgtggtgcsgsgcggcccgcaggccgggggtgtctccggcgtggc8940ccgcccaggcgscgtsgccgtccgggcgtaccagcaacgcctcgggcaggtccttcgggt9000tcttgcggcct3cgctcsccgaccgcaccgcggtgacgcggtccgcccagcccgcttcgg9060ccacctcggcgC3gaeigccgccgtgggsctggtcgagcsgC3Cg333CC3cccgtgcggs9120agtgggtgtagsggcggctgtccacgccgtcacgtgagcgcagcgcggtgtcgtgcaccc9180gtcggccgaccagccggtgcggtacccgtcccgtcgtgccggccaigggggcggtsgcggs9240gcttcagcccggacagctcctccsggacggagttctgtscgggcgccsgccccatcaggc9300gggtgctgsgcccgcgcsgcagccgggcccccgcggagtcggagacctcg33cttgt3g39360ccaggtccgtgcggcgcagggtcgccgcagcgatggggcggcgctcgcgctggtsggtgt9420cgaggsggtgggccggtgcccgccccgacaggtgggcggcg3gcttcc3cccgaggttca9480tggcgtcctggstccctgtctgcaggccctggccg.ccggacgggatgtgggtgtgcgcgg9540cgtcgcccgcaagcaggscccggccgagccggtagcggtcgctgagccgctggtcgctgc9600ggaagcgggacatccscagcggatcgtggagcccgaagtcggtgcccagtsccgcccggc9660agctctccctcsgctcgtcgaccgtgacgggctcgtccsccggcgcgccc3tccgcgcgt9720ggtcgagcgcgatgagccggaacgtgccgtcgccgaacgggaagagcscgsccatcccgc9780gcttgccggtccgggcgtacscctgcgggtccggcggggtggccagccggacgtcggcca9840cgatcagcgaactgtcgtsggaggatccgc3ccggcg3gttcgcgcaccg9900tgctgtgcaccccgtcgcsgcccscc3cg3agtccgcgtgctcggtgcgcggscccccgg9960gcccctcc3cctccacctcgacgccgccgtcgtgc:tggcgcagtccggtcacggtcgccc10020cgcgctcgatgacggcgcccgagtccsgggcccaggaggtcsgctgctcctcggtgcggc10080tctgcgggstgacgagcstgtscgggaaggggctgtcgacgtcgcggtsgtccagccagc10140gcttgccgtcgccgsgcgcggcstgcgcccaggg(:aggccctgcgccscgaacgsctcgg10200cgcgcccgcgcatgtcgagcagttcgagggtgcgggg.csgtccgascgcccgcgagt10260ggggcgagggggtgctccgcttgtcgagtacscggcsggcgatcccggccgcggagagct10320cggcggcgagggcgsgsccggtcggtcccgcgccgacgatcagcacggacgggg33ctgg10380ccatgaaatgctccttggtctgactcggactgactcggtccgacccgtcgcgtcgcggat10440ccgggcgscggcgggcggstcagccgttgcgggtcgcgtgggtg3cg3cgasggtgccgg10500ggacggtctccsgscgctcgggggtg3g3ccggcgtccgcgaccagcgtgcgcatcccgt10560cgscgtcgtacagccggggcagccgccccatggc(:ttg3gcagcagccggsgcagcactc10620cgggcggggcgtgctcggcgstcatcscggaaccgcccgggcgcaiggeitccggcgtattct10680cccgcagccccgccggggcgtccgaccagtggccgaacgcggtggtgctcsccacgsggt10740cgacgctcgcgtcggccggcggcsgccgctcggcgcggccgaccgtcagctcggcgccgg10800gcsgccgcccccgggcgatctCC3CC3tgCggtcggcggggtcgacgccgtacaggcggg10860cctgcggccagcggcggcccgcctcctccagcagctttccggtgccgcagccg3cgtcg310920gtacggaccggggctccsggcccgcctgggcggcccagtcgagtacgacggtgtgggcgg10980tctcgcagtccttgccgaacttggcgtcgtaccgcggtgcgatccggttg33ggtgtcg311040cgtgcccgggstccgccatggggttccgctccaggtctccgtgccg3tgggatttccacg11100ctaggaaggcggccgggcgcgcgggcgascggcg3c3ggtcgctgaaaggstcccgcccg11160cgccgccgccccgccgcgcggtcctggcggtgcgggtggaacccgcaggcgtcagggcag11220ctcggcccaggcgcgggcgcgctcctcgttgtccttgcgggcgctctcggcgatccgctc11280cagggccgcccsgtcgatgtcctggggggcggtcaggttccgctcctcc3ggasccgcgc11340ggtgtcccgttcgstcttctcc3cg3cacccgcccagcccccgggtccaagaccgggccc11400cggcagcacccgccggggccgccgtcgccgtgggcggcgaggtc3ccgg311460gscggcgsgggcctgc33cgagccgaactcctcgtcgtggctgtgccscagcagccacgc11520gggtccctccgcggccggcscgccctccggcsgccggsccaccstgagacccgtgtcccg11580gtcctgccggggacggcsgcgcaggctgaaggggsagccgtcggggtcccgg3aggag3g11640csccgggctgctgaagaggtgcaggtagtcgtcgatcttgtgccsgscgctcstgccsc311700cgctccagcgcggtgccgcccstcgcgtccttgcggasggcccsgsccttctccggcgtc11760accgtgatgcgc3ggcgcc3gtagaaactcggatggaactgcgcgcggatccgcgggtcg11820agcgcctcgsgcgcgccggcggg3cgcttgcggaacagctccgcccsga3gtcctccagg11880ccgtccacggtcatgatctcctctggggcggtggc:ccgcccctgcacgagtacggcgggs11940gcgctgccgggtagggcgctgcccgtgaastcgga.cstgsggagggagatcccggtcgtcg12000cggttgstgtgctccgccttccgggccaccgagatgctggaactgaggacgaactcgttc12060tcctccggcttccagsggasggccstcggccaggtcaccgggcggccggttctggcssgs12120gtggsgaactcacaggcacgtacacggcgc3g33cacgcttcacttcgccggsssgggtg12180gctggcatggttcctcttgatcgtcgggtgattcctgtcacgatgaggccgggacatggg12240t3ccscccggttttcttccggtcaaggctggwttcgsgcctcagtcgacggtctcggac12300cgtccgastscggctgacggattcctgaaaccggtcgaattccgaatgct12360C3tgscs3cgacgcccgcacccagctaccccttccccgcg333t3CCtCg12420scgaggccgccgs3Ctggccaggctgcgccaggaggsgcccstctc3cgggtcaccatgc12480cctacggcggcgaggcctggctggtgscccgtatggcggacgtcsaggaggtgctcgccg12540accccsggttcsgccgtcagctgcagaccgsggcggaccgcccgcgcttcttccccgagc12600ccgtggtcgagggcataggc3tcatgg3cccgccggagcsgacgaggctgcgccggctgg12660tcgccasggcgttcscggcgcgccgggtccaggagttcgggccgcgcgtgcsgacgatcg12720tcgacgaactgctcgscgcggtcgaggccaagggcgcccccgccgacctctscgccgact12780tctcctggcagctgccgggc3tctccatctgcgagttcatgggcgtcccctacgaggacc12840gggaccggttcgtgccgttcttcg3ctcggtcgtctcgacC3cgtcc3sgsccccggscg12900agatccgccsggccatcggcgscctccacgcgta(二ttcggcgagctcatcgagaggcgtc12960gcgccscccccggggscgacctgttcsccgctctgstccgggcgcgcgacgsggscgscc13020ggctctcggagaaggagctcatcgagatgagcttcggcctcctcgtcgccggcstggaga13080ccaccgcctcccsgctcgcc33CttCCtCtacctgctgttCCgC33CCCCgaccagctgg13140agctgctgcgcgccaagcccgaactggtgccgaacgccgtcgaggagctgctgcgcttcg13200tgccgctgctcgccgtcgaccsgccgtcggtggcgcgcgaggacgtcaccctgggcggcg13260tgctgatccgggccggcgsg3ccgtggtcccgtcgatgasctccgcgasccgcgacgccg13320acgtcttcgag33cccgc3gcgcatcgacgtcac'二cgggagsscaeicccgcacctcgcct13380tcsgccscggagcgcaccactgcgtgggcgcccagctggcgcggstggsgatggtcscgg13440cgctgcgctccctgctggtgcgcttccccggccttcaccaggcggtgccg13500tccgctggs3ggccggggtcgtcctgcggggcgtcgaggcgctgccggtggcgtggtagc13560cgtggccgacgag33cgcatggagcgteicgggtcgaccggcaggcctgcctgatgtcggg13620gctgtgcstgtcgctggccccc33cgtcttcggggagggcgscgsccsccgggtcgtggc13680gctgsccgaccggtccgcgcccgacgggcggctgctggaggccgcggsgttctgcccggc13740cgaggcgatcgcgstcagcagcctcgcgaccggggagacggtcgagggcaccggctgagg13800cgggccsctctcccgcggsaccscccgsgggcggcggaccggtgtgtccgccgccctcgg13860gcacgtttccggatcaggtg3tcgactgggcgasgcggaagaagcccttggggtcgtagg13920cgcgcttcaccgccgscsgccgctcgtagttgsccccgtagtsggcctcccgccagtccg13980cgaggtccgggtcgatgtagttctggtacgcctggcccgsggsccsggggtgcacccgtg14040cgtaggcccggtcgscccsctcctggcscgcgcgcctgcggtcgggcgccagcacctcgc14100cctccggcacgtcgstgccc3ccgsccagccggcgtsgaacagggcgttgcggtggacgt14160scgcggtggcggtcgcgggcacgcggttgtgggcgccgcccsgcccctggaiggtcgaagc14220tgcgggcctcgccctcgctgcgggcggccgtgaaggccttcsggsccgcgtcgatgcccg14280cactgtcgsgcggcctgtcgaagaagttgccgcgcgccagcgcgaatccgtsacgggcca14340gctgggcctgggggttgtgtccgscacggtggcacgcggccggttcgagggtgtcgcatc14400cgaaccagtggcgcstgccg33gcggtsggagtcggtgcgccgctcgctcgtcgcggggg14460cggtgcccacggcggcggtcaigccggtccagcsgcgggccgagcccgtccggcgsgccgs14520gccacscgccggsgsccgtcscacccggctcgacaccggcgtcggcgccgtacgtcgsgs14580tgttgagcagcggcgtcsgcgggtcgggggcctcggccgtccaccgctgccsggcggsc314640gcsccgcccggeicggaggcccacgtccagggctggtcstgtcgggggcgg14700gcaccggctcgasgtcgtactcggtgacgatcccgssgttgccgccgccaccgccgcgca14760gggcccsgaacaggtccgcgtgctggtgggcgctgctctcgaccacgcgcccgtccgcgs14820ggsccscctgggccctgcgcstgcggtcgcttgcgacgccgtacstccgggtcaggasgc14880ccagcccgccgccgagggcgaggccgccgatggcgaccgtgggacsccsgccggtcggca14940ccgcgaccccgtgcscggcgagcgcctcctccaggtcgacgagctggcagccgggctgga15000tccgggcgsccgsaccggcc3gccgc3ccgcgttcatgagagsgaggtcg3tg3cc3tgc15060cggtcgtcgtggagtatccggcgssggagtggccgccgctgcgcgccacggccggcacgc15120gatggcgcgcggcgaaccgtacggcctcgc3gacgtcctcgsccgtggtgcagcgcacca15180cggcctgcggcctgaccgsgtcgssctggcgcagggccagggagcggacggtgtcgtact15240gcgggtcgccgggcagcaccaccgtgccgtcgaggctccgctggagggcgtcccacggcg15300ggcctcccgccgsggcggtggcggccggcsgcccgcsgctgaggccgccg15360cggtccccagg貼3gtgcggcgtttcatacaggtctccccttttccgggcgcggtctgtc15420ggttccgscggcccsgsgtgtggtgactgtgttcgcgccacacggcggagatggcaggaa15480ttcgscagttctcggccacacs3gggggcggctcccgggtcsggggtgctccggcsgctc15540gatgatctcgttcaccgstccggtgscgcggtacgcccgcggagccggcccgtccggccc15600ggccaccagggcccggacgtgcccgtcggggaccgtggagatccgcacgscggg3tccac15660cacsgtcccggcgggcagcacgacaggtgcgcgctgcatgatgcgggtg3gcaccagctt15720gctctccsgcagggcgaggttgcgcccgstgcagttgtgctggcccsgsccgaaggggaa15780gtsctcgtgctgcgsgggctgggcggtctcccag(:gctccgggcggasgcgcaggggctc15840ggggaacgtcccggcgtcgcggtgggtgacgasggagctg3agatcgtcstggcgccctt15900ggggstcgggtggccgtcgstcscggcgccctcctccgcgtagcgcscgccgastccggc15960cggcggcatcagccgcacgctctccttgacgstccggtcc3gcscgggc3tccgtacgsg16020cacgtccggtcccggsgccgcgtcacccgcgscciccgccacctcggcgcgcsgcttctc16080CC3CC3CtCCcggtgctggtcgsgcsggsagsgggtccag3scagggacgsggcgscgct16140gtcgtggcacagcgcggtgtscgcctcgcccsgcagctcctcgtcgctcsactcctcggc16200gtcggcctccgcgtcgggcscggtcagcttgctc,agcaggtcgggcgcggccgctccggs16260gcgccgtgcggcgstcagcgcgcgcaggatgtcctcgatctcgcccgcgstccgggtcat16320gcggcggssgggcgtaccgggcagcggtgcctgtstC3ccgccgtgsgcgggtgccccgc16380cgcggcsgccagctcgaccatcttgtcgttgaggtggcgtgccgccgcctcgtcctccag16440accggccatggtggccsgcgagatccgcgtcacg,aggcgggcgsgctcctcgtgcaggtc16500cacggaacggcccggcgtccacccgtccsgcatgtcgtcgcgatggtgtc16560ggtgtagacggtgscgtggcgcggggagsaggcctgctgcstggcctg3Cggtgcttgcg16620gtggatggtgccgttcagccgcagcagcccactggtC3gcagcatcatcggcgsgccggg16680cggcsggcgcaggtggcggaaggsgtccg3ttcccggtc3gcscctggcg16740tgcctggtcgggcccgaacgcgsagaccgtgcgcgggttgcggccgggcatcgcgcsgac16800gtcgccgtaaccgctcgcgagctcctgcaggaaaggcagcgggcggcgcsggagattcag16860tccggccgagagcctgcgca3cccgagaggatctgccggtccggccsgtcggtccgccgg16920attctgcacsscaggggttccaggcacggtgsttctcctccgggaacggcatggatgaag169803CtCttCCttgtsccccsccgccgcccgcccgagccgccggccgaaagaaccccgcaags17040tggtcctcggcgacttctttcgggaatccctcatggttctctccggaggg17100tccgggcccccttt3gcttcgggsaggcgggcttcgggagcggactttccccccsccccg17160ctgsacgcggcgtstgtcaitcggc3g3cgggtggggacacaatgcaggcgsgcacgggcc17220gggagactgttctcggggcc3ttcgggccgtgtgcg3g3ccgscccgggtgcgscggcgg17280ccgttttccccgcccagccggcgstcggtctcaccgaggcggtcaccttcsgccgcgagg17340gactggaccgcgaggcccgggccctcgcggtc33gctgcgcgaggtggccg3gggccccg17400tactgctgctgatgcsgcccggcccggact3cctcaaggggttcctggccacggtctacg17460cgggsctgccagccgcaccggtctacccgccca3c.cccgccgacgtgcgccgcgscttca17520tccggctcggcgcgatcctcgccaagctgccggacgccaccctgctgaccgagccggggc17580tgctggagccgctgcgcg3gctcttcgcggagcggctgcccgacgt3cgtcccgaacggc17640tggtggacaccagcgtcccggccggcgccgaggac:gactggcgggcgcccgcggtccggc17700ccgaggasccgctgttcatcc3gttc3cctccggttcgsccggcsccccccgcggcgtgc17760tcgtctcgcaccgcascctgctcgccaacgtccgc:gccatcaccgaccggttcggcctgg17820acacgtcgsgcsccggtgccctgtggctgccgccgtaccacgscatgggactggtcggcg17880gsgtactcsctccgctcgtcsgcggcttcccgatccscctgctctcgccgctctccttcg17940tctccgacccgatgggctggctccggctggtctccgsgaccggsgccscccscsccgggg18000cgcccsacttcggctscgcgctggccscccgccg(:gcccgggscgaggscgtggccgcgc18060tcgacctcagccgsctccaggtggccttctccggt:gccgagccggtggacgcctcgaccc18120tgcgcgctttcgccgsgcgcttcgcccccgccggcctgsgcccggacgtcttcctgccct18180gttacgggctcgccgagtccaccctgatcgtcagcggcggccgccccggcgccggactgc18240gtacgcaccggctcgaccccgsagccctggcactgggcsgggccgaaccggcccggcccg18300gcgccccggcc3ccg3actcgtctcc3gcggcccggtggtggccggcsccgsggtscggs18360tcgccgaccccgtcsccggggcggcggccggscccggcgsgatcggcgaggtcttcgtcg18420cctcggacagcgtcgccgsggggtscttcgsggaccccgagg3g3cggcgcgcaccttcg18480gcgcgaccctggcgggctcgagtcgcagctggstgcgcsccggcgacctcggcttcctcg18540gcgccgacggcgacctcgtgccggtcgcccggatcaaggacgtcatcgtcgtccgcggcc18600gcs3cctgc3cccgc3gg3cstcgagcggacggtgcaggacsccgsccccgggatccgca18660agggctgtgtcgcggccgcgggcgtscccggcccggacggcggggaggagstcctggtcg18720tggcggsgctgcggcccgaggccgcggscg3cg33gccc3ggcccgcaggatcgcccggg18780aagtccgcgcagccgtcgcscgcgagcactccgc:ccggtgcgcggcatccacctgatcc18840gtccctcgaccctgcccssgscctccsgcggcaaggtgcagcgctccgccgcccgcgccg18900cccacctcgacggctccctggccaccgtcctgtgcgacaccgccgacsctgcggcgccga18960cgggcgcgcccctccccgccgggcccgcsgcgggcggctccgacgscctcgtcgtcascc19020tcgcagcgacggtcstC3ggcacttcgtcggccacggcggcsccsccggcgcactggact19080cgctcggcgcggcccggctcgccggggsggcgtcgcagatcttcaaggtcgcccttcccg19140tcaccgacatcctggccggtgtcgscgccgaggcgctcgccgcgctcstccggaccgccg19200cccccctcggcggccccgccgccgc33ccggctccgccgggctggccgaactccccgcga19260cggc3cgcc3ggaaaccctgtgcctgctccaggaactcgsiccccgccgcccccggcctcg19320ccctcggcatcgccttcgccctgcccgccacggccgscccgcagcgcgtcgccctggccc19380tgsccgctctcgtggcccgccscccggcsctgcgcscccgcctggtgcgccgcggcgaac19440sctggtcgcgcctcgtcgscccggccgacaccgccgccggtgcgccctggggccggtsct19500gcsccctcscccggctcggcgggstcgggg3gcggg3cctcg33g3ggccctcgccgagc19560gctgccgacgcgtscccgatctcgccgssggcccgctgttccaggccgaggtcgtcctcg19620cggscgggcscggcgcccscctggtgatcaccgtccaccacgccgtcgccgacctgtggt19680cgstcggtgtcctggcggccgagctcgcccacctgctcgccgccggtgatccggsggccg19740gggccctgaccctcccgccggccccggaggtctcctgcgacgtccccgscgaccggcgcc19800tggascgggcgtggaacttctggcagggsctgctc3ccg3cgggtgcggccccctccagc19860tgccgtccgccgacggcccggcgcsgsscgtcccggccggccgcgcgcggcsggccsgcc19920acgccccgctcsccctggacgcccggcgcaccgccgcgctcsccgccctcgccasggsgt19980gcggcgtcaccctgtacgcggtcctgctcgccgtgcsgtcgctcaccctgtcccggctc320040ccggaaccg3ccgcgtcccggtcgccgt3ccgctgcacggccggggsagcgccscgcscc20100gggccgtcgsctscctcgtgtccacggtcsccctgccgatggsgsccgccaccggcaccg20160tccgcgacctggtcggccgtgcggccgscgccctgcgcggcgcgctcgcccsccgcaccg20220tcggctaccccgsgctggtggccctgtccgcggcccgcggcggccccgscgt3cccgccc20280ccgacgcggcgctgctgctccagcsggeicacccccggcgcsccgcgcggggtcggcgccg20340gcctcctcggcggcggagtgcggctcggggacgtggaactgggcgtcgccgtggtcccgc20400ccsgcstcggccccttcggcctgaccaccctgctcacggagacggacggc3CCCtC3CCg20460ggcgcgtcgaggtcgaccccagccgccatgccgactggctggcaggccggttcgccgaicg20520ccttcctcgcC3tcgccgactcggtggccgccggcgcaggsctccccctcgc,ggcgt20580ccgcggtgggggcggaacaggccgsscggctgcgcggctggtcgcgctcgcccgtgcccg20640aggscggcgagggcsccctgcacgscctcgtcctgtcagcggcggcccgccacccgcggc20700gcaccgcggtggtcgccggcgacgggsgcctcagctacggcg33ctcgcscsccgctcgg20760ccgtcgtggcggccgcgctcgccgcgg33ggcgccggccccggctgcsccgtcggcgtgc20820tgatgcsgcgcggccgcgscctgcctgcggtcctgctcgg3gtactgcgctccggcgccg20880cctacctgcccctgg3cgcggccactccgcccgcccggctggccgccgtggtcgaggscg20940ccggctgccgccacgtcctggtgggcgacgtcccggtggaacgggggcagttcttccccg21000tccgcaccctggscgtggacgcggtcctcgcggcgggcccggccgsgccggtgccgccgc21060ggcccctgscC3cccccgscgaccccgcctscctgctcttcscctccggctccaccggcc21120ggcccaagggcgtggtgatcccgcaccgcggaccggtcaacctgatccgctgggcgggcc21180gggagttcggcscggacgccctcgcccgc3ccctcgccgtcscgcccdccaccttcgacc21240tgtccgtcttcgaactcttc3cgcccctggcgcacgggtgcgaggtgcggctgctcgacg21300gcgtcctggacctggtggacagccccgcgc3cgccgccgscgccsccctgCtC33C3CCg21360tcccctcggcggtggcgsgcctgctggascagga(:gcgcttcccgccggcctcagcgtgg21420tcaacgtggcgggcgaaccgctcaccgccg3SCtcgtCC3ctccgtgcaccggcgccttc21480ccggtgtccgcatggtcaacctctscggccccagcgsgacC3CC3CCt3Ctccacgtacg21540ccgsactgggcccggscsccsgcggcgccgtccccstcggccg3cccgtcggcggcacca21600ccctgagcgtcgtcgscgcctccctgcggccgctgccgcagggsgccsccggcgssctgc21660tcstcggcggcgcgggcgtcgccgtcgg3tecgcggggcgcccgggcstgaccgcggccc21720ggttcctgcccgsccccgsscscccgggccgcaggctctaccgcsccggcgscctcgtcc21780gctggcgggccgscggcctgctggagttcctcgggcgctccgaccaccaggtcssggtcc21840gcggcttccgcstcgaactcggcgacgtcgagcgcgcgctgaccggcctcgacgcggtcc21900gggaggccgtcgtcgtcgccctgggccsgggcaccgaccgccgcctggccgcctscctcg21960tgcccgagcggcccctggsaggcgsccccggcag(:tggctgcgcggcgtccgccggcgcc22020tcggccacgaactgcccgggtacatggtgccgggcgsgttcgcggtactcgacgaactgc22080cgcgcaaccggcacggcasgctcgaccgcggccggctcgcctccaccgcc3ccgttccgc22140tcgtceicgggcsigccggstcgccccgcgcaacga(:agcgsgcgccgtgtcgcggcctgct22200gggcgcsggcactgccggttcccgagstcggcgtgaccgacgagttcctcgscgtcggcg22260gacsctccctgstgctcaccscgstcgcgc3cctgatcgccagggagttctcggtccagg22320tcccgctgaccgccctgcgccggtggccga3c3ggccaggtacctcgacg22380agctcacctcggccgcgcccccgcacgccgcsggcccgcggtccgtgcg3cgcctggacc22"0gcagcaggtac3ccgcggccgggsscggccggsccscctgatgggccgcccccctcaggs22500ccgcccgaccccsgcccgscgaaacgttttggagcactcgtgaigcgaccggstcccaccc22560gacgcccatctcttccccgcctccccgggscagge.acggctctggttcctggsccggatg22620aacg3g3tggccggcssggcgtacggcctcgccgt.ccgcctggscctgcgcggcgccctg22680gsgcgcsccgccctccsgtccgccctcagcgccgtcgtcgascgccscgsggccctgcgc22740accggcctgcgccagstcgacggcscgctgacgcaggtcgtcgtccccggcgtcsccgtc22800tcgctgcccgtcgtcgscctcggcggccgcggsccggaccccgcgcsgctcgsccgcgsg22860gtgcggcgcctggcacgccaggaggcccsgcgcggctggaacctggcccagccgcccctg22920ctgcgcgggctgctggcccggctcgccgscg3ccdcc3cgtcctgctgctgtgcgtgcac22980cacgccgtgtgcgacggcctgtccctccsg3tcgt:gctgcgcg33ctgctggagaactsc23040accggcggcggtcccgccggtgccgacgagcccctccsgttcgcggsctscgtggtgtgg23100asc33cggcggtgaggagttcccggaccccgagtggtccgcgcgccgccsggccgcccgg23160gascactggsgcagcaccctcgcgggcgccccccaggttctcgscctgcccaccgaccgg23220cgccggccgcccctgcagtcctacgccggagccc(;cgttcccgtccggctggscaccgcg23280ttcgccgaccgggtgcgcgactggtccgcccsgcgcggggtgacccccttcaccacgctg23340ctggccgcctacaccgtcgtcctcgcccgc33cggcggcgccgacgacctgctggtcggc23400ctgcccgtcgccsaccgctcccacgcggacctcgccgggscggtcggctacctggccaac23460scctgcccgctccgcgccgscctgcgcgccgsccccsccctcggcgaactggtccgagsc23520ctgtacgsccggctcacgggcgtcctcgaacacgccgacctccccttcggcgagctggtg23580gaactgctggcsccgccgcgcstgcccgsgcgcsaccccgtcttccsggtcstgttcggc23640ctccagcaggacgtccggcgcggctgggscctgccaggcctgcgcgtggacgtcgaggac23700gtggactgcggcsscgcccgcgtcgacctgtccctgttcctcttcgaggsggctgscggg23760gcgatcgscggsttcctggagtscgcctcggcgctcttcgaccgcgccaccgccgagcgc23820ttcgccgaccagctccacaccgtgctccggcag3tcctgcgggscgcgcggatccccgtc23880tccgccgtcgacctcgtcggcaacggctccgCC3CC3CC3tcgsctccttcctcgacggc23940gggccgctggsacagccctggccgctggtgtggccgcggstccgcgaactcgcggcgcgc24000cgtccgtcggccgsggccgtccgcgacgacgcggsagccctggsctscgcctcgctcgtc24060gaccgcgtcgacgccgccgccgcgcggctcsccgctgccggcgcgggccccggcgsccgc24120gtcgccgtcctcgccgascgcggcgtccgcgccgtggtcgcgatgctcgcctgctggcgt24180gccggcggcgtgtscgtgcccgtcgacccggcggccccgctgccgcgccgggagctgatc24240ctcgaacsggccgctcccgccgtcctcgtctgcgsggsccccgscgagcagccgccgcac24300C3ccggsgccgsgcggtggcgstcggcgacctcaccgccgaggcggacgccggtgcgggc24360acsccggcggagccggccccgcggccgcacgatcccgcgtacctgatgttcacctccggc24420tccaccgggcgccccsagggcgtggccgtc3gccscgcc33CCtctCCtCgttcctgcac24480gcgctgsccggccgcctggcactggggccggccg'accggctgctggcgctgscgscgacg24540gcgttcgacstctcgctgctcgsactcctcggcc:cgctcgtcaccggcggcsccgtcgtc24600gtggcaccctcctccgcccsgcgcggcgccgccgacctcgccgcccggctcagctccccc24660ggaatcaccaccgcccsggccacaccggccgtctggcgcctggccctgtccgccggatgg24720cgcccccgggsigggcttcac3Ctcctgtgcg,gcgaggcsctgcccccggscctcgcc24780gscctgctggccgccscccccgccgaggcccacascctctacggtccgaccgagaccacc24840atctggtcctgcgccgcccggstccgcccgggtgagcccgtcaccatcggccgtcccatt24900cccggcacccgcgtcctggtggccgscgccgcgctgcggcccgtsccgcccggcgtctgc24960ggcgsactcctcgtcggcggccccggsgtggccctcggatacctggscgaccccgcccgc25020accgcggcgcggttcgtgcccgscccctsccaccccggcgaacggctctaccgcsccggc25080gacgtcgtacgcctgcgctccgacgggttgatcgsgttcgtgggccgcgtcgscgaacag25140gtcasggtacgcggccsccgcatcgaactcggcgagatcgagsgcgcgctgcgcgccctg25200cccggagtgcgcgscgccgccgcgsccgtgctcgacccgcgcggcseicgcccgtstcgcc25260ggctacctcgtcgccgacgacggcgccctcgscacggcaggscgggcggcccggctgcgt25320csggacctgtccgaggcsctgcccgcgtccatggtgccctccgagctgtacgccgtgccc25380gccatccccctC33CCCC33cggcaaggtcgsccgccgggccctgccgggcsccggtcgg25440cgcctggsgggcggcagcgagcgcgtcgccccctcgaccgatgccgagcacgccgtcgcc25500gcgctgtggtgcgssctcctcagccttccggssgtcggcgtccgstgaggacttcttcgga25560ctcggcggccactcgctgctcgccgccgscctcctgcagcgcctcgaacgcgscctgggc25620gcccgcgtccccgtcgccgsgttcttcatggaaccgacggtcgcccgcctcgcggcgacc25680gtcagccsgctcagcggtgccgtgc3cc3g3C3CC:g3CCCcgggcgtgcg25740gagcsgaccgcagsgccctcgcccgaggsccgcggggcag3cggtgscggatgggacttc25800cccaccgtgcgccggtccgtcacggtccccggctccgsccccgttcccctggaggtttcg25860ccgtgacccggcacgagcccaccgagacgttcaccgscaccctgcsccgcgtsctcgasg25920aacgcggcgcggsatcggccsccccgctgtccgtggaacagcgccgcctgtggctgctcg25980gcgggatcgccgacggctcctgggccgtcgtcaccggccgctaccggctccagggcgtcc26040cggacgccgcgcggctccasttgcggctggcctccctggtctcccgccacgaggccctgc26100gcagcgtgttcgtgcaggtcgccgsgcgtcccgtccgcctggtcctgccgttcgccgsgg26160tcgccctgcgcaicggtgsscgcacccgactccsccgsccccgcgcgggccgacgsgcacg26220tccgccscctcgtcgaggaggagttctccctcggccacggaccgctgctgcgcgccctgc26280tgctgcgctccgccgacggcggcgccgccgsact(:gtcctcgtcgggcsccgcctcgtcc26340tggscgccscctcactggacctgctcgtcgccgaactgctcggcgaggacgccccccacg26400gccgcgagggcgccgcggacaccgccggcgctctggsgaccaccctcgcggcggsacgcg26460agcgcctcgccgsccccgccctc3cgc33gcggt(jaccggccgggccgccgaactggccc26520tccccgccgccaccgagatccccggctscggccggcggcccg3g3tc3agggcacctcct26580acgcctccgtggcgctgcccctgccgctcgccctcgccccgcgcgccgccgsggccgccg26640sctgggaggccgcggtcgccgccgcctggctcgt(;gtcctcatgcgctcccaggccaccg26700gctccgccgtctgcggtgtccgcsccgcccgcggcgcggsgcaggccgggatcgtcggcc26760cgctcgacggcctgcgcctggtccgcgtcgacgacgcggscg3cgcgccgctgtcggacc26820tgctcgccgcggtcgggcgacagctgcgcgcccccgccgtggacgtgccgctcgcacacc26880tcctcgaagtggccccgccccgccgcgacatcagccgcaccccgtscgcccagsccgtcg26940tgcgggccgtggacgcccgcgsaiccgctcggcggggcgtcgggcaccggccggcsgatcg27000gcggagccgcctcgggcsccgagtacgscatcgaggtgscggtgcgcctccagcccggcc27060tcgccgtcgcacagstcgaictacgacgtccsgctgtsctccgaggsgcgggtccgcgccc27120tcggcaaccagctcgccgccgtcctcgacgccstcctgcccggcggcaccccggaggtca27180ccgccaccgtcgtcccgctgctcgacgccgagggcgcccggctcgccctggsggcgggcc27240gcggcgagcggaccgcgcccg3c3cccg3tcgctcgtcgacctcgtcgaggcccaggtgg27300ccgccgcaccggacgccgtcgccctgtggcccgagtcacgtacgcccagc27360tgtgggccgacgccactcgcctggccgscg3gctggccgcccgcggggtgcgccccggcg27420accgcgtcgcggtctggctgcgccgcggcccctcgscggtcsccgccctgctcgccgtgc27480tcgccgccggcgccgccttcgtgcccgtcgacgccgcctaccccgaggagcgggtgcgct27540acctgctctccgactcgcggccctcgctcgtggtcaccgagagctccgtgcacctgctcg27600gcgsactcggccttcccscgctgctgctggacgagctgtccggtgcgccggccgccgtgg27660acggcgcgcgccgccccgaccgggtcgccgccgscsctcccgcct3cctc3tCt3C3CCt27720ccggcaccacgggccgccccsagggcgtcgtcgtgcgccactccagcgtg27780ttgcctggcgccaggcgssctggcagctcaccgaagacgaccgggtgctgC3C33CC3C327840gcttctgcttcgacccctccgtctgggcggcgttctggccgctcgccaicgggcgccgcca27900tcgtcctcgccaccg3gg3gc3g3tg33ggaccccggcgagatga匕c3cgacgctccgcg27960accsccaggtcaccgtgctcggcggcgtgccctccctgctgtccctgctgctcgaccacc28020gcgatgcgggcacctgcacccgcgtccgcctggtq〖ctcagcggtggcgsacccctcsccg2謂0acsccctgctggsgtccgtcgagtcgscctggtcc:gccg3ggtcgcc33cctctacggcc28140ccaccgaggccaccatcgacgccsccggcc3ccgcgtsccgcgcggcgaccgcscggtcc28200cggttccgatcggccgcgcggtgtccaacaccgccgtccacgtcgtcgacgccgasctgc28260ggcccgtacccgaaggcgtccccggtgagatcgtggtcaccggcgcgggagtcgccgtcg28320gctaccacgaccggccggcgctgaccgccgcgcgc:ttcctgcccgcgcccttcgccgacg28380cgtccgacgacccgggtgcc3CCCtCt3CCg3accggcg3cctgggccgcaggctccccg28440acggctccgtccagttcttcggccgggtcg3cgs(:caggtC3sgstccgcggccaccgcg28500tcgaggtctccgaggtcgagtcggtgctcaaggc(:ctggccggggtccaggacgcggccg28560tggtcgccctggscgcgggt3ccg3g33cgcccg(;ctggccgccgccctggtccttccgg28620ccggctccg3cgccccgtccctcgaagacgtccgctcggcactcgccggcgaactccccg28680actacctcgtgccggsccgcttcgccgtcgtgga(:gagctgccgctc3ccgcc33cggc328740agacggaccgccgcggsgtcgccgaactgctctcccggcaggcagccgcaccggtcgccg28800gccaggacgggcccsccgsgccccgcsscgccctggsgcgctccgtcgccgaggccttcg28860cctccgtgctgcgcctccccgcgstcg3catccscgccgscttcttcgscgtgggcggc328920cctcgctc3tcctcgcgssgctggcctccctgctcggccgcaagcacgatgtcgagstcc28980cgctgcacgagttcttccgtacgccgaccgtcgccggcgtctccgagacgatcgsggtct29040accgccgcgagggcctcgccggcgtcctcggccgcaagcacgccgcgacgctggsgggcg29100acggcaccctcgacccctccatcagccccgagggccttcccgaggcggaatgggagsacc2916Qcgcggcgcgtcttcctcsccggcgccsccggctacctgggcctccacctcgtcgagcagc29220tcctgcgccgcaccgacgcggaggtcgtgaccctgtgccgggcccgcgscgsacsgcacg29280cgctcgaacgcctcssggagggcttcgccctctacgagatcgacgtcgaggsccsgctgc29340sccggstctcggccgtcatcggcgacctggccgagccgcgcctgggcctcacccaiggsgc29400agtgggacgacctggccgccaccgtggacgtgatctaccacascggcgcgctcgtcaact29460tcgtct3cccgtactcggcgctc33ggccgccaacgtcggcggcsctcagcgcgtactgg295206L3ctggcctgC3CC3CCCgCctc3sggccgtccaccacgtgtccsccstcgscaccctgc29580tcgccacccacstgccgcgccccttcctggagaacgacgcgccgctgcsctcggccgtcg29640gcgtcccggccggctacaccggcsgcasgtgggtggcggsgaaggtcgtcgacgaggccc29700gccgccgcggC3tcccggtcaccgtcttccgccccggcctC3tcctcggccacacgaags29760acggcgccscgc3geicc3tcgactacctgctggtcgcgctccgcggcttcctgccc3tgc29820gcatcctgcccgsctscccccgcatcttcgscgtcstcccggtcgactacgtggcctccg29880cc3tcgtgcscatatcccgg33gcgggasgccstcgscggCttCt3CC3Cctgttcaacc29940ccgcgccggtgccgctgctcsccttctgcgactgg3tcasgagctacggctacgagttcg30000scdtcgtgcccttcgsggagggcaggcgccgggccctcgg3gtcggcccc3gccscctcc30060tgtacccgctcgtcccgctc3tc33gg3cgcggaggccgsgccgcsccgggccctcgacc30120ccaagtacatggscgaggtccsgcccgccctcgaatgcgcggagscgctgcgcatgctgg30180ccggcagcgacatcgcctgcccgccgsccsccgaggccgacgcccscgccgtcatggact302403cctggtgcgcaccggcttcatgccggcgcccgcggscgtcgtccacgacgsgccgsgca30300gcscgctggaggagcgatgscggcsccggccgacaccgtccaccccgccggccsgcccg330360ctacgtggcccsggtggcc3ctgtgcccttccggctgggacggcccgaggaactgcccgg30420cacgctcgscgaactccgggccgccgtctcggcccgggcgggtgaggccgtgcgcggcct30480cwccgcccgggagcccgg3ccgscctggccgcactgctggcggcgsccg3gcggscccg305403gcggccctcgccccggtcggcgccggcccggtgggggacgacccctcggagtccgaggc30600caaccgcgacaacgacctcgccttcggcatcgtgcgcacccgcggcccggtggccgaact30660cctcgtggacgccgccctcgccgcgctcgccggaatcctggaggtcgccgtcgaccgcgg30720ctcgg3cctggaggacgcggcctggcsgcgcttC3tcggaggattcgacgcgctcctcgg30780ctggctcgccgacccgcacagcgcgccccggccggcgsccgtaccgggcgcaggcccggc30840cgggccgcccgtccsccaggscgccctgcgccgctgggtgcgcggacaccacgtcttcat30900ggtcctcgcccsgggctgcgcgctcgcgacggcctgcctgcgcgscagcgcggcccgcgg30960cgacctgcccggggcggaggcctcggcagccgcggccgaagcgctgstgcgcggctgccs31020gggsgcgctgctctscgccggcg3cgccsaccgggsgcsgtacascgagcagatccgccc31080c3ccctc3tgcccccggtcgC3CCgCCC3Sgatgagcggcctgcactggcgcgsccacgs31140ggtgctgatcaaggagctcgccggctcccgcgscgcctgggagtggctctccgcccaggg31200ctcggagcggcccgccaccttccgggcggccctcgccgag3cgtacgactcccscatcgg31260ggtgtgcggccacttcgtcggcgsccsgtcacccagcctgctggccgcgcatgggctcgac31320gcgttccgccgtcggcgtcatcggscagttccgcaagstcsggctgsgcgcgctccccgs3138Qacagcccgccacccagcagggagaaccctcgtgaaccgcggctcccgctccctgcgcgaaagcctggcccgctscggcccggccgccgcggcctgcgccgcggcggcctacgccggcacc31500tcgctcscctcaccgcgcccccgcccggcaagcgctccccaggagcagttctccgccgag31560cgcgccatgcgccscgtccgggccgtggccgccgaaccccacccggtcggctcgcgggcc31620gcggcccgcgtccgggsct3cctgctcgccacctgggcttcgagscggag31680gtccsggsggccgtcgcctcccacgacctggggcccaccccctacggaccccggtacctc317403ccggggg3gtggtgcgc33cgtcstcggccggctgccgggctccstccccggacstgcc31謂gtggcgctgstgacgcactacgsctcggtgtcccagggcccgggggccsgcgscgcgggc31860gt3ccggtggcsgcgctgctggaagcggcccgcgcactgcggaccgacggcgtgcsgccg31920gtcascgacctcctggtggtgttcaccgacggcgaggsagcgggactgctcggcgcccgg31980gccttcttcgsccggcacccgctggcgaagacggtgggcgcggccttc33cttcgaggcc32040cgcggcaccgagggccccgtcctg3tgttcgaggccggcccgggcaacggccccatgctg32100gsgga3ctggcccgcaccggcgtacccgtgttcgccagctccctcttcg3cgcc3tctsc32160cggcgcatgcCC33CgCC3Ccgacttcgccctcgtcasggagagggggatcccgggtctc32220aacttcgccccttcgccgcct3cca,tggccccctcgacgacatcgaccat32280gtcgaaccatcggccctccsgcaccagggcgs3ctcgc3ctggcgctggcccgccgcctc32340ggctccgccgacctggccgssctcaccggcgaggacgcggtcttcttccccgccggacgc32400ggaaagctgstcagggtccccggcsg3gccgtcgtccccctggccgtcgcggcaggcctg32460ctgggcgcgggcggcctcggscaicgccctgcgcaa.gggcggcctgaccggccgccggctg32520gccggcscc3gcgccctgctgctcggccggstcgccgccggcgcggccgcgggtacggcg32580ttcaccgccgccstgggcsa3ctggcgcccgagttccggcggcacggcgacttccacggc32640tccggcgscstctacgccgccgtcgccgggctcgcctgcgccgcggtcctcgccggtccc32700ggcgacgccccggcggcggccgcccggaccgccgcggcc3cactcccgctggccgccgcc32760tctgccgcgctggcctcgggactgcccggtggcagctacctgacgacctggccgctcgcc32820ggcgcggggctaggactgcggctgctcgactccgcgcsgccgacgggcsccggacgggtt32880ccgtccgtscgccacgcggtgggcgcggccgccgctgccgccccggccgccgccctcctc32940gtcccgctgtcccgsctcctcttcssggggctc3ccccgcgcctggccggcaccgccgtc33000gtcgccctccagctctgcggcgasctggccgcccccgcsctggccggactgccccggcgc33060acccgcaigggccgccgccgcgggcggcctgctgctcgccgggggcgtggccgcgcggcgc33120ttcctgcggcgcgggaagggtgctcccccgccggagaccatcagctacctcctggacccc33180gtgcactccacggcactgtggacgaatcggcctcggsgcggacccgggcc33240ttcctcgggg33c3gccgg3gaagggccgcctgccggagcscttccccggctggcsgcgg33300gscttcctgcacgcccccgcccccaggctcgacctccccgcaccggaagtgacggtgctg33360gsggscggggccggcgsggccggccgscgccggctgctgctccacctgcgttcsccgcgc33420ggcgcscgccagctgtccctcgccgtcgtcggcgcaggagtgcggcggtggcgcgtcgag33480gscggcccgtggtccgsgccgtccccggccg3ggscg3csgctgggagctgtggctccsc33540gccgtgccggacccgggcatccgggtcgaactcg£i3cttccgtcgtccggsscccggctc33600cgggtcctcgaccggatcgacggactgccgc3gg3gctcgcggcggctccgggcscagst33660cccgccaccgggstcccggagcac3tcagcccgg〔:3gccgccctcgacgtggacacctgg33720ggcaacgcctcactcgtcgtacggacggtgcaccttccggcccaggaagggccggc3tgs33780agtgscsgggttggccactttcctgggcgatgscg3cctcgtcccctgcg33840cggtcgtcgtcsscgscgccgcccagtactcgatctggccgcaggaccgtcccgtccccg33900ccggctggcggccggccgggttcgaaggcccgcgcagcgcatgcctggsacacatcgaga33960cggtatgggccggtcccscccccgcccccgccgcctg3ccgcdc3ccscc3402033CC3gC3CCatgaccatcagcctcgagsaC3ccsccgtcggccsg33ccccgccggtgg34080ccccccgdccggcaaggcccccctggscatggagggtctcgcctggatcctcttcggcgc34140ctccgccttcc3gt3cctca3cgccgcgtgcgsgctcsscctcttcgagctgctggagaa34200caagcccggtctgsccaagccgcagatcggcgccgsactcggcctggccgsccgcgccaa34260cgacatcctgctgctgggcgccsccgcgaccggcatgctcaccgtcgaggacggccgcts34320ccagctcgccscggtcctggccgagctcctC3sgacggacgactggcsgcgcttcaagga34380cacggtcggcttcgagcagtacgtctgctscgsgggccagstcgacttcaccgsgtcgct34"0gcgctccaacagcsacgtcggcctgcgccgggtccgcggctcgggccgtg3CCtgt3CC334500ccgcctgcacgagaacccgc3g3tgg3gc3ggccttctacaagtacatgcggtcctggtc34560ggagctggcc33CC3gc3cctggtcgaggtcctggacctgtccggcacctccssgctcct34620cgactgcggcggcggcgscgccgtcaactcgatcgcgctggcccaggccaacccgcacat34680cgaggccggcatcctggagatcccgccgactgccccgctg3gatcgccgs34740ggccggcctcagcgaccgcatcacggtcaagccgggcgacatgc3cscggacgagttccc34800caccggctscgacacggtgatgttcgcccsccagctcgtcatctggsccc34860cscggcgctgctgcgc33ggcgt3csacgccctgcccgsgggcggccgggtgatcatctt34920csactcgatgtccs3cgeicg3gggtg3cggcccggtcgtggcggcgctcgacagcgtcts34980cttcgccgcgctgcccgccgsgggcggcstgatctactcctgggccscct35040cctgaccsaggccggcttcsaccccgagaccttccsgcggatcgacttcccgggctggsc35100cccccacggcgtc3tc3tcgccsccsagta3tccacggcccgtgcsgcggcccggtcccg35160ccctctgggcggsggccgggccgctgcacgggcttccgcc3tgccgc3ccgcggsggscc35220tgatgctgaccsctccggactgaggatcgtccgctggccgctggssscgg35280ccaagcgggagcgatgcagggssgccggsgccctgcggatcctcctcgtcg3g33cggcg35340cgaagccaccgctgacggccgscctccgggsggactgggtcagggcccccgtcsgc3ggg35400aagacctcgcggcccgggtcacggccctgcgcgccaasgccggtgcctatcgggtcccca35460ccctcgacgtggscggcgeigctgcgcttcg3cggccgctcggtcgtgctctcccgcgcgg35520sggtcgccatcgtctccgtactcgtgcgcsstt3cggctccctggtcgcc3ggg3gtcgc35580tcgcgcscctgccggscgctccccg3gcgggcsgccgcs3cgcgctcgscctgcscgtga35640tgcgcatccgccggcgcstcgccccgctgggcctg.ggcatcagcacggcctggggccgcg35700gctscctgctccsggsgctgacgcaggasgcgcagggacaggscgcgggagg33cctagg35760gccgccccctccgtatccggcccggcccgcgacc5,ggcggssgcggcggg3gcggsascg35820gcggacggggcgctcsccgtcggctgscggctcgcgccccgctccgctccgcggcccgct35880cttcgcgggccggccctcttC3C3CCCtCCcggccsccggatccgctgacatggactccc35940ccagcaccggccsgccgcgctccgcc3gcc3gtccg3cagcatggacgcc3tgccggcgt36000cgaccgagaggscggcgsggccgacggggggtcctcgatcC3C3g3tCCt36060cg3cgttcscgttcaactcgctC3C3tccgccata(:3gccgggccagttccccctcgcggt36120ccgtgaggagcacctggagcttgtcgtacgcgcgt:cctgccgccccgtscttctgcggga36180tcsgc3gctgtcccgcccggCCCtCCtCC3ggag〔cggcggsgcagctcgcggtsggccg36240gggcctcggcggcctcggcggcccccggggasgcggcgagcgcgtgcsggst3cgacggg36300tc3tctccsggtcgtgctgcaggttttccagggcgccgagcsccgcggtggcgttggatc36360cgaggatgtcggtccsc3ggtcggggtcgccggccgcgstcctggtgaagtc3cgcagcc36420cggagccgg3cagctccagcgcccggtggtcgtaggaccgcsgcaggcgggccatcagac36480tgctgacgsggtgcggcsggtgggaggtcatggcg3cggcccggtcgtgtgcctccggct36540gcatcgtcaccacgtccgccccgcacaggccgatcagccggcgggccgtggcgatcgtct36600cctgcgaggtgtcagcgctgggggtgaggatccacggcttggccctgsacstgtcgctgc36660gggccgcgccgggccccgggtgttcccggccggcg3tggg3tgtcccccgstgasccggg36720aggcgtcccsgccgaggtccgcccccgccgcggggagcgccgccttgacgctggccacgt36780cggtgtagtggcgggcgatccgccgtcgctgggcgtcagcgsgsgccgttcgcacgctct36840gcggaggcscggcgatgsccgccaggtccgtctggcgctcgggggcgagcgcggtgcccg36900cccccctggcctcggcgstccgcaccgattccgcatggacgtcctggsggtgg3cccggs36960caccgt5ictcggtgagtgccaggccgstggaggttccgatcsggccggtgccgstcscgg37020tgscgctgstcsgcgsaccggcgtcttcggtgtttccgstggsggtcacctggtggtccg37080tccgttccacggccgtcagtsgcggtsgtggtcgggcttgtaggggccctcgscctggsc37140gccgatgtaggcggcctgctccgggcgcagggtggtgagcttgacgccgagcgcgtcgag37200gtggaggcgggcgaccttctcgtcgaggtgcttggggagcacgtagacgtcggtcgggta37260ctcggacggcttggtgaagsgctcgatctgggccagggtctggtccgcgaaggagttcgs37320catcacgaacgaggggtggccggtcgcgttgcccaggttgsgcaggcggccctcggsgsg37380C3cg3tcsggsccttgccgtcggggaacttccsggtgtggacctgcggcttg3cctcgtc37440cttgacgatgccctcg3tctttgccaggccggccatgtcgatctcgttgtcgssgtggcc37500gatgttgcccscgatcgcctggtgcttcatcttggcc3tgtccgaggccatgatgstgtc37560cttgttgccggtcgtggtgatgaagatgtccgcgatctccaccacgtcgtccagcgtggc37620gacctggtagccgtccatcgccgcctgcagcgcgcagatcgggtcgatctccgtgacgst37680gscccgcgcgccctgcccgcgcsgcgactccgcgcagcccttgccgacgtcgccgtaacc37740gcacacgaccgcgaccttgccgccgatcagcacgtcggtggcgcggttgstgccgtcgat37800cagggagtggcggcsgccgtacttgttgtcgs3Cttcgacttcgtcacggcgtcgttcac37860gttgatcgccgggaacsgcagggtgccctcggccatcatctcgtagagscggtggscscc37920ggtggtggtctcctccgtcacsccgcggatctcggacgcgsgctgcgtccacttctgcgg37980ggtctcgccg3gggtgcggttgeigcagcgtgsggatgtggccgtactcctcggsigtccgc3,0ggtggscgggtccggggccgcgcccgccttctcgasctcgacgcccttgt38100ggtggcgtcaccsccgtcgtcgsggstcatgttcgggccgccggtgggggtgttcggcc338160ggtcagcgcctgctccgtgcaccaccagt3ctcctccsgcgtctcccccttcc3ggcg3338220gscggggscgccctgcgggttctccggggtgccgtccgggccgsccgcgsitggcggccgc38280ggcgtggtcctgggtggsgsagatgttgcaggaggcccagcggscstccgcgccgsgcgc38340gacgagggtctcgstgsggscggccgtctggsccgtcatgtgcagggagccggtgstacg38400ggcgccggccagcggctgcgccgcggcgtactccttgcggatcgacatcaggcccggcst38460ctcgtgctcggccsgggtgstctccttgcggccgasggcgggtcggcgac38520cttgsaatcggcgggggacaggttcgtcatcgttgctccgtgsggcggtgagggscggtt38580cagtggcgggctgcggcgggctgccgggcaggcgacagaccgaggttccgatggstctcg38640acggtcgcggtggsccggttgsgggtgstgtagtgcaggccgggcgcsccttcggccsag38700柳cggtcggccstggcggtggcgtggtcgacgccgatgcggtggccctcggsggggtcg38760tcgcgggcggcgtcgagccgtcgggcg3gttcgtcggggaacgscgcgttgctgagttcc38820gcgasgcggcggatctggcgtacgtcggtggcgggcstgatcgcggggatgatcggcgtg38880ttgcagccgacgsccgcgaccctgtcgcgcsgccggaggtagtcgtcgscgcgga3g33c38940atctgggtgatggcgtagtcggcgccggcccggcacttggcgscgsagtgscggatgtcg39000ctgtcccstgtccggcgsgcgtggstgccgctcgggsasggccgccacgcc39060tctcccagttccctcgtgagtcagcggcatgacggsgcccgtcgggstgg39120gggacccactcgcccttgggstcgcccggcgggtcgccccgcagagccaggscatcccgg39180atgccggcgtcggcgtactggccgstgstgtggcgtsgttgggcgatcgagtggccgacg39240gcggtcaggtgcgcgsccgggcgcsgcgtggtgtccgccgcgatccgcttggtcacctcg39300accgtccggtcccggg3gg3acctccggccccgtacgtgacggsgacgascgscggtgag39360aaggcctcgatgcggcggatcgccctccacsgcgtccgctcgccggctgccgtcttcgga39420gggaagsactcgaacg33t3cgacttctgccccgtgcggagcaggtcccccaggcggccc39480ggacgggcgcggt3cggg33ctccgcggtctctgctgccstggtggtccttcgggsttcg39540gtacgggggsgccggatcsgcgcgcgttgsagtacgtggcgtccgggtggtggatgscga39600tggcgtcggtggactgctcggggtggagctggaactcctccgagaggtggacgccgatcc39660gctccggctggaggaggtcggcgstcttcgcgcggtcctccsggtcggggcaggcgccgt39720agccgsgcgagssgcgggcgccgcggtscttgaggtcgaacatgtcctcgaccttctcgg39780ggtcctcgccgccg33gccgagctcggcgcggacgcgggcgtgccsgtactcggcgaggg39840cctcggcgagctggacggagsgcccgtgcagctccaggtactcgcggtaggagtcggcct39900cgascagcttggccgtggcctcgccgstcctcgaaccgacggtgaccacctggaggccga39960tgacgtcggtctcgccggsctcctccgggcggasgaagtcggcgaggcacagacgccggc40020cccggcgctggcgggggaisggtg33gcgggtccgctcgttgccctgctcgtcg3gg3tga40080tcaggtcctcgcccttggsg3cgceicgggsagta>gccgta_geiccscggccgcttccsgca40140ggttctccgtgtggagcttctccaggaggccgcgcaggcgcggacggccctcgctctcgs40200cgagctcctcgtsggtggccccgccggcacgggcctgcttgaggccccactggcccttga40260acagggcgccctcgtccagccaggaggcgtagtccttgagcgggatgcccttgatcacgc40320gggtcccccag33gggcggggtgggcaccgggttgtcgacggccacgtcggagcggccgc40380cggtctcctccggctcctccscctggagggccggggtgtcccgcttggcgacgcggcgct40440gcttgaggggcgggsgttcggcgccggggattccgcgcttgacggcgatgsgcgcgtccs40500tcagccgcagaccctcgaacgcgtcccgggcgtagcggacctcgccttcgtagstctcgt40560ggsggtcctgctccacgtsggcgcgggtgagggcggcgccgccgsggstgaccgggteigt40620cggctgccsgcttgcgctggttgsgctcctccaggttctccttcstgstcaccgtcgact40680tgscgaggsgtccggacatgccgatcacgtcggccttgtgttcctgcgcggcctccsgga40740tcgcggagaccggctgcttgatgccgatgttcacgacgttgtagccgttgttggtgsgga40800tgatgtcgscgaggttcttgccgatgtcgtggacgtcaccgcggacggtggcc3ggacga40860tggtgcccttgccgtcggcgtcggtcttctccatgtgcggttcgsgataggccaccgccg40920tcttC3tC3Cctcggccgsctgcaggacgsacggcagctgcatctgcccg40980gctcaccgacgsccttcatgccctccaggagggtgtcgttcacgstgtccagcgccggac41040gggatgcgagggcctcgtccsggtcggcctccaggccgttcttctccccgtcgstgstcc41100gccgctggagccgctcgtccagcggcagcgcgaggagctcctcggcscggcccgccttcs411603cg3cttcgtgttgacgccctcgaagsgggccatcagcttctgcagggggtcgtagtcgc41220cctcacgccggtcgtagatcagatcgagtgcggtgctgacctgctcctcgtcgsaccggg41280cgatcggcsggstcttcgacgcgtgcacgatcgccgastccagacccgccttcacacsct41340cgtccaggaaasccgsgttcagcagcac3cgggcggccgggttcsscccg41400tcgscagscccagcgtcgtctgc^cctccgggtgacggcgcttgagctcccggatcgact41460cgstcgtcgcgatcccgtccttgcgggactcctc(:tgaccggtgcagatcgtgascgtca41520gcgtgtcgatcsggatgtccgactcccggatcccccsgttactggtgaggtcctcgettca41580gccgctcsgcgstggccsccttgtgctcgaccgagcgggcctggccctcctcgtcgatcg41640tcagcgcgatcagcgcagc3ccstgctcctgcgccsgccgggtcsccttcgcgsascggg41700actccggcccgtcaccgtcctcgtagttgscggagttgatgsccgcacgcccgcccagct41760tctccagscccgcctggstcaccgggacctcggtcgagtccagcacgatggg33gggtgg41820aggcggtggcg貼gcggccggcgagttcctgcatgtccgcgacaccgtcscggcccscgt41880agtccscgcacsaatccagc3tgtgcgcgccctc<acgg3tctggtcccgggccatctcc3"940cacsgtcgtcccagcgggcctccagcstcgcctcacggaacttcttcgacccgttcgcgt42000tcgtccgctcaccgatcgccatgtscgsggtgtcctgccggaacggcsccgtctgatacs42060gcgacgccgcacccggctccggctgcggcgaccgctcggtgaccgccgtgccccggaccc42120gctcc3cg3cctgccgcaggtgctccggcgtcgtaccgcagcsgccacccsccsgcgscs42180gcccatactcccggscgascgtctcctgcgcgtccgccagctcggccgccgacagcggst42240agtgcgccccctgcttcgtcagcaccggcagacccgcgttcggcatgcacgacagcggaa42300tccgcgcattgcgggccsgatagcggaggtgctcgctcatctccgccggacccgtcgcac42360agttcagaccgatcatgtcgatccccsgcggctccaacgccgtcagcgccgcdccgstct42420ccgaaccgagcsgcstcgtgccggtcgcctccaccgtcsccg3gc3gatg3ccggcatgt42480cttccccgcagttgctcaacgcccggcgggcgccgatgatcgcggccttcgtctgcagga42540ggtcctgggtcgtctccsccsgcagggcgtcggcgccgccggagstcsggccctcggcgt42600tgscctggtaggcgtcgcggatctgggcgt3gtcg3tgtggcccagcgtcggcagcttcg42660tgcctgggccceitcgsccccagcscccagcgctgccgcccggtcgsggccgtgaactcgt42720cggcgacctggcgggcgatgcgggcgccggcctcggscsgctcg33gttgcgttcgggga42780tgtcgtsctcggccagcgccgcgasgttcgcgccgaacgtgttggtctccscgcagtcga42840cgcccacggcgaagtsctcgcggtgcacgttggccacgatgtcgggccgcgtcacgttca42900agacctcgttgcagccctcc3gctgctgg3agtcctccatcgacgggtcctgcgcctgca42960gcatcgtccccatcgccccgtcggcgacgsccacgcgggtcgcgagcgcctctcgcaggg43020cgtcacttcgtgcgctcacggcttggcctggaggtggggcgtgatctcggcggccggtcc43080gtcaccgtacgcgtccgcgatccgggccgtgsgggacgsccggtccagccggtattcctg43140cgtgcctaccg3ctcc3gg3cggttgtggccagcgcgcagcccagctgggctgcgcgctc43200ctgcgacagcccctggccgstccccgccsggtscccggcgcgg33cccgtcgccgscgcc43260cgtcgggtcggcgacgttcttggcggggacggccggcscgctgscggscggggtgtcggt43320gttgtggatcgtgacgccgtcaccgccgcaggtgatgatccaggtgcccacccgctggsg43380gscctggtgtgcggtccatccggtgcsctcctggsgcsgggcggcctcgtactcattggt43440tcggcgccgtccaccagacgccgggtctcgtcgccattcsgccgcgcgag43500ctgctgcgacggatccgcagcgsgggg33tgcccagttcacgggtcttcgccgagtgccg43560csgcatcgcctcggggtcgtccggcgcgacgacgscccggtcctgcggtcccg3scggcg43620csccacatcggccaggtcgstctcgcgggcctcgaccatggcaccggtgt43680gatctggttctggtccgcgtccgtggtgc3catgaaacgggccgtgtgccgtgtggcgct43740caccctcacggstgccggtgtcgtgttcgcg33gccsggsgtcgtactcggggaagtcggt43800gccggcggcgccgaccsgcstcccagscgtcccsggccgaatgcgatgtt43860cgccgcgacgccgccgcggcgcacctccsggtcgtccaccaggaaggacascgagstcgt43920gtccsgccggtcgggtatcagctgctcggcgaascggccgggg,gtcgtcagatggtc43980ggtggcgatggstccggtgatcgcgatgcgcatgtcagacccccgcgaccctgcgcsggg44040cgtcggcgcggtcggtgcgttcccacgtgasgtcggtcsgctcgcggccgsagtggccgt44100acgcggcggtctgcgcgtsgatgggccgcagcsggtccsggtcgcggatgatggcggcgg44160ggcgcaggtcccgatggcctcttcgatcttcgaggtgtcgaccttggcgg44220tgccgaaggtCtCC3Cg33Caggccgscgggctcggccttgccgstcgcgtscgcgacct44280gcacctcgcsgcgcgsggcgagcccggcggcgsccacgttcttcgcgscccagcgcatcg44340cgtacgcggccgascggtcgsccttggacgggtccttgccggagaacgcgccscc3ccgt44400gaicgggcc3tcccgccgtacgtgtcgatgatgat':ttgcggccggtcagacccgcgtcgc44460ccatcgggccgccgatctcgaatcggccggtcgggttC5ccagcagccggtagccgtcgg44520tgtcgagcttgatcccctcctggscgagctgcttcaggacgtgctcgaccacgtgctcgc44580ggacgtcgagggtgaggagggcgtccsggtcgacgtccgcggcgtgctgggtggagacga44640cgacggtgtcgsggcgc3C3gccctgtcaccgtcgtactcgstcgtgacctgcgtcttgc44700cgtcgggacgcsggtsiggggatggtgccgtccttgcggacctcggacagccggcgggaga44760gccggtgggcgaggtggatcggcsgcggcatcagctcgggcgtctcgtcgcsggcgtscc44820cgaacatcaggccctggtcgccggcgccctgcttgtcc3gctcgtcgccctcgccctcg344880cgcgctgctcgtaggcggtgtccscgccctgggcgatgtcgggcgactgcgcaccgatcg44940agatcgacacgcc3cagg33gcgccgtcgsagcccttggcggsggsgtcgtagccgatgt45000cgagaatcgcgtcgcggscgsgctgcgctdtcggcgcgtacgccttcgtcgtC3cctcgc45060cggcgacatgcacctggccggtggtgatcagggtctcgaccgcgaccctgctgctcgggt45120cctcgcgcagcsgggcgtcgaggstggtgtcgctgstccggtcggcgatcttgtcggggt45180gaccttcggtcsccgattC3gscgtgaiagsgacggcccatgacgagttcctccaggattc45240tcsggcggcgcggaccgcscgsatgacgttcatgccactgggcagggsgtccacacgggt45300aatgcggsacccggccttgctcsgcsggacttcgacctgcgccgcagtccgttcctggcc45360gccgttgccgccg3scsgggagagcstgtagaggtccstc3gggccgcggtgtgcagccg45420ctcgtcccgctgtccgcggttcgcggcgatcaggtcgacgaccatcagcgtcgcgtcgtc45480ggacatggcggcgcgacsggtgcgcaggatctgcsccgctcgatcgtcgtcccagtcatg45540caggstgtgcagaggtcgccccc(:ctcggggaggctgtcg45600ccgttgtcacgggsgtsgcggccctccsgcggs-二ccgcggtggggcgctccgcctggagg45660gacgtcgagccgggcsgatcgaagacgatgccgctgatgtcgggatgtgctgacaggstc45720ctgcctgcaggtcgsctctagaggstccggttccgtcgctgccgcctacg45780tccscgaccgtgggataagc3tcgcgcgcaggatctcgtccagctccaga45840gccsgtccgcggccctgggagtcgtcgaagaccgcgcgssgctcctcgtcctgctccatg45900tgggtgaagagcgagscgcctcggctgcgctcgaacggcg3ctgtcccgtccgcacggtc45960tccagcsagtcgccccaggcggcaccgsactgccctgcceiccagcgtggccgacggasgt46020gcggsagccgggtgcccggaggtgagggcctcaccgagcggsgtcsgcgagtagacctgt46080tcggcgccctcctcgaacacgccgaacacggcgagsccscgcsgcsggcgggcgagtgcc46140acggcgtccgtgctggtccgctccgcgagggtcgagatgtccgcsggcccctccgcgagg46200atgtccgcgatccccaactgcgcgacgscgscc3gcgcscgagaaggaagctgggcatsc46260agcatccgtctcstgscggc3gccgcctcccgctgttccgcgctcaactcgtscacggtc46320gcctcctgactggtctcctcgcgccaccgggatggagacgccccacgatggtgaccgtcg46380gcggaccggcccttccgccacgggccsgatcccgaaceigatttccccggsccccgctcgc46440gccggtccgagsagcccccsccacccggttccggctccaccgcgcttccgtcsggactcg46500gtcstccgttgagcggccct3CCC3C3CtC3tggcggcccggcaccctgtcggccgctga46560sccgagaggggggsggssccggtgscggsgaacccgcgacgcccgcccctggtggtgggg46620gcgsgcgccgcgggcctgacggtcgcccacgaactggcccggcgtggsctgcccgtgcgc46680gtcatcg3agcctcsgccggaccggccacgacsgggcccgcggtgacactccagccgcgt46740accctggssgtgctcgaccagatggcggtcatcgacggcttcctccgccacggacgcaag46800ggcgagsccctcgccgaccacacgacgctgCCC3CCCttC3cccgtscacgctcgtC3tc46860gaccgggcccgggccgcggccctgctccgcgsagccgtcgtcgacctcggcgtaagcgtc46920gsggggggcgcgcgggsccggcaccctgccgccccggcttgtctg3tc3Cctgcgacggc46980ttcccgcagcgctcggstccggtcggccgccgcctcsccgtcgtggcggaccscgscccc47040gacsccgggatccggcaggctt3C33CCtCgcctggsaactggcgstggtcgsgcagggc47100ctcgtgggtccgcgcttgctggscacctacgacgaggagcatcctgcggagcgaggccac47160cggccgggg3tccgtgcsctcctc33cgctgtccccgctctccggcgcactcatccasgc47220csccgctcsgggctggggctcacctaccgg33c3gctccctgsccacgcggaacaccctc47280gacttcctctccggttaccacgtcggcgcggtgggsttcgasccgactcccgtccccgcg47340cccggttcccgcgtcactgaggccgggccggcctgcaacctgctgcgctccgsgctgsgg47400gsicccscgatggtccctgctgttcgcgacggacccagccggcacccsgggctcaccggsc47460tccgtggccgccaccgccgctgcccagtacagcgcctacttgtcggtccgcatcctggga47520gccgcttccctccccgaccccctctgccgactgcgcgccgcgctgggcacgcctccgggc47580ggctggctcctgatccggccggacggctacgtggctgcacgcggcagcctcctgsccagc47640caagccctcgaccgggcsctgtccccgctctccgtcgcccCtCt33g33tcggagccasc47700cc3tgtggctgatgtggttcttccgccgtctcgtgacaacctccgcggacgtacgcccct47760accatcacgagcagccccggcsccgccgccgccscctgcggcgggccactgccggccgac47820gcgeictgscggcctgcccggtca_tgtttcaiscgcaLcatcgsgccgagtgatgccstga_ccgg47880gcccccgsaccggcgcggttgtcgggacgacccgcsgscgccgctgcggc47940tgcggcgactga_cgggctgcgcagagacggcggtcgaaggcgttgsgggs3gtctccgcg48000gcsgccacatgccggccggscscggggtcgcatcggc3cgcccttgsgccggattccgct48060cgttctcacctcagsccstgcacagtcgatgccctgccsccsgsctC3gcgcc3cggtc348120C3ccgeitcggcaccacgacgtcgsigcgtgctgcccaagccgcccgcgscg祖80gcctgcgggsgssgcsgtccccgcactatgtgsttctgscgctctccggtgscgggtcgt48240cgatctcagtgg3gca3c3ggg3ccggg3gaccgctacgagttcctcgacaaactggccg48300gg3ggsgctgccgctacggcctgtgtaccgtgccct:ccaagstgggcgaa3c3c33gtcg48360tcttcgtccgctggacccctgtggacgcgcccgccgaagaasagacgctctatgactcct48420acgaggageigcgcccggcaagcgctcactgctccccaggacgtggctccgcgcaccgscc48480gctccgasccgcsggaccttcgcgggcgccgasc3cctgsggcaacccgagggcggtg3348540sggcccagg3ccctgaccgccacacggcccgtctgagcgtgsggtctcgcccggccaccc48600gaccacctggaccatgccttcgccccgsaccscasgggg3tgccggtgcgcgacggsgcc48660cccgsgcccgsccgggcaccgcccactgggtgatg,agcggcgtgtcggttcatgctgccg48720ccsggctttggtgaggcggacgacgcgagcgggggtggcg3tgccccgcscggtcgcgag48780cttgccgtccgsgstgtcgsscgtcscggcgccg3cg3ccctgccgtcgg48840aatggcgagtgcgtagtggacagcggagacgccgccgagccgctgcttcgcgggagttgs48900cctgaagccggcccgtgcgatggcggcgatgcgctgaggagtgtcgcaccgcsgcagctt48960ctcggtcsggccggctccgtcggagstcgcggtcgcgtcgtcggtgagcagtgccaccsg49020gtgttcggtcttccccgctgccgccgcgccgcgccccgacgstgcgctgccgggcccggt49080ggaggtgctgctggcttgcggactcggtgstagcgaggatctcgacgatctcggcgcggg49140tgcaggagascgcttcgcgcsggacggtgcgttcgcggcgggcctgtgccg3gcgg3gct49200tgacg3ggc3cagtgaccsgcgtcgsagaccsctgttctcccgacccgcgsggccgggcc49260sctggccgcctcgccgtcgagscctgcgttgccgcscctcgcgactgcctgcgcasccag49320tcgcccastggcgcgctggcggagcccacccgsctcgg3tcc49362<210>2<211>195<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1ValAsplieLeuArgAlaAla15AlaGlyLeuThrLeuAlaAla20GlyThr!LeuLeuGinArgPhe35lieAlaArgGluSerAlaLys5055ArgAspGluHisAlaThrAla6570ThrSerMetGlyValMetThr85AlaPheLeuCyslieAspLeu100LeuThrlieHisGinAlaGin11552AlaAspVallieGlyArgValGlyPro1015ValAlaGlyGluValGlyLeuValPro2530GlySerLysArgGlyLeuMetLeuAla4045GluAlaGluSerValTyrGluSerMet60LeuMetAlaValThrAspLeuAlaVal7580LysProGluValTyrAlaAsnHisLeu9095ThrAspProGluPheHisAlaHisAla105110ArgArgValTyrGluAlaLeuLeuThr120125GluAlaValAspAsnGlyGluLeuLeuAlaGlyThrAsnValAlaAla130135140LeuAlaThrThrLeuGinAlaValValAlaGlyAlaGlyLeuThrTrp145150155160AlaLeuAspArgGluGlyThrLeuProHisArgLeuGluArgGluVal165170175lieThrAlaLeuAlaProHislieThrProGinGlyAspSerAlaGly180185190GluGlySer195<210>3<211>304<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1ValThrAspGlyValArgThrLeuAlaGlyLysValAlaLeuValAla151015GlyGlyThrArgGlyGlyGlyArgGlylieAlaValGlu!LeuGlyAla202530AlaGlyAlaThrValTyrValSerGlyArgSerSerSerAlaAsnGly354045SerSerAspMetGlyArgProGluThrlieGluGluThrAlaArgAla505560ValThrAlaAlaGlyGlyValGlylieProValArgThrAspHisSer65707580SerProGluGluValArgAlaLeuValAspArglieAspAlaGluGin859095GinGlyArgLeuAspValLeuValAsnCysValTrpGlyGlyAspArg100LeuThrAspTrp115LeuArgLeuLeu130TyrAlaValArg145ValThrAspGlyValAlaLysSer180His105AsnArgProLeuTrpGin120ArgGinAlaValGluThr135LeuMetAlaAlaArgArg150AsnThrAlaArgTyrArg110GinAspLeuAspLysGly125HisVallieThrSerArg140SerGlyLeuVal155GlySerPhePheLeuLysGlu195ArgSerGlu210ArgAspGly225AlaTyr!Leu165ThrVallieArgGly170LeuAlaPheAla185ValAlalieThrValAlaAla200AlaMetLeuGluHisPheGlyValThr215220ValAlaArgAspProAspPheAlaHisSerGlu230235AlaValAlaAlaLeuAlaAlaValGlu160TyrAsp175GinAlaAlaAsp190ProGlyPheLeu205GluAspAsnTrpGlyArg245MetAlaLysThrGlyArgAlaLeu260ThrAspTyrGlyPhe275TrpAlaArgThrLeu290lieAspGly280GluProArg295AlaLeu250AlaThrTrpGly265SerGinProAspLeuGlyProLeu300ThrPro240AspProAsp工le255LeuTyr!LysGlu270PheAlaAlaHis285GlyThrProLeu<210>4<211>169<212>PRT<213>Str邵tomyceslavendulae314(NRRL11002)<400>1ValLeuAsnAspAlaAlaAlalie5ArgArg20LeuValLeuGluMaGlyValGluSerLysValGluLeu工leArgTrp10AspAlaAlaGluAsp50lieAsp65AspPro35TrpCysArg25GluAla40ProValArgThrProValSerArg55LysAlaLeuArgAsnGlyProThrGluValValProSer130ValLys145ValMetThr100ProArg115ArgAsnVal85AspVal70:LeuGlySerAlaArgTyrArgLeuValAsnGluValLeuAl3lieuLeu120lieuArg105GluArg90LeuAlaGinGlyValValCysThr45SerAspLeu60AspPheLeu75MetSerValAlaGluSerGluSerAla165150AspAlaAlaSerMet30AspProVal15ArgLeuValArgArgAlaArgHislieAsp135ProLeuAspLeuAlaLeuArgThrProValPhe110LeuSerArgThr125MetArglieArg140ValTrpGlyArgThrVal80!LeuSer95SerHisSerGinArgArgGlyTyr155160<210>5<211>512<212>PRT<213>Str鄰tomyceslavendulae314(NRRL11002)<400>1MetLeuGluArgArgAlaValLeuArgMet1510ValAlaValGlyGlySerMetValAlaGly2025AspThrGlyValArgArgSerAlaGlyVal3540SerArgLeuSerGlyAspLeuValLeuPro5055GinAlaSerArgLeuAlalieGlyGinPhe6570AlaValAlaTyrCysGinSerGluGinAsp8590PheAlaGinAspAsnSerLeuArgLeuVal100105SerPheGlyGlyTyrSerThrThrThrGly115120SerAlaGlyAlaAlaAla15ValAlaSerAlaAlaAla30AspTrpSerArgLeuSer45AlaAspAlaArgTyrGlu60AspAlalieArgProGin7580ValArgThrAlaLeuAla95ProArgSerGlyGlyHis110LeuValLeuAspValSer125ArgLeuAsnSerValArgLeuThrAlaAspThrValValValGlyPro140LeuAlaProArgGlyVal155160CysProGlyGlyPhelie130135GlyLeuGinGinValAspAlaLeuThrGin145150GinValValSerGly!>euCysProGlyVal165170175GinGlyGlyGlyLeuGlyTrpGinThrArgLysPheGlyMetAlaLeu180185190AspArgLeuValSerAlaArgValValLeuAlaAspGlyArgAlaVal195200205ThrAlaSerAlaLysGluAsnCysAspLeuPheTrpAlaLeuArgGly210215220GlyGlyGlyGlyAsnPheGlyValValThrSerPheGluValGluPro225ThrHisValPro230SerlieValAsn245VallieAlaAla235PheAsnLeuThrTrpPro250GinArgTrpAlaAlaGinLys260SerArgAspLeuGlyAlaGlyLeuGlyValGinTrp275ThrGlyArgPro290GluTrp265LeuGly280ValTyrGlyGly240TrpGlu255GlyGlyAspAla305AlaProLeuThrLeuLeu295PheThrAlaValLeuAspAla310SerValGluGluGinProTyrGinLysArg325MetSerTyrTyrGlyCysAlaAsp340ValGlyTyrSerProGluAla355SerArg]二eu345ValPheVal315GinPro330ThrProTrp300ArgVallie270LeuAspAlaGly285LeuGlyArgProAspAspAlaGlySer320AlaMet335HisThrAspArgSerArg!Leu370LysAlaLeuGluAla385PheLeuSerPheAla360LysAlalieProGinCys350ProArgThrAsnPheAlalie365SerGlylieGluPhe405ThrAspThrAlaPheValHisArg420Ser!LeuGlyLeuAsnGluAla435AlaTrpAlaPheSerLysAlalieProAsp375380PheValAlaAsnProArqAla390AlaLeuGlyGlyAlaAlaLysGluProProArg395AlaAla410ThrGluPheAsp425LysTyrThrAlaGlu440GlyPheArgAlaGlyArgAlaTrpAlaGluAla450455AsnArgGluSerTyrGinAsnPhe:IleAspProGlyGinPheArg400GlyArg415TyrAlaGlyTyr430AspGluAlaAla445lieAspProPheSer460SerLeuThrAspTrp465470475480LysThrAlaTyrTyrGlyGluAsnTyrAlaArgLeuLeuThrValLys485490495ArgLysTyrAspArgHisGinValiheSerPheAlaGinGlylieAla500505510<210>6<211>479<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetValAlaGluGinGlyAlaProSerAlaLysAlaGlyLysThrLeu151015LeuThrLeuLeuCysLeuAlaGinPheMetLeulieLeuAspValAla202530ValValAlaValAlaValProSerMetGinSerGinLeuAsnlieSer354045AlaAlaAsnlieGinTrpValSerThrAlaTyrGlyLeuAlaPheGly505560GlyPheLeuValAlaSerGlyArgAlaAlaAspLeuPheGlyProLys65707580A:rglieLeuLeulieGlyLeuSerLeuPheThrLeuAlaSerGlyLeu859095CysGlylieAlaProAsnGluLeuThrLeuPheValAlaArgAlaVal100105110GinGlyPheGlyAlaAlaLeuValSerProAlaAlaLeuSerLeuVal115120125ThrThrSerPheGlyGluGlyGluSluArgAsnLysAlaLeuGlyLeu130135140TrpGlyAlaVal145GlyValLeuAlaValProLeuGly180LysAspArgSer195LeuLeuThrAla210AlaGluArgGly225AlaProAsnLeuAla305MetLeuVallieSerSerGlyGlyAlalieAlaGlyGinLeu150155AspLeuAlaGlyTrpArgSerliePheLeu165170LeulieVallieValAlaValAlaArgMet185190ValLeuLeuValLeuArg260MetlieCys275GlyThr!Leu290Gly!LeuGlyArgMetGlyAlaSerAla340ThrValAsp355GlyGlyValAsnAspThrGly200GlyValValLeu215ValGlyLeuPro230ValPhePheVal245LeuSerMetPheMetLeuThrAla280TyrMetGinGin295PheAlaProVal310ProLeuValGin325AlaLeuPheAlaGlySerTyrTrp360GlyAlaAlaLeuArglieAspMetAlaGly205LeuValThrAlaValSer220ValLeulieSerGlyVal235GluGlyArgValValArg:265GlyHis250AsnProAsn!LeuGly160lieAsn175ValArgAlaGlyGinAla!LeuGly240GinAsn255TyrGlyValArg270GlyAlaThrValSerValPhe285SerProVallieGlyMet300Thr!Leu!LeulieLeuGly315GlyValLysVal345ValPhe330GlySerLeuGinValPheSerPheAlaProLysAsnLeulieu350ProGly365SerMetMetMetValSer320:LeuLeu335SerPheLeuAlalie:Leu<table>tableseeoriginaldocumentpage60</column></row><table>ProGinValAspArglieAspGlyLeuGly8590GinArgArgLeuAsnArgHisSerArgSer100105GlyValAspAlaTyrLeuGlyLeuLeuTyr115GluGlyProAlaGly130ProGluGlyGlyCys145AlaAspLeu120SerSerGlyHisGluLeu135ProValCysHisGly150lielieThrHisAlaThrLeuAlaValAsp95ThrValAlaThrMetThr110SerArgLeuProAlaVal125SerArgTyrSerAlaAspLeulielieThrHisPro165170GlyGlySerProTrpPheAspLysLeuArg180185ValProPheLeuAlaGluArgHisGlyAla195LeuProGinArgGlu210ThrGluThrValSer225AlaGluSerHis200ArgAspAlaLeuPhe140AlaGlyGlyThrProArg155160GlyLeuProLeuGinGlu175SerProGluGinValAla190AspLeuSerLeuProTrpPhe215SerValGinValVal230GinLeuAsnValLeuTyi:245GluValGluArgLeuPheAlaAla260ArgTyrLeuProTyrMetArg275TyrGlyGluAlaGinPro250AlaGlyThr265lieProCysLeu220ProAsnLysAsn235!LeuArgGlyAla205TyrGlyThrGlySerGlyTyrGlyGluAlaAla290295TyxGinGinLeuSerAlaLeu305310Gin280ArgThrValAlaArgValHisGlu315SerAsp240LeuAla255GluAsnAlaLysGin270AlaGinCysGlyGly285LeuAlaGlyValThr300ValGinLysTrpAla320AlaGluLeuGlyGlyValLeuSerGlyVal325330AlaLeuLeuProGluValThrAsnArgLeu340345GlyLeuAlaHisLeuGluLeuGlyArgThr355360GlyGluLeuGinArgAlaArgThrAlaAla370375ThrGly工leValPheValLeuAspGluLeu385390AlaAspLysGinLysLeuArgAsplie405ThrValLeu420TrpVallie435MetPheSerThrGlyArgSerGlyGly485HisThrVal500CysPheThr515GlyHisAlaValAla530HisProAlaValGly545GlyAsnAlaAspArgVal450SerVal465AlaArgArgAlaLeuThrLeuValAspLeuGlyPro455TyrLeu470GlySerArgValGlyValProAla535AlaValGluHis425GlyPro440ProGinLeu410AspGlyGluGinA:rgGlulieGlyGlu335ArgLeuLeuAspArgLeu350AlaProThrLeu365GinLeuSer380GlySerGly395GluAspLeuSerGlyThrSerLeuLeuArgVal430GlyAspArgSerlieProAspValAlaGlySer445LeuAlaAla460ArgProArg475GlyArgTrpLeuThrPheLys490AspGluLeuArgValProLeu505510SerGlySerGlyLysSerSer520525LeuThrAlaAlaLeuAlaGly540ThrGly工leGluSerValAlaHisPro400AspSer415AlaGinGlyGlyHisProAspAsnArg480AspVal495AsnThrLeuLeuSerGlyTrpVal550555560AspCysValAspAlaThrTyrThr580ThrAspArgAla595AsnAlaAlaAla610LeuValAspMet625CysAlaGlyArgGlyLeuAlalie660GluPhePheArg675GinGinValGly690LeuSerGlyGly705ArgGlyAlaValGinSerProlieGlyArgThrProArgSerAsnPro565570575LysAlaPheAspAlaValArgLysLeuTyrAlaAla585590ArgAlaLeuGlyLeuGlyAlaSerAlaPheSerPhe600605GlyArgCysGluAlaCysThrGlyTyrGlyGinLys615HisPheLeu630ArgPheThr645AspGinValGluProArgLeuGlyTyr695GluAlaGin710GlyGlyArg725LeuThrGlyLeuHisProAla740AspAlaLeuLeuAlaAlaGly755LeuHisLeuAlaAspSerAla770775AlaGlyGluHisGlyGlyArg785790620ProAspValTrpAlaThrCysAspVal635640ProGluValLeuSerValThrTyrGin650655LeuGlyLeuThrValAspGluAlaAla665670ThrLeuThrAlalieLeuHisAlaLeu680685LeuGinLeuGlyGinSerAlaThrAsp700ArgLeuLysLeuAlaGluAlalieArg715720GlyGlyValValLeuLeuAspGluPro730735AspValGinArgMetValSerAlaPhe745750HisThr!LeuLeuValAlaGluHisAsp760765AspTrpLeuValAspMetGlyProGly780ValValAlaGluGlyThrProGluGin795800lieArgAlaAlaAspSerAlaThrAlaGlyHisLeuArgArg805810ProArgGly<210>8<211>521<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetAlaSerSerProSerValLeulieValGlyAlaGlyProLeuAlaLeu:lsuAspLeuPhe65!LysPro50ValLys35ArgAla20Arg5AlaGluLeuSerSerThrProThrLeuGluArgTrpLeuAlaGinGlyLeu70AspTyr85ProGinSer100SerGlyAlaVal115ArgGinHisAspGlyGly130Leu55ProSer40LeuAla25Pro10AlaGlyHisSerA鄰MetArgTrpAlaHisArgAspValAla75SerLeuVallieLeuAspAsp90ArgThrGluGluGin105lieGluArgGlyAlaVal135120GluValGluValProArgThrGluHisAlaAspPhe'ValValGlylieAlaCys30ArgAlaPhe45GlyArgAla60AlaLeuGlyProPheProLeuThrSer110ThrValThr125GluGlyPro140CysAspGlyLeuVal815ThrGly15ArgValGlyLeuGluSerAspGly80TyrMet95TrpAlaGlyLeuGlyGlyValHis145150SerThrValArgAspSerSer!Leu180ProGinValTyr195ProPheGlyAsp210MetGlyAlaPro225SerCysArgAlaTrpMetSerArg260LeuGly1ArgVal275SerGlyGlyGin290GlyTrpLysLeu305LeuAspThrTyrArgArgThrAsp340AlaArgLeuLeu355ValGinAsnSer370ArgPro!LeuAlaGluLeuAla165工leValAlaAlaArgThrGlyThrPhe215ValAspGlu230ValLeuGly245PheArgSerIaeuLeuAl3GlyEeuGin295AlaAlaHis310GinArgGlu325LeuValTyrArgGlyLeuValLeuGlu375GlyThrThrGlyAspGly200ArglieSer170ValArg185LysArg155PheLeuGly160SerGlySerSerTyrL,eulieAlaProValThrThrAspAspGly280ThrGin265AspPhe250ArgVal235GlyLeuAlaAlaGlylieGinLeuSea:GlyArgArgLysSer360GluPhe345ThrArg315Prolie330GluValArgLeuLeuSerGlyGlyAirgValProAlaThrPro190MetValVal205LeuAspHis220AspGluLeuLeuHisAspSerAspArg270HisThrHis285AspAlaMet300AlaProAlaAlaAlaAlaSerAspSer350MetGlyLeu365LeuLysLeu380HisArgLeu175ProAspIjeuPheAlaArgArgGlu240ProLeu255TyrArglieProAsnLeuHisLeu320ThrLeu335AlaGlyAl3ProArgTyrValGly<formula>formulaseeoriginaldocumentpage66</formula>ValGly50ProGin65lieAlaGluArgAlaGlyArgArgTrp60AspArgMetValGlu80ThrValGlyArgAla95LeuValValSerThr110CysGlyThrGlyLys!LeuLeuGluGlu55AlaArgLeuTyrGlyValAspProAla7075ArgGlyArgLeuProGlyAlaGluLeu8590LeuProProAlaAspAlaSerValAsp100105ThrAlaPheGlyHisTrpSerAspMaProAlaGlyLeuArgGlulie115120125ArgArglieLeuArgProGlyGlySerValMetlieAlaGluHisAla130135140ProProGlyValLeuLeuArgLeuLeuLeuLysAlaMetGlyArgLeu145150155160ProArgLeuTyrAspValAspGlyMetArgThrLeuValAlaAspAla165170175GlyLeuThrProGluArgLeuGluThrValProGlyThrPheValVal180185190ThrHisAlaThrArgAsnGly195<210>10<211>160<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetSerValTrpHisLyslieAspAspTyrLeuHisLeuPheSerSer151015ProValLeuSerPheArgAspProAspGlyPheProPhe25ThrGlyLeuSer20ArgGinAspArgCysArgProArgGinAspArgAsp3540ProGluGlyValProAlaAlaGlu5055HisAspGluGluPheGlySerSerLeuArg30ValArgLeuMetVal45GlyProAlaTrpLeuLeuTrpHis60LeuAlaSerHisAspGluGluPheGlySerLeuGinAla657075AspLeuAlaAlaHisGlyAspGlyTrpSerPheArgPro8590LeuProGlyProGlyLeuGlyProGlyGlyTrpAlaGly100105ArgAspThrAlaArqPheLeuGluLyslieGluArgAspThrAlaArg115120AlaProGinAsplieAspTrpAlaAlaLeuGluValSerGly80ArgArgVal95ValValGlu110AsnLeuThrAla145Pro130ArgLysAspAsnGlu150Trp135GluArgAlaArgAla155GluArg125ArglieAlaGluSer140TrpAlaGluLeuPro160<210>11<211>165<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetProAlaThrLeuSerGlyGluValLysArgValLeuArgArgVal151015ArgAlaCysGluPheSerThrLeuAlaArgThrGlyArgProValThr202530TrpProMetAlaPheLeuTrpLysProGluGluAsnGluPheValLeuSerSer50AspAjrg65GlySerGlulie354045SerlieSerValAlaArgLysAlaGluHislieAsnArgAsp55ValSerLeuLeuMetSerAspPheThrAlaPro70AlaVal85ValAspMetThr100ArgLysArgProAlaGly115PheHisAlaGin130ProGlu145!LeuGluLysValArgValProSerTrpAla150Ala16575LeuValGinGlyArg90GlyLeuGluAspPhe105AlaLeuGluAlaLeu120PheTyrTrpArgLeu135PheArgLysAspAla15560GlySerAlaLeuPro80AlaThrAlaProGlu95TrpAlaGluLeuPhe110AspProArglieArg125ArglieThrValThr140MetGlyGlyThrAla160<210>12<211>395<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetThrThrThrProAlaProSerTyrProPheProAlaLysTyrLeu151015AspGluAlaAlaGluLeuAlaArgLeuArgGinGluGluProlieSer202530ArgValThrMetProTyrGlyGlyGluAlaTrpLeuValThrArgMet354045LeuAlaAspProArg55ProArg]hePheProPhe60GluProProGluGin90AlaArgArgVal105GinGluAlaAspValLysGluVal50GinThrGluAlaAspArg657075GlylieGlylieMetAspProProGluGinThrArg85ValAlaLysAlaPheThr100ValGinThrlieValAspGluLeuLeuAspAlaVal115120AlaProAlaAspLeuTyrAlaAsp130135SerlieCysGluPheMetGlyVal145150ValProPhePheAspSer165PheSerTrpProTyrGin140AspSerArgGinLeuProValValGlu80LeuArgArgLeu95PheGlyProArg110GluAlaLysGly125LeuProGlylieGlu155ValValSerThrThrSerLysThrArgAspArgPhe160170GlulieArgGinAlalieGlyAspLeuHisAlaTyr180185lieGluArgArgArgAlaThrProGly195200lieArgAlaArgAspGluAspAsp210215GluMetSerPheGlyLeuLeuVal225230GinLeuAlaAsnPheLeuTyrLeuLeu245GluLeuLeuArgAlaLysProGlu260LeuLeuArgPheValProLeuLeu275280AspAspLeuArgLeuSerPheGly190PheThr205GluLysGlu220AlaGlyMetGluThrThr235PheArgAsnProAsp250ValProAsnAlaVal270ValAspGinProSer285Leu265AlaProAsp175GluLeuAlaLeuLeulieAlaSer240GinLeu255GluGluValAlaArgGluAspVal290ValVal305AsnProPheSerProSerGinArgThrLeuGlyGlyValLeulieArgAlaGlyGluThr295300MetAsnSerAlaAsnArgAspAlaAspValPheGlu310lieAspVal325AlaHisHisHisGly340GluMetValThrAlaLeuArg355AlaValHisGin370LeuArg385GlyValProAspGlu375GluAla!Leu390315ThrArgGluAsnAsn330CysValGlyAlaGin345SerLeuLeuValArg360Glu]」euArgTrpLys380ProValAlaTrp395320ProHisLeuAls335LeuAlaArgMet350PheProGlyLeu365AlaGlyValVal<210>13<211>61<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<楊>1MetSerGlyLeuCysMetSerLeuAlaProAsnValPheGlyGluGly151015AspAspHisArgValValAlaLeuThrAspArgSerAlaProAspGly202530ArgLeuLeuGluAlaAlaGluPheCysProAlaGluAlalieAlalie354045SerSerLeuAlaThrGlyGluThrValGluGlyThrGly505560<210>14<211>504<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<糊>1MetLysArgGlyLeuLeuArgThrPheLeuGlyThrAlaAlaAlaSerAlaAlaGlyAspAlaProGin50ValArg65GluAlaLeu35TyrLeu20Gin5ProAlaAlaThrArgSerLeuAspThrValProGinAlaValArgSerGlyGlyPhe85SerVal70AlaArg55ValAsp40SerAla25Gly10SerAlaGlyGlyThrValValLeuAlaLeuLeu45GinPro30Pro15ProTrpGlyAspPheAspSerArg60ArgCysThrThrValGluAspValAlaArgHisPheAlaGlyHis100lieAspLeuSerLeuMetAsnTyr105ValArg90Ser75ValProAlaValThrThrThrArgLeuAlaArglie130ValHis145GlyGlyAla115120GinProGlyCysGinLeuValAspLeu135GlyValAlaValProThrGlyTrpGly110GlySer125GluGluAla140ProThrValAla95MetCys80ArgValValAlaLeuAlaLeuAlaLeu165Cys150155GlyGlyGlyLeuGlyPheLeuThrArg170AlaMet175lie160TyrGlyValAlaSerAspArgMetArgArgAlaGinValValLeuAlaAsp180GlyArgValValAlaLeu210AspPhe225TrpThrThrAla195ArgGlyGluGlyGluProValTrpAlaSer245ProSerSerAla200GlyGlyGly215ProAlaPro230ValArgAlaAspProLeu185190HisGinHisAlaAspLeuAsnPheGlyGluAla260TyrGlyAlaAspAlaGlyVal275GlySerProAspTrpLeu290AlaAla305TyrArgAlaAlaTyrGlySerAla370GluGly385AsnArgValGlyThrGly295ProGlu280LeuAspMetValLeu250ThrPro265ProGlyThr235Serlie220Ser205ValThrPheThrAlaTrpGin!LeuLeuValThrGlyProLieuAsn工le270ValSer285AspArgPheTrpGluTyrLeuThr240ArgTrp255SerThrGlyValAla310PheGlyMetArgHisTrpLeuAspArqLeuThr300AlaThrSerGluArgArgThrAspCysHis340PheAla355Glylie325ArgValGlyHisLeuAlaArgPheGly330AsnPro345AsnPhe315CysAspThrLeuGinAlaGlu335AlaSer320ProArgGly360AspAlaValLeuLysAlaPheGinLeu350PheAspArgProLeuAsp365AlaAlaGluAlaArgValProMa405Ser390Thr一Thr375380PheAspLeuGinGlyLeuGlyGlyAlaArgSerAlaThrAlaTyr410395ValHisArgAsnAla415His400LeuPheTyrAlaGlyTrpSerValGlylieAspValProGluGlyGluVal420425430LeuAlaProAspArgArgArgAlaCysGinGluTrpValAspArgAla435440445TyrAlaArgValHisProTrpSerSerGlyGinAlaTyrGinAsnTyr450455460lieAspProAspLeuAlaAspTrpArgGluAlaTyrTyrGlyValAsn465470475480TyrGluArgLeuSerAlaValLysArgAlaTyrAspProLysGlyPhe485490495PheArgPheAlaGinSerlieThr500<210>15<211>475<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<楊>1ValProGlyThrProValValGinAsnProAlaAspArgLeuAlaGly151015ProAlaAspProLeuGlyLeuArgArgLeuSerAlaGlyLeuAsnLeu202530LeuArgArgProLeuProPheLeuGinGluLeuAlaSerGlyTyrGly354045AspValCysAlaMetProGlyArgAsnProArgThrValPheAlaPhe505560GlyProAspGinAlaArgGinValLeuThrGlyLysSerPheValSer65707580AspSerPheArgHisLeuArgLeuProProGlySerProMetMetLeu859095:LeuThrSerGly100ArgGinAlaMet115ThrAspThrlie130ProGlyArgSer145ArglieSerLeuLeuLeuArgIjSuAsn105GlyThrGinGinAlaPheSerProArg120AlaArgHisProLeuArgMet210工leAla225LeuThrGluLeu:LsuAsn180ThrAla195ThrArgValValAla165AspArgPhe135AspLeu150ThrMetAspAspMetHisGlulieHisArg110HisValThr125LeuAspGly140AlaArgLeuLysMetValGluLeu155AlaGlyLeuGluAspGluAla170LeuAlaVallieGinlieAla200GluGlu185Pro!LeuProAlaGly215AlaArgArgSerGlyAlaAla230AlaGlulieGluAspValProAsp245GluAlaAspAlaTyrThrLeuGly260SerSerLeuPheTrpThrLeu275!LysLeuTrpGlu290ProGly305LysGluArgProAspValSerValArg325AlaGlu295LeuVal310LeuMetPhe280ValArgP:coAl3Pro235AlaGlu250AlaLeuCys265LeuLeuAspAlaGluValMetProVal315ProAlaGly330AlaAlaAla190GlyThrPro205lie:LeuArg220AspLeuLeuGluLeuSerHisAspSer270GinHisArg285AlaGlyAsp300LeuAspArgPheGlyValLysHisValTyrTrpThrValThr160AlaAla175GlyHisPheArgAlaL>euSerLys240AspGlu255ValAlaGluTrpAlaAlalieVal320ArgTyr335AlaGluMetThrGluGlyAlaVallieAspGlyHisProlieProLys340345350liePheSerSerPheValThrHisArgAspAlaGlyProSer385GlyGlu370GinArgMetGinValVal355ProLeuArgPheArg375HisGluTyrPhePro390Asn!LeuAlaLeuLeu405ArgAlaProValValAlaVal465Gly450Asn420Arglie435ProAspGlulieSerThrValGlyProAla455lieGlu!Leu470360365ProGluArgTrpGluThrAla380PheGlyLeuGlyGinHisAsn395GluSerLysLeuValLeuThr410LeuProAlaGlyThrValVal425430ProAspGlyHisValArgAla440445ProArgAlaTyrArgValThr460ProGluHisPro475GlyAlaThrPheGinProCyslie400Arglie415AspProLeuValGlySer<210>16<211>1836<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<■>1MetAspGluAspSerSerLeuTyrProThrAlaMaArgProSerArg151015ArgProLysGluProArgLysMetValLeuGlyAspPhePheArgGlu202530SerLeuMetValLeuLysGlulieLeuArgArgValArgAlaProPheSerPhe50GluArg65SerThr3540GlyLysAlaGlyPheGlySerGlyLeu55GlyValCysGlyThrAspProGlyLeuAlaArg130LeuLeu145ThrValThr115AlaArgGlu85GlyAla100GluAlaHis70ThrArgGinThrValLeuGlyThrAlaAlaGlyGly75Alalie90PheProVal105ValThrPheSerArgGlu120LeuAlaValLysLeuArgGluValLeuMetGin135GlyProAspPro150TyrAlaGlyLeuProAlaAlaAlaAspValLeuProArgGlu210ValAsp225AlaValThrGlyAsnValArg180AspAla195LeuPhe165ArgAspPhelieThrLeuLeuArg185GluTyrLeu155ProVal170LeuGlyProGlyThr200AlaGluArgLeuProAspValThrSerValArgProGlu245ThrProArg260ArgAlaliePro230Glu215AlaGlyAlaProLeuPheGlyValLeuGluAsp235lieGin250SerHisVal265ThrAspArgPheGlyLeu45SerProHisProAla60AspThrMetGinAla80ArgAlaValCysGlu95AlaGinProAlalie110GlyLeuAspArgGlu125AlaGluGlyProVal140LysGlyPheLeuAla160TyrProProAsnPro175AlalieLeuAlaLys190LeuLeuGlu205ArgProGlu220AspTrpArgPheThrSerArgAsnLeu270AspThrSerProLeuArgLeuAlaPro240GlySer255LeuAlsSerThrGlyAla290Val:Leu305LeuSerThrGlyThrArgLeuGin370ArgAla385PheLeu275LeuTrpLeuProPro295ThrProLeuValSer310PheValSerAsp325AlaThrHis340ArgAlaArgThrGlyAlaAspGlu280285TyrHisMetGlyLeuVal300GlyPheProlieHisLeuLeu315TrpLeuArgLeuVal330AsnAlaProMetGly355ValAlaPheSerPheAlaGluProGlyArgProLeuAlaGluLeu450AlaAsp465ValPheLeu435ValCysTyr405GlyAla420GlyArgArg390GlyGly375PheAsp360AlaPro345ValGluProAlaProAlaLeuAlaGluGlyLeuArgAlaGluThr425AlaSer410HisArgPheGlyTyrAla350AlaLeuAspLeu365ValAspAla380GlyLeuSer395ThrLeulieValSerSerGlyProValThrValGly470AspPro455AlaPro440ValAlaAlaGlySerValAlaAlaSer485GluGluThrAlaArgThrPheGly500SerTrpMetArgThrGlyGlySerPro320SerGlu335LeuAlaSerArgSerThrLeuProArgLeuAspProGlyAla445ValAlaGlyThrGlu460ProGlyGlu475GluGlyTyrPhe謂AlaThrLeuAlaGly505Pro430ProAspVal400SerGly415GluAlaAlaThrValArglielieGluSer510GlyGlu480AspPro495SerArgGlyAspLeuGlyPheLeuGlyAlaAspGlyAspLeuVal530AsnIj6U545Glylie515ProHisArg520ValAlaArglieLys535ProGinAsplieGlu550LysGlyCysValAlaAspvalArgThrGlyGlyGluAspAspValAla610ProSer625AlaArgGlu595Arg565Glulie580AlaGinGluHisThrLeuProAlaAla!LeuValValAlaArgArg600SerAla615LysThr630LeuAspAlaAla585lieAla570GlulieVal555Gly525ValVal540GinAspArgThrValProGlyLeuArgProAlaArgProValArgGlySerSerGlyGluVal605lieHis620ValGinGlu590ArgGlyArgAspPro560ProAsp575AlaAlaMaAlaLeulieArgArgHis—_645650ThrAlaAspThrAlaAlaProThrGlyAlaLys635GlySerLeuAlaThrValLeuAlaAlalieArg690LeuGly705AlaLeuGly675HisGly660665GlySerAspAspLeuValValAsn680GlyGlyThrThrProLeuProPheValGlyHis695AlaAlaArgLeuAlaGlyGluAla710ProValThrAsplieAlaAlaLeulie740Thr725ArgAla670LeuAlaAla685GlyAla!Leu700SerGinliePhe715LeuAlaGlyValAspAlaGluThrAlaAlaPro745730LeuGlyThrGlySerAlaGlyLeuAlaGluLeuProAlaGlyProAla750ThrAlaArgSerAla640CysAsp655GlyProThrValAspSerLysVal720AlsLeu735AlaAlaGinGlu755760765ThrLeuCysLeuLeuGinGluLeuAspProAlaAlaProGlyLeuAla770775780LeuGlylieAlaPheAla785790AlaLeuAlaLeuThrAla805ArgLeuValArgArgGly820AspThrAlaAlaLeuProAlaThrAlaAspProGinArgVal795畫LeuValAlaArgHisProAlaLeuArgThrGluHis810TrpSerArgLeuValAspGlyAlaPro835LeuGlyGly850CysArgArg865ValValLeuHisAlaValAlaHis!Leu915ProProAla930GluArgAla945ProLeuGinGlyArgAlaTrp840lieGlyGluArgAspLeuGlu855!Leu825GlyArgTyrCys830ThrLeu845LeuAlaValProAsp870AlaAspGlyHisGly885AspLeuGluAla860AlaGluGlyProLeuPheGin875LeuValAla900LeuAlaAlaGlyAlaHis890TrpSerlieGlyValLeu905ProGlulieThr815ProAlaThrArgGluArgAlaGlu880ValHis895GluLeuAsp920ProGluValSerCysAspVal935TrpAlaAla910AlaGlyAlaLeuThrLeu925AspArgTrpLeuArg980AsnPhe950ProSer965GinAlaProAsp940GinGlyLeuLeuThrAspGly955AlaAspGlyProAlaGinAsnVal970SerHisAlaProLeuThrLeuAsp985990ArgLeuCysGly960ProAla975AlaArgArgThrAlaAlaLeuThrAla:LeuAlaLysGluCysGlyValThrLeu99510001005TyrAlaValLeuLeuAlaValGinSerLeuThrLeuSerArgLeu10101015ThrGlyThrAspArgValProValAlaValPro10251030GlySerAlaThrHisArgAlaValAspTyrLeu10401045ThrLeuProMetGluThrAlaThrGlyThrVal10551060LeuArgGlyAlaLeu1075ValAlaLeuSerAla1090AspAlaAlaLeuLeu1105GlyValGlyAlaGly1120ValGluLeuGlyVal1135Gly!LeuThrThrLeu1150ArgValGluValAsp1165ArgPheAlaAspAla1180GlyAlaGlyLeuPro1195GinAlaGluArgLeu1210AspGlyGluGlyThrGlyArg1070ValGly1085ProAsp1100ThrPro1115GlyVal1130ProSer1145AspGly1160AlaAsp1175AlaAsp1190SerAla1205ArgSerAlaAlaAspAlaTyrProGluLeuValProAlaProGlyAlaProArgAxgLeuGlyAsplieGlyProPheThrLeuThrGlyTrpLeuAlaGlySerValAlaAlaValGlyAlaGluProValProGlu1020LeuHisGlyArg1035ValSerThrVal1050ArgAspLeuVal1065AlaHisArgThr1080AlaArgGlyGly1095]LeuGinGinAsp1110!LeuLeuGlyGly1125MaValValPro1140!LeuThrGluThr1155ProSerArgHis1170PheLeuAlalie1185LeuAlaGluAla1200ArgGlyTrpSer1215!LeuHisAspLeu12201225ValLeuSerAlaAlaAlaArgHisProArgArg12351240AlaGlyAspGlySerLeuSerTyrGlyGluLeu12501255AlaValValAlaAlaAlaLeuAlaAlaGluGly12651270CysThrValGlyValLeuMetGinArgGlyArg12801285ValLeuLeuGlyValLeuArgSerGlyAlaAla12951300AspAlaAlaThrProProAlaArgLeuAlaAla13101315AlaGlyCysArgHisValLeuValGlyAspVal13251330GlyGinPhePheProValArgThrLeuAspVal13401345AlaAlaGlyProAlaGluProValProProArg13551360ProAspAspProAlaTyrLeuLeuPheThrSer13701375ArgProLysGlyValVallieProHisArgGly13851390lieArgTrpAlaGlyArgGluPheGlyThrAsp14001405ThrLeuAlaValThrProThrThrPheAspLeu14151420LeuPheThrProLeuAlaHisGlyCysGluVal14301435GlyValLeuAsp!LeuValAspSerProAlaHis1230ThrAlaValVal1245AlaHisArgSer1260AlaGlyProGly1275AspLeuProAla1290TyrLeuPro!Leu1305ValValGluAsp1320ProValGluArg1335AspAlaValLeu1350Pro!LeuThrThr1365GlySerThrGly1380ProValAsnLeu1395AlaLeuAlaArg1410SerValPheGlu1425ArgLeuLeuAsp1440AlaAlaAspAla1445ThrLeu1460GinAsp1475GluPro1490ProGly1505ThrTyr1520ValPro1535AspAla1550Leu工le1565GlyMet1580ArgArg1595GlyLeu1610ArgGly1625GlyLeu1640GlyThr1655LeuGluLeuAsnThrValAlaLeuProAla1450ProSerAlaValAla1465SerValValGlyLeu1480LeuVal1495AsnIjSU1510GluLeu1525ValGly1540;LeuPro1555ValAla1570PheLeu1585GlyAsp1600GlyArg1615LeuGly1630GluAla1645AspArgArgLeuAlaMaTyrLeuVal1660SerTrpLeuThrAlaGluValArgMetValSerThrTyrAl3lieGlyArgProSerLieuArgProGlyGlyAlaGlyThrAlaAlaArgLeuTyrArgThrLeuGluPheLeuPheArglieGluAspAlaValArgHisSerValTyrGlyProGlyProAspGlyThrThrGinGlyAlaValGlyTyrProAspProLeuValArgSerAspHisAspValGluValValValGlyAspProGlyLeuArgGly1455SerLeuLeuGlu1470AsnValAlaGly1485HisArgArgLeu1500SerGluThrThr1515ThrSerGlyAla1530LeuSerValVal1545ThrGlyGlu!Leu1560AlaGlyArgPro1575GluHisProGly1590TrpArgAlaAsp1605GinVal!LysVal1620ArgAlaLeuThr1635AlaLeuGlyGin1650ProGluArgPro1665ValArgArgArg1670!LeuGly1685valLeu1700GlyArg1715Arglie1730TrpAla1745PheLeu1760HisLeu1775LeuArg1790GluLeu1805ValArg1820ArgThr1835HisGluLeuProAspGluLeuProLeuAlaSerThrAlaProArgAsnGinAlaLeuProAspValGlyGlylieAlaArgGluThrHisThrThrThrSerAlaAlaArgLeuAspArgThr1675GlyTyrMetValPro1690ArgAsnArgHisGly1705AlaThrValPro!Leu1720AspSerGluArgArg1735ValProGlulieGly1750HisSerLeuMet!Leu1765PheSerValGinVal1780ValAlaGluGinAla1795ProProHisAlaAla1810SerArgTyrThrAla18251680GlyGluPheAla1695LysLeuAspArg1710ValThrGlySer1725ValAlaAlaCys1740ValThrAspGlu1755ThrThrlieAla1770ProLeuThrAla1785ArgTy;rLeuAsp1800GlyProArgSer1815AlaGlyAsnGly1830<210>17<211>1082<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetAsnGluMetAlaGlyLysAlaTyrGlyLeuAlaValArgLeuAspLeuArgGlyAlaLeuGluArg20ValValGlu35GlyThrLeu50ValValAsp65GluValArgArgHisGluAlaThrAla25LeuArg4010LeuThrThrGinValValValProGly55LeuGlyGlyArgGlyProAspArg70AlaArgGinGluLeu85AlaGinProP:ro!LeuLeuArgGlyHisHisVal115SerLeuGin130GlyProAla145TrpAsnAsn100LeuLeuLeuCyslieValLeuGlyAlaGlyAsp150GluArg135GluVal120Glu!Leu105HisAla90LeuHisLeu:LeuProEeuGinGluPheProGly165ArgGinAlaAlaArgGluHisTrpGinSerAlaLeu30GlyLeuArgGin45ValThrValSer60ProAlaGinLeu75GinArgGlyTrpAlaArgLeuAla110AlaValCysAsp125GluAsnTyrThr140PheAlaAspTyr155ProGluTrpSer15SerAlalieAspLeuProAspAsn95AspArg80LeuAspGlyLeuGlyGlyValVal160ArgGinValLeu195180Asp!LeuProThrAspProGluTrpSerAla170175SerSerThrLeuAlaGlyAlaPro185190AspArgArgArgProProLeuGinSer200205TyrAlaGlyAlaArgValProValArgLeuAspThrAlaPheAlaAsp210215220ArgValArgAspTrpSerAlaGinArgGlyValThrProPheThrThr225230235240Ala280GlyLeuLeuAlaAlaTyrThrValVal245AspLeuLeuValGlyLeuProVal260AlaGlyThrValGlyTyrLeu275LeuArgAlaAspProThrLeu290295ArgLeuThrGlyValLeuGlu305310ValGluLeuLeuAlaProProArg325MetPheGly340LeuArgVal355LeuSerLeuPheLeuAla250AlaAsn265AsnThrArgAsnGlyGlyArgCysGluLeuValGinValProGlyValAsp370GlyPhe385ArgPheAlaArgThrThrP:coLeu450AlaGlu465LeuGinGinAspValHisAlaAspMetPro330AspVal345AspValGlu360Phe315GluArgAspGluGluAlaLeu375LeuGluTyrAlaSerAlaLeuPheAlaAspGin405lieProVal420工leAspSer435ValTrpPro390LeuHisThrSerAlaValPheLeuArgAsp440Arglie455MaValArgAspAspAla470Asp395ValLeuArg410AspLeuVal425GlyGlyProGluLeuAlaGluAlaLeu475SerHisAla270ProLeuArg285ArgAspLeu300ProPheGlyArgAsnProArgGlyTrp350CysGlyAsn365AspGlyAla380ArgMaThrGinlieLeuGlyAsnGly430乙euGluGin445AlaArgArg460AspTyrAlaAlaAsp255AspLeuAlaAspTyrAspGluLeu320ValPhe335AspLeuAlaArglieAspAlaGlu400ArgAsp415SerAlaProTrpProSerSerLeu480ValAspArgValAspAlaAla485GlyProGlyAspArgValAla500ValValAlaMet515ValAspProAla530AlaAlaProAla545HisHisArgSerAlaAlaArg490ValLeuAla505LeuAlaCysTrpArgAla520ProArgA:rgAspAlaGlyAla580ProAlaTyrLeu595ValAlaValSer610GlyArgLeuAla625ThrAlaPheAspGlyGlyThrVal660AspLeuAlaAla675ThrProAlaVal690GluGlyPheThr705AlsProLeu535Val!LeuValCysGluAsp550ArgAlaValAlalieGly565570GlyThrProAlaGluPro585MetPheThrSerGlySer600HisAlaAsnLeuSerSer615LeuGlyProAlaAspArg630lieSerLeu645ValValAlaLeuThrAlaAlaGlyAla495GluArgGlyValArgAla510GlyGlyValTyr525GluLeulieLeu540ProAspGluGin555LeuThrAlaAspAlaThrPhe!Leu635LeuLeuGluLeu650ProSerSerAla665ArgLeuSerSerPfoGlylie680TrpArgLeuAlaLeuSerAla695LeuLeuCys710GlyGlyGluAla715ValProGluGinProPro560GluAla575ProArgProHisAsp590GlyArgProLysGly605LeuHisAla!LeuThr620!LeuAlaLeuThrThr640GlyProLeuValThr655GinArgGlyAlaAla670ThrThrAlaGinAla685GlyTrpArgProArg700LeuProProAspLeu720AlaAspLeuLeuAlaAlaThrPro725ProThrGluThrThrlieTrpSer740GluProValThrlieGlyArgAlaGluAlaHisAsnLeuTyrGly730735CysAlaAlaArglieArgProGly755AlaAspMaMaLeu770LeuValGlyGlyPro785ArgThrAlaPro760Val745lieProArgPro775GlyValAlaLeu790PheValProAspAlaArg805LeuTyrArgThrGlyAspValVal820GlyArgValAspGluGinValLysGluPheVal835lieGluLeuGlyGlu850ArgAspAlaAlaAla865AlaGlyTyrLeuVal885AlaAlaArgLeuArg900ValProSerGluLeu915GlyLysValAspArg930GlyGlySerGluArg945Arg825GluGin840lieGluSerAla855ThrValLeuAsp870AlaAspAspGlyGinAspLeuSer905TyrAlaValPro920ArgMaLeuPro935ValAlaProSer950750ProGlyThrArgValLeuVal765ProGlyValCysGlyGluLeu780GlyTyrLeuAspAspProAla795800ProTyrHisProGlyGluArg810815!LeuArgSerAspGlyLeulieSerAspGly830ValArgGlyHisArg845LeuArgAlaLeuProGlyVal860ProArgGlyAsnAlaArglie875880AlaLeuAspThrAlaGlyArg890895GluAlaLeuProAlaSerMet910AlalieProLeuAsnProAsn925GlyThrGlyArgArgLeuGlu940AlaGluHisThrAsp955AlaVal960AlaAlaLeuTrpCysGluLeuLeuSerLeuProGluValGlyValArg965970975GluAspPhePheGlyLeuGlyGlyHisSerLeuLeuAlaAlaAspLeu980985990LeuGinArgLeuGluArgAspLeuGlyAlaArgValProValAlaGlu99510001005PhePheMetGluProThrValAlaArgLeuAlaAlaThrValSer101010151020GinlieuSerGlyAlaValHisGinThrProThrGinAspProAsp102510301035GlyArgAlaGluGinThrAlaGluProSerProGluAspArgGly104010451050AlaAspGlyAspGlyTrpAspPheProThrValArgArgSerVal105510601065ThrValProGlySerAspProValProLeuGluValSerPro107010751080<210>18<211>1485<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>ValThrArgHisGluProThrGluThrPheThrAspThrLeuHisArg151015ValLeuGluGluArgGlyAlaGluSerAlaThrProLeuSerValGlu202530GinArgArgLeuTrpLeuLeuGlyGlylieAlaAspGlySerTrpAla354045ValThrGlyArgTyrArgLeuGinGlyValProAspAlaAlaArg505560GinLeuArgLeuAlaSerLeuValSerArgHisGluAlaLeuArg707580ValPheValGinValAlaGluArgProValArgLeuValLeuPro859095AlaGluValAlaLeuArgThrValAsnAlaProAspSerThrAsp100105110AlaArgAlaAspGluHisValArgHisLeuValGluGluGluPhe115120125!LeuGlyHisGlyProLeuLeuArgAlaLeuLeuLeuArgSerAla130135140GlyGlyAlaAlaGluLeuValLeuValGlyHisArgLeuValLeu150155160AlaThrSerLeuAspLeuLeuValAlaGluLeuLeuGlyGluAsp165170175ProHisGlyArgGluGlyAlaAlaAspThrAlaGlyAlaLeuGlu180185190ThrLeuAlaAlaGluArgGluArgLeuAlaAspProAlaLeuThr195200205AlaValThrGlyArgAlaAlaGluLeuAlaLeuProAlaAlaThr210215220lieProGlyTyxGlyArgArgProGlulieLysGlyThrSerTyr230235240SerValAlaLeuProLeuProLeuAlaLeuAlaProArgAlaAla245250255AlaAlaAspTrpGluAlaAlaValAlaAlaAlaTrpLeuValVal260265270MetArgSerGinAlaThrGlySerAlaValCysGlyValArgThrValLeu65SerPheProSerAsp145AspAlaThrGinGlu225AlaGluLeu275280285AlaArg290ArgLeu305!LeuAl3LeuAlaThrProAspAla315AlaPro330ProArgPro300Pro!LeuSerGlyAlaGluGinAlaGlylieValGlyProLeuAspGlyLeu295ValArgValAspAspAlaAsp310AlaValGlyArgGinLeuArg325HisLeuLeuGluValAla340TyrAlaGinThrValPro345ArgAlaValAspAlaValAspArgLeu355GlyAlaSerGly370ThrGluTyrAsplieGluValThrGly385AlaValAlaGinGlyThr375lieGlu390AspTyrVal360GlyArgGinlieAsplie350AlaArg365GlyGlyAla380LeuGinProAspLeu320ValPro335SerArglie405ValArgAlaLeuGlyAsnGinLeu420ProGlyGlyThrProGluVal435GlyAlaValArg395AspValGinLeuTyrSerGlu410AlaValAlaArgGlu450AlaProAspThrArg465AlaAlaProAspLeuAla455SerLeuValAsp470ValAlaLeuTrpAlaAlaValLeu425ThrAlaThrValVal440LeuGluAlaGlyGluProAlaSerGlyLeu400GluArg415lieLeuAspAla430ProLeulieuAsp445GlyGluArgThrAla485TrpAlaAspAlaTyrAlaGinLeuTrpAlaAspAlaThr500505AlaArgGlyValArgProGlyAspArgValAlaVal515520Arg460LeuValGluAlaGinValAla475糊GinGlyAspThrArgValThr490495ArgLeuAlaAspGluLeuAlaAspGlu510TrpLeuArgArg525GlyAla545LeuPro530PhelieuHis!LeuSerGlyAlaArg625AlaAla610ProTrpSerThrValThrAla535ValProValAspAla550SerAspSerArgPro565LeuGlyGlu580AlaProAla595AspThrProAlaTyr615HisAsnProLeuLysVal705AspProLeuGluValLeuGlyLeuLeuLeuAlaValLeuAlaAla540AlaTyrProGluGluArgVal555SerLeuValValThrGluSer570ProThrLeuLeuLeuAsp585590AlaValAspGlyAlaArgArgProAsp600605LeulieTyrThrSerGlyThr620LysGlyValValValArgHisSerSerValValAsnAsp690!LeuAlaArgGinAla645HisSerPhe660AlaThrGly675ProGlyGluGlyGlyValGlyThrCys725ThrAspThr740AlaAsnLeu630635AsnTrpGinLeuThrGluAspAspArg650ProSerValTrpAlaAla665670ValLeuAlaThrGluGlu685ThrLeuArgAspHisGin700LeuSerLeuIjeuLeuAsp715ArgLeuValLeuSerGly730SerValGluSerThrTrp745750ThrGluAlaThrlieAsp765CysPheAspAlsAlslie680MetlieThr695ProSerLeu710ThrArgValLeuLeuGluTyrGlyProGlyAlaArgTyr560SerVal575GluLeuArgValThrGlyAsnlie640ValLeu655PheTrpGinMetValThrHisArg720GlyGlu735SerAlaAlaThr755760GlyHis770ArgAla785ProValValAlaLeuProTyrArg850PhePhe865GluValAspAlaAlaAlaAspVal930AspArg945ThrAspProValArgSerArgValProArgGlyAspArgThrVal775ThrAla790ValProValSerAsnValHisValProGluGly805ValGlyTyr820AlaProPhe835ThrGlyGlyHisAspArgAlaAspGlulie810ProAla825SerAspVal795ValLeuAspAla840AspLeuGlyArgArgLeuPro855GlyArgValAspAspGinValLyslie870875ProValPro780AspAlaGluValThrGlyThrAlaAla830ProGlyAla845AspGlySer860ArgGlyHislieGlyAlaVal900AlaLeu915ArgSer890AlaLeuAspAlaGlyThrValLeuProAlaLeuPheAlaValVal950Ala935AspAla920Gly905GlySerAspGluLeuProGluLeuProLeu955AlaGly980ValAla995965GinAspGlyProThr985970GluLeuAla815ArgArg800GlyPheThrLeuValGinSerGluValGluSerValLeuLysAlaLeuAlaGly885ValGluAsnAla910AlaProSer925AspTyrLeu940ThrAlaAsnArgVal895ArgVal880GinLeulieuGluValProArgArgGlyValAlaGluLeuLeuSerArgGinAlaProGlyAla975LeuLys960AlaGluGluAlaPheArgAsnAla990AlaSerValLeuArgLeuProAlalie10001005Asplie1010LeuAla1025liePro1040SerGlu1055LeuGly1070AspPro1085AsnPro1100LeuHis1115ValThr1130LeuLys1145LeuHis1160LeuGly1175AspVal1190TyrSer1205LeuGlu1220HisAlaAspPheLys!LeuAlaSer!LeuHisGluPheThrlieGluValArgLysHisAlaSerlieSerProArgArgValPhe!LeuValGluGinLeuCysArgAlaGluGlyPheAlaArglieSerAlaLeuThrGinGlulieTyrHisAsnAla!LeuLysAlaLeuAlaCysThrPheAsp1015LeuLeu1030PheArg1045TyrArg1060MaThr1075GluGly1090LeuThr1105LeuLeu1120ArgAsp1135LeuTyr1150Vallie1165GinTrp1180GlyAla1195AlaAsn1210ThrArg1225ValGlyGlyGlyArgLysThrProThrArgGluGlyLeuGluGlyLeuProGluGlyAlaThrArgArgThrGluGinHisGlulieAspGlyAspLeuAspAspLeuLeuValAsnValGlyGlyLeuLysAlaThrSer1020HisAsp1035ValAla1050LeuAla1065AspGly1080AlaGlu1095GlyTyr1110AspAla1125AlaLeu1140ValGlu1155MaGlu1170AlaAla1185PheVal1200ThrGin1215ValHis1230LeulieValGluGlyValGlyValThrLeuTrpGluLeuGlyGluValGluArgAspGinProArgThrValTyrProArgValHisValSerThr1235LeuGlu1250GlyTyr1265AlaArg1280lie!Leu1295LeuLeu1310AspTyr1325SerAla1340PheTyr1355CysAsp1370PheGlu1385LeuLeu1400ProHis1415AlaLeu1430lieAla1445lieAspThrLeuAsnAspAlaProThrGlySerLysArgArgGlylieGlyHisThrLysValAlaLeuArgProArgliePhelieValHislieHisLeuPheAsnTrplieLysSerGluGlyArgArgTyxProLeuValArgAlaLeuAspGluCysAlaGluCysProProThrLeuAlaThrHisMet1240LeuHisSerAlaVal1255TrpValAlaGluLys1270ProValThrValPhe1285AsnGlyAlaThrGin1300GlyPheLeuProMet1315AspVal1330SerArg1345ProAla1360TyrGly1375ArgAla1390ProLeu1405ProLys1420ThrLeu1435ThrGlu1450lieProValLysArgGluProValProTyrGluPheLeuGlyVallieLysAspTyrMetAspArgMetLeuAlaAspAlaProArgProPhe1245GlyValProAla1260ValValAspGlu1275ArgProGlyLeu1290ThrlieAspTyr1305ArglieLeuPro1320AspTyrValAla1335AlalieAspGly1350LeuLeuThrPhe1365AsplieValPro1380GlyProSerHis1395AlaGluAlaGlu1410GluValGinPro1425AlaGlySerAsp1440HisAlaValMet1455AspTyrLeuValArgThrGlyPheMetProAlaProAlaAspVal146014651470ValHisAspGluProSerSerThrLeuGluGluArg1475l徹1485<210>19<211>365<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetThrAlaProAlaAspThrValValAlaGinValAlaThrValPro20LeuProGlyThrLeuAspGlu35AlaValArgGlyGlyGlu50AlaAla65ValGlyLeuLeuAlaLeu55ThrLeu40AsnHisPro10PheArg25ArgAlaAlaGlyGinProAspTyr15LeuGlyArgGluGluPro30AlaValSerAlaArgAlaArgProGlyGluArgThrAla70AlaGlyProValGlyAspAspArgAspAsn85LeuAlaPheGlyAsp100AlaGluLeuLeuValAspAlaGluVal130115AlaValAspArgGly135Ala120Serlie105LeuPro90ValArg75SerAla60Ala45ArgThrAspLeuAlaLeuAlaGluSerGluArgThrArgAlaAlaLeuAsp!LeuGluAsp140Ala125AlaGly110GlyAla95ProPro80AsnVallieLeuAlaTrpGinArgPhelieGly145HisSerAlaProProProValHis180ValPheMetVal195ArgAspSerAla210AlaAlaAlaGlu225AlaGlyAspAlaLeuMetProPro260AspHisGluVal275GluTrp!LeuSer:290AlaLeuAlaGlu305ValGlyAspGinSerAlaValGly340LeuProGluGin355GlyPheAspAlaLeuLeuGlyTrpLeuAla150ArgProAlaThrValGly155GlyAlaGlyPro165GinLeuAlaAlaAsn245ValLeuAlaThrSer325ValProPro170ArgTrpValAspPro160AlaGly175HisHisAspAlaLeuArgArgTrpValArgGly185190AlaGinGlyCysAlaLeuAlaThrAlaCysLeu200205ArgGlyAspLeuProGlyAlaGluAlaSerAla215220LeuMetArgGlyCysGinGlyAlaLeu230ArgGluGinTyrGin235GluGinlieArgAsn250MetSerGlyLeuLeuTyr240ProThr255TrpArgAlaProProLysMetSerGlyLeuHis265270lieLysGluLeuAlaGlySerArgAspAlaTrp280285GinGlySerGluArgProAlaThrPheArgAla295300TyrAspSerHislieGlyValCys310ProSerLeuLeuGlyGly315AlaGinGlySerlieGlyGinPhe345AlaThrGinGinGlyGlu360Ala330ArgLyslieArgLeu350HisPhe320ThrArg335SerAlaProSer365<210>20<211>789<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1ValAsnArgProAl3Al3ThrSerAlaGlu50ProVal65GluLeuPro35ArgGlySerArgSerLeuArgGluSerLeuAlaArg5Ala_25SerAla20Arg10CysAlaAlaAlaAlaTyrAlaGlyProArgProAlaAla40AlaMetArgHisValArgAla55AlaGlySerArgLysAspAla70GlyAlaArgValPheGluThrLeu85SerHisAspLeuGlyProThrPro100GlyValValArgAsnVallieTyr105ArgGlu90Gly:LeuProGinGlu45ValAlaAla60ArgAspTyr75ValGinGluProArgTyr115HisAlaValAla130GlyAlaSerAsp145ArgAlaLeuArgValPheThrAsp180LeuMetAlaThr165GlyGly150AspThr135ValGly120HisProValAlaGlyValGinGluGluAlaThr30GinTyrGly15SerLeuPheSerGluProHisProGlySer125TyrAspSerValSer140AlaLeuLeu155ProValAsnAsp170Gly!LeuLieuGlyAla185LeuAlaLeu110lieLeuAla80ValAla95ThrGlyProGlyGinGlyProGluLeuArg190AlaAla160LeuVal175AlaPhePheAspArgHisProLeuAlaLysThrValGlyAlaAlaPheAsnPhe195200205GluAlaArgGlyThrGluGlyProValLeuMetPheGluAlaGlyPro210215220GlyAsnGlyProMetLeuGluGluLeuAlaArgThrGlyValProVal225230235240PheAlaSerSerLeuPheAspAlalieTyrArgArgMetProAsnAla245250255ThrAspPheAlaLeuValLysGluArgGlylieProGlyLeuAsnPhe265GlyPheAlaAlaTyrHisGly280ProSerAlaLeuGinHisGin295ArgLeuGlySerAlaAspLeu310315PhePheProAlaGlyArgGly325330ValValProLeuAlaValAla345GlyHisAlaLeuArgLysGly360ThrSerAlaLeuLeuLeuGly375ThrAlaPheThrAlaAlaMet390395HisGlyAspPheHisGlySer405410AlaValAlaGlyLeuAlaCysAlaAlaValLeu260AlaHislieGly275AspHisValGlu290AlaLeuAlaArg305GluAspMaValProGlyArgAla340AlaGlyGlyLeu355ArgLeuAlaGly370AlaAlaAlaGly385GluPheArgArg270ProLeuAspAsplie285GlyGluLeuAla!Leu300AlaGluLeuThrGly320L>ysLeulieArgVal335AlaGlyLeuLeuGly350GlyLeuThrGlyArg365ArglieAlaAlaGly380Gly!LysLeuAlaPro權(quán)GlyAsplieTyrAla415AlaGlyProGlyAsp420425430AlaProAlaAlaAla435AlaAlaSerAlaAlaAlaArgThrAlaAlaAlaThr440SerGlyLeuPro450ThrThr465SerAlaValGlyLeuSerAlaVal530AlaGly545LeuLeuTrpProLeuGinProThr485AlaAlaAla500ArgLeuLeuLeuAla455AlaGly470GlyThrAlaGlyGly460LeuLeuP:ro445GlySerArgLeuAlaAlaAlaPheLysLeuGly475GlyArgValProSerValArg490AlaAlaPro505JLeuAlaLeuAlaTyrLeuLeuAsp480HisAla495ValProGly515520ValAlaLeuGinLeuCysGlyGluLeuLeuLeu510ThrProArgLeuAlaGlyThr525AlaPro535LeuProArgArgThrArgArg550ValAla540AlaAlaGlyAlaGlyGly565GlyAlaProProP:roGluThrlie580SerThrAlaLeuTrpAlaAla555AlaAlaArgArgPheLeuArgArg570TyrLeuArgAla610PhePro625AspLeuAla595PhelieSerSer585AsplieuAla!LeuGlyLeu560GlyLys575ValHisSer600LeuGlyGluGinProGluLysGlyAspPro590GluSerAlaSerGluArgThr605LeuPro615GlyTrpGinArgAspPheLeu630GluArg620ProAlaProProAlaPro645AlaGlyArgArgArgLeuLeuLeu660HisAla635ValThrValLeuGluAspGlyAla650LeuArgHis665SerProArg670GluHisArgLeu640GlyGlu655GlyAlaArgGinLeuSerLeuAlaValValGlyAlaGlyVal675680ValGluAspGlyProTrpSerGluProSerProAla690695700TrpGluLeuTrpLeuHisAlaValProAspProGly705710715lieuGluLeuProSerSerGlyThrArg!LeuArg725730AspGlyLeuProGinGluLeuAlaAlaAlaProGly740745ThrGlylieProGluHislieSerProAlaAlaAla755760ThrTrpGlyAsnAlaSerLeuValValArgThrVal770775780GinGluGlyProAla785ArgArg685GluAsplieArgValLeuAspThrAsp750LeuAsp765HisLeuTrpArgAspSerValGlu720Arglie735ProAlaValAspProAla<210>21<211>73<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetThrThrLysGlyValThrGlyLeuAlaThrPheLeuGlyAspAsp151015AspLeuValProCysAlaValValValAsnAspAlaAlaGinTyrSer202530lieTrpProGinAspArgProValProAlaGlyTrpArgProAlaGly354045101PheGluGlyProArgSerAlaCysLeuGluHislieGluThrValTrp505560AlaGlyProThrProAlaProAlaAla6570<210>22<211>366<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetThrlieSerLeuGluAsnThrThrValGlyGinAsnProAlaGly1510GlyProProThrGlyLysAlaProLeuAspMetGluGlylieLeuLeuGin65!LeuAsn50liePhe35L>eu20GlyAlaSerAlaPheGluLeuPhe40Glu25GinTyr!LeuAsnLysProLeu55GlyAlaGluLeuGlyLeuAlaAspLeuGlyAlaTyrGinLeuGinArgGlyGin130Ala100PheLys115lieAspThr85Thr70AlaThrGlyMetValLeuAlaAspThrValPheThrGlu135Gly120SerGlu105PheLeu90LeuArg75ThrAsnAla45Gly!Leu60AlaAsnLeu30Ma15AlaTrpCysGluThr:LysProAsplieValGluAspLeu!LysThrGluGinAsp110CysGly95Asp!Leu80ArgTrpTyrGluTyrVal125LeuArgSerAsnSerAsnValGly140LeuArgArgValArgGlySerGlyArgAsp145150GluAsnProGinMetGluGinAlaPheTyr165170SerGluLeuAlaAsnGinHisLeuValGlu180185ThrSerLysLeuLeuAspCysGlyGlyGly195200AlaLeuAlaGinAlaAsnProHislieGlu210215ProProThrAlaProLeuThrGluLysLys225230SerAspArglieThrValLysProGlyAsp245250ProThrGlyTyrAspThrValMetPheAla260265ThrProGluGluAsnThrAlaLeuLeu275280ProGluGlyGlyArgVallieliePheAsn290295GlyAspGlyProValValAlaAlaLeuAsp305310LeuProAlaGluGlyGlyMetlieTyr325SerLeuThrLysAlaGlyPheAsnPro340345PheProGlyTrpThrProHisGlyVal355360!LeuTyrHisArgLeuHis155160LysTyrMetArgSerTrp175ValLeuAspLeuSerGly190AspAlaValAsnSerlie205AlaGlylieLeuGlulie220lieAlaGluAlaGlyLeu235240MetHisThrAspGluPhe255HisGinLeuVallieTrp270LysAlaTyrAsnAlaLeu285SerMetSerAsnAspGlu300SerValTyrPheAlaAla315320SerTrpAlaThrTyrGluGlu330335GluThrPheGinArglieAsp350lielieAlaThrLys365Arg<210>23<211>178<212>PRT<213>Streptomyceslavendulae314(NRRIil1002)<400>1Met1LeuLeuThrThrProAspGluAspLysValArglieValArgTrpProGluThr5!LysArgGluArgAla20lieLeu!>euValGluAsnGlyArgArg65!>euGlu50Val35AspTrpValArgThrAlaLeuAspValAspArg70GluAla55AlaAla40ProCys25Lys10ArgGluAlaGlyProProValSerArgLysAlaGlyLeuArgPheGly85SerArgAlaGluValAlalieValSerLeuAlaArg145TyrGly130lie100ValAla115SerArgArgGluSerAsnAlaAlaProLeuGly150Leu135LieuLieu120AspSer105AlaAsp90ValAla75GlyLeuThr45GluAsp60TyrArgAla30Ala15!LeuArgAspLeuLeuAlaAlaValProArgSerValLeuValArgHisLieuLeuHisValGlylieSerThr155ProAsp125MetArg140AlaTrpAsn110AlaVal95TyrThr80LeuGlyProArglieArgAo:gGlyArgLeuLeuGinGluLeuThrGinGluAlaGinGlyGinAsp165170Ala175Gly160GlyGlyThr<210>24<211>389<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1ValThrSerlieGlyAsnThrGluAspAlaGlySerLeulieSerValThrVallieThrGluValArg65ThrArg50GinTyr35工leGly20Gly510ThrGlyLeulieGlyThrSerlieGlyValArgValHis4025LeuGinAspValHis45Leu30Ala15AlaLeuGluSerAlaGluAlaArgGlyAlaGlyThrAlaLeuAlaProGlu6055ThrAspLeuAlaVallieAlaValAlaLeuAla70AlaGinArgArgAsp85AspValAlaSerValLysAlaAlaLeu100105GlyTrpAspAlaSerArgPhelieGlyArg90ProPro75lieProGinSerValAlaArgHisAlaAlaGlyGlyHisProGluPro145ThrHis130Trp_lie115120ProGlyProGlyAlaAlaArgSerAsp135lieLeuThrProSerAlsAspThrAlaArgArgLeu165150lieGlyLeuCysGly170Ser155AlaAla110lieAla125MetPheArg140GinGluThrAspValValTyr95AspArg80Thr!LeuGlyArgAlaLyslie175Ala160MetGinProGluAlaHisAspArgAlaValAlaMetThr180185HisLeuValSerSerLeuMetAlaArgLeuLeuArgLeu195ArgAlaLeuGluLeuSerGly210215AlaAlaGlyAspProAspLeu225230ThrAlaValLeuGlyAla245ThrArgArglieLeuHis260AlaAlaGluAla275ArgAla200SerGlyLeuArgAsp220TrpThrAsp工leLeu235GinHisAlaGluGly290GlyArg305GluLeuGluAspGluAsnLeu250LeuAlaAlaSerPro265ProAlaTyrArgGluLeuLeuArg280LeulieGlyGinAlaTyrLeu295LeuGinValLeuAspLys310AlaArgLeuLeuAlaAspVal325lieGlu!LeuTrp340AspSerAla345ProGinLys300LeuThr315SerGlu!Leu330AspSerProValLeuSerValAspAlaGlyMetAlaSerMetLeuAlaGlu370ValAla385355ArgGlyTrpProVal375360!LeuGlyGluSerMet380SerHisLeuPro190SerTyrAspHis205PheThrArglieGlySerAsnAla240AspLeuGluMet255GlyAlaAlaGlu270Arg!LeuLeuGlu285TyrGlyAlaAlaAspArgGluGly320AsnValAsnVal335ValGlyLeuAla350SerAspTrpLeu365SerAlaAspProGlyArgVal<210>25<211>484<212>PRT<213〉Streptomyceslavendulae314(NRRL11002)<400>1MetThrAsnLeuSerProAlaAspPheLysValAlaAspLeuSerLeu1510AlaAlaPheGlyArgLysGlulieThrLeuAlaGlu2025GlyLeuMetSerlieArgLysGluTyrAlaAlaAlaGlyLeu65CysAla50lie__Glu3540ArglieThrGlySerLeuHisMetThr55Val60HisGlu30GinPro45GinThr15MetProLeuAlaAlaValGluThrLeuValAlaLeuGlyAlaAspValArgTrpAla7075AsnliePheSerThrGinAspHisAlaAlaAlaAlalie9085GlyProAspGlyThrProGluAsnPro100105GinGlyValProVal110Ala95PheSer80ValAlaTrpLysGlyGluThrLeuGluGluTyrTrpTrpCysThrGluGinAlaLeuAsp145Lys115120ThrTrpProAsnThrProThrGlyGlyPro130135GlyGlyAspAlaThrLeuLeuValHis125AsnMetlie140GlyValGluLeuAspAlaGlyAlaGlyHislie!Leu180Ala165ThrLys150155ProAspProSerThrAlaAspSerGluLeuLeuAsn170ArgThrLeuGlyGluThr185190PheGlu175ProGlu160TyrGinLysTrpThrGin!LeuAlaSerGlulieArgGlyVal195200GlyValHisArgLeuTyrGluThrThr210!LeuPheProAlalie225AspAsnMetMetLysTyrGly245AlaThrAspValLeulieGlyGly260GlyAspValGlyLysGlyCys275lieValThrGluArgLeu215AsnValAsnAsp230CysArgHisSerLeulieAspGlylieLys265GluAlaVal235Leulie250ValAlaAla220ThrThrGluGluThr205GluGlyThrLeuValLysSerLysPhe240AsnArg255GlyTyrArgVallieValThrGlulie290295MetAspGlyTyrGinValAlaThrLeu305310AspliePhelieThrThrAla280AspProlieCysValCys270SerLeuArgGlyGinGlyAla285LeuGinAlaAlaAla300ValValGluThr325Met!LysHisGinAspMetAlaLys340PheAspAsnGlulieAspMet355AspGluValLysAspAspValValGlulieAla315320ThrGlyAsnLysAsplielieMetAlaSer330335lieValGlyAsnlieGlyHisValLys370GlyLysValLeulie385AsnAlaThrGlyPro375ValLeuSerGlu390ProSerPheValAla345AlaGlyLeuAlaLyslieGluGlylie360GinValHisThrGinThrLeuAla420His405GinlieGluLeuPhe425Asnlie350lieGlu365T:rpLysPhe380GlyArgLeuLeuAsn395MetSerAsnSerPhe410ThrLysProSerGlu430ProAspLeuGly400AlaAsp415TyrProThrAspValTyrValLeuProLysHisLeuAspGluLysValAlaArg435440445LeuHisLeuAspAlaLeuGlyValLysLeuThrThrLeuArgProGlu450455460GinAlaAlaTyrlieGlyValGinValGluGlyProTyrLysProAsp465470475480HisTyrArgTyr<210>26<211>313<212>PRT<213>Streptomyceslavendulae314(NRRIA1002)<■>1MetAlaAlaGluThrAlaGluPheProTyrArgAlaArg1015LeuGlyAspLeuLeuArgThrGlyPheProArgArg50AlaGly65AlaAlaPro35lie20LysThrAlaAlaGluAlaPheGlySerSerAspThrThr85Arg70LeuSer55AspGly40ProGin25Glu!LysSerTyrSerArgThrLeuSerPheValArgThrValArgProValAla90Glu75HisSer60ValTrp45ValThr!LeuThrHisSerlieAlaGinLeuArgHislielieGlyGinTyr100105ProGlyArg15PheGluPhe30ArgAlalieThrTyrGlyLysArglie80AlaValGly95AlaAspAla110GlylieLysGly130LeuVal145AlaPheArgHisGinMetAlaVal210ThrAsp225PheProSerGluArgLeuArgSer290AlaArg305ArgAspValLeuAla115GluTrpValProHis135SerLeuThrArgGlu150ProGluArgHisPro165PheValAlaLysCys180PhePheArgValAsp195ValGlyCysAsnThr215lieArg230MaArg!LeuArgGly120ProAspGlyAspProLeuArg140ProGly125HisAlaLeuGlyAspPheThrValGly155AspArgSerPro170ArgAlaGly185AspTyrLeu200ProTrpAspSerAlaAspArgLeulielieProTyrAla190ArgAsp205工leMetValArgGinAspGluLeu245GlyHisArg260LeuAlaGlu275ThrAlaThrGinProAlaAla220ArgPheAlaGluLeuSerAsn235AlaArgLeuAsp250lieGlyValAspHisAlaThr265GlyLeuAlaArgAspGlyAlaValGlu295AlaAla310Pro280lieHisTyrHisArgAsnLeu300AlaMet270lieThr285GlyLeuAspProAlaGluValAla160Asplie175lieThrArgValProAlaAlaSer240AspPro255AlaAspIjSUAsnSerProArgHis<210>27<211>1127<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetGluAspPheThrArgProAsp20GlyValAspCys35LeuAlaGluTyr50GlyAlaArglie65GlyArgGinArgProThrLeuGly100ValAsnAlaGlu115GluThrThrGin130ArgArgAlaLeu145ValThrValGluGin5lieValAspAlaTrp85HisGlyAspSerAla165AlaAlaLeuThrAla180AsnCysAlaThrGlyGinLeuGluGlyValAlaAsnGluThrAsn40lieProGlu55ArgGinVal70ValLeuGlyVal25ThrCys10HisPheArgAsnAsnGluValLeuAsnVal15ArgGluTyrPheAlaVal30GlyAlaAsnPheAlaAla45LeuSerGluAlaPheGlu60AlaAspGluPheThrAlaSer75GlyProSerGlyAlaAspThrLysMet90lieAspTyrAlaGinlieArg105LeulieSerGly120LeuLeuGin135AsnCysGly150ThrGlyThrLeuGluProProAlaGluGluAspMetLeu185MetAlaAla140MetPro155LeuLeuGly170GlylieAspSerGluHisGlyThrAspAla110Ala!>eu125lielieLys95TyrThr80LeuGinLeuValGlyAlaVallieCys195200SerGluMetlie190LeuArg205lie175GlySer160GlyLeuTyrLeuAsnAlaArgLeuThr!LysAspAlaGin245GlyCysCys260ValArgGly275AlaAlaSerMetAlalieGluAlaMet325AspGinlie340ValGlyArg355ArgPheAlaThrAla370ValProVallieGin385lieAsnSerValAsn405AlaLysValThrArg420LeuThrlieAspGluAlaArg210ProVal225LeuAlaValGlyGluArgProGly290SerTyr305PheArgAlaArgAspTyrlieProLeuSerCysMet215GinGlyAlaHisTyrPro230235GluThrPheValArgGlu250ThrProGluHisGlyThrThrAlaLeuTyr295GlyGlu310LeuGluVal280GinArgAlaArgGluGlyAspGlyVal360SerThrEeu375AlaGlyLeu390TyrGluAspLeuAlaGinGluGlyGin265ThrGluArgThrValProThrAsnAla315ArgTrpAsp330AlaHisMet345AlaAspMetProlieValGluLysLeu395GlyAspGly410GluHisGly425AlaArgSerPro220LeuTyrLeuSerPhe300AsnGlySerAsnAlaGlyLeuSerAlaAlaGlu240GlyLeuSer!Leu255ArgGinValVal270ProGinProGlu285ArgGinAspThr!LysLys320GluMet335CysValAlaGlyThrGluAlaVal400ArgPhe415lieAlaLysVal435440AspCysValLeuAspLeu350GinGlu!Leu365LeuAspSer380GlyGlyArgProGluSerAlaAlaLeu430ValGluHis445AlalieAlaGlu450Arg465GlyGluSerAspGinGluGluArgGluSerAsnSerVal545ValLeuLys500lieSer515PheLeuArgLeulielieLeulie470SerArgLys485ArgArgHisGluAspLeuThrAspThrLeuAspGlyProlie490ValThr475AlaGinVal530HisSerThrAlaGlu505PheGlyLeuAsnProAlaAla520ValAspLysAlaGlyGluCys535AlaSerLyslieLeuProlieAla550AspLeuSerAsnTrpGlylie460PheThrlieCysThr480ThrlieGluSerlie495ThrThrLeuGlyI<eu510ArgValLeuLeuAsn525LeuAspSerAlalie540PheAspGluAspProLeu565LysArg555lieTyrAspArgArgGluGlyLeuMetAlaLeuGin580SerLeuLysAlaGlyArgAlaGlu595600LeuGinArqArcrlielieLeu585Glu570PheGluLeuLeuGluAla625AsnArg610AspLeuAspGluAspThrLeuSerGlylieu645GinGinMet660LysThrAlaValAla675Arglie615AlaLeuAla630GluGlyMetLeuProPheTyrLeuGlu680AspGlyGluSerArgGlyValAsn590AlaLieuPro605LysAsnGly620AlaLeuAspPro635L>ysValValGlyGluLeuVal665Pro650LeuGinHisMetSerAlaGlu670GluLysThr685GluGin560AspTyr575ThrLysLeuAspLeuGlulieVal640PheGly655ValMetAspAlaAspGly690Asplie705AsnValAlaAlaValLysArgLys770ThrArg785ValArgLeulieLysGinGluPro850ValPro865LeuLysGinTrpLeuValLysGlyThrGlyLysAsnValAsn工le725GinGluHis740SerThrVal755LeuAlaAlaMaTyrValTyrAlaArg805AlaValLys820ArgArgVallieValLeu695LeuValAsp710GlylieLysLysAlaAsplieMetLys760AspTyrPro775GluGinAsp790AspAlaPheArgGly工leAlaLysArgAlaThrlie工leGinPro730Vallie745GluAsnValArg700LeuThr715ValSerGlyMetLeuGluGlyAspValHisVallieLeuLeuHisGluPro825AspGly810GlyGlu795LeuGly780lieAsnAsnAlalieSerGly750GluLeu765GlyAlaTyrGluArgLeuMetAlaGluThrProAla835840GluGluThrGlyGlyArg855ThrProProPheTrpGly870AspTyrAlaSerTrpLeu885GlyLeuLysGinAlaArg900GluSerGluGlyArgPro915920SerAspValThrArgAspAla905ArgVal875GluGly890GlyGlyLeuArgLeuPro830LeuGin845AlaValAsp860工leLysGlyAlaLeuPheAlaThrTyr910Gly:LeuLeu925GlyTyr720!LeuGlu735;Leu:LeuAsnGinAlsLeuGlyGlu800AspAla815ProIjeuValGluAsnProliePro880LysGly895GluGluGluLysLeuHisThrGluAsnLeu930ProCysValSerLysGly945950AsnGluArgThrArgPhe965LeuCysLeuAlaAspPhe980VallieGlyLeuGinVal995ThrAlaLysLeuPhe1010LeuHis1025TrpHis1040ProGlu1055ArgPhe1070AlaLys1085LeuSer1100lieVal1115GlyLeuSerAlaArgValLysValGluSerLeuGlylieAlaAspGluGluPhe工leHisHisLeuGluAlaAlaValValTyrGlyTyrPhe935940GluAspLeulielie!LeuAspGluGinGly955960ThrPheProArgGinArgArgGlyArgArg970975PheArgProGluGluSerGlyGluThrAsp985990ValThrValGlySerArglieGlyGluAla10001005GluMaAspSerTyrArgGluTyrLeuGlu10151020ValGinLeuAlaGluAlaLeuAlaGluTyr10301035ArgAlaGluLeuGlyPheGlyGlyGluAsp10451050AspMetPheAspLeuLysTyrArgGlyAla10601065TyrGlyAlaCysProAspLeuGluAspArg10751080LeuLeuGinProGluArglieGlyValHis10901095GinLeuHisProGluGinSerThrAspAla11051110ProAspAlaThrTyrPheAsnAlaArg11201125<210>28<211>325<212>PRT<213>Streptomyceslavendulae314(NRRIj11002)<400>1MetArglieAlalieThrGlySer工leAlaThrAspHisLeuThrThr1PheProlieSerValAla50LeuLeu65LeuArgGlyLeu35AlaArg20Ser5PheAlaGluGinPheLeuValLeu25Asp10工leProAspLeuGluValAsp40AsnlieAlaPheGlyLeuGlyArgArgLeu30ArgArg45GlyPhe15AspThrGlyGlyValGlyAla———LeuGlyPheSerPro5560GlyThrAspPheProGluTyrAspSerAla7075GluHisAspThrGlySerValArgVal8590SerAlaThrThrAlaArgPheMetCysThrThrAspAlaAspGin100ThrSerPheAspVal130AspAsp145GlylieTyr115Val105GlyAlaMetValGluAlaArgGlu120ArgArgSerGlyProGinAspArg135ProGluAlaMetLeuArgHisSerAsnGin110lieAsp125ValValVal140LysThrArgArg95lieTrp80HisAlaLeuAlaAlaProProLeu—Ala150155AlaAspProSerGinGinLeuAlaArgAla_165170GlyAspGluThrArgArgLeuValAspGlyAla180185GluAlaAla!LeuLeuGinGluCys200AsnGluTyr195GluLeuLeu190ThrGlyTrp205GluLeu175PheLeu160AsnThrThrAlaHisGinVal210AspGlyVal225AlaValProPheArgAlaArgAlaAla275ValGlyThr290lieAlaAsp305LeuGinAlaLeuGinArgValGlyThrTrplielieThr215220ThrlieHisAsnThrAspThrProSerVal230235AlaLysAsnValAlaAspProThrGlyVal245250GlyTyr!LeuAlaGlylieGlyGinGlyLeu260265GinLeuGlyCysAlaLeuAlaThrThrVal280285GinGluTyrArgLeuAspArgSerSerLeu295300AlaTyrGlyAspGlyProAlaAlaGlu工le310315LysPro325CysGlyGlySerValPro240GlyAspGly255SerGinGlu270LeuGluSerThrAlaArgThrProHis320<210>29<211>401<212>PRT<213>Streptomyceslavendulae314(NRRL11002)<400>1MetGlyArgLeuPheThrSerGluSerValThrGluGlyHisProAsp151015LyslieAlaAspArg工leSerAspThrlieLeuAspAlaLeuLeuArg202530GluAspProSerSerArgValAlaValGluThrLeulieThrThrGly354045GinValHisValAla50AlaGinLeuValArg65AlaLysGlyGluVal55AspAlalieLeuAsplie7075GlyAlaSerCysGlyValThrThrLysAlaTyrAlaProlie60GlyTyrAspGlyPheAspGlyAlaSerCysGly8590AlaGinSerProAsplieAlaGinGlyValAsp100105ArgValGluGlyGluGlyAspGluLeuAspLys120AlaCysAspGlu115LeuMetPheGinGlyLeuMetPheGlyTyr130135ProLeuProlieHisLeuAlaHisArgLeu145150ValArgLysAspGlyThrlieProTyrLeu170ArgSer155ArgValValVal195LeuAspValLeuThr210LeuValGinGluGlylieLysLeuAspThr225ValAsnProArgGlu215lieLys230ArgPheThrGly245GlyLeuThrGlyArgLyslielie260GlyGlyAlaPheArgHisGly275Ser280SerlieSecAspGly165GinValThrlieGluTyrAspGlyAspArgAla180185SerThrGinHisAlaAlaAspValAla200HisValValGluAsp235GlyGlulieGly250lieAspThr265GlyLysAspSerSer80lieGly95ThrAlaTyrGluGin110GinGlyAlaGlyAsp125ThrProGluLeuMet140ArgArgLeuSerGlu160ProAspGlyLysThr175ValArgLeuAspThr190AspLeuAspAlaLeu205HisValLeuLysGin220GlyTyrArgLeuLeu240ProMetGlyAspAla255TyrGlyGlyMetAla270ProSerLysValAsp285ArgSerAlaAlaTyr290LeuAlaSerAla305LysGlyAlaValAspArgProGin370TrpAlaThr385Val<210><211><212><213>GluProVal325ThrSerLys340AlaAlalie355ThrAlaAlaGluArgThr30334PRTStreptomycesAlaMet295ArgCys310GlyLeulieGlulieArgTyrGly375AspArgArgTrpValAlaGluValGinVal315PheValGluThr330GluAlalieGly345AspLeuAspLeu360HisPheGlyAxgAlaAspAlaLeuLysAsnVal300AlaTyrAlaPheGlyThrGinValPhe350!LeuArgPro365GluLeuThr380ArgArgVal390395ValAlalieGly320AlaLys335AspLeu工leTyrAspPheAlaGly400lavendulae314(NRRL11002)<400>1MetArgArgMetLeuTyrAlaGinLeuProSerArgAlaLeuValVal151015ValAlaGinLeuGlylieAlaAsplieLeuAlaGluGlyProAlaAsp202530lieSerThrLeuAlaGluArgThrSerThrAspAlaValAlaLeuAla354045ArgLeuLeuArgGlyLeuAlaValPheGlyValPheGluGluGlyAla505560GluGinValTyr65HisProAlaSerGlyAlaAlaTrp100ProPheGluArgSerArgGlySerLeuThrProLeuGlyGluAlaLeuThrSerGly707580AlaLeuProSerAlaThrLeuValAlaGlyGinPhe859095GlyAspLeuLeuGluThrValArgThrGlyGinSer105PheTh:r115Glu!LeuArgAspGlu130AlaLeuGlu!LeuAsp145ProThrValValValSerLeu120AlaValPheAspAsp135GlulieLeuArgAla150ValGlyAspValGlyGlySerAsp165170lieLeuSerAlaHisProAsplieSerGlylieVal185GlySerThrSerLeuGinAlaGluArgProThrAla200GlyAspPheSerGin140lieAspPheSer155GlyThrPheLeu110HisMetGluGin125GlyArgGlyLeuSerAla180ThrSer195TyrSerValGlyArg210GlyGlyAspLeuTyr225AspArgAlaValGin245AspAlaThrLeuMetValVal260ArgAspGluArgLeuHisThr275PheGlyGlyAlaThr215LeuLeuSerHislie230工leLeuSerLeu290AsnGly295ArgThrCys250AspLeulieAlaAla265AlaAlaLeuMetAsp280GlyGinGluArgThr300PheAsp220LeuHisAspTrp235ArgAlaAlaMetAlaTyr160ArgArg175PheAspLeuPro190AspProLeuGlu205SerLeuProGluAspAsp240SerAsp255AsnArgGlyGin270LeuTyrMetLeu285AlaAlaGinValGluValLeuLeuSerLysAlaGlyPheArglieThrArgValAspSer305310315320LeuProSerGlyMetAsnVallieArgAlaValArgAlaAla325330<210>31<211>376<212>PRT<213>Streptomyceslavendulae314(NRRIL11002)<400>1ValThrGluAlaGlyLeuArgValThrAsp65ThrLeu50Glylie35GinAsnThr20GluProArg5ValAlaArgProProLeuValValGlyAlaSerAla10HisGluLeuAlaArgArgGlyAlaSerAlaGly4025ProAlaThrThrGly45Leu30Pro15ProValAlaValProArgThrLeuGluValLeuAspGinMetAlaValliePhe!LeuArgThr!LeuProArgAlaAlaValGlulieThr130Ala100GlyGly115CysAspThr85LeuHis70Leu55GlyArgLysGly60ThrLeuAlaAspGlu75HisProTyrThrLeuVallieAspLeuArgGluAlaArgAspAla105His90ValValAspLeuProAlaAlaAxg120GlyPheProGinArgSerAsp135Pro140Gly110ProAla125ValGlyArg95ValHis80AlaSerCysLeuArgArgLeuThrValValAlaAspHis145150TyrAsnLeuAlaTrpLysLeuAlaMet165ProArgLeuLeuAspThrTyrAsp180GlylieArgAlaAspProAspThrGlylieArg155GluGinGlyLeuVal170GluHisHisArgPro195HisProSerGlu185LeuAsnAlaArgThr210SerSer225ValGlyLeu200SerGlyLeuGlyHisArg215LeuThrThrArgAsnThrLeuAsp230PheGluProThrGinAla160ValGly175ProAlaGluArgGly190ValProAlaLeuArg205LeuThrTyrArgAsn220LeuSerGlyAlaValGly245ArgValThrGluAlaGlyProAla260ArgTrpSerLeuPro250AsnPhe235ValProAlaProLeuArgAspPro275GinGlySerProAsp290AlaTyrLeuSerVal305LeuCysArgCys265PheAlaThr!LeuArg325l>eulieArgProAspGlyTyrVal340LeuAspArgAlaLeu280SerValAlaAlaThrAla295ArglieLeuGlyAla310AlaAlaLeuGlyTyrHis240GlySer255LeuArgSerGluLeu270AspProAlaGlyThr285AlaAlaGinTyrSer300Ser!LeuProSerGinAla355ArglieGlyAlaAsn370ProCys375Leu360GlyThr330AlaAlaArg345SerProLeuAla315ProProGlyGlyAspPro320TrpLeu335GlySer!LeuLeuThr350SerValAlaProLeu36權(quán)利要求1.一種抗腫瘤活性的抗生素-番紅霉素(Saframycin)的生物合成基因簇,其特征在于編碼番紅霉素生物合成所涉及的30個基因,具體為1)非核糖體聚肽合成酶基因,即sfmA,sfmB,sfmC共3個基因sfmA位于基因簇核苷酸序列第16971-22481個堿基處,長度為5511個堿基對,編碼非核糖體聚肽合成酶,1836個氨基酸;sfmB位于基因簇核苷酸序列第22618-25866個堿基處,長度為3249個堿基對,編碼非核糖體聚肽合成酶,1082個氨基酸;sfmC位于基因簇核苷酸序列第25863-30320個堿基處,長度為4458個堿基對,編碼非核糖體聚肽合成酶,1485個氨基酸;2)前體分子3-羥基-5-甲基-O-甲基-酪氨酸的生物合成基因,即sfmO5,sfmM2,sfmM3,sfmD共4個基因sfmO5位于基因簇核苷酸序列第37069-35900個堿基處,長度為1170個堿基對,編碼預(yù)苯酸脫氫酶,389個氨基酸;sfmM2位于基因簇核苷酸序列第34031-35131個堿基處,長度為1101個堿基對,編碼甲基化酶,366個氨基酸;sfmM3位于基因簇核苷酸序列第46245-45241個堿基處,長度為1005個堿基對,編碼甲基化酶,334個氨基酸;sfmD位于基因簇核苷酸序列第30317-31414個堿基處,長度為1098個堿基對,編碼5-甲基-O-甲基-酪氨酸-3-羥化酶,365個氨基酸;3)肽骨架環(huán)合酶基因,即sfmcy1,sfmCy2共2個基因sfmCy1位于基因簇核苷酸序列第3166-4704個堿基處,長度為1539個堿基對,編碼氧化還原環(huán)合酶,512個氨基酸;sfmCy2位于基因簇核苷酸序列第15388-13874個堿基處,長度為1515個堿基對,編碼氧化還原環(huán)合酶,504個氨基酸;4)番紅霉素的修飾基因,即sfmM1,sfmO1,sfmO2,sfmO3,sfmK,sfmO4,sfmO6共7個基因sfmM1位于基因簇核苷酸序列第11059-10460個堿基處,長度為600個堿基對,編碼N-甲基化酶,199個氨基酸;sfmO1位于基因簇核苷酸序列第1370-2284個堿基處,長度為915個堿基對,編碼氧化還原酶,304個氨基酸;sfmO2位于基因簇核苷酸序列第10384-8819個堿基處,長度為1566個堿基對,編碼單氧化酶,521個氨基酸;sfmO3位于基因簇核苷酸序列第12372-13559個堿基處,長度為1188個堿基對,編碼細(xì)胞色素P-450羥化酶,395個氨基酸;sfmK位于基因簇核苷酸序列第13613-13798個堿基處,長度為186個堿基對,編碼鐵氧化還原蛋白,61個氨基酸;sfmO4位于基因簇核苷酸序列第16947-15520個堿基處,長度為1428個堿基對,編碼細(xì)胞色素P-450羥化酶,475個氨基酸;sfmO6位于基因簇核苷酸序列第46553-47683個堿基處,長度為1131個堿基對,編碼FAD依賴的氧化酶,376個氨基酸;5)S-腺苷化甲硫氨酸合成相關(guān)酶基因,即sfmS1,sfmS2,sfmS3,sfmS4,sfmS5共5個基因sfmS1位于基因簇核苷酸序列第38550-37096個堿基處,長度為1455個堿基對,編碼S-腺苷-L-高半胱氨酸水解酶,484個氨基酸;sfmS2位于基因簇核苷酸序列第39521-38580個堿基處,長度為942個堿基對,編碼亞甲基四氫葉酸還原酶,313個氨基酸;sfmS3位于基因簇核苷酸序列第42940-39557個堿基處,長度為3384個堿基對,編碼5-甲基四氫葉酸-高半胱氨酸S-甲基轉(zhuǎn)移酶,1127個氨基酸;sfmS4位于基因簇核苷酸序列第44013-43036個堿基處,長度為978個堿基對,編碼碳水化合物激酶,325個氨基酸;sfmS5位于基因簇核苷酸序列第45220-44015個堿基處,長度為1206個堿基對,編碼S-腺苷甲硫氨酸合成酶,401個氨基酸;6)調(diào)節(jié)基因,即sfmR1,sfmR2,sfmR3共3個基因sfmR1位于基因簇核苷酸序列第786-1373個堿基處,長度為588個堿基對,編碼TetR家族轉(zhuǎn)錄調(diào)節(jié)因子,195個氨基酸;sfmR2位于基因簇核苷酸序列第2487-2996個堿基處,長度為510個堿基對,編碼轉(zhuǎn)錄調(diào)節(jié)因子,169個氨基酸;sfmR3位于基因簇核苷酸序列第35223-35759個堿基處,長度為537個堿基對,編碼轉(zhuǎn)錄調(diào)節(jié)因子,178個氨基酸;7)抗性基因,即sfmG,sfmH共2個基因sfmG位于基因簇核苷酸序列第4738-6177個堿基處,長度為1440個堿基對,編碼移位酶,479個氨基酸;sfmH位于基因簇核苷酸序列第8748-6289個堿基處,長度為2460個堿基對,編碼紫外修復(fù)蛋白,819個氨基酸;8)還包括4個功能不明確的基因,即srmE,sfmF,sfmI,sfmJsfmE位于基因簇核苷酸序列第31411-33780個堿基處,長度為2370個堿基對,編碼肽酶,789個氨基酸;sfmF位于基因簇核苷酸序列第33777-33998個堿基處,長度為222個堿基對,編碼MbtH類似蛋白,73個氨基酸;sfmI位于基因簇核苷酸序列第11694-11212個堿基處,長度為483個堿基對,編碼未知功能蛋白,160個氨基酸;sfmJ位于基因簇核苷酸序列第12188-11691個堿基處,長度為498個堿基對,編碼吡哆胺-5’-磷酸氧化酶相關(guān)蛋白,165個氨基酸。2.根據(jù)權(quán)利要求1所述的番紅霉素的生物合成基因簇,其特征在于編碼的非核糖體聚肽合成酶包含模塊或結(jié)構(gòu)域?;o酶A連接酶結(jié)構(gòu)域AL、肽酰縮合酶結(jié)構(gòu)域C、腺苷化酶結(jié)構(gòu)域A、肽?;d體蛋白PCP、端基還原酶結(jié)構(gòu)域RE。3.—種根據(jù)權(quán)利要求1所述的番紅霉素的生物合成基因簇的用途,其編碼蛋白催化合成抗生素番紅霉素。4.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其編碼蛋白催化合成S-腺苷化甲硫氨酸。5.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其編碼蛋白催化合成雙四氣異喹啉生物堿核心骨架結(jié)構(gòu)。6.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其編碼蛋白催化合成3-羥基-5-甲基-0-甲基酪氨酸或5-甲基-0-甲基酪氨酸。7.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其中的基因在熒光假單孢菌/^wcto附owasyww^ceraA2-2中進行異源表達可以催化番紅菌素B(safracinB)的生物轉(zhuǎn)化得到氧化產(chǎn)物氨基化的番紅霉素S,該氧化產(chǎn)物經(jīng)氰化鉀處理可以得到番紅霉素Y3。8.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,對其中的基因進行遺傳改造獲得的突變體經(jīng)生物發(fā)酵使得番紅霉素的產(chǎn)量發(fā)生改變。9.根裙權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其中的NRPS基因在大腸桿菌中獲得異源表達,經(jīng)分離純化得到的重組蛋白SfmA、SfmB和SfmC分別識別并活化丙氨酸、甘氨酸和3-羥基-5-甲基-0-甲基酪氨酸。10.根據(jù)權(quán)利要求3所述的番紅霉素的生物合成基因簇的用途,其中的氧化還原酶、環(huán)合酶基因s/wO/、s/w(92、^nO、s/mO么<w06、s/附qw、s/wC>2,甲基化酶基因s力"Mh:>M2、^wMJ及5-甲基-0-甲基-酪氨酸-3-羥化酶基因s/mZ)均在大腸桿菌中獲得異源表達,經(jīng)分離純化得到的重組蛋白SfmOl、Sfm03、Sfm04、SfmM2、SfmM3和SfmD。全文摘要本發(fā)明是四氫異喹啉生物堿家族中一種鏈霉菌產(chǎn)生的具有抗腫瘤活性的抗生素——番紅霉素的生物合成基因簇的克隆、測序、分析、功能研究及其應(yīng)用。整個基因簇共包含30個基因3個非核糖體聚肽合成酶基因、4個前體分子3-羥基-5-甲基-O-甲基-酪氨酸合成基因、2個肽骨架環(huán)合酶基因、7個番紅霉素的修飾基因、5個與S-腺苷化甲硫氨酸合成相關(guān)基因、3個調(diào)節(jié)基因、2個抗性基因和4個功能不確定的基因。通過對上述生物合成基因的遺傳操作可阻斷番紅霉素的合成,可使其產(chǎn)量發(fā)生改變;還可得到番紅霉素的結(jié)構(gòu)類似物。該基因簇可用于四氫異喹啉生物堿家族化合物的基因工程、蛋白表達、酶催化反應(yīng)等,也可用于尋找和發(fā)現(xiàn)用于醫(yī)藥、工業(yè)或農(nóng)業(yè)的化合物或基因。文檔編號C12N15/31GK101157929SQ20071003708公開日2008年4月9日申請日期2007年2月2日優(yōu)先權(quán)日2007年2月2日發(fā)明者文劉,唐功利,磊李申請人:中國科學(xué)院上海有機化學(xué)研究所