ID SPAC323 standard; DNA; FUN; 15458 BP. XX AC AL109988; XX SV AL109988.1 XX DT 24-AUG-1999 (Rel. 60, Created) DT 07-MAR-2000 (Rel. 63, Last updated, Version 3) XX DE S.pombe chromosome I cosmid c323. XX KW ABC_tran domain; atp transporter; conserved hypothetical protein; KW metal transport atp-binding protein; methyltransferase; KW proteasome component PUP2 homolog; ThiF family; KW Uncharacterised protein family domian; UPF0013 domain. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-15458 RA Wood V., Rajandream M.A., Barrell B.G., Bothe G., Ramsperger U., Pohl T; RT ; RL Submitted (23-AUG-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and GATC GmbH, Fritz-Arnold-Str 23, D-78467 Konstanz, RL Germany XX DR GOA; O14084; O14084. DR GOA; Q9UT93; Q9UT93. DR GOA; Q9UT94; Q9UT94. DR GOA; Q9UT95; Q9UT95. DR GOA; Q9UT97; Q9UT97. DR SPTREMBL; Q9UT91; Q9UT91. DR SPTREMBL; Q9UT92; Q9UT92. DR SPTREMBL; Q9UT93; Q9UT93. DR SPTREMBL; Q9UT94; Q9UT94. DR SPTREMBL; Q9UT95; Q9UT95. DR SPTREMBL; Q9UT96; Q9UT96. DR SPTREMBL; Q9UT98; Q9UT98. DR SWISS-PROT; O14084; YER1_SCHPO. DR SWISS-PROT; Q9UT97; PSA5_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c323 is overlapped at the 3' end by cosmid c2F3, CC EMBL entry SPAC2F3, EMBL accession number Z99165. XX FH Key Location/Qualifiers FH FT source 1..15458 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c323" FT /map="IR" FT misc_feature 1..103 FT /note="nominal overlap with cosmid SPAC926, EM:AL110469 S. FT pombe chromosome 1" FT CDS join(complement(1668..1788),complement(579..1543)) FT /db_xref="SPTREMBL:Q9UT98" FT /label=SPAC323.01c FT /note="SPAC323.01c, len:361, SIMILARITY:Saccharomyces FT cerevisiae, Q08928, chromosome xvi reading frame orf FT ypl188w., (414 aa), fasta scores: opt: 667, E():0, (37.0% FT identity in 373 aa)" FT /gene="SPAC323.01c" FT /product="hypothetical protein" FT /protein_id="CAB53404.1" FT /translation="MIRAANGFRISVRNTAVCLAPNFRQLKGFSIINLGSLQYFQCASP FT QSIGGKSNLKQLQWPKPPKNILILKKRMDERVDHCFETLVQHLQQTYPDICIITETDVA FT KKFSYLNLYTWTEISDLEQKVDAIITVGGDGTILHAASLFARSGMPPILSFSLGTLGFL FT LPFDFGSFQTAFADFYNSRSFVLMRMRLRVAMKTKLYNESIYAMNEMHIHRGLSPHMAV FT LKVFVNDKFLTEAVADGLIISTPTGSTAYSLSSGGPIVHPSINALLLTPICPNSLSFRP FT VLFPDTFKISIETSNKSRVRPQLSIDGRPLGLTDIGQRIDITSVKDNAIPCIIRSHKED FT DWVSDIVSLLRWNHPFHRKGW" FT CDS join(complement(3346..3364),complement(2866..3307), FT complement(2625..2727),complement(2382..2561)) FT /db_xref="GOA:Q9UT97" FT /db_xref="SWISS-PROT:Q9UT97" FT /label=SPAC323.02c FT /note="SPAC323.02c, len:247, SIMILARITY:Saccharomyces FT cerevisiae, PRCZ_YEAST, proteasome component pup2, (260 FT aa), fasta scores: opt: 1100, E():0, (66.7% identity in 246 FT aa)" FT /gene="SPAC323.02c" FT /product="proteasome component PUP2 homolog" FT /protein_id="CAB53405.1" FT /translation="MFMTRSEYDRGVNTFSPEGRLFQVEYAIEAIKLGSTAIGVKTKDA FT VVLGVEKRLTSPLMESHSVEKLFEIDSHIGCAISGLTADARTIIEHARVQTQNHRFTYD FT EPQGIESTTQSICDLALRFGEGEDGEERIMSRPFGVALLIAGIDEHGPQLYHSEPSGTY FT FRYEAKAIGSGSEPAKSELVKEFHKDMTLEEAEVLILKVLRQVMEEKLDSKNVQLAKVT FT AEGGFHIYNDEEMADAVAREQQRMD" FT misc_feature complement(2562..2578) FT /note="ctaacaatgtcgttcag, splice branch and acceptor" FT misc_feature complement(2619..2624) FT /note="gtacgt, splice donor sequence" FT misc_feature join(complement(2866..3221),complement(2643..2727)) FT /note="Match to PF00227 proteasome, Proteasome A-type and FT B-type Score 176.99" FT misc_feature complement(2728..2743) FT /note="ctaacttcctatacag, splice branch and acceptor" FT misc_feature complement(2860..2865) FT /note="gtaagt, splice donor sequence" FT misc_feature complement(3308..3318) FT /note="ctaatctttag, splice branch and acceptor" FT misc_feature complement(3340..3345) FT /note="gtaagt, splice donor sequence" FT CDS join(complement(5536..5597),complement(5010..5493), FT complement(4821..4957),complement(3733..4777)) FT /db_xref="SPTREMBL:Q9UT96" FT /label=SPAC323.03c FT /note="SPAC323.03c, len:575, SIMILARITY:Arabidopsis FT thaliana, Q9ZVM4, t22h22.10 protein., (530 aa), fasta FT scores: opt: 129, E():0.11, (24.1% identity in 307 aa)" FT /gene="SPAC323.03c" FT /product="hypothetical protein" FT /protein_id="CAB53406.1" FT /translation="MEEAINNVLLLLREDSISLETVLWETHYVLLNLHNEQNLRLVVAQ FT LIACGRIWDYWNEHRSEYFAFWVELISRKKVTNNGLPFSSFVKSIVGILEVDASNEILC FT FRRICLLCVFYKLLSCDHIVNLQYPLKRAVSKALSKQIKTHQFSGFEANFLLQQLFASV FT DSSASDISFDAFSLLPHLLKWEEIVWSGFINYLDTEHKRDSLPTAVLCHLLLRLSTYQQ FT ISIIKRLITLIDKAIPSWKSRSSDSKYNDHQFVKKNFFSIIMVLESLAKSQYRKSNVLA FT ADRSLCEYIILTLFHMEYLFSFVASSWSTLDFVITTCLSRVAQPAKFISETVREAIIDS FT IQLEGYVDLQKLSGSPVLVTLSFINNWQNLICRRLEKQTVNEKVITLSSTAKEISSLGL FT SFAEKLISQSDESTLCKHYVYASLYACLFCNLNEGSPKDYLDNDIYVHARCLFLLTKTL FT NLDSLKSILCSRVRLYYNIEIAYYFTDVLLKWFQPIIRYEFDNALIFYKASISLVSVLA FT PAAQKQFLSNYLNSIQGFSTETKEDLIFLVSSQIRQMPYQNATSLLSFWLSIVVGRAV" FT misc_feature complement(4778..4790) FT /note="ctaaatgatgcag, splice branch and acceptor" FT misc_feature complement(4815..4820) FT /note="gttagt, splice donor sequence" FT misc_feature complement(4958..4970) FT /note="ctaattgtctcag, splice branch and acceptor" FT misc_feature complement(5004..5009) FT /note="gtaagt, splice donor sequence" FT misc_feature complement(5494..5506) FT /note="ttaacaatccaag, splice branch and acceptor" FT misc_feature complement(5530..5535) FT /note="gtaagt, splice donor sequence" FT CDS join(5766..5909,6090..6474,6537..7048,7095..7517) FT /db_xref="GOA:Q9UT95" FT /db_xref="SPTREMBL:Q9UT95" FT /label=SPAC323.04 FT /note="SPAC323.04, len:487, SIMILARITY:Escherichia coli., FT MODF_ECOLI, putative molybdenum transport atp-binding FT protein modf, (490 aa), fasta scores: opt: 455, E():2e-22, FT (27.8% identity in 474 aa)" FT /gene="SPAC323.04" FT /product="putative metal transport atp-binding protein" FT /protein_id="CAB53407.1" FT /translation="MASFVKFANTTFLDARFPLFKNVSFELARKQNWAIIGNTGSGRTT FT FLRCIQGSFTPSPSTSFSYPFLKGKSDSPWQAIQLLDFKSSGQQRAAYYSERYHSFRDK FT EHDTTLEKWLLGAYRGNEKFASQHVQEAASMTQLSHLLPSSLINLSNGQSRRAMLASKL FT VQRPQLLLLDEPYAGLDVTSRSVLSSLLGEMSNHCSPKIVLSLRPQDKIPDFITHVLEL FT KNKKITYQGPKEQYIPMTSHSTNIPVKPQMKKSKPITIGKPLISMEHLNCVYWGRKVLS FT DINWTIREGERWALTGSNGSGKTTLLAYVVGDHPKLFASNIKFFGKSIGPGTGISIFDI FT QENIGHCSPEIHNHFPKQHTCFEALLSAWSTTFTIPKLTETRLAAISSILEEFELKDIK FT DKPLSSISVGMQRFILFCRAIVKQPRLVVLDEPFQGVDTKYVHMAHNYLNEKLSPSQAM FT VIISHYEDELPACVNRRAHIDNGKLVIHA" FT misc_feature 5853..6440 FT /note="Pfam match to entry PF00005 ABC_tran, ABC FT transporter, score 30.50, E-value 9.9e-08" FT misc_feature 5910..5915 FT /note="gtatgt, splice donor sequence" FT misc_feature 6076..6089 FT /note="ctaactcaatacag, splice branch and acceptor" FT misc_feature 6475..6480 FT /note="gtacgc, splice donor sequence" FT misc_feature 6518..6536 FT /note="ctaacagtattttttttag, splice branch and acceptor" FT misc_feature join(6875..7048,7095..7496) FT /note="Match to PF00005 ABC_tran, ABC transporter Score FT 79.06" FT misc_feature 6896..6919 FT /note="PS00017 ATP/GTP-binding site motif A (P-loop), FT [AG].{4}GK[ST], info count = 13.8" FT misc_feature 7049..7054 FT /note="gtaagt, splice donor sequence" FT misc_feature 7080..7094 FT /note="ctaacgactttttag, splice branch and acceptor" FT CDS join(complement(8398..8464),complement(8260..8353), FT complement(8159..8220),complement(7831..8117), FT complement(7598..7783)) FT /db_xref="GOA:Q9UT94" FT /db_xref="SPTREMBL:Q9UT94" FT /label=SPAC323.05c FT /note="SPAC323.05c, len:231, SIMILARITY:Saccharomyces FT cerevisiae, Q03920, putative methyltransferase., (221 aa), FT fasta scores: opt: 396, E():1.1e-19, (37.2% identity in 234 FT aa)" FT /gene="SPAC323.05c" FT /product="putative methyltransferase" FT /protein_id="CAB53408.1" FT /translation="MLSTPVTSQLRLKEFQDVYEPAEDTFALLDALEKDAKKLRQMAEM FT KNLLTAEIGCGSGCASSFLKSGILKNKPIVHFMSDISNSACRASKITALNNRELYKDDN FT GLFITVQTSFLDGIRLGNGVDILIFNPPYVPTEFEEIPSEAATIASAWAGGTDGMDVTS FT TLLNQLKDILSQDGVFYMVAVARNKLHSICEILQKDGFIVNETLKRKAGRETLSILRIY FT RIGNTIWDE" FT misc_feature complement(7784..7805) FT /note="ttaataactcatttatggatag, splice branch and acceptor" FT misc_feature complement(7825..7830) FT /note="gtaagg, splice donor sequence" FT misc_feature complement(8118..8128) FT /note="ctgatgtctag, splice branch and acceptor" FT misc_feature complement(8153..8158) FT /note="gtacgg, splice donor sequence" FT misc_feature complement(8221..8231) FT /note="ttaacctgcag, splice branch and acceptor" FT misc_feature complement(8254..8259) FT /note="gtaggt, splice donor sequence" FT misc_feature complement(8354..8369) FT /note="ctaattacacttttag, splice branch and acceptor" FT misc_feature complement(8392..8397) FT /note="gtatgt, splice donor sequence" FT CDS join(complement(10490..10524),complement(10340..10452), FT complement(10133..10296),complement(9906..10095), FT complement(9562..9866),complement(9333..9520), FT complement(9101..9291),complement(8858..9014), FT complement(8739..8814),complement(8608..8691)) FT /db_xref="GOA:Q9UT93" FT /db_xref="SPTREMBL:Q9UT93" FT /label=uba5 FT /note="SPAC323.06c, len:500, SIMILARITY:Arabidopsis FT thaliana, Q9ZV69, putative ubiquitin activating enzyme, FT (523 aa), fasta scores: opt: 985, E():0, (31.5% identity in FT 519 aa)" FT /gene="SPAC323.06c" FT /gene="uba5" FT /product="ubiquitin activating enzyme" FT /protein_id="CAB53409.2" FT /translation="MGTSAKMQKYDRQVRLWKAEGQNAIEKSHVCLLYANTVGCEALKN FT LILPGIGSFAVVDDTSVDFSMDGMNFFIQYDQEGKSRARCTASLLQQLNPNVEMEYLEM FT SPEALIDKNIEYFSKFSVVLSSNLKEKPLFRLEEYLRSHKIPLLHFNSVGFAGILRIST FT HEYTTTQSQPELPQDLRLKNPWPELINYVKSMDLDNMDSSSLSEIPYIVLIIHVLLKVS FT PAHAQNSQEADDCAMFRKIMEEYKGKCDSENIEEASSNSWKAFKEYKLPSNVYEVLHDT FT RCVKIQEDSESFWIMAHCLKMFYDETEFLPLSGLLPDMNCSTQQYVKLQVIYKEKSEND FT ILKFKKYVQQTLKRLNRSVEEITDLEIKHFSRNCLNIKVMDFKTMKEEYQPTSNSAFRI FT YDTILEKHGKNYKEAFSDTTKTISVAQSFLSQIGLEKFFDVVYTAIQELERADGHELHS FT ISSFIGGIVAQETIKLLAQQYLPLNNTFVFDGVHSRTETFKL" FT misc_feature complement(8692..8706) FT /note="ctaactttactttag, splice branch and acceptor" FT misc_feature complement(8733..8738) FT /note="gtatga, splice donor sequence" FT misc_feature complement(8815..8827) FT /note="ctaatagtttcag, splice branch and acceptor" FT misc_feature complement(8852..8857) FT /note="gtacgc, splice donor sequence" FT misc_feature complement(9015..9047) FT /note="ttgactcaaattctttacttccgtggtatttag, splice branch and FT acceptor" FT misc_feature complement(9095..9100) FT /note="gtacgt, splice donor sequence" FT misc_feature complement(9292..9303) FT /note="ctaatatttcag, splice branch and acceptor" FT misc_feature complement(9327..9332) FT /note="gtacgt, splice donor sequence" FT misc_feature complement(9521..9534) FT /note="ttaatatgaaatag, splice branch and acceptor" FT misc_feature complement(9556..9561) FT /note="gtattt, splice donor sequence" FT misc_feature complement(9867..9880) FT /note="ctaactaaagttag, splice branch and acceptor" FT misc_feature complement(9900..9905) FT /note="gtaagt, splice donor sequence" FT misc_feature complement(9925..10083) FT /note="Match to PF00899 ThiF_family, ThiF family Score FT 22.48" FT misc_feature complement(10096..10108) FT /note="ctaacgtttgtag, splice branch and acceptor" FT misc_feature complement(10127..10132) FT /note="gtatgg, splice donor sequence" FT misc_feature complement(10297..10313) FT /note="ctaaatagacgtaatag, splice branch and acceptor" FT misc_feature complement(10334..10339) FT /note="gtattg, splice donor sequence" FT misc_feature complement(10453..10462) FT /note="ttaatcatag, splice branch and acceptor" FT misc_feature complement(10484..10489) FT /note="gtatga, splice donor sequence" FT CDS join(complement(12196..12857),complement(11194..12133)) FT /db_xref="SPTREMBL:Q9UT92" FT /label=SPAC323.07c FT /note="SPAC323.07c, len:533, SIMILARITY:Saccharomyces FT cerevisiae, YD38_YEAST, hypothetical 77.8 kd protein in FT mrps28-hxt7 intergenic region., (695 aa), fasta scores: FT opt: 1082, E():0, (34.4% identity in 532 aa)" FT /gene="SPAC323.07c" FT /product="conserved hypothetical protein; UPF0013" FT /protein_id="CAB53410.1" FT /translation="MKRFFSKLFSKSPTSGRVPSPDSDYSEEEQRLLAEENGYFQDSNE FT YVEPNIPAVYGSMIPVAQQLQQHHVHTPGESFADNASGYPVIKHELSELLRLGSPTVIA FT YLLQSSEQFSTVFTLGHLGKEYLAASSLSTMTAAISAFSIFQGVISSLDTLATQAFGAN FT KPYNVAIYLQRCLLILAVLHIPVALIWLNLEHILIFLHQDPMVAHLCGRYMRVFILAAP FT GYAVFEALKRYLQAQGIFTPITYVLCFAAPLNILLNYLLVWHPTIGFGFLGAPVAVATT FT FWFQSICLILYICFSSTPIPWPGFSRQALKNLSPMLHFSFHGMLMIVTEWAAYEMTSLG FT AGYLGTAPLASQSILLTSTSLLFQIPFAFAVASSTRVGHLIGSGRANLARLCSRVAYSL FT ALCISIFDGSLIFCFRDVWGSLFTSDPEVLAVVKDIFPILSLFIVTDGLNAVGGGLLRG FT TGKQYIGGLISIGSSYLFALPVTVFVVVYFNTGLKGIWCGMILSSVTAITCQFTVLFNT FT DWHRVLQEARHRLTHV" FT misc_feature complement(11358..12185) FT /note="Pfam match to entry PF01554 UPF0013, Uncharacterised FT protein family, score 205.50, E-value 8.3e-58" FT misc_feature complement(12134..12151) FT /note="ctaactgtattcttttag, splice branch and acceptor" FT misc_feature complement(12190..12195) FT /note="gtaagt, splice donor sequence" FT CDS 14069..14704 FT /db_xref="SPTREMBL:Q9UT91" FT /label=SPAC323.08 FT /note="SPAC323.08, len:211" FT /gene="SPAC323.08" FT /product="hypothetical protein" FT /protein_id="CAB53411.1" FT /translation="MQELQYDVVLLQKIVYRNRNQHRLSVWWRHVRMLLRRLKQSLDGN FT EKAKIAILEQLPKSYFYFTNLIAHGQYPALGLVLLGILARVWFVMGGIEYEAKIQSEIV FT FSQKEQKKLELQSQDDIDTGTVVARDELLATEPISLSINPASTSYEKLTVSSPNSFLKN FT QDESLFLSSSPITVSQGTKRKSKNSNSTVKKKKKRARKGRDEIDDIFG" FT CDS 15347..15458 FT /db_xref="GOA:O14084" FT /db_xref="SWISS-PROT:O14084" FT /label=SPAC323.09 FT /note="SPAC323.09, len:37" FT /partial FT /gene="SPAC323.09" FT /gene="SPAC2F3.01" FT /product="hypothetical protein" FT /protein_id="CAB53412.1" FT /translation="MGRKCRKLLLKGIPICGVILLILWGYSLYNTLRFMVP" FT misc_feature 15409..15458 FT /note=" nominal overlap with cosmid SPAC2F3, EM:Z99165 S. FT pombe chromosome 1" XX SQ Sequence 15458 BP; 5127 A; 2790 C; 2793 G; 4748 T; 0 other; gatctgttag acagctagtt cccagactag gtcagttgtt gggaattggt aggatccact 60 aaacgcgctt ctaaactatt cccacacgaa gtggtcacgt gggtttccaa catttcgata 120 aatgtaagta taaatacaca aatttattta aaatattttt tggaaagtta taatggtata 180 cttgctgtat atgttttttc ttccaatcta ttgaccccgc gtcttttcac gccggttgct 240 gatatgaatt tagactaacg ataattgata agcctcagta taaatgatta acggatagta 300 gcaataaaca acgaggtgga gttcaaacat ggagaatcca tgctaaagac attcagctta 360 tcctctcgct agcaaaggca gaatcgagat tccttattac gaaattatta tcatatagat 420 attattatat taaagaattt tagtttataa cggttaattt atttatttac aaaatatctc 480 tagtgtaaag gaaagcaaac cttttaaaag tctataggca gtctaaagat tccttgcgtt 540 cctcaaatta gtaccaaata ctcgcctttt tgcccttctt accaaccttt tctatggaaa 600 ggatgattcc agcgtaacaa actaacaata tcactcaccc agtcgtcttc tttgtgcgat 660 cgaatgatgc aaggaattgc attgtctttt acactagtga tgtcgattcg ctgtcctatg 720 tctgtaagac caagaggtcg tccatctatc gaaagttgcg gtcgtactct tgatttgttt 780 gatgtttcaa tactgatttt aaaagtatcg ggaaacaaaa ccggtcggaa tgaaagcgag 840 ttggggcaaa tgggtgtgag caaaagagca ttgattgaag gatggacgat aggcccgccg 900 gaggaaagtg aatatgctgt agagccagta ggtgtagaaa tgataaggcc atctgcaacg 960 gcctctgtca aaaacttgtc atttacaaac acttttagca ctgccatgtg aggacttaaa 1020 cctcgatgga tatgcatttc attcatagcg taaattgatt cgttatacaa tttggttttc 1080 attgctacac gtagtctcat ccgcataagc acgaacgatc gtgagttgta aaaatcggcg 1140 aatgcagttt gaaacgatcc aaagtcaaat ggtagtaaaa atcctaacgt tcccaaagaa 1200 aatgagagta taggaggcat tcctgatctt gcgaataaag atgcagcatg caaaatagtt 1260 ccatcacccc ctacagttat aattgcatcc accttctgct ccaaatccga gatttctgtc 1320 cacgtgtaca gatttagata agaaaatttt tttgcgacgt ctgtttctgt aataatgcaa 1380 atatctggat aagtttgttg tagatgctgt accaaggtct cgaagcagtg atcaacacgt 1440 tcgtccattc ttttctttaa aatcaagatg tttttgggcg gcttcggcca ttgtagctgt 1500 ttcaaattag atttcccacc gattgattgt ggggatgcac attctattac ataagagtca 1560 gctttgcctg gatgaatttt tgttcacact acctttatac actggaacaa tgcgattctc 1620 taaggtgttt acaaggcgta ttgactttga atagaccgag ttataacgaa aatattgtaa 1680 agaacctaaa ttaataatcg aaaatccttt caattggcga aaattaggag ctaaacagac 1740 tgctgtattt ctaacactaa ttctaaagcc atttgccgct cttatcatta gattgatcca 1800 attttaataa aaatggtttt tcgaaaagtc tttatagaag taaagtggga tccggacctt 1860 caaattcaaa ggcaaagata tataatgaag caacacgtaa aaaacgctag ttagtcagaa 1920 atcctataca aatggtaaaa ttactgtaat aaaaaatttt gcctgtaaat cttttaaacg 1980 atcagaattt tgattttatt tactgtatat attttttttt attggaaatt ggatatttta 2040 tgacttccca cgatacaaac catatgaacc gataaatcga taaaatttga cttaaacctg 2100 gagactagta caaataaaag ctgtttatta gcaaaaggtt catttaaaga aaagatacgt 2160 tacatatact gatattacat atatggttaa tcacaattaa aatacaataa acaatcaacc 2220 gaacaaacac aaaaacggtt aaaagatgaa aggaatacat ttctaatcat agatactagg 2280 aaaaggataa aggtgatggc gaaacaagtc tgtagagtta tgtaacattc gactaatcat 2340 gctttatttg gtaatattgg cggaatttaa atcatcatag gttaatccat tctttgctgt 2400 tcacgtgcta cagcatcggc catttcctcg tcattataga tgtgaaatcc accctcagcg 2460 gtaactttgg caagttgaac atttttggaa tcaagctttt cttccattac ttgtcgaagt 2520 actttgagaa tcaatacttc agcttcctcg agagtcatat cctgaacgac attgttagtt 2580 taacaaaaaa ggaaaaagat aaagcaaata aagttaatac gtactttgtg aaattccttc 2640 actagttcgc ttttcgcggg ttcactacct gaaccaatag ctttagcttc ataacggaaa 2700 tatgtacctg aaggctcaga gtggtatctg tataggaagt tagcatctta acgaattaca 2760 acaaaaggaa tccaaacaat ggcaggtgta cgaagataat caaaaaacaa aagcattaac 2820 acaatagcca catcgttcca taaaaataaa ataattttga cttacagttg aggaccatgc 2880 tcatctattc cagcaattaa caaagcaaca ccaaacggtc ttgacatgat gcgttcttcg 2940 ccatcttctc cttcaccaaa gcgaagtgcg agatcacaaa ttgactgggt agtactttct 3000 ataccttgtg gttcatcata ggtaaaacga tgattttgtg tttgaactct agcatgttca 3060 ataattgtac gggcatcagc tgttaggcct gaaattgcgc aaccaatgtg tgaatcaatt 3120 tcaaacagct tttcaacaga atgagattcc ataagtggac tagttaatct cttttcaaca 3180 ccaaggacga cggcatcctt agttttcaca ccaatagcag tggatcctaa tttgatggct 3240 tcgattgcat attccacttg aaacaaccgg ccttcaggag aaaaggtatt tactcctcta 3300 tcatattcta aagattagct taagaaactt gtacagttta cttacctgat ctagtcataa 3360 acatttttaa aagagaacgt gtaaagtagt agttgctgac tgaccaaaac ccagccaaat 3420 gataatatcg ttaatattcc ctataatacg ttgcctaata tatcgtaaac cagattattc 3480 gttacactcc gttattagtg aatataaaaa ttaaataaaa atatgtacag aggagaaagg 3540 taaacaaatg tttataatag gtatattgaa aataataatt cgaatttgaa gataaaaaaa 3600 aaacgaaact gactaatata cacgtttcat gttacaataa ttgatattgt agtgtaacca 3660 aaaacaatcc aatagctcaa tagctttgga aaagaacgga aaatattatc cttcatctaa 3720 attatagagt ttttaaactg cacgacctac gacaatagat agccaaaagg atagcaagga 3780 agtagcgttt tgatagggca tttgacgaat ttgggaagat acaaggaaaa ttaagtcttc 3840 tttggtttca gttgaaaatc cttgaattga attcaaataa ttggataaga attgcttttg 3900 agcagcagga gcaagaacac ttactaaact gattgaagct ttataaaaaa taagggcgtt 3960 gtcaaattca tatcgtataa ttggctgaaa ccatttcaat aaaacatctg tgaaataata 4020 agctatttcg atgttatagt ataaccttac tctcgagcaa aggatgctct tcagagagtc 4080 gagattcaac gttttggtaa gcaagaagag acacctagca tgaacataaa tgtcattgtc 4140 tagatagtct tttggagagc cctcatttag attacaaaaa agacacgcat acagcgacgc 4200 gtaaacataa tgtttgcata atgtagattc gtcagattga gatataagtt tttcagcgaa 4260 actaagacct aatgatgata tttctttagc agtggagctt aacgttataa ctttctcgtt 4320 gactgtttgc ttttctaaac gtcgacatat gagattttgc caattattga taaaactcaa 4380 agttaccaaa actggagagc ctgatagctt ttgtaagtct acatatccct cgagttgaat 4440 tgaatctata atggcttccc gcacagtttc cgagataaac ttggctggtt gagcaacacg 4500 actcaagcat gttgttatta caaaatccaa tgtagaccat gatgatgcaa caaaagagaa 4560 tagatactcc atatgaaaca aagtcaaaat aatatactca cataaagatc ggtctgcagc 4620 caatacattg cttttacggt actgagattt tgccaatgat tctaaaacca taattattga 4680 gaaaaaattt tttttaacaa actgatggtc attatattta gaatcagagg aacgactttt 4740 ccaagaggga atcgccttgt caattagagt tattaacctg catcatttag tcttccagaa 4800 cagtgaagtc aaacactaac cttttaataa tgctgatttg ctgataagtt gataagcgta 4860 ataacagatg gcacaaaaca gctgtaggca aggaatcccg cttgtgttca gtatccaaat 4920 aattgatgaa accgctccaa acaatttctt cccatttctg agacaattag tataaagttt 4980 tttggagttc tatatttact gctacttacc aataaatgag ggagtaacga aaatgcatca 5040 aaagaaatat cgctcgctga tgaatcgaca gatgcaaaaa gttgttgcag taaaaaattg 5100 gcttcaaatc ctgaaaattg atgcgttttt atctgttttg ataaagcttt gctaactgct 5160 cttttcaagg ggtattgtaa gttgacgata tgatcgcaac tcaatagttt gtaaaaaacg 5220 caaaggagac aaattcttcg aaagcaaaga atttcattgc ttgcatctac ctcaagaatg 5280 cctacaatgc ttttcacgaa ggatgaaaat ggcaaaccgt tattcgttac ctttttcctg 5340 cttatcaatt caacccaaaa tgcaaagtat tcagaacgat gctcattcca atagtcccat 5400 attctaccgc aagcaatcaa ttgtgctacc accagtcgaa ggttctgttc attatgtaaa 5460 ttgagtaaga catagtgtgt ctcccatagg acacttggat tgttaacaaa atggattaaa 5520 aaccaccgga cttacgtttc taaagatatc gagtcctctc gtaacagaag aagcacgttg 5580 ttaatggctt cttccataat taaaacgcag tgttctggaa gccttattat ctttcgtacg 5640 atcactacca cagaatcaaa ctgcaactta taaattgact tatacggagt tttctcattt 5700 cattaattac tttaacaatc accatataag ctgctgagca tatagcggcg gccatatctt 5760 gcagcatggc gtcttttgtc aaatttgcca atacgacatt cttagatgcg cgatttcccc 5820 ttttcaagaa tgtgtctttt gaactagctc gaaagcaaaa ttgggccatt attggaaaca 5880 ctggaagtgg taggactaca ttcttgcgcg tatgttttgt aatttatgcg aagtgtttgg 5940 agtaaaaatg cgctaaagct ttagcaattc atcaaattta ctaaatcagc tttgtacaat 6000 tcatttttgt cccttagtcg gaaaccaaac actttttatt atcattaatt acatggtagt 6060 aactttacta aattgctaac tcaatacagt gtatacaagg ttcctttaca ccctcaccct 6120 ccactagttt ttcatatcca ttcttgaaag gcaaatctga tagcccatgg caagcaatcc 6180 aattgcttga ttttaaaagt tctggacagc agcgtgccgc atactactct gaaagatatc 6240 attcctttag agacaaagaa catgacacta cattggagaa gtggcttttg ggcgcttatc 6300 gagggaatga gaagtttgct tctcaacatg tacaagaggc cgcatctatg actcagttat 6360 ctcatttact tccttcgagt ttaataaatt tgagcaatgg acaatctaga agagcgatgc 6420 ttgcatcgaa acttgtgcaa aggccgcagc ttcttttact tgatgaacct tacggtacgc 6480 tggccactaa ggggtgcaat ttatgttaat acaattacta acagtatttt ttttagctgg 6540 acttgatgtt acttcaagat ctgttttgtc ttcactttta ggtgaaatgt caaaccattg 6600 ttcgccaaag attgtacttt ccttacgtcc tcaggacaaa attcccgact ttatcacaca 6660 tgtattggaa ttgaaaaata aaaaaatcac atatcaaggg cccaaggaac agtatattcc 6720 tatgaccagt cactcaacta acatacctgt caagccccag atgaaaaaaa gcaagcctat 6780 tacaataggc aagcctttaa tttccatgga acatctcaac tgtgtttact ggggacgaaa 6840 agtactctct gacataaatt ggactattag agaaggagaa agatgggctt taacaggtag 6900 caatggaagc ggtaaaacta cacttttagc atatgtagtt ggagaccacc ccaagttatt 6960 tgctagcaat attaaatttt ttggaaaaag cattgggccc ggcactggaa tttctatatt 7020 cgatatccaa gaaaacatag ggcattgtgt aagtctataa aagtaaatag tctttaaaac 7080 taacgacttt ttagtctcca gagatacaca atcatttccc gaagcaacat acatgctttg 7140 aagctcttct tagcgcctgg agcaccacct ttaccatccc aaagttaacc gaaactcggt 7200 tggcagcaat tagtagcatc ttagaggaat ttgaactaaa ggatattaaa gataaaccat 7260 taagcagtat ctcggttggc atgcagagat ttattttgtt ttgtcgtgct attgtcaaac 7320 aacctcgctt agttgttttg gacgagcctt tccagggagt agataccaaa tatgtccata 7380 tggcgcataa ttacttaaat gaaaagctct ctccttccca agccatggtt ataatttctc 7440 attacgaaga tgaactccct gcctgcgtaa atagaagggc tcacattgat aatggaaaac 7500 ttgtcataca tgcttgatga tgaacatccc tattgaatgt gtataattct tcgactcctt 7560 tctaggattg ttttttgatt gaccccaaaa atttgaatta ttcatcccaa attgtattac 7620 caatccgata aattcttaga atacttagtg tttcacgccc agcttttctt tttaaagtct 7680 cattaacaat aaacccatcc ttttgcaata tttcacaaat agagtgtaac ttattgcgag 7740 caactgcaac catgtaaaat actccgtctt gtgaaagtat atcctatcca taaatgagtt 7800 attaatgtaa atataaaggt ctgaccttac ctttaattgg tttaacaaag tgctagtgac 7860 atccatacca tcggtaccac cagcccaagc cgaagcaata gtagcagctt cacttggtat 7920 ttcttcgaac tcggtaggaa catatggagg gttgaatatt aaaatatcaa ctccattccc 7980 gagtctaatt ccgtcaagga atgaagtttg tacggtaatg aacaacccat tatcgtcctt 8040 atacaactca cgattattca gagcagttat cttactagca cgacacgcag aatttgagat 8100 gtcggacata aaatgaacta gacatcagca tattatattt tgtggtttgc aaccgtacca 8160 attggcttgt ttttcaaaat cccacttttt aaaaatgagg atgcacagcc agatccgcag 8220 ctgcaggtta atatcttcac gatttaacca aaaacctacc caatttccgc tgttaataag 8280 tttttcattt cagccatttg gcgcaatttt ttggcatcct tttcaagcgc atcgagcaaa 8340 gcaaacgtgt cctctaaaag tgtaattagc aagctttttt gaccttttat aacataccag 8400 ctggttcata aacatcttga aattctttta gacgcaactg ggatgtaact ggggtcgaaa 8460 gcattcgttc ttacgcgtac ctacttattt gaagaaggac gtcagactgt ggtgcaaatg 8520 atatgagaat gtttgatttg ggataggaac ttttggaata tgatgaattt ttttgaaatt 8580 tcatttaata ataaatactg attctaatta taatttaaac gtttctgtcc tgctatgaac 8640 accgtcaaag acgaatgtat tattcaaagg aagatattgt tgggctagta actaaagtaa 8700 agttagtcca taatagtatt tgtataaatt tttcatacct ttatagtctc ttgagctaca 8760 ataccaccta taaaagatga tattgaatga agttcatgac catcagcgcg ttctctgaaa 8820 ctattagtaa aaagtcagaa taagaaattt agcgtacaat tcttgtatag ctgtgtaaac 8880 aacgtcaaaa aatttctcta atccaatttg ggacaaaaat gattgagcaa cgctgattgt 8940 tttagttgta tcactgaacg cttctttgta attttttcca tgtttttcaa gtattgtatc 9000 atatatgcga aacgctaaat accacggaag taaagaattt gagtcaatcg aggatgattc 9060 gagaactacg gtcagacaaa agcgaatact gttaacgtac ctgagtttga tgtaggttga 9120 tactcctctt tcattgtttt gaaatccata acctttatat ttaaacaatt tcgagaaaaa 9180 tgcttgattt ctaaatcggt tatctcctcg actgatcggt ttaaacgttt cagcgtttgt 9240 tgaacgtatt ttttgaattt taaaatatca ttttcagatt tttctttgta gctgaaatat 9300 tagtaatggt aaaagttaaa caaaaaacgt acatgacctg tagctttacg tattgttgag 9360 tagaacaatt catgtcaggc aataatccag atagtggaag aaattctgtc tcatcataga 9420 acattttgag acaatgcgcc atgatccaaa agctttcaga atcttcctga attttaacgc 9480 agcgagtatc gtgtaaaact tcgtatacgt tgcttggcaa ctatttcata ttaatttatg 9540 cgttctaaaa aatacaaata ccttatattc cttaaaagcc ttccaagaat tcgatgaagc 9600 ctcttcgatg ttttctgaat cacattttcc cttgtattcc tccattattt ttcgaaacat 9660 agcacaatca tcagcttctt gtgaattttg cgcgtgtgct ggtgagacct ttaataagac 9720 atggatgatt aagactatat acggtatttc tgacagagaa gatgagtcca tattgtccaa 9780 atccattgat tttacataat taattagttc aggccacggg tttttcagac gcaaatcttg 9840 aggcaattct ggttgtgatt gagtagctaa ctttagttag atatccctct taaataggaa 9900 cttactggta tattcatgag tcgaaattcg aagtatgccg gcaaagccaa ctgaattgaa 9960 gtgtagcaga ggaatcttgt gcgatcgaag gtactcttct aagcgaaaaa gaggtttttc 10020 ctttaagttt gaggataaga ctacagaaaa ttttgaaaaa tactcaatgt ttttatctat 10080 taaagcttcg gggctctaca aacgttagag tatatcgtcc aaacagccat accatctcta 10140 aatattccat ttcgacgttt ggattaagtt gttgcaacaa agatgccgta cagcgagctc 10200 tggattttcc ttcttggtcg tactggataa aaaagttcat tccatccata ctgaaatcga 10260 cagaagtatc atcaacgaca gcaaaagagc caatccctat tacgtctatt tagtataatt 10320 tctttaaaat atccaatacc tggcaaaatt aaatttttta aagcttcaca ccctacagta 10380 tttgcataca aaagacacac atgcgacttc tcaattgcgt tctgtccttc agctttccaa 10440 agccgtactt gtctatgatt aatagtaaag gatttaatga gtttcatacc tatcgtattt 10500 ttgcattttc gcagaagttc ccatactagt tcaaaagtat gacaaaacag ctggaagtcg 10560 ttacgtgtaa attacaggta tgtatgtagg aaggaagaag gatgttaata agacagcttg 10620 aacaaaatct ctacgaaaat atatcaaatg aaataaaaat aaatttggct tgtaacaata 10680 aaaatggtaa ttaaagggaa ccgacgatat gactaggccg gctaagtttg atattctaaa 10740 attgtgaagc aaaatgaaac tacttccatt agggaaatat gggcttcttg aaaacaactg 10800 ttaaacttct gtacctaatc agcttttttt gagtatatgg caatagtata gagtcggcgt 10860 gatagtgact caaatactca tgaagaaaga caaaccaaca atcaaaaaag tagaaaagtc 10920 aaattttttt tggtaataag ataaatacta attaattgtt aagtctctaa attttttgca 10980 gaaaaaggaa tggtagaaac caattgacta catgtgtaaa ctgaatgaca ggctgacagg 11040 caaagataat caaacatata ttgtaagatt gagaggatac gtttttatcg cggatgtcta 11100 ttaaataaag taattattaa agcggaaaca aaagaagaac gagaattcgg cagtaaaata 11160 gaatatcaaa ggagtggccg aaacgaaaac ttattaaacg tgggtcaatc tatgacgagc 11220 ttcttgaaga actctatgcc aatcagtatt aaatagaaca gtgaactgac aagtaattgc 11280 ggtgacacta gacaaaatca ttccacacca aatccccttt agaccggtat tgaaatatac 11340 aacaacaaaa acggtgactg gtaaagcaaa caagtagctt gatccgatac taattaaacc 11400 tccaatatac tgtttaccag tgccacgaag aagtccacct cctacagcat ttaatccgtc 11460 agtgacaata aacaaggaca agatagggaa gatgtcctta acgactgcaa ggacttcggg 11520 atcactggta aacaagcttc cccaaacgtc tctaaaacag aaaattaaag acccatcaaa 11580 aatggatatg cacaaagcta gcgaatatgc tactcgagag caaagacgag ctaagttggc 11640 tcttcctgaa ccaattaagt gacccacgcg tgttgacgag gcaacagcaa aagcgaaggg 11700 aatttgaaaa agcaaagaag tagaagtgag caaaattgac tgggatgcaa ggggtgcagt 11760 acccaaataa ccagcaccaa gactagtcat ttcataggca gcccactcag taacgatcat 11820 cagcattcca tggaaagaga aatgtaacat gggagataga tttttcaaag cctgcctaga 11880 gaaaccaggc cacgggatag gggtggagga gaaacaaata tacaatatta aacaaatgct 11940 ttgaaaccaa aaagtagtag caacggcgac tggagcaccg agaaagccaa atccaattgt 12000 gggatgccat acaagaaggt aatttaaaag aatattgaga ggagcagcaa agcataatac 12060 ataagtgata ggggtaaata tgccttgagc ttgcaagtaa cgcttcaatg cctcaaaaac 12120 ggcatatcca ggtctaaaag aatacagtta gtgaaaggag gagagagaac ggacaaattt 12180 taataaagta cttacgcagc caagataaac actctcatat aacgaccaca caaatgagca 12240 accataggat cttggtgaag aaagattaga atatgttcaa gattaagcca aatcaaagct 12300 acaggtatgt gtaaaacagc caaaataagc agacatcttt gtaggtagat ggctacattg 12360 tagggtttat tagcgccaaa ggcttgagtg gctaaggtat ctaaggagga gataacacct 12420 tggaaaatgg agaaagcact gattgcagca gtcatggtgg aaagggacga agctgccaag 12480 tattcctttc ccaagtgtcc caatgtaaag actgtagaaa attgttccga ggattgtaac 12540 aaataagcga tgacggtggg agatccaaga cgcaaaagtt ctgacaactc gtgctttata 12600 acagggtatc ccgatgcgtt atctgcgaag ctctcgccag gtgtatgaac atgatgctgc 12660 tgtaactgct gagctacagg aatcatagaa ccgtagacgg ctggaatatt aggctccaca 12720 tactcattgc tatcctggaa gtacccattc tcttctgcta gtaatctttg ttcctcttca 12780 gaataatcac tgtcgggaga aggaactctt ccagaggtcg gggatttgga gaagagtttg 12840 gagaaaaaac gcttcataat ttgagaatta accttgtctt cactattaat tttttaaaag 12900 caaaatattt caaattctct atattcgcct ttctaatatc ttttcttttt tgtgttatac 12960 ttaggatgaa gcttccaata attacaatta acaaagtctt taactatata aggggaagag 13020 aagtcaagtt tgatagtgtg tacaagacaa tggtaaacgg taagccgtgg aacggtgcga 13080 ggtttttaat cagctcaatc gtccccccta gtaccgactt gatatgcaag agataatcgg 13140 taccgtacat agattaaata caagggagaa ccaatcaaaa atcaaaagct ctcaactgag 13200 ccaatgatat atcattaaga taaaattaat ttttaaatcg cactaagcgg acaaatagcg 13260 tttagacttt ttatcagttt cgctattgaa aatttagtac cgacaatcag caaattttac 13320 aaatttcttt tcatggcaaa ctgctggctt aattgacaaa aaggaggata tttgacagga 13380 aaattaacgc agtttatgtc ttgaactgcg gaatcgatac gccgtgaaaa agaagccaga 13440 aacagccgac ggtagtggtg ctttttgaag gattcttaaa gtgaaaatta tctgctacac 13500 gagtatatta tggcggtatc gaatggcagg taccaaatgc tacaaaatct taaaacaggc 13560 atattaaaaa atgggaaaca aaaaaggaca caataaatca tgtaggattc taaaggatgc 13620 atcgaccagt agcggatttt attgctaagt attagaactc tttgtatttc ttttcattga 13680 tttgtacctt ttgtttttgt ttttgtctat gagctctcca tatgtttcta cttcgttccg 13740 cggaatgccc ggatcgactg ttacttcttc cactcccgta aaccaggtgt cgtttattat 13800 cttgatttac tcgctaatat attttcgtta agattactta tcgagagaaa ttgaaaatga 13860 taaaaatcgt taaagtcaaa tacgaacagt tcatttttaa cgtccaaatt ttgatttaac 13920 gagaacttga gcaacacttt atttcttttc atccatcaat gtatttaatg gatactgttc 13980 atacatacat actttcttcg ttcgtgttcg tgttcacatt gctcactcgt tgggtggttt 14040 gtacgaccta tttgtctagt ccaacgatat gcaggaattg caatacgatg tagttttatt 14100 gcaaaaaatc gtgtatagga atagaaatca gcatcgacta agtgtttggt ggagacacgt 14160 acgaatgctg cttcgaagac taaagcagtc gctagatgga aatgaaaaag cgaaaattgc 14220 tattttagaa caattgccga aatcgtactt ttattttaca aacttaattg cccatggtca 14280 gtatccagcc ttagggttag ttttgctggg tatcttagct cgcgtttggt ttgttatggg 14340 cggaatagag tacgaagcaa aaatacaatc ggaaatagtc tttagtcaaa aggagcaaaa 14400 aaaattggaa ttacagtctc aagatgacat agacactggg actgttgtag ctcgcgatga 14460 attgctagct acggaaccta tttcattgtc tataaatcct gcttctacta gttatgagaa 14520 actgactgta tcctctccta attcttttct caagaatcaa gatgaatctc tcttcttgtc 14580 ttcttctcct ataactgttt ctcaaggtac caaacgtaaa tccaaaaact caaattccac 14640 agtcaagaaa aaaaagaaac gtgcaagaaa gggacgagac gaaattgatg atatattcgg 14700 ataaataaac ctcaattagt ctttccattt tcaattgatt ttttcttttc tttacttttg 14760 ctttttgttt catctatttc tgacaccagt ctcgcaaaat ctctctattt ttattaagaa 14820 cattacgcta tgatacgaaa acgctgtcat tcatcataac accatttggg tctttgagtc 14880 tctactgcat ttcatccagc atcaaggcat cctgttgtat agaatcaaac atattttttt 14940 tctttgtttg tacatatctt ttatttatct gtgcatttga agtgctcgcg ggctttaatt 15000 tattcttttc gttacttgtc aagtttgaat atgcagcgat tctaagccat ccccataacg 15060 tatttattct tttcaaccta ttctgagctt ctaattctat taaaccttca atttattaat 15120 attgggtttt ctgttctact gaaacacttt tcaacactat cactgcatat attctggtta 15180 tttgtttttt atgcgcacat cgtgaacaaa ttaaatattc attttatttt tcaacggttt 15240 tcttattaga acaagtgaat tttgctcacc atatcacctt tcttgttttt tcgcttcttt 15300 tgttttatcg ttttgctgcc gaggtgaatt acatctgttt gcaatcatgg gaaggaaatg 15360 tcgcaagctt ctcttaaagg gaatacctat atgcggcgtt attttgttga tcctctgggg 15420 ctattccctt tataacacat tgagatttat ggtaccag 15458 //