ID SPBC1677 standard; DNA; FUN; 8044 BP. XX AC AL035581; XX SV AL035581.1 XX DT 01-MAR-1999 (Rel. 59, Created) DT 17-DEC-1999 (Rel. 62, Last updated, Version 2) XX DE S.pombe chromosome II cosmid c1677. XX KW 5s rRNA; C-terminal domain of Threonine dehydratase; KW Pyridoxal-phosphate dependant enzyme domain; threonine dehydratase; KW tRNA Asp anticodon GTC. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-8044 RA Saunders D., Harris D., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (28-FEB-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; O94632; O94632. DR GOA; O94633; O94633. DR GOA; O94634; O94634. DR SPTREMBL; O94632; O94632. DR SPTREMBL; O94634; O94634. DR SWISS-PROT; O94633; DPM3_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1677 is overlapped at the 5' end by c1604, EMBL entry SPBC1604, CC accession number AL034433, and at the 3' end by cosmid c26H8, CC EMBL entry SPBC26H8, AL031743. XX FH Key Location/Qualifiers FH FT source 1..8044 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1677" FT /map="IIR" FT misc_feature complement(1..101) FT /note="nominal overlap with cosmid SPBC1604, EM:AL034433 FT S.pombe chromosome 2" FT CDS complement(1..698) FT /db_xref="GOA:O94632" FT /db_xref="SPTREMBL:O94632" FT /label=SPBC1677.01c FT /note="SPBC1677.01c, len:239, SIMILARITY:Synechocystis sp., FT P73759, hypothetical 38.0 kd protein., (337 aa), fasta FT scores: opt: 329, E():3.2e-15, (31.9% identity in 213 aa)" FT /partial FT /gene="SPBC1677.01c" FT /gene="SPBC1604.01" FT /product="hypothetical protein" FT /protein_id="CAB37620.1" FT /translation="MTEIENIGALEVLFSPESIEQSLKRCQLPSTLLYDEKGLRLFDEI FT TNLKEYYLYESELDILKKFSDSIANQLLSPDLPNTVIELGCGNMRKTKLLLDAFEKKGC FT DVHFYALDLNEAELQKGLQELRQTTNYQHVKVSGICGCFERLLQCLDRFRSEPNSRISM FT LYLGASIGNFDRKSAASFLRSFASRLNIHDNLLISFDHRNKAELVQLAYDDPYRITEKF FT EKNILASVNAV" FT misc_feature complement(2156..2276) FT /note="similar to HSDJ900E8 AL109623 Human DNA sequence FT from clone 900E8 on chromosome Xq25-27.1" FT LTR complement(2244..2323) FT /note="degraded Tf2-type LTR" FT CDS join(2729..2944,3010..3066) FT /db_xref="GOA:O94633" FT /db_xref="SWISS-PROT:O94633" FT /label=SPBC1677.02 FT /note="SPBC1677.02, len:90, SIMILARITY:Caenorhabditis FT elegans., Q9XVV5, f28d1.11 protein., (95 aa), fasta scores: FT opt: 138, E():0.0014, (37.5% identity in 80 aa)" FT /gene="SPBC1677.02" FT /product="hypothetical protein" FT /protein_id="CAB37621.1" FT /translation="MQRIHKVILYYVSLTILYRVTYLFDLEEPWSTLRPYTPYLFILAF FT GSYLGITLLYNVATTNDKPEAYVDLVKDIKEAQDALRSKGMTIED" FT misc_feature 2945..2950 FT /note="gtatgt, splice donor sequence" FT misc_feature 2997..3009 FT /note="ttaacaaaattag, splice branch and acceptor" FT CDS complement(3449..5251) FT /db_xref="GOA:O94634" FT /db_xref="SPTREMBL:O94634" FT /label=SPBC1677.03c FT /note="SPBC1677.03c, len:601, SIMILARITY:Arxula FT adeninivorans., THDH_ARXAD, threonine dehydratase FT precursor, (550 aa), fasta scores: opt: 1946, E():0, (54.9% FT identity in 543 aa)" FT /gene="SPBC1677.03c" FT /product="putative threonine dehydratase precursor" FT /protein_id="CAB37622.1" FT /translation="MTGTSFYTSVLRLGRLAQQGLKFQSVKHIRPSCFSSFGLQAKRWN FT STQQNDSSIDCLEPKLQGIIEDNISPSTAQKEISDIKFNIPKEMLLPDGTPDYLRLTLT FT SNVYEVIKETPLTKGVVISESTGVPVYLKREDLTPVFSFKIRGAHNKMASLDKQSLKNG FT VIACSAGNHAQGVAYSARTLGVKATIVMPQNTPEIKWRNVKRLGANVLLHGANFDIAKA FT ECARLAKEQNLEVIHPFDDPYVIAGQGTIGLEILHQIDLRKLDAIYCAVGGGGLIAGIA FT TYVKRIAPHVKVIGVETFDADALKKSLKDKKRVTLKEVGLFADGTAVKLVGEETFRLVS FT KNIDDVVLVDKDEICAAIKDVFLDTRSVVEPSGAMAVAGMKRYVAKHKPKNPNAAQVCI FT LSGANMDFDRLRFIAERADLGLNKEVFLSVTIPERPGSFEALHNIITPRSITEFSYRYD FT NDDYANIYTSFVVKDRATELPLILQQISEQNMVAEDISDNELAKTHARYLIGGKSSVSK FT ERLYRLDFPERPGALCKFLRSIKEVCSISLFHYRNCGGDIASVLAGLRVFDGQVEKLHS FT VLEEIGYNWVDETNNPVYLRYLRK" FT misc_feature complement(3458..3724) FT /note="Match to PF00585 Thr_dehydrat_C, C-terminal domain FT of Threonine dehydratase Score 75.97" FT misc_feature complement(3740..4009) FT /note="Match to PF00585 Thr_dehydrat_C, C-terminal domain FT of Threonine dehydratase Score 108.05" FT misc_feature complement(4043..4933) FT /note="Match to PF00291 S_T_dehydratase, FT Pyridoxal-phosphate dependent enzyme Score 335.96" FT misc_feature complement(4808..4849) FT /note="PS00165 Serine/threonine dehydratases FT pyridoxal-phosphate attachment site" FT misc_feature 6163..6292 FT /note="5s rrna, by similarity" FT tRNA 6812..6882 FT /note="tRNA Asp anticodon GTC, Cove score 70.49" FT misc_feature 7948..8044 FT /note="nominal overlap with cosmid SPBC26H8, EM:AL031743 FT S.pombe chromosome 2" XX SQ Sequence 8044 BP; 2741 A; 1437 C; 1363 G; 2503 T; 0 other; accgcattga cactagccaa aatattcttt tcaaactttt cagtaatacg ataaggatca 60 tcgtaagcta gttggactag ctcagccttg tttctatgat cgaaggagat taaaaggttg 120 tcatgaatat tcaaacgact ggcaaacgaa cgtaaaaatg atgctgcgga tttcctatca 180 aaattaccaa tcgaagcacc caagtacaac atgctaattc gactattggg ctcactacga 240 aacctgtcca aacattgtag caatctttca aagcaaccgc aaataccaga caccttaaca 300 tgctgataat tggtagtttg acgaagctcc tgcagtcctt tttgcaactc ggcttcatta 360 aggtcaaggg cgtaaaaatg cacatcacag cccttctttt caaacgcatc taaaagaagt 420 tttgttttgc gcatatttcc acaccctaat tctataaccg tgttaggaag atctggagac 480 agtaactggt tggcaatgga atcgctgaac ttcttcagaa tatcaagctc actttcatac 540 aggtagtatt cttttaaatt cgtaatctca tcaaacagtc gtaaaccttt ttcatcgtat 600 aataaagtgg aggggagttg acaccgtttg aggctctgct cgatggattc aggagagaag 660 agaacttcta atgcgccaat gttttctatt tctgtcattt tctattaaac agcgtctcaa 720 tgcttatgct tgctttttgt gttcccaaac gtcaagtgcc aatcgtatga attagtcacg 780 ttcaatgaaa cagaacttga acagtctctt cacttaatta tttaaaataa tggatagtag 840 tatgtgttta gagagttata taattctgta ttttggtgac ataaaacagg tgactaaaag 900 atgcgtcatg ttagtacaca aacatctttg tttacccttt ccaaaaatat cggaaagtaa 960 tgaatgctgc agtattaaac aaagaaacaa cattcatcgt cgtgagacga tctggacgag 1020 ctaacaagta agcacatccc taatgagatc agacatcttc gaaaacgtgg tcaaaggagt 1080 caatcgtgat ttgcaactcc tttgtcttgg ttttattgaa accaaataaa attaaagaat 1140 ccctacaact ggagaatcct acaacctagc cgggtaacag ggattctgcg gcaataatta 1200 ttgctatatc aaaggcacat cgcaaaacca actcaattga atggggttga gtaaaagata 1260 catcgcatac acttgaaaag aaacattaaa aaaacaaaaa caaaaaaaaa acaaatttgg 1320 ctcaatggaa aattagattt ctcaatattg ttacatatta gaggttggaa aggatgaatg 1380 agtctcatcg gcgcttgctc cactttcaac tatataatta acacccattg ctttgtataa 1440 tattataggt acttttcttt gcttagtcac ataattagga ccttgcgcat tgttacgaat 1500 attgttggga aattatcaac gtcttggtga tgttgagttc gttaaacaat gtacagtcaa 1560 agattataat gcctttttaa aatactatga tttcattcga tttctttaac tatttcaagt 1620 agttgtgtac cctttgattc atctaaccga attggtgtat gtacactgga gttttgtaaa 1680 cttttaggac agtgaataaa aaaggacaag tagaattgat atggattatg gtaaattgta 1740 gtaaattgga ttatataaac atgctcctac agtatctcca atattagttt gactgatatg 1800 agtgcatact gcagagcacc tattttcaca aacaattgtc ttatacgcta gtcatttgag 1860 ctcgcataag aaattaagta gatattctac agactaattt tgtattagaa tgatacgagt 1920 atttcacata atcttaagtt tacattgaat gggtaataat tgatttacat agtatatgtc 1980 tctacgtcta gcaattatta gctacgcatt gcaacgcatg caaatccaca gcggaacgga 2040 aatagaagaa actccattac attataagcg actcgttaga gtaaacacct agagagtttt 2100 ccgtgcaaag tagtggagtt aagtcactta gcgacttatt taattactga ctcggaacat 2160 gtaaagcttc cgaaacatat atatatatgt atatgtatat gtagatacat acatatatga 2220 agaattacag taagctcagt cattgccagc tacacaattg tcattcaaaa aaacgatacg 2280 tcgaattgcg gtgagtttct tcttgtaata acatataatc tatgactgca ataaaaatat 2340 acttccatta ttattacaat tactagaaac tcgccgcaat tgaacatact aacgtctcag 2400 cagtaatcgt agctggactt aacgtgctag gttttcgaaa gcagttgccg atttgcaaga 2460 agctaggtca gtggcggatc agtactgtgg gggtctcaaa agtctcaaga cgacttagct 2520 tcggctatac ctattaagaa tctaaacttt taatgaatac cgttgcactg tagtaaatta 2580 agaatagagc attacacaga gctaactctc aattctcgct tacaatttcg tatatacaaa 2640 agcttactag cactaacgta ttcaatccgt tcaccagttt cttcatcctt gttttttcta 2700 tttcttattt gttgaaaacc taatcaagat gcaaaggatt cacaaagtta tcctttatta 2760 tgtttcactg accatattgt atcgagtaac ttacctattt gatttggaag agccttggtc 2820 gactcttcgc ccttatactc cctacttgtt cattttggct ttcggcagct atttgggaat 2880 cactttgtta tacaatgtcg ctactactaa tgacaaacca gaagcttatg ttgatttggt 2940 aaaggtatgt atacttgcat gatgatgttc cggaaaagtt ggccaaagaa cgtttattaa 3000 caaaattagg atataaaaga agcacaggat gctttaaggt ccaagggtat gactattgaa 3060 gactaaagga gcagaataat agaaaatatt attaatatga atctaaattt tcccaataaa 3120 gtccacaatg tatttattaa tgaattgaga ataaagatac attttttcta gaaatggtaa 3180 tataaaacaa agatgttcta atcccaatgg aacaaaaaaa gtaaatagta ggatatatta 3240 caatgttgat gatacattct tagccttact aaaggctaaa cgcatttgat caaaagaaaa 3300 aaaaaggaaa gcagggaatg taaaaaaatc cagactacct aacaaaattt tgaggtttcg 3360 aataacaaaa ttttgagatt tcaaatagca aaatttttaa cattcgtctc gctgaaagcg 3420 tctataggat tcaaagatgg cgagccgtct atttacgaag atagcgcaag taaacgggat 3480 tatttgtttc gtccacccag ttgtatccaa tctcttccaa aactgaatga agtttttcca 3540 cttggccatc aaaaactcta aggccagcaa gcacactagc tatatctcca ccacaattac 3600 gataatggaa aagggaaatg ctgcaaactt cctttatact cctcaaaaac ttacataaag 3660 ctccagggcg ttcagggaaa tccaatcggt acaaacgctc ttttgaaaca gatgattttc 3720 ctccaataag ataacgggca tgagttttag caagttcatt atcgctgata tcttctgcaa 3780 ccatattttg ctcagagatt tgttgaagaa tcaaaggcaa ttcagttgca cggtccttta 3840 ccacaaacga tgtgtaaatg ttagcatagt catcattatc gtaacgataa gaaaattcgg 3900 taatactacg tggagtaata atgttgtgta gggcttcaaa tgaaccaggg cgctcaggaa 3960 tagtgacact caagaatact tccttgttca aaccaagatc agcacgctca gcaataaatc 4020 taaggcgatc aaagtccata ttggcaccac ttaagatgca aacctgagca gcattgggat 4080 ttttaggctt gtgtttagcg acataacgct tcataccagc aacagccata gctcctgatg 4140 gttcgaccac tgaacgggta tccaaaaaaa catccttaat ggctgcacaa atctcatctt 4200 tgtcaacaag aactacatcg tcaatattct tggagacaag acggaaggtt tcctctccaa 4260 caagtttcac agcagttcca tcagcgaata agccaacttc cttaagggtt acccgctttt 4320 tgtccttcaa agactttttt aaagcatcag cgtcaaatgt ctcgacacca atgaccttaa 4380 catggggagc aatacgctta acgtaagtag ctattccagc aattaaacca ccaccgccaa 4440 cagcgcagta aatagcatcc agcttgcgaa gatctatttg atgaagaatt tcaagtccaa 4500 tggttccttg tccagcaatt acataaggat cgtcaaaggg atgaataact tcgagatttt 4560 gctctttagc caaacgtgca cattctgctt tagcaatgtc aaaattagct ccatgtaaga 4620 gaacattagc gcccaatctc ttaacgttcc tccatttgat ttcaggagta ttctgaggca 4680 taacaatggt agcttttaca ccaagagtcc tagcggagta agcaacaccc tgggcgtgat 4740 tgccagcgga acaagcaatg actccatttt tcaatgactg cttatcaaga gaagccattt 4800 tattatgagc ccctcgaatt ttaaatgaaa acacaggagt gagatcttca cgttttaagt 4860 agactggaac accggtactt tcagaaatga caacaccctt tgtaagagga gtctccttga 4920 taacttcata cacgttagac gtgagagtca aacgtaaata atcaggagtt ccatctggaa 4980 gaagcatttc ctttggaata ttaaacttga tgtctgatat ttctttttgt gccgtcgagg 5040 gagaaatatt gtcttcaata attccttgca gcttaggttc taaacaatca atagaactat 5100 cattttgttg agtagagttc caacgtttag cttgtaatcc aaaagatgaa aaacatgatg 5160 gacgaatatg ttttacagat tggaatttta ggccctgttg agccaatcgt cccaatctga 5220 gtaccgaagt gtaaaaactc gttccagtca ttcacagtga attgatccct tctggattaa 5280 gcaaaagcaa aatccttgtg gaattatatg agtttggtga aggtcacaaa atatatttga 5340 tgctaccatt ttcgacgggt gaataacgag ataacgaagc gaacaatatg gggttgttat 5400 acacgaaggc cgaatagtta tgtatctaaa tatcacagaa atctttaatt ttattttatt 5460 ttttaataag atacggattt ttttcagcat taaaatcaaa gcttaaggaa agagatacat 5520 tgaataagtc gaatgttgaa aaagctacat ggcaaaaagt ttttgtaatg aatataagat 5580 tgtttaataa tatattctat actttacgaa ttgtatgaaa tgcattgatt agatttcacc 5640 agtgataact gaaaaaaaaa aatagttgtc ccattacttc ttttttaatc tgagataaag 5700 aggtaaacat tagaatatta atacaacaaa cggaggcaaa gctgttaagt agatccagta 5760 tataaaatta tattacaaac acaaaaggaa tttagcaaat aaataattca aaaaattatt 5820 attatttttt catagtattt taacaagctg agagaatttc aaagctattt ttgaaacttt 5880 ttctattttt ttttttaatt ttaaatgata taaaacactt tagcaaaaca accaatatcc 5940 gaatattaaa ataattcaaa accaatttgg ctagcttttt taaaatattt cctaaattaa 6000 attcgtacag acaagatttc aataattgta catctcgtgc aaagtataag aatatggata 6060 ttaaaaattt tacagttcat tgacaaaaga gtttagtata taacattaga gtatctcttt 6120 tatattggaa taatatatat agtgcaatta taagaacaag tcgtctacgg ccatacctag 6180 gcgaaaacac cagttcccgt ccgatcactg cagttaagcg tctgagggcc tcgttagtac 6240 tatggttgga gacaacatgg gaatccgggg tgctgtaggc ttcttttttt taaattccaa 6300 aacaatatta ttaattctct actttttaat gcactcattg ggtttgtgct atttagctca 6360 acaacatttg tcattcttat tttaatttcg tactcaatcg cattagcccg aattctatta 6420 acatttttac actaaaagct attttacggt atatttagta aaggtaaaga aacacagcta 6480 tgaaagtgct gccaaaaatt tcaattgacg aaatcagcca tcaatggctg taactattga 6540 attccattgt ctttcaactt ttgaattatc gcagcctaca acaaaaaatg aaattaatca 6600 caatctcttc aatcttcatc gttacagata aatataatac atgcgagacg tgtagggaag 6660 aagaccaaag aaagtaattt tagagataaa ttcttacctt ctttcttcac tgtatccgaa 6720 gccttcaaaa actgactaaa ctatttaagt tgctcatgca acacaaagta cacgacaatt 6780 atattttgca aattagtaat cctcacaata atctccttta gtataggggt agtacacaag 6840 cctgtcacgc ttgcagcccg ggttcgaatc ccggagggag agtttttaac taaaatattt 6900 tttaaattac catatttaac aaaagaggat aataatgagg atatgctagt aaaattttac 6960 tatcgaacct taatgagtcg aagaagcaaa tacaataaac cttcttggaa ggttgtatac 7020 attgctgcaa aatatctcat atattgaata atgaatatgt ttcattaata acaataagtt 7080 atgtgttcat taagactccg agcttaaaag ctctcgatta tgatggcatg ctatgtcttt 7140 accaccatct aaactggctt aatcgaaaca taagaatgtt atatcgattg aagttgagtt 7200 ttagaaaatc aaggtatctt tttttcatca tcagctgacc tttcaaccct tcaatttata 7260 tttatttgaa gactatgcaa atgctttgta taatagcatc catccttagt tggtgagcca 7320 ttggatatga ttttatatat aagccaaagt tacattaata acctgttagt tgctataaca 7380 gtcaaaaagt cattggcttg gaaatcacga tgaactatct aacgaaacac ccaagtctgt 7440 cttcaatcaa aaagggagat gtattgatgt ctatccgtat cacctggctc tttttttact 7500 ttccaatgag attgtaatct ttaatgacgg gtagttactg tgatgaggct tttattttta 7560 atttgaagac gccatctgta attcatcgcc gccgatcagt tacttggatg cgaacaaaat 7620 cagcaaataa tagctgcata ttcagatatt tatcatgtat caattacact tacgaatgca 7680 acccaatact cgccatactt gcaagcttcc gagtccgaaa cacttttatc aaatttgaaa 7740 tatggcaatt ttatgttaga aaagtctcta aacttttgtg gtactattga tcttaagtct 7800 aacattaaaa aattaatttt tttgacgcta attgtcttta aaattcgatc tctgttcaat 7860 attaatgctt gtgtcatatc atcagactac actatttcta tgcaaagatt tatataaaat 7920 gaccagaaaa gcttaaaaaa tcagatccac taatttgcaa ccgaaactgg aaatatgtaa 7980 ccgaaatttg atactacgga gttatgaacc cggcagccaa aggcaagttt gtttgtagtc 8040 acaa 8044 //