ID SPCP25A2 standard; DNA; FUN; 7410 BP. XX AC AL133440; XX SV AL133440.1 XX DT 10-DEC-1999 (Rel. 62, Created) DT 10-DEC-1999 (Rel. 62, Last updated, Version 1) XX DE S.pombe chromosome III p1 p25A2. XX KW 3-mercaptopyruvate sulfurtransferase; DNA repair and recombination protein; KW Helicases conserved C-terminal domain; Rhodanese-like domain; rhp26; KW SNF2 and others N-terminal domain. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-7410 RA Brown S., Harris D., Lyne M.H., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (03-DEC-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; Q9UR24; Q9UR24. DR GOA; Q9URT3; Q9URT3. DR SPTREMBL; Q9UR24; Q9UR24. DR SPTREMBL; Q9URT2; Q9URT2. DR SPTREMBL; Q9URT3; Q9URT3. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC p1 p25A2 is overlapped at the 5' end by cosmid c4B3, EMBL entry CC SPCC4B3, accession number AL132870, and at the 3' end by cosmid c550, CC EMBL entry SPCC550, accession number AL023592. XX FH Key Location/Qualifiers FH FT source 1..7410 FT /chromosome="III" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="p1 p25A2" FT /map="IIIR" FT misc_feature complement(1..108) FT /note="nominal overlap with cosmid SPCC4B3 AL132870 S.pombe FT chromosome III" FT CDS join(complement(364..642),complement(1..306)) FT /db_xref="GOA:Q9URT3" FT /db_xref="SPTREMBL:Q9URT3" FT /label=SPCP25A2.01c FT /note="SPCP25A2.01c, len:194, SIMILARITY:Rattus norvegicus, FT THTM_RAT, 3-mercaptopyruvate sulfurtransferase, (296 aa), FT fasta scores: opt: 468, E():3.1e-24, (40.4% identity in 171 FT aa)" FT /partial FT /gene="SPCP25A2.01c" FT /gene="SPCC4B3.01" FT /product="probable 3-mercaptopyruvate sulfurtransferase" FT /protein_id="CAB62826.1" FT /translation="MFSVGKKISFILPIKGVLQKLKDNAQKTVLLDATWYLPTDTKNGK FT KEYLESRLPGAQYFDIDEAKDHKNPLPHMLPPADEFASYVGKLGIDRNTNVIIYDRKGF FT FSSPRVFWTFKVFGHEHVFLFPNAFNAWKTEGLELETGEPRTPKPVVYEGAKLNKDLVA FT SFDDIVKVIESPDAAGVHIVDARAHERFLGNV" FT misc_feature join(complement(364..573),complement(181..306)) FT /note="Match to PF00581 Rhodanese, Rhodanese-like domain FT Score 34.95" FT misc_feature complement(307..318) FT /note="ctaatattttag, splice branch and acceptor" FT misc_feature complement(358..363) FT /note="gtaagt, splice donor sequence" FT CDS complement(1453..4374) FT /db_xref="GOA:Q9UR24" FT /db_xref="SPTREMBL:Q9UR24" FT /label=rhp26 FT /note="SPCP25A2.02c, len:973" FT /gene="rhp26" FT /gene="SPCP25A2.02c" FT /product="DNA repair and recombination protein Rhp26p" FT /protein_id="CAB62827.1" FT /translation="MSVNEDLSHLGVFSVDQENLERDVTNTASEYIAHESREIEKKRLQ FT KVRKEISSVKEKIRRLDERIDSRLTKISVKENFRKQLSKFRDTLQSLQSDENDIKRRLN FT NEDSANAPGIGAFSTEELERQELIRTGKVTPFRNLSGLQKEVDFDDESSIREAVIKSEG FT TYYETAPHLSSEPSNIDHGIIPRDEKDEYVTVDAVTEKVVTAAIDDGDDLVYRQRLNAW FT CANRKELRDQASASENNKDRGEFEGKDEWLLPHPSKKGQTFEGGFTIPGDIRPHLFRYQ FT VTCVQWLWELYCQEAGGIIGDEMGLGKTIQIVSFLSSLHHSGKFQKPALIVCPATLMKQ FT WVNEFHTWWAPLRVVVLHATGSGQRASREKRQYESDASESEAEESKTSIKLRGASSSFH FT RYAKNLVESVFTRGHILITTYAGLRIYGDLILPREWGYCVLDEGHKIRNPDSEISISCK FT QIRTVNRIILSGTPIQNNLTELWNLFDFVFPGRLGTLPVFQNQFALPINIGGYANASNV FT QVQTAYKCACMLRDLISPYLLRRMKLDVAADLPKKSEQVLFCKLTPLQRKAYQDFLQGS FT DMQKILNGKRQMLYGIDILRKICNHPDLVTREYLLHKEDYNYGDPEKSGKLKVIRALLT FT LWKKQGHRTLLFSQTRQMLDILEIGLKDLPDVHYCRMDGSTSIALRQDLVDNFNKNEYF FT DVFLLTTRVGGLGVNLTGADRVILFDPDWNPSTDAQARERAWRLGQKKDVVVYRLMTAG FT TIEEKIYHRQIFKQFLTNKILKDPKQRRFFKMTDLHDLFTLGDNKTEGTETGSMFLGSE FT RVLRKDNSSRNGNEAEDIPARDRKKHKIHDKGKKVNSSKVFEKMGIASMEKYKPPQESN FT VTKTNSDSTLGDDSVLDDIFASAGIQSTLKHDDIMEASQTESILVEKEATRVANEALRA FT VSSFRRPPRQLIPPQQSTNVPGTSKPSGPITSSTLLARLKQRR" FT misc_feature complement(2158..2412) FT /note="Match to PF00271 helicase_C, Helicases conserved FT C-terminal domain Score 102.23" FT misc_feature complement(2560..3537) FT /note="Match to PF00176 SNF2_N, SNF2 and others N-terminal FT domain Score 429.59" FT CDS join(4613..4807,4867..5757,5813..6985) FT /db_xref="SPTREMBL:Q9URT2" FT /label=SPCP25A2.03 FT /note="SPCP25A2.03, len:752, possibility of 3' splicing, FT SIMILARITY:Homo sapiens, Q15219, protein p84., (657 aa), FT fasta scores: opt: 388, E():4e-16, (23.8% identity in 505 FT aa)" FT /partial FT /gene="SPCP25A2.03" FT /product="hypothetical protein" FT /protein_id="CAB62828.1" FT /translation="MEVQKGLIEAFYNTYPLEKAKELDKSPLCSEYELFIKELWPSIVE FT SFHNSTEFETAIRFCCYETARKSEIGLEERLKCLFAILDLLVIGNEINESFCDHLLPFL FT ILEELMDIHTVNECAKLYEYFETRPSLMKGIVSNRGRGPVLLRISNELLRRLSRQENSS FT FCGRIDILLSKAFPPEERSGANLRGDYNTVHSFGKVELSPPSTPISDRTDLSYHKKLNT FT LFTAYWDLQCMCSNPPKLLASDTLPKFIDAAGSAIQAFESILQNTFFNGKSNPTIDPNS FT SSLLSEKYITLDKGFPSKYIYSRSLFEYQLSDEDFRLQAILQLIIIFDFLLDHSKERIE FT RRTLEKWTNKAVIPIVILSDEDTSKLNELSKEAYSFLHTARCGSVQRTIKEIIHIEGNW FT KLWKGLGCPSLEKPLVDKAAIDEAVEGLKKLTNTPVKLRFAMGNAALSRLWEQAGENTL FT DDLKKEERYRIPSPESFLSGVKADKFEIEEAVRDDDKHFHEQSLATKTWRAFRSAINSH FT LQNFSDTGLGDVELLCNSIEGKPTTSKITPSIPPAFDIHIIEGEELLEEMKKRENVKHN FT SQNFASPMQTDAEGDIVQNEEEKESVEVEEGKHKNDLPKVSPKPPTEGVDSEVNGESLV FT QVNKVLKSEDDNTSEASKDPSSHVKSPENIEKLKQNDDHFEVTEEITSTINSKISEKQE FT NNVAETILEVTSSPKSSENSQKQSEITKKRGRDEEDEPSDLHSSPKRPKTGEDGEIVL" FT misc_feature 4808..4813 FT /note="gtatgt, splice donor sequence" FT misc_feature 4846..4866 FT /note="ctaattgttattttcgtgtag, splice branch and acceptor" FT misc_feature 5758..5763 FT /note="gtatgt, splice donor sequence" FT misc_feature 5802..5812 FT /note="ctaatatgtag, splice branch and acceptor" FT misc_feature 7312..7410 FT /note="nominal overlap with cosmid SPCC550 AL023592 S.pombe FT chromosome III" XX SQ Sequence 7410 BP; 2378 A; 1415 C; 1328 G; 2289 T; 0 other; aacgttgcca agaaaccttt catgagcgcg agcatcaacg atgtgaacgc ctgcggcatc 60 tggactttca atcaccttta caatgtcatc aaaggaagca accagatctt tgttcaactt 120 agcaccttcg taaacaacgg gctttggagt tctaggttct ccagtttcca attcaagacc 180 ctcagttttc catgcgttaa acgcattagg aaataagaaa acatgctcat gaccaaagac 240 tttaaaggtc cagaaaaccc taggagacga aaaaaaccct tttcggtcat aaatgataac 300 attggtctaa aatattagta aaattgaaga ctaagtaatt ggaatttcta aactactact 360 tacgtttctg tcaattccta attttccaac gtaagaggcg aattcgtcag ctggaggaag 420 catatgagga aggggatttt tgtgatcttt agcctcatct atatcaaagt attgagcacc 480 tggaagacga ctttctaaat actccttctt tccgttttta gtgtctgttg gcaagtacca 540 agtagcatcc aagagaacgg ttttttgagc attatccttt aatttttgca atacgccctt 600 aataggcaaa ataaaagaga tttttttacc aacagagaac attgtttgaa gcagagtctt 660 attttgtttc agcgaactac atagacttcg aatttcaagt aatcttacga aagtacagcg 720 ttggttgcag ctcccaaaat taattaacga gtcaagtttt tggtaaaacc aaatatgtgt 780 gtaaaaatgg ttagatgcgg ttgcattgct tatatacaaa tattaattta tataaaatga 840 aaaagtacta gtaataagtt cgccaataag caccaagaca tcaaaatttt tgaaccacta 900 gttttctaat taaaaggtct tttctgacac tgtcaacaaa catgactcta gcattctaat 960 aacgttgaca aaagaacgta gtaatttata gctacaaaaa gggttgcacg ggttgcacat 1020 accactaact aagccgcttt gagtgttcat acttttctca gccaactaga aagttattaa 1080 cgctaatttt aggtctcagg taaagcgaat ttgtcaagtg cttgcttaca ataacattaa 1140 cacactttaa ccaagtactt taacaaaagg gaaaaatgat caaagtaatt ccgacagccc 1200 tatcctcaac atggaattga catttccaat acttttcaag caaagtaaac ccacctagta 1260 ataaggataa ttcccattca aaaaatgaaa tgatcgcgtg tctatcaata aatccataaa 1320 taaagcttaa atcatgaaaa acttaatgta caaaataaag ggaaataata ataaataaat 1380 aaatgggtac aagattgggt ccgaaatttt aatggataca gcttactcta aggcgaactt 1440 cttctaatta attcatcgtc tctgttttag tcgggcaaga agagtgcttg aagtaatcgg 1500 acctgatggt ttagaagtac caggaacatt agtagattgc tgcggaggta taagttgacg 1560 aggaggtcgc cgaaaagatg aaactgcacg cagagcttca ttagcaacac gcgtcgcttc 1620 tttttcaacc aatatgcttt cagtctggga ggcttccatg atatcgtcgt gctttaaagt 1680 actctgaata ccagcactag caaaaatatc atcaaggacg gaatcatccc ctaaagtcga 1740 atcagagttt gtcttagtaa catttgactc ttgcggtggt ttatactttt ccatcgatgc 1800 aatccccatt ttttcaaaca ctttggagct gttaactttt ttacctttgt cgtgaatttt 1860 gtgctttttt cggtcacgag ctggaatatc ttcagcttca ttgccatttc ttgaggaatt 1920 atcctttcga agtactcgtt cagatcccaa aaacatgctg cctgtctcag tgccctcagt 1980 cttgttatcg cctaacgtaa acaaatcgtg caagtccgtc attttaaaaa accttctttg 2040 ttttggatct ttcaaaattt tgttagtcag aaactgctta aagatttgac gatgataaat 2100 tttttcttca atagttccag cagtcatcaa ccgataaact actacatctt tcttttggcc 2160 cagtctccaa gcacgttcac gtgcttgagc atccgttgag ggattccaat caggatcaaa 2220 aagaattacc ctgtcagcac cagtcaaatt gactcctaat ccaccaacac gagtagttag 2280 aagaaacaca tcaaaatact catttttatt aaagttgtcc accaaatctt gtcttaaagc 2340 aatagatgtg ctaccatcca tgcgacagta atgaacatcc ggcaaatcct ttaatcctat 2400 ttccagtatg tcgagcattt ggcgagtctg agaaaataga agagttctat gtccttgttt 2460 tttccacaaa gttagtaatg ctctaataac cttcagtttg ccagactttt caggatcacc 2520 atagttgtaa tcttctttat gtaacaagta ctctcgagtt acaagatctg gatgattaca 2580 aatttttcgt agtatatcaa ttccataaag catttgacgc tttccgttca agatcttttg 2640 catatcagag ccttgcaaaa aatcttggta agctttcctt tgtaaaggag tcagcttaca 2700 gaacaatact tgttcggact tcttaggtaa atctgcagca acgtcaagct tcattcttct 2760 aagtaaataa ggagaaatta aatctcgaag catgcaggcg catttataag cggtttgcac 2820 ttgaacattt gaagcattgg cataaccacc aatattaata ggtaaggcaa attgattttg 2880 aaataccggt aatgtaccca atcttccagg aaatacaaaa tcaaataaat tccaaagctc 2940 ggtaaggttg ttctggatcg gagtccctga gaggataatc ctattcacag tacggatttg 3000 cttacacgaa atagatattt cagaatctgg gtttcgtatt ttgtgccctt catccaaaac 3060 acaataaccc cattcccgtg gtaaaatcaa atcaccatat atccttaaac cagcatatgt 3120 tgtaattaaa atgtggcctc tagtgaagac actttcaact aagttcttag cgtatctgtg 3180 gaatgaactt gaagcacctc ttaattttat ggaggttttg ctttcttccg cttcactttc 3240 agaagcgtca gattcatact gtctcttttc gcgtgaagca cgctgtccgc tgccagtagc 3300 atgaagaaca acaacacgta atggagccca ccatgtatga aactcattaa cccactgctt 3360 cattaaagtc gctggacaaa cgataagtgc aggcttttga aatttgccag agtgatgcaa 3420 agacgaaagg aaagatacta tttgaatagt ttttcctagt cccatttcat caccaatgat 3480 tcctccggcc tcttgacaat agagttccca taaccattgt acgcaagtga cttgataacg 3540 gaaaagatgt gggcgaatat cacctgggat ggtaaatcca ccctcaaaag tctgtccttt 3600 tttggaagga tgaggtagta accactcatc tttaccttca aactctcctc tatctttatt 3660 gttttcagat gctgaggcct gatctcgtaa ttccttacga ttcgcacacc aagcattaag 3720 cctttgtcga tataccaaat catctccatc atctatagcg gcagttacta ctttctccgt 3780 aacagcatcc acagttacgt actcatcttt ttcgtctctt ggaataatcc catgatcaat 3840 atttgacggc tccgaactca aatgaggtgc cgtttcatag tatgttcctt cagactttat 3900 aacagcctca cgaatgctag attcatcatc aaaatcaact tctttttgta gtccagagag 3960 atttcgaaac ggtgtaacct taccagtacg gatgagctct tgtctttcca attcttcagt 4020 cgaaaatgca cctataccag gggcatttgc cgaatcttcg ttattaagtc ttctcttaat 4080 atcattttcg tctgattgta aactttgaag agtgtctcgg aatttagata attgtttgcg 4140 gaagttctcc ttgacactta tttttgtaag tcgactgtct atacgctcat caaggcgacg 4200 tattttttct tttacagaag atatttcttt cctgactttt tgaagtctct tcttttcgat 4260 ctctcgactt tcatgtgcaa tgtactcact agcagtattc gtaacatcgc gttccaaatt 4320 ctcttggtca accgaaaaaa cccctaaatg ggatagatct tcattgacac tcatatccta 4380 tcaatttctt aaataaagtc tcattttcag actcactgca aatttaagaa aatactttgt 4440 tgtttggtta agggcaaaat tacatattcc ttcatgctgc tcctgttata gcgagaagca 4500 atgaaccaca caaagcaata acgaataaat ccttatttct aaatgacatt gtcaatgcta 4560 gataaactca cgtttctgcg cgcggattgt atattcgcag acttttaaag taatggaggt 4620 gcaaaagggt ttaattgaag ctttttataa tacatatcca ttagaaaagg caaaagaact 4680 ggataagagt ccactttgtt ctgaatatga attattcatc aaagaattat ggccttcgat 4740 tgttgagagt tttcataact caactgagtt tgaaacggca ataagatttt gctgttatga 4800 aacagcagta tgtgcttttt ccataatgaa gttttcctta agttactaat tgttattttc 4860 gtgtagcgca aaagtgagat tggattggag gagcgattaa aatgtttatt tgcgattttg 4920 gatctcttag taatagggaa cgaaattaac gaaagcttct gtgatcacct gttgcctttt 4980 ctaatcttgg aagagctgat ggacattcac accgtcaacg aatgcgccaa attatatgaa 5040 tattttgaaa ccagacctag cttgatgaaa ggtattgtca gcaatcgtgg aagaggaccc 5100 gttttgcttc ggatatcaaa tgaattactc cggcgtcttt ctcggcaaga aaattcatct 5160 ttttgtgggc ggatcgatat tctgctaagt aaggctttcc cccctgaaga acggagtgga 5220 gccaatttac gtggagatta taatactgtt cattcttttg ggaaggtaga actgtcacca 5280 ccatccacac ccatttcgga ccgaacagac ctctcatatc acaaaaaact taatacctta 5340 tttactgctt attgggattt gcagtgtatg tgctccaacc ctcctaaatt actagcaagt 5400 gataccttac ctaagtttat agatgcagct ggttctgcaa tacaggcatt cgaatctata 5460 ttacaaaata cgttcttcaa tgggaagtca aaccctacaa ttgatccaaa ctctagctca 5520 ttactttctg aaaaatatat aactttggat aaaggatttc cttctaaata tatatattct 5580 cggtcgcttt ttgaatacca actttcagat gaagatttcc ggttacaggc tattttacaa 5640 ttaattatta ttttcgactt tttgcttgac cattctaaag aacgaattga aaggagaact 5700 cttgaaaaat ggacaaataa ggcggtaatt cccattgtta ttttgtctga cgaagatgta 5760 tgttcttttt tagactcgaa aaaaactgat tttattgatt gctaatatgt agacctcaaa 5820 gttaaatgaa ttatccaaag aagcatatag tttcttacat actgcccggt gtggttctgt 5880 gcaaagaacc attaaagaaa ttatacatat tgagggtaat tggaaattat ggaaagggct 5940 tggctgccct tcccttgaaa aacccctagt agataaagcg gctattgatg aagctgtcga 6000 aggcttaaaa aagcttacaa acaccccagt gaaattaaga tttgctatgg gaaatgctgc 6060 tttgtcgagg ctctgggagc aggctggaga aaataccctt gatgatctca aaaaggagga 6120 aaggtaccgg attccatcac ctgagagctt tttatctggc gttaaagcag acaagtttga 6180 aatagaagag gcagttagag atgatgacaa gcatttccat gaacagtcct tggcaacgaa 6240 gacatggcgt gcctttcgtt cagcaattaa tagccatctt caaaactttt cggataccgg 6300 actaggtgat gttgaattgc tttgtaattc cattgagggc aaaccgacta catcgaaaat 6360 tacaccctct attcctccag cttttgatat tcatattata gaaggagaag agttattaga 6420 ggaaatgaaa aagagggaga atgtcaagca caattctcaa aactttgcaa gtcctatgca 6480 aacagatgct gaaggtgata ttgtccaaaa tgaagaggaa aaagaaagcg ttgaggttga 6540 agaaggaaaa cataaaaatg acttgccaaa ggtatcacct aaacccccaa ctgagggtgt 6600 ggacagtgaa gtaaacggtg aatctttggt tcaggttaat aaggttctca aatcagaaga 6660 tgataataca tctgaagctt caaaggaccc atcttcacat gtgaaatctc ccgaaaatat 6720 tgaaaaacta aaacaaaatg atgatcactt tgaggtcaca gaagaaatta cttccaccat 6780 aaattctaaa atatctgaaa agcaagagaa taatgttgct gaaactatat tggaagtaac 6840 atctagtcca aaatcttctg aaaatagtca aaagcagtca gagattacta aaaagcgagg 6900 acgggatgag gaagatgaac cttctgacct tcattcaagc cctaaaaggc caaaaactgg 6960 agaggatgga gaaattgtat tataatcatt aaatcaatta cacatatgtg ataagtgctt 7020 agttatgatt gagtcagaaa atgtatatat agttaatgtt ttgatctagc gtgtttttgt 7080 atgcgcttat ataaaataaa attacatcct tttagtagag ctactgtaaa gatgaatttc 7140 attgtttatt gatgggtaaa ttacgcttat ttactaaact gcacacattt tttgacagtt 7200 gaatacgttc acatgttcaa tatgaaaaca ttctaaagtc aaatttttta tttttcaaca 7260 aatgcaacaa taaaagcggg ttataagtac agaacttttc caatatcgat ggatcagatt 7320 tgaaacagac ttatacaatt tgacaacaac gttgaattgt aaaatatgtt tttgcgtatt 7380 taaactccat cctatcaatt caatactaag 7410 //