ID SPAPJ698 standard; DNA; FUN; 7743 BP. XX AC AL356333; XX SV AL356333.1 XX DT 17-MAY-2000 (Rel. 63, Created) DT 17-MAY-2000 (Rel. 63, Last updated, Version 1) XX DE S.pombe chromosome I PCR product pJ698. XX KW 40s ribosomal protein s0B; activator 1 36 kd subunit; cut20; prp12; KW Replication factor C 36 KD subunit; rfc3; rps0-2; rpsa-2; sap130. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-7743 RA Saunders D., Harris D., Lyne M.H., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (17-MAY-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; O14003; O14003. DR GOA; Q9P546; Q9P546. DR GOA; Q9UTT2; Q9UTT2. DR SPTREMBL; Q9P545; Q9P545. DR SPTREMBL; Q9UTT2; Q9UTT2. DR SWISS-PROT; O14003; RFC3_SCHPO. DR SWISS-PROT; Q9P546; RS0B_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC PCR product pJ698 is overlapped at the 5' end by cosmid c27E2, CC EMBL entry SPAC27E2, accession number Z98978, and at the 3' end CC by cosmid c19G12, EMBL entry SPAC19G12, accession number Z97209. XX FH Key Location/Qualifiers FH FT source 1..7743 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /map="IR" FT misc_feature 1..74 FT /note="nominal overlap with cosmid SPAC27E2 S. pombe FT chromosome 1" FT CDS join(complement(618..691),complement(388..452), FT complement(125..338),complement(1..73)) FT /db_xref="GOA:O14003" FT /db_xref="SWISS-PROT:O14003" FT /label=rfc3 FT /note="SPAPJ698.01c, len:141" FT /partial FT /gene="rfc3" FT /gene="SPAPJ698.01c" FT /gene="SPAC27E2.10c" FT /product="Replication factor C 36 KD subunit; activator 1 FT 36 kd subunit" FT /protein_id="CAB92098.1" FT /translation="MSIEKGKGRAMDIDLPLGSESTLPWVEKYRPANLEDVVSHKDIIS FT TLEKFISSNRVPHMLFYGPPGTGKTSTILACARKIYGPNYRNQLMELNASDDRGIDAVR FT EQIKNFASTRQIFASTFKMIILDEADAMTLAAQNALRR" FT misc_feature complement(74..85) FT /note="ctaacaatttag, splice branch and acceptor" FT intron complement(74..124) FT /note="confirmed intron" FT misc_feature complement(119..124) FT /note="gtatgt, splice donor sequence" FT misc_feature complement(339..355) FT /note="ctaatcaaataaataag, splice branch and acceptor" FT intron complement(339..387) FT /note="confirmed intron" FT misc_feature complement(382..387) FT /note="gtaaga, splice donor sequence" FT misc_feature complement(453..463) FT /note="ttaacatttag, splice branch and acceptor" FT intron complement(453..617) FT /note="confirmed intron" FT misc_feature complement(612..617) FT /note="gtaagc, splice donor sequence" FT CDS complement(1632..2495) FT /db_xref="GOA:Q9P546" FT /db_xref="SWISS-PROT:Q9P546" FT /label=rpsa-2 or rps0-2 FT /note="SPAPJ698.02c, len:287, FT SIMILARITY:Schizosaccharomyces pombe, Q9Y7L8, 40s ribosomal FT protein s0., (292 aa), fasta scores: opt: 1569, E():0, FT (81.9% identity in 287 aa)" FT /gene="rpsa-2" FT /gene="rps0-2" FT /gene="SPAPJ698.02c" FT /product="40s ribosomal protein s0B" FT /protein_id="CAB92099.1" FT /translation="MAESIARPSVLNATDDDIKNLLAADSHIGSKNLEVRMENYVWKRR FT SDGIHIINLGKTWEKLVLAARVIATIENPADVCVISSRPYGHRAVLKFAAHTGATAIAG FT RFTPGNFTNYITRTYREPRLIIVTDPRADAQAIKEASFVNIPVIALCDTDSILNHVDVA FT IPINNKGYKSIGLAWYLLAREVLRLRGNISRTTAWEVMPDLYFYRDPEEIEREEEQKAA FT AAAAAEEEAQLAAQTAAAEFEVTDSAAGTVDPTILDNATAGQVGQTTWEGDAEWNITGA FT APSEWA" FT misc_feature complement(1929..2435) FT /note="Match to PF00318 Ribosomal_S2, Ribosomal protein S2 FT Score 285.62" FT CDS complement(3053..6673) FT /db_xref="GOA:Q9UTT2" FT /db_xref="SPTREMBL:Q9UTT2" FT /label=prp12 or sap130 FT /note="SPAPJ698.03c, len:1206" FT /gene="prp12" FT /gene="SPAPJ698.03c" FT /product="prp12p/sap130." FT /protein_id="CAB92100.1" FT /translation="MDTFPSLFLYSLTIQNSNYVQSSCAASLSGKKAQEIVIATESRLL FT IYKVDATDGRMNCILNQNCFGIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNK FT LVPIYQETFGKSGIRRVVPGEYLAIDAKGRAAMIASVEKNKLVYVLNRDSEANLTISSP FT LEAHKANNICFHLIGLDTGYANPIFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLG FT LNHVVKRWSKVVDRNSYMLIPVPGGNDGPSGTLVISNGWISYRHLQKAFHQIPILRRQA FT ASANAISTPWNQVNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGN FT VVELRLKYFDTVPLAVQLNILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQA FT QDNEVGTKNVHFGVRGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNS FT SLRQLRRGLETTEIVASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIGETVE FT EISDSGFLSSVSTLNARQMGRDSLVQIHPKGIRYIRANKQTSEWKLPQDVYVVQSAIND FT MQIVVALSNGELVYFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCL FT ACDDATVRVLSLDLYTTLENLSVQALSSPANSLCIIPMNVNGVSTLYLHIGLMNGVYLR FT TVIDVTSGQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIA FT YSAIDHASSFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNF FT PVLYILQSERNFDSFKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHES FT PLGDNEAAFSMTAAFFKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELI FT SHTEIDGIPMALTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVPLFITHITVQASR FT IVVADSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRC FT PEHVSKLADEENSESKLIHEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVL FT LWTGLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCVIDG FT DLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDFRVRSF" FT CDS complement(6943..7743) FT /codon_start=1 FT /db_xref="SPTREMBL:Q9P545" FT /label=cut20 FT /note="SPAPJ698.04c, len:>266" FT /partial FT /gene="cut20" FT /gene="SPAPJ698.04c" FT /gene="SPAC19G12.01c" FT /product="cut20 protein" FT /protein_id="CAB92101.1" FT /translation="IEHINETVIYIRHSLFRSKLTSYFMGTKPLQLRDPDYYSLKDFAN FT QDDSNSVDDFVSFKTLKESLRDSFNVIFSYPSLTCQKQWLKTGDLVLFEGTDWNVSSLI FT PKSCNEKNQLFSLFFRKDTPNIFLIISQLMENTMLPVSGCHFGLDYAELLGSSLLDFQP FT ATVLDMKLLNGSSILILGKLKEKCFLCEICLADVPLTFFEHQQKNSYLDAISHLPFIPL FT NSCLWLHEFEKDFLPSTLEYALSENSDYGVLISRESSRYRLFSF" FT misc_feature 7641..7743 FT /note="nominal overlap with cosmid SPAC19G12 S. pombe FT chromosome 1" XX SQ Sequence 7743 BP; 2538 A; 1462 C; 1402 G; 2341 T; 0 other; tcttcgtaaa gcattttgag ccgccaaagt catggcatcg gcttcatcta atataatcat 60 tttgaaagta gatctaaatt gttagtcaaa catatgttgg aactaaatat catcaacgac 120 atacgcaaag atttgccggg tgctggcaaa atttttgatt tgttccctca cggcatcgat 180 accacgatca tcgcttgcat ttaactccat caactgattt ctgtaatttg gtccataaat 240 ttttcgagca cacgcaagaa ttgtagaggt tttgcctgtg ccaggtggcc cataaaacag 300 catatgaggg actctatttg aactaatgaa tttttcaact tatttatttg attagaaaag 360 acagaaaaaa gtgatggtca atcttacgcg tagatatgat atctttgtga gaaaccacat 420 cctctaaatt agctggtcgg tacttttcca ccctaaatgt taacaagata taaaggaact 480 taaaaaaggt aaatcggcaa gtaaatattt agacacaaaa gacctcaaat ttgtaatttt 540 gaatggcatg acgagttttg cgcgaatgta aatatcgtgc aaaaacaaag taaccagaat 600 cactgtaatt agcttaccat gggagcgtac tctcacttcc caaaggaagg tcaatgtcca 660 ttgcacgccc tttacctttt tcgatagaca tgctggaaga ctgcaggttt gaggactgat 720 tgcacagtag atgtagatgc tagaggtcag ttacattggg ttgactaaat atcctcgcat 780 aaacactact ttcagatagt tttatgaaga taacatttaa atgtaattat gtttctgatt 840 ctatatatct attcaagttt tatcacttaa aagtttgcta aatacgattt tcatacaatg 900 tacggtggac acttaaaatt gaataaacac gttaatacct cctttatctg tctagaaaaa 960 tgcttcaatt gtttgtatga atgcttgaca tcgaaggttc ggagtctgaa cagttgtact 1020 gggcgaagag aagaatagtc tacatataag tgtttaaaag tttcaagaaa tttgattagt 1080 actgagttgt acatattagt gatgttctaa ggattgctaa tcagaaaacg aacagcacta 1140 aagttagtac ttcacattca aaacctacga ttttttcaag aattgacaat actttatctg 1200 ttcgaaattt gtcaaaataa agactcactg ataatttgta atcaataatg aactaaaatt 1260 ttaaaaacaa cattgggaat ctaactttta aacatatatg ggtagttaac attcgcatcg 1320 ccgatttcat acaatatcca caacactcaa tgttggttac actgtttcta caaataacac 1380 agctaatgag acgatgtttg ttcagttcat tttcttttat atttaccact acgacaagat 1440 tcaatatcca aatgataatc tattcattgt actgtaaaat aatcttgttt cagcactaca 1500 accaaagctt aaaattataa cagaaattta acacacctac ttataagtac tgcacgaatg 1560 gctcttattt attagcataa cgtgatctca aagatatggt tttttagctg catgaaatgg 1620 gaaataaaca attaagccca ttcagaaggg gcagcaccgg taatattcca ttcagcgtct 1680 ccttcccaag tagtttgacc aacctgaccg gcagtagcat tgtccaatat agttgggtca 1740 acggttccag cagcactgtc tgtgacttca aactcagcag cagcagtttg agcagccaat 1800 tgagcttctt cttcggcagc agcagcagca gcagcctttt gttcttcttc acgctcaatc 1860 tcttcaggat cacggtagaa gtaaagatca ggcatgactt cccaagcggt ggtgcgagaa 1920 atgttaccac gaagacgaag gacttcacga gccaaaagat accaggccaa accaatggac 1980 ttgtaaccct tgttgttgat aggaatagca acatcaacat ggttcaaaat ggagtcagta 2040 tcacacaagg caataactgg aatgttgacg aaggaagctt ccttgatagc ttgagcatca 2100 gcacgaggat cagtaacaat aataaggcga ggttcacggt aagtacgagt gatgtagttg 2160 gtgaagttac cgggagtgaa acggccagca atggcagtag caccagtgtg ggcagcgaac 2220 ttcaagacag cacggtgtcc atatggccgg ctagaaataa cacaaacatc agctgggttc 2280 tcaatggtag caattacacg agcagcaaga acaagctttt cccaagtctt gcctaagttg 2340 ataatgtgga taccatcaga acgacgcttc catacatagt tctccatgcg gacttccaag 2400 tttttggagc cgatgtggga gtcggcggct aagagattct tgatatcatc atcagtagcg 2460 ttgagaactg atggacgggc tatggattct gccattttgt tgcgtgcacc tcttggtggt 2520 tggaagaaat tccaatttca cctttgccat agtgttgata atttagtgac tgctttttcg 2580 gtcaaaactt tttttacaga gttctaagct gaggttaaag aagattaagg aattgtaacc 2640 catcgttaat tcgtaaacaa aagatttaca gtaatagctg ttttccaact tatagtatgc 2700 tttattttgg taggcattta cagaaaatcc tttcatggga actctgtata acagtcatta 2760 tttatttatt ttagtactaa acttttatat ttttttttat aattacccct atttccaaac 2820 ttactttgaa atcgaatggt cttaccagag aaaggattaa taattgcttc tttattaaaa 2880 acactagatg gaagaaattg ctatcgacat aagacgtata attatattaa aaagctgcca 2940 aacattgaag agtcgacatc aagaaacttt tgtttttcaa gaaggatagg cataaagaga 3000 tgccctttat aaatcttctt cgtcaaagtt cacggatttg taagctattt gtttaaaaac 3060 tacgaacacg aaaatcttct attttttttg aaacctctgc aattgttcta tcgagttcat 3120 tggcaatcat ctcctgaact ggatgaggta acgaatagta catctcgcag agatcaccat 3180 ctataacaca tttgactgga gcgtaataac tacgataagc aagatgatcc cttccagcca 3240 aaggaggaca ttcttttcgc aacaaaaatt ccagttgttg aaaaaatcga acatcttcct 3300 ggttaataaa aggggtaaat actccaacag tacccagaag accggtccat aacaaaactt 3360 ctcgagctcc ttcaaccaat tggacctttt gaagcgaagt aggaatatca tttgtaaaga 3420 aatgagccat taggtccaat ttgtgaggtg tggaatttaa aaagggcttt tcgtgaatta 3480 atttagattc agaattctct tcatcagcta gtttagaaac atgttcgggg cagcgtaaaa 3540 gccaaatatt accaaattta tccccccctg ctaaagtatc ataatcgaca aggacatttg 3600 tagtagtcca ccgatgaata gtatcatcag caaaagttag taggtgatta tcctcaggtt 3660 tatagacaac gaatctaact gagtattgcg aatctgcaac cactatccta ctagcttgaa 3720 ctgtaatatg tgtaataaat agtggtacag cagacaactc acctttacga agcatttttt 3780 tgtttcctaa atcataaatt cgtaaaaagc gtccaactcc agctagcata cgtccctgaa 3840 agggagttaa agccatagga attccatcga tttccgtatg gctgattaac tcaagctttt 3900 taccttcatc gtggaaccgg tatactcgaa agtttccatg agaacaggtt cgacattcaa 3960 gatccatatt agtggcagaa ccagctacca aaaagaattc atctctattt ttgaaaaaag 4020 cggctgtcat actaaaagca gcttcgttat cgcccagcgg actttcatgg attatttttt 4080 tagatatcat atcaaataca gatataaatg acacccaaga ttttgatgtg tgttcatttt 4140 gcttctcctt cgtatacgaa gaacccacat ctccattctc ttgagcatat ttaaatgagt 4200 caaagttcct ttcactttgc aaaatgtaca aaactggaaa gttgggatgt ttaacgattt 4260 ttcgaggggt acaaattaac ggataaatat cagattttaa gtcatcttgc agactatcga 4320 ctgtgaaaat ttttaaagta tttttttgaa tcgctacaat accttctgga cactgttcgc 4380 ttgcaaaaga tgaagcatga tcaattgcag aataagcgat tggtgatagt tgcaagtttt 4440 gttgataact atatgcaagg aaggtacgag aagacaccgc caacaccgta ttttgatttt 4500 tcatggttat tggatagatc ttgacagctc gtggacccaa gaatctagtc cttgtatcca 4560 aaagttgccc ggatgtaacg tcgataacag ttcgtaaata gacaccattc attaaaccta 4620 tatgtaaata gagagtactc acaccattta cgttcatcgg aattatgcat aaagaattgg 4680 caggagaact aagggcttgt acactcaaat tttctaaggt tgtgtacaaa tcgagagaca 4740 acaccctcac cgtagcatcg tcacatgcta agcacataaa attacttctt cttgatcctt 4800 cttgtacggg gcctaaagcc aaagaagtta cattagcagt aagcgttttt ctttcttgat 4860 attcgttgag ctgacctcct tcaacatcat cactcatttc aaaataaacg agttctccat 4920 tgcttaaagc aacaactatt tgcatatcat taattgccga ctgtacaacg tacacatcct 4980 gaggtaattt ccattcactt gtttgcttat tagccctaat ataacgaata cctttaggat 5040 gaatttgaac caacgaatcc cttcccattt gcctcgcatt tagagttgag actgatgata 5100 aaaatccact atcagatatt tcttcaaccg tctctccaat agaaagaacc aaggttccat 5160 ttgtaaacga aagaataata taagagtcat aaacgtcggt ttgatttaat ttcaatgtcc 5220 aaattgcaat aggggcaccg ggaagttctg acgctacgat ctctgtggtt tctaaacccc 5280 ttctcaactg acgcagagaa gagtttgagc ctcttccaca tactgtatat aattgattag 5340 cttcgcctga cgagggcgcc ttcataagta acgtatctgt taacgaatat aggcttggta 5400 tttcttcaac caaagaaagg ttctgtagac cgcggactcc aaaatgcaca ttttttgtac 5460 cgacttcatt gtcctgggct tggaaatcta aagaagtaat ttccaattca tcatcatcaa 5520 tacctaaatt ctcaaattga tacaattggt ggtttccaaa ttccgtggca acgaatagaa 5580 aaccagtttt caaaatattt aactgaactg ccaggggaac tgtatcaaaa tattttaacc 5640 ttagttcaac aacattccct tgaccatcat gctcgattgt cagcttcagc aaatccccgt 5700 caccagtttg taaaagataa aaaaacgatc ctttcatttt atggagaaca gcactaacaa 5760 tcaaaggacc atcatttgca gagtttgaat tgacttgatt ccatggagtg gatatggcgt 5820 ttgccgatgc agcttgacga cgcaaaatgg gaatttgatg aaatgccttc tgaaggtgac 5880 ggtaggaaat ccatccgtta gatattacta gtgtaccgga gggcccatca ttaccaccgg 5940 gaacagggat taacatataa gaatttctgt caacgacctt cgaccatctt ttgacaacat 6000 ggtttaatcc caaatccaat tcgtaatatg ataaaacctt ttccgaagat gtaaatgctt 6060 ctctagttga gtcatggtct atttcgctgt agtcgacttc taaggcagcg aagatgggat 6120 tagcataacc agtatctaac ccaattaaat ggaaacatat attattagct ttatgagctt 6180 caagtggtga agatattgtt aggttagcct ctgaatctcg atttagtacg taaaccagct 6240 tatttttctc cacagatgca atcatagcgg ccctcccctt cgcatctatg gctaagtatt 6300 caccaggaac aactcttcga attcctgatt ttccaaaagt ttcctggtat attggcacta 6360 atttattttt ttcaacgtta tattccaaaa tggttattcg tcctgaatcc gacgtgacta 6420 cgagataatc gcgtttaaaa ccagttaatc tcaaaggagc aacatttcga ataataccaa 6480 aacaattttg atttaaaata caattcattc tcccatcagt tgcatcaact ttatatataa 6540 gcaaccttga ttctgtagca attactatct cctgtgcctt tttaccagac aaggacgcgg 6600 cacacgaact ttgaacataa ttactattct gtattgttaa ggaatataga aacagagaag 6660 ggaaggtgtc catctctata ctttagctga aacacgatcc tcgctgagag aagaagtcag 6720 tgtatcaagg aaacttatag cagaaggtaa aaaacttcgt tatatgttga cttttctaaa 6780 tgaagttatt gcaattgaaa atattgtata aattttctgc ttttaaatta tattaaagaa 6840 ttaaaaaaaa gaaaagaaaa tttaaaagac cgctacaaaa aatgtaactt gctcttcaaa 6900 aacaagaaaa aaaaaacatt acgaccatta aacataagct cattaaaaag agaataaacg 6960 atatctcgaa gattcccttg aaataaggac tccgtaatcg ctattctcag aaagggcata 7020 ttcgagtgtt gaaggaagga aatccttctc aaattcatgc agccataggc aactgtttag 7080 tggaatgaaa ggaagatgac tgatagcatc aaggtaactg tttttttgct ggtgctcaaa 7140 aaaggttaag ggaacgtctg ctaagcaaat ttcacataag aagcacttct cctttagttt 7200 tccaagtatt aaaatactac ttccgtttaa aagcttcatg tctaaaacag ttgcgggctg 7260 gaaatccaac agactgcttc ctagtaattc tgcataatcc aagccaaagt ggcatccaga 7320 tacaggcaac atagtatttt ccataagttg ggaaattatt aagaatatat tcggagtatc 7380 ctttcgaaag aataatgaaa ataattggtt tttttcatta caggattttg gtataagaga 7440 tgatacattc caatcagtcc cttcaaacaa aacaaggtct ccggttttta accattgctt 7500 ttgacaagtt aaagaagggt agctaaaaat gacgttgaat gagtctctta aagattcttt 7560 caaggttttg aatgatacga aatcatcaac agaattactg tcgtcttgat ttgcaaaatc 7620 cttgagagaa taataatcag gatctcttag ctgaaggggt tttgtgccca taaagtaact 7680 tgttagtttc gagcggaata aagaatgacg aatataaatt acggtttcat taatatgttc 7740 aat 7743 //