ID SPAC31F12 standard; DNA; FUN; 4623 BP. XX AC Z99166; XX SV Z99166.1 XX DT 16-SEP-1997 (Rel. 52, Created) DT 24-JUN-1999 (Rel. 60, Last updated, Version 2) XX DE S.pombe chromosome I cosmid c31F12. XX KW ubiquitin fusion degradation. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4623 RA Badcock K., Churcher C.M., Barrell B.G., Rajandream M.A., Wood V.; RT ; RL Submitted (15-SEP-1997) to the EMBL/GenBank/DDBJ databases. RL Schizosaccharomyces pombe chromosome I sequencing project, Sanger Centre, RL Hinxton Hall, Hinxton, Cambridge CB10 1RQ E-mail: barrell@sanger.ac.uk XX DR GOA; Q10435; Q10435. DR SPTREMBL; O14100; O14100. DR SWISS-PROT; Q10435; YDE1_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S.pombe) CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPAC5H10.01c. CC SP (S. pombe), A (chromosome 1), c5H10 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c31F12 is overlapped at the 5' end by cosmid c637 and CC at the 3' end by cosmid c12B10. XX FH Key Location/Qualifiers FH FT source 1..4623 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c31F12" FT /map="IR" FT misc_feature 1..3928 FT /note="overlap with SPAC637 S. pombe chromosome 1" FT CDS 34..2349 FT /db_xref="SPTREMBL:O14100" FT /label=SPAC31F12.01 FT /note="SPAC31F12.01, len:771, SIMILARITY:some to FT Saccharomyces cerevisiae, ZDS1_YEAST, zds1 protein, (915 FT aa), fasta scores: opt: 294, E():1.7e-07, (22.5% identity FT in 755 aa)" FT /gene="SPAC31F12.01" FT /product="hypothetical protein" FT /protein_id="CAB16275.1" FT /translation="MSPPETEQDASTLFWVPANLHPELNPTGWKSFLDLQVKNLKSPTA FT TDTSSSPLEHIRSLRRRKSLLSRQVKADDAVINYQDGSPIVEKAYLKRHRSLRLNELEH FT LESLARDPHRMVSLVDGMSNGSPEDSPLLVSPNHFLQRSSRTTIRRTGASIRTIHRGKT FT STLSGNRSHSILQKPTDTSPLHKIEPISADELVESDDSRTSALSNSQNPSDDVENQSDQ FT ALEVLSLTNPPKIDNASADTTLHKETNKIDKLYVSENKAESAVASESSLSEGTLALKAP FT APENKPEKSSTSKPPVPENKAEDSVVLKSSVPEDKSENSIASKPSATEGIPENAIALQS FT SVPENKAEDSVVLKSSVPEDKSEDSVPSKSSVLEDKHENSVEIDKKADDSLPSNNKTEG FT YTPSVVREEKNYSEPNASPSVIPPRVPTPVPGRTLSPKPTRIPTPIPSSLNVSLESSKK FT PEIFHERHIPTPETGPNKPSKNNILKSTQVPVTPKQKSSTANKGSTSSPSPPSSESKKT FT KRSWGRLFVSGDSDKEHKEHKKDKQKKKNDQISSSSKSASSFKKDRDKESIFGSLFGSK FT KKQTEIPPVSSSPPHNDAPPKAKPISAPSELPNTTSVAEAKCQTVTDDEGTDQQSDEKS FT TEPKTFIPDKDYYWSRFPICTERAIYRLSHIKLSNAHRPLFQQVLLSNFMYSYLDLISR FT ISSNRPMNNVQQSTAKPIRKDINGQQRRSEFSAENVKNELENLSYQFGDQRKRNLNRKG FT STIHTVSQNIQKVSKNAK" FT misc_feature 784..1306 FT /note="identical to EM_FUN:SPDNAM31 Y09285 S.pombe DNA frag FT ment M31; Genomic DNA fragment which when fused to GFP give FT s a nuclear localization" FT CDS complement(2724..4623) FT /codon_start=2 FT /db_xref="GOA:Q10435" FT /db_xref="SWISS-PROT:Q10435" FT /label=SPAC31F12.02c FT /note="SPAC31F12.02c, len:632, SIMILARITY:Saccharomyces FT cerevisiae, UFD4_YEAST, ubiquitin fusion degradation FT protein 4, (1483 aa), fasta scores: opt: 826, E():0, (38.1% FT identity in 616 aa)" FT /partial FT /gene="SPAC31F12.02c" FT /product="probable ubiquitin fusion degradation protein" FT /protein_id="CAB16276.1" FT /translation="SRISVRNETGRRFSILREAGSLRESMSGSSRNSSGDYTDSMSQDA FT PNHTTEPSERRDSSTSSHFEEHFVFSLMGKKVPRNKTIFRILYEYIQLSDDHTLDDFWK FT TPVPIFYGEPDCIHNDMKGELNYENETEGFSINIREILDLLSILYYGIRDVHTLFPDKH FT FRGNIENILTDFSNWKLSAKLNRQLEEQQLVVHGCLPSWCISLTSAYPFLVPFETRYLL FT LQSTAFGLSRSVSFLLSRNPELSKNESSSILQFVSLHRQKIRISRKKIFNYALHLLATY FT AASENILEIEYEDEVGSGLGPTLEFYTSVSKEFTLNSLDIWRNDQPNSKFVYQASGLFP FT SPIPLLGSSPENERKISLFFALGQFVARSIYDSRIISIQFNPLFFARNIPLTISSVAKV FT DKGLANSLRYLEKLIPGKNPTNAETDIKLEDLHLDFTLPGFPSIELIPDGASTPVTTFN FT VSDYLNYVIDYTVGKGVQQQLEAFQNGFSSVFPYTSLQVLTEHELVTLFGTVDEDWSYA FT TLMKSIVADHGYTMESPTIQRLLTLMSQMNFQEQRDFLQFITGSRKLPIGGFAGLNPPL FT TVVRRLNEPPYVPDDYLPSVMTCVNYLKLPEYSSSEVLGSRLSKAILEGQGSFHLS" FT misc_feature complement(2730..3701) FT /note="Pfam match to entry PF00632 HECT, HECT-domain FT (ubiquitin-transferase).," FT misc_feature complement(4537..4623) FT /note="nominal overlap with cosmid SPAC12B10 S.pombe FT chromosome 1" XX SQ Sequence 4623 BP; 1487 A; 940 C; 859 G; 1337 T; 0 other; gatcctgaga ttccaactga ctggtcagtt gcgatgtccc cacctgaaac cgaacaagat 60 gcatctaccc tattctgggt accagcaaat ttacatcctg aattgaaccc gactggttgg 120 aaatcgtttt tggaccttca agttaaaaac cttaagtccc ctactgctac agacacatct 180 tcaagtcctc ttgagcacat tcgttcccta agaagaagaa agtcattgtt gtcacgacag 240 gttaaagctg acgatgctgt aattaattac caagatggca gtccgatcgt ggagaaagcg 300 tatctcaaaa ggcatagatc tttacgcttg aatgagttag agcatttgga atctcttgca 360 cgtgatcccc ataggatggt ttctttagtt gatggcatga gcaacggctc tcccgaagat 420 tcgcctttac tagtatctcc taatcatttt cttcaacgtt cctcaaggac gacaattcga 480 aggacaggtg cttcgattcg gactatccac cggggtaaaa catcaactct atctggcaac 540 cgtagtcact ctattttgca aaagccgacc gatacttcgc ctctacataa aatagagccc 600 atatctgctg atgaacttgt tgagtcggat gactctagaa cttcggcgct gtcaaactct 660 caaaatccat cggacgatgt tgaaaaccaa tctgaccaag ctctagaggt tttatccttg 720 acgaatccac caaagattga taacgcctct gcagatacaa ccttacataa agagacgaat 780 aagatcgata agctgtatgt ttcggagaat aaagctgaaa gtgctgttgc atctgaatct 840 tcgttatccg aaggtactct cgcattgaaa gcaccggcac cggaaaacaa acctgaaaag 900 tctagtacat cgaaaccgcc agtaccagaa aataaggctg aagattctgt tgtattaaag 960 tcgtcggtac cagaagataa atctgaaaat tctattgcgt cgaaaccgtc ggcaacggaa 1020 ggtataccgg aaaatgctat tgcgttgcaa tcatcggtac cagaaaataa ggctgaagat 1080 tctgttgtat taaagtcgtc ggtaccagaa gataaatctg aagattctgt tccgtcgaag 1140 tcgtcggtac tggaggataa acatgaaaat tctgttgaga ttgataaaaa agcagatgac 1200 tcattacctt ctaataataa aactgagggt tatacacctt cagtagtccg tgaggaaaag 1260 aattattctg aaccgaacgc ttctccttcc gtaatacctc ctagagtccc tactcctgtg 1320 cctggtagga ctttaagtcc aaagcctact aggataccca cgcctattcc ttcttctttg 1380 aacgtttctt tagagtcctc aaaaaagcca gagatttttc atgagcgtca tattccgact 1440 cctgaaaccg gtcctaataa gccatcgaaa aacaatattt taaaaagcac tcaagttccc 1500 gttactccaa aacaaaaatc atccactgct aacaaaggtt caacttcttc accaagtcca 1560 ccctcttctg aatcaaagaa aaccaagagg tcctggggcc gtctttttgt gtctggtgat 1620 tctgataaag aacacaaaga acacaaaaaa gataaacaga aaaagaagaa tgaccaaatt 1680 tcttcttctt caaagtctgc ttcttctttt aaaaaggacc gagacaagga aagcattttt 1740 ggatctttat ttggctcgaa gaaaaagcag acggaaatac cccctgtatc ttcttctcca 1800 ccgcataatg atgccccacc aaaagcaaag ccaatttccg cacctagtga attgcccaat 1860 accacttctg tcgcagaagc taagtgtcag acagtgacag atgacgaagg taccgatcaa 1920 caaagtgatg aaaagagtac ggaaccaaaa acgtttattc ctgataagga ttattactgg 1980 agtcgttttc caatttgtac tgaacgtgcc atttatcgtc tctctcatat aaagttgtca 2040 aacgcgcatc gcccattgtt tcaacaagtt ctcttaagta actttatgta ttcatactta 2100 gatttaattt ctcgaattag ctcaaatcgc ccgatgaata atgtacaaca gtctactgct 2160 aaaccaatac gcaaagacat taatggacaa caaaggcgtt cagagttttc agcagaaaat 2220 gtaaaaaacg agctggagaa tctcagttat caatttggag atcagcgtaa gcgaaatctt 2280 aataggaaag gatcgactat tcataccgtt tctcaaaata ttcaaaaagt ttcaaagaat 2340 gctaaataat cattttaccc tactgctatt cgttggtcta acatttttcc tcattggtta 2400 attattagca tttggcgact tttgttatat gttacagggt cttcattgtc tttaaaaatt 2460 tgtatcttgt tcaccaacac tttctggaag caggtggtta tttattttca tgagttaaat 2520 ataaaattct tactcaatgc ataagtttct aatgaatttc attcgctatg ctactttaca 2580 ttgtaagggg agtacgaagt tacccattct ataataaatt accaattctt tttcgcttta 2640 tatttgtaca tagcctctat tttgaagtaa tcatagaacg ctactgaagt gctatacaat 2700 aacttttttt ttctaaacaa tgcttaggat aaatgaaaac tgccctgtcc ttctaaaata 2760 gccttagaaa gtcgacttcc taaaacctca cttgaagaat actcaggtag tttcaaataa 2820 ttcacacaag tcatcacgct tggaagataa tcatcaggta cataaggagg ttcatttaag 2880 cgcctaacaa cagttaacgg gggatttagg ccagcaaatc ctccaatagg aagttttcgg 2940 cttccagtaa taaattgcaa aaagtctctt tgttcttgaa aattcatttg actcatcaaa 3000 gtcagtaatc tctggattgt aggactttcc atggtgtaac cgtgatctgc tactatagat 3060 ttcatcaagg tcgcgtaact ccaatcctca tcgacagtac cgaatagtgt gaccaattca 3120 tgttctgtca aaacttgcaa agaagtataa ggaaaaacgc tggaaaaccc attttgaaat 3180 gcttctaatt gttgttgaac accctttcca acagtatagt caatgacata atttaaataa 3240 tcactcacat tgaatgttgt cacaggtgta ctagctccat caggtatgag ttcaatagac 3300 ggaaatcccg gtagagtaaa atcgagatga aggtcttcaa gcttaatgtc cgtctcggcg 3360 tttgtagggt ttttacccgg tattaatttc tctaaatatc gtaacgagtt agctaagccc 3420 ttgtcaactt tcgctacaga ggaaatggtt aaaggtatat ttcgtgcaaa gaaaagtgga 3480 ttaaactgaa tgcttattat ccttgaatca taaatcgagc gtgcaacgaa ctgtccaaga 3540 gcaaagaaca acgaaatttt tctttcattt tctggactgg accctaataa tggaatagga 3600 gaaggaaaaa gtccagacgc ttggtacaca aatttagagt tgggttgatc attccgccaa 3660 atatcgagag aatttaaggt aaattcttta gatactgaag tatagaactc caaagtgggc 3720 ccaagacctg aaccaacttc atcttcgtat tctatttcaa gaatattttc agaagccgca 3780 taagttgcaa gcaaatgcaa agcgtaatta aatatttttt tccgagaaat acgaattttc 3840 tggcgatgta agctgacaaa ttgaagtatg gaggaacttt cgttttttga tagctcagga 3900 tttcgggaca aaagaaaaga aacggatctt gataaaccaa aagcagtaga ttgcaataaa 3960 agatatcgag tttcaaacgg tactaggaat ggatatgcag aggttagtga tatacaccag 4020 gagggtaaac aaccgtgaac aacaagttgt tgctcttcaa gttgtctatt caatttggca 4080 ctcaacttcc agttgctaaa atctgtaaga atgttttcaa tgttgccacg gaaatgtttg 4140 tcaggaaaga gggtatgaac gtctctaatc ccataataca aaatactcaa taaatccaat 4200 atttctctaa tatttataga aaatccctct gtttcattct catagtttaa ctcccctttc 4260 atatcattgt gaatgcaatc aggctcacca taaaatattg ggacaggagt tttccaaaaa 4320 tcatcgagag tatgatcgtc agacaactgt atatactcat ataaaattct gaatattgtt 4380 ttgtttcggg gtaccttttt acccatcaaa ctgaatacaa aatgttcctc gaaatgtgat 4440 gatgtcgagg agtcacgtcg ctctgatggc tctgttgtgt gattaggagc atcttgagac 4500 attgagtcag tatagtctcc tgaagagtta cgagatgatc cactcattga ctccctcaaa 4560 gaaccggcct cgcggaggat tgaaaaacga cgtccagttt cattacgcac agaaatgcga 4620 ctg 4623 //