ID SPAC683 standard; DNA; FUN; 6399 BP. XX AC AL441621; XX SV AL441621.1 XX DT 16-SEP-2000 (Rel. 65, Created) DT 12-JUN-2001 (Rel. 68, Last updated, Version 2) XX DE S.pombe chromosome I cosmid c683. XX KW C2-H2; zinc finger. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-6399 RA Aert R., Robben J., Volckaert G., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (14-SEP-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and Katholieke Universiteit Leuven, Laboratory of RL Gene Technology, Kardinaal Mercierlaan 92, B-3001 Leuven, Belgium XX DR GOA; Q9HFF2; Q9HFF2. DR SPTREMBL; Q96VG3; Q96VG3. DR SPTREMBL; Q9HFF2; Q9HFF2. DR SPTREMBL; Q9HFF3; Q9HFF3. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c683 is overlapped at the 5' end by cosmid c26H5, CC EMBL entry c26H5, accession number Z99126 and at the 3' end CC by cosmid c694, EMBL entry SPAC694, accession number AL138666. XX FH Key Location/Qualifiers FH FT source 1..6399 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c683" FT /map="IR" FT misc_feature 1..756 FT /note="Nominal overlap with SPAC25B8 S. pombe chromosome FT 1" FT CDS complement(133..1701) FT /db_xref="SPTREMBL:Q9HFF3" FT /label=SPAC683.01c FT /note="SPAC683.01c, len:522" FT /gene="SPAC683.01c" FT /gene="SPAC25B8.19c" FT /product="hypothetical zinc finger protein" FT /protein_id="CAC08551.1" FT /translation="MSSDNTPSINRRNNENPPQSSLPTTSGIVYNMFPACPYPHVQNPA FT FHGSVDVPQVAQKAFDPQAATVSESANVSRPTPAPVPPAGNTNTPTTSNSNQNLENNVT FT SAASMPAILNAAGQLEFSPSTNNALCCCTTVHGHPHMPPNLLAASSARLRLPPISTILG FT GTFADPTFLAAAAAAAVPHYAHATTGSATDASNTSNGNSNPAAVPAGFLSGYSPLSYAY FT FVAKANELALANQRQSSEAAEQPSSKNNTSGANPPSSNNQEVTSAPIPAAPIVLPFQQP FT FYPIVCPGCAQGALPQHIPVPHNTEFAQYQPSSRDLQNHPTVDESRLSSVAPPASNTLN FT HANGNQAENASESSTSQSNDSQGPANTSYPVSVPLPNDAENNHTLSRNPYIPSLNFKDN FT MSAELSVVATLASNSAQAHPMGQQSDSNYSDHHNNDKRAHVSRRHSTSRKIAQSHTGSS FT STSSAANVRYRCTECLQGFSRPSSLKIHTYSHTGERPFVCDYAGCGKAFNVRSNMRRHQ FT RIHGL" FT misc_feature complement(142..216) FT /note="Match to PF00096 zf-C2H2, Zinc finger, C2H2 type FT Score 36.60" FT misc_feature complement(232..300) FT /note="Match to PF00096 zf-C2H2, Zinc finger, C2H2 type FT Score 28.82" FT CDS 3133..3450 FT /db_xref="SPTREMBL:Q96VG3" FT /label=SPAC683.03 FT /note="SPAC683.03 len:105aa" FT /gene="SPAC683.03" FT /product="very hypothetical protein" FT /protein_id="CAC41386.1" FT /translation="MEKCINRYKDILPLNRYSDGSGKCWGLLGCEISSVVGQTGVKRLL FT LGSITKFVYCSMRLFSLYTIVQYFKSLRCLRNGRTEVRLYTHYKRHLPRSVNSIYLQTN FT C" FT CDS complement(5150..5806) FT /db_xref="GOA:Q9HFF2" FT /db_xref="SPTREMBL:Q9HFF2" FT /label=SPAC683.02c FT /note="SPAC683.02c, len:207" FT /gene="SPAC683.02c" FT /gene="SPAC694.01c" FT /product="hypothetical zinc-finger protein" FT /protein_id="CAC08552.1" FT /translation="MARITNMGKRKRFLEATPYESKVLEQPKNSSNTNEESSSQDNMKA FT SFGSSKRYDERQKKKRSEYRRLRRINQRNRDKFCFACRQQGHIVQDCPEAKDNVSICFR FT CGSKEHSLNACSKKGPLKFAKCFICHENGHLSGQCEQNPKGLYPKGGCCKFCSSVHHLA FT KDCDQVNKDDVSFGHVVGVAGTTGADEDVYHEYAKTVAAPTKKRPVKPVKKLVTF" FT misc_feature 5305..6399 FT /note="Nominal overlap with SPAC694 S. pombe chromosome 1" XX SQ Sequence 6399 BP; 2042 A; 1140 C; 1303 G; 1914 T; 0 other; gccaactgct atgtggtttt attcaataaa tttttatata tcatgaaatt aatatcgtca 60 gaagttgtag tagtcgcaat aaatggcttt caagtaacgg ctaataattg ctatcagaca 120 ggttaaaatt ggctacaaac catgaatgcg ttgatgtcgc cgcatattac tgcgtacatt 180 gaacgccttt ccacatccag cgtaatcgca gacaaacggc ctttctcctg tatgggaata 240 tgtatgaatt tttagactag aaggcctaga aaatccttgt aaacattccg tacatctata 300 tcgaacgttg gcagcggaag aggtagacga agaaccggta tgggattgtg caattttgcg 360 tgaagttgaa tgtcgccgcg atacatgagc tctcttatcg ttgttgtgat gatcagaata 420 atttgaatca gattgctgac ccattggatg agcctgcgcg ctgttgctag ccaaagtagc 480 aacaacacta agttccgcgc tcatgttatc cttgaaattt aaagaaggaa tatatggatt 540 tctggataaa gtgtgattgt tttcagcgtc attaggaagc ggaaccgaga ctggataaga 600 ggtgtttgcc gggccttgag aatcatttga ttgcgaggtt gagctttcgg aagcgttttc 660 ggcttgatta ccatttgcat gatttaaagt attacttgca ggaggagcaa ccgatgaaag 720 acgtgattca tcgactgtgg gatgattttg aagatctcgg gagctgggtt gatactgtgc 780 aaattctgta ttatgcggaa cagggatgtg ctggggcagt gcaccctgag cacatccagg 840 acatacaatt ggataaaatg gctgttgaaa aggaagtact atgggagcgg caggaatagg 900 agcagatgta acttcttgat tattagaaga cggaggattt gcgcctgagg tattgttctt 960 ggaactaggc tgctctgcag cttcagaaga ttgcctttga ttagcaagcg ccagttcgtt 1020 agcctttgca acaaaatatg cataggaaag aggggagtaa ccagacaaaa atccagcagg 1080 aacagcagcc ggattagagt ttccattact agtgttagaa gcatcagtcg ccgagcctgt 1140 agtcgcgtga gcataatgcg ggacggcagc agctgctgct gctgccaaga aagtcgggtc 1200 ggcgaatgtc ccacctaaaa tagtagaaat gggaggaagt ctaagtcgag cgcttgatgc 1260 tgccaagagg tttggaggca tatgaggatg gccgtgaaca gtagtacaac aacaaagtgc 1320 gttgtttgta gaaggagaaa actctaactg accggcagca ttcaggatag caggcatgct 1380 tgcggctgat gtgacattgt tctccaaatt ttgattagaa tttgaagtag tcggggtgtt 1440 agtattacct gcaggaggaa ctggtgccgg tgttggacga gatacatttg cagattccga 1500 aacagtagca gcttgaggat cgaaagcttt ttgagctact tgtggaacat ctaccgagcc 1560 atgaaaagca ggattctgca catgtgggta tggacaagca ggaaacatgt tatatactat 1620 tccagaagta gtgggaagag aagattgagg agggttctcg ttattacgtc tattgattga 1680 aggagtgtta tctgagctca tcacttttaa aaacagcaaa agcctaaaga aatgaaaaaa 1740 ttgaactgaa ccaaagtaag tcagatcgtt ctattcggag tagcaaagaa gcgtcactgg 1800 atagaaagtg agttataatg agactcggtt aaatatattt aatttctgtc gtgagattga 1860 acacgatatt acgagaaagt cctctatatt tgaggggtga aagtgtccga tatttgattg 1920 acacagaatt gctatcctac gaagtaaaag aatatctcag tttagccaaa gcaagaacaa 1980 ttaaaagagg caatgcaaga gacaaaagaa taacgggatc caaggatgat caatgtaaga 2040 ataaaaaagc aattgatcga ctgattaacc ctacagtaaa tttttttttg ggggtttttt 2100 ttataataaa tagcggaaat gagtgagcgg aacaaacaaa caaagcaaag aaaaaacaaa 2160 aacttcctac aacctttacg aagaggaagg aaagagtgaa aaaactagaa aaaccaactc 2220 aaaggcgctc taggacaaaa aaaatatggt caatcgtttc cacgtgacca aattccaacg 2280 aaccaagaaa cacaaataaa aaaactagga gaatacagaa aaattaaaat gagatgcagc 2340 aacaaaggga acgaaaagta atttgtttgc tctccacaag ccgacaattt ttacagcact 2400 ctctttcctc gaagataaat ccaataaacc gctaccttcc aaaataccag aacgtgaaaa 2460 aggcaggaaa caaagaaaat taagatatcc gtatgccaaa cgagtggact cgaaagggca 2520 aacaggataa accaagagaa ccaagctaca attcagttgt tggtcaaatg atgctgccag 2580 agaaaggata acacgaaaat ataagaaaat attacaaata attgggaaaa taccactgga 2640 aacttggtgg attccttgat ttgcaattta cctagtggtg ttgttgttat acagtaggag 2700 tggtaaacgg ggaaatccgg taaatatctt tatataaacg tgagagatgg caaagctttt 2760 ttaacggtga tcggcgtaat gttacctaaa actttgttgg taatctcaat acggggcgag 2820 taacaatttt ctccagataa cgaattacag ctttgccggc cgaccccgct ttagctaaaa 2880 tttttttagg ggaactaata tagactcaat ggaaggagca aatcgacaga aacaatatag 2940 tctgtaatgt aaagaagtct acgaaaaaac cagcaagacc ttgatgttct ttactttctg 3000 gattcgatgt ggtggaatac gtcaaacgat cggaatttgt attttgggac ccctttgacc 3060 catttaccca ttcccatatg atcgacgaat gatctgcgtt gaagtaacag gattactata 3120 tcagatatct cgatggaaaa atgcattaac cgttataagg atatattgcc gttaaaccgc 3180 tatagcgacg gtagtggcaa atgctgggga ttactgggct gtgaaatatc gagtgtagta 3240 ggacagactg gagtaaagag attattattg ggatcaatta ctaagtttgt atactgttct 3300 atgaggttat tttcattgta tactattgta caatatttta agtcattgcg ttgtctacga 3360 aatgggcgga cagaagtgcg cctgtataca cattacaaga gacatcttcc ccgttcagtg 3420 aattccatct atttacaaac aaattgttga acgattattc acaatttaga aatggccgat 3480 cggcccaaca gcgattccga gcccggtgtt ttcaagtgcc atgcttcatt tttctgcatg 3540 ggagtatatc gctgtgtaag aatccatcac aatagcaaat ggacatgcga tccaacgacg 3600 atggggcggc agtgtattga tcacacatcg ccagaaaaac tcatctctcg ctaacgtaag 3660 cagataaaac aatgtcctca attcactaat gactattctt tagcgagagt tgcgctagat 3720 tgcctaacac ttggcgtaca ataggcggat taccaatgga tatcccatcg tatggccaac 3780 attgggctca caattcaatt taaaggatcg taaggaattg caacatagta caggaataag 3840 caacgtactt gaagctatcg tacagtatca cccgtataac aaatgctact gaacgatcga 3900 tcagcgaggc aagaaagtgg attgaaaact tgacttgaaa tcgatgaaaa ggcttattca 3960 aaataaaaaa atttttttta cgatttacat atgagtcaaa cgcatatcaa gtacttactt 4020 ccgtcttttc gttaacattg catacttctt gggttagctc ctatattaac gcattgttta 4080 ccaatctcag taccagtttg ttaatccatg ttaaatcctt tcattataat taatacctat 4140 tttccagctc aaggtccatt ctttatttta ttttatttta tttattttat ttttatttta 4200 ctattattat tattattttt aattttacga aattcatata aaaactttat gcactcttgt 4260 aaattcacaa ttccgattaa ttcacattta agttcgatta attcgcaatt catgtatagt 4320 taggcatatt gcaggcatat accatataca caaagaacca aaaagtcgct gtaaaagtcc 4380 actgctggat aatccttctt gcaactgtca gtcacataac ggtgtgtcac tttgggtgga 4440 taaggattac agtatagaga gggggcgatc attagtcagt aaatgtattt ttacatccat 4500 gaatggaccc ttgaaacggg atttgcattg tcttctgtca agtgataccg catatcacgt 4560 gcaatgaaat ccatgatcta tatttcggca tttgttttct cagctggcaa tcgcagcctg 4620 ccagtattac gagttgagta ggggtttcca tgattcccta cgtagagaga gagtatcgca 4680 catatacatt aaggtaataa tttgtatgaa atgagtcgag ttgtcgaaac caaattacaa 4740 atagaaattt aatggtcaga tatctaatat ctaattgatt acatgtattg tttgttaatt 4800 cattttaata taagcaatta aaaaattaac gctttgaatt tttgctaaga atatattgtt 4860 tttggtattc ggatttgtag gattacatta aaagtgtcag ctagctatat tgtcccaaca 4920 caacaagaca gcaaaaatat tttgcttttt actcttctct tttctattat ttctttaatt 4980 ttttcaattt attataaaat attaattatt tttcatacac atcaacttca aaattcgcat 5040 atgtaatcga aaacgtacca aaagctgatt atctgttaca gttactatac catggcttgc 5100 cttctggtct agtccaattc ctttccccgt ttacctttcc aaaacccttt taaaacgtaa 5160 ccaacttttt cacaggtttc actggtcttt tcttagtagg cgctgcaacc gttttggcgt 5220 attcatgata tacgtcttca tcagctccag tggtaccagc gacaccaaca acatggccaa 5280 aggaaacatc atccttgttc acttgatcac aatcctttgc taaatgatgt acagaggaac 5340 aaaatttaca acaaccacct ttggggtaca accctttggg gttctgttca cattgtcccg 5400 acaagtgacc attttcatgg caaatgaagc atttagcaaa cttcaatggt ccttttttag 5460 agcaggcatt tagagaatgc tctttgctac cacaacgaaa acatattgaa acattatcct 5520 tagcctctgg gcagtcttgt acaatgtgtc cctgttgccg gcaggcaaag caaaatttgt 5580 ctctatttct ttggttgatt cttcgaagtc ttcgatactc agagcgcttc tttttctgac 5640 gctcatcata tcttttcgac gaaccaaaag atgctttcat attatcttga gaagatgatt 5700 cttcatttgt attggaggaa ttcttgggtt gttccaaaac ttttgactcg tatggcgtag 5760 cttcgaggaa tcgtttcctt ttacccatgt ttgttattcg tgccattttt tttaattctt 5820 atgccaatga acgattacac caaagaaatg ccagaaaaga gtattttcta taattaaaat 5880 tgtttaaaaa atgaatccaa caaggaaata gacttttaga attatacaat ccacagacaa 5940 ttctaattga aaagttttcc taaatctcta gcctttctgt acctgaagac gaacactttc 6000 accaaatata aattatactg gtacgcatgt actaaaatga aatattcact tctttaaaaa 6060 gtcgaacagt cctctctatt tggtttatgg tagaaggttt gttgtgtctt tcagttataa 6120 aaaaatttcc taaatttcta cacttaagcc attaggttta aggtgtgatt gttaagcgag 6180 caagcgaaat ttttataatt acgatgttaa gtgatgcatt ttcatgtgta taataaaata 6240 cattgttgct acatgtttct gcttttgact aatatgtcaa tggcggacaa gacatgcagg 6300 gacaagcagc atggagttct ttggacttgt gtttgagcct tttagaaacc tttgatcatt 6360 cagataatac gggtgtacca atgaccacca tttccttct 6399 //