ID SPCC790 standard; DNA; FUN; 7000 BP. XX AC AL031855; XX SV AL031855.1 XX DT 09-OCT-1998 (Rel. 57, Created) DT 09-FEB-2000 (Rel. 62, Last updated, Version 2) XX DE S.pombe chromosome III cosmid c790. XX KW vacuolar membrane protein; zinc finger. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-7000 RA Lyne M., Rajandream M.A., Barrell B.G., Hilbert H., Duesterhoeft A; RT ; RL Submitted (08-OCT-1998) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and QIAGEN GmbH, Max-Volmer-Str 4, D-40724 Hilden, RL Germany XX DR GOA; O74924; O74924. DR SPTREMBL; O74924; O74924. DR SPTREMBL; O74925; O74925. DR SPTREMBL; O74926; O74926. DR SPTREMBL; O75002; O75002. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c790 is overlapped at its 5' end by cosmid c1919 and CC at its 3' end by cosmid c1840, EMBL entry SPCC1840, CC EMBL accession number AL031179. XX FH Key Location/Qualifiers FH FT source 1..7000 FT /chromosome="III" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c790" FT /map="IIIR" FT CDS 1..301 FT /codon_start=2 FT /db_xref="GOA:O74924" FT /db_xref="SPTREMBL:O74924" FT /label=SPCC790.01 FT /note="SPCC790.01, len:99, SIMILARITY:Schizosaccharomyces FT pombe, O74563, hypothetical zinc finger protein., (680 aa), FT fasta scores: opt: 244, E():4.7e-16, (36.7% identity in 9 0 FT aa)" FT /partial FT /gene="SPCC790.01" FT /product="hypothetical zinc finger protein" FT /gene="SPCC1919.15" FT /protein_id="CAA21291.1" FT /translation="HQKLFHLQEEHTILSIKASYNKKESHLINQAYETQEAQVYKGMLK FT CSVCNFSNWKSKLIPNCGHAFCSNCMEPFYEHKTSTCPQCETPFSVSDILTIHL" FT misc_feature complement(1..310) FT /note="nominal overlap with cosmid SPCC1919, EM:AL035075 S. FT pombe chromosome 3" FT misc_feature 136..256 FT /note="Pfam match to entry PF00097 zf-C3HC4, Zinc finger, C FT 3HC4 type (RING finger)" FT misc_feature complement(281..460) FT /note="low complexity region with similarity to Rice, Mouse FT and Human genomic sequence" FT CDS join(753..1002,1052..1516,1567..2240,2289..3602) FT /db_xref="SPTREMBL:O74925" FT /label=SPCC790.02 FT /note="SPCC790.02, len:900, NOTE: May have an additional FT C-term exon, SIMILARITY:Drosophila melanogaster, DOR_DROME, FT deep orange protein., (1002 aa), fasta scor es: opt: 677, FT E():4.7e-25, (24.9% identity in 947 aa)" FT /gene="SPCC790.02" FT /product="putative vacuolar membrane protein" FT /protein_id="CAA21292.1" FT /translation="MSLAEDWIDPNSSEDSDIQEDAELEYTADNPEKEQRGVFSLEKVQ FT LQFPVSIRCLAVENNILVMALTSDKLMIVDLERPEDIIDIELPKKVLALGLTYKIFLDP FT SGHYIFVTTTAGDNCLFTPSHQGRVLTKLKGHTVEAVQWNLNGGNILELLIASKSGVLL FT ELVLTLDSANLKRIEKSINTLYSFPFMESPMGILKNIQDDSMTIVTNKRILRFEPKTSR FT GKDQLYFSPAFQGSMKEILSFSEEETAQCFSYSPFPKNLAEPYTLALKTSKRIIYLDIM FT NPVNPDIQDYEFNESPKLSVPTVEMNMILTSFHLAFLDLDTLYIVNRVNGKESYQQRVN FT LSPHEEILGLCCDHEKNTYWLYTTDSLHELVVNNETREASLVFLEKGDFEKALECANTA FT KVRNTVLVGYAEFLMEHEEYERAATLYAETLKSVEEVALKFIELNQKDVLRLYLWKKLR FT SYKSTMKIQKSLLVNWLLELMLAKLNSLDEKERLELFPENVMQQRQQVQREFSTLLNQY FT KDEINREAAYNLANNYGKEEQLLQIATVMKDQSYIMHYWVQRENYEKALETLNEGVSQE FT TLIQHATALLTHRPNETVSIWERQTDLDVHALIPSLLSYNQRSHVPVEENAAIRYLRYV FT TGVLGCVDPSIHNTLFCIYACHSSSNESYLMNYIEQQGNHPLYDMDLGIRLCLQFNCRR FT SAVKILVLMKLYSQGVELALEADDCELAATIANIPEEDVVLKKTLWQTIAKYMFSKKSG FT IKETLRFLENSEVLQLPELIRLLPEDIKLDDLSDNVCDELDHCMKRIEQLDFEIGQASE FT VAHEIQTNAENMRNRYIVLEPNESCWHCNQPLFSEPFVLFPCQHAFHRSCMLEKTYKLA FT SEKNILKECQLCGPSYAVRLINEPFSTDF" FT misc_feature 1003..1008 FT /note="gtaagc, splice donor sequence" FT misc_feature 1038..1051 FT /note="ctgaccttcgttag, splice branch and acceptor" FT misc_feature 1517..1522 FT /note="gtacgt, splice donor sequence" FT misc_feature 1552..1566 FT /note="ttaacatgatttaag, splice branch and acceptor" FT misc_feature 2241..2246 FT /note="gtaagt, splice donor sequence" FT misc_feature 2272..2288 FT /note="ctaacataatatattag, splice branch and acceptor" FT CDS join(4217..4373,4420..4696,4851..5163) FT /db_xref="SPTREMBL:O74926" FT /label=SPCC790.03 FT /note="SPCC790.03, len:248, SIMILARITY:Saccharomyces cerevi FT siae, YPL246C, Q12270, chromosome xvi reading frame orf ypl FT 246c, (262 aa), fasta scores: opt: 250, E():2.7e-07, (27.0% FT identity in 211 aa)" FT /gene="SPCC790.03" FT /product="hypothetical protein" FT /protein_id="CAA21293.1" FT /translation="MILGRSKEFILKLPIWTQIITYIAILVYALSFFGISTGVLSLSWI FT GLLQKRQLYEIITYVTLHLSMLHIVFNFVSLLPAMSQFEKKQGTLACILVTVIPYTLFP FT GIMHLIVYHFFLRKDYVSIAGLSGWAFAFISASCVHSPQRLISFFNLFSIPAYCFPIIY FT LIMTTILVPKASFIGHASGAVMGYCTPFMLGSIPLKSWAQNVDPIFQSWVKNYHSFDQL FT SHAQLPIAEPLSTFSSFPGKGTRLGG" FT misc_feature 4374..4379 FT /note="gtaagt, splice donor sequence" FT misc_feature 4401..4419 FT /note="ttaataacggagagagtag, splice branch and acceptor" FT misc_feature join(4401..4696,4851..5007) FT /note="Match to PF01694 Rhomboid, Rhomboid family Score FT 142.31" FT misc_feature 4697..4702 FT /note="gtaagc, splice donor sequence" FT misc_feature 4835..4850 FT /note="ttaacttattttttag, splice branch and acceptor" FT misc_feature 5074..7000 FT /note="nominal overlap with cosmid SPCC1840, EM:AL031179 S. FT pombe chromosome 3" FT CDS complement(5840..6412) FT /db_xref="SPTREMBL:O75002" FT /label=SPCC790.04c FT /note="SPCC790.04c, len:190, SEE spcc1840.01c" FT /gene="SPCC790.04c" FT /product="hypothetical 20.9 kd protein." FT /gene="SPCC1840.01c" FT /protein_id="CAA21294.1" FT /translation="MVQLFGGALCADFPPKFLDASVLRQIPDNQEVFLQDSKENLTVII FT ELLEKIEKPFDGSVAAYHFNSIAFDNDASQRVIWRDKSLGEDDFEGMRSEKASGSSVQG FT CQRVLEKGKRNPESATNVAIFVNVITLIDFQTDIVISVNAPLPNTSSVPSSVENIPPSD FT QSIVRAALETIQRVTRSLVLVDKTVFA" XX SQ Sequence 7000 BP; 2193 A; 1221 C; 1226 G; 2360 T; 0 other; gcatcagaag ttgtttcatt tgcaagaaga gcatactatt ttatccataa aagcttcgta 60 taataagaaa gagtctcatc ttataaacca ggcatatgaa acacaagagg ctcaagtata 120 caaaggaatg ttaaaatgct ccgtgtgcaa cttttctaat tggaaatcta aactcattcc 180 aaactgtggt catgcctttt gttctaattg tatggaacct ttttatgagc ataaaacaag 240 tacatgtccg cagtgtgaaa cacctttttc ggtttccgac attttgacca ttcatcttta 300 attctagatc atcttttata tgattaacgt aatgtttttg ttattttggt acataatata 360 ttttttggca tcaatagtta ttgaaagaat agtcaatcct atatacatat atatatatat 420 atatatatat ttatttattt atttacatat aaatatgtat gcttttttta ataaaaaact 480 agtaatatct atcaatttga ttaacttatt ccattaagca acttcgttca ttcatatgct 540 agtttcgcaa gtctagcatc atgactcact ctccattatc catgtgccaa aaatagaggg 600 tgtgtatctg tagcgactgt aactagcgaa ataggaaaca attaattact attaattatt 660 tttttaaata ttggtaaatt aatttatatt ggcagcggaa ttataataat tcatctcatt 720 ttctattgat acgcgtgcaa tcccattgaa caatgagtct ggccgaggat tggattgatc 780 ctaattcttc tgaagattca gatattcaag aagatgcgga attggaatac acagctgata 840 atcccgaaaa ggagcaaaga ggtgtatttt ccttagagaa ggtgcaactg cagttccctg 900 tttcaataag atgccttgca gtcgaaaata acatccttgt aatggccttg actagtgata 960 aattaatgat tgttgattta gagcgcccag aggacattat aggtaagcac atacttaacc 1020 agttttcatt gttcttgctg accttcgtta gatattgaat tgccaaaaaa agtcctagca 1080 cttggtttaa cttataaaat atttttggat ccttctggac attacatttt tgttactaca 1140 actgctggtg ataattgttt attcacacct tcacaccaag gtcgcgtgtt gacaaagcta 1200 aaaggtcata ccgtggaggc tgttcaatgg aacttaaatg gaggaaatat tttagagtta 1260 ttaattgcat ctaaatctgg agttctttta gaattagtac tcactttgga cagtgctaat 1320 ctcaaaagaa tcgaaaagtc catcaatacc ctatattcat ttccatttat ggagtctcca 1380 atgggaatct taaaaaacat tcaggatgat tcaatgacta tagttaccaa caaacggata 1440 ctgaggttcg agccaaagac atctaggggc aaagaccaat tgtatttttc tcctgctttt 1500 caggggtcta tgaagggtac gtttagtttt tatacgattt tatcactttt tttaacatga 1560 tttaagaaat actttcattt tccgaagaag aaactgccca atgcttctca tactctccat 1620 ttccaaaaaa tttagcggag ccttatactt tagccctaaa gacgagtaaa cgaattattt 1680 atttagatat tatgaatcct gttaatccag atattcaaga ttatgaattt aatgaatctc 1740 ctaagctgtc tgtccccact gtggaaatga atatgatttt aacctcattt catcttgctt 1800 ttctcgatct cgatactctt tacattgtaa atagagttaa cggaaaagaa tcatatcagc 1860 aacgagtcaa tctttctcca catgaagaga ttttggggct ttgttgtgac catgaaaaaa 1920 atacatattg gttgtatacc accgatagcc ttcacgagct agttgttaac aatgaaacaa 1980 gagaagcgtc tcttgtgttc ttagagaaag gggattttga aaaagcactt gagtgtgcga 2040 atactgctaa agttcggaac acggtattag ttgggtatgc tgaattttta atggaacacg 2100 aggaatacga gcgtgctgcc actttgtacg cagaaacgct taaatctgtg gaagaggtag 2160 ccttaaaatt tattgaatta aatcaaaagg atgtattgag gctatatttg tggaagaagc 2220 ttcgatctta taagagcact gtaagtcacc aatacaaaaa actttaaatt tctaacataa 2280 tatattagat gaaaattcaa aaatcgttgt tagttaactg gcttttagag ttaatgctgg 2340 ccaaattgaa ttctcttgat gaaaaagagc gtctcgaatt atttcctgaa aatgtaatgc 2400 aacagcgcca acaggttcaa cgggagtttt caactcttct taatcaatac aaagatgaga 2460 taaatcgtga agcagcttat aatcttgcta ataactatgg taaggaagaa cagttacttc 2520 aaattgctac cgttatgaaa gaccagtctt acataatgca ctattgggta caaagagaga 2580 actatgaaaa agcgttggag acattaaatg aaggcgtcag tcaagaaacc ctcattcagc 2640 atgcgactgc ccttttaact catcgaccga atgaaactgt cagcatttgg gaacgtcaaa 2700 cagatttgga cgtacatgca ttgataccat ccctcttgag ttataatcag cgttctcacg 2760 taccggtaga ggaaaatgcg gccataagat accttagata cgttactggt gttttggggt 2820 gcgtagatcc atctattcat aatacattgt tctgcattta cgcttgtcat tcgagttcta 2880 atgagtctta cttaatgaat tatattgaac agcaaggaaa tcatcctttg tacgatatgg 2940 atttaggaat tcgattatgc ttgcaattta actgtcggag aagtgcagta aaaattctgg 3000 tgcttatgaa gctgtatagc caaggagtgg aattagcctt agaagcagat gattgtgaat 3060 tagctgctac gatagcaaat attccagagg aagatgttgt gttgaaaaaa actttgtggc 3120 agactattgc aaaatatatg ttttccaaaa aaagtggaat taaagaaacc ttgagatttt 3180 tagagaattc cgaggtttta caacttcccg agcttattcg cctattgcca gaagatataa 3240 aactggacga tttaagtgat aatgtgtgtg atgaactgga tcattgcatg aaaagaattg 3300 aacagctgga cttcgaaatc ggacaagcat cagaagttgc gcatgaaata caaactaacg 3360 ctgaaaatat gcgtaatagg tacattgttt tggagcccaa tgaatcatgc tggcactgca 3420 accaacctct tttttccgaa ccatttgttt tatttccttg tcagcatgct tttcatcgca 3480 gttgtatgct agaaaaaact tacaaactgg catcagaaaa aaacattttg aaagaatgtc 3540 agttgtgtgg tccttcttat gctgtaagac ttattaatga acctttttcg acggattttt 3600 gagtgccatt taaattacca ttattatgtg ttttcgagtc gtttattgct tgcaccgaac 3660 agccttaacg accttgactt aaacagtttt agactcaatg aaatttatac taatgactag 3720 ttttttcact catattgtct ttgacatcga tctccctagt cgttaccacc cctgaaaaat 3780 ttgatatttc taatataata ttagcattga aattggaagc tacaacttct tatatctaga 3840 atttttaatt gtttttaaag tgctattcgt tttcgaatca cgactaaaag tcataattct 3900 tttctcatat atattcttaa accggttata ttgttttttg tgctaatgag tagaaatagc 3960 tcacactcca cacggtaagc gtgcgatttt aaaataaaac caatattttt gaataaaacc 4020 cttggctttc ctttggtttg tctgcaaacg taaggcgagc tctcatattc gcttcaggtg 4080 accggatctt ccttcgtgaa ccatatgccg cattagcgta tccagatcgt atttcagtcc 4140 cctttgaact aattttgaaa aatttattaa atcaaaattg tacggatatt aaattagtag 4200 cttctcgatg attttcatga ttctaggacg atcaaaggag ttcattctga aattaccgat 4260 atggacgcaa ataataactt atatcgccat tctggtgtat gcactttcct tctttggaat 4320 atccacgggg gttctgtctc ttagctggat aggattgttg caaaaaaggc agcgtaagtt 4380 aaaattcctt ggtactttaa ttaataacgg agagagtagt gtatgaaatt ataacttatg 4440 taactcttca tctttcaatg ttgcatattg tatttaactt tgtatccctg ttgccagcaa 4500 tgtctcaatt tgaaaagaag caaggcactt tggcgtgcat tttggtgacg gtcattcctt 4560 acactctttt cccgggtatt atgcatttaa ttgtctacca ttttttcttg agaaaagatt 4620 acgtttctat tgctggactt agtggatggg cttttgcttt tatctctgct tcctgtgttc 4680 actctccaca acgattgtaa gcatttctta taattatggt tgcagatgca tacttccaca 4740 cgtcttcttt cctcacaatg ttaagaaaca acctggtttg tatgattttt tgcaaacgtc 4800 gaacctctct tgtcttttaa atgatggcgg tttattaact tattttttag aatcagcttt 4860 tttaatcttt tcagcattcc tgcatattgt tttcctatta tttacctaat tatgactacc 4920 atattggttc caaaagctag ctttatcggg catgcatccg gagctgttat ggggtattgt 4980 actcctttca tgcttggttc tatcccctta aaaagctggg ctcaaaatgt ggacccaatc 5040 ttccagtctt gggtaaaaaa ctatcactct tttgatcaac tgtcgcatgc tcaactcccc 5100 attgcggaac ctctgtcgac cttttcttct ttccccggaa aaggaacccg tcttggggga 5160 tgaactttct gtcttttact tcttttctgt tcttcttcgt atgatgatat cctaaatgtt 5220 tgttttcttt attgcaaagg gtggaaaagc tcctctcaat gaagagatct tctatctttt 5280 ggtaattctt attcagtatt ttctagtcag tcgtttacag gtttcaatca taaagattgt 5340 ttcattcgct atatttgttc tttacttgca cggagttagg gacaattaga aagtatttta 5400 atgtttaata attctttatg attaacttgt tttaatcacc actgtcctac agcggcagtg 5460 taaaggacaa aagctgtaca atcaaaacac cttgaaattg taatttccgt tattctttgg 5520 aaactagcgt tttacgttgt actctcaaat cataaaacat taaactgtaa agacacagtg 5580 aaacttatac tattaacaat ggaagaatgg gaaaattaaa ataatagcag aaaaagtgaa 5640 aaccattgct aatcggttat caagtatcag atgtttttat taaaaccctt catacaagtt 5700 ggaataaccg gtttttataa ataagagaaa caattccttt cggataaaga aaaataaaac 5760 ttgaaaagta tttccctaga tgaaaatctt aattaagtgt tgtgggctag ttaatttgct 5820 tgcctatgtc catcatcctt taagcaaata ctgttttatc gacaagtaca agagatcgag 5880 taactcgctg tattgtttcg agagcagcgc gaacgatgct ttgatcagaa gggggaatgt 5940 tttcaacaga agagggaact gaggaagtgt taggtaaagg agcatttacg gagattacga 6000 tatctgtttg aaagtcaatc aaagttatca cattcacgaa gattgccaca ttggtagcgg 6060 attcaggatt tcgctttcct ttttcaagga ccctctgaca accttgtaca gaagatccac 6120 tagccttttc gcttctcatg ccctcaaagt cgtcttcacc aagacttttg tctctccaaa 6180 ttactctttg agatgcatca ttatcgaatg caatgctatt gaaatggtaa gctgccacac 6240 ttccatcaaa tggcttctca attttttcca aaagctcaat gatcactgtt aggttttcct 6300 tggaatcttg caaaaacacc tcttggttgt caggaatctg acgcaaaacg ctagcatcga 6360 gaaatttagg agggaaatca gcacaaagag ccccaccgaa tagctgtacc attactataa 6420 aaatataaaa acctaataga tttgtttgtc tcaagaacca gattggtatt cgaaccacaa 6480 aaggattctt tcctattttc actccatttc attcgactgc ggtaacaagt tagtgcgtgt 6540 tattttatgt ttcattatta tatttatcgc tagtaaaacg ctttaaatga gcctagattc 6600 aattttcaat atttacgcac tgcacggtaa ctactaaact gatttaaaat taaaataaaa 6660 agaaattcaa attgatgatt tttaatgaaa atgatatgtt aaaaataaaa ttttgtacac 6720 atatatagtt tctaatatcg ttttgacaac tagtatatca aatggatgaa aggacgctcg 6780 taatcttgta atcatgagtc cattcagata tccataacta atacaaggat agggcgtctg 6840 ataactttta cgattaagtt gattagttgt tcagccttcc acttgtgtga tacaaaagag 6900 aaatggaatt gcacactaga gtggttattt caaactttct attgatctga aaaacctaaa 6960 ttaaacgttc actagataat aagtattcaa ccaaaagatc 7000 //