ID SPBC18A7 standard; DNA; FUN; 4999 BP. XX AC AL080287; XX SV AL080287.1 XX DT 25-JUN-1999 (Rel. 60, Created) DT 18-APR-2000 (Rel. 63, Last updated, Version 2) XX DE S.pombe chromosome II cosmid c18A7. XX KW dipeptidase; metallopeptidase family. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4999 RA Wood V., Rajandream M.A., Barrell B.G., Moreno S.; RT ; RL Submitted (25-JUN-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and Instituto de Microbiologia Bioquimica, CSIC RL Universidad de Salamanca, Edificio Departamental, Campus Miguel de Unamuno, RL 37007 Salamanca, Spain XX DR GOA; Q9UUD8; Q9UUD8. DR GOA; Q9Y7Y9; Q9Y7Y9. DR SPTREMBL; Q9UUD8; Q9UUD8. DR SPTREMBL; Q9Y7Y9; Q9Y7Y9. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c18A7 overlapped at the 5' end by cosmid c4F6, CC EMBL entry SPBC4F6, accession number AL031534, CC and at the 3' end by c336. XX FH Key Location/Qualifiers FH FT source 1..4999 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c18A7" FT /map="IIR" FT misc_feature 1..1619 FT /note="nominal overlap with cosmid SPBC4F6, EM:AL031534 S. FT pombe chromosome 2" FT CDS 892..2247 FT /db_xref="GOA:Q9UUD8" FT /db_xref="SPTREMBL:Q9UUD8" FT /label=SPBC18A7.01 FT /note="SPBC18A7.01, len:452, SIMILARITY: Pyrococcus FT horikoshii, O58691, 56AA long hypothetical dipeptide, (356 FT aa), fasta scores, opt:546, E():2.1e-28, (33.1% identity in FT 362 aa overlap)" FT /gene="SPBC18A7.01" FT /gene="SPBC4F6.19c" FT /product="hypothetical dipeptidase; metallopeptidase FT family" FT /protein_id="CAB45933.1" FT /translation="MVSFESSFERGTDFLNRNFKKCLFACISIFIFALLALSFLSLLQP FT DTVQRLYQCAVPSMIYVPPMINEAISIQHEEFNNRRRRLSAALREDKLDALIMEPTVSM FT DYFANITTGSWGLSERPFLGIIFSDDEPYPGDVASRIYFLVPKFELPRAKELVGKNIDA FT KYITWDEDENPYQVLYDRLGPLKLMIDGTVRNFIAQGLQYAGFTTFGVSPRVASLREIK FT SPAEVDIMSRVNIATVAAIRSVQPCIKPGITEKELAEVINMLFVYGGLPVQESPIVLFG FT ERAAMPHGGPSNRRLKKSEFVLMDVGTTLFGYHSDCTRTVLPHGQKMTERMEKLWNLVY FT DAQTAGIQMLSHLSNTSCAEVDLAARKVIKDAGYGEYFIHRLGHGLGLEEHEQTYLNPA FT NKGTPVQKGNVFTVEPGIYIPDEIGIRIEDAVLASDVPILLTNFRAKSPYEP" FT misc_feature 1552..2214 FT /note="Match to PF00557 Peptidase_M24, metallopeptidase FT family" FT CDS join(complement(3779..3847),complement(3647..3725), FT complement(2370..3595)) FT /db_xref="GOA:Q9Y7Y9" FT /db_xref="SPTREMBL:Q9Y7Y9" FT /label=SPBC18A7.02c FT /note="SPBC18A7.02c, len:458, FT SIMILARITY:Schizosaccharomyces pombe, O13989, putative FT mannosyltransferase c26h5.07c, (424 aa), fasta scores: opt: FT 363, E():2.1e-16, (24.3% identity in 350 aa), also FT SIMILARITY:Saccharomyces cerevisiae, PTM1_YEAST, protein FT PTM1 precursor, (531 aa), fasta scores: opt: 360, FT E():4e-16, (23.3% identity in 460 aa overlap)" FT /gene="SPBC18A7.02c" FT /product="putative mannosyltransferase" FT /protein_id="CAB45934.1" FT /translation="MKLLISLLWSIFFSIVYSEKTLLNFKHYELCNGIYSKSESGGSLN FT PAIYVNWTEPWGQEDEVEVLIFNWKEIRKLGAFRSDDQFTYICDYDAVYTDHLCEADQL FT GLYLWNSTSAKSIRSYIIPTNADPQNIIYEISQSGYYCIWSHSKKMLPYQALVNWQNAY FT GGLPASQFPRMPISGGITIAYSVILALWMFFRFQYKHSIVTVQKAIMFLLIFSCAQQAV FT TSIVLDTENLRNRGNFTWLGETLVSILFACQLVLDLALLLILSWGYTRYSTNMRDRLFT FT EAKIPLIICFFALFVVRFFAITIQSIHLGLWFCFFFLTACISALYILFGAFVALPSTLR FT ALVEQRYYTLHSIYKIFRIMVLCGVVTIFSFSLVALIFCSNTNNNSTNKLWKIRWYFLD FT GWIDGVHLTYLITLSSLWRPSQENPDLDPTGLSYPVLDPRLEEELDLLEEDIRADKSK" FT misc_feature complement(3596..3608) FT /note="cttacgacgttag, splice branch and acceptor" FT misc_feature complement(3641..3646) FT /note="gtatgt, splice donor sequence" FT misc_feature complement(3726..3746) FT /note="ctaactgcggcttcttgctag, splice branch and acceptor" FT misc_feature complement(3773..3778) FT /note="gtaaaa, splice donor sequence" FT misc_feature complement(4165..4999) FT /note="nominal overlap with cosmid SPBC336, EM:AL121815 S. FT pombe chromosome 2" FT misc_feature 4930..4999 FT /note="SPBC336.01, Complete CDS in c336" XX SQ Sequence 4999 BP; 1717 A; 819 C; 900 G; 1563 T; 0 other; atgaccgtga ctgaaaagaa cctttgaatt ttttttaaaa tttttcactt tttttattat 60 cttatatact atgggctatc aattatcgcg tcaattatat aggtaactcg ttacatttgt 120 tttgtaaatc acagtaccta aaagatcata aaccttttaa aaatgaagga ttaaacactc 180 ttatgctgca ggaactcaaa attgaaggat gaattaaccg aatgcgaatc aacaagttca 240 ctttcatagc ttattttgag taacaaggtg attttgaggt gtaagatgaa accttcattt 300 ctaattctat gacactgtga aaagtattca tgaatgtttt taatatttaa aaaagtggtc 360 aatttatagc tcttaaacaa ggcagtaaac tataaaataa taatagaaac ataaccatca 420 agagcatatg aaaccatgcg agatcaattg caaaaatgaa gcttcttttc aaatgtaaag 480 ctctttttaa gttaacttaa aatgagtacg tgaaacacta gggcttactg aaatttttgg 540 aattgaacta tacagcgact actttattat tgttgattaa cattcctttg aggtatttat 600 atttgacttg aagaccaact attaaagagt aagattttga ttgaaatagg acaggggaat 660 ttttcatttg ttaggaaact aattgataca ccaaattttc aaatctcata taggtttttt 720 ttataaaact aataaacgaa ataaagcttt ttaatttaat aaaaatatat acaaaagcga 780 cgatattgta gcatcttcgg ttaactcaat gctgtaatct gatagtcagt aatatctgac 840 atactatata gttcagcacc acctctcttc aatcagcaga attgctcaaa aatggtttct 900 tttgaatcaa gttttgaacg aggtaccgat tttttgaata gaaattttaa gaaatgtctc 960 ttcgcatgca tttccatctt catttttgcg cttttagcat tatcattcct atcactgtta 1020 caaccagata cagtgcaaag gctttaccaa tgcgctgttc cctcaatgat ttatgttcca 1080 ccgatgatta atgaagcaat ttcaatacaa catgaagaat tcaacaatcg taggcggaga 1140 ctttcggctg cacttcgtga agataagttg gatgctctaa ttatggagcc tacagtttca 1200 atggattatt ttgcaaatat cactactggc tcctggggtc tttctgagcg accttttttg 1260 ggaataatat tttcagacga tgaaccctat ccaggagacg tggcatcaag aatatatttt 1320 ctagtaccta agtttgagct accacgtgct aaagaattgg ttggcaaaaa cattgatgcg 1380 aaatacatca cttgggatga agacgaaaac ccttatcaag ttttatatga ccggttagga 1440 cctttgaagt taatgataga tggtactgtg cgaaatttca ttgcacaagg tttacagtat 1500 gctggattta ccacatttgg ggtgtccccc cgtgttgcga gtttacggga aataaaaagt 1560 ccagctgagg tagatattat gtcaagggtc aatattgcaa ctgtggcagc tatacgatct 1620 gtccaaccgt gtattaaacc tggtattacg gagaaagagt tggcggaagt gattaacatg 1680 ctttttgttt acggtggatt gcctgtgcaa gagtcaccga tagttttgtt cggtgaaaga 1740 gcagcaatgc cacatggagg tccatcgaat cgtcgattga aaaaaagcga atttgtcttg 1800 atggatgttg gtactacttt gtttggctat catagcgatt gcacccgaac tgtacttccc 1860 catggccaaa agatgactga gaggatggag aagctttgga atttggttta cgatgcccag 1920 acggccggca tccagatgct ttcccattta tcgaacacaa gttgtgcaga agttgaccta 1980 gcagccagaa aagtaattaa agatgctggt tatggcgagt attttataca cagattggga 2040 cacgggttag gattagaaga gcatgaacaa acttacctta atcctgccaa taaaggtacg 2100 cctgtacaaa agggaaatgt gtttactgtt gagcctggaa tttacatacc tgacgaaatt 2160 ggaattcgta ttgaagatgc tgttttagca tctgatgttc ctattctatt aacaaacttt 2220 agggcaaaaa gtccctacga gccttaatat ttatgtgatt ttttgtgatt gtaacctgaa 2280 ttggatcaaa taatttaatt catactatta acattagtag taattgaaat aaaatcaact 2340 catcaacgaa agaaattgaa cataaaaaac tattttgatt tatccgccct gatatcttcc 2400 tctaataaat ctaactcttc ttccaaacga ggatctaaaa ccggataact aaggcctgta 2460 ggatctaaat cggggttttc ctgagatggt cgccataaag aagataatgt aattaagtaa 2520 gtcaaatgca caccatcaat ccatccatct aagaaatacc agcgtatttt ccagagtttg 2580 tttgttgagt tattattagt gtttgagcaa aatatcaatg caactagaga aaatgagaaa 2640 atggtaacaa caccacacaa taccattatt cggaaaattt tatagatgct atgcaacgta 2700 tagtaccgtt gttcaaccaa cgcgcgaaga gttgaaggta acgcaacaaa ggcaccaaac 2760 aagatatata gcgctgaaat acaagcggtg aggaagaaaa agcaaaacca aaggccgagg 2820 tgaatagatt gaatggtaat tgcaaagaag cgaactacaa aaagagcgaa gaaacaaatt 2880 attaaaggaa tttttgcttc agtaaatagt cgatcacgca tgttagttga ataacgggta 2940 tatccccaag agagtattaa taataacgca aggtccaaga ctaactgaca cgcaaacaaa 3000 attgacacca gagtctctcc taaccatgtg aagttccccc gatttctcaa attttctgta 3060 tccaacacaa tagaagtgac agcctgttga gcacacgaaa agattagcaa gaacataatt 3120 gccttttgaa cggtaacaat cgaatgttta tattgaaatc ggaaaaacat ccaaagtgca 3180 agaattacac tataagcgat tgtgattcca ccagatatag gcattctagg gaattgagaa 3240 gcgggaagcc caccatacgc attttgccaa ttgacaagag cttgataagg taacattttt 3300 ttcgaatgag accaaatgca ataataacct gactgagaga tttcatatat aatgttttga 3360 gggtcagcat ttgtcggaat tatataagaa cgaatggatt ttgctgacgt tgagttccac 3420 aaatatagtc ccagttgatc ggcttcacat aggtgatccg tgtataccgc atcgtaatcg 3480 caaatgtaag tgaattgatc atcagaacgg aatgcaccta acttgcggat ttctttccaa 3540 ttgaatatca aaacctcaac ttcatcttct tgaccccatg gttcagtcca attcactaac 3600 gtcgtaagtt ttttaaataa aagcgtaaaa aaagatatgt acataccata aattgctgga 3660 tttaaagacc ctccagattc cgatttcgag taaatgccat tgcacaattc ataatgcttg 3720 aaattctagc aagaagccgc agttagcaaa tgatacgcaa gaaatctgat tgttttacca 3780 gtaatgtttt ttctgaataa actatggaaa agaatatact ccatagaaga gatatcagta 3840 atttcatggt ttatgatgag agcattagat tttttatact tcgaaggact gggaacaaaa 3900 attaaatacc aatgttatgt aaatcaatgt acaaaatatt ttttcccaaa accgattcac 3960 ggattatctg ccaaagttga gattcagaag ttggtggagg ttattgacta cttaataaat 4020 agtaatttat ttatagttag gcgattagca aggttcaaaa aatagctgag aaaggtgttg 4080 attcgagcct ggacatcaac attttaatta attttttctt tctataaaca attttgcttg 4140 acttgctgaa tttcgctaag gatcaacatc attgactact ggtgaattct ttctttgtta 4200 tcggcccaga aattaccaac cgtaaaatgt attttccgca attaaagaaa attgtgagga 4260 tttatccttc tattcggttt catttaataa attgagagca ttgatgaata aatagactag 4320 gtgtagaaaa aaatttagtt gacgaagccg aatttaatac cgcatgttgt ctggtctaaa 4380 cttctatttt tggtgaaacc ggtttccatt aaaatatcaa caaaagtgat attgctaaac 4440 gtggcattta tagtgtctaa agaaagtata ataaaaacca accaagaggt cgttgatgag 4500 aaataattat ataaataatt agttaaatca tgagaataag aatgatacat caattaataa 4560 aaaattacaa tttacaaaat aaagtagagg attttttcaa cacaaaaagt aatagagtct 4620 ttttgagtta attatttatt caatgaaact tacttttaca taagaaatta tgggaattct 4680 ttcaatttgc agatggttaa agtatttctg taacgacagc atgcttcgga gagaaataat 4740 tatataggtt aaggaatgta ttttatttta aaagattgga tttattttaa tgttcgttgc 4800 tcattcataa actagttctg gtatttttac tactcttaaa tttgcacata cccaaccaac 4860 aagcaattct tctaatgcaa tttgttcttt cttttcagtc aagcacccag tcagcaaatt 4920 cttagttata tgagtgctca acatttacat agctgcaaat tttatagact gccacttgaa 4980 attatcccat tgatctgca 4999 //