ID SPBC418 standard; DNA; FUN; 4459 BP. XX AC AL096876; XX SV AL096876.1 XX DT 22-JUL-1999 (Rel. 60, Created) DT 22-JUL-1999 (Rel. 60, Last updated, Version 1) XX DE S.pombe chromosome II cosmid c418. XX KW glutamine amidotransferase/cyclase; n-terminal acetyltransferase. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4459 RA Wood V., Rajandream M.A., Barrell B.G., Brown S., Harris D.; RT ; RL Submitted (25-JUN-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; O94303; O94303. DR GOA; Q9Y7X2; Q9Y7X2. DR SPTREMBL; O94635; O94635. DR SPTREMBL; Q9Y7X2; Q9Y7X2. DR SWISS-PROT; O94303; HIS5_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c418 is overlapped at the 5' end by cosmid c887, EMBL CC entry SPBC887, accession number AL033388 and at the 3' end by CC cosmid c16D10, EMBL entry SPBC16D10, accession number AL035637. XX FH Key Location/Qualifiers FH FT source 1..4459 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c418" FT /map="IIR" FT misc_feature 1..194 FT /note="nominal overlap with SPBC887 S. pombe chromosome 2" FT CDS join(complement(161..280),complement(1..54)) FT /db_xref="GOA:O94303" FT /db_xref="SWISS-PROT:O94303" FT /label=SPBC418.01c FT /note="SPBC418.01c, len:57, SIMILARITY:Arabidopsis FT thaliana, CAB36536, glutamine amidotransferase/cyclase., FT (592 aa), fasta scores: opt: 194, E():8.1e-09, (51.8% FT identity in 56 aa)" FT /partial FT /gene="SPBC418.01c" FT /gene="SPBC887.20c" FT /product="glutamine amidotransferase/cyclase" FT /protein_id="CAB51352.1" FT /translation="MIVSIVDYGSGNVRSLINAVRYLGFETQWIRNPHDIEKAECLIFP FT GVGNFGFVCDSLA" FT misc_feature complement(55..71) FT /note="ctaatctatactttcag, splice branch and acceptor" FT misc_feature complement(155..160) FT /note="gtatgt, splice donor sequence" FT CDS 1713..3800 FT /db_xref="GOA:Q9Y7X2" FT /db_xref="SPTREMBL:Q9Y7X2" FT /label=SPBC418.02 FT /note="SPBC418.03c, len:175, SIMILARITY:Schizosaccharomyces FT pombe, O74985, n-terminal acetyltransferase 1, (729 aa), FT fasta scores: opt: 356, E():2e-16, (30.3% identity in 696 FT aa)" FT /gene="SPBC418.02" FT /product="n-terminal acetyltransferase" FT /protein_id="CAB51353.1" FT /translation="MSKLSEKEAFLFDRSIDQFEKGQYSKSLKTIQSVLKKKPKHPDSV FT ALLGLNLCKLHDSRSALLKCGYASSIDPKSQFCWHALAIVYRETKDYNNSLKCYQNALA FT ISPNNESLWYDAAYLQAQLGLYQPLFDNWNRLLQLDSSNLEYRLCFTLSAFLSGNYKES FT LEQIQYLISSCNLSPLVVSRLISFLPRICEHIENGSQTVLEILLMNQNSFLNNFNFEHI FT KADFAFRQKNYEESIYLYARLLIKFPNRLDYSEKYLNSLWNFYKSGGLALDLLLKRTDS FT LIKTFSEILQTGISVLIFLLSKNLDYDFCLNHLISYSMHHFIPSFISLLKIPLKTNDAF FT SKKLITMLSNFREGDSAKNIPTHKLWCTYCLCLAHYKLGDYEESNYWLNLAIDHTPTYP FT ELFLAKAKIFLCMGEIEEALCSFKRSVELDKSDRALASKYAKYLIRMDRNEEAYIVLSK FT FSRFRFGGVCNYLAETECVWFLVEDGESLLRQKLYGLALKRFHSIYQIYKKWSFLKFDY FT FTQCAEDGEFQEYVELVEWSDNLWSSTDYLRATLGALTIYLLLFESKFNMYGNKAEEIS FT HMSEVEQIAYAREDNKKIMKLQKIEEDKIKSYIPSESEEPLVIDEDYFGHKLLITDDPL FT TEAMRFLQPICWHKIKGWGFLKILSSKLYKLKGIIFCYYALTNNIEGLHQKANSLETSV FT L" FT CDS complement(3932..4459) FT /codon_start=1 FT /db_xref="SPTREMBL:O94635" FT /label=SPBC418.03c FT /note="SPBC418.03c, " FT /partial FT /product="hypothetical protein" FT /gene="SPBC418.03c" FT /gene="SPBC16D10.01c" FT /protein_id="CAB51354.1" FT /translation="AEALQTLASMRISQQKIEEAKDALSKCLQSISRAATEDSVDLPTY FT AVRTSIVRLLIEVEMHEEAHQLLVYLQKEDDQILDIWYLLGWNCYVEAQNLQEQGNGSE FT EEIKELLMNAKFYFISALGVYQKIGWDDEGIKSHIQELLEILNGLGVPNMDEENEEAEW FT ETSENEEEMDED" FT misc_feature 4231..4459 FT /note="nominal ovrlap with SPBC16D10 S. pombe chromosome 2" XX SQ Sequence 4459 BP; 1448 A; 750 C; 721 G; 1540 T; 0 other; tgctaaggaa tcgcatacga aaccaaagtt tccaacacca ggaaaaatca agcactgaaa 60 gtatagatta gtatttgatg aaaacgaaaa aaaaaaccga tttgacttag cctttataat 120 gtagatggta acaacatttt taaaaacaat aatgacatac ttcagctttt tcaatatcat 180 ggggattcct gatccattgt gtttcaaacc ctaaatacct aacggcattg attaaggatc 240 gaacgtttcc actaccataa tcaactattg aaacaatcat aatatgagta atgcaaaatg 300 taacttcaaa tttgcttata atgtatactc atggcacacc gtcgattgca ttacgatagc 360 gtattgcgat caacttcatt atcttacaga gtctgtagca cataaactac tacaaaactc 420 attgatgcgt ccaaagtatt ttttttgttc aatgtcagac aatatacgag tgatgcaatg 480 cctaaagggg taaggtattt tataaagctt tatcaaagta atgtaactat attttcatca 540 atcaacgtat tttttaaaac aactttatag acgttccgtt atttgtcaat cttgaattaa 600 gatgctagca ttatttggta ttgctaccgc tagtataatt cattgttatt tgaaggtttt 660 gctactttga gtaatgcagt gtatgccaac aatagtagat ttatttgtat gatcttttgc 720 agaaaataga agtaaattag agcaaaagac atttattatc ttttcgttga tattctttat 780 gtcttaaatt gttaattctt gattagccat aatgagtaaa ggaaatacaa cttaacatgg 840 ctaatactga gacttagtat ttaagaagca caatcaaaca gtgatgaatg tgatttaacc 900 gacataattc ttgtaaattt tattcgatat ttgattgaat cccttacgtt catactatca 960 agaaataatc ttttgtctat ttagaatact aacgattttt cccttacggc aaaacccctg 1020 gaatgcaaat gtgaaaagct ttggattttc tatgataaaa cgattaaaga attatagtta 1080 ggttccgtta ccgtttctat tcctttgctc aatccttaaa tattttagga aattaaaatt 1140 tttcagccaa tcttcttttt ttttctaact aatgcgaaac tacaaagtcg ttagtatctt 1200 ttgcaaaatt attgtttttt ataataaaat taagacactc ttccaaaagg tattggaata 1260 taaattggag atagggtaga catttgttaa taattataca atttagttgt cgcttagaac 1320 caggaaaaaa ccttcagctt tcactcgact aagttgatga ggaacactta aaaataaaat 1380 cagtgatact tttacttggg tatttatgag taactatata agtttagtac caatttattg 1440 atcttttttt ggttatttat gcgtaaatca cttttcttgg ctcagccgat cgttcgttga 1500 tgcgttggta aaatatattt tcttctaaaa tattcaggtt caggttactt gtttgtaaca 1560 tttttacata tttttggagt aattatattt tcagatcaaa ttgttatacg cactgaagta 1620 tttactgtta caattcctat agggcgcatg caatattata aacgtgatta ttgacttagc 1680 agcagggtgt tcaattaaag tcttatcctt cgatgagtaa attgtccgaa aaagaagctt 1740 ttctttttga tcgttctata gatcaatttg aaaagggaca atattccaaa agtctcaaaa 1800 ctatccagtc agtattaaaa aaaaagccta aacatcccga ctccgtagct ttgctaggtt 1860 taaatctttg caaactgcat gatagtcgta gtgcactttt gaaatgtggg tatgcttctt 1920 ctatcgatcc taaatctcaa ttttgttggc atgctttagc tattgtttac agagaaacca 1980 aagattacaa caactcactt aaatgctatc agaatgcatt agcaatatct cctaacaatg 2040 aatctttgtg gtacgatgcc gcgtatttgc aagctcaact tggtttgtat caaccccttt 2100 ttgataattg gaatcgacta ttgcaactgg attcaagtaa tttagaatat cgattatgtt 2160 ttacgttgtc tgcttttctt tctgggaatt ataaagagtc gttggaacaa attcaatatt 2220 taatcagttc atgtaattta tcccctttag ttgtttcgag gcttatttca tttttaccta 2280 gaatatgcga gcatatcgaa aacggtagtc aaaccgtgtt agaaatttta ttaatgaatc 2340 aaaattcatt cttaaataac ttcaactttg agcatattaa ggctgacttt gcatttcgcc 2400 agaaaaacta cgaagaatcg atatacctat acgctcgttt gctcatcaag tttccaaatc 2460 gcttagatta ttcagagaaa tatttgaata gtttatggaa tttttacaaa agtggaggtt 2520 tagctcttga tctcttgcta aagagaactg attccttaat taaaacattt tcagagattc 2580 ttcagactgg tatatctgtt ttaattttct tgctctccaa aaatctggac tacgattttt 2640 gtttaaatca cttgatctcg tattccatgc atcattttat tccttccttc atcagtttac 2700 ttaaaatccc acttaaaaca aatgatgcgt tctctaagaa gttaattact atgttgagta 2760 attttagaga aggcgattct gctaaaaata taccaacaca taaactatgg tgtacctatt 2820 gcttatgcct cgcacactat aagctaggtg attatgaaga atcgaattat tggctcaatt 2880 tagcaattga tcatacaccg acttatccag aacttttttt ggcaaaagca aaaattttct 2940 tgtgcatggg tgaaattgaa gaggcactgt gctcatttaa acggtcagtg gaacttgata 3000 aaagtgaccg tgctctagct tctaaatatg caaaatattt aatacgtatg gatagaaatg 3060 aggaagctta tattgtcttg tctaagttct cacgattcag atttggcggg gtatgcaatt 3120 atttagcaga aactgaatgt gtttggtttt tagttgaaga tggtgagtcc ctattacggc 3180 agaaattata tgggctagct ttgaaaaggt tccattctat ttatcaaatt tacaaaaaat 3240 ggtcattttt gaaatttgat tatttcacac aatgtgcaga ggacggtgaa tttcaagaat 3300 acgttgaatt agtagagtgg tccgataacc tgtggtcttc aacagattat ttgagagcta 3360 ctcttggtgc tcttaccata tacctattgt tgtttgaatc caagtttaat atgtacggaa 3420 ataaagctga agagatatcc catatgtcgg aagttgaaca aattgcgtac gctagagagg 3480 acaataagaa aatcatgaaa ttgcaaaaaa tagaggagga taagattaaa agttacatcc 3540 cttctgaaag tgaagagccg ttggttattg atgaagacta ctttgggcat aagctgctaa 3600 tcactgatga tccattaaca gaagctatga ggttcttaca gccaatttgc tggcataaaa 3660 tcaaaggctg gggattctta aaaattttat cttctaaact ttataaactt aaaggtataa 3720 tattctgtta ttacgcacta actaacaaca ttgaaggatt acaccaaaaa gctaatagtc 3780 tagaaacttc agtattgtaa gaaccccttt tcggatacac agttatgaag gatgtgatgt 3840 cgacggacta tatttatatt ttgataacct ataataccca aaaattaaaa ccctaaaagt 3900 gacaacttat tcaaggaaac cacacaaaat tctaatcttc atccatttcc tcttcgtttt 3960 cagaagtttc ccactcagct tcttcgtttt cttcgtccat attcggtaca ccaaggccat 4020 tcaaaatttc gagtagttcc tgaatatgac tcttgatgcc ctcatcatcc catcctattt 4080 tttgatatac acctaaggcc gaaatgaaat aaaactttgc attcatgaga agctctttta 4140 tttcttcctc agacccatta ccctgttctt gtaggttttg agcttctaca taacagttcc 4200 agcccaataa gtaccatata tccaaaattt gatcatcctc tttctgaagg tataccaaaa 4260 gttgatgagc ctcttcatgc atttcaactt caattaatag cctcacaatg gatgttcgaa 4320 ctgcataggt agggagatca acactatcct ccgtagccgc cctagaaata gattgcaaac 4380 atttagataa agcatctttg gcttcctcta ttttttgttg actaatacgc atagatgcta 4440 gtgtctgtaa agcttcagc 4459 //