ID SPBC1306 standard; DNA; FUN; 4495 BP. XX AC AL133303; XX SV AL133303.1 XX DT 01-DEC-1999 (Rel. 61, Created) DT 01-DEC-1999 (Rel. 61, Last updated, Version 1) XX DE S.pombe chromosome II cosmid c1306. XX KW elongation factor G, mitochondrial 1 precursor (EF-G); KW Elongation factor Tu family; G-beta repeat; GTP_EFTU; WD domain; KW WD-repeat protein; WD40. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4495 RA Brown S., Harris D., McDougall R.C., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (30-NOV-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; Q9USZ1; Q9USZ1. DR SPTREMBL; Q9USZ0; Q9USZ0. DR SWISS-PROT; Q9USZ1; EFG1_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1306 is overlapped at the 5' end by cosmid c409, EMBL entry CC SPBC409, accession number AL109822, and at the 3' end by cosmid c4, CC EMBL entry SPBC4, accession number AL121863. XX FH Key Location/Qualifiers FH FT source 1..4495 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1306" FT /map="IIL" FT misc_feature 1..119 FT /note="nominal overlap with cosmid SPBC409 S. pombe FT chromosome 2" FT CDS complement(1..755) FT /db_xref="GOA:Q9USZ1" FT /db_xref="SWISS-PROT:Q9USZ1" FT /label=SPBC1306.01c FT /note="SPBC1306.01c, len:251, SIMILARITY:Saccharomyces FT cerevisiae, EFG1_YEAST, elongation factor g 1, FT mitochondrial precursor, (761 aa), fasta scores: opt: 598, FT E():1.9e-30, (62.0% identity in 255 aa)" FT /partial FT /gene="SPBC409.22c" FT /gene="SPBC1306.01c" FT /product="putative elongation factor G, mitochondrial 1 FT precursor (EF-G)" FT /protein_id="CAB62091.1" FT /translation="MLKLSFRSLTSRLPRLSTLVVRGYASVANTGIEASNTSENNLNIQ FT EQLNDNDKKRLKQIRNIGISAHIDSGKTTFTERVLYYTGRIKDIHEVRGKDNVGAKMDF FT MELEREKGITIQSAATHCTWERTVDQIEANEKQKTDFEKSYNINIIDTPGHIDFTIEVE FT RALRVLDGAVLVLCAVSGVQSQTITVDRQMRRYNVPRISFVNKMDRMGADPWKVIQQIN FT TKLKIPAAAVQIPIGQEDKLEGVVDLIQMR" FT misc_feature complement(3..587) FT /note="Pfam match to entry PF00009 GTP_EFTU, Elongation FT factor Tu family," FT intron complement(736..943) FT /note="Intron predicted by HMM_intron, Score=10.50" FT CDS join(1407..2452,2506..3154,3208..4467) FT /db_xref="SPTREMBL:Q9USZ0" FT /label=SPBC1306.02 FT /note="SPBC1306.02, len:984, LOW SIMILARITY:Saccharomyces FT cerevisiae, Q08924, chromosome xvi reading frame orf FT ypl183c., (1013 aa), fasta scores: opt: 423, E():7.2e-21, FT (25.8% identity in 1049 aa)" FT /gene="SPBC1306.02" FT /gene="SPBC4.08" FT /product="hypothetical WD-repeat protein" FT /protein_id="CAB62092.1" FT /translation="MLVEGDKGVLEPVNFVGPVTSLALIHNDKFLCCAQGPCLRIYDVS FT ESRHESDLIKSILLPFHNRIHGMLPCKQGLFVWGGTYFAVVDVETSQVYYDRISDWIFN FT AAELGEENKFCVVTSKNQVHLLSLSKGSWEVTNTISCEKTPLLFCATVQISGNEIYIAS FT ASAFQQIYVWKFNYNNPPSCVSIDTYLVGHEGACFDLKFSSDGRYLCSVSEDRTLRLWS FT IESSPFLIATGFGHSARVWRCVFLPNKEIATVSEDLTLRLWTWNDKTLTNTYTYTGHRG FT KHIWSLVVSSANPIIYTGGNDGSVRSWDYKTRIQESCISVSSLFTSKTLLASFCFVKCQ FT KLLCLAKNGEMSIIDNKGKKAIPALTTTSDMSIIRSWKNSFFVAAASNKGTIYVFSYQN FT TNLLKAIDLHRGKIQQLLTYQVKERYFLLSKSFVGKNQFRVYLQEFDFEADKVNLISTI FT ELELPATFDPTSLTVDEEYSNFLIGSRTGSLALYNIGESNYLSVWRRIHEFDAISSIHI FT KSSKRDYLLIQTVGRDGYVNLFSIPRVEKSAVPQELCSKKCCDGILSGGLCKKLKNGKE FT NQWVWGFHASNFFLRDETNETNVFFIDCGGSHRPWAFSMEMDIQSFASYRANQLYIYST FT ALNLLQRNSVLQNGLHGREIRAMDFNPSGDLLLSGSEDTKVTLLEFQSNGNIIPLNSVK FT FHNSGIQSLTWFSDDILFSTGGLEELNVYRLIQKPEAYLKRIVHEKLCKPNKSNKTYDG FT DLRITDISVVKATHLGERVLWINTVQSDSTIKAFTYNVDTKQLNCIKSWKYKTVCLVFA FT EAVIFGSHLLLVVASTDGNVAIWNTFWRDPQIEPLIIWIKTVHQFCVKSLNISRDQDVL FT TISTGGDDGAISRSILLLEQISENSVVVNSLHNYFFAEAHASSVTGVLSITKELLLSVS FT IDQRILLWKVEGDSLKPILERYTHVADVGGMIRNNDFLIIYGIGCEVFRLGSLR" FT misc_feature 1974..2072 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 33.63" FT misc_feature 2109..2195 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 26.94" FT misc_feature 2223..2333 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 22.29" FT misc_feature 2453..2458 FT /note="gtaagc, splice donor sequence" FT misc_feature 2488..2505 FT /note="ctaacccaattaatttag, splice branch and acceptor" FT misc_feature 3155..3160 FT /note="gtgagt, splice donor sequence" FT misc_feature 3194..3207 FT /note="ctaacatatgctag, splice branch and acceptor" FT misc_feature 3463..3546 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 23.51" FT misc_feature 4395..4495 FT /note="nominal overlap with cosmid SPBC4 S. pombe FT chromosome 2" XX SQ Sequence 4495 BP; 1332 A; 765 C; 875 G; 1523 T; 0 other; cgcatttgta taagatcaac cacaccttca agtttgtctt cttgacctat aggaatttga 60 acagctgcag cgggaatttt caattttgtg tttatttgct gaatgacctt ccacggatct 120 gcacccatac gatccatttt gttaacaaaa gagatcctgg ggacattgta gcggcgcatt 180 tgtcgatcaa ctgtgatagt ttgcgattga acaccggaga cagcacagag taccaacact 240 gcaccatcta gcacacgtaa ggcacgctcc acctcaatgg taaaatcaat gtgaccaggt 300 gtatcaataa tattaatgtt ataacttttc tcaaagtcag ttttttgttt ttcgttagct 360 tcaatctgat ccacagtacg ttcccaagtg caatgggtag cagccgattg aatggtaatt 420 cctttttccc gttcaagttc catgaaatcc atttttgctc cgacattatc ctttcctctg 480 acttcatgaa tgtcttttat ccttcctgta tagtacaaaa ctctttcagt aaaggtagtc 540 ttgccggaat caatatgggc agatattcca atatttcgaa tttgtttaag tcgcttttta 600 tcgttatcat tcaactgttc ttggatattc aaattatttt ctgatgtatt agaagcttca 660 atgcccgtat ttgctactga cgcgtaacca cggacgacta aagtagaaag tctggggagg 720 cggcttgtca gcgacctaaa tgataatttt agcattttat attatttcaa tagaatataa 780 aaacagccat gaataacata aacaataaat tcctagcttt gggaacacgc tgaggtggtg 840 agagtcgtcg agcaaggtgg gaggagcaaa gtcgctttgt ttacgtgtcg cgagttaccg 900 cggtaacata ttaaatttat ttaaacgtaa atcaaatact tacttaacgc atttttaatg 960 aataacaaaa tatacatttt atatggagaa tatatatgta tttgcaattg aatgttgctg 1020 attatttaaa agtagcttgc ttatcgtcaa gaaaaaagcg acttcgttct gtattcggta 1080 tctccatgtt taaatgattc atgaggcggt ccaagcatag tacttataaa agaaatacaa 1140 ctattttgtc atgaagtatt ttgggcattg tatcagaaag cgcgcttttc tagtatccac 1200 aatcggtttg aacctcacca ccatcgcaat ctatttctac tactagggtt ttttgctctt 1260 tgcagttttt taagctgagc ttattcaatc cttactgctt aatcggttaa taccattttt 1320 tttctaattc ttctcacggt ttttatgacg gcgtattata tttagcttga agaatctgga 1380 cttaaaaatc taagaagggt ttaagaatgc ttgttgaggg tgataaaggt gttttagagc 1440 cagttaactt tgtgggtcct gttactagct tggcccttat tcacaatgat aagtttttat 1500 gctgtgctca aggaccttgt ttaagaattt acgatgtttc tgaaagtcga cacgaaagtg 1560 atttaattaa atcaatttta cttccatttc ataatagaat acatggaatg cttccttgta 1620 aacaaggatt atttgtttgg ggaggaacct attttgcagt tgttgatgtt gaaacgagcc 1680 aagtttacta tgaccgtata tctgactgga tatttaatgc agcggagctt ggtgaagaaa 1740 ataaattttg cgtcgttaca tcgaaaaatc aagttcatct tctttcgtta agtaaaggtt 1800 cgtgggaagt tacaaataca atatcttgcg agaaaacgcc tcttttgttt tgcgcaacgg 1860 tacaaataag tggaaatgaa atttacatag cttctgcttc cgcttttcag caaatttacg 1920 tgtggaaatt caattataac aatccacctt cttgtgtttc cattgacacc taccttgttg 1980 gacacgaagg tgcttgcttt gatctcaagt tttcatcgga tggacgttac ctttgctcag 2040 tttctgaaga tcgtacccta aggctttggt cgatcgagtc gtcacccttt ttaatagcga 2100 caggctttgg tcactcagct cgtgtttggc gttgcgtgtt tctacccaac aaagaaatag 2160 ccacagtttc tgaagatttg accttacgtc tctggacttg gaatgataaa acgctcacta 2220 atacttacac atatactggt catcgcggga aacacatatg gtcactagtt gtctcctctg 2280 cgaacccaat aatatatacc ggtggaaatg atggatcagt taggtcttgg gattacaaaa 2340 ctcgtattca ggaatcctgc atctcagtat cctcattgtt cacctcaaag actttgcttg 2400 cttctttttg ttttgttaaa tgtcaaaaat tattatgcct tgccaaaaat gggtaagcga 2460 tttgcatgct atcaaatatt catctgtcta acccaattaa tttagagaaa tgtcaattat 2520 agataataaa ggaaaaaaag ctattcctgc tttgactacc acatcggaca tgtctataat 2580 tcgttcttgg aaaaactctt tttttgttgc tgctgcctcc aacaaaggta ctatatatgt 2640 gttttcatat caaaatacga atttattaaa agctattgat ttgcataggg gaaagattca 2700 gcaactgctt acttatcaag ttaaagaaag atattttcta ctatcaaaat catttgtagg 2760 aaaaaaccaa tttcgagttt acttacagga atttgatttt gaagccgata aggtcaacct 2820 aatttcaaca atagaactgg agttgccggc aacatttgat ccgacttctc ttacagttga 2880 tgaagaatat tcaaattttt taattggttc tcgaacagga tcgctggctt tgtacaatat 2940 tggtgaaagc aactaccttt ctgtatggag gagaattcat gaatttgatg caatttcatc 3000 cattcacatc aagtcgtcaa aacgtgatta cctcttaatt caaacagtcg gacgtgatgg 3060 ttatgtcaat ttattttcaa ttccgagggt agaaaaaagt gctgttcctc aggagctgtg 3120 ttcgaaaaaa tgttgtgatg gtatactaag cggggtgagt attatcttta agaatatacg 3180 gttcatggtt cagctaacat atgctagggt ttgtgtaaaa aattaaaaaa tggaaaggaa 3240 aatcaatggg tttggggttt tcatgcatca aatttttttt taagggatga gacaaatgaa 3300 actaatgttt tctttattga ttgtggaggt agtcatcgtc catgggcttt ttcaatggag 3360 atggacattc aaagttttgc atcttataga gcaaatcaac tttatattta tagcactgct 3420 ttaaatcttt tgcaaagaaa ttctgtatta caaaatggtt tgcatgggcg tgagattcga 3480 gcaatggatt tcaatccttc aggggatctt ttattatcag gatctgaaga tactaaagta 3540 accctgcttg agtttcagag taatggaaac ataattcctc tgaattctgt caagtttcat 3600 aattcaggaa tccaatcgct gacctggttt agcgatgata ttttattttc cactggtggt 3660 cttgaagagt taaatgttta taggctaata caaaagccag aggcatatct gaaacggatc 3720 gttcatgaaa aactctgcaa accaaataaa agtaacaaga cttacgatgg ggatcttcga 3780 ataactgaca tttctgttgt aaaggcaacc catctgggtg aaagagtgtt atggattaat 3840 actgtacagt ccgattcgac tatcaaagca ttcacttata acgtagatac gaagcagcta 3900 aattgtataa aaagttggaa atataagaca gtttgtttgg tttttgccga ggctgttatt 3960 ttcgggtctc atttattgtt ggtggttgct tcgactgatg gtaatgttgc tatttggaac 4020 acattttgga gggatcctca aattgagcca ttaataattt ggataaaaac agtgcatcaa 4080 ttttgtgtta agtcattaaa tattagcagg gatcaagatg tgttaacgat ttctacgggt 4140 ggtgatgatg gtgcaatttc tcgatctata cttttgttag aacaaataag tgaaaattca 4200 gtagtagtca attctttaca taactacttt tttgctgaag ctcatgcttc cagtgttaca 4260 ggtgttttat caattaccaa agagcttttg ctgagtgtat ccattgacca aaggatatta 4320 ctatggaaag tcgagggtga ttcattgaaa ccgatcctgg aaagatatac tcacgtagct 4380 gatgtgggtg gcatgatccg aaataatgat tttttgataa tttacggaat tggttgcgaa 4440 gtttttagac tgggttcttt acgatgatta tttgctcaat aatgtctact actcg 4495 //