ID SPAC1527 standard; DNA; FUN; 7813 BP. XX AC AL355653; XX SV AL355653.1 XX DT 08-MAY-2000 (Rel. 63, Created) DT 08-MAY-2000 (Rel. 63, Last updated, Version 1) XX DE S.pombe chromosome I cosmid c1527. XX KW cell wall alpha-glucan synthase; confirmed intron; ER to Golgi transport; KW membrane protein; mok11; vesicular transport; yeast SRO9. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-7813 RA Seeger K., Harris D., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (08-MAY-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; Q09854; Q09854. DR SPTREMBL; Q9P6K0; Q9P6K0. DR SPTREMBL; Q9P6K1; Q9P6K1. DR SWISS-PROT; Q09854; MOKB_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1527 is overlapped at the 5' end by cosmid c23D3, CC EMBL entry SPAC23D3, accession number Z64354, and at the CC 3' end by cosmid c30, EMBL entry SPAC30, accession number AL136538. XX FH Key Location/Qualifiers FH FT source 1..7813 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1527" FT /map="IR" FT CDS 1..3695 FT /codon_start=3 FT /db_xref="GOA:Q09854" FT /db_xref="SWISS-PROT:Q09854" FT /label=SPAC1527.01 FT /note="SPAC1527.01, len:1230, FT SIMILARITY:Schizosaccharomyces pombe, BAA34054, FT alpha-glucan synthase mok1., (2410 aa), fasta scores: opt: FT 2278, E():0, (50.8% identity in 1263 aa)" FT /partial FT /gene="mok11" FT /gene="SPAC1527.01" FT /gene="SPAC23D3.15" FT /product="cell wall alpha-glucan synthase Mok11p" FT /protein_id="CAB90796.1" FT /translation="GLGVMAQLMARHLEHEDIIWVIPCVGDVSYSNVEEDDPIEVVIID FT QTYFINVYKYVIGNIIYILLDAPIFRRQTSGKPYPSRADDLSSAIFYSAWNQCIASVIS FT RNNIDLYHMNDYHGSLAPLYLLPKIIPVALSLHNAEFQGLWPLRNSSEKEEVCSVFNIS FT KSVCSKYVQFGNVFNLLHAGASYIRIHQKGYGVVGVSSKYGKRSWARYPIFWGLRKIGK FT LPNPDPADNGTNFKDLDANSMNEFENIKAKHKRSAQEWANLNIDPEADLLIFVGRWTLQ FT KGIDLIADITPTLLENFNSQIVVVGPVIDLYGKFAAEKFTALMKKYPGRIYSRPLFTQL FT PSYIFSGADFALIPSRDEPFGLVAVEFGRKGTLGIGAKVGGLGQMPGWWYTIESNTTAH FT LLCQFEEACRQALTSSKSVRTKLRAISTIQRFPVSEWVSKLDTHVRNCIKFSHKQNLEE FT DFIHEPVIDVDEFAISSSKDIDADEDLEIIGSSDNKAIDSNGEGFLIEKDNIGTGSYSN FT QQSFDFKSSESDSFPQKSPSVESFSIIDNDNPFHEGQNSSTGYKDIVQGLLAENGVSEA FT GVDVMTSIVSSTIPIVSNHQTEGSQMFNEISSVSSIHVYHDESQPPVEMPAESDTPLQN FT KLYHPGMWSSSDIRIPNNSSQLSIDSVRSGMRPFSLSKVPHQFDDEEGKALQIFREKLK FT DLNCKNSMNEMCIENFLMKCTRKYFDEVRKLRLGTLKPENLQFVKDPSSLALETNLLPA FT SDTIEEKNDVGNKQVNSHILDPGFLKEEECVYEFEQLHGLRKALQVEIYGWPLYTILLA FT IGQVLAATSFQLNLFSDTPDQPEYQSYIVCSVYISASLFWYILHSLVPNIYALSTPFII FT YATAFALAGLTTFSSLGDSRLWISRVATWIYSIASGSQALFFSLNFGNEGRYDILYWIV FT RACFIQGSQQVWSAALWYWGSYSTDKPNRMGVSGKLNPQPWMAAITWPIALVLLAIAFV FT VYRGLPNFYRQCPSKIPAFYRSLFRRRLIMWFFISVFLQNYWMSSVYGRSWAFMWSAHN FT VHKWAMFLLVILFYVAIWIALVALLASLSRYHSWILPILGLGFGAPRWLQTLWGTSNIG FT IYVPFLGKGAPYLSRMLWLYLGLLDTVQTVGIGIILLQTLTREHITVVLITGQIVGAIA FT SMVGKASSPSKFGPGDVFIDFTHWSVKDGPQILASIPFWVCLICQLSVIIGYFLFFRRE FT NLSRP" FT CDS join(4206..4214,4268..4596,4753..5020) FT /db_xref="SPTREMBL:Q9P6K1" FT /label=SPAC1527.02 FT /note="SPAC1527.02, len:201, SIMILARITY:Saccharomyces FT cerevisiae, SFT2_YEAST, sft2 protein., (215 aa), fasta FT scores: opt: 420, E():4.5e-21, (39.2% identity in 194 aa)" FT /gene="SPAC1527.02" FT /product="putative membrane protein required for ER to FT Golgi transport by similarity to yeast sft2" FT /protein_id="CAB90797.1" FT /translation="MEGSFQSRLQSIIQRTGETTAESTNSWYNRLRTSMPWSNDYTEIP FT TNASGGNSYFQSSEFSLSRWERYMLFGICLLGSLACYAIACFMFPVLVLKPRKFVLLWT FT MGSLLAVLGFAIVQGFVAHFRQLTTMERLPITLSYFVTLLATIIATIKIKSTILSIVFG FT VLHILSFVAYLIAFFPFGTRTVSLGTRMASRSLSNWLP" FT misc_feature 4215..4220 FT /note="gtacgg, splice donor sequence" FT intron 4215..4267 FT /note="confirmed by mRNA" FT misc_feature 4251..4267 FT /note="ttaactagactatgtag, splice branch and acceptor" FT misc_feature 4597..4602 FT /note="gtatgt, splice donor sequence" FT intron 4597..4752 FT /note="confirmed by mRNA" FT misc_feature 4736..4752 FT /note="ctaacttatttctttag, splice branch and acceptor" FT CDS 5662..7089 FT /db_xref="SPTREMBL:Q9P6K0" FT /label=SPAC1527.03 FT /note="SPAC1527.03, len:475, SIMILARITY:Saccharomyces FT cerevisiae, SRO9_YEAST, sro9 protein., (466 aa), fasta FT scores: opt: 293, E():2.6e-09, (25.5% identity in 404 aa)" FT /gene="SPAC1527.03" FT /product="putative involvement in vesicular transport by FT similarity to yeast SRO9" FT /protein_id="CAB90798.1" FT /translation="MSHKEPSAAETSTVVHSEEDKAFCIGTGAVISEEREKEVLKNLQN FT SLTGKTAEENLNDEANHTSSDKSKSEDYQPSNVNVWALRKEKMIPKKHSHVKQEKRFSK FT SLQLQDPNVWPSPEIAEKQVEDRKLSDDSQKPLAPKANGKEKWVTITPNFTHTPISNRK FT SSRSRNDGSRRNGNGRRRGNYSSHGSNKRQTNYSREKDASRSIDSSNPSAEYRDDINNT FT FGSQTVSSANGKEVPQTSEDSSSQAPHHSSSSGHAPSQQGGNKHSYKKSDSQQSFHHKG FT RNTRKGQRHNNGFYRNIANNIQGPFPNYPVVVNGNGVNPYLCDVQAFLTSQLEYYFSIE FT NLCKDMFLRKHMDDEGYVPLAFLASFNRIKSFSTDLNLLHAACKASDIIDVAIDLQSPM FT SIKVRRKETWSPWILPSESRLKFEMAKYPQINSSSSMSPLASSISNLTISPPFIPSSVD FT SIIKRDVQTEVEDKLTV" XX SQ Sequence 7813 BP; 2352 A; 1363 C; 1442 G; 2656 T; 0 other; gcggattggg cgtcatggcg caactaatgg cccgacattt agaacatgaa gacatcattt 60 gggtgatacc ttgtgtcggg gatgtgtctt actcaaacgt tgaagaagat gatcctatcg 120 aagtagtaat catcgaccaa acgtatttca ttaacgttta taaatatgtg atcggaaaca 180 taatatacat actccttgac gctccaattt tccgaagaca aacaagcgga aaaccgtatc 240 catctcgtgc tgacgatcta agctcggcta ttttctattc tgcttggaat caatgcattg 300 ctagtgtaat atcaagaaac aatatcgatt tatatcacat gaatgattat cacggttctc 360 ttgctccgct atatttgctt ccaaaaataa tcccagtggc actgtctttg cacaatgctg 420 aattccaagg actttggccc cttaggaact cttctgagaa agaggaggtt tgttcggttt 480 tcaatatatc taaatctgtt tgttctaaat atgttcaatt tggtaatgtt tttaatctgt 540 tgcatgctgg cgcctcatat attcgaattc atcaaaaagg ctacggagtg gtgggtgttt 600 ccagtaaata cggaaaacga tcttgggcta gatatcccat attttgggga ttgaggaaaa 660 tcggtaaact cccaaatccc gacccagcag ataatggtac caattttaaa gaccttgacg 720 cgaattccat gaatgagttt gaaaacatca aagcaaaaca taagcgttca gctcaagaat 780 gggcaaatct caatattgat cctgaggcag atcttttgat atttgtgggt cgatggacgt 840 tgcaaaaggg cattgattta attgcagaca tcactcctac cctattagaa aatttcaatt 900 cccaaattgt tgtggttggc cctgtcattg atctttatgg aaaatttgct gctgaaaagt 960 tcacggcttt aatgaagaag tatcctggtc gaatatactc taggcctctc tttactcaac 1020 ttccttctta catattcagt ggcgcagatt ttgctttgat accctcaaga gatgaacctt 1080 ttggtttggt ggctgttgaa tttggtagga aaggaacatt gggtattggt gccaaagttg 1140 gaggccttgg acagatgccg ggatggtggt ataccatcga atctaacaca actgcacatt 1200 tgctctgcca gtttgaagaa gcctgtcgtc aagccttaac atcttcaaag tctgtgcgca 1260 ccaaacttcg agctatttct actattcagc ggtttcctgt ttctgaatgg gtctcaaagc 1320 ttgatactca tgtgagaaat tgtatcaagt tcagccataa acaaaactta gaagaagact 1380 ttattcatga acctgtaata gatgttgatg aatttgcgat atcgtcatca aaagatattg 1440 atgcagatga agatttagaa attatcggaa gctcagataa caaagcgatt gattcaaacg 1500 gtgaaggatt tttaattgaa aaagataata ttggaacggg aagctactca aatcaacaat 1560 catttgactt taaaagtagc gaaagtgatt catttcccca gaaaagccct tcggtcgaaa 1620 gtttttctat aattgataac gataatcctt tccacgaagg acaaaattct tctactggtt 1680 acaaggacat tgtgcaaggg cttttggctg aaaatggtgt ttcagaagct ggagttgacg 1740 ttatgacttc tatagtttca tccactattc caattgtatc aaaccatcaa actgaaggtt 1800 ctcaaatgtt taatgaaatc tcatctgtgt cgtctattca tgtataccat gatgaatcac 1860 aaccgccagt tgaaatgcca gctgaaagtg acactccctt acaaaataaa ttataccatc 1920 cagggatgtg gtcgtcttct gacattagaa taccaaataa tagctctcaa ttgagtattg 1980 attcagttcg atccggtatg cgaccatttt cactttccaa ggtacctcat caatttgatg 2040 atgaggaggg aaaagcgctt caaatatttc gagagaaatt aaaagatctt aattgcaaaa 2100 attctatgaa cgaaatgtgt attgaaaatt ttctcatgaa atgtactaga aaatattttg 2160 atgaagtccg aaaattacga cttggtactt taaaacctga aaatcttcaa tttgttaagg 2220 acccatctag tttggctttg gaaactaacc ttttaccagc atccgacaca attgaggaaa 2280 agaacgacgt aggtaataaa caggtgaatt cgcatatatt agatccaggg tttctcaaag 2340 aagaagaatg tgtatatgaa tttgagcaac ttcatggact tcgaaaggct ttacaagttg 2400 aaatttatgg atggccatta tatacaattt tgctagctat tggacaagtg ttagctgcta 2460 cctcttttca actgaattta ttttcagata cgcctgatca gccagaatac caatcctata 2520 tcgtgtgtag tgtttatatc agtgcatctc tattctggta tattttgcat tctctcgtac 2580 ccaatattta cgcactttca accccattca taatttacgc tactgcgttt gctttagctg 2640 gccttacaac gttttcaagc ctcggtgatt cacgtctttg gatttcacga gttgcaacct 2700 ggatttattc aattgcctct gggagtcaag ctttgttttt ttctttaaat tttggtaatg 2760 aaggaagata tgatatcttg tattggattg tgagagcttg ttttattcaa ggatcgcaac 2820 aagtatggtc ggcagcactt tggtattggg gttcttactc aacggataaa ccaaatagaa 2880 tgggagtttc gggaaaatta aacccacagc catggatggc agccattaca tggccaatag 2940 ctttggtttt actcgcgatc gcttttgttg tgtatcgagg tttgcctaat ttttacagac 3000 aatgtccttc aaaaatacca gctttctatc gttccttgtt tcgccgtcgg ttgataatgt 3060 ggttttttat tagcgttttt cttcaaaatt actggatgtc atcggtgtat ggtcgatcgt 3120 gggcctttat gtggtctgca cataatgtac ataaatgggc tatgttttta ctagtcattt 3180 tgttttatgt tgctatatgg attgctttgg tggcattact tgcttctttg tccagatacc 3240 attcttggat cctacctatt ttgggattag gatttggagc tcctcgctgg ttacaaacat 3300 tatggggcac gtcaaatatt ggcatatacg taccgttttt aggaaaaggt gcaccctatt 3360 tatcaagaat gctgtggctc tatcttggct tattggatac agttcaaact gtcggtattg 3420 gaattatatt attgcagaca ctaacaagag agcacattac tgttgtttta ataaccgggc 3480 aaattgtcgg tgctattgca agtatggtag gtaaagccag ctccccatca aaatttggac 3540 caggggatgt ctttatagat ttcacacatt ggagcgttaa ggatggtccc cagatccttg 3600 caagcattcc attttgggta tgcttgattt gtcaattatc tgttattatc ggttacttcc 3660 tgttctttcg ccgtgaaaat ttaagtcgac cttagatatt tacttttctt tgaatagaat 3720 agtgtcaaat tttatatata gaattaatga ttacttaata ttacctctgt tattagcagt 3780 tttagccgta acagcaatgt aaagtaaatg aaaactttga ttttataaat agactaaaca 3840 aatcattcat aaagcggcaa aaaagtaatt cctatcgatt catatataat tgataagtgc 3900 tttttgaata gtcataggac taatactagg ttttggtagg actttgaacg aaattggtac 3960 ttaatataat aggtaagaac ttcaatgtgc gtttttatgt ataactgtaa taatgaaata 4020 tggaaaatat aatagaaaag ctaccaaatg tcttattaca tgtaagcgtt gagatataat 4080 gaacttttat tataatggac gcgttgagtt gttacatgaa tattcgcagc cattcgctat 4140 tcacttaaca ccgacgtgct ttgggacttg aattagtaaa ccaaagctac attatcttta 4200 gaaaaatgga aggagtacgg tctgattttt attgttatag aattgaaatg ttaactagac 4260 tatgtagagt tttcagtctc gtctgcaatc tataattcaa cgaactggtg aaacgacagc 4320 tgaatcgact aattcttggt ataataggtt aagaacctca atgccatggt caaatgacta 4380 tacagaaatc cctacaaatg catcaggtgg aaattcttat tttcaaagct cagaatttag 4440 tttatctaga tgggaaaggt atatgctgtt tggcatatgc cttctaggaa gtttagcatg 4500 ctacgcgatt gcttgtttta tgtttcctgt tttagtactg aagcctcgaa agttcgtatt 4560 gttgtggact atgggaagtt tgttggcagt tctggggtat gttatagaaa agccaaattc 4620 aattttggta caatctcttt tttgaaatgt atacatcccc tattgtagaa catgggtttc 4680 gtattataac tttgcagtaa tctggcctct atgattgcta cttttttttc gattgctaac 4740 ttatttcttt agctttgcaa ttgttcaagg tttcgtagct cattttcgac aattaactac 4800 tatggaaaga ttgcctatta cgctatcata ttttgttact ctattagcta ctatcattgc 4860 gaccattaaa atcaagtcta ctattcttag catcgttttt ggagtcttac atatattatc 4920 atttgttgcg tatttgattg cgttcttccc atttggaaca agaacggtat cccttggcac 4980 tagaatggca tctcggtctc tttcgaactg gttgccctga aacttgttac gatttaataa 5040 ttttttctta cttataatat cgactttagg tattccaaat tgctgacgaa gtcgatactt 5100 aatagttacc gctctttcat tatgatttat ccccatatcc tattgaccga atctgttaat 5160 atttcctact cgcattaaaa tagttggttc ttcttccctg ttataagctc agtttacaat 5220 gatagatatg atttgatttt ttaataaaat gtagaattgt attggtacaa cccatttttc 5280 tgaataacta ataatcgaac tttcattact tactgaatac ataaggtagt actgtaaaac 5340 tcgctattta cttttgtcta atataaaatg catttttgca tttggaaaag ccgttttagt 5400 ggtgaactta ggtgaacctt accttatttc tttaaaacca attgaaaatt gcttttcgat 5460 agttgaattg gatttttgat ttttaaattt gcttttaacc gcctctttta ttaaagtgtt 5520 catttctcat cactacatca aagcaatttt catttgaacc tgaaattttt tgcaagaacg 5580 tcggatattc tattttgtgt tattggtctt taattagttt accgattttc tattatctgg 5640 taatttattt tagtaaaaaa aatgtctcat aaagaaccaa gtgctgctga aacttctacg 5700 gttgtccatt ctgaagaaga caaagccttt tgtataggta ctggtgctgt aatttcagaa 5760 gagcgtgaaa aagaagtatt gaaaaattta caaaactcct taacgggaaa aaccgctgag 5820 gagaatttaa atgatgaagc caatcatacc tcttcagata aaagcaaaag cgaggattat 5880 cagccttcaa acgtgaacgt ttgggcgctt cgaaaagaaa aaatgatacc taaaaagcat 5940 agtcatgtga aacaagaaaa aagattttcc aaaagtttac agcttcagga tccaaatgtt 6000 tggcctagcc ctgaaatagc tgaaaaacag gttgaagata ggaaattgtc agatgattca 6060 caaaagcctt tagcacccaa agctaatgga aaggagaaat gggttacgat taccccaaat 6120 ttcacgcata cacctatcag caaccgcaaa agtagtcgtt caagaaatga tggctcacgc 6180 agaaacggaa atggaagacg tagaggaaat tattcttctc atgggtccaa caaacgacaa 6240 actaattatt cacgagaaaa agacgcttca cgttcaatcg attcatcaaa cccatcagct 6300 gaatatcgtg atgatataaa taacactttt ggttcccaaa ctgtgtcaag tgccaatggt 6360 aaagaagtgc ctcagacttc ggaagattcc agttctcaag cccctcatca ctcttcctct 6420 tcaggtcatg ctccgtctca gcaaggtggt aataagcatt catataaaaa atccgactct 6480 caacaatcct tccatcacaa gggaagaaat acacgtaaag gacaacgaca taataatggt 6540 ttctaccgaa acattgccaa taatatacaa ggtccatttc caaactatcc tgtcgtcgtc 6600 aatggtaatg gcgttaatcc ttacctttgt gatgtccaag catttctcac ttcacaacta 6660 gagtattatt tttcgattga aaacttgtgc aaagatatgt ttcttcgtaa acatatggac 6720 gatgaaggtt acgttccttt agcatttttg gcttccttca atcgaattaa gagcttttct 6780 acagatttga atcttttaca tgcagcttgc aaggcttcag atattattga tgttgctatt 6840 gatctccagt ctcctatgtc cattaaggtt agaagaaagg aaacttggtc tccttggatt 6900 ttaccttcag agtcacgctt gaagtttgaa atggccaagt atcctcagat aaactctagc 6960 tccagcatgt cccctcttgc cagttccatt tcaaatttga caatttctcc tccattcatt 7020 ccttcttctg tggattcaat tataaaaaga gatgttcaaa ctgaggttga ggataaactt 7080 actgtttaga atttttgata ttttttttat ttctcgaggc atcaatattt ttcgttcatt 7140 ctgggtctgt tcttttattg acgcctgtac ttttttttat ttaaaaggca aggattgttt 7200 attgaaactt gacttctata gtttggtttt ggttactgaa gcgtaatggg tttatttctt 7260 tcggttaaaa gaaaaaaaaa aaaaaaaaaa aaaaaagaaa atattgtact tattataaga 7320 gtatatgtcg ttttcgataa atgggattac tggattaaat gattatctaa agtgatgtgt 7380 catttatgct aaattctaat ttttttttac ttaaaaaaaa tcattgttta tgatttgcat 7440 atagtttctg ttattatttt caactgtgat acgctgtaac ttcatgcttc ctgattatat 7500 cttccaaatc actggctccc tttattattt tgtcatggca aaattttgtg agtttaatat 7560 gttttctgta aagtaaattt accatttgta ccttgatgct tcaactcata atgtactcgt 7620 ctttacttta agctattcta tatttaaagg ccgcagggac atgaaatgaa taatagaaga 7680 ggatatttca cgaaaaatct ttaattgatc gttaaaaaaa aggtctctgg tgtgtgtggc 7740 agtagcatct tccttctggc gttggattac tacaaatact gattagctta catgcagaat 7800 attaagaggt tgt 7813 //