ID SPAC1639 standard; DNA; FUN; 5601 BP. XX AC AL117213; XX SV AL117213.1 XX DT 06-SEP-1999 (Rel. 61, Created) DT 06-SEP-1999 (Rel. 61, Last updated, Version 1) XX DE S.pombe chromosome I cosmid c1639. XX KW GNS1/SUR4 family; potassium transport; SUR4 family protein. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-5601 RA Wedler H., Duesterhoeft A., McDougall R.C., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (06-SEP-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and QIAGEN GmbH, Max-Volmer-Str 4, D-40724 Hilden, RL Germany XX DR GOA; Q9UR34; Q9UR34. DR GOA; Q9UTH7; Q9UTH7. DR SPTREMBL; Q9UR34; Q9UR34. DR SPTREMBL; Q9UTH7; Q9UTH7. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1639 is overlapped at the 5' end by cosmid c806 and at CC the 3' end by cosmid c1F5. XX FH Key Location/Qualifiers FH FT source 1..5601 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1639" FT /map="IL" FT misc_feature 1..2282 FT /note="nominal overlap with cosmid SPAC806 S. pombe FT chromosome 1" FT CDS join(complement(2083..2096),complement(1530..1890), FT complement(864..1475)) FT /db_xref="GOA:Q9UR34" FT /db_xref="SPTREMBL:Q9UR34" FT /label=SPAC1639.01c FT /note="SPAC1639.01c, len:328, SIMILARITY:Saccharomyces FT italicus, AAB87766, v-snare bypass mutant., (347 aa), fasta FT scores: opt: 905, E():0, (49.2% identity in 309 aa)" FT /gene="SPAC1639.01c" FT /gene="SPAC806.09c" FT /product="SUR4 family protein" FT /protein_id="CAB55289.1" FT /translation="MLDCWSWLNSLADATFGKRPSSFEFIVNKTRFSSAPVVATIIISY FT YLLILVGGRIMRNRQPIRLQKIFQYYNLTFSIASAILALLIFEQVAPAIYKHGFFFSIC FT NEKAWTQPLVFLYYCAYISKFLELTDTFFLVLRKKPLQFLHCYHHGATAVLVYTQIVGR FT TSISWLIIEINLLVHVTMYYYYYLVAKGIRVPWKKWVTRFQIVQFFADLGFIYFAVYTE FT VAYRLKFYKACMGHCSGHPLAAFCGLATISSYLVLFIVFYHNTYKKNAALKMKAKAAAA FT TKGNSSESSKNADLKRLSKSNASIAEVKCNNIVTNLYPISSGLNNEK" FT misc_feature join(complement(1517..1910),complement(1047..1462)) FT /note="Match to PF01151 GNS1_SUR4, GNS1/SUR4 family Score FT 315.69" FT misc_feature complement(1476..1491) FT /note="ctaacatacgttttag, splice branch and acceptor" FT misc_feature complement(1524..1529) FT /note="gtacgt, splice donor sequence" FT misc_feature complement(1891..1904) FT /note="ctaacaatttttag, splice branch and acceptor" FT misc_feature complement(2077..2082) FT /note="gtatta, splice donor sequence" FT misc_feature complement(4257..5601) FT /note="nominal overlap with cosmid SPAC1F5 S. pombe FT chromosome 1" FT CDS join(complement(5565..5583),complement(5483..5508), FT complement(5168..5272)) FT /codon_start=1 FT /db_xref="GOA:Q9UTH7" FT /db_xref="SPTREMBL:Q9UTH7" FT /label=SPAC1639.02c FT /note="SPAC1639.02c, len:50, SIMILARITY:Schizosaccharomyces FT pombe, TRK_SCHPO, potassium transport protein., (841 aa), FT fasta scores: opt: 151, E(): 8.4e-06, (70.0% identity in 30 FT aa)" FT /partial FT /gene="SPAC1639.02c" FT /gene="SPAC1F5.12" FT /product="putative potassium transport protein" FT /protein_id="CAB55290.1" FT /translation="IRGRHRGLPSALDRAVLMPSDKNFDREEEDYMRRHGKKNTNRADP FT VPSS" FT misc_feature complement(5273..5293) FT /note="ctaacggtttatttactttag, splice branch and acceptor" FT misc_feature complement(5477..5482) FT /note="gtaagt, splice donor sequence" XX SQ Sequence 5601 BP; 2010 A; 971 C; 1050 G; 1570 T; 0 other; gtgtatattt cgtatactat cgcgaatgaa ggaaaatttt tgtttgacat gagagaaatg 60 agttgagtca atttatgact gaagggggtt aatcattatt tgccgttaag ctgtgtaagt 120 catttataat agtaataaaa aaaaaactaa tatattttgt tttaaacgta tttactgtta 180 ttagattaac cggcgccgct ccctaactga aatagtaaga aaacttcaaa ataaataatg 240 aattccgtat gctcacttgc aatgactaca agttatacag agaaatttac caaccttaca 300 caactgtagt actgttatta gaacgtatat tttcttacta atttataccc atttttcata 360 agttgtgcat tagtattacc atggtttttt aatataaaat attaaacaaa tggactttta 420 atctttggaa aaaaagattc agaacattat attgaagaaa agattttggc tctgattttt 480 ttaattatat tatcacacca tcaagtttga ccttaagata ccacgaagtt aacaaatcta 540 tacttagtag aggtttagct aaaaagttta gaacatgtaa agacgtcgat aaatcttgat 600 taaggaatgg gaaaaaggat tcactggata aaaggtagta atcgaatata gacaaggatg 660 taaagctaac ggacaattaa gtactttcaa aattcacaaa atcgtcgccc gtaagtcact 720 ccaactatta attggtaatt ctaaatgcaa cgttaagcag aacaaagaaa aaaaaggcac 780 ttgtagactt tagggctcaa tatcaagtta taagtaagag caaaatcggg taattaaaaa 840 ccataagagg taccacgaaa acattatttt tcattgttta aacctgaaga aataggatac 900 agattagtaa ctatattatt gcatttaacc tcggcaatag aagcgttact ttttgagaga 960 cgctttaagt ccgcattttt agaagattct gatgagttac cttttgttgc agctgcagcc 1020 ttagctttca ttttgagagc agcatttttc ttgtaagtgt tatggtaaaa gacgataaac 1080 aaaacaaggt atgaagagat tgttgcgaga ccacagaacg ccgctaaagg atgtcctgag 1140 caatggccca tacaggcttt gtaaaacttc aaccgatacg caacctccgt gtatactgcg 1200 aaataaataa aacctaaatc agcaaagaac tgaacgatct gaaagcgggt gacccatttc 1260 ttccaaggaa cccgaatacc tttggctaca agatagtagt aataatacat agtaacatga 1320 accaataaat tgatttcaat gatcaaccaa ctgatggaag ttctgccaac aatttgagta 1380 tatacaagaa cggcagtggc accatgatgg tagcaatgaa ggaattgaag gggttttttg 1440 cggaggacaa ggaaaaaagt atctgtcaat tcaagctaaa acgtatgtta gaatgaaaaa 1500 agtatggtca aaaaacgatg gagacgtacg aattttgaaa tataagcaca gtagtataaa 1560 aataccagcg gttgagtcca tgctttctca ttgcagatag aaaagaaaaa accgtgcttg 1620 taaatagcag gggctacttg ttcaaaaatc aaaagtgcaa ggatagctga agcgatggaa 1680 aaagtcaaat tataatattg gaaaatcttt tgaagacgga taggttgcct attacgcata 1740 attctacctc cgacaagaat taaaagatag taagatatga tgatggttgc aacgaccggg 1800 gcggacgaaa agcgggtttt gtttacgata aattcaaagg aagatggccg ctttccgaat 1860 gtggcatctg ccagggaatt taaccatgac ctaaaaattg ttagtaaaaa tcaacatgaa 1920 agcctctcaa tcgcattaaa actagtgaaa tttctattgt tctttcagac gcttgctttc 1980 atgtttggta gtacttaacg actacacaca aaagtaacac atgatgtgcc tcaaaaaaag 2040 acacacccca gagatgaaca ttccacaatc aatcagtaat accagcaatc caacatcagc 2100 taaaaagtca cagagtttaa cttgagatga aaaagttttg gatgaaaaaa atttcagtaa 2160 acctaacgaa attttcaccc ttgttgattc cacctactct gccgagtcta tcgggccgtc 2220 agcgaaaatg attggctaag agctcgcttt acgaaccgtt aagcatccac atggtggaga 2280 tcttaataaa caacggaatg ataaaaaaca acaacgaaaa ctagaaatac cgatccgctt 2340 tttatcaatt gaatatctaa cagaggatgc tgaacggaac gatatagcga gcgacagtgt 2400 aacaacggta agcaaagaaa ggacgacaac aaagaccggt atgctggtag gagagtgtca 2460 aaaacaaaga aaactcaaaa cagaaagtgg ctggtctaca gtcaaaattt cagaaaattc 2520 aaaataaaca gcgccggaag tagaaaggca ttgaacaagg ataaacgaaa aatatatact 2580 gattgatttg tgtggaaaaa aagcaagcat cttctagaag agaaattcac gtacgcatca 2640 ctatgtaggc acattagtga gcaaagctat ggcaaacatg gaaaacagtt aaatagctta 2700 gcggaggcat aatatcgaca aactcaccag atgggatact gaaattgtcc taccgaaaca 2760 gatgccatgc cttgatgatt attgttagag gttagagacc cgttctcaag cccaataata 2820 tggttatggt gaagggtagg tgagtcaggc ataatgaaaa cgtatttgag atttgcaggg 2880 tacaaatggc ggagactagc aagtaacgaa aacaaataat accgtaaaat acggacgaag 2940 tttaagaatt tggtatcgag ttctttaaat gaaaatacgg atgagttcag cccgctcact 3000 gcgttgtgat atttgcacaa aaaaagcgag gaaggggatg cgggtaggtg gaagaaaaaa 3060 gaaaaagact attccaaagc aagggaataa ggaacggaag ctgaaacaaa aagaaacgaa 3120 ggttccagga gtatgtaagg ggttaaactg tacagactgc aatcacagaa aaaccgtggg 3180 aatggatgga cgggatgaaa aaataaaata acaatatccc tgacgatatc gagaaataag 3240 gaaacgagat tcgatctctg cttggagttg aaaaaacgtg gattcaacgg atcgacttgg 3300 taaacaaaga tttgccaaca aaataaaaga gcttagtcgc acaaaaagag tacttggaac 3360 aagtaggtac gtatgatact aagaaaaact taactgtgaa tgaggaaaag gaggtcttct 3420 tcaatacgtt tctcgagtgt tataagaaga aggatgaaat gatgagccaa acgattgctt 3480 gttatttctt tgaatacaca aagaatgcaa acaatgagtt agtgtcgttc aacaataata 3540 acgctatagc aataagaata aggtcactgc aaacaatcgc aagtcacctg gcacgaaaat 3600 acacgagacc gaaaatggcc tgataccgac tttaaaaaca aaaaaaaaga tccgatatat 3660 tgtagaaaat cagctgaaaa attctcgcac cgttgggaat tgccggaacg gtatatatac 3720 tctcactttt cttttatagc gtcatttcca aacttgcata tagcggaaat catgtcatgc 3780 ttccgaatct ggggataagg taacggaagc catggtgcgg gacccgcgtt ttcgtatgta 3840 tggttgaaac aagaacgtta tgttaaaatg taaacaaaaa ataataatag caacgctggt 3900 caccaagcgt aaatattaag agaaagcgat tgcgcaaaaa gtcaaccgta tgaaaccagg 3960 ttgaaaagga aagtttgcat tgcttttcga tagcgactaa tgtaatccgt acgactctaa 4020 tgaatttttt atactcttcc actcttaccc ttgttttaag gtattgtatg gtaataaagt 4080 tcgtgcgaca tgttataatt attaagtaat attgtgagcg cttcctttgt gcgcgctgag 4140 catttcttgg aattttgttg atcaaatatt ggcatggaaa gcggtggaca aatccaaaat 4200 ccccaaaata aatacctcaa cggagttctc tatcaatgcg aattctgcca gaaattgatc 4260 ggtcctttct taggggcgta tatactttcc gaactttcat ggtttccttt tcttcctatt 4320 cggctcaagt ttcatcattt ttacttgcgg ttccaaagga tttgccatcg atgaacagta 4380 cgcgccaatt tttcaatcgg tctgtgattt actaacgtct ttttccatgt tttaccgcat 4440 attttattga attatctaag gaggaaaaat gcaattgaat ttgcagtcga cttggtgttc 4500 accgtaatgt ctgataattt catgtccgac ataaaaaaaa agaaaagctg tgtttcattt 4560 tgatccgttt aatgctttat tttttcattc cctttgcttt tgctaccgaa actgtcggca 4620 aacaccaccg ttcatttcat attcgtcatt taccgaattt tttttcatcc cgcaaatcgt 4680 cgatctcata attatcgggt agcgtttgaa cgattaaaaa catataagca tcatcccaaa 4740 tcgatttaag cattaaacac tacaccctaa tcttgtccta caaacatact ataaaatcaa 4800 tgcgacaata caaacacttt acagtacagt gactttttgt gagctcgtca caattcccgc 4860 aggcgggcat taccctcatt tcttcccgaa tatactttga ttacactaat aactatttac 4920 aaataaattg atcaaatttc ttattgacca caaattctaa acatgaaaac tttaacgaat 4980 gtagtgaaaa caagtaaatg tacataccaa acaaaagtgt aaaataaaat aaaacaaaaa 5040 gttcttttaa ggcccaatca gttaaaatgg cacagaacca gaagatacag tacaagtttc 5100 attagctcct gaccttgcct tgcattttca tagaaagtaa gtctaatcaa ccgtgaggtt 5160 ataatcatta agaactgggt accgggtctg ctctattagt attttttttc ccgtgacgtc 5220 tcatataatc ctcttcttcc cggtcaaagt ttttatccga aggcattagc acctaaagta 5280 aataaaccgt tagtagattc atttttaagt gtatcttcta actatagaaa gaaggtgtgt 5340 tgcgatgaaa ataaaaacct ttctcaatta taaggttttg tcattgctaa atggattttt 5400 tgaacacatc aaaactgcac atctcatgac atttcaaaat catctgctaa caccctttaa 5460 atcgtgttta gttcaaactt actgctctat ctaatgcact tggaagtcct gtttttttaa 5520 aatcgttaat acccttctct cttgataaca tcttagtgac tcacctctat gtcgtccacg 5580 aatctaaacc attattagac a 5601 //