ID SPCC2H8 standard; DNA; FUN; 6221 BP. XX AC AL049567; XX SV AL049567.2 XX DT 30-MAR-1999 (Rel. 59, Created) DT 19-JUL-2000 (Rel. 64, Last updated, Version 2) XX DE S.pombe chromosome III cosmid c2H8. XX KW inorganic phosphate transporter. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-6221 RA Lyne M., Rajandream M.A., Barrell B.G., Volckaert G.; RT ; RL Submitted (29-MAR-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and Katholieke Universiteit Leuven, Laboratory of RL Gene Technology, Kardinaal Mercierlaan 92, B-3001 Leuven, Belgium XX DR GOA; Q9Y7Q9; Q9Y7Q9. DR SPTREMBL; Q9UUN4; Q9UUN4. DR SPTREMBL; Q9Y7Q9; Q9Y7Q9. DR SPTREMBL; Q9Y7R0; Q9Y7R0. DR SPTREMBL; Q9Y7R1; Q9Y7R1. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c2H8 is overlapped by cosmid c1393, EMBL entry SPCC1393, CC accession number AL035592 at the 5' end and by cosmid c63 at the 3' end. XX FH Key Location/Qualifiers FH FT source 1..6221 FT /chromosome="III" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c2H8" FT /map="IIIL" FT misc_feature 1..270 FT /note="Originally marked as SPCC2H8.01, but no homology FT evidence, and no MET" FT misc_feature 1..441 FT /note="nominal overlap with cosmid SPCC1393, EM:AL035592 S. FT pombe chromosome 3" FT CDS 1791..3542 FT /db_xref="GOA:Q9Y7Q9" FT /db_xref="SPTREMBL:Q9Y7Q9" FT /label=SPCC2H8.02 FT /note="SPCC2H8.02, len:583, SIMILARITY:Schizosaccharomyces FT pombe, O42885, inorganic phosphate transporter, (274 aa), FT fasta scores: opt: 1082, E():0, (61.8% identity in 267 aa)" FT /gene="SPCC2H8.02" FT /product="MFS inorganic phosphate transporter" FT /protein_id="CAB40203.1" FT /translation="MGFKLNPFSKKPKDEEPLPLEQYEASEQKILGLVTKKEAKLLAIA FT GTGFLLDSYDLFIINLVSPILAYLYWGGLTGHQDYPSGIRGVVNAATNIGNIMGQLLFG FT FLGDFFGRKFVYGKEMMVVIIATILIICLPDRIPTPTAKMMWLFAFRVMLGIGIGGDYP FT MSASITSEQSLINRRGALLAWIFSNQGWGTLAGCVATLIILACFEKPLNDRGEYTKLNG FT VWRIQFGIALFPAVIVLIPRLRMQESEQFKNSKNMKSPGEGDLDSASQIELHDFKKKES FT LTAEFTTSSPSTASLSDKKNPGSVHIRPNNEVAPSSAPSRAPSTTSVESNTEGKESDIQ FT TGSSFVSYFKEWRHAKLLIGSALSWFLLDIAFYGINLNQSVILQEMGFNKGVNEYHILQ FT KNAIGNLIIAVAGYIPGYWVSVVLIEVMGRKWIQIQGFLICCLLFGVLAGTWETISTGG FT RFACVALAQFFFNFGPNTTCFVIPAEVFPSRVRAFSHGICAACGKAGAILSALLFNKLT FT EVIGFGNVLWIFFGCMVAGAVVTLILPETANRDADLIDRLEIAAMQQGRTSIIDRSEKW FT AWWKHGI" FT misc_feature 1908..3464 FT /note="Pfam match to entry PF00083 sugar_tr, Sugar (and FT other) transporter" FT CDS join(3734..3743,3805..4022) FT /db_xref="SPTREMBL:Q9Y7R0" FT /label=SPCC2H8.03 FT /note="SPCC2H8.03, len:75" FT /gene="SPCC2H8.03" FT /product="very hypothetical protein" FT /protein_id="CAB40205.1" FT /translation="MCSGSRLHIYGSSLADYCLNAKYGQRYATFSILKLVEYNCFPGFI FT SSFHSLLTYRKKPHRQNTSNMQGLMRLVAC" FT misc_feature 3744..3749 FT /note="gtcagt, splice donor sequence" FT misc_feature 3773..3804 FT /note="ctaactcgacactgtacttattattattatag, splice branch and FT acceptor" FT CDS join(4423..4629,4694..5287) FT /db_xref="SPTREMBL:Q9Y7R1" FT /label=SPCC2H8.04 FT /note="SPCC2H8.04, len:266" FT /gene="SPCC2H8.04" FT /product="hypothetical protein" FT /protein_id="CAB40207.1" FT /translation="MSAAWRKKITNAHRNECLQVYRSLLRSCDEIQNRPLGEKLSKIVK FT ARTRHYQNVGNAYKAEALLKDAKKQCETLKSALNGKLPEVNIESHMLSKYRKYLNEGGK FT QGDVGITKPHTKTKKLKSNSIFNYVRNGEVLYHLVSTTGGDTFLRPKFWPQSQRISGML FT KKRITMKENKHRLIQSMSCMLRHAKLEDEFMKAYISEEDVGYADEIHKCIQMVLSDLDS FT IYKRESKNWHIVSQFYNMLELEAARQKLENRYLNFWKVSRRSSP" FT misc_feature 4630..4635 FT /note="gtaagg, splice donor sequence" FT misc_feature 4682..4693 FT /note="ctgatgacttag, splice branch and acceptor" FT CDS complement(5592..6221) FT /codon_start=1 FT /db_xref="SPTREMBL:Q9UUN4" FT /label=SPCC2H8.05c FT /note="SPCC2H8.05c, len:203, SIMILARITY:Caenorhabditis FT elegans, Q22010, r186.4 protein., (643 aa), fasta scores: FT opt: 139, E():0.28, (29.2% identity in 168 aa)" FT /partial FT /gene="SPCC2H8.05c" FT /product="hypothetical coiled coil protein" FT /gene="SPCC63.01c" FT /protein_id="CAB40209.2" FT /translation="QPCVSWKVSPLKAKAMSPDDVSRLFCKSSTSSATRKHDPFHKLWD FT RLQPKAQSTIQRTSSLPVPSSSNFKERLNNIGGLKRSRTLESSYEDETETANKLSRVSS FT LVSVIRQTIDRKKSLERRVREEQEEKTDNEDDNDVEISTQESLENNGLAEKKDDTSSLA FT TLEDDIEGQEFSFDDQDLQMLQDIEDQWLSSQKQQGSPLTSDHISK" FT misc_feature 5606..6221 FT /note="nominal overlap with cosmid SPCC63, EM:AL049522 S. FT pombe chromosome 3" XX SQ Sequence 6221 BP; 1803 A; 1057 C; 1129 G; 2232 T; 0 other; actgtcaata cataccaaac tgaattacaa atacccttgc ataacctcga aagtataaaa 60 aacaaatgtc tttcatttgg tgctcacgcc caaacgttaa accagcaaca atttgtgtta 120 atttctctcc cgctttcttt tacagatatc aagtttcaaa aataccaagt attaacaaaa 180 aataaatact tggattttgc aacgactatt tacaatcctt tacatttttc tttaacttct 240 gtgattttct atgaaacgca gttgacatga ttttattttt ctctttattt ttttttaaaa 300 agaagcaatg tctagtattg ttgaaatttt agcttgatgt ttgttgaagc aaatcatcat 360 tttgcaagct gaatgagccc gagtgttact tctatttact attaatctat ctgattttgt 420 attgctttag gtgttaagat catgtaatct ttggtataca gaaggttatg aaactggagg 480 ttcaattttg cttttctttt tttttttttt tttttttttg aaaaagcacg catctcacaa 540 taaatcaata atttttgtag ctctttctaa ttttatatat aaacgtgttt gtcttctttg 600 caaatgagaa agactttgtc ttgattttcc tgtttattta tttactttct tttttctgtt 660 acattgttct ttcggcattt tccgttcaaa agttactctt tattttaagg tacttagaat 720 ttggtgaatc tttagttttc ttttcttgtt gctaattcaa tattgatgaa tttcgttgct 780 ttttaaaaat atttatttat ttatgttagc attcgttttc gaagtcttca gcaaaatttc 840 aatagttcaa tagatctttt tgcgtgccaa ttgaaaaaaa aaacgataat aataataata 900 accataatgg aatgagcatc gtttgattaa acaacaacat caatccgttt actaaatgaa 960 tctatatcct atgccatctt ataagtaaaa aatgaagata caaattacag cactctggtg 1020 gaacacgata aaccgattaa gaatttatta ttcagcagac gttatttaat ttttgataat 1080 ctctgctcca ctttgctaac gtaacgactc gtcattgtcg atgtctttaa cgacctttgt 1140 tacggtcctt taaaaagttt aaccaataag gccgaggcca tctttgagcc aaaatatacc 1200 atcaaatttc actttgcctc ttcacatacc caattcctta gggattttta aaattttaga 1260 gtattagtgt tattcacttg ttgttacgca tatttttata gacctgttta ttccccaagg 1320 tttacttgtg taattccctt tttccgaggt cgaaaagtga aatttcttat gatttaatga 1380 gtacattaaa ttgattaaga aatcaaggaa gggaatttaa aaaagtatcg ctcaaattgt 1440 ttttaactga caacttattt ttatgaagtt gtttgagcga aaaaattaat atatcctatc 1500 gcttgttgaa tttttataca ttgtttggga ttgtttgttt agtttagtca ttcaagtttg 1560 tttgtatcta aaagcaaaag tctttttttg ttgcagcagt aaattctccc tgcattgttg 1620 aataaatact tatctgatca ttgtggttta gcttattttt gctttattta attcatcaat 1680 ttcgttgaaa gtgaaagtct ccttcatttt cagcttattt gctaattttg ttactgttta 1740 atacaagacc gtttcactca acttcagagg tttcactagt atagcggaaa atgggcttca 1800 agttgaatcc attttcaaaa aagcccaaag atgaggagcc tttaccctta gagcaatatg 1860 aagcttctga gcaaaaaatt ttaggccttg tcactaaaaa ggaggccaaa cttcttgcaa 1920 ttgctggtac tggttttctt ttggacagtt acgatctttt cattattaac ttggtttcgc 1980 caattcttgc ttatttgtat tggggtggat tgacaggaca tcaagattat ccttctggta 2040 ttcgcggtgt tgtaaatgct gctaccaaca ttggaaatat tatgggtcaa cttttatttg 2100 ggtttttggg tgatttcttc ggtcgtaaat ttgtctatgg caaagaaatg atggttgtca 2160 ttatcgctac tattttaatc atttgtctcc ctgatcgtat tcctacaccg actgctaaaa 2220 tgatgtggct ttttgctttc cgtgttatgt tgggtattgg tattggtggt gattatccta 2280 tgtctgcatc catcacttca gagcagtcat tgattaaccg ccgtggtgct ttgctcgctt 2340 ggattttctc caaccaagga tggggtactc tggctggctg tgttgctact ttgattattt 2400 tggcttgttt cgaaaaacct ttgaatgatc gcggtgagta tacgaagctg aatggcgttt 2460 ggcgcattca gtttggcatt gctctctttc cagctgtgat cgttttgatt ccacgcttgc 2520 gtatgcagga atcagaacaa tttaagaatt caaaaaacat gaaatctccc ggggagggtg 2580 atcttgattc tgcctctcaa atcgaattac atgatttcaa gaaaaaagag tccttaactg 2640 cggagtttac aacttcttcc ccttctacag cttccttgtc cgataaaaag aacccaggtt 2700 ccgttcatat ccgtcccaat aacgaagttg ctcccagcag tgctcctagt cgtgctcctt 2760 ctactacctc tgtcgaatct aatacggaag gtaaggagtc tgatattcaa accggttctt 2820 cgtttgtttc atactttaaa gagtggagac atgctaaact tttaattggg tctgcgcttt 2880 catggttttt gcttgatatt gctttttacg ggataaattt gaatcagtct gttattcttc 2940 aagaaatggg gtttaacaaa ggtgtcaacg aatatcatat tcttcaaaag aatgcaatcg 3000 gtaatctgat tattgcagtt gctggctaca ttcctggata ttgggtgagt gtcgtattga 3060 ttgaggtaat gggccgaaaa tggattcaaa ttcaaggttt ccttatttgc tgtcttttat 3120 ttggcgtttt agctggaaca tgggaaacta tttcaactgg tggtcgcttt gcgtgcgtcg 3180 ctcttgccca atttttcttc aactttggtc caaacaccac ctgctttgtc attcctgcgg 3240 aagtgtttcc ctccagagtt cgtgcttttt cacatggcat atgcgctgca tgtggtaagg 3300 ccggtgccat tttatctgct cttctattca ataaacttac cgaggtaatt ggttttggca 3360 atgttttgtg gatattcttc ggatgcatgg ttgctggagc tgttgttact ttaattttgc 3420 ctgaaactgc taatcgtgat gcggatttga ttgatcgtct tgaaattgct gccatgcaac 3480 aaggtagaac ttctattatc gatagaagcg agaagtgggc ctggtggaaa catggcattt 3540 aaagcatttc cgagttttgc ttatatgaaa gccatacact taggtgcttt agtaattttg 3600 ctggactaaa ttagcccttt tttttgctaa gggataaacg ggtttccttg gttctctaca 3660 gaggctgctt tgactaatct attagtattc ctattctgct cgcgctataa tatttcggcg 3720 tgataaaggt gttatgtgct caggtcagtg ttagagtgga tcgttttgat ggctaactcg 3780 acactgtact tattattatt ataggttcaa gattacacat ttacggttcc agccttgcag 3840 attattgttt gaatgcaaag tatggacaaa gatatgctac cttttctatt ttaaaactcg 3900 tggaatataa ttgctttcct ggttttatta gttcatttca ttccctgtta acttatagaa 3960 aaaagcctca taggcaaaat acttctaaca tgcaggggct tatgcgttta gttgcttgtt 4020 gaaggacttt acggatttgt ttttgccagt tttgaacatt gtactgaaaa gcagtgtcga 4080 aacgtaacaa aaaaaaaatt tgggattttt ttaaaaatta tgattggggt ttaatttagc 4140 agtttcattt agttagcttt tatgttaaaa aatttgatgt ttatttgttt cattttatac 4200 gttaaaagta ggaatctaat tcagtaaata tggggcaagt aagcaattcc taataatcag 4260 taatttttta taaactatat attgatatat atgtttttac aaaaactgtt tttagacgcc 4320 ttgtttaatc tgatatttac aataatgttt ttgttgtaat ctttaaatgt gtagcacttc 4380 ctgtctacga aatatgtact ccagctgagg tttgctatat ctatgagcgc agcttggcgg 4440 aaaaaaataa cgaatgccca tcgaaacgag tgtttacaag tttacagaag tcttctcaga 4500 tcttgtgatg aaatacaaaa tcgccctctt ggggaaaagc tctcaaaaat agtgaaagca 4560 agaacacggc attatcaaaa tgtagggaac gcctacaaag ccgaagcact tttaaaagat 4620 gctaagaaag taaggatata gtaaattata gtgagaactg tagaaaatga aataactaaa 4680 gctgatgact tagcaatgcg agactctcaa gtcagcatta aacggtaaat tgcctgaggt 4740 aaatatagaa tctcatatgc tatctaagta tcgaaaatac ttaaatgaag gtggaaaaca 4800 aggagatgtt gggataacaa aaccgcatac gaaaacaaaa aagctaaagt ccaattcaat 4860 atttaattat gtgcgcaatg gtgaagtgct atatcattta gtaagtacta ctggaggaga 4920 tacgttttta cgccccaaat tctggcctca aagtcagaga ataagtggaa tgttaaaaaa 4980 aaggataact atgaaagaga ataaacatcg tctcatccaa agcatgagct gtatgctacg 5040 ccatgctaaa cttgaagatg aatttatgaa ggcatacatt tcagaggagg atgttgggta 5100 tgctgatgag attcataaat gcatacaaat ggttttaagt gatctcgatt ctatttacaa 5160 acgagaatca aaaaattggc atatcgtttc gcaattttac aacatgcttg agttagaagc 5220 agctagacaa aaactcgaaa atagatatct aaacttttgg aaggtatctc gacgctcttc 5280 accttgatgt atttaattct ttaattagct tctgagagaa gtttacgacg aatatcataa 5340 ttattattat tttgtatata tcaaacataa aaaaatttta gaaacataca ctcgccttgg 5400 agacgtaata aagttgtaaa gcaagcaatt cgtgcattgc agttcttaat cgcggcagta 5460 catatacagt agtattcaca ttgtgcttgc gaaactcttt tcccaaaata acaagcttaa 5520 aaatttacat ggtatacttt gcggaaatac aacaatcatt agcccaaccg tcaaactacg 5580 aatagcgttt tctatttgct tatatgatct gatgttagag gagaaccttg ctgtttttga 5640 gaactgagcc attggtcttc tatgtcctga agcatttgta aatcttggtc atcaaacgaa 5700 aattcttggc cctcaatgtc atcttccaac gtagcaagtg atgaagtatc atccttcttt 5760 tccgctaaac cattgttttc cagggattcc tgagtagata tttcgacatc attatcatcc 5820 tcattatctg tcttctcttc ttgttcttcc cttacccttc tttctagtga ttttttgcga 5880 tctatagttt gtcgtatcac gctcaccaaa ctagaaaccc gtgacagctt atttgcagtc 5940 tctgtttcat cttcatagga gctttccaat gtccttgacc tcttaagtcc tcctatatta 6000 tttaatctct ctttaaaatt acttgacgaa ggaacaggca atgaagatgt ccgttgaata 6060 gtggattgtg cttttggttg taagcgatcc catagcttat gaaaaggatc atgcttccga 6120 gtagcgctac tagtactgga tttacaaaaa agccttgaga catcatcagg agacatggct 6180 ttcgctttta aaggagaaac tttccatgag acgcatggtt g 6221 //