ID SPCC1529 standard; DNA; FUN; 4250 BP. XX AC AL132664; XX SV AL132664.1 XX DT 26-OCT-1999 (Rel. 61, Created) DT 14-MAR-2001 (Rel. 67, Last updated, Version 2) XX DE S.pombe chromosome III cosmid c1529. XX KW low-complexity gene-free region; membrane transporter; sugar transporter. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4250 RA Seeger K., Harris D., McDougall R.C., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (26-OCT-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk and XX DR GOA; Q9USN4; Q9USN4. DR SPTREMBL; Q9USN4; Q9USN4. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1529 is overlapped at the 3' end by cosmid c794, EMBL CC entry SPCC794, accession number AL023595. XX FH Key Location/Qualifiers FH FT source 1..4250 FT /chromosome="III" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1529" FT /map="IIIL" FT misc_feature 1..2701 FT /note="low-complexity gene-free region" FT CDS join(2702..3071,3123..3228,3287..3681,3736..4250) FT /db_xref="GOA:Q9USN4" FT /db_xref="SPTREMBL:Q9USN4" FT /label=SPCC1529.01 FT /note="SPCC1529.01, len:462, SIMILARITY:Penicillium FT olsonii., CAB46647, putative multiple drug resistance FT protein., (652 aa), fasta scores: opt: 870, E():0, (33.7% FT identity in 466 aa)" FT /partial FT /gene="SPCC1529.01" FT /gene="SPCC794.14" FT /product="MFS multidrug efflux transporter" FT /protein_id="CAB59612.1" FT /translation="MDLEKKPQDKVETAEIDVKEDPFLVTWQSPTDPKNPKNWIYARKW FT TQLILVSAFALLGPMASSMVAPCLDQIADRFHIQNSTEKALILSIYLLVFAISPMISAP FT LSEVFGRRMLLQVGNVIFIVFNMACGLARTKAQMYIFRFLAGFGSATPMGLGSGTISDL FT FTPDERGKAVAVMSLAPLLGPTIGPVVSGFIAEYTTYKWIFWSTTIFSGFIFALSLPLL FT AETYPKTLLGEKARKLNKSEKTNKYHTEWNLEHIPRLKLVTPALMRPIRMLFTQPIVIL FT CSTYMAIQYGILYLVLTTYPTLWTEEYHERPSIAGLNYIASGIGLIFGSQASGIFIDKT FT FRYLKRRNNGKMAPEFRVPVILLGTFFFPAGLFIYGWTAQYHTHWIGPDIGAAMFNIGL FT MLGWRGIQTYLIDSFMIYAASSTAVACCVRSIAAFAFPLFGQDMYDTLGYGWGNSLLAF FT MYVF" FT misc_feature 2834..4087 FT /note="Pfam match to entry PF00083 sugar_tr, Sugar (and FT other) transporter," FT misc_feature 3072..3077 FT /note="gtaagt, splice donor sequence" FT misc_feature 3106..3122 FT /note="ctaaccttttgtcaaag, splice branch and acceptor" FT misc_feature 3229..3234 FT /note="gtaagt, splice donor sequence" FT misc_feature 3267..3286 FT /note="ctaacgttttctttattcag, splice branch and acceptor" FT misc_feature 3682..3687 FT /note="gtaggt, splice donor sequence" FT misc_feature 3723..3735 FT /note="cttaccgttatag, splice branch and acceptor" FT misc_feature 4150..4250 FT /note="nominal overlap with cosmid SPCC794, EM:AL023595 S. FT pombe chromosome 3" XX SQ Sequence 4250 BP; 1243 A; 820 C; 875 G; 1312 T; 0 other; gatcgtaaac aaatgggtat gcgattatac tgcttccacg cttacgcttt tacgcttacg 60 attgttggat tggaataggc agttgaatga gagagaatgg gaatcaacca aagatacagc 120 cacaagaagc aaaaaaaaaa agaaattgca ggtgtattgg tatggtgtga tatgtaccgc 180 tagtgtgaat taaccttgaa aacgaattca acgaaaactg tgatcacaaa taagaaaaac 240 gatccttgtt cagcagggta tatatatacc tgtgaggtta ggattgatat tatacctgcg 300 aagggtaagt ttaggaaaga tcagagaatg tagctcgaaa gatgccttta ccttccaact 360 ttttcacact ttatatatgt tatggaatag agcctccaac tcaaccaaat gcggggatac 420 tgcaaattca atctaccatt tctacgtagt attgtttagc ttttcatatc aacctgggtc 480 gagacactta gtgtgcaaaa tagtcatcct aaacaaactt tccacaacac cctttaggaa 540 gtgtaatgtt gcataccaag cccaaatcca ttcacttcct ctatgcctgt taaactcaaa 600 cgcaaatttg cctaccaacg atgcgggaca ccagtccaaa tggttgtgag aatttagtag 660 ggaacaagtt actatacgta agggtcgatg ctgcagcccc aactccccgc taactatggc 720 tgggcagaaa aagtctgggg caatagtagt agaaagttag ttaatcggtt gagtgggatg 780 aatgggattt gagagaaagg ttatgggtat gaaagtggtg gatagagagt gaaaaaaaaa 840 ggttagagga aaagtagaaa aatagagaag gaaagtagat aagaaaaata atcaataaag 900 gtagggcaag acagggttaa acgaaaaggt gatagtttga acattgttta catgaaaaga 960 aaaaaaaaag gcgaccgttg tatgaacatt tttgtcgatc gatcgatcaa tcgatcaatc 1020 gattctccta cctaccaacc acccacttcc aactattcct ctctacctta ctcccctata 1080 ccttatcctt tgcatcggag ggttttcctc ggttagtggg caagaaaatc catggtgacc 1140 acgttataga gtggggttcc ttcatgatgg cgtgttacat ggggttttgg ggtatgagcc 1200 tattacaccc aaagcattat gctttgatac gttgtcacaa ccccgctcat tacttccact 1260 cggttggcag gacgaatgat cgttggctaa gcgatcgttt aggtgcggag aagaaggatc 1320 gcttctgcca tagtgattgc ggcacctctt tagtagtaac ttaaaaaaaa acaaagtttc 1380 gctatgggaa gatgatgaaa ttcgctaaag acaagtttcc ggaaaatcca atgattggtg 1440 atgttgaaaa gcggggattg ccatgcattg atgcaaatcg cctaatgcaa attttccgtc 1500 caattcacca ttagctcatc caggtaagat gtctgaacac acatcggatc acctccacca 1560 atgtttgctt taggtaaaca ctcccatcga atgagaaaag ggttgttgac aactaaactt 1620 tgctgtgcac cggggaagtt gcggggcctg taaatttgac agtagcggaa aaacttcaaa 1680 ggactgaagg tttgagaaat aagcttttct ccagtcatgc agaaaacgaa ataaaaacgc 1740 acagtttcgc cgggtttcct tcgggttcct ttcctttggc ttttacaaca ttctcgtctt 1800 gacatcatca cgaaccaact ctcgatccaa tttaaggtaa aaacgaatgt gtgggtccaa 1860 tgcgggcaac gcaaaatgac tgttgcgtag gataaagttg gcgcttgaaa agtatcattg 1920 tgtttactgt gagatactgt aaagcattta aggtgatgga tggtaaatga gcgcaacgaa 1980 cggtgaatta aggtgaacta aggtgatggt tggtaaaacg taaacaatag atagtgaaac 2040 tgtgggatac tataaatcaa cattacaact gcaacaagaa atgttctttg gccatcatta 2100 acacaatacg tcaaatacac actaagagga cgaaaccctg ttcatcctcc ctggggtggg 2160 gggagggggg ataggtaatg tcgaagaaaa gaaaagaaaa aaaaaaaaaa atagcaaaaa 2220 gaaaagaaaa aaaaaatttt ttttcttttt cttaatcttt ttgctagttt ttctttctat 2280 gtgaagggag cggacgagtg tcctttcttc aaccgtcttt tctttcttcc cttttgatca 2340 atcatcataa atggatggat acacgtaccc cctcgttgca tggtgaaaaa aaaaatttcc 2400 tccccccaaa aggcttacac atgtgtttgt gctaagtagg tgaatttttg caaaattttt 2460 taccccttcc tctctctttc ttttgttttg ttgtattttc ttttgtattc atttagtttt 2520 attaaactcg ttatttgctt tgcttggcat ctcttgactg cgtaaacttt atttttttca 2580 tctcaatcgc taccattttt tcgcaaatca aatacaaaaa ttttcatcct taatcgagtt 2640 tcatcgcgta cgttttggtg attgttggtc gcttaatttt tttctttcac aaaattcttt 2700 tatggatcta gaaaagaaac ctcaagacaa agtagagact gcagagattg acgtgaaaga 2760 agaccctttc ttagttactt ggcagtctcc aaccgatcct aaaaacccta aaaattggat 2820 ctatgctcgt aaatggaccc agctaatctt ggtttctgcc tttgctctct tgggtcccat 2880 ggcttcttcc atggtggctc cttgtttgga tcaaattgca gatcgttttc atatccaaaa 2940 ttctacggaa aaggccctta ttttaagtat atatttgctt gtgtttgcaa tttcacccat 3000 gattagcgct cctttatcgg aagtttttgg acgtcgtatg ctattgcaag ttggaaatgt 3060 gattttcata ggtaagtctt tcttctcctt ttcacttctt tccagctaac cttttgtcaa 3120 agtcttcaac atggcatgcg gtcttgcaag gacaaaagcg caaatgtaca tttttcgatt 3180 cctcgctggt tttggtagtg ccactccaat gggtctcggt agtggaacgt aagtatatat 3240 attttttccc ttaaaagatg aaatgactaa cgttttcttt attcagtatc agtgatctgt 3300 tcactcctga cgagagagga aaggccgttg cagtgatgtc ccttgctcct ctattgggtc 3360 ctactattgg ccctgtcgtt agcggattca ttgcagaata tacaacttac aagtggatct 3420 tttggtctac aaccatcttc agtggcttta tatttgccct ttctctgccc ttgcttgcag 3480 aaacatatcc caaaactctg ctgggagaaa aagctcgaaa gctcaacaag tctgaaaaga 3540 caaataagta tcatactgag tggaatctcg agcatatccc taggcttaaa cttgtcactc 3600 cagctttaat gagaccaata cgaatgcttt tcacccaacc tatcgtgata ctatgttcta 3660 cgtatatggc tattcaatat ggtaggtaaa atagttatac tttcttcaat gaaacacctc 3720 aacttaccgt tataggtatt ctctatcttg ttttaactac ataccccaca ttgtggaccg 3780 aagagtatca tgaacgacct tccatcgctg gcctaaacta cattgcttcg gggataggcc 3840 ttatctttgg cagtcaagca tctggtatat ttattgataa aacgtttcga tacctaaaac 3900 gtagaaacaa tggaaagatg gcacctgaat ttcgagtccc cgtaatttta ttgggaactt 3960 ttttctttcc agctggcttg tttatttatg gttggactgc gcagtatcac actcactgga 4020 ttggtccaga tattggcgca gccatgttta atataggttt gatgttaggc tggcgtggta 4080 tccaaactta tttaattgat tcttttatga tttatgcagc atcgtcgaca gcagttgcct 4140 gttgtgttcg atcgattgct gcttttgcat tccctctttt tggccaggac atgtatgata 4200 ccttgggata tggatggggt aactctcttt tggcatttat gtatgttttt 4250 //