ID SPBC1652 standard; DNA; FUN; 5454 BP. XX AC AL163525; XX SV AL163525.1 XX DT 07-APR-2000 (Rel. 63, Created) DT 07-APR-2000 (Rel. 63, Last updated, Version 1) XX DE S.pombe chromosome II cosmid c1652. XX KW aap1; amino acid permease; yeast sin3p-binding protein. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-5454 RA Davis P., Churcher C.M., Lyne M.., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (07-APR-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; Q92367; Q92367. DR SPTREMBL; Q9P7B0; Q9P7B0. DR SWISS-PROT; Q92367; AAP1_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1652 is overlapped at the 5' end by cosmid c16G5, EMBL CC entry SPBC16G5, accession number AL023554, and at the 3' end by CC cosmid c16A3, EMBL entry SPBC16A3, accession number AL021748. XX FH Key Location/Qualifiers FH FT source 1..5454 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1652" FT /map="IIR" FT misc_feature 1..451 FT /note="nominal overlap with cosmid SPBC16G5 EM:AL023554 S. FT pombe chromosome 2" FT CDS 445..1605 FT /db_xref="SPTREMBL:Q9P7B0" FT /label=SPBC1652.01 FT /note="SPBC1652.01, len:386, LOW SIMILARITY:Saccharomyces FT cerevisiae, Q12427, hypothetical 56.3 kd protein, FT sin3p-binding protein (Sin3p broad spectrum transcriptional FT regulator complexes), (513 aa), fasta scores: opt: 200, FT E():9.2e-05, (28.6% identity in 248 aa)" FT /gene="SPBC1652.01" FT /product="hypothetical protein" FT /protein_id="CAB86886.1" FT /translation="MDLEMTTASDFAEKKESVDVKRENPSFTNSIKALPIQSELSAMMS FT KDNTLNHVKQEPSDGFSSVKWASTSFDSTVTAHKEETPYSSSLGSHDSSSLPSSTNNRY FT SSVLKELCNTYLPSILSTYGSLPIRRLLHHLSLMLPSFNELTPTQQRRLLTRALESKKG FT IQFEKIGWGRWVLRDSTIPANSHPQSLPSNSIPKTEPLDTPSLSNSQERFSKSPSDGQN FT VRSRAKKIPSGMSAEESDELLSSSGRKTRSFNTPFSSFFASLEETPGSYTAHLGGVLSP FT REQTPALFTGQYPDNGVYYEEEEEEEDDNDLFDEHEYGYGTLPPFVFDEDQMLDGGEST FT DEEDWRAIGTEALLRKVNTKKRRPSRVLVHRDQVAVEAMLMLSGSV" FT CDS 3674..5454 FT /db_xref="GOA:Q92367" FT /db_xref="SWISS-PROT:Q92367" FT /label=aap1 FT /note="SPBC1652.02, len:>593" FT /partial FT /gene="aap1" FT /gene="SPBC16A3.20c" FT /gene="SPBC1652.02" FT /product="amino acid permease" FT /protein_id="CAB86887.1" FT /translation="MTSFTESKKLDEIESPVPEIIVPSTTNGNGTIESYKEKSVGTSFL FT DFFRSYKLRPDNEFAEVHNSEDFLKPRHLQMIAIGSCIGTGLFVSTGKSLKNAGPGSLM FT INFIILSAMILALILSLGEMCCFLPNQSSITMYTGRLLNNNIGFAQSWLYFWIWLTVLP FT SEISAACEVVDFWTTQHLNPAIWVTIFLAYVVLVNAFGARSYGECEFVSSFLKVVIVII FT FFFVAIIINCGAAPKGGYIGAHYWHHPGSFRNGFKGFCSVFISSAYSLSGTENIGTAAG FT NTSNPQRAIPSAVKKVFYRMGFFYIITIFLITLVVPYDNPDLGNVSPFIIAIKNGGIHV FT LPHITNAVILVSVLSVGNAAVFAASRNAMALVKQGWAPRFLGRVDQKGRPVISYLCSLA FT MACIAYVNAAPDGSVVFDWLMSVSGGGAFVIWGLSFIDHIRLRYAMKAQKIPDTVLPYK FT FPGSVYLSYYGVLINFLALCALVYISIFPVTHEKPSAYGFFVSFLGPSVFIAYLLISPI FT FVKPTFQSLKDVDLTTGRYDLVNSQMYVAESSTSELSEKDLTKPNLQSNDNKNSEDLES FT NTPPQKKSALQKVADFL" FT misc_feature 3879..5234 FT /note="Match to PF00324 aa_permeases, Amino acid permease FT Score 533.70" FT misc_feature 5390..5454 FT /note="nominal overlap with cosmid SPBC16A3 EM:AL021748 S. FT pombe chromosome 2" XX SQ Sequence 5454 BP; 1574 A; 1072 C; 992 G; 1816 T; 0 other; tatctgtcca gtcacgctgc taagagacat attggtaaat acggtggctg attgatttaa 60 cgatacagat tgtgcggaat tgggttgttc cgtctccatt taattattga cgtttatctg 120 tccttgttgt gtacccgctt atttaatacg gtgttctggg gccgatttcc ttagggtgtt 180 ttgttcaagg agctcggctg gcatattgtg ttatcgacaa ataagcggct gtactgcagt 240 atattttata taaaaatcgg acgcgcagga gttggtgttt catttactac atatcttccg 300 aactattctg agcatatttt ctgtttgcct ctttaaatta gttcccatta cattcttttc 360 ctaatttttc cctattaatt tctttgtgct taatttctcg agtctttctc ttaattactc 420 tacttcacaa tgaatatcga taacatggat cttgaaatga ctactgcttc tgactttgct 480 gagaagaaag aaagtgttga tgttaagaga gaaaacccga gttttacgaa ttccatcaaa 540 gctctgccta tacaaagtga actatctgcg atgatgtcga aggataacac tttgaatcat 600 gtcaaacaag agccttccga tggcttttcg tctgtcaagt gggcatccac ctcatttgac 660 agcacggtga ctgcacacaa agaagaaacc ccatattctt cttcgttagg ttcccatgac 720 agttcatctt tgccatcgtc aacgaacaat cgttattcat ctgtgttaaa ggagctctgt 780 aatacatatt tgccttcaat tctgtcgact tatggctcct tgcccatccg ccggctgttg 840 caccatttat ccctgatgtt gccttccttt aacgagttaa cgccgaccca gcaacgcagg 900 ctgttaactc gtgctctcga atcaaaaaag ggcatccaat tcgagaaaat tggctggggg 960 cgttgggtac ttagggactc gacgataccc gcaaacagcc atcctcagag tttgccctct 1020 aattcaatac cgaagactga gcctctcgac actccctcct taagcaattc tcaggagcgt 1080 ttttcaaaaa gtccgtccga cggacaaaat gttaggtccc gagccaagaa aataccgtcc 1140 ggcatgtccg ccgaagaatc tgatgagtta ttgtcgtcat ctggtagaaa aaccagatct 1200 tttaatactc cattttcctc tttttttgca tcgttagaag agactcctgg cagttacaca 1260 gctcatttag gaggggtatt gagtccaagg gaacaaacac ctgctctgtt taccggtcaa 1320 tatcctgata atggtgttta ttatgaagaa gaagaagagg aagaagacga taatgattta 1380 tttgatgaac atgaatatgg atatggtacc ttgccacctt ttgtttttga cgaagatcaa 1440 atgttagacg gaggcgaaag tactgatgaa gaagattggc gagcaattgg taccgaagca 1500 cttttgcgaa aagtgaatac gaaaaagcga cgtccctcaa gggttttggt ccatcgggat 1560 caagttgccg tggaagccat gttaatgcta agtggatctg tttgaagcta attatttatg 1620 cagcctgaaa tgcgatcgaa ggcagtataa atttttttaa catttgttaa tgattgtttt 1680 ttcaatcatg aatgttacga tattagttta tagaaaaatt tgttttagta atcaaggttt 1740 atgatgcttt atcatcatca gcctattaca catatacatc catttttaaa ttccagttta 1800 gattttgttt tactggtatt ttatacacgg ttagacagac tttctgtcta atggagatga 1860 atgattgata aaagggtttg ctttcctatc aacactggaa agcttgaatt actgcttatc 1920 aactagcaaa ttacaaattt ttgaagtttt ccttgaaata acctcatctt tttcgtcata 1980 tgttttgatt tagttgtctg atcaagagtc ctttcggtcc ttgaaataac gtacggatcc 2040 attttttgtt ggattaattg gcgacgactc aagagtgaag tttaaaatcc ttcaatccta 2100 aattgatggc cgggttattc acaagaatac cttttatcca gcttgactcg gaagcttgtt 2160 ttatatacaa ttgctttact atgccttgtc ttttaatgtt tgttatgacc tttatgaccc 2220 acgtacttac cttatcctac cctactcttc actaattttt atttttgtaa tgcgttatta 2280 cttttaatta tctctccttt aattattaat taatcatagc taataacaaa acgaatacat 2340 ttcaatcaat tacttgctta aatcacatta taaataatgg tggaaggggg atcaaaaaat 2400 gctggaactg tatgaaggct gttatataaa ttgtaacatt tttaaattat atgcaccatc 2460 aatattgaat tcaataacat tttattacaa aactaagcat cataaaatta acacatagat 2520 ttgtttatat tccgtatgac tagacgcttt ctaaaacgca acagcaaatt tatagaataa 2580 ttaaaattat ctttacaaaa gtacattcgg ttgctttgct gttagcacta gccttgatgt 2640 atcaattaca gatgaatgat gacattcatc tttcacttga aaatcagttt tgtactcata 2700 attccactat tcctttgcaa accttcttcc ttttttatac attggcaggt tgattcgtta 2760 ggttattgat tagattttac tatcaacagt tgggactaag cattcagacg caaaggcatt 2820 attaactaag cacagtaacg gactaaacta agtacacttt agttactgag atttcttgtt 2880 gcccccacat agcactttta agtgaataca cagtctcaag tctaaaatag tctatcccac 2940 atcccccact tagacccaaa tattgtgaag gttgactaaa cagttctact aggtaattat 3000 tggtgcaaac ttttagatat attttctaaa accatcacaa aagaatctct aaaaatcaaa 3060 cattatcctg ttttttaatt atactcctac tgacataatc aagcttctat aaaaaccgaa 3120 agcgtttatc gaacttttgc aacagctcgt taattattat ccgtttcgct aacatcccaa 3180 ccgaaaccaa agcgggatcg ccctgcgcaa aatcaccttt cccaaaactc gagaattgtt 3240 agtctctcag gaaaaccaag tcagctaatc aaattgccga ctatatctta cagttctata 3300 tttataaaat tagagttaat taataaacgg gctgttaata agagggattg ggcaaaacat 3360 cacgaacaaa aagttttcaa tattcatagt aaaaatcatt aaaagagatt tacttgaggg 3420 ctgctcgaaa acaaccaaaa tatcgattaa ttcactgttt tgctttttat ttttttaaga 3480 aagaattata agctacatat tttctgacga ccaccattat tactctttaa aatcacctgc 3540 acctgccaat ttgaattttt tccccgtctt tcccataaac atcagttgaa tagtgtgcag 3600 ttttttttat ttactttttt ttagtgtcgt tgtttaataa actgtaaaaa aaaaataaaa 3660 cttatcatac aatatgactt cgtttacgga atcaaaaaaa ctggatgaaa tagagtcgcc 3720 agttccggaa attattgttc caagtaccac caatggaaac ggaacaattg aatcttacaa 3780 agaaaaatcg gttggaacct ctttcttgga tttttttcgc tcctacaagt tgcgcccaga 3840 caatgagttt gctgaagtgc acaacagtga ggactttttg aaaccgaggc atttgcaaat 3900 gatcgccatt ggtagctgta ttggtacggg tctttttgtt tccactggca aatctttaaa 3960 aaacgctgga ccgggaagtc taatgattaa tttcattatc ctcagcgcaa tgattctcgc 4020 tttaattctt tctctaggag aaatgtgttg tttcctacca aatcaaagta gtatcacaat 4080 gtacactgga cgattgttaa ataataacat tggttttgcg cagtcatggt tatatttttg 4140 gatatggctg accgtcttgc ctagtgaaat ctctgccgct tgcgaagtag ttgacttttg 4200 gacaacacag cacttaaacc cagctatttg ggtaactatc ttcttggctt atgttgtctt 4260 agttaatgca tttggagccc gtagctatgg tgaatgtgaa ttcgttagtt catttttaaa 4320 agtcgtcatt gttatcatct tcttcttcgt tgcaattatc atcaactgtg gagctgctcc 4380 aaaaggtggc tacattggtg cgcactattg gcatcatcca ggatcgtttc gtaacggttt 4440 caaaggtttt tgctccgtat ttatttcctc agcctattcg ttgtcgggta ctgagaacat 4500 aggtacggct gctggaaata ccagcaatcc tcaacgtgca attccaagtg ctgtgaaaaa 4560 ggtgttctac cgtatgggct tcttctacat catcaccatc ttcttgatca ctttagtagt 4620 cccgtatgat aacccagacc tgggtaatgt cagtccattc attattgcga tcaagaatgg 4680 aggaattcat gtactaccac atatcaccaa tgctgttatt ctcgtttccg ttctctctgt 4740 aggtaatgca gctgtattcg cagcgtctcg caatgcaatg gcgttggtaa agcaaggatg 4800 ggcaccacgt ttcctaggtc gtgtagatca aaagggacgt cccgtaattt catacttgtg 4860 ttcgctagcc atggcttgta ttgcttacgt taatgctgca cctgacggtt ctgttgtgtt 4920 cgactggcta atgtctgtct cgggtggagg agccttcgta atttggggac tttcttttat 4980 cgatcacatc cgtttgcgat atgcaatgaa ggctcagaaa attccagaca ctgtcttacc 5040 atataagttt cctggaagcg tttatcttag ctactatgga gtccttatca attttttagc 5100 actctgtgca cttgtataca tttcgatttt cccagtaaca catgagaagc cgagtgcata 5160 cggcttcttt gtttcatttt taggaccgtc tgtttttatc gcttatcttc tgattagtcc 5220 gattttcgtg aagccaacgt ttcaatccct aaaggatgta gaccttacaa cgggccgtta 5280 tgatttggta aattcacaaa tgtatgtagc tgagtcgagc acttcggaat taagtgaaaa 5340 agatctaacc aagccaaacc tccaaagcaa tgataataaa aatagtgaag atctcgaaag 5400 caatactcct ccacaaaaga aaagcgcgtt acagaaagtt gccgacttcc tttg 5454 //