ID SPBPJ758 standard; DNA; FUN; 2044 BP. XX AC AL162532; XX SV AL162532.1 XX DT 28-MAR-2000 (Rel. 63, Created) DT 28-MAR-2000 (Rel. 63, Last updated, Version 1) XX DE S.pombe chromosome II PCR product pJ758. XX KW RNA recognition motif(x2); rna-binding protein. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-2044 RA Harris D., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (28-MAR-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; O60176; O60176. DR SWISS-PROT; O60176; YG41_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC PCR product pJ758 is overlapped at the 5' end by cosmid c21C3, CC EMBL entry SPBC21C3, accession number AL157918 and at the 3' CC by cosmid c23G6, EMBL entry SPBC23G6, accession number AL023287. XX FH Key Location/Qualifiers FH FT source 1..2044 FT /chromosome="II" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /map="IIR" FT misc_feature 1..103 FT /note="nominal overlap with cosmid SPBC21C3, EM:AL157918 S. FT pombe chromosome 2" FT CDS complement(1244..2011) FT /codon_start=1 FT /db_xref="GOA:O60176" FT /db_xref="SWISS-PROT:O60176" FT /label=SPBC23E6.01 FT /note="SPBC23E6.01c, SIMILARITY:Schizosaccharomyces pombe, FT CSX1_SCHPO, rna-binding post-transcriptional regulator FT csx1., (632 aa), fasta scores: opt: 509, E():1.9e-25, FT (39.0% identity in 223 aa)" FT /partial FT /gene="SPBPJ758.01" FT /gene="SPBC23E6.01c" FT /product="rna-binding protein" FT /protein_id="CAB83010.1" FT /translation="MTDPQTNVSRGYGFVRFTDENDQKSALAEMQGQICGDRPIRVGLA FT TPKSKAHVFSPVNVVPVSMPPVGFYSAAQPVPQFADTANSTVFVGGLSKFVSEEELKYL FT FQNFGEIVYVKIPPGKGCGFVQFVNRQSAEIAINQLQGYPLGNSRIRLSWGRNQNPIAA FT PALNYQSQVSQTTIPATSLFPAMSLPPQAQFSPYPAVAPSPLALQTRGAPIGMEISIGS FT PALVPDQMHIPENGNSDTMPVPNTQGKHLSAEE" FT misc_feature complement(1559..1753) FT /note="Match to PF00076 rrm, RNA recognition motif. (a.k.a. FT RRM, RBD, or RNP domain) Score 79.60" FT misc_feature complement(1886..2041) FT /note="Match to PF00076 rrm, RNA recognition motif. (a.k.a. FT RRM, RBD, or RNP domain) Score 38.13" FT misc_feature 1945..2044 FT /note="nominal overlap with cosmid SPBC23E6, EM:AL023287 S. FT pombe chromosome 2" XX SQ Sequence 2044 BP; 705 A; 351 C; 381 G; 607 T; 0 other; ttatgttgtc tgcttctttg gatcggtgag taaattattt agtagatcaa aattcattta 60 taagctggat ttcaataatg tgtacacaat tatgttaatg atcgttccag gtctagagat 120 taaaactttt tcacaggtta aaaaataaaa aataaaatta attataagaa ttgtcgcttt 180 ccagagtctt atttaagagt atcgagccag gttaataaag tgtaacgtca aaggttgaat 240 actaatcaca gcgttgacaa agtatataaa cgcacgagtt aagcccgttt atacttaatc 300 gcaaaactga atttgatgat gtattcgtat taatgggaaa ggctacgatt taaaaagatt 360 ttgagcctat actataagtg cgttttctgt cagttgttta ttgcgacagt cgcccccctg 420 gcgtagtaga agttgcaagt cgcgttgtta ctctgtaaac aagagcttta aaaatggagt 480 agatttaaat ggatgaaaag ttaataatag attttaattg aatggatttt tcgataaata 540 gggagtatca ttcaaataaa attgaattta gaggcggttg atttttggtt gaatgaaatg 600 ttttgaaaat ggcgctgatc atttaactaa actcatacct aaaagaacga atgattaaac 660 cagaagcaac ttaaaagttg tttatcaatg ggaaatctca ataagcactt acataccttg 720 cctaaactgc attctccagc gacctaaaaa tctcagtgcg ataattctga agtccatcat 780 tactattaaa tttaatcttt tatatacgtc gtatgaaaac ctttttcttg actataaaaa 840 tcatgacaaa tgaaaatttc ggctaaaaaa acgtagcatt tcatatcata aacagggaga 900 gatgaatata aaaaccgtca atccttagaa aggaaacgta atgaagctat gtccaacaaa 960 aatttagttc aagcttttct tcgataattc gaattcatga acacatgtaa aatgcttatt 1020 aatcaattat gctaagggac tctcttgatt taacgtccaa aataacagat agagtttaca 1080 agaactctca cgtttctcga aacaataatt ctccgtgatt cattatgaac atcaaactta 1140 ttagaataga tggaaaatta ttcatacttt ataaaaacca agaatgatga taaaaagtat 1200 gagacatgcc attataaaaa attcaatcgc aagtactttt acattactct tcagctgata 1260 aatgctttcc ttgtgtgtta ggaacaggca tggtatcaga gttgccgttt tcaggaatat 1320 gcatttgatc aggtacgagg gcaggagatc caatcgaaat ttccatgcca atcggtgcac 1380 ccctcgtttg cagggctaag ggggaaggtg caacagccgg atatggacta aactgagcct 1440 ggggtggtaa actcattgcc ggaaataatg aggtagcagg aatagtagtt tgtgaaacct 1500 gagactgata attcaaggca ggagccgcaa ttggattctg attccttccc caagataatc 1560 ggatacgaga attgcccaaa ggataaccct gcaactgatt gatagcaatt tcagcagact 1620 gtcggttgac aaattgcaca aatccacaac cttttccagg tggaattttg acataaacaa 1680 tttctccaaa attttgaaaa agatacttaa gctcctcttc agaaacaaat tttgagagac 1740 cgccaacaaa tacggttgag tttgctgtgt cagcaaactg aggaacaggc tgtgcagcac 1800 tataaaatcc aactggtggc atactaacag gtacaacgtt cacaggacta aaaacatgag 1860 ccttactttt aggagtggct aatcccaccc gaataggacg atcaccacaa atttgaccct 1920 gcatttctgc caatgcagat ttctgatcat tttcgtccgt aaaacggaca aaaccatatc 1980 ctcttgacac atttgtttga ggatcagtca tgattttggc cgacttgcat gagttatatc 2040 gaga 2044 //