ID SPAC1705 standard; DNA; FUN; 2358 BP. XX AC AL163529; XX SV AL163529.1 XX DT 07-APR-2000 (Rel. 63, Created) DT 07-APR-2000 (Rel. 63, Last updated, Version 1) XX DE S.pombe chromosome I cosmid c1705. XX KW confirmed intron; human 4F5S homolog; retrotransposable element; tf2-4; KW tf2-type transposon. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-2358 RA Brown S., Harris D., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (07-APR-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR SPTREMBL; Q9P7A7; Q9P7A7. DR SPTREMBL; Q9P7A8; Q9P7A8. DR SPTREMBL; Q9UTF0; Q9UTF0. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c1705 is overlapped at the 5' end by comid c167, EMBL CC entry SPAC167, accession number AL035248, and at the 3' by cosmid CC c23H4, EMBL entry SPAC23H4, accession number Z98977. XX FH Key Location/Qualifiers FH FT source 1..2358 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c1705" FT /map="IL" FT misc_feature 1..97 FT /note="nominal overlap with cosmid SPAC167, EM:AL035248 FT S.pombe chromosome 1" FT CDS complement(1..452) FT /db_xref="SPTREMBL:Q9P7A8" FT /label=tf2-4 FT /note="SPAC1F2.03" FT /partial FT /gene="tf2-4" FT /gene="SPAC1705.01c" FT /gene="SPAC1F2.03" FT /product="retrotransposable element; tf2-type transposon" FT /protein_id="CAB86944.1" FT /translation="MSYANYRYMKARAKRWRPENLDGIQTSDEHLINLFAKILSKHVPE FT IGKFDPNKDVESYISKLDQHFTEYPSLFPNEHTKRQYTLNHLEELEQQFAERMFSENGS FT LTWQELLRQTGKVQGSNKGDRLTKTFEGFRNQLDKVQFIRKLMSKA" FT misc_feature 302..2358 FT /note="Nominal overlap with old SPAC1F2 Z98976 S.pombe FT chromosome I cosmid c1F2 (removed from EMBL as redundant)" FT LTR complement(619..967) FT /note="Tf2-type LTR" FT CDS join(1098..1104,1299..1395,1451..1538) FT /db_xref="SPTREMBL:Q9UTF0" FT /label=SPAC1705.02 FT /note="SPAC1F2.02c, len:63, SIMILARITY: H. sapiens, O75919, FT 4F5S., (62 aa), fasta scores, FT opt:147, E(): 0.00051, (43.1% identity in 58 aa overlap)" FT /gene="SPAC1705.02" FT /gene="SPAC1F2.02c" FT /product="human 4F5S homolog" FT /protein_id="CAB86945.1" FT /translation="MSRGNQRDVDRARNLKKSQASKKKQAGDPTKRLEAQAEIMRAKQR FT AADERKAAEANGGSKGKK" FT misc_feature 1105..1110 FT /note="gtaagt, splice donor sequence" FT intron 1105..1298 FT /note="confirmed intron" FT misc_feature 1286..1298 FT /note="ctaatgttactag, splice branch and acceptor" FT misc_feature 1396..1401 FT /note="gtaagt, splice donor sequence" FT intron 1396..1450 FT /note="confirmed intron" FT misc_feature 1438..1450 FT /note="ctaatattttaag, splice branch and acceptor" FT CDS complement(2230..2358) FT /codon_start=1 FT /db_xref="SPTREMBL:Q9P7A7" FT /label=SPAC1705.03c FT /note="SPAC1F2.01" FT /partial FT /gene="SPAC1705.03c" FT /gene="SPAC1F2.01" FT /gene="SPAC23H4.19" FT /product="hypothetical serine-rich protein" FT /protein_id="CAB86946.1" FT /translation="SYSSDSSASSSSSSSHESSAASNGFTAGALVLGSLLVAALAM" FT misc_feature 2258..2358 FT /note="nominal overlap with cosmid SPAC23H4, EM:Z98977 FT S.pombe chromosome 1" XX SQ Sequence 2358 BP; 788 A; 390 C; 378 G; 802 T; 0 other; tttgcttttg acatgagttt ccttataaat tgaactttgt ccaattgatt tctaaaacct 60 tcaaatgttt tagttaaacg atcacctttg ttggatcctt gtactttccc tgtttgtctg 120 agtaattctt gccatgtaag acttccattc tcagaaaaca tgcgttcagc gaattgttgc 180 tctaattctt ctaggtgatt caatgtatac tgtcttttag tatgctcatt tgggaataat 240 gaagggtatt cagtaaagtg ttgatcaagt tttgaaatgt aactttcaac atccttatta 300 ggatcgaatt tccctatctc tggtacatgc ttcgataata tttttgcaaa aaggtttatt 360 aaatgttcgt ctgatgtttg aattccatcc aaattctctg gtctccatcg ttttgctctt 420 gctttcatat aacgataatt tgcgtaggac attgtatatg tttccctttg ggagatttca 480 aaagggataa cagtccttcc aaagttcttt ctcctttgaa cccagaagga gagaaatata 540 tatatttatg atatatattt attcaataat tcctttttcc taaaaggagt taattgtaga 600 tcacaagagt tcagttattg taagctacgc agtttggtat ctgatttaag gatacgtaga 660 actgcggtga gttttccttg tgatctatta tattacaata cacaggttgt ataagtagca 720 actgagtata ggtattgtat taactgggtt ataatgttac ctatcactaa tatagctcat 780 aactgaactg aggaacgagg ttcagcagta gctctattta tagtattcag ataatattga 840 tcttaataga ttatcattaa gatcaatctc catatcgtat catacatggt acgatatata 900 attagaacat gtgacatata gtgatactca acgtagtgta ttatagcgta gtgtagtatt 960 gctgacaaat atttcacatt aacgatatat gctatatttc gttaagctat ggtcaccacc 1020 ataaattgcc cgtttttctt ttttacctca taagaagtgt gagttgcagt ttgattgatg 1080 atacaatttt tgctaagatg tctcgtaagt tttaaacctt tgagaatgaa gaagaaattc 1140 taaaaattat tatccccctc tcagcaggga atagctgcgt tgaactatct cttttatact 1200 tgcttattga tttttagtat tcttttagcc tttttcacat tctgatattc attaaaatta 1260 aaactatatc tataatttgt aattactaat gttactaggt ggaaatcagc gtgatgttga 1320 ccgtgctcgc aatttaaaga agtctcaagc ttctaaaaag aaacaagccg gagatcctac 1380 caaacgtctt gaagcgtaag tggaatggaa tcaggatttt gtttaaaaac caattagcta 1440 atattttaag tcaagctgaa attatgcgtg ctaaacaacg tgcagctgat gagcgcaagg 1500 cagcggaagc caacggtggc tcaaaaggaa aaaaataact atattcgaaa ttttgttaaa 1560 gggcagttta ttccatcctt ttaagtggtg tttttccaca gtacattggc tctttattct 1620 gttatttttc ttcaatttta cctctatttt gataaaaaca ataaaatcta tatttctaag 1680 taaagagtaa ataaactcgg tagtcttgca atctataata acatcccatt aagttactta 1740 atcatttact cgtaatttct taactctcgg ttatgcatat catacgcctt atgtctcatc 1800 tttgccaatc aaattatttt attactaact accgcgcgcc ttgtattaaa aattaaaaaa 1860 atcaaattca agatatacta aatgcaaata ttcgatagac taattataaa aaaggcaaaa 1920 tcaataactt tgaaaatagc ctaagagcaa ctcatataaa taatttaaag ctcaaattaa 1980 agaaaaggaa catcaaacct tctcaacact tttctaagat catatagaag ttaatctctt 2040 ctaaagatat atgtttaatc acacccgatt ataaagaaga aaataaaacg gcctgtatta 2100 ataaacattt cagtaaccca catcaccaga gataagccag ctcatctcgc caaataagta 2160 tcggtaacat tcaaatatta attttctata ctttcgataa aaaaagtaac aaattgttca 2220 tgacgacact tacatagcaa gagcagcaac caaaagagat cccaaaacta aagcaccagc 2280 agtgaagccg ttagaagcgg ctgagctttc atgggaggaa gaggaagagg aagatgcaga 2340 agaatcagaa gagtatga 2358 //