ID SPAP4C9 standard; DNA; FUN; 8806 BP. XX AC AL360094; XX SV AL360094.1 XX DT 01-JUL-2000 (Rel. 64, Created) DT 01-JUL-2000 (Rel. 64, Last updated, Version 1) XX DE S.pombe chromosome I P1 p4C9. XX KW DNA repair protein, yeast rad50 homolog. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-8806 RA Saunders D., Harris D., Wood V., Rajandream M.A., Barrell B.G.; RT ; RL Submitted (30-JUN-2000) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR GOA; Q9P3T5; Q9P3T5. DR SPTREMBL; Q9P3T5; Q9P3T5. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC P1 p4C9 is overlapped at the 3' end by cosmid c1556, CC EMBL entry SPAC1556, accession number AL132984. XX FH Key Location/Qualifiers FH FT source 1..8806 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid p4C9" FT /map="ICEN" FT misc_feature 1..5432 FT /note="Right hand side of chromosome 1 centromere" FT mRNA 4497..4860 FT /note="mRNA from spc00798 133 368; not attached to CDS; may FT be UTR of SPAP4C9.01c" FT CDS join(complement(7179..8806),complement(6767..7141)) FT /codon_start=3 FT /db_xref="GOA:Q9P3T5" FT /db_xref="SPTREMBL:Q9P3T5" FT /label=SPAP4C9.01c FT /note="SPAP4C9.01c, len:658, SIMILARITY:Saccharomyces FT cerevisiae, RA50_YEAST, dna repair protein rad50, (1312 FT aa), fasta scores: opt: 1115, E():0, (34.8% identity in 679 FT aa)" FT /partial FT /gene="SPAP4C9.01c" FT /gene="SPAC1556.01c" FT /product="putative DNA repair protein, yeast rad50 homolog" FT /protein_id="CAB96041.1" FT /translation="SYSGTFASMISEIKALESEIEENRKTLHSLQFGSTFYEKAIEICV FT DQHACQLCQRSLDKEEEKLFVEHCHSMIDVIPSKSAEVYSHLETLTKTFKNLSEAKPIF FT DEIELLDKRLSETKTELSDLQGDLQGLDIRKDEIQSELDTLYLRRANLEKLQLLVKDIS FT NLEEEIRTIDRETEVLRIELPSSIAHHNLDEIYAEREKLLEKRGYLRKQIERTKLEETS FT FKKKIDDAVLANNEQKLKLTKLNFQVNELEQLEKDINKSSEDCDLQKKKLLEVSSKQGS FT QAPFLNELESEYEKLEADIQEMAQKSRTEILEANEYLHQLNEWNSELRIDVSTKFKCIK FT EKKSNIGEEVRIIASKIESTDDNLRKLQERLADLRTRERNASDNLRLRALMRQLEEAVT FT QKNYLLSQQSHDDRESFRERMQILKSKYGALNAERAGLLGECKQLENSITKDKEELNME FT FKDADERFRRQLIKTKTTGKANEDLGKYAKALDVAIMQLHSMKMNEINRIVDELWKQTY FT CGTDIDTILIRSDSEGKGNRTYNYRVCMVKGDAELDMRGRCSAGQKVLACIIIRLALAE FT CLGVNCGILALDEPTTNLDEENICSLAKNLSRIVEFRRKQANFQLIVITHDEQFIRLVN FT SDAYCSYYYRVKRDTNQKSMVRFANTVIVLT" FT misc_feature complement(7142..7151) FT /note="ctaacaacag, splice branch and acceptor" FT misc_feature complement(7173..7178) FT /note="gtacgt, splice donor sequence" FT misc_feature 8704..8806 FT /note="Nominal overlap with SPAC1556 S. pombe chromosome FT 1" XX SQ Sequence 8806 BP; 2799 A; 1560 C; 1366 G; 3081 T; 0 other; gatcgtcaat tttctacgga atcgatgaat cgaatgcata tgttcaaatc gctcgccttg 60 ccaatggcct aaaattgcga acctgaaact gaaatatcaa aagaagactc aactaatact 120 taaaaggaaa tagaaggaag atttcactca tacctaccga acgtatgatt agcataacat 180 ttcaaatata tatatccctt attataataa ctccctatta ggattagccc gatataaaaa 240 catattctct ttggttataa aaatccaaga aaggacattt ttgttactaa atatactaag 300 gcaaaaagcc tcaatcaaaa attgcgatcc caattattca gaatgataaa tttgattgga 360 tttatttttg gctatcatgg taaggatata ctagaggttt aatctatttg aacatttctc 420 agattaaaaa gtcaaggaca tatgtctcca tgttgttcgg aaactattta tttttccgtc 480 taaattataa ttattcaagt gctcaatgtt atttagctcg atcgatttct cttggttttc 540 aataatgtcg atcaatttga ctaaacactc attcaatctt taactttttc tcttccaatc 600 tctactttag actgttagaa tgtatataca aagcattaac tttattattt tcaatgatta 660 attaccaaat tgtttatgaa tttttagtaa gtatacaagt aatgctgatg ttcgcgatat 720 aaaagtttag caaaatactg actctaatta tatagccaat tggggtgttc aagtataatg 780 tatataatgt atataatgta tataatgtat ataatgaatg tatatatata tatatatata 840 ttttataagt aatcattatc caatcatgtt gaaaaatttg aagattgacg atatgaaata 900 gttgtaggag aataaaaaaa catattatta ttattattat cattatcatt atcattatca 960 ttattatcac tatcattctt ccaaagcaaa tagtctaatg atcaaatgta aatacatact 1020 ttacataaat cccttcaaat agcttgttga cataatgaag accaaataat aaacgactgc 1080 tattgtcgct ttgttgtcgt ggactattaa agtttacgga cgaaataatg tctagttgtc 1140 atgttgcaaa atattaattc catttcttat cgccgtgttt tttataccta tttgtccctt 1200 atttacacgt atgtattaat atcatttgaa cgaatgcatt aatatatgag ttatagaaaa 1260 ttggcggatt atcttcgaaa aatattttga ctcaatttga atcgtgttac tcaacccatt 1320 ataataaacg aacctcatga aatcgtttac cgcttctcct taatccattt gtgtaatact 1380 gaatgctaag taagagtgga aagcttccac catccaccag accattacaa gcactacata 1440 cgccatcttc aatatcgtat attttcagta gtcaacttga ctagctctta ttggcggttt 1500 tattcaagag aagattcatc cggtgaaaat tcaagcaact attactgcac tagcaattgg 1560 atcggtaaat aggcgagatc tgaaagtttc ggtaaagttt tagattacat ggcttagttt 1620 cacacgctct ttatattctc aaccttccga cgcaaatcac cagattcaat aaatgagtcc 1680 tactcctaca cctactctta tcacttgtaa tccaaaaaag ctccatttct ttcttgcacg 1740 ctaattcact attaaataga aatgtatatg caaaacagcg agaaatttgt agatacgttg 1800 aatgttgttg ctttcacatt tgcaactctt agtagttgtt caaaacaaca atgtcgagca 1860 aagatgtttg cgatgtttga atgattcttg gattgtattt ggattccatc ggtactatgg 1920 cctgttgatt cggcaccttt gtcattttag caacttttgt tttcttttct gtgtcgatgt 1980 agttctctat accatattca tgttcatatt catgttcata ttcatgttca tatcaatgtc 2040 catatcaagg tccatatcaa tgtccatatc aatttcccat gttccattac atatcccgtg 2100 ttctttcctt cttccagctt ttacatgcta gccttttata caatgattcc tctcatctcg 2160 actcgcttga tgaatgggcg tttagttttt gcaacaacat atgcgttggg ttatctcata 2220 tcgggaaaca ctttctgcca cttttaactt gaacaaagtt ggagaaataa agtgatggca 2280 gatattgcaa gttgtttata gtgtgcctgt agaaatgtaa aaacgctatg tttcgtgaat 2340 gcccagcctg taaatcgaaa taagatttcg tatttattaa gcattggaag atcacaattg 2400 gaaaaaacac gatacggttg gtctcttcac agtcgttctc cagtaatcac aaacatgaaa 2460 cctaaatata cggtaacttt gcacctatta ggacaacgcc tcaagtgact gcattaaagc 2520 ataataccga agcactgaca taacagagag gttcagaaag gaatttaaag attgactttt 2580 tcgacaaact tcatgttaca agtcttatcg tttgtgttta taaatcatca gcctctctct 2640 atatctctat atctctatat atcagatata aagatgcaag ttttgaagta gacattccgc 2700 acaaagtcta gtacaccatg tctcttgtgc tcaggctggt tgtcctgcaa aaatgtacaa 2760 caggcaaatg gatgtttcat caaagatgga ctcctttgtc tcatacttat tgatggcgaa 2820 gctagatccg ttatcatctt gagaaaacac atcgttgtct tcagagtagt gatagttctt 2880 atcgttgtag ttatagttgc agttatagtt atagttgtag ttataattat agttgtagtt 2940 gtaataactc tttcaacaag tcctgaatct tggcaaacag accctcatac agtgctgtca 3000 gctcactcaa gtccaatcgc accatgttaa ttaaacgcat ctccaatgca cgggtacata 3060 gaattacttc gagactgaaa caatacggtg cttgggctta gtccttgtag tgtattttgc 3120 atacataccc ttctgttcga atgatatccc taatgacttt gtgggtttcg taaagaacat 3180 catatagatt tctttgacca agtgtaataa agcaaaacac acggacatag tatggatatg 3240 gacacagcat gggtatggac acaagcaaca ctaccagtcg tttgaaacga tgaaatgggc 3300 aacaagtcga tttgtttttc atctatcaat cgttgtaatt cacatgcagc tacaagacga 3360 cacgttatat ttagggtgca aagcaggtag agatttcata ggcatatttc gaagtaccag 3420 ctttatgcca aaacatgcat gcatgcaaga aactccataa cttcgtttgc gcaactcctg 3480 cttatcgtct tctttctata cccatactgt tggatcaatg aatgaacgta gcaatagata 3540 caagatatag cgccacactt ttgagcatat cctaatgaca gtaattcgtt ttgttctaag 3600 cgaccaagta ttcgctcttg caacactata gttctcactt tgaggaaaca ttcctttgat 3660 gcccatgttc attccacttg gatgacagaa tcctcgatag tattagtatt aatgaattaa 3720 ctgtcaggat gtgttgtcgt tcttgaagta agtcttgtga aagaaaccac cgagttaaaa 3780 atagagatct tgatggtttt atatataaaa gtttaatgct tgcctattta tacatttccc 3840 aaggactgct gaggtagatg cgttgttcag ttgactagga tttgtttgaa ttcaaagatt 3900 atctattcaa gtcttcttta tacgtttgca tgtgtttgat aagaattcca ttgattcaga 3960 aaaaaagtgc caacagtttt tccaacttga ctgacactac aaccttccaa tcatagccat 4020 actactatgg tataaaaata aaagtttaca attttactac tgttatatat atttcgatgg 4080 tagtgaagct ttaaaaaaaa gaaggaatag cataccgtca agtcgttagt tgatgggtag 4140 ggttgttagc ctaattatga gttttatttg taaaataggt ttaggtttta gttttagttt 4200 taaattatta gtccaatcat gagttgagct gctagtttag gcgtgcgttg accttttagc 4260 ttaggtgtat tgtcacggtt tggttttcat atctctgcgc cgcctatatt agcaaacagt 4320 tgacatagaa ttcacaacaa taacaaatta aaaatgcatc atcctgtaat taaaagaagg 4380 tatattgatg catcctttgc gtgaatcaga cacgtagaat catgtttctc tcatgacaaa 4440 gagcacttat agtagcatat ctagaatttc tctcagtttt aactttcttc ggtctcggtt 4500 tttcccttga caaagctgat aatatatcat gtacaatcag gcaacacatt tatttagcca 4560 acattctagt tcaaaagcaa ctttttgcta aatcaagact gtgatggtgt aaaaatattg 4620 cagtcaatat ttttgtacta aaaaaattga aagctttagt tgatacgtcc acggattgag 4680 tcctcgcgag cttggttcta tgtgttaatt attatcatta tcattatttt ttttggcaaa 4740 cttcaaggga gtaacttctt cacctcattt acaagctcca ctaaaaatta actcagcaat 4800 gattttacaa tgctcgagag taaaatgtta aattcgtatt aaatctactt gaaatttctt 4860 aaacacattt tatggttttg cgggatttta tattcgagta ctcaagaaca aggcaataaa 4920 tccataaaag aattgctgaa tgtaaccaac atcatagatg aagctaatat tgattatgtt 4980 aggtgaaacg caattgccaa ttgaactaag tgtgttgaaa tagaacatct gtagataatc 5040 attcattcga tacactgata gttgaaactg ggaaattata aagttagctt atagtgttaa 5100 cgcttttctt cagcactaat aggcaattga ggcatataaa atttgtttac taactatgcg 5160 tagcgaagct aacgtttatt tgtaaattta aataataagg tttttttttt tgttatcaat 5220 agtttttatt ttctgtaatt acacatgata ttcttactga taacccaagc agatagactg 5280 aaatttgtaa agcatttaca acaccatgtg taaaatttgc tattttattt aaatacttat 5340 caaatttcat atcagtgcag tgtttaccaa caagcgtacg atagcgtata cgttatcatt 5400 ttattcacat ttgctctaaa aaatatattt cttaaaaaga tacgtaaata ttgaagttat 5460 ttgagaggtt tcttatcgtg ctatcaacat aataaaagat gcgtttgcga ttctctgcaa 5520 tttactcatt tactgatacc cacattgaaa atttaattaa ataaaacacg aaacaaaaat 5580 gtttattgta ttgtttatta attagcaaaa gttagcgctc acactaagtt tacactctaa 5640 aatgcacatt tggtaagact aaaaaaagaa gactttacat taacaaatta tttaggattt 5700 ctaatttaat ataagtaatg aaaactataa tttcaatata atcgtgattt agaatatact 5760 ttgtgtaaaa cgtattttat atgtgtgtca agcaagaaag cttcatagat ttcattattc 5820 tgagttctcg cagctaccac ctactataca caagcaccgc accgaaatct aatctggcat 5880 aataaactga aatggaaagc tcgacaataa atgcaaaaaa aatatcagtt ctactaactc 5940 ttttttccat tattggatac actgcttatt ctgcacatga atcaattctt gaaattagac 6000 aagatggtaa acttccttta gacgtatgtc tttaccacat gcggcataaa taaactaaca 6060 tacagataaa atgtgaagtt atcttagtaa ctctactttt cacttttact accgttataa 6120 ttgcttctcc tttgcgtagt atacaactta ataaatggtc acaccagaga tcggaccttg 6180 catttctcaa ctcgagaacg aactttttaa ggataaaaga attgaaggaa aaaatagaaa 6240 aagtaaagaa ttaacatgct ttgtggaatt agaggaacct gtacttagat tatgttaaaa 6300 ccatgaatgt agtgctgttg atacttcgag gatctaaaat agtttatatt tattcttccg 6360 gcttcttgct tttgcattat tcaaattatt ttagcgagag aatattttgg atcaccttct 6420 ttttcccaat caatggctca tttatcaatt cagatatgtt tatagcttgt tatcaaaacc 6480 atgtatgatt ttgaacatac gaagctatat ttcattattt tcaaacaatg aaaaatgtat 6540 attcaaatca ttttgttata ataaagggtt ttagttcttg atgaaataaa aataaagtta 6600 cctcggtcat gttaaaaact actatatagt aaagatagac tacaaagaaa tagagaatcc 6660 aagtcaattt ataaagacta ttattcagta aatgctttaa ccaatatcta aaacgtaagg 6720 tcgaatatat tcatcattaa ttttaaagag gttctttaac aatctactaa gttagtacta 6780 taactgtgtt agcgaacctt accatgctct tctgattcgt gtcccgtttc accctataat 6840 aatagctaca ataagcatct gaattaacta accggataaa ttgttcatca tgcgtaataa 6900 caatgagttg gaagtttgct tgtttccttc taaattcgac aatccgagat aaattcttag 6960 ccaaactaca tatattctct tcatcgagat ttgtagttgg ctcatccaga gccaaaattc 7020 cacagtttac acctagacat tctgccaaag ccaaccttat aataatacac gctaaaacct 7080 tttgaccagc actgcagcgg cctcgcatat ctagctcggc atcaccttta accatacaga 7140 cctgttgtta gtaagtaaca gtagatatat gtacgtactc tataattgta agtcctattc 7200 cctttacctt cactatcaga gcgaattaaa atggtatcaa tgtcagttcc acagtaggtt 7260 tgtttccaaa gttcgtctac tattcgattg atctcattca ttttcattga atgtagttgc 7320 atgatagcga catcaagagc tttcgcatac ttgcccaaat cctcatttgc tttacccgta 7380 gtctttgttt ttataagctg tcgtcgaaat ctttcgtccg catctttaaa ctccatatta 7440 agttcttcct tgtctttagt aatggagttc tccaattgct tacattctcc taataagcct 7500 gcgcgctcag cgttgagtgc cccgtatttt gactttaaaa tttgcattcg ttcgcgaaat 7560 gattcccgat catcatgtga ttgctgggat aacaaataat tcttttgcgt cacggcttct 7620 tctaattgtc tcattaaagc gcgaaggcgt agattatcag atgcgttcct ttccctcgtg 7680 cgcaaatccg ctaagcgctc ctgtaacttt cttaaattat catcagtact ttcaatttta 7740 gaagcaataa tcctaacttc ttctccaata ttacttttct tctcttttat gcatttaaac 7800 tttgtggaaa cgtcaatcct caattcagag ttccattcat taagttgatg caagtattca 7860 ttagcttcaa ggatttctgt ccgacttttt tgagccattt cttgaatatc tgcttctaat 7920 ttttcatact ctgactctaa ttcgtttaaa aagggtgcct gagaaccttg tttagagcta 7980 acttccaaaa gctttttctt ttgaaggtca cagtcttcgg aggatttgtt aatatctttt 8040 tcgagttgtt ccaattcatt gacctgaaaa ttcaacttgg taagttttag tttctgctcg 8100 ttattggcta agacagcatc gtctattttc tttttaaaag aagtttcctc caatttagtt 8160 ctttctattt gtttccttaa gtatcccctc ttctcgagta atttttctct ctcagcataa 8220 atttcgtcta aattatgatg agcaatagaa gaaggaagtt caatacgtaa aacttcagtt 8280 tctctgtcaa ttgtacgaat ttcttcctct aaattagaaa tgtctttaac caacaattgt 8340 aatttctcta agtttgctct tcttaaatac aaggtatcca attctgattg aatttcgtcc 8400 ttacgtatat ccaaaccttg tagatcacct tgcaaatctg aaagttcagt tttcgtctct 8460 gaaagccttt tatcaagcag ttcaatctca tcgaaaatcg gtttagcttc agaaagattt 8520 ttaaaagttt tagtaagggt ttcaaggtga ctataaactt cagcagattt ggatggtata 8580 acgtcaatca tggagtgaca gtgttccaca aagagctttt cctcttcttt atcgaggctt 8640 ctttgacata gttgacaagc atgttggtca acacatattt ctatagcttt ctcataaaat 8700 gtagatccaa attgcaatga gtgaagggtt ttcctatttt cttctatttc actttccaaa 8760 gctttaattt cagatatcat agaagcgaag gtcccactat aagatt 8806 //