ID SPAPJ691 standard; DNA; FUN; 6161 BP. XX AC AL133302; XX SV AL133302.1 XX DT 30-NOV-1999 (Rel. 61, Created) DT 18-OCT-2000 (Rel. 65, Last updated, Version 4) XX DE S.pombe chromosome I PCR product p691. XX KW Sec7 domain. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-6161 RA McDougall R.C., Rajandream M.A., Barrell B.G., Brown S., Harris D.; RT ; RL Submitted (30-NOV-1999) to the EMBL/GenBank/DDBJ databases. RL European Schizosaccharomyces genome sequencing project, Sanger Centre, The RL Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, E-mail: RL barrell@sanger.ac.uk XX DR SPTREMBL; Q9HFF0; Q9HFF0. DR SWISS-PROT; Q10491; YDG1_SCHPO. DR SWISS-PROT; Q9URW3; YIPL_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S_pombe/) CC During 1995 to 1996 about 66% of S. pombe chromosome 1 was sequenced CC by the Sanger Centre. The sequencing of the S. pombe genome is now CC being continued with funding from The European Commission. CC Fourteen European sequencing laboratories, including the Sanger Centre, CC are participating in the project. CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPBC25H2.01c. CC SP (S. pombe), B (chromosome 2), c25H2 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC PCR product p691 is overlapped at the 5' end by cosmid c26F1, EMBL CC entry SPAC26F1, accession number Z73100. XX FH Key Location/Qualifiers FH FT source 1..6161 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /map="IR" FT misc_feature 1..109 FT /note="nominal overlap with cosmid SPAC26F1 S. pombe FT chromosome 1" FT CDS complement(1..922) FT /db_xref="SWISS-PROT:Q10491" FT /label=SPAPJ691.01c FT /note="SPAPJ691.01c, len:307, FT SIMILARITY:Schizosaccharomyces pombe, O94317, serine-rich FT protein., (534 aa), fasta scores: opt: 237, E():3e-06, FT (28.1% identity in 263 aa)" FT /partial FT /gene="SPAC26F1.01" FT /product="hypothetical Sec7 domain protein" FT /gene="SPAPJ691.01c" FT /protein_id="CAB62088.1" FT /translation="MDESSRIASSSALHGLDEMVSAHKPSPPLPSRRKGKSALRSALEK FT KNRKSKSKPKVTITSDTPKVSSQHSPVSSAYTGDSTTDLDSKSGHSSSQKLSNKVSSAL FT KLTIPKRWRSSKSSSSQCSSPFLPTSSSNGHGDDASLNLPDKKSRPSSQSSIFLSNWST FT IFSSNASPTDSQLSPTHTSTIAELAASTLSIFPSGSYAGSTFGSPSRSSIDSSTYLPRS FT KSVNSLSSNFSARTPASNQSSVSEDFGAAPNCDHKHNSVTLSDFALPDIDQHDTVETIL FT EKVEFTIPKQFTPAILSQGTSSLLKL" FT CDS 3385..3780 FT /db_xref="SWISS-PROT:Q9URW3" FT /label=SPAPJ691.02 FT /note="SPAPJ691.02, len:131, SIMILARITY:Arabidopsis FT thaliana, AAD32844, thioredoxin-like protein., (130 aa), FT fasta scores: opt: 283, E():2.8e-13, (40.6% identity in 101 FT aa)" FT /product="hypothetical zinc binding protein yipee-like" FT /gene="SPAPJ691.02" FT /protein_id="CAB62089.1" FT /translation="MGRYYPVHLKSRCYVCAKCKTHLAFKGHLLSHDYRGKNGPACLFK FT RVENVIEMEPKTEQMSTGRFIVRHIHCCRCHTYIGWKYVSSYEPSQKFKDGHYILEMQD FT AVLQRDDPEPDDCFIHPPITFLSSSFS" FT CDS join(4941..4982,5183..5267,5339..5472) FT /db_xref="SPTREMBL:Q9HFF0" FT /label=SPAPJ691.03 FT /note="SPAPJ691.03, len:86" FT /gene="SPAPJ691.03" FT /product="hypothetical protein" FT /protein_id="CAC14019.1" FT /translation="MSTSQSSEQTLNYQWDVCLSNMVVQSGIGLGAGIVSSVLFFRRAA FT WPVWGGLGFGLGKSYADSNARLRTFHAIPKQLPASSTQKKD" FT misc_feature 4983..4988 FT /note="gtatgt, splice donor sequence" FT misc_feature 5162..5182 FT /note="ctaacaacattgctgttttag, splice branch and acceptor" FT misc_feature 5268..5273 FT /note="gtattg, splice donor sequence" FT misc_feature 5317..5338 FT /note="ttaacgaaatttttttgttcag, splice branch and acceptor" FT LTR complement(5959..6161) FT /note="TF2 LTR" XX SQ Sequence 6161 BP; 1868 A; 1067 C; 1148 G; 2078 T; 0 other; aaagttttag aagtgatgaa gttccctgcg acaatatggc aggagtaaac tgcttaggaa 60 ttgtaaattc caccttctcc aagatggttt caacggtgtc atgctgatct atatctggca 120 gtgcgaaatc cgaaagagta acagaattat gtttatggtc acaatttggt gcagcaccaa 180 aatcttcaga tacgctagac tgatttgaag ccggcgttcg cgcactgaag tttgaagaaa 240 gagaatttac agacttcgaa cgaggtaagt aggtagagga gtcgattgaa gaacgggaag 300 gagatccgaa tgtcgagccc gcataagagc cggatggaaa aattgaaaga gtagaagcag 360 ctaactcagc gatggtagag gtatgcgttg gagataactg cgaatctgtg ggactagcat 420 ttgaggaaaa gattgtcgac caattggaaa gaaagattga agactgacta gaaggacgtg 480 attttttatc aggcaagttc aacgatgcat catcgccatg tccgttggag gaggaggttg 540 gtaagaaagg ggaggagcat tgtgacgagc tagatttgga agagcgccaa cgcttcggta 600 tggtgagctt caaggctgaa gagactttat tggaaagttt ctgggaagag gagtgaccag 660 attttgaatc caaatcggta gtcgaatctc cggtgtatgc ggaagagact ggggagtgtt 720 gagaggagac cttgggggtg tccgaagtga tcgtaacttt tggcttagat ttggactttc 780 tatttttttt ctccaaagcg ctacgcaggg cagacttacc ctttctccgg gaaggaaggg 840 gaggagacgg cttatgagca gaaaccatct catctaaacc atgcaaagcg gaagaagaag 900 ctatacgaga agattcgtcc atgttctatt aaaaagggaa ttttaaattt attaaaagaa 960 ataaaaaaaa aactttttta aaatttccta cgattaaaga gttcagaacc tggaaaacaa 1020 aaatgcgtct cgaaaaccct gagcaacaca aacgacatca aacttcatga agaagcaatt 1080 gcacaggatg attgtgatta actgataggg ttctcttttc taaacaaaaa aataaaagag 1140 tatcttttca gcgaaaaatg aagtttttaa ataataaaag cgtagatcat agccgcagtt 1200 cgttttggaa gaagagagcg attcttgaag agtataatta cagcagacga cggaatggca 1260 aagcagtaaa gaggaaaaga cttaaaaaac ctatttttcc ctactaagca caagaaatga 1320 ataaaactaa ttatgccttg aatgatagga tgaggggggt aaatattgac tgtggaagat 1380 gagtgagaaa cgatgtaacg attcaatggt acgcagcgag gcgggcattg gtaatactga 1440 acgcagaggc aagtgccccc ataaaaaaga ggacttatgt gggggtggat cgactagtta 1500 atcctaatgt aaacaatgga tggaaagaac taaaaaatga tgaagaagaa cattaagaga 1560 aaaaacatat agtaagttat atactattaa tgaaaaataa agaattgcat tgaaaataag 1620 tatttgtagg aggaattgaa ctttctaagc caattgtgcg aaacattgct agatgtagtg 1680 gataaataca aaaattgctt gcgaagaaaa atattctaaa ggagtacaaa cgaaatttcg 1740 attcacatag tattgaaagt ttttttagta agattgtttc ttgaatttcc tttaatttcc 1800 cttcaatctg catattggga actcgaattg atgaaaacat ttattccatt agtaattgct 1860 gagaagcata tattaataga aaaatccatc tctatttatg attttttttt cacactcaga 1920 acggttatga ttaaagatac tgtaagttta tgaaattaat tgacatgtaa attgaattgg 1980 tggttttccg aagaaaaagc atttctttca acaaccgttc gttcactcaa taatcttcat 2040 tgaatttctt tagcagtcat tttcttaaat cgcgttttat caattgcaaa acccttcttt 2100 tatcgttcat ctgcaaaact aactgactca cttacaatat tgaatttgcg cacttaagaa 2160 ttccctacca ctttagtatt ttacatttga agattcgtcg atctcaagtt tccaaaggtc 2220 acctattaag acaaagtatg atagctacag caccatcaag caattgcgtt tcgcacactt 2280 gaactaccgt ctctacaatt cattcaactc aacctttctc aatacttcca ttcctccgtc 2340 aatctcctac cgtacttctt tactatcttt tctatcaaac cttttctcta tttccactct 2400 tccgctcctg ctgccacgct acctaaatga gatggataag taacagttgc aagagttgta 2460 gaaacttatt ttcgccagct ataaagctca aaggaaacgc attcatcctt aaaattggca 2520 ggtcagtttg ctcgaattac taatatccaa gtgtaacgta gcggcatttc atacttagta 2580 ttgccattcc catactcaac actgctcgtt ggcgtggaag tctacttggc aattcttagt 2640 cggacattct gtcatcaatg gcaaattaat catgtttctc ggtattagta aagcaaggaa 2700 aatctggcat tgagggcgtc ataaatttat taccgaagta cgttagtaat tctcctacgt 2760 tagcagtaaa ctaagcagta atattggtgg tagcggtaga attcctctta ccttcttctt 2820 gctacatcta aacaggaagt gaagttggaa tcgagcgatt gtgagacgaa attgcttgcg 2880 aaaagaattt caaggactca attgagatca attgttaaat tttacttaat ctaagacttt 2940 ttttttttcc aaattaatcg ggtttatttg gatttcataa agttgcagaa ttgcaatttc 3000 atagattcgt ttgtagaagt aaattttttt tgtcatttac attattgcct tttattctag 3060 tttctcttct ccttcaggtc cccgtccttc tttccttagt gtattgtaat aaaacggctc 3120 atacgattcg ttgacttcgc tgtctgacgc tttccagcaa agtccataag ccatccgcat 3180 tgttaaataa cctcttttag ttaataaacc tcagctccta acatctagat ttacgaattg 3240 ggcttagctt cttttcgttg agtgtatcta ctggcgccac cgtcattctt tcctctttct 3300 atcgctgctt taaatattca attgagcaat tgacgtcttt tccctttttt ccttttcgta 3360 tttagttgta agtctttcat cgcaatgggt cgctactatc ccgtccatct caaaagtcga 3420 tgctatgttt gtgcaaagtg taaaactcat ttggctttta agggccacct gctcagtcac 3480 gattatcggg gtaaaaatgg tcctgcctgt ctgttcaaaa gggtagagaa tgtcattgag 3540 atggagccga aaacggaaca aatgagcacg ggtcgtttta ttgttcgtca tattcattgt 3600 tgtcgttgcc atacttatat tggatggaag tacgtttctt cctacgaacc atcccaaaag 3660 tttaaggatg gccattatat tttagagatg caagatgctg ttttgcaaag agatgatcct 3720 gaacccgacg attgtttcat tcatcctccc atcaccttcc tctcttcttc tttttcttga 3780 tgtttcttca tctgttatta ttcgcattca tcaccttttt taactagtgt ttttcttagg 3840 acaccgattg aacttattta cctaatttta cttttttttt gtttgttgtt tttcatgcta 3900 ctttcttttt tatccgcttt aggactccct gcttttagaa ttgcatccac attactcaga 3960 tattgaagca gtcgaggcaa ttgtaaacct tttttacttt tttaaagctt ctattatccc 4020 ttttggtggc tgcttttctc tcttccattt atatccagca tttttttttg caaattctcc 4080 aagttctgca aaacagatcc gtggatattt caactggtgt ttctttaata ttcatttaga 4140 atatcaaata attgtcatta cgtatcttta tcaaatttct ctttttaatc cttgagcttc 4200 aacttactat tatctttgtt aacaatcagt aaattgcttg atttactctt tagtgtcgag 4260 gaattctata gttttactct ttacatttca ttactggagt gattatcaat ttaatacaca 4320 gatcttttat ctaccaaagg gtcttacctc tagtcataaa ctattagtct tactaaatca 4380 actctattca aattttatca atttcatttg tagtagaaaa gctaaatata aaaaagaaag 4440 aaagtattat ttgtgtacta tcatgtaaag gttgtaagaa aagtgtgtcc atgaaatatg 4500 taatttgatt actagtaaac agaaaaatca tcactttttg agacttcgtt ttggaagtta 4560 atgatgagat tggcagagta atcactccta attatgctat tcctgacgat taatcatttg 4620 ttgaaggaat tttttatttt taactatttt attttattat tttttttaga atttgtttac 4680 tgaatactaa tgccctagtc taatctctgt tctacttcga ggtagcaatg ctactttact 4740 cttttgtaaa ggaatttaca tcatgaacga gcgagtgcga gtgcaagcgc ttgcggatgg 4800 gactcacgaa cttcttcttg aacaactcaa catcccccct ttccaaattc tttaggtttt 4860 actgatatat acgactttgt atattttttg aataaaataa attttacgct tccttcgagt 4920 ttcgctcttt cctttttaca atgtcaacct ctcaatcaag tgaacagaca ctcaattacc 4980 aggtatgtta tgcgtataat aatgaaacaa aataaaaaaa ggaattactg ttaaacatgg 5040 atatacattt tttaaggaca cctgaagaag ccagttctga aggtgatgcg gtaatacata 5100 tctgttgaaa gatacatatg attgcttcaa gtcgattgtt tcatgttact ttttactttt 5160 tctaacaaca ttgctgtttt agtgggacgt gtgtctttcc aacatggtcg tgcaatccgg 5220 tattgggtta ggcgcaggta ttgtgagctc tgtgcttttc tttcgtcgta agtaatattg 5280 agaacctgca ataagtttcc tgtttgatta tgattattaa cgaaattttt ttgttcaggt 5340 gctgcttggc cagtatgggg aggccttggt tttggtttag gaaaatcgta tgccgacagc 5400 aatgctagac ttcgcacgtt tcatgccatc cccaaacaac taccagcttc ttctactcaa 5460 aaaaaggatt aaatgaaaga aaaaggagat cctttacata aagattgatt tatgttgttc 5520 catggggaat taaatttgta tgaactatca taccaggcgt tacttcaatc ttgatagcag 5580 ctaataatgt ttgaatgaat gacgctttcc atctttgttt tcagttatta taaatcaatt 5640 tctttttctt ttcttttatt tttattttta ttttttggtt tagattttgt ttaaagtatt 5700 tcgtactcaa cttgattagt ctgcaaaggg gtaaagatcc aatgaaaaca atactattca 5760 tcgttgtaga attcctagtg gaagatataa gaagtataca atattaaagc tgagtttgga 5820 attattaaaa ttttagtgta cacacgtgtt cagcacttta gcatacaaga attcctccat 5880 tccaaaaaag tcctctagtt ttgtacgcac tttcgctcga ccatctcaaa taacgagaat 5940 atgatctcgg tcgataagtg taagctacgc agtttggtat ctgatttaag gatacgtaga 6000 actgcggtga gttttccttg tgatctatta tattacaata cacaggttgt ataagtagca 6060 actgagtata ggtattgtat taactgggtt ataatgttac ctatcactaa tatagctcat 6120 aactgaactg aggaacgagg ttcagcagta gctctattta t 6161 //