ID SPAC15F9 standard; DNA; FUN; 3910 BP. XX AC Z68197; XX SV Z68197.2 XX DT 12-DEC-1995 (Rel. 46, Created) DT 20-FEB-2003 (Rel. 74, Last updated, Version 5) XX DE S.pombe chromosome I cosmid c15F9. XX KW G-beta repeat; NFT2 homologue; nuclear pore protein; KW nuclear transport factor 2; SEC13 homologue; SEH1 homologue; WD40 domain. XX OS Schizosaccharomyces pombe (fission yeast) OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes; OC Schizosaccharomycetales; Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-3910 RA Devlin K., Churcher C.M., Barrell B.G., Rajandream M.A., Walsh S.V.; RT ; RL Submitted (07-DEC-1995) to the EMBL/GenBank/DDBJ databases. RL Schizosaccharomyces pombe chromosome I sequencing project, Sanger Centre, RL Hinxton Hall, Hinxton, Cambridge CB10 1RQ E-mail: barrell@sanger.ac.uk XX DR GOA; Q10099; Q10099. DR GOA; Q10100; Q10100. DR SWISS-PROT; Q10098; YAP1_SCHPO. DR SWISS-PROT; Q10099; SEH1_SCHPO. DR SWISS-PROT; Q10100; NTF2_SCHPO. XX CC Notes: CC Details of yeast sequencing at the Sanger Centre are available on CC the World Wide Web. CC (URL, http://www.sanger.ac.uk/Projects/S.pombe/) CC Protein coding regions (CDS) have been predicted with the help CC of computer analysis using the Genefinder program in PomBase CC (an ACEDB database) with additional predictions for the CC branch-acceptor sites supplied by the program Sp3splice. CC CAUTION: It is possible that for any individual CDS we may have CC underestimated or overestimated the number of introns/exons or CC we may not have chosen the correct splice donor/acceptor sites. CC CDS are numbered using the following system eg SPAC5H10.01c. CC SP (S. pombe), A (chromosome 1), c5H10 (cosmid name), CC .01 (first CDS), c (complementary strand). CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC The length in codons is given for each CDS. CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC Cosmid c15F9 overlapped 5' by c22H10. XX FH Key Location/Qualifiers FH FT source 1..3910 FT /chromosome="I" FT /db_xref="taxon:4896" FT /organism="Schizosaccharomyces pombe" FT /strain="972h-" FT /clone="cosmid c15F9" FT /map="IL" FT misc_feature 1..100 FT /note="nominal overlap with cosmid SPAC22H10, EM:Z69730 S. FT pombe chromosome 1" FT CDS join(complement(609..918),complement(180..553)) FT /db_xref="SWISS-PROT:Q10098" FT /gene="SPAC15F9.01c" FT /product="hypothetical protein; sequence orphan" FT /protein_id="CAA92378.1" FT /translation="MEQIRPFSDNYLDRLRTVEEDDYNDWEGSSDTKTLYSSVSDSKIG FT HKKSFHKDGKGLFKTLRNFLSLKTNKVPKNRKFSLLKGSKCNNVEIKVFIRNNDKQSNV FT HNFKGKDSIHEAWSQYLIFDSMGLSPLIHISGEDKYKVLCLFSIWRRKLGKCSIFTVHD FT PDIQKKIKSILSYLWELSNTNDPPHVIVWHTAKGTNVMTMGPLGLKDEDLWNYANERSN FT LNGTT" FT misc_feature complement(554..566) FT /note="ttaacagctgtag, splice branch and acceptor" FT misc_feature complement(603..608) FT /note="gtgcgt, splice donor sequence" FT CDS join(2151..2217,2259..3192,3247..3265) FT /db_xref="GOA:Q10099" FT /db_xref="SWISS-PROT:Q10099" FT /gene="SPAC15F9.02" FT /product="WD repeat protein; nuclear pore complex FT (putative); possibly involved in nuclear transport; similar FT to S. cerevisiae SEH1" FT /protein_id="CAA92379.1" FT /translation="MDDISATTIQTNHQDLVNDVTYDFYGRRMVSCSADQRVKVYDFND FT DTETWAITSEWRAGDASLMRVAWAHPSFGQVLAVCSLDRGVRIYEEQKKNFESKTWVEV FT AKLMDARSAVLDISFCPFQHGCKLAAVSADATLRIYEAMEPGNLTYWTLMNEIALMPSP FT PSRNEQPAFCVNWCPSRWREQYIAVGCMNDAYIYKQNSHGKWKKVAELPGHTDLIRDIC FT WAPSMGSSYYLIATACKDGNVRIFKVETLCEEVFQEEEDAGNSMTEDSNFNLNSLKVEL FT IGEYDNHKCQVWRCRFNVTGTILSSSGDDGCVRLWKASYANLFKCISVVSLEKKPEKL" FT misc_feature 2218..2223 FT /note="gtaagg, splice donor sequence" FT misc_feature 2247..2258 FT /note="ctaattgtcaag, splice branch and acceptor" FT misc_feature 2807..2932 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 27.52" FT misc_feature 3053..3142 FT /note="Match to PF00400 WD40, WD domain, G-beta repeat FT Score 21.90" FT misc_feature 3193..3198 FT /note="gtaagt, splice donor sequence" FT misc_feature 3233..3246 FT /note="ctaacttgtgacag, splice branch and acceptor" FT misc_feature 3597..3910 FT /note="nominal overlap with cosmid SPAC1B9, EM:AL109951 S. FT pombe chromosome 1" FT CDS join(complement(3876..>3910),complement(3690..3797)) FT /codon_start=3 FT /db_xref="GOA:Q10100" FT /db_xref="SWISS-PROT:Q10100" FT /gene="SPAC15F9.03c" FT /gene="SPAC1B9.01c" FT /protein_id="CAA92380.2" FT /translation="SVIVMVTGELLLDEEQMAQRYSQVFHLVNNNGNYYVLNDLFRLNY FT G" FT misc_feature complement(3798..3812) FT /note="ttaacctttttgtag, splice branch and acceptor" FT intron 3798..3875 FT /note="confirmed intron" FT intron complement(3798..3875) FT /note="confirmed intron" FT misc_feature complement(3870..3875) FT /note="gtaagt, splice donor sequence" XX SQ Sequence 3910 BP; 1292 A; 633 C; 660 G; 1325 T; 0 other; atattttttc gtttacaaat agcttagtaa tgaagtttcc ccaaaaaaaa atagtttaat 60 atttcgtagt cggtttgggt aacggtgatg ctctgcgatc atatcacatt actcgataag 120 tacaaagatt aaataaaaat aaatacacgg tatcttctct aaaaggcatt gcagtggtgt 180 caagtagtac cattaaggtt ggatctttcg ttggcataat tccaaagatc ttcatctttt 240 agtcccaaag gtcccattgt catgacatta gtgccttttg cggtatgcca cacgataaca 300 tgaggagggt catttgtatt actaagttcc cataagtatg acagtatgct ttttattttc 360 ttttggatat cggggtcatg gacagtaaaa attgagcatt tgccaagttt tcttctccat 420 atactaaata agcaaagaac tttgtattta tcttctccag atatgtgtat aaggggactc 480 aagcccatgc tatcaaaaat taaatattgg ctccatgctt catgaattga atcttttcct 540 ttaaaattat gaactacagc tgttaaaatt taagaaaaca aagtgtcatt atcctatttt 600 tgacgcaccg tttgattgtt tatcattatt ccttataaac accttaattt ccacattgtt 660 acatttagaa ccctttagta atgaaaactt tctgttttta ggtaccttat tagttttcaa 720 actcaaaaaa tttcgcaatg tcttaaataa gcctttgcca tccttatgaa aacttttttt 780 atgccctatt ttgctgtctg atacgcttga gtataacgtt tttgtatcag atgaaccctc 840 ccagtcatta taatcatcct cttcaactgt acgtagacga tccaagtagt tatccgagaa 900 cggcctgatt tgttccattt ctggccttga tcgaactaac tcagattagc gagtaaggga 960 taggctttgt tttttatttt tagaaaaggg tcgttttaca atactcaata taatcaaaat 1020 tccgtagagt caaaatggca ttgccaattg acattcagtg tataaaaaat aaaaaaaatt 1080 gaaaaatttc ccatttaaaa gatatactat acttgtggca ctgacgtgca attacaatca 1140 ttattacttg tacatgactt gcctattaag tcctgaagta ttgtagtata ataacacaca 1200 tgagacaaga acatattatg ggtcacaaaa gtcattttga aatctgtgat aatctgcaaa 1260 tacaaaactg aaaagaattg actggtagtg tcactatttt gttaaacact gactgcattt 1320 aataaataga caagcttata aattcaaaaa accaacactt taatgatatc aatgttacta 1380 ctatcgtttg tctctttgca aaatcaaata aaatgttttt atgaattttt ttgctaattt 1440 tgattacggc tcctcagcat tttattgtga agttcttatt ttacaaaaaa agtattaaca 1500 ttatatatgc atatttttta atgggggact aatattttca aattacctac ttcagttttg 1560 cattaattaa atcaataatt ttatcaaaag attttgtacc ttaaacaaaa tatcaataat 1620 acttttaact atactgaaac cattaaaaaa agcacataat ttcatatttg taaatgaagg 1680 taattaataa ctaactattg actgaggtat acataccact aacaactaaa taatttagta 1740 gtccggaaga ctatttgtat agtacttgaa actttcattt cattttttaa acgtcaatga 1800 acatattaga ttattatcgt tttggcttac caaatcactt caggtgaaat aacgtcaatg 1860 tggtatacaa atttgttatt accatctatt aagaatgaac tgagtgaaga ttatgtgcag 1920 tagatatttg ctgactattt tagtcgtttt gtatattctt tactatgatt gtcgccttcg 1980 atgagttaac ttgtcggacc gaaagttaag catgcgaaaa gtcttcttta caaaaaattt 2040 acttctttca ttttttctag acttaccctc tttttttttg cgctgatacc gataatttac 2100 atataaaatc ttttggccct tctttactag ctcacctcgt ctctaccaac atggatgata 2160 tttcggctac tacgattcaa acaaatcacc aagatttagt gaatgatgtt acatacggta 2220 aggaaatatt attttagatt ttgtagctaa ttgtcaagac ttttatggcc gtcgaatggt 2280 atcatgttca gctgatcaac gagtcaaggt ttacgatttt aatgatgata ctgaaacatg 2340 ggcaatcact tctgaatgga gggcaggaga tgctagtcta atgagggttg cttgggctca 2400 tccttcattt ggtcaagtgc tagctgtctg cagtttagac agaggggttc ggatctatga 2460 ggaacaaaaa aagaactttg aatccaagac atgggttgaa gttgcaaaat taatggatgc 2520 tcgtagcgcc gtattagaca tatcattttg cccttttcaa catggttgca aattagcagc 2580 agtttcggcg gatgctactt tacggattta tgaggccatg gaacccggta atttaacgta 2640 ttggacttta atgaatgaaa ttgctttaat gccgtcccct ccttcgcgca acgagcaacc 2700 agcgttttgt gtaaattggt gcccatcacg ttggcgagaa cagtacattg ctgttggatg 2760 catgaatgat gcttacattt acaagcagaa ctctcatgga aaatggaaaa aagttgctga 2820 gttacccgga catacagatc ttattcgtga tatttgctgg gctccgagta tgggcagtag 2880 ttattacctg atagctactg catgcaaaga tgggaatgtt cgtattttta aagttgaaac 2940 cctttgcgaa gaagtatttc aagaggaaga agatgctgga aattcaatga cagaagactc 3000 aaatttcaat ctaaattctt tgaaggtgga actcattggt gaatatgata accataaatg 3060 tcaagtgtgg agatgccgtt ttaatgtgac aggaactatt ttgagcagct cgggcgatga 3120 tgggtgcgtt cgtttatgga aagcgtccta tgcaaattta tttaaatgca tatctgtcgt 3180 ttctttagaa aagtaagttc agagatgctt tcggcctcat gaatatatga tactaacttg 3240 tgacagaaaa cctgaaaaat tgtgatttac ttagttcaat tttaatgtaa cagatacatt 3300 gaatggccga ccttgcttgg tttcctttct taaattattt catatttacc agctagatca 3360 taaccaaatt gtaaaaaagg agatattact ttataaattt ttggttttaa tttataattt 3420 tagacaagtg aactaacatt cgctttgaga ttaaattttc atttagcgta aaacttcgaa 3480 attagactaa gtcttaatgt aaacttttag gtagagttca attcgaaaat ttgttttaac 3540 gaactaaaca agcgatgggt tcaaaaagct atatctgcta caggttgtag tttgaaaatc 3600 cggtaatcgc tacaaaagca taaaaaaaaa gaaaatgaaa ttgcagataa tcaagatact 3660 tgtattaaat attttcacta tgatattctt caaccatagt tcaaacgaaa caggtcattt 3720 aagacatagt agttaccatt attgttaaca aggtgaaaaa cttggctata gcgctgggcc 3780 atttgctctt catccaacta caaaaaggtt aaaaaaagag caaagcaaaa aaaaaaaaaa 3840 tgaatagaat aagtaacatc aatcttataa cttaccaaaa gctcacctgt aaccataacg 3900 attacggatc 3910 //